ICIP 2024 Panel on Video Coding for Machines

Video is occupying about 80% of today’s Internet traffic. Out of this, more and more video contents are consumed by machines, in broad applications such as video surveillance, healthcare monitoring, transportation, smart cities, etc. Machine vision is different from human vision in many aspects. Great interests and demands have risen in recent years to develop video coding technologies and solutions for machine vision requirements and use cases. This panel will discuss and envision technology and standard development, applications, challenges, as well as trends that are related to video coding for machines (VCM).

Moderator:

Name: Shan Liu

Job Title / Position: Distinguished Scientist and General Manager

Organization: Tencent

Email: shanl@global.tencent.com

Brief Bio
Shan Liu (Fellow, IEEE) received the B.Eng. degree in electronic engineering from Tsinghua University, the M.S. and Ph.D. degrees in electrical engineering from the University of Southern California, respectively. She is a Distinguished Scientist and General Manager at Tencent. She was formerly Director of Media Technology Division at MediaTek USA. She was also formerly with MERL and Sony, etc. She has been a long-time contributor to international standardization with many technical proposals adopted into various standards such as VVC, HEVC, OMAF, DASH, MMT and PCC, and served as a Project Editor of ISO/IEC | ITU-T H.266/VVC standard. She is a recipient of ISO&IEC Excellence Award, Technology Lumiere Award, USC SIPI Distinguished Alumni Award, and two-time IEEE TCSVT Best AE Award. She currently serves as Associate Editor-in-Chief of IEEE Transactions on Circuits and Systems for Video Technology and Vice Chair of IEEE Data Compression Standards Committee. She also serves and has served on a few other Boards and Committees. She holds more than 600 granted US patents and has published more than 100 peer-reviewed papers and one book. Her interests include audio-visual, volumetric, immersive and emerging media compression, intelligence, transport and systems.

Panelists:

Name: Marek Domanski

Job Title / Position: Full professor, Director of the institute

Organization: Poznań University of Technology, Institute of Multimedia Telecommunications, Poland

Email: marek.domanski@put.poznan.pl

Brief Bio
Marek Domański (Life Senior Member, IEEE) received the M.Sc., Ph.D., and Habilitation degrees from the Poznań University of Technology, Poland, in 1978, 1983, and 1990, respectively. Since 1993, he has been a Professor with the Poznań University of Technology, where he is the Director of the Institute of Multimedia Telecommunications. He has coauthored one of the very first AVC decoders for TV set-top boxes (2004) as well as significant technology proposals to MPEG standardization for scalable video compression (2004), 3-D video coding (2011-2012), immersive video coding (2019-2022), and video coding for machines (2022-2024). He authored 3 books and over 300 papers in journals and conference proceedings. He has 18 patents granted by European Patent Office and United States Patent and Trademark Office. He promoted (directed) 25 candidates to Doctor degree. The contributions were mostly on image, video and audio compression, virtual navigation, free-viewpoint television, image processing, multimedia systems, 3-D video and color image technology, digital filters, and multidimensional signal processing.

Name: Jin Lee

Job Title / Position: Principal Researcher

Organization: Electronics and Telecommunications Research Institute (ETRI), Korea

Email: jinlee@etri.re.kr

Brief Bio
Jin Young Lee is a researcher at Electronics and Telecommunications Research Institute (ETRI) of Rep. of Korea. He received his Ph.D. degree in Electrical and Computer Engineering from the Michigan State University, USA.

Since 2009, he has been involved in various fields of standardization such as MPEG, ATSC, IoT, Digital Twin, and smart city. He had successfully lead ISO 23247-1 (Digital Twin), ISO 37110 (Smart City), ISO/IEC 20924-1 (IoT), ISO/IEC 21823-1 (IoT), ISO/IEC 23008-1 (MMT), and ATSC A/104, and is currently in charge of ISO/IEC 23888-1 (MPEG-AI), ISO/IEC 23888-2 (VCM), and ISO/IEC 29093-6 (IoMT). His main standardization activities involved with SC29 include MPEG-AI, MPEG-VCM, MPEG-FCM, G-PCC, and AI-GC, IoMT, ISOBMFF, MPEG-DASH, MPEG-MMT.

Name: Dong Tian

Job Title / Position: Principal Researcher

Organization: InterDigital Inc., USA

Email: dong.tian@interdigital.com

Brief Bio
Dong Tian (Senior Member, IEEE) is currently a Senior Director with InterDigital in New York, NY. His research interests include image processing, 3D video, point cloud processing, and deep learning. He has been actively contributing to MPEG industry standards and academic communities for 20+ years. Prior to InterDigital, Dr. Tian was a Senior Principal Research Scientist at MERL, Cambridge, MA from 2010-2018, a Senior Researcher with Thomson Corporate Research, Princeton, NJ from 2006-2010, and a Visiting Researcher at Tampere University of Technology (TUT) from 2002-2005. He holds 30+ US-granted patents and 50+ recent publications on top-tier journals/transactions and conferences. Dr. Tian serves as a chair of MPEG-AI, a chair of MPEG 3DGH on AI-based Graphic Coding (2021-), a chair of MSA TC (2023-2025), an Associate Editor of TIP (2018-2024), General Co-Chair of MMSP’20 and MMSP’21, TPC chair of MMSP’19, etc. He is a current advisor member of IEEE MMSP. Dong Tian received the B.S. and M.Sc degrees from University of Science and Technology of China (USTC), Hefei, China, in 1995 and 1998, and the Ph.D. degree from Beijing University of Technology, Beijing, in 2001.

Name: Giuseppe Valenzise

Job Title / Position: CNRS researcher and head of the Multimedia and Networking team

Organization: Université Paris-Saclay, CentraleSupélec, France

Email: giuseppe.valenzise@l2s.centralesupelec.fr

Brief Bio
Giuseppe Valenzise is a CNRS researcher at Université Paris-Saclay, CentraleSupélec, in the Laboratory of Signals and Systems (L2S), France, where he is the head of the Multimedia and Networking team. He completed a Ph.D. in Information Technology at the Politecnico di Milano, Italy, in 2011. His research interests span different fields of image and video processing, including traditional and learning-based image and video compression, immersive video (light fields, point clouds), image/video quality assessment, high dynamic range imaging and applications of machine learning to image and video analysis. He is co-author of more than 120 research publications, one book and award-winning papers. Giuseppe is the Editor in Chief of the EURASIP Journal on Image and Video Processing, published by Springer, and serves/has served as associate editor for several journals, including IEEE Transactions on Circuits and Systems for Video Technology, IEEE Transactions on Image Processing, and Elsevier Signal Processing: Image communication. He is the Chair of the MMSP technical committee of the IEEE Signal Processing Society for 2024-2025, and General Co-Chair for ICME 2025.

Name: Honglei Zhang

Job Title / Position: Principal Researcher

Organization: UNokia Technologies, Finland

Email: honglei.1.zhang@nokia.com

Brief Bio
Honglei Zhang is a principal researcher specializing in machine learning at Nokia, Finland. He earned his Bachelor’s and Master’s degrees in Electrical Engineering from Harbin Institute of Technology, China, in 1994 and 1996, respectively. With a career spanning both China and Finland, Honglei has excelled in various roles, including software engineering and architecture at Nokia Of Finland from 1999 to 2013. Transitioning to academia, he earned his Ph.D. in Signal Processing from Tampere University, Finland, in 2019. Honglei has authored over 40 papers and holds over 50 patents, focusing on image/video compression, graph data analysis, and artificial intelligence. Notably, Honglei received outstanding inventor awards in Nokia from 2021 to 2023. His research focuses on neural network-based video coding for human and machines.