情報処理学会 第80回全国大会 会期:2018年3月13日~15日 会場:早稲田大学 西早稲田キャンパス 情報処理学会 第80回全国大会 会期:2018年3月13日~15日 会場:早稲田大学 西早稲田キャンパス
招待講演(2)
Shi-Min Hu(Executive member of the council, China Computer Federation, Professor, Tsinghua University)
Intelligent Processing of Visual media and it's applications

日時:3月14日(水)13:50-14:20
会場:第1イベント会場(57号館 201)

【講演概要】In recent years, with the rapid development of digital technology, the public can capture and produce high-quality visual media conten. However, the current visual media processing techniques cannot satisfy the increasing demands of automatically organize, understand and enhance the media data according to the human intentions and potentially billions of personalized editing. In my talk, I will introduce recent research results in Visual Media Research Center at Tsinghua, then I will focus on a new research topic on visual distractors, which are objects accidentally captured and usually distract viewers from focusing on the main subjects. We propose methods to automatically detect and localize these distractors. A rule-based method is firstly proposed to detect common distractors. Then a learning based framework is proposed with a manually-labeled dataset. To achieve spatially- and temporally-coherent detection, we propose to extract features at the Temporal-Superpixel (TSP) level in a traditional SVM-based learning framework. We have also experimented with end-to-end learning with Convolutional Neural Networks (CNNs), which achieve slightly higher performance. The classification result is further refined in a post-processing step based on graph-cut optimization. Experimental results show that our method achieves an accuracy of 81% and a recall of 86%. We demonstrate several ways to remove the detected distractors to improve the video quality, including video hole filling; video frame replacement; and camera path re-planning.

【略歴】Shi-Min Hu is currently a professor in the department of Computer Science and Technology, Tsinghua University, Beijing. He received the PhD degree from Zhejiang University in 1996. His research interests include digital geometry processing, video processing, rendering, computer animation, and computer-aided geometric design. He has published more than 100 papers in journals and refereed conference. He is Editor-in-Chief of Computational Visual media (Springer), associate Editor-in-Chief of IEEE Transactions on Visualization and on editorial board of several journals, including Computer Aided Design (Elsevier) and Computer & Graphics (Elsevier).