Shin'ichi Satoh's Lab

Welcome to Shin'ichi Satoh's Lab homepage at NII!

Research in our lab focuses on multimedia understanding and knowledge discovery. Especially, we aim to create an intelligent computer system which can see and understand the visual world.

We accept graduate students from Department of Information and Communication Engineering, Graduate School of Information Science and Technology, the University of Tokyo. Our lab is in National Institute of Informatics, Japan.


  • [2017.6.16] Dr. Uchida (our former PhD student) and Prof. Satoh received the best paper award at ICMR 2017! [paper]
  • [2017.5.2] We will present our demo at NII Open House on June 9 and 10.
  • [2017.5.2] Entrance Exam Briefing for the University of Tokyo will be held on May 27.
  • [2017.4.21] Benjamin's paper was awarded as Honorable Mention Poster in IEEE PacificVis 2017.
  • [2017.2.21] Dr. Matsui will give an invited talk at CGVI symposium
  • [2017.1.13] A paper of Ryota Hinami (D1) was accepted by IEEE Multimedia
  • [2016.12.13] We will present 2 papers at COLING
  • [2016.12.11] We will present 2 papers at ISM

Research projects

Large-scale fast object detection

We extended R-CNN to larger scale, which enables immediate and accurate object category detection from a large image databas. R. Hinami and S. Satoh, "Large-scale R-CNN with Classifier Adaptive Quantization", ECCV 2016

Multimedia Analytics

Explore, analyze, and visualize archives of multimedia content by bringing together data science and computer vision for the support of real world applications such as social sciences, media studies, and even marketing.

Temporal Matching Kernel with Explicit Feature Maps for Video Event Retrieval

We propose a new video representation for video event retrieval. Given a video query, the method is able to efficiently retrieve similar video events or near-duplicates along with a precise temporal alignment. ``Temporal matching kernel with explicit feature maps,'' ACM Multimedia 2015.

Video Event Detection by Exploiting Word Dependencies

We exploited word dependencies as a new semantic video representation for recognizing complex events S. Phan, Y. Miyao, D-D Le and S Satoh, "Video Event Detection by Exploiting Word Dependencies from Image Captions", COLING 2016