Learning Visual Semantics: Models, Massive Computation, and Innovative Applications

Tutorial at CVPR 2014

June 23rd, 1:00pm-5:00pm, Columbus, OH


Shih-Fu Chang John Smith Rogerio Feris Liangliang Cao


The explosion of digital multimedia data - including visual content from surveillance cameras, mobile phones, personal photo collections, news footage, or medical images is creating significant opportunities for automated visual analysis. However, the most interesting content in multimedia files is often unconstrained and complex in nature, reflecting a diversity of human behaviors, scenes, activities, and events, which poses serious challenges for computer vision approaches. In this tutorial, we will present the state-of-the-art on large-scale visual semantic modeling, covering methods for obtaining intuitive mid-level semantic feature representations, while presenting innovative applications. The organizers will share their experience in achieving top performance on several recent competitions, including TRECVID, ImageNet, and ImageCLEF, and developing large-scale data and tool resources.


Innovative Applications and Datasets:

We will provide pointers to datasets and cover various applications for large-scale visual semantic analysis, including: