ACM Multimedia 2014 papers on the web

This page is maintained by Yusuke Matsui. If you have additions or changes, please send an e-mail (matsui(at)

Full papers

Rescue Tail Queries: Learning to Image Search Re-rank via Click-wise Multimodal Fusion
Xiaopeng Yang (Chinese Academy of Sciences)
Tao Mei (Microsoft Research Asia)
Yongdong Zhang (Chinese Academy of Sciences)
Mining Cross-network Association for YouTube Video Promotion (pdf, project)
Ming Yan (Chinese Academy of Sciences)
Jitao Sang (Chinese Academy of Sciences)
Changsheng Xu (Chinese Academy of Sciences)
Scalable Visual Instance Mining with Threads of Features (pdf)
Wei Zhang (City University of Hong Kong)
Hongzhi Li (Columbia University)
Chong-Wah Ngo (City University of Hong Kong)
Shih-fu Chang (Columbia University)
Object-Based Visual Sentiment Concept Analysis and Application (pdf)
Tao Chen (Columbia University)
Felix X. Yu (Columbia University)
Jiawei Chen (Columbia University)
Yin Cui (Columbia University)
Yan-Ying Chen (National Taiwan University, Columbia University)
Shih-fu Chang (Columbia University)
Quality-adaptive Prefetching for Interactive Branched Video using HTTP-based Adaptive Streaming (pdf, project)
Vengatanathan Krishnamoorthi (Linköping University)
Niklas Carlsson (Linköping University)
Derek Eager (University of Saskatchewan)
Anirban Mahanti (NICTA)
Nahid Shahmehri (Linköping University)
ADVISOR – Personalized Video Soundtrack Recommendation by Late Fusion with Heuristic Rankings (pdf)
Rajiv Ratn Shah (National University of Singapore)
Yi Yu (National University of Singapore)
Roger Zimmermann (National University of Singapore)
Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification
Zuxuan Wu (Fudan University)
Yu-Gang Jiang (Fudan University)
Jun Wang (Fudan University)
Jian Pu (Fudan University)
Xiangyang Xue (Fudan University)
3D Activity Recognition with Reconfigurable Convolutional Neural Networks
Keze Wang (Sun Yat-Sen University)
Xiaolong Wang (Carnegie Mellon University)
Liang Lin (Sun Yat-Sen University)
Meng Wang (Hefei University of Technology)
Wangmeng Zuo (Harbin Institute of Technology)
Fashion Parsing with Video Context
Si Liu (National University of Singapore)
Xiaodan Liang (Sun Yat-Sen University)
Luoqi Liu (National University of Singapore)
Liang Lin (Sun Yat-Sen University)
Ke Lv (National University of Singapore)
Shuicheng Yan (National University of Singapore)
A Group Testing Framework for Similarity Search in High-dimensional Spaces (pdf)
Miaojing Shi (Peking University)
Teddy Furon (INRIA)
Hervé Jégou (INRIA)
Optimized Distances for Binary Code Ranking (pdf)
Jianfeng Wang (University of Science and Technology of China)
Heng Tao Shen (The University of Queensland)
Shuicheng Yan (National University of Singapore)
Nenghai Yu (University of Science and Technology of China)
Shipeng Li (Microsoft Research Asia)
Jingdong Wang (Microsoft Research Asia)
Say Cheese vs. Smile: Reducing Speech-Related Variability for Facial Emotion Recognition
Yelin Kim (University of Michigan)
Emily Mower Provost (University of Michigan)

Short papers

Food Detection and Recognition Using Convolutional Neural Network
Hokuto Kagaya (The University of Tokyo)
Kiyoharu Aizawa (The University of Tokyo)
Makoto Ogawa (Foo.log)
A Social DJ Interface with Remote Audience Feedback (project)
Lasse Farnung Laursen (The University of Tokyo)
Masataka Goto (AIST)
Takeo Igarashi (The University of Tokyo)
Clothing Retrieval Based on Local Similarity with Multiple Images (video)
Masaru Mizouchi (The University of Tokyo)
Asako Kanezaki (The University of Tokyo)
Tatsuya Harada (The University of Tokyo)
Automatic Image Synthesis from Keywords Using Scene Context
Sho Inaba (The University of Tokyo)
Asako Kanezaki (The University of Tokyo)
Tatsuya Harada (The University of Tokyo)
A Robust Panel Extraction Method for Manga (pdf)
Xufang Pang (City University of Hong Kong)
Ying Cao (City University of Hong Kong)
Rynson W. H. Lau (City University of Hong Kong)
Antoni B. Chan (City University of Hong Kong)
Just Browsing? Understanding User Journeys in Online TV (pdf)
Yehia Elkhatib (Lancaster University)
Rebecca Killick (Lancaster University)
Mu Mu (Lancaster University)
Nicholas Race (Lancaster University)
Chic or Social: Visual Popularity Analysis in Online (pdf)
Kota Yamaguchi (Tohoku University)
Tamara L. Berg (University of North Carolina at Chapel Hill)
Luis E. Ortiz (Stony Brook University)
Predicting Viewer Perceived Emotions in Animated GIFs (pdf)
Brendan Jou (Columbia University)
Subhabrata Bhattacharya (Columbia University)
Shih-fu Chang (Columbia University)
Modeling Attributes from Category-Attribute Proportions (pdf)
Felix X. Yu (Columbia University)
Liangliang Cao (IBM T. J. Watson Research Center)
Michele Merler (IBM T. J. Watson Research Center)
Noel Codella (IBM T. J. Watson Research Center)
Tao Chen (Columbia University)
John R. Smith (IBM T. J. Watson Research Center)
Shih-fu Chang (Columbia University)
Organizing Video Search Results to Adapted Semantic Hierarchies for Topic-based Browsing (pdf)
Jiajun Wang (Fudan University)
Yu-Gang Jiang (Fudan University)
Qiang Wang (Fudan University)
Kuiyuan Yang (Microsoft Research Asia)
Chong-Wah Ngo (City University of Hong Kong)
Real-Time Summarization of User-Generated Videos Based on Semantic Recognition (pdf)
Xi Wang (Fudan University)
Yu-Gang Jiang (Fudan University)
Zhenhua Chai (Huawei Technologies)
Zichen Gu (Huawei Technologies)
Xinyu Du (Huawei Technologies)
Dong Wang (Huawei Technologies)
Multi-modal Language Models for Lecture Video Retrieval (pdf, project)
Huizhong Chen (Stanford University)
Matthew Cooper (FX Palo Alto Laboratory)
Dhiraj Joshi (FX Palo Alto Laboratory)
Bernd Girod (Stanford University)
Automatic Image Cropping using Visual Composition, Boundary Simplicity and Content Preservation Models (pdf)
Chen Fang (Dartmouth College)
Zhe Lin (Adobe Research)
Radomír Měch (Adobe Research)
Xiaohui Shen (Adobe Research)
Automatic Fine-grained Hyperlinking of Videos within a Closed Collection using Scene Segmentation (pdf)
Evlampios Apostolidis (CERTH)
Vasileios Mezaris (CERTH)
Mathilde Sahuguet (EURECOM)
Benoit Huet (EURECOM)
Barbora Červenková (University of Economics)
Daniel Stein (Fraunhofer IAIS)
Stevens’ Power Law in 3D Tele-immersion (pdf)
Sabrina Schulte (RWTH Aachen University)
Shannon Chen (University of Illinois at Urbana-Champaign)
Klara Nahrstedt (University of Illinois at Urbana-Champaign)
Twitter-driven Youtube Views: Beyond Individual Influencers (pdf)
Honglin Yu (Australian National University, NICTA)
Lexing Xie (Australian National University, NICTA)
Scott Sanner (Australian National University, NICTA)