ACM Multimedia 2015 papers on the web

This page is maintained by Yusuke Matsui. If you have additions or changes, please send an e-mail (matsui(at)hal.t.u-tokyo.ac.jp).

Best Paper

An Affordable Solution for Binocular Eye Tracking and Calibration in Head-mounted Displays (pdf, project)
Michael Stengel (TU Braunschweig)
Steve Grogorick (TU Braunschweig)
Martin Eisemann (TU Braunschweig, FH Koeln)
Elmar Eisemann (TU Delft)
Marcus Magnor (TU Braunschweig)
Analyzing Free-standing Conversational Groups: A Multimodal Approach (pdf)
Xavier Alameda-Pineda (University of Trento)
Yan Yan (University of Trento, ADSC, UIUC)
Elisa Ricci (Fondazione Bruno Kessler, University of Perugia)
Oswald Lanz (Fondazione Bruno Kessler)
Nicu Sebe (University of Trento)
SINGA: Putting Deep Learning in the Hands of Multimedia Users (pdf, project)
Wei Wang (National University of Singapore)
Gang Chen (Zhejiang University)
Tien Tuan Anh Dinh (National University of Singapore)
Jinyang Gao (National University of Singapore)
Beng Chin Ooi (National University of Singapore)
Kian-Lee Tan (National University of Singapore)
Sheng Wang (National University of Singapore)
Weakly-Shared Deep Transfer Networks for Heterogeneous-Domain Knowledge Propagation (pdf)
Xiangbo Shu (Nanjing University of Science and Technology)
Guo-Jun Qi (University of Central Florida)
Jinhui Tang (Nanjing University of Science and Technology)
Jingdong Wang (Microsoft Research)

Multimedia Indexing and Search

Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment
Xinyang Jiang (Zhejiang University)
Fei Wu (Zhejiang University)
Zhou Zhao (Zhejiang University)
Weiming Lu (Zhejiang University)
Siliang Tang (Zhejiang University)
Yueting Zhuang (Zhejiang University)
Effective Multi-Query Expansions: Robust Landmark Retrieval
Yang Wang (The University of New South Wales)
Xuemin Lin (The University of New South Wales)
Lin Wu (The University of Adelaide)
Wenjie Zhang (The University of New South Wales)
Fast and Accurate Content-based Semantic Search in 100M Internet Videos (pdf, project)
Lu Jiang (Carnegie Mellon University)
Shoou-I Yu (Carnegie Mellon University)
Deyu Meng (Xi'an Jiaotong University)
Yi Yang (University of Technology Sydney)
Teruko Mitamura (Carnegie Mellon University)
Alexander G. Hauptmann (Carnegie Mellon University)
Visual Coding in a Semantic Hierarchy
Yang Yang (University of Electronic Science and Technology of China)
Hanwang Zhang (National University of Singapore)
Mingxing Zhang (University of Electronic Science and Technology of China)
Fumin Shen (University of Electronic Science and Technology of China)
Xuelong Li (Chinese Academy of Sciences)

Social Multimedia

Cross-Domain Collaborative Learning in Social Multimedia
Shengsheng Qian (Chinese Academy of Sciences)
Tianzhu Zhang (Chinese Academy of Sciences)
Richang Hong (Hefei University of Technology)
Changsheng Xu (Chinese Academy of Sciences)
Learning Socially Embedded Visual Representation from Scratch
Shaowei Liu (Tsinghua University)
Peng Cui (Tsinghua University)
Wenwu Zhu (Tsinghua University)
Shiqiang Yang (Tsinghua University)
Spatial-aware Multimodal Location Estimation for Social Images
Jiewei Cao (The University of Queensland)
Zi Huang (The University of Queensland)
Yang Yang (University of Electronic Science and Technology of China)
What are Popular: Exploring Twitter Features for Event Detection, Tracking and Visualization
Hongyun Cai (The University of Queensland)
Yang Yang (University of Electronic Science and Technology of China)
Xuefei Li (The University of Queensland)
Zi Huang (The University of Queensland)

Emotional and Social Signals in Multimedia

A Multimodal Predictive Model of Successful Debaters or How I Learned to Sway Votes (pdf)
Maarten Brilman (University of Twente)
Stefan Scherer (The University of Southern California)
Collaborative Fashion Recommendation: A Functional Tensor Factorization Approach
Yang Hu (University of Maryland)
Xi Yi (University of Maryland)
Larry S. Davis (University of Maryland)
Predicting and Understanding Urban Perception with Convolutional Neural Networks
Lorenzo Porzi (Fondazione Bruno Kessler, University of Perugia)
Samuel Rota Bulò (Fondazione Bruno Kessler)
Bruno Lepri (Fondazione Bruno Kessler)
Elisa Ricci (Fondazione Bruno Kessler, University of Perugia)
Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology (pdf)
Brendan Jou (Columbia University)
Tao Chen (Columbia University)
Nikolaos Pappas (EPFL)
Miriam Redi (Yahoo Labs)
Mercan Topkara (JW Player)
Shih-Fu Chang (Columbia University)

Multimedia and Vision

Dancing with Turks (project)
I-Kao Chiang (University of Pennsylvania)
Ian Spiro (New York University)
Alyssa Lees (New York University)
Jingchen Liu (The Pennsylvania State University)
Chris Bregler (New York University)
Yanxi Liu (The Pennsylvania State University)
Eye of the Dragon: Exploring Discriminatively Minimalist Sketch-based Abstractions for Object Categories (pdf)
Ravi Kiran Sarvadevabhatla (Indian Institute of Science)
R. Venkatesh Babu (Indian Institute of Science)
Single Image Spectral Reconstruction for Multimedia Applications
Antonio Robles-Kelly (NICTA)
SkyStitch: a Cooperative Multi-UAV-based Real-time Video Surveillance System with Stitching (pdf)
Xiangyun Meng (National University of Singapore)
Wei Wang (National University of Singapore)
Ben Leong (National University of Singapore)

Multimedia Art, Entertainment and Culture

A Distributed Theatre Experiment with Shakespeare
Douglas L Williams (BT)
Ian C Kegel (BT)
Marian Ursu (University of York)
Pablo Cesar (Centrum Wiskunde & Informatica)
Jack Jansen (Centrum Wiskunde & Informatica)
Erik Geelhoed (Falmouth University)
Andras Horti (Joanneum Research)
Michael Frantzis (Goldsmiths, University of London)
Bill Scott (Miracle Theatre Company)
Image Profiling for History Events on the Fly
Jia Chen (Shanghai Jiao Tong University)
Qin Jin (Renmin University of China)
Yong Yu (Shanghai Jiao Tong University)
Alexander G. Hauptmann (Carnegie Mellon University)
Modeling Perspective Effects in Photographic Composition (pdf)
Zihan Zhou (The Pennsylvania State University)
Siqiong He (The Pennsylvania State University)
Jia Li (The Pennsylvania State University)
James Z. Wang (The Pennsylvania State University)
Who’s Afraid of Itten: Using the Art Theory of Color Combination to Analyze Emotions in Abstract Paintings (pdf)
Andreza Sartori (University of Trento, Telecom Italia)
Dubravko Culibrk (University of Trento, University of Novi Sad)
Yan Yan (University of Trento, ADSC, UIUC)
Nicu Sebe (University of Trento)

Telepresence, Virtual, and Augmented Reality

Gradient-based 2D-to-3D Conversion for Soccer Videos (pdf)
Kiana Calagari (Simon Fraser University)
Mohamed Elgharib (Qatar Computing Research Institute)
Piotr Didyk (Saarland University)
Alexandre Kaspar (Massachusetts Institute of Technology)
Wojciech Matusik (Massachusetts Institute of Technology)
Mohamed Elgharib (Qatar Computing Research Institute)
Image2Scene: Transforming Style of 3D Room
Xiaowu Chen (Beihang University)
Jianwei Li (Beihang University)
Qing Li (Beijing Union University)
Bo Gao (Beihang University)
Dongqing Zou (Beihang University)
Qinping Zhao (Beihang University)
Smart Beholder: An Open-Source Smart Lens for Mobile Photography (pdf)
Chun-Ying Huang (National Taiwan Ocean University)
Chih-Fan Hsu (Academia Sinica)
Tsung-Han Tsai (Academia Sinica)
Ching-Ling Fan (National Tsing Hua University)
Cheng-Hsin Hsu (National Tsing Hua University)
Kuan-Ta Chen (Academia Sinica)
Ubii: Towards Seamless Interaction between Digital and Physical Worlds (pdf)
Zhanpeng Huang (Hong Kong University of Science and Technology)
Weikai Li (Hong Kong University of Science and Technology)
Pan Hui (Hong Kong University of Science and Technology)

Actions and Events

Coherent Motion Detection with Collective Density Clustering
Yunpeng Wu (Zhengzhou University)
Yangdong Ye (Zhengzhou University)
Chenyang Zhao (Zhengzhou University)
Efficient Activity Retrieval through Semantic Graph Queries (pdf)
Gregory Castanon (Boston University)
Yuting Chen (Boston University)
Ziming Zhang (Boston University)
Venkatesh Saligrama (Boston University)
Temporal Localization of Fine-Grained Actions in Videos by Domain Transfer from Web Images (pdf)
Chen Sun (The University of Southern California)
Sanketh Shetty (Google)
Rahul Sukthankar (Google)
Ram Nevatia (The University of Southern California)
Temporal Matching Kernel with Explicit Feature Maps
Sébastien Poullot (CNRS, National Institute of Informatics)
Shunsuke Tsukatani (The University of Tokyo, National Institute of Informatics)
Anh Phuong Nguyen (University of Information Technology)
Hervé Jégou (INRIA)
Shin'Ichi Satoh (National Institute of Informatics)

Video Systems

Dependency-Aware Unequal Error Protection for Layered Video Coding
Mohammad Reza Zakerinasab (University of Calgary)
Mea Wang (University of Calgary)
Exploring QoE for Power Efficiency: A Field Study on Mobile Videos with LCD Displays (pdf)
Zhisheng Yan (State University of New York at Buffalo)
Qian Liu (State University of New York at Buffalo)
Tong Zhang (Rensselaer Polytechnic Institute)
Chang Wen Chen (State University of New York at Buffalo)
HiFi: A Hierarchical Filtering Algorithm for Caching of Online Video
Shahid Akhtar (Alcatel-Lucent)
Andre Beck (Alcatel-Lucent)
Ivica Rimac (Alcatel-Lucent)
Video Killed The Data Store: Extending the n-Dimensional Display Interface for Full Screen Video
Charles D Estes (University of North Carolina at Chapel Hill)
Ketan Mayer-Patel (University of North Carolina at Chapel Hill)

Deep Learning and Multimedia

Automatic Image Dataset Construction from Click-through Logs Using Deep Neural Network (project)
Yalong Bai (Harbin Institute of Technology)
Kuiyuan Yang (Microsoft Research)
Wei Yu (Harbin Institute of Technology)
Chang Xu (Nankai University)
Wei-Ying Ma (Microsoft Research)
Tiejun Zhao (Harbin Institute of Technology)
DeepFont: Identify Your Font from An Image (pdf)
Zhangyang Wang (University of Illinois Urbana-Champaign)
Jianchao Yang (Snapchat)
Hailin Jin (Adobe Research)
Eli Shechtman (Adobe Research)
Aseem Agarwala (Google)
Jonathan Brandt (Adobe Research)
Thomas S. Huang (University of Illinois Urbana-Champaign)
EventNet: A Large Scale Structured Concept Library for Complex Event Detection in Video (pdf)
Guangnan Ye (Columbia University)
Yitong Li (Columbia University)
Hongliang Xu (Columbia University)
Dong Liu (Columbia University)
Shih-Fu Chang (Columbia University)
Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification (pdf)
Zuxuan Wu (Fudan University)
Xi Wang (Fudan University)
Yu-Gang Jiang (Fudan University)
Hao Ye (Fudan University)
Xiangyang Xue (Fudan University)

Multimedia Quality Perception

Biologically Inspired Media Quality Modeling
Luming Zhang (Hefei University of Technology)
Meng Wang (Hefei University of Technology)
Liqiang Nie (National University of Singapore)
Richang Hong (Hefei University of Technology)
Roger Zimmermann (National University of Singapore)
Yingjie Xia (Zhejiang University)
Modelling Human Factors in Perceptual Multimedia Quality: On The Role of Personality and Culture
Michael James Scott (Brunel University London)
Sharath Chandra Guntuku (Nanyang Technological University)
Yang Huan (Nanyang Technological University)
Weisi Lin (Nanyang Technological University)
Gheorghita Ghinea (Brunel University London)
QoE Modelling for VP9 and H.265 Videos on Mobile Devices
Wei Song (Queensland University of Technology)
Yao Xiao (Queensland University of Technology)
Dian Tjondronegoro (Queensland University of Technology)
Antonio Liotta (Eindhoven University of Technology)
Towards Solving the Bottleneck of Pitch-based Singing Voice Separation
Bilei Zhu (Fudan University)
Wei Li (Fudan University)
Linwei Li (Fudan University)

Multimedia Networking

Bandwidth-aware Prefetching for Proactive Multi-video Preloading and Improved HAS Performance (pdf)
Vengatanathan Krishnamoorthi (Linköping University)
Niklas Carlsson (Linköping University)
Derek Eager (University of Saskatchewan)
Anirban Mahanti (NICTA)
Nahid Shahmehri (Linköping University)
Distributed Optimal Datacenter Bandwidth Allocation for Dynamic Adaptive Video Streaming
Fanxin Kong (McGill University)
Xingjian Lu (McGill University, East China University of Science and Technology)
Mingyuan Xia (McGill University)
Xue Liu (McGill University)
Haibing Guan (Shanghai Jiao Tong University)
Enhancing the Quality of Interactive Multimedia Services by Proactive Monitoring and Failure Prediction
Mohammed Shatnawi (Simon Fraser University)
Mohamed Hefeeda (Qatar Computing Research Institute)
HTTP/2-Based Methods to Improve the Live Experience of Adaptive Streaming
Rafael Huysegems (Bell Labs)
Jeroen van der Hooft (Ghent University)
Tom Bostoen (Bell Labs)
Patrice Rondao Alface (Bell Labs)
Stefano Petrangeli (Ghent University)
Tim Wauters (Ghent University)
Filip De Turck (Ghent University)

Data Imperfectness for Multimedia

Beyond Doctors: Future Health Prediction from Multimedia and Multimodal Observations
Liqiang Nie (National University of Singapore)
Luming Zhang (Hefei University of Technology)
Yi Yang (University of Technology Sydney)
Meng Wang (Hefei University of Technology)
Richang Hong (Hefei University of Technology)
Tat-Seng Chua (National University of Singapore)
If You Can’t Beat Them, Join Them: Learning with Noisy Data (pdf)
Pravin Kakar (Institute for Infocomm Research)
Alex Yong-Sang Chia (Rakuten Institute of Technology)
Multi-View Visual Recognition of Imperfect Testing Data (pdf)
Qilin Zhang (Stevens Institute of Technology)
Gang Hua (Stevens Institute of Technology)
Searching Persuasively: Joint Event Detection and Evidence Recounting with Limited Supervision (pdf)
Xiaojun Chang (University of Technology Sydney)
Yao-Liang Yu (Carnegie Mellon University)
Yi Yang (University of Technology Sydney)
Alexander G. Hauptmann (Carnegie Mellon University)

Multimedia Experiences and Expectations

HyperMeeting: Supporting Asynchronous Meetings with Hypervideo (pdf, project)
Andreas Girgensohn (FX Palo Alto Laboratory)
Jennifer Marlow (FX Palo Alto Laboratory)
Frank Shipman (Texas A&M University)
Lynn Wilcox (FX Palo Alto Laboratory)
Interactive Scene Flow Editing for Improved Image-based Rendering and Virtual Spacetime Navigation (pdf, project, video)
Kai Ruhl (TU Braunschweig)
Martin Eisemann (TU Braunschweig, FH Koeln)
Anna Hilsmann (Fraunhofer HHI)
Peter Eisert (Fraunhofer HHI)
Marcus Magnor (TU Braunschweig)
MMToC: A Multimodal Method for Table of Content Creation in Educational Videos
Arijit Biswas (Xerox Research Centre India)
Ankit Gandhi (Xerox Research Centre India)
Om Deshmukh (Xerox Research Centre India)
Multi-sensor Self-Quantification of Presentations (pdf)
Tian Gan (National University of Singapore)
Yongkang Wong (National University of Singapore)
Bappaditya Mandal (Institute for Infocomm Research)
Vijay Chandrasekhar (Institute for Infocomm Research)
Mohan S. Kankanhalli (National University of Singapore)

Short papers

Jointly Estimating Interactions and Head, Body Pose of Interactors from Social Scenes (pdf)
Ramanathan Subramanian (ADSC, UIUC)
Jagannadan Varadarajan (ADSC, UIUC)
Elisa Ricci (Fondazione Bruno Kessler, University of Perugia)
Oswald Lanz (Fondazione Bruno Kessler)
Stefan Winkler (ADSC, UIUC)
Probabilistic Semi-Canonical Correlation Analysis
Chie Kamada (The University of Tokyo)
Asako Kanezaki (The University of Tokyo)
Tatsuya Harada (The University of Tokyo)
R2P: Recomposition and Retargeting of Photographic Images
Hui-Tang Chang (National Taiwan University)
Po-Chen Pan (National Taiwan University)
Yu-Chiang Frank Wang (Academia Sinica)
Ming-Syan Chen (Academia Sinica)
Selective K-means Tree Search
Tuan Anh Nguyen (The University of Tokyo)
Yusuke Matsui (The University of Tokyo)
Toshihiko Yamasaki (The University of Tokyo)
Kiyoharu Aizawa (The University of Tokyo)
The Quest for Visual Interest (pdf, project)
Mohammad Soleymani (University of Geneva)
Unsupervised Cosegmentation based on Global Graph Matching (pdf)
Takanori Tamanaha (The University of Tokyo)
Hideki Nakayama (The University of Tokyo)
Vision-Inertial Hybrid Tracking for Robust and Efficient Augmented Reality on Smartphones (pdf)
Xin Yang (Huazhong University of Science and Technology)
Xun Si (Huazhong University of Science and Technology)
Tangli Xue (Huazhong University of Science and Technology)
Liheng Zhang (Huazhong University of Science and Technology)
Kwang-Ting Tim Cheng (University of California, Santa Barbara)