2011 IEEE International Conference on Multimedia and Expo

Workshops index

Acoustic and Video Coding and Communication
Streaming and Media Communications
Visual Content Identification and Search

Multimedia Services and Technologies for E-health

Multimedia-Aware Networking
Hot Topics in Multimedia Delivery
Interactive Ambient Intelligent Multimedia Environments
Content Protection & Forensics
Advances in Automated Multimedia Surveillance for Public Safety
Advances in Music Information Research
Hot Topics in Multimedia

Edutaiment

Human Factors in Multimedia

Multimodal Audio-based Multimedia Content Analysis

Vision and Graphics Computing for Multimedia Communications

Hot Topics in 3D Multimedia

Acoustic and Video Coding and Communication

Room: JH1.08 Date: 7/11/2011 Time: 9:00-18:00

Session Chair(s): Peng Yin, Yan Ye

Introduction

Multimedia communication through mobile networks and the Internet has rapidly become an essential way for us to communicate in the recent years. Multimedia communication includes and is not limited to peer-to-peer video calls and chatting, multi-point video conferences, live or adaptive streaming of movies and events, interactive viewing of graphic applications fully running in the cloud, etc. It involves audio, speech and video, all at the same time. To guarantee QoS, bandwidth limitation and network congestion are the main issues to be tackled. Furthermore, arising importance of low-delay interactive multimedia communications and applications requires special care and robustness against network delay and jitter. In this workshop, we aim to bring together speech, audio, video, and communication experts to present their latest research ideas on how to improve the QoS of multimedia communication and applications from both source coding and transport perspectives.

For the video part, what is next for video coding is a frequently asked question since the finalization of H.264/MPEG4-AVC in 2003. In early 2010, ITU-T and MPEG joined force again and launched together a standardization effort to develop HEVC (High Efficiency Video Coding), the next generation video coding standard. Such an effort represents simultaneously new opportunities and new challenges. Any future video standard needs to address not only the basic question of higher compression efficiency, but also better support for existing and emerging features such as 3D video and high dynamic range video. Furthermore, since digital video is being processed more commonly by a combination of software, GPU, and dedicated ASIC, new architecture friendly designs and better parallelization capabilities are also important topics to be considered. When delivering video over the Internet or mobile network, , and specially for low-delay real-time interactive applications, in order to ensure QoS, congest control and error control must also be taken into consideration.

For the speech and audio part, speech and audio coding has long been an important component of communication system design, especially when voice communication service was the main traffic in telecommunication system. With the ever-increasing high data rate available in new generation of communication systems, especially wireless communication systems, the role of speech and audio coding becomes different and more challenging: intelligibility requirement is being replaced by audio quality requirement. Users expect high quality, multi-channel, high dynamic-range audio experience over wireless and internet. Such a content-driven trend challenges researchers to provide better speech and audio coding/decoding schemes that not only provide high compression efficiency, but offer more enjoyable listening experience as well. Hence, cross-disciplinary research ranging from signal processing, machine learning, to psychoacoustics and auditory model is essential for developing successful next generation speech and audio coding.

All these factors together create the right moment for the AVCC workshop. The AVCC workshop provides a unique opportunity for people from both audio/speech and video backgrounds to hold joint discussions. It also provides an exciting forum for top researchers from both industry and academia to gather together and to share the latest research ideas and industry trends.

This workshop aims at high-quality original papers. Topics of interest include: advanced video coding technologies, including high efficiency video coding, scalable video coding, 3D video coding; speech, audio, image and video processing; detection of speech and audio signal in complex environments; statistical methods for speech and audio signal processing; low delay, interactive multimedia communications; error control and congestion control; error resilience coding and error concealment; complexity optimization, GPU and ASIC friendly design and implementation of speech, audio and video signal processing system.

The keynote speaker is Dr. John Apostolopoulos from Hewlett-Packard Laboratories. The title of his talk is “Future Multimedia Communications for Mobile Devices and Immersive Environments”.

Organizers:

Peng Yin, Dolby Laboratories Inc, USA

Yan Ye, InterDigital Communications Inc, USA

Pei-jung Chung, University of Edinburgh, UK

Zhiqiang Wu, Wright State University, USA

IMPROVED INTRA MODE SIGNALING FOR HEVC
Glenn Van Wallendael, Sebastiaan Van Leuven, Jan De Cock, Peter Lambert, Joeri Barbarien, Adrian Munteanu, Rik Van de Walle

RATE-DISTORTION-COMPLEXITY OPTIMIZATION FOR VLSI IMPLEMENTATION OF INTEGER M0TION ESTIMATION IN H.264/AVC ENCODER
Alireza Aminlou, Zahra Najafihaghi, Majid Namaki, M. R. Hashemi

SALIENCY-PRESERVING VIDEO COMPRESSION
Hadi Hadizadeh, Ivan Bajic

HYBRID VIDEO CODEC BASED JOINT RATE CONTROL OF STEREOSCOPIC VIDEO FOR TERRESTRIAL BROADCASTING SERVICES
Yongjun Chang, Munchurl Kim

CLASSIFIED QUADTREE-BASED ADAPTIVE LOOP FILTER
Qian Chen, Yunfei Zheng, Peng Yin, Xiaoan Lu, Joel Sole, Qian Xu, Edouard Francois, Dapeng Wu

MULTI-MODALITY LIKELIHOOD BASED PARTICLE FILTERING FOR 2-D DIRECTION OF ARRIVAL TRACKING USING A SINGLE ACOUSTIC VECTOR SENSOR
Xionghu Zhong, A. Premkumar, A. Madhukumar, Lau Chiew Tong

TECHNICAL OVERVIEW OF VP8, AN OPEN SOURCE VIDEO CODEC FOR THE WEB
Jim Bankoski, Paul Wilkins, Yaowu Xu

FAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING
Michal Joachimiak, Kemal Ugur, Jani Lainema, Moncef Gabbouj

QUALITY ASSESSMENT AND ERROR CONCEALMENT FOR SVC TRANSMISSION OVER UNRELIABLE CHANNELS
Marco Brandas, Mikko Uitto, Maria Martini, Janne Vehkapera

MEDIA RATE CONTROL FOR LARGE SCALE IMMERSIVE COMMUNICATIONS
Mohammed Raad, Raad Raad, Farzad Safaei

UNEQUAL ERROR PROTECTION BASED ON SLICE VISIBLITY FOR TRANSMISSION OF COMPRESSED VIDEO OVER OFDM CHANNELS
Laura Toni, Pamela Cosman, Laurence Milstein

A RATE CONTROL ALGORITHM FOR X264 HIGH DEFINITION VIDEO CONFERENCING
Abbas Javadtalab, Mona Omidyeganeh, Shervin Shirmohammadi, Mojtaba Hosseini

Streaming and Media Communications

Room: JH1.02 Date: 7/11/2011 Time: 9:00-18:00

Session Chair(s): Luigi Atzori, Maria Martini, Ivan Bajić

HTTP ADAPTIVE STREAMING WITH MEDIA FRAGMENT URIS
Wim Van Lancker, Davy Van Deursen, Erik Mannens, Rik Van de Walle

EVALUATING THE PERCEIVED QUALITY OF INFRASTRUCTURE-LESS VOIP
Kun Chan Lan, Tsung Hsun Wu

QOE-BASED DYNAMIC RESOURCE ALLOCATION FOR MULTIMEDIA TRAFFIC IN IEEE 802.11 WIRELESS NETWORKS
Xinghua Sun, Kandaraj Piamrat, Cesar Viho

GAME THEORETICAL ANALYSIS OF WIRELESS MULTIVIEW VIDEO MULTICAST USING COOPERATIVE PEER-TO-PEER REPAIR
Vicky Zhao, Gene Cheung

INTERACTIVE MULTIMEDIA FOR ENGINEERING TELE-OPERATION
Benjamin Jailly, Marius Preda, Christophe Gravier, Jacques Fayolle

NOVEL CROSS-LAYER SCHEME FOR VIDEO TRANSMISSION OVER LTE-BASED WIRELESS SYSTEMS
Sotiris Karachontzits, Tasos Dagiuklas, Lambros Dounis

ERROR CONCEALMENT STRATEGIES FOR MOTION CAPTURE DATA STREAMING
Choong Hoon Kwak, Ivan Bajic

DISTRIBUTED MARKOV DECISION PROCESS IN COOPERATIVE PEER-TO-PEER REPAIR FOR WWAN VIDEO BROADCAST
Zhi Liu, Gene Cheung, Yusheng Ji

OFFERING STREAMING RATE ADAPTATION TO COMMON MEDIA PLAYERS
George Toma, Laurent Schumacher, Christophe De Vleeschouwer

SCALABLE VIDEO DISSEMINATION WITH PRIORITIZED NETWORK CODING
Eymen Kurdoglu, Nikolaos Thomos, Pascal Frossard

P2P VIDEO STREAMING WITH INTER-SESSION NETWORK CODING
Jonnahtan Saltarin, Nikolaos Thomos, Eirina Bourtsoulatze, Pascal Frossard

A PACKET LOSS ESTIMATION MODEL AND ITS APPLICATION TO RELIABLE MESH-BASED P2P VIDEO STREAMING
Chi Wen Lo, Chia Wen Lin, Yung Chang Chen, Jen Yu Yu

Visual Content Identification and Search

Room: JH1.03 Date: 7/11/2011 Time: 9:00-18:00

Session Chair(s): Wenjun (Kevin) Zeng, Miroslaw Bober, Ton Kalker

EXTRACTING MULTI-SIZE LOCAL DESCRIPTORS BY GPU COMPUTING
Naoyuki Ichimura

CREDIT-TITLE DETECTION OF VIDEO CONTENTS BASED ON ESTIMATION OF SUPERIMPOSED REGION USING CHARACTER DENSITY DISTRIBUTION
Ryota Mase, Ryoma Oami, Toshiyuki Nomura

A NEW ROI GROUPING SCHEMA FOR AUTOMATIC IMAGE ANNOTATION
Rami Albatal, Philippe Mulhem, Yves Chiaramella

AN EFFICIENT, ROBUST VIDEO FINGERPRINTING SYSTEM
Randall Cook

DETECTING HIGHLIGHTS IN SPORTS VIDEOS: CRICKET AS A TEST CASE
Hao Tang, Vivek Kwatra, Mehmet Emre Sargin, Ullas Gargi

MOBILE AUGMENTED REALITY FOR BOOKS ON A SHELF
David Chen, Sam Tsai, Cheng Hsin Hsu, Jatinder Singh, Bernd Girod

STATISTICAL MODELLING OF OUTLIERS FOR FAST VISUAL SEARCH
Skjalg Lepsoy, Gianluca Francini, Giovanni Cordara, Pedro De Gusmão

DIRECTED IMAGE SEARCH WITH LOCAL PARALLEL FEATURE AXES
Stefan Gumhold, Marcel Spehr

ALBUM-BASED OBJECT-CENTRIC EVENT RECOGNITION
Shen Fu Tsai, Feng Tang, Thomas Huang

AN EFFICIENT ANGLE-BASED SHAPE MATCHING APPROACH TOWARDS OBJECT RECOGNITION
Zhiyuan Zhang, Aixin Zhang, Jianhua Li, Shenghong Li

ANALYZING THE INNER WORKINGS OF THE SIGNATURE QUADRATIC FORM DISTANCE
Christian Beecks, Thomas Seidl

P-VCD: A PIVOT-BASED APPROACH FOR CONTENT-BASED VIDEO COPY DETECTION
Juan Manuel Barrios, Benjamin Bustos

Special Session: Multimedia Services and Technologies for E-health

Room: JH1.04 Date: 7/11/2011 Time: 9:00-18:00

Session Chair(s): TBA

Introduction

It is our great pleasure to welcome you to the ICME 2011 Workshop on Multimedia Services and Technologies for E-health (MUST-EH). MUST-EH workshop was one of the first of its kind and aiming at a series of workshop in coming years. This workshop aimed to report on the recent advancement and developments in various aspects of multimedia applications in e-health such as health monitoring, ubiquitous health care, Adaptive exergames for health, QoS for health and so on.

The applications of multimedia services, tools and technologies revolutionize today’s healthcare system. These applications opened up on demand smart healthcare to innovative possibilities: healthcare professional can treat and monitor patient more effectively and efficiently as multimedia services and technologies facilitate medical multimedia data to be sent to clinic or travelling doctor’s locations from an ambulance or patient accident site; present multimedia service and technologies also enable remote consultations which result in reduced cost, accessing undeserved population and continuous quality care of patients.

The call for papers attracted 16 submissions from USA, Canada, Europe and different parts of Asia. The program committee accepted 9 papers that cover different aspect of multimedia services and tools in e-health. We hope that the proceedings will serve a useful reference for scientists, researchers and academics in the field of multimedia in e-health.

We would like to thank our program committee for their valuable support from their busy schedule. We also like to thank main workshop organizers for their kind follow up.

Organizers:

M. Shamim Hossain, King Saud University, KSU, KSA

Stefan Goebel, Technical University Darmstadt, Germany

Atif Alamri, King Saud University, KSU, KSA

Manzur Murshed, Monash University, Australia

CLUSTERING MEAL IMAGES IN A WEB-BASED DIETARY MANAGEMENT SYSTEM
Gamhewage De Silva, Kiyoharu Aizawa

OPTIMAL SOURCE RATE ALLOCATION IN BODY SENSOR NETWORKS WITH ENERGY HARVESTING
Yifeng He, Wenwu Zhu, Ling Guan

OPTIMAL RESOURCE ALLOCATION TO PROVIDE QOS GUARANTEE IN PERVASIVE HEALTH MONITORING SYSTEMS
Yifeng He, Wenwu Zhu, Ling Guan

INTEGRATED DATABASE SYSTEMS FOR MOBILE DIETARY ASSESSMENT AND ANALYSIS
Marc Bosch Ruiz, Tusarebecca Schap, Nitin Khanna, Fengqing Zhu, Carol Boushey, Edward Delp

AUTOMATIC VOICE DISORDER CLASSIFICATION USING VOWEL FORMANTS
Ghulam Muhammad, Mansour Alsulaiman, Awais Mahmood, Zulfiqar Ali

A HAND-HELD MULTIMEDIA TRANSLATION AND INTERPRETATION SYSTEM FOR DIET MANAGEMENT
Albert Parra Pozo, Andrew Haddad, Mireille Boutin, Edward Delp

E-GLOVE: AN ELECTRONIC GLOVE WITH VIBRO-TACTILE FEEDBACK FOR WRIST REHABILITATION OF POST-STROKE PATIENTS
Ali Karime, Hussein Al Osman, Wail Gueaieb, Abdulmotaleb El Saddik

INTEROPERABILITY AND DATA SHARING SETTINGS IN A HEALTHCARE DATA MANAGEMENT SYSTEM
Mehedi Masud

A CONTEXT-AWARE E-HEALTH FRAMEWORK FOR STUDENTS WITH MODERATE INTELLECTUAL AND LEARNING
Abdulmotaleb El Saddik

Multimedia-Aware Networking

Room: JH1.05 Date: 7/11/2011 Time: 9:00-18:00

Session Chair(s): C. Timmerer, G. Xilouris

Introduction

Latest advances in multimedia content encoding and representation, including HDTV, 3DTV, multi-view video and associated added-value interactive services, are offering to the end user a truly rich multimedia experience. At the same time advances in communications systems and network technologies give the ability to apply in-network processing techniques in order to take advantage of the available user and media contextual information for efficient, flexible and auto-configurable media transmission. On the other hand network-aware applications and cross-layer mechanisms are being developed in order to take advantage of network information and to adapt the media to the current network conditions, terminal capabilities, and user preferences.

This workshop solicits novel contributions and breaking results on all aspects of multimedia-aware networking. In particular, workshop papers should describe algorithms, issues and experiences related to content-aware networking and network-aware applications, future (media) Internet architectures, self-* and adaptivity, cross-layer design and optimization, applications, and interoperability. We are particularly interested in (but not limited to) areas such as

* Content aware networking and network aware application including content creation (coding (e.g., 2D/3D, SVC, HEVC), preparation, packaging), content delivery (transport, streaming, live, on-demand, real-time, download), content adaptation (server, in-network, client), content-aware and media-aware forwarding and routing, content consumption (widget), and context aware applications and networks.

* Future Internet architectures, in particular, content/media centric aspects, network/infrastructure management, network virtualization, and cross-layer design and optimization.

* Self-* and adaptivity in the area of self-organization, self-configuration, and self-healing.

* Interoperability (i.e., standardization) efforts related to high-efficiency video coding, 3D video coding, dynamic adaptive streaming over HTTP, and modern media transport.

The call for papers attracted 14 submissions (WoMANΥ11 only without special session) from Austria, Belgium, France, Germany, Greece, Italy, Romania, Taiwan, and United States of America. The program committee accepted eight papers covering a variety of topics, all in the context of multimedia-aware networking:

* Content-awareness and cross-layer optimizations for service and network providers;

* Performance evaluations of home gateways and interactive spherical videos;

* Enhancing legacy infrastructures with content-awareness;

* Scalable video coding packetization in peer-to-peer networks;

* Quality impact of media-aware network delivery;

* Home-gateway support and implementation of content-aware network nodes;

Furthermore, WoMANΥ11 hosts a special session on “Hot Topics in Multimedia Delivery (HotMD)” introduced elsewhere and Claude Perron (Vice President of Thomson Video Networks) provides a keynote in the area of multimedia-aware networking.

As workshop chairs, we would like to thank all people who have contributed to the success of this workshop: the authors, the invited speaker, the program committee members and all reviewers, and the members of the organizing committee of ICME 2011. The support of the workshop sponsors is also greatly acknowledged, in particular DOLBY (http://www.dolby.com/) and RADVISION (http://www.radvision.com/) for sponsoring the best paper award.

We sincerely hope that the carefully crafted technical program we have arranged for, the scientific discussions that the workshop will hopefully stimulate, and your additional activities in Barcelona, will make your participation worthwhile and a memorable experience.

Organizers:

Christian Timmerer, Alpen-Adria-Universitt Klagenfurt, Austria

Daniel Negru, LaBRI, France

Eugen Borcoci, UPB, Romania

George Xilouris, Demokritos, Greece

EVALUATING THE NETWORKING PERFORMANCE OF LINUX-BASED HOME ROUTER PLATFORMS FOR MULTIMEDIA SERVICES
Ingo Kofler, Robert Kuschnig, Hermann Hellwagner

QUALITY IMPACT OF SCALABLE VIDEO CODING TUNNELING FOR MEDIA-AWARE CONTENT DELIVERY
Michael Grafl, Christian Timmerer, Hermann Hellwagner

EVALUATION OF BANDWIDTH PERFORMANCE FOR INTERACTIVE SPHERICAL VIDEO
Patrice Rondao Alface, Jean Francois Macq, Nico Verzijp

HOME BOXES SUPPORT FOR AN EFFICIENT VIDEO-ON-DEMAND DISTRIBUTION
Soraya Ait Chellouche, Willy Aubry, Daniel Negru, Yiping Chen

ENHANCING LEGACY INFRASTRUCTURES WITH CONTENT-AWARE ENABLERS TOWARDS A NETWORKED-MEDIA PLATFORM
Petros Anapliotis, Evangelos Pallis, Daniel Negru, Vasilleios Zacharopoulos, Greece

A PRELIMINARY IMPLEMENTATION OF A CONTENT-AWARE NETWORK NODE
Nikolaos Vorniotakis, George Xilouris, Georgios Gardikis, Nikolaos Zotos, Evangelos Pallis, Anastasios Kourtis

SERVICE PROVIDER AND CONTENT AWARE NETWORK PROVIDER CROSS-LAYER OPTIMISATION OF MULTIMEDIA DISTRIBUTION
Eugen Borcoci, Serban Obreja, Christian Timmerer, George Xilouris

PACKETIZING SCALABLE STREAMS IN HETEROGENUS PEER-TO-PEER NETWORKS
Alexandro Sentinelli, Tea Anselmo, Pasqualina Fragneto, Amit Kumar, Beatrice Rossi

Hot Topics in Multimedia Delivery

Room: JH1.05 Date: 7/11/2011 Time: 13:30-15:30

Session Chair(s): TBA

Introduction

Multimedia delivery platforms are evolving in several axes. From the transmission point of view, moving from purely broadcast based to hybrid broadcast-broadband and broadband networks, being of special interest the converged traditional use of media with mobile internet devices. Also, from point of view of content, moving from SD and HD images to 3D and multiview immersive experiences. From the user perspective where, user personalization and context are becoming more and more important as well as new enhanced ways of user interaction. And, finally, the convergence of media delivery and consumption with other technologies, as the ones of videogames, where real and virtual media are combined to provide a unique immersive user experience.

Multimedia content distribution has always been a fascinating and motivating topic to the multimedia research community. After years of active research and industrial efforts, it has reached a stage that “all of us can get our 15 minutes on YouTube”. However, there is still a long way towards the ultimate goal of anyone being able to consume multimedia content anytime, anywhere. As more right holders of professional multimedia content are ready to test water of new distribution methods, such as Internet video and IPTV; as the ever increasing capabilities of emerging devices, such as smartphones and tablets, enable new multimedia experiences on-the-go; as rapidly expanding cloud services start to offer greater assistance to multimedia distribution, we are now at a time when these are ample opportunities for the research of multimedia content distribution. Solving the real challenges towards the ultimate goal of multimedia content distribution is not only inspiring and exciting, but also bound to have significant and long lasting impact.

Organizers:

Cheng Huang, Microsoft Research, USA

Yong Liu, Polytechnic Institute of NYU, USA

Francesc Pinyol Margalef, La Salle - Universitat Ramon Llull, Spain

Gabriel Fernàndez Ubiergo, La Salle - Universitat Ramon Llull, Spain

Alejandro López González, La Salle - Universitat Ramon Llull, Spain

SKYMEDIA – UAV-BASED CAPTURING OF HD/3D CONTENT WITH WSN AUGMENTATION FOR IMMERSIVE MEDIA EXPERIENCES
Massimo Neri, Aldo Campi, Rosalba Suffritti, Francesco Grimaccia, Pedro Sinogas, Olivier Guye, Christophe Papin, Theodoros Michalareas, Laszlo Gazdag, Ismo Rakkolainen

A METRIC FOR NO-REFERENCE VIDEO QUALITY ASSESSMENT FOR HD TV DELIVERY BASED ON SALIENCY MAPS
Hugo Boujut, Jenny Benois Pineau, Toufik Ahmed, Ofer Hadar, Patrick Bonnet

REVERSE ENGINEERING THE YOUTUBE VIDEO DELIVERY CLOUD
Vijay Adhikari, Sourabh Jain, Yingying Chen, Zhi Li Zhang

AN EVALUATION OF PIECE-PICKING ALGORITHMS FOR LAYERED CONTENT IN BITTORRENT-BASED PEER-TO-PEER SYSTEMS
Michael Eberhard, Tibor Szkaliczki, Hermann Hellwagner, László Szobonya, Christian Timmerer

A NOVEL MARKING MECHANISM FOR PACKET VIDEO DELIVERY OVER DIFFSERV NETWORKS
Haidong Wang, Guizhong Liu, Lishui Chen, Qinli Wang

Interactive Ambient Intelligent Multimedia Environments

Room: JH1.07 Date: 7/11/2011 Time: 9:00-12:30

Session Chair(s): Ali Asghar, Xenophon Zabulis

Introduction

Interactive Ambient Intelligence Multimedia Environments (AIME) has the vision of enhancing our everyday environment and our interaction with its objects by sensing, computing, and communication capabilities. Major characteristics of such environments are the increasing number of intelligent devices (ubiquity), their complexity, and their integration into the background (transparency). As pervasive devices grow in complexity and sophistication, they become harder and harder to use, particularly because of their tiny display screens and limited interfaces. At the same time, devices will disappear or blend into the background and will be invisible to the user. However, because of this transparency, users fail to develop an adequate mental concept for AIME and its interaction concept. The reasons for this are the integration of the infrastructure into the background and the missing or invisible user interfaces.

To overcome these challenges, new interaction models are required. How can one interact with tiny devices that do not provide their own user interfaces? Or how to find and access devices that are invisible to the user? How to access physical devices in an unfamiliar environment without having knowledge about the technical infrastructure such as device’s physical address or IP address? Identifying and activating the right device or service from a huge amount of existing devices to perform a specific task is a challenge for users of these environments.

Some researchers have suggested fully automated and intelligent interaction to overcome the above-mentioned challenges. However, when analyzing implicit interaction approaches, some other challenges come up. Major challenges are lack of control and over-automation. Several works have reported already that people do not accept a full-adaptive and over-automated environment. Instead, users prefer to always be in control. Therefore, concepts for natural interaction are needed that allow implicit interaction while avoiding loss of control and over-automation. In such environments, multimedia play two key roles: they support new ways of interaction that apply to multiple human senses, and they diffuse the presentation of content in the environment of the user.

Facing these challenges is a community-spanning effort, necessitating the pooling of resources and experiences of different research groups. The aim of this workshop thus is to provide a link (1) between different AIME research groups and (2) between the academic communities and industry groups that work in the AIME area. These different groups will come together to foster the developments of highly intuitive, multimedia supported interaction solutions for AIME. To reflect this, we will invite a keynote speaker from the industry, promote the workshop among industry participants, and help to set up cooperation between the industry and academic groups.

Organizers:

Ali A. Nazari Shirehjini

Xenophon Zabulis

Sahin Albayrak

DESIGN ISSUES OF AMBIENT SOCIAL MEDIA FOR BETTER DECISION MAKING
Tatsuo Nakajima, Tetsuo Yamabe

A SMART ENVIRONMENT FOR AUGMENTED LEARNING THROUGH PHYSICAL BOOKS
George Margetis, Panagiotis Koutlemanis, Xenophon Zabulis, Margherita Antona, Constantine Stephanidis

TV AS A HUMAN INTERFACE FOR AMBIENT INTELLIGENCE ENVIRONMENTS
Gorka Epelde, Julio Abascal, Xabier Valencia, Unai Díaz, Christian Husodo Schulz, Ingo Zinnikus

OBJECT DETECTION USING HOMOGENEOUS PROBABILITY REDISTRIBUTION OF HISTOGRAMS
Mehdi Madani, Mahdi Bagheri, Amir Masood Eftekhari

MOBILE POINTME BASED PERVASIVE GAMING INTERACTION WITH LEARNING OBJECTS ANNOTATED PHYSICAL ATLAS
Abu Saleh Md Mahfuju Rahman, Abdulmotaleb El Saddik

A FRAMEWORK FOR ADAPTIVE INTERACTION SUPPORT BASED ON QUALITY OF CONTEXT INFORMATION
M. Anwar Hossain

CONTEXT-AWARE PRIORITIZED GAME STREAMING
Hesam Rahimi, Ali Asghar Nazari Shirehjini, Shervin Shirmohammadi

Content Protection & Forensics

Room: JH1.02 Date: 7/15/2011 Time: 9:00-18:00

Session Chair(s): Yongdong Wu, Sabu Emmanuel

A COPYRIGHT-PROTECTION WATERMARK MECHANISM BASED ON GENERALIZED BRAIN-STATE-IN-A-BOX NEURAL NETWORK AND ERROR DIFFUSION HALFTONING
Li Fan, Tiegang Gao, Qunting Yang, Yanjun Cao

DRM VIOLATION DETECTION USING CONSUMER LOGS ANALYSIS
Amit Sachan, Sabu Emmanuel

UNDERSTANDING GEOMETRIC MANIPULATIONS OF IMAGES THROUGH BOVW-BASED HASHING
Sebastiano Battiato, Giovanni Maria Farinella, Enrico Messina, Giovanni Puglisi

REVERSIBLE IMAGE WATERMARKING BASED ON PREDICTION-ERROR EXPANSION AND COMPENSATION
Yi Luo, Fei Peng, Xiaolong Li, Bin Yang

SECURITY AND PRIVACY IN ONLINE SOCIAL NETWORKS: A SURVEY
Prateek Joshi, C.-C. Jay Kuo

DATA HIDING FOR QUALITY ACCESS CONTROL AND ERROR CONCEALMENT IN DIGITAL IMAGES
Amit Phadikar, Santi Maity, Claude Delpha

OBSCURE SPREAD SPECTRUM WATERMARKING
Yongdong Wu, Szeling Yeo

DATA HIDING IN DOT DIFFUSED HALFTONE IMAGES
Yuanfang Guo, Oscar Au, Ketan Tang, Lu Fang, Zhiding Yu

DEMYSTIFYING HISTOGRAMS OF MULTI-QUANTISED DCT COEFFICIENTS
Thomas Gloe

A VIDEO ENCRYPTION ALGORITHM IN H.264 COMPRESSED DOMAIN FOR RESOURCE-LIMITED SYSTEMS
Niansheng Mao, Zhuo Li, Xiaoguang Li, Jing Zhang

A MULTI-LAYER KEY STREAM BASED APPROACH FOR JOINT ENCRYPTION AND COMPRESSION OF H.264 VIDEO
Suman Deb Roy, Jun Tian, Heather Yu, Wenjun Zeng

AN ANALYSIS ON BITWISE OPERATIONS IN THE ENCRYPTED DOMAIN
Sung Him Chui, Oscar Au, Chun Wing Kwok, Yujun Li, Lingfeng Xu, Wenxiu Sun

PREVENTATIVE STEGANALYSIS IN WIRELESS VISUAL SENSOR NETWORKS: CHALLENGES AND SOLUTIONS.
Julien Jainsky, Deepa Kundur

Advances in Automated Multimedia Surveillance for Public Safety

Room: JH1.03 Date: 7/15/2011 Time: 9:00-18:00

Session Chair(s): TBA

Introduction

It is our great pleasure to welcome you to the 1st IEEE International Workshop on Advances in Automated Multimedia Surveillance for Public Safety (AAMS-PS 2011) in conjunction with IEEE International Conference on Multimedia and Expo (ICME 2011). Unlike other workshops on surveillance, this special workshop aims to report on the recent developments in various aspects of automated multimedia surveillance, including the state-of-the-art techniques, methodologies and systems for protecting people and ensuring their safety through effective surveillance.

Automated multimedia surveillance systems utilize different tools and techniques in capturing, processing and analyzing multimedia data coming from heterogeneous sensors. These systems are often designed to support (semi-) automatic decision making, such as generating an alarm in response to a surveillance event, as well as to provide useful information to human decision makers for ensuring public safety. Many researchers in industry and academia are working on various aspects of automated surveillance - automatic analysis of media streams, real-time processing, event detection, system architecture, quality evaluation and so on, while some argue on the level of automation to be exercised in such systems. Therefore, AAMS-PS 2011 aims to cover a broad spectrum of research aspects in automated multimedia surveillance that is often multidisciplinary in nature.

Despite the fact that it was the first edition, the AAMS-PS 2011 workshop attracted a good number of submissions, a total of 25 submissions (none as invited) from Spain, Italy, Belgium, Saudi Arabia, China, Korea, Singapore, Canada, UAE, Bosnia and Herzegovina, and USA. The program committee consisting of experienced researchers has accepted 10 papers for oral and 5 papers for poster presentation. The selected papers cover different aspects of automated multimedia surveillance including fast video search and indexing, multi-object tracking, queuing models, QoS, decision support system, anonymity, middleware, and social networking based surveillance.

We would like to thank all the organizing committee and TPC members for their hard work in advertising the workshop, reviewing the papers and providing valuable suggestions. We hope that the outcome of this workshop proceeding will serve as a valuable reference for researchers and developers in the area of multimedia surveillance. Finally, we would like to thank the authors and the keynote speaker for providing the content of the program, and to Abdulmotaleb El Saddik and Xian-Sheng Hua for helping us in realizing this event.

Organizers:

Mohan S. Kankanhalli, National University of Singapore, Singapore

Pradeep K. Atrey, University of Winnipeg, Canada

M. Anwar Hossain, King Saud University, Riyadh, KSA

FAST VIDEO SEARCH AND INDEXING FOR VIDEOSURVEILLANCE APPLICATIONS WITH OPTIMALLY CONTROLLED FALSE ALARM RATES
Jose Luis Lisani, Antoni Buades, Lenny Rudin

THE LARGE-SCALE CROWD DENSITY ESTIMATION BASED ON SPARSE SPATIOTEMPORAL LOCAL BINARY PATTERN
Hua Yang, Hang Su, Shibao Zheng, Sha Wei, Yawen Fan

WIDE-AREA MULTI-OBJECT TRACKING WITH NON-OVERLAPPING CAMERA VIEWS
Youlu Wang, Senem Velipasalar, Mustafa Cenk Gursoy

A DECISION SUPPORT ENGINE FOR VIDEO SURVEILLANCE SYSTEMS
Dewan Ahmed, Shervin Shirmohammadi

A FAST ALGORITHM FOR RAIN DETECTION AND REMOVAL FROM VIDEOS
Minmin Shen, Ping Xue

A MIDDLEWARE PLATFORM FOR REAL-TIME PROCESSING OF MULTIPLE VIDEO STREAMS BASED ON THE DATA-FLOW PARADIGM
Pasquale Foggia, Mario Vento

MULTI-SENSOR FIRE DETECTION USING VISUAL AND TIME-OF-FLIGHT IMAGING
Steven Verstockt, Pieterjan De Potter, Sofie Van Hoecke, Peter Lambert, Rik Van de Walle

DETECTION AND TRACKING OF MULTIPLE MOVING OBJECTS IN VIDEO SEQUENCE USING ENTROPY MASK METHOD AND FAST LEVEL SET METHOD
Wanhyun Cho, Sunworl Kim, Gukdong Ahn, Sangcheol Park

A SURVEILLANCE SYSTEM BASED ON SOCIAL NETWORKING AND LOCALIZATION
Wonyoung Kwak, Jinyoung Han, Nakjung Choi, Ted Kwon, Yanghee Choi

VIVIE: A VIDEO-SURVEILLANCE INDEXER VIA IDENTITY EXTRACTION
Daniel Riccio, Maria De Marsico, Michele Nappi, Andrea Abate

FUSION OF FACE NETWORKS THROUGH THE SURVEILLANCE OF PUBLIC SPACES TO ADDRESS SOCIOLOGICAL SECURITY RECOMMENDATIONS
Sk Alamgir Hossain, Abu Saleh Md Mahfuju Rahman, Abdulmotaleb El Saddik

ANONYMOUS SURVEILLANCE
Mukesh Saini, Pradeep Atrey, Sharad Mehrotra, Mohan Kankanhall

TOWARDS OPTIMAL PLACEMENT OF SURVEILLANCE CAMERAS IN A BUS
Khaled Amriki, Pradeep Atrey

QOS-AWARE SERVICE COMPOSITION FOR VIDEO SURVEILLANCE
M. Shamim Hossain

AN ANALYTICAL G/M/1 QUEUEING MODEL FOR PROVIDING GUARANTEED QOS IN MULTIMEDIA SURVEILLANCE
Mohsin Iftikhar, M. Anwar Hossain

Advances in Music Information Research

Room: JH1.04 Date: 7/15/2011 Time: 9:00-18:00

Session Chair(s): Peter Knees, Òscar Celma, Markus Schedl

Introduction

Music information research has been a fast growing scientific field during the past decade. In traditional content-based MIR, music-related information is extracted from the audio signal using signal processing techniques. These methods, however, cannot capture semantic information that is not encoded in the audio signal, but nonetheless essential to many consumers, e.g., the meaning of the lyrics of a song or the political motivation or background of a singer.

In recent years, the emergence of various Web 2.0 platforms and services dedicated to the music and audio domain, like last.fm, MusicBrainz, or echonest, has provided novel and powerful, albeit noisy, sources for high level, semantic information on music artists, albums, songs, and others. The abundance of such information provided by the power of the crowd can therefore contribute to music information research and development considerably. On the other hand, the wealth of newly available, semantically meaningful information offered on Web 2.0 platforms also poses new challenges, e.g., dealing with the huge amount and the noisiness of this kind of data, various user biases, hacking, or the cold start problem.

Another recent trend, not at last addressable to platforms like Apple's iPhone or Google's Android, are intelligent user interfaces to access the large amounts of music available on today's mobile music players. Mobile devices that offer high speed Web access allow for even more music to be consumed via Web services. Dealing with these vast amounts of music requires intelligent services on mobile devices that provide, for example, personalized and context-aware music recommendations. The current emergence and confluence of these challenges make this an interesting field for researchers and industry practitioners alike.

Addressing these challenges, the 2011 edition of the AdMIRe workshop series served as a forum for presenting cutting edge research and discussing emerging trends in the fields covered. The workshop brought together researchers and developers from music and audio, multimedia, Web mining, and recommender systems communities.

In its third successful year, the workshop has established itself as a truly international venue with 18 high-quality submissions by authors from 12 countries.

Keeping an acceptance rate similar to previous years' editions, finally 10 papers have been selected as the result of a rigorous reviewing process.

The included topics span a wide field, such as mobile music computing and browsing, similarity measurement, tag and text analysis, music ontology design, and social studies. Several papers further present improvements in content-based music information research, more precisely, in chord estimation, signal-based genre classification, and content-based music retrieval.

We would like to thank all authors, participants, and presenters for their high-quality submissions and presentations as well as for their interesting discussions that made AdMIRe an outstanding event. Special thanks go to all members of the program committee and all reviewers for ensuring the high scientific quality of the workshop program.

The AdMIRe 2011 workshop was supported by the Austrian National Science Fund (FWF) under project numbers L511-N15 and P22856-N23, by the Department of Computational Perception at the Johannes Kepler University Linz, by the Linzer Hochschulfonds, by Barcelona Music & Audio Technology (BMAT), and by the organizers of the IEEE International Conference on Multimedia and Expo 2011.

Organizers:

Markus Schedl, Johannes Kepler University, Linz, Austria

Peter Knees, Johannes Kepler University, Linz, Austria

Oscar Celma, Barcelona Music and Audio Technologies, Barcelona,

Spain

WHEN USERS GENERATE MUSIC PLAYLISTS: WHEN WORDS LEAVE OFF, MUSIC BEGINS?
Simone Stumpf, Sam Muscroft

ANALYZING THE IMPACT OF DATA VECTORIZATION ON DISTANCE RELATIONS
Sebastian Stober, Andreas Nuernberger

AMP: ARTIST-BASED MUSICAL PREFERENCES DERIVED FROM FREE VERBAL RESPONSES AND SOCIAL TAGS
Rafael Ferrer Flores, Tuomas Eerola

IMPROVING THE KEY EXTRACTION PERFORMANCE OF A SIMULTANEOUS LOCAL KEY AND CHORD ESTIMATION SYSTEM
Johan Pauwels, Jean Pierre Martens, Marc Leman

MUSICAL GENRE CLASSIFICATION OF MPEG-4 TWINVQ AUDIO DATA
Michihiro Kobayakawa, Mamoru Hoshi

THE AMBLR: A MOBILE SPATIAL AUDIO MUSIC BROWSER
Rebecca Stewart, Mark Sandler

THE SEGMENT ONTOLOGY: BRIDGING MUSIC-GENERIC AND DOMAIN-SPECIFIC
Ben Fields, Kevin Page, David De Roure, Tim Crawford

KOLMOGOROV COMPLEXITY IN LYRICS
Teppo Ahonen

TRANSPOSITION AND TIME-WARP INVARIANT GEOMETRIC MUSIC RETRIEVAL ALGORITHMS
Kjell Lemström, Mika Laitinen

REAL-TIME SYNCHRONISATION OF MULTIMEDIA STREAMS IN A MOBILE DEVICE
Rob Macrae, Joachim Neumann, Xavier Anguera, Nuria Oliver, Simon Dixon

Hot Topics in Multimedia

Room: JH1.05 Date: 7/15/2011 Time: 9:00-18:00

Special Session: Edutaiment

Room: JH1.05 Date: 7/15/2011 Time: 13:30-15:30

Session Chair(s): Maiga Chang, Lluis Vicent, Oscar Garcia-Panella

Introduction

The appropriateness of the workshop in the program is to bring the best researchers in multimedia closer to one of the most important applications of their expertise: learning. At the same time, we expect that learning professionals come to see how multimedia can improve their learning methodologies.

Multimedia Productions in general and Games in particular, can be considered as interactive experiences that feature structured and complex activities. These can be used for solving problems and therefore for helping within the learning process. “Edutainment” is a transversal and strategic field that can be applied to different sectors and disciplines, providing with novel and innovative solutions.

The main aim of this workshop is to collect and disseminate the latest innovations in the field of applying multimedia technologies for learning and to create the synergies between the specialist in Multimedia and the specialist in Education to work together in this challenging field. To mention some, interesting ares of study include practical cases regarding the application of Multimedia in Education, Edutainment and Serious Games, Affective computing, Augmented, Virtual and Mixed Realities in Education, Interactive Arts in Education and Collaboration, competition and community Collaborative Environments and Learning, among many others.

As technology continues to permeate all areas of education the significance of multimedia is growing. Nowadays, the use of a Learning Management System is widespread in schools and universities. Also the use of simulations, videos, etc is becoming more and more important, and of course, distance programs are profiting from this technology and the number of students of open universities has increased steadily in the last 10 years. Education can be for multimedia what Medicine has been for Image Processing, a capital applications perfectly defined.

Entertainment implies a part of amusement plus some pleasure in the execution. A game provides with an interaction scheme and some sort of manipulative user interface for the generation of sensory feedback. It seems that educating through a medium that both serves for the purpose of learning and entertaining can be a pleasant, motivational and satisfying experience. And there are several real experiences that demonstrate that with clear facts and conclusions.

Organizers:

Dr. Maiga Chang,

Dr. Oscar Garcia-Panella,

Dr. Lluis Vicent-Safont

Dr. Olenka Bilash

A STUDY OF DEVELOPING THE PROCEDURAL LOGIC LEARNING SYSTEM USING THE CONCEPT OF THERBLIGS
Yu Ren Yen

INVESTIGATING STUDENTS’ PERCEIVED SATISFACTION, BEHAVIORAL INTENTION, AND EFFECTIVENESS OF ENGLISH LEARNING USING AUGMENTED REALITY
Yuan Jen Chang, Chin Hsing Chen, Wen Tzeng Huang, Wei Shiun Huang

MEECO: GAMIFYING ECOLOGY THROUGH A SOCIAL NETWORKING PLATFORM
Danae Vara, Enric Macias, Sergio Gracia, Alba Torrents, Simon Lee

Special Session: Human Factors in Multimedia

Room: JH1.05 Date: 7/15/2011 Time: 10:30-12:30

Session Chair(s): David Fonseca

Introduction

Chairman: David Fonseca

Organizers:

David Fonseca, La Salle

William A. Gruver, IEEE SMC Society

Anup Basu, Alberta

Chunrong Yuan, Tuebingen University

Marc Pifarre, La Salle

Eva Villegas, La Salle

QOE ASSESSMENT IN TELE-OPERATION WITH 3D VIDEO AND HAPTIC MEDIA
Ayano Tatematsu, Yutaka Ishibashi, Norishige Fukushima, Shinji Sugawara

GESTURE RECOGNITION ON A MOBILE DEVICE FOR REMOTE EVENT GENERATION
Eric Torunski, Abdulmotaleb El Saddik, Emil Petriu

HAND TRACKING BASED ON THE COMBINATION OF 2D AND 3D MODEL IN GAZE-DIRECTED VIDEO
Li Sun, Guizhong Liu

EVALUATING NEW INTERACTION SYSTEMS ON QUIZ SHOWS
Marc Pifarré, Eva Villegas, David Fonseca

HKISS: REAL WORLD BASED HAPTIC INTERACTION WITH VIRTUAL 3D AVATARS
Abu Saleh Md Mahfuju Rahman, Abdulmotaleb El Saddik

Special Session: Multimodal Audio-based Multimedia Content Analysis

Room: JH1.05 Date: 7/15/2011 Time: 9:00-10:10

Session Chair(s): Xavier Anguera Miro

Introduction

By definition, multimedia content is composed of multiple forms, including audio, video, text/ subtitles, and others. Traditionally, applications and algorithms that work with such content have considered only a single modality, allowing for example searching of textual tags, thereby ignoring any information available from others modalities. The limitations of this approach are obvious, and there is a recent trend towards multimodal processing, in which different content modalities complement each other, or are used for bootstrapping analysis of new modalities.

Audio is a prominent part of multimedia content, which is backed up by extensive research by the speech and music communities, although usually performed on audio-only systems. Utility of audio-only systems is often limited by the quality of the acoustic environment or the information contained therein, so they can benefit from a multimodal analysis of multimedia data, to enhance the resulting performance, robustness, and efficiency.

The main goal of the special session is to explore ways in which audio processing can be enhanced, bootstrapped, or facilitated by other available information modalities. We are interested not only in applications that show successful combinations of audio and other sources of information, but also on algorithms that effectively integrate them and leverage complementary information from each modality to obtain an enhanced result, in terms of degree of detail, coverage of the corpus, or other enabling factors.

The special session wants to provide a forum for publication of high-quality, novel research on multimedia applications and multimodal processing, with a special focus on the audio modality.

Organizers:

Xavier Anguera, Telefonica Research

Florian Metze, Carnegie Mellon University

Gerald Friedland, International Computer Science Institute

INDEXING OF PLAYER EVENTS USING MULTIMODAL CUES IN GOLF VIDEOS
Hyoung Gook Kim, Jinho Lee

AUTOMATIC SYNCHRONIZATION OF ELECTRONIC AND AUDIO BOOKS VIA TTS ALIGNMENT AND SILENCE FILTERING
Xavier Anguera Miro, Andreu Urruela, Nestor Perez, Nuria Oliver

SOCIAL FOCUS OF ATTENTION AS A TIME FUNCTION DERIVED FROM MULTIMODAL SIGNALS
Danil Korchagin, Hamid Reza Abutalebi

JUST-IN-TIME MULTIMODAL ASSOCIATION AND FUSION FROM HOME ENTERTAINMENT
Danil Korchagin, Petr Motlicek, Stefan Duffner, Hervé Bourlard

Special Session: Vision and Graphics Computing for Multimedia Communications

Room: JH1.05 Date: 7/15/2011 Time: 16:00-18:00

Session Chair(s): Pietro Zanuttigh

Introduction

The explosion and far-reaching of internet multimedia, especially those with new and emerging new media types, such as various 3D visual entities including 3DTV, stereoscopic movies and sports, and PhotoSynth (i.e., 3D photo collections), calls for advanced multimedia communication systems to deliver the most compelling experiences to the end user. While signal processing based multimedia communication systems have been very successful in dealing with traditional multimedia, they are not sufficient in dealing with emerging new types of multimedia. Presumably, due to the rich and structured visual information, smart content analysis techniques from computer vision and visual synthesis techniques from computer graphics shall provide effective solutions for efficient visual communications of these new media. Furthermore an interactive 3D experience requires all the four steps of 3D acquisition, compression, transmission and visualization at the client side and these tasks must be considered together in order to fully exploit the potentiality of this new media.

This special session will explore how vision and graphics computing can be leveraged for advanced multimedia communications systems, both for traditional and emerging new media types. It will bring researchers from 3 different fields, i.e., computer vision, computer graphics, and multimedia communications together, and provide a forum to researchers to identify critical problems and proposed effective solutions in this emerging interdisciplinary area.

The special session will cover many different interesting topics in this rapidly growing field. These include visual and 3D data processing, management, summarization and analysis for various communication applications. All the steps required for a 3D interactive experience will also be covered: firstly the 3D reconstruction and acquisition of static and dynamic scenes; then compression algorithms for 3D and multi-view data; after that the transmission of this information, including error resilience and concealment techniques; and finally 3D data visualization and interaction, including augmented and virtual reality applications.

Concluding the aim of the special session is to show how vision and graphics computing provides key enabling technologies to effectively utilize the rich structured visual information behind new 3D entities in order to facilitate multimedia communications.

General Chairs

Gang Hua, IBM T. J. Watson Research Center

Pietro Zanuttigh, Department of Information Engineering, University of Padova,

Program Chair:

Philippos Mordohai, Stevens Institute of Technology

REAL 3D INTERACTION BEHIND MOBILE PHONES FOR AUGMENTED ENVIRONMENTS
Farid Abedan Kondori, Shahrouz Yousefi, Haibo Li

AUTOMATIC LINE MARK RECOGNITION AND ITS APPLICATION IN CAMERA CALIBRATION IN SOCCER VIDEO
Jiang Bu, Songyang Lao, Liang Bai

TOPOLOGICAL SYNCHRONIZATION MECHANISM FOR ROBUST WATERMARKING ON 3D SEMI-REGULAR MESHES
Ali Beddiaf, William Puech, Med Chaouki Babahenini

DYNAMIC PROGRAMMING-BASED OPTIMIZATION FOR AUDIO-VISUAL SKIMS
Yu Huang, Jizhou Gao, Heather Yu

Hot Topics in 3D Multimedia

Room: JH1.08 Date: 7/15/2011 Time: 9:00-18:00

Session Chair(s): TBA

Introduction

The second international workshop on Hot Topics in 3D Multimedia continues the mission of the first Hot3D workshop which was held in conjunction with ICME 2010 in Singapore during July 2010. The first Hot3D workshop was organized as a full‐day workshop aligned with other ICME workshops and activities. It has been a great success with outstanding contributions and very good attendance. The steering committee, consisting of Dinei Florencio, Murat Tekalp, Anthony Vetro, and Cha Zhang, at that time stated the Hot3D mission as follows:

“The 3D research community is at a potentially revolutionary point. From one side, 3D display technology is at the verge of becoming widespread and reasonably priced, and that includes autostereoscopic displays. From another direction, increases in computational power – including powerful GPUs – has allowed an ever-increasing realism in 3D scene generation. 3D audio is now often tightly integrated with 3D environments, including 5.1 (and higher) and even 3D soundfield reproduction. Haptic systems are also being tightly integrated within 3D systems. Quality evaluation of 3D systems itself is of great importance, and is also showing fast progress. Finally, new depth cameras, coupled with new 3D analysis and synthesis algorithms are close to enable commercial-quality 3D rendering of real scenes, instead of being restricted to synthetic scenes as in the past. All these factors together create the perfect storm: an environment prone for an explosion of related technology and applications, with a speed of development that will not fit in the (slower) cycle of traditional conferences and journals. In other words, while appropriate venues for presenting research at advanced stages is plentiful, the 3D Multimedia community lacks an appropriate venue for receiving feedback during early or initial stages of the development of radical or potentially disruptive technology. This is exactly the void that Hot3D tries to fill, providing an environment for lively discussion of early-stage, potentially disruptive research.”

In 2011, Touradj Ebrahimi, Aljoscha Smolic, and Eckehard Steinbach took over the organization from the 2010 organizers. The technical program committee grew to 36 experts working in the area of 3D multimedia and the call for papers attracted a total of 27 submissions. Each paper received at least three reviews and finally, 12 high quality contributions were selected for inclusion in the workshop program, which corresponds to an acceptance rate of 44%.

The workshop will take place on Friday, July 15, 2011 in Barcelona in conjunction with ICME 2011. We are very happy that Tibor Balogh from Holografika has accepted our invitation to deliver the keynote talk at the workshop.

We would like to thank the Hot3D Steering Committee to entrust us with the organization of the second workshop and we hope that Hot3D will grow into a permanent venue for 3D multimedia researchers to meet and exchange ideas. We are excited to see that the workshop has attracted an excellent number of submissions and we are looking forward to an exciting and lively workshop.

Organizers:

Touradj Ebrahimi

Aljoscha Smolic

Eckehard Steinbach

OPTIMIZING FRAME STRUCTURE FOR INTERACTIVE MULTIVIEW VIDEO STREAMING WITH VIEW SYNTHESIS
Xiaoyu Xiu, Gene Cheung, Antonio Ortega, Jie Liang

REAL-TIME LOCAL STEREO MATCHING USING GUIDED IMAGE FILTERING
Asmaa Hosni, Michael Bleyer, Christoph Rhemann, Margrit Gelautz, Carsten Rother

BIT ALLOCATION FOR MULTIVIEW IMAGE COMPRESSION USING CUBIC SYNTHESIZED VIEW DISTORTION MODEL
Vladan Velisavljevic, Gene Cheung, Jacob Chakareski

VIRTUAL OBJECT DISTORTIONS IN 3D DISPLAYS WITH ONLY HORIZONTAL PARALLAX
Amir Said, Bruce Culbertson

A NO-REFERENCE QUALITY MEASURE FOR DIBR-BASED 3D VIDEOS
Mashhour Solh, Ghassan Al Regib

3-D VIDEO QUALITY IMPROVEMENT USING DEPTH TRANSITION DATA
Woo Shik Kim, Antonio Ortega, Jaejoon Lee, Hocheon Wey

EVALUATION OF BACKWARD MAPPING DIBR FOR FVV APPLICATIONS
Daniel Berjón, Alexander Hornung, Francisco Morán Burgos, Aljosa Smolic

AUTOMATIC CONTENT CREATION FOR MULTIVIEW AUTOSTEREOSCOPIC DISPLAYS USING IMAGE DOMAIN WARPING
Miquel Farre, Oliver Wang, Manuel Lang, Nikolce Stefanoski, Alexander Hornung, Aljosa Smolic