


default search action
3rd MLMI 2006: Bethesda, MD, USA
- Steve Renals

, Samy Bengio, Jonathan G. Fiscus:
Machine Learning for Multimodal Interaction, Third International Workshop, MLMI 2006, Bethesda, MD, USA, May 1-4, 2006, Revised Selected Papers. Lecture Notes in Computer Science 4299, Springer 2006, ISBN 3-540-69267-3
Invited Paper
- Parisa Eslambolchilar

, Roderick Murray-Smith
:
Model-Based, Multimodal Interaction in Document Browsing. 1-12
Multimodal Processing
- Martial Michel, Jerome Ajot, Jonathan G. Fiscus:

The NIST Meeting Room Corpus 2 Phase 1. 13-23 - Marc A. Al-Hames, Thomas Hain

, Jan Cernocký
, Sascha Schreiber, Mannes Poel, Ronald Müller, Sébastien Marcel, David A. van Leeuwen, Jean-Marc Odobez, Sileye O. Ba, Hervé Bourlard, Fabien Cardinaux, Daniel Gatica-Perez, Adam Janin, Petr Motlícek
, Stephan Reiter, Steve Renals
, Jeroen van Rest, Rutger Rienks, Gerhard Rigoll, Kevin Smith, Andrew H. C. Thean, Pavel Zemcík
:
Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers. 24-35 - Lei Chen, Mary P. Harper, Amy Franklin

, R. Travis Rose, Irene Kimbara, Zhongqiang Huang, Francis K. H. Quek:
A Multimodal Analysis of Floor Control in Meetings. 36-49 - Xiao Huang, Sharon L. Oviatt, Rebecca Lunsford

:
Combining User Modeling and Machine Learning to Predict Users' Multimodal Integration Patterns. 50-62 - Marc A. Al-Hames, Benedikt Hörnler, Christoph Scheuermann, Gerhard Rigoll:

Using Audio, Visual, and Lexical Features in a Multi-modal Virtual Meeting Director. 63-74
Image and Video Processing
- Sileye O. Ba, Jean-Marc Odobez:

A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room. 75-87 - Kevin Smith, Sascha Schreiber, Igor Potucek, Vítezslav Beran, Gerhard Rigoll, Daniel Gatica-Perez:

Multi-person Tracking in Meetings: A Comparative Study. 88-101 - Andreas Humm, Jean Hennebert, Rolf Ingold

:
Gaussian Mixture Models for CHASM Signature Verification. 102-113 - Aristodemos Pnevmatikakis

, Lazaros Polymenakos:
Kalman Tracking with Target Feedback on Adaptive Background Learning. 114-122 - Dennis J. Lin, Jilin Tu, Shyamsundar Rajaram, ZhenQiu Zhang, Thomas S. Huang:

Da Vinci's Mona Lisa. 123-128
HCI and Applications
- Maria Danninger, Erica Robles, Leila Takayama, Qianying Wang, Tobias Kluge, Rainer Stiefelhagen, Clifford Nass:

The Connector Service-Predicting Availability in Mobile Contexts. 129-141 - Agnes Lisowska, Susan Armstrong:

Multimodal Input for Meeting Browsing and Retrieval Interfaces: Preliminary Findings. 142-153
Discourse and Dialogue
- Jacob Eisenstein, Randall Davis:

Gesture Features for Coreference Resolution. 154-165 - Weiqun Xu, Jean Carletta, Johanna D. Moore:

Syntactic Chunking Across Different Corpora. 166-177 - Alfred Dielmann, Steve Renals

:
Multistream Recognition of Dialogue Acts in Meetings. 178-189 - Matthias Zimmermann, Dilek Hakkani-Tür

, Elizabeth Shriberg, Andreas Stolcke:
Text Based Dialog Act Classification for Multiparty Meetings. 190-199 - Matthew Purver

, Patrick Ehlen, John Niekrasz:
Detecting Action Items in Multi-party Meetings: Annotation and Initial Experiments. 200-211 - Özgür Çetin, Elizabeth Shriberg:

Overlap in Meetings: ASR Effects and Analysis by Dialog Factors, Speakers, and Collection Site. 212-224
Speech and Audio Processing
- Mikko Parviainen, Tuomo W. Pirinen, Pasi Pertilä:

A Speaker Localization System for Lecture Room Environment. 225-235 - Dusan Macho, Climent Nadeu, Andrey Temko

:
Robust Speech Activity Detection in Interactive Smart-Room Environments. 236-247 - Xavier Anguera

, Chuck Wooters
, Javier Hernando:
Automatic Cluster Complexity and Quantity Selection: Towards Robust Speaker Diarization. 248-256 - José M. Pardo, Xavier Anguera

, Chuck Wooters
:
Speaker Diarization for Multi-microphone Meetings Using Only Between-Channel Differences. 257-264 - Matthias Wölfel

:
Warped and Warped-Twice MVDR Spectral Estimation With and Without Filterbanks. 265-274 - Martin Karafiát

, Frantisek Grézl, Petr Schwarz
, Lukás Burget
, Jan Cernocký
:
Robust Heteroscedastic Linear Discriminant Analysis and LCRC Posterior Features in Meeting Data Recognition. 275-284 - Darren Moore, John Dines, Mathew Magimai-Doss, Jithendra Vepa, Octavian Cheng, Thomas Hain

:
Juicer: A Weighted Finite-State Transducer Speech Decoder. 285-296 - Sebastian Stüker, Chengqing Zong

, Jürgen Reichert, Wenjie Cao, Muntsin Kolss, Guodong Xie, Kay Peterson, Peng Ding, Victoria Arranz, Jian Yu, Alex Waibel:
Speech-to-Speech Translation Services for the Olympic Games 2008. 297-308
NIST Meeting Recognition Evaluation
- Jonathan G. Fiscus, Jerome Ajot, Martial Michel, John S. Garofolo:

The Rich Transcription 2006 Spring Meeting Recognition Evaluation. 309-322 - Etienne Marcheret, Gerasimos Potamianos, Karthik Visweswariah, Jing Huang:

The IBM RT06s Evaluation System for Speech Activity Detection in CHIL Seminars. 323-335 - Dominique Vaufreydaz, Rémi Emonet, Patrick Reignier:

A Lightweight Speech Detection System for Perceptive Environments. 336-345 - Xavier Anguera

, Chuck Wooters
, José M. Pardo:
Robust Speaker Diarization for Meetings: ICSI RT06S Meetings Evaluation System. 346-358 - Corinne Fredouille, Grégory Senay:

Technical Improvements of the E-HMM Based Speaker Diarization System for Meeting Records. 359-370 - David A. van Leeuwen, Marijn Huijbregts:

The AMI Speaker Diarization System for NIST RT06s Meeting Data. 371-384 - Elias Rentzeperis

, Andreas Stergiou, Christos Boukis, Aristodemos Pnevmatikakis
, Lazaros C. Polymenakos:
The 2006 Athens Information Technology Speech Activity Detection and Speaker Diarization Systems. 385-395 - Xuan Zhu, Claude Barras, Lori Lamel, Jean-Luc Gauvain:

Speaker Diarization: From Broadcast News to Lectures. 396-406 - Christian Fügen, Shajith Ikbal, Florian Kraft, Ken'ichi Kumatani, Kornel Laskowski, John W. McDonough, Mari Ostendorf, Sebastian Stüker, Matthias Wölfel

:
The ISL RT-06S Speech-to-Text System. 407-418 - Thomas Hain

, Lukás Burget
, John Dines, Giulia Garau, Martin Karafiát
, Mike Lincoln, Jithendra Vepa, Vincent Wan:
The AMI Meeting Transcription System: Progress and Performance. 419-431 - Jing Huang, Martin Westphal, Stanley F. Chen

, Olivier Siohan, Daniel Povey, Vit Libal, Alvaro Soneiro, Henrik Schulz
, Thomas Ross, Gerasimos Potamianos:
The IBM Rich Transcription Spring 2006 Speech-to-Text System for Lecture Meetings. 432-443 - Adam Janin, Andreas Stolcke, Xavier Anguera

, Kofi Boakye, Özgür Çetin, Joe Frankel, Jing Zheng:
The ICSI-SRI Spring 2006 Meeting Recognition System. 444-456 - Lori Lamel, Éric Bilinski, Gilles Adda, Jean-Luc Gauvain, Holger Schwenk:

The LIMSI RT06s Lecture Transcription System. 457-468

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














