default search action
6th LREC 2008: Marrakech, Morocco
- Proceedings of the International Conference on Language Resources and Evaluation, LREC 2008, 26 May - 1 June 2008, Marrakech, Morocco. European Language Resources Association 2008
Session O1 - Information Extraction and Question Answering
- Kathrin Eichler, Holmer Hemsen, Günter Neumann:
Unsupervised Relation Extraction From Web Documents. - Muath Alzghool, Diana Inkpen:
Combining Multiple Models for Speech Information Retrieval. - Chun-Yuan Teng, Hsin-Hsi Chen:
Event Detection and Summarization in Weblogs with Temporal Collocations. - Cvetana Krstev, Ranka Stankovic, Dusko Vitas, Ivan Obradovic:
The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines.
Session O2 - LRs: Infrastructures, Projects, Centers
- Steven Bird, Robert Dale, Bonnie J. Dorr, Bryan R. Gibson, Mark Thomas Joseph, Min-Yen Kan, Dongwon Lee, Brett Powley, Dragomir R. Radev, Yee Fan Tan:
The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics. - Marian Reed, Denise DiPersio, Christopher Cieri:
The Linguistic Data Consortium Member Survey: Purpose, Execution and Results. - Dieter Van Uytvanck, Alex Dukers, Jacquelijn Ringersma, Paul Trilsbeek:
Language-Sites: Accessing and Presenting Language Resources via Geographic Information Systems. - Tamás Váradi, Steven Krauwer, Peter Wittenburg, Martin Wynne, Kimmo Koskenniemi:
CLARIN: Common Language Resources and Technology Infrastructure.
Session O3 - Corpus, Lexicon and Evaluation
- Jeroen Geertzen, Volha Petukhova, Harry Bunt:
Evaluating Dialogue Act Tagging with Naive and Expert Annotators. - Drahomíra "johanka" Spoustová, Pavel Pecina, Jan Hajic, Miroslav Spousta:
Validating the Quality of Full Morphological Annotation. - Kremena Ivanova, Ulrich Heid, Sabine Schulte im Walde, Adam Kilgarriff, Jan Pomikálek:
Evaluating a German Sketch Grammar: A Case Study on Noun Phrase Case. - Mark McConville, Myroslava O. Dzikovska:
Evaluating Complement-Modifier Distinctions in a Semantically Annotated Corpus.
Session O4 - Multiparty and non-Verbal Communication
- Petra-Maria Strauß, Holger Hoffmann, Wolfgang Minker, Heiko Neumann, Günther Palm, Stefan Scherer, Harald C. Traue, Ulrich Weidenbacher:
The PIT Corpus of German Multi-Party Dialogues. - Martine Adda-Decker, Claude Barras, Gilles Adda, Patrick Paroubek, Philippe Boula de Mareüil, Benoit Habert:
Annotation and analysis of overlapping speech in political interviews. - Nicolas Moreau, Djamel Mostefa, Rainer Stiefelhagen, Susanne Burger, Khalid Choukri:
Data Collection for the CHIL CLEAR 2007 Evaluation Campaign. - Susanne Burger, Kornel Laskowski, Matthias Wölfel:
A Comparative Cross-Domain Study of the Occurrence of Laughter in Meeting and Seminar Corpora.
Session O5 - Spatio-Temporal Annotation
- Inderjeet Mani, Janet Hitzeman, Justin Richer, Dave Harris, Rob Quimby, Ben Wellner:
SpatialML: Annotation Scheme, Corpora, and Tools. - Steven Bethard, William J. Corvey, Sara Klingenstein, James H. Martin:
Building a Corpus of Temporal-Causal Structure. - Alessandra Zarcone, Alessandro Lenci:
Computational Models for Event Type Classification in Context. - Corina Forascu:
GMT to +2 or how can TimeML be used in Romanian. - Nianwen Xue, Hua Zhong, Kai-Yun Chen:
Annotating "tense" in a Tense-less Language.
Session O6 - Syntax and Parsing
- Barbara Plank, Khalil Sima'an:
Subdomain Sensitive Statistical Parsing using Raw Corpora. - Laura Kallmeyer, Timm Lichte, Wolfgang Maier, Yannick Parmentier, Johannes Dellert:
Developing a TT-MCTAG for German with an RCG-based Parser. - Peter Adolphs, Stephan Oepen, Ulrich Callmeier, Berthold Crysmann, Dan Flickinger, Bernd Kiefer:
Some Fine Points of Hybrid Natural Language Parsing. - Jeremy Nicholson, Valia Kordoni, Yi Zhang, Timothy Baldwin, Rebecca Dridan:
Evaluating and Extending the Coverage of HPSG Grammars: A Case Study for German. - Yi Zhang, Valia Kordoni:
Robust Parsing with a Large HPSG Grammar.
Session O7 - Document Classification
- Jahna Otterbacher, Dragomir R. Radev:
Modeling Document Dynamics: an Evolutionary Approach. - Dominic Widdows, Kathleen Ferraro:
Semantic Vectors: a Scalable Open Source Package and Online Technology Management Application. - Magnus Rosell, Sumithra Velupillai:
Revealing Relations between Open and Closed Answers in Questionnaires through Text Clustering Evaluation. - Kim Luyckx, Walter Daelemans:
Personae: a Corpus for Author and Personality Prediction from Text. - Leanne Spracklin, Diana Inkpen, Amiya Nayak:
Using the Complexity of the Distribution of Lexical Elements as a Feature in Authorship Attribution.
Session O8 - Multimodal Annotation Tools
- Thomas Schmidt, Susan Duncan, Oliver Ehmer, Jeffrey Hoyt, Michael Kipp, Dan Loehr, Magnus Magnusson, R. Travis Rose, Han Sloetjes:
An Exchange Format for Multimodal Annotations. - Laura Stoia, Darla Magdalena Shockley, Donna K. Byron, Eric Fosler-Lussier:
SCARE: a Situated Corpus with Annotated Referring Expressions. - Han Sloetjes, Peter Wittenburg:
Annotation by Category: ELAN and ISO DCR. - Hennie Brugman, Véronique Malaisé, Laura Hollink:
A Common Multimedia Annotation Framework for Cross Linking Cultural Heritage Digital Collections. - Philippe Blache, Roxane Bertrand, Gaëlle Ferré:
Creating and Exploiting Multimodal Annotated Corpora.
Session O9 - Lexicon, Corpus and Semantics
- Annie Zaenen, Daniel G. Bobrow, Cleo Condoravdi:
The Encoding of lexical implications in VerbNet Predicates of change of locations. - Aljoscha Burchardt, Marco Pennacchiotti:
FATE: a FrameNet-Annotated Corpus for Textual Entailment. - Stephen A. Boxwell, Michael White:
Projecting Propbank Roles onto the CCGbank. - Piek Vossen, Isa Maks, Roxane Segers, Hennie VanderVliet:
Integrating Lexical Units, Synsets and Ontology in the Cornetto Database. - Javier Álvez, Jordi Atserias, Jordi Carrera, Salvador Climent, Egoitz Laparra, Antoni Oliver, German Rigau:
Complete and Consistent Annotation of WordNet using the Top Concept Ontology.
Session O10 - Multimodal and Speech Data over the Web
- Adrian Popescu, Gregory Grefenstette:
A Conceptual Approach to Web Image Retrieval. - Gwénolé Lecorvé, Guillaume Gravier, Pascale Sébillot:
On the Use of Web Resources and Natural Language Processing Techniques to Improve Automatic Speech Recognition Systems. - Stanislas Oger, Georges Linarès, Frédéric Béchet:
Local Methods for On-Demand Out-of-Vocabulary Word Retrieval. - Marc Kemps-Snijders, Alexander Klassmann, Claus Zinn, Peter Berck, Albert Russel, Peter Wittenburg:
Exploring and Enriching a Language Resource Archive via the Web. - Florian Schiel, Hannes Mögele:
Talking and Looking: the SmartWeb Multimodal Interaction Corpus.
Session O11 - Coreference and Discourse
- Erhard W. Hinrichs, Monica Lau:
In Contrast - A Complex Discourse Connective. - Georg Rehm, Marina Santini, Alexander Mehler, Pavel Braslavski, Rüdiger Gleim, Andrea Stubbe, Svetlana Symonenko, Mirko Tavosanis, Vedrana Vidulin:
Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems. - Olga Uryupina:
Error Analysis for Learning-based Coreference Resolution. - Lucie Mladová, Sárka Zikánová, Eva Hajicová:
From Sentence to Discourse: Building an Annotation Scheme for Discourse Based on Prague Dependency Treebank. - David Day, Janet Hitzeman, Michael L. Wick, Keith Crouch, Massimo Poesio:
A Corpus for Cross-Document Co-reference.
Session O12 - Named Entity Recognition
- Antonio Toral, Rafael Muñoz, Monica Monachini:
Named Entity WordNet. - Cristina Mota, Ralph Grishman:
Is this NE tagger getting old? - Benjamin Farber, Dayne Freitag, Nizar Habash, Owen Rambow:
Improving NER in Arabic Using a Morphological Tagger. - Stephan Busemann, Yajing Zhang:
Identifying Foreign Person Names in Chinese Text. - Marius Pasca:
Low-Complexity Heuristics for Deriving Fine-Grained Classes of Named Entities from Web Textual Data.
Session O13 - Parallel and Multilingual Resources
- Jinji Li, Dong-Il Kim, Jong-Hyeok Lee:
Annotation Guidelines for Chinese-Korean Word Alignment. - Ondrej Bojar, Miroslav Janícek, Zdenek Zabokrtský, Pavel Ceska, Peter Bena:
CzEng 0.7: Parallel Corpus with Community-Supplied Translations. - Jonathan Clark, Robert E. Frederking, Lori S. Levin:
Toward Active Learning in Data Selection: Automatic Discovery of Language Features During Elicitation. - Michael Mohler, Rada Mihalcea:
Babylon Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages.
Session O14 - Evaluation Tools and Methodologies
- Cong-Phap Huynh, Christian Boitet, Hervé Blanchon:
SECTra_w.1: an Online Collaborative System for Evaluating, Post-editing and Presenting MT Translation Corpora. - Mark Arehart, Chris Wolf, Keith J. Miller:
Adjudicator Agreement and System Rankings for Person Name Search. - Paulo C. F. de Oliveira, Edson Wilson Torrens, Alexandre Cidral, Sidney Schossland, Evandro Bittencourt:
Evaluating Summaries Automatically - A system Proposal. - Thierry Poibeau, Cédric Messiant:
Do we Still Need Gold Standards for Evaluation?
Session O15 - LRs: Large Programs, Policies, Strategies
- Peter Spyns, Elisabeth D'Halleweyn, Catia Cucchiarini:
The Dutch-Flemish Comprehensive Approach to HLT Stimulation and Innovation: STEVIN, HLT Agency and beyond. - Christopher Cieri, Mark Liberman:
15 Years of Language Resource Creation and Sharing: a Progress Report on LDC Activities. - Anil Kumar Singh, Kiran Pala, Harshit Surana:
Estimating the Resource Adaption Cost from a Resource Rich Language to a Similar Resource Poor Language. - Valérie Mapelli, Victoria Arranz, Hélène Mazo, Khalid Choukri:
Latest Developments in ELRA's Services. - Carol Peters, Martin Braschler, Giorgio Maria Di Nunzio, Nicola Ferro, Julio Gonzalo, Mark Sanderson:
From Research to Application in Multilingual Information Access: the Contribution of Evaluation.
Session O16 - Biomedical Resources
- Scott Piao, John McNaught, Sophia Ananiadou:
Clustering Related Terms with Definitions. - Ngan L. T. Nguyen, Jin-Dong Kim, Jun'ichi Tsujii:
Challenges in Pronoun Resolution System for Biomedical Text. - Barry Haddow, Beatrice Alex:
Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks. - Yuka Tateisi, Yusuke Miyao, Kenji Sagae, Jun'ichi Tsujii:
GENIA-GR: a Grammatical Relation Corpus for Parser Evaluation in the Biomedical Domain. - Xinglong Wang, Claire Grover:
Learning the Species of Biomedical Named Entities from Annotated Corpora.
Session O17 - Semantics in Lexicons and Corpora
- Tony Veale, Yanfen Hao:
Acquiring Naturalistic Concept Descriptions from the Web. - Ulrich Heid, Marion Weller:
Tools for Collocation Extraction: Preferences for Active vs. Passive. - Francis Bond, Hitoshi Isahara, Kyoko Kanzaki, Kiyotaka Uchimoto:
Boot-Strapping a WordNet Using Multiple Existing WordNets. - Bartosz Broda, Magdalena Derwojedowa, Maciej Piasecki, Stan Szpakowicz:
Corpus-based Semantic Relatedness for the Construction of Polish WordNet. - Rafiya Begum, Samar Husain, Lakshmi Bai, Dipti Misra Sharma:
Developing Verb Frames for Hindi.
Session O18 - Affect and Emotion in Speech
- Katherine Forbes-Riley, Diane J. Litman, Scott Silliman, Amruta Purandare:
Uncertainty Corpus: Resource to Study User Affect in Complex Spoken Dialogue Systems. - Milan Gnjatovic, Dietmar F. Rösner:
On the Role of the NIMITEK Corpus in Developing an Emotion Adaptive Spoken Dialogue System. - Stefan Scherer, Hansjörg Hofmann, Malte Lampmann, Martin Pfeil, Steffen Rhinow, Friedhelm Schwenker, Günther Palm:
Emotion Recognition from Speech: Stress Experiment. - Laure Charonnat, Gaëlle Vidal, Olivier Boëffard:
Automatic Phone Segmentation of Expressive Speech. - Márk Fék, Nicolas Audibert, János Szabó, Albert Rilliard, Géza Németh, Véronique Aubergé:
Multimodal Spontaneous Expressive Speech Corpus for Hungarian.
Session O19 - Opinion Mining and Summarization
- Wei-Hao Lin, Alexander G. Hauptmann:
Vox Populi Annotation: Measuring Intensity of Ideological Perspectives by Aggregating Group Judgments. - Carmen Banea, Rada Mihalcea, Janyce Wiebe:
A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources. - Josef Ruppenhofer, Swapna Somasundaran, Janyce Wiebe:
Finding the Sources and Targets of Subjective Expressions. - Veselin Stoyanov, Claire Cardie:
Annotating Topics of Opinions. - Zhuli Xie, Barbara Di Eugenio, Peter C. Nelson:
From Extracting to Abstracting: Generating Quasi-abstractive Summaries.
Session O20 - Coreference and Discourse
- Jette Viethen, Robert Dale, Emiel Krahmer, Mariët Theune, Pascal Touset:
Controlling Redundancy in Referring Expressions. - Massimo Poesio, Ron Artstein:
Anaphoric Annotation in the ARRAU Corpus. - Mark-Christoph Müller, Margot Mieskes, Michael Strube:
Knowledge Sources for Bridging Resolution in Multi-Party Dialog. - Rashmi Prasad, Nikhil Dinesh, Alan Lee, Eleni Miltsakaki, Livio Robaldo, Aravind K. Joshi, Bonnie L. Webber:
The Penn Discourse TreeBank 2.0. - Iris Hendrickx, Gosse Bouma, Frederik Coppens, Walter Daelemans, Véronique Hoste, Geert Kloosterman, Anne-Marie Mineur, Joeri Van Der Vloet, Jean-Luc Verschelde:
A Coreference Corpus and Resolution System for Dutch.
Session O21 - Semantic Resources and Acquisition
- Kirk Baker, Chris Brew:
Statistical Identification of English Loanwords in Korean Using Automatically Generated Training Data. - Diana Trandabat, Maria Husarciuc:
Romanian Semantic Role Resource. - Alessandro Lenci, Barbara McGillivray, Simonetta Montemagni, Vito Pirrelli:
Unsupervised Acquisition of Verb Subcategorization Frames from Shallow-Parsed Corpora. - Daisuke Kawahara, Kiyotaka Uchimoto:
A Method for Automatically Constructing Case Frames for English. - Núria Bel, Sergio Espeja, Montserrat Marimon:
Automatic Acquisition for low frequency lexical items.
Session O22 - Speaker and Dialect Identification
- Doroteo T. Toledano, Daniel Hernández López, Cristina Esteve-Elizalde, Julian Fiérrez, Javier Ortega-Garcia, Daniel Ramos, Joaquin Gonzalez-Rodriguez:
BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition. - Iker Luengo, Eva Navas, Iñaki Sainz, Ibon Saratxaga, Jon Sánchez, Igor Odriozola, Inma Hernáez:
Text Independent Speaker Identification in Multilingual Environments. - Udhyakumar Nallasamy, Alan W. Black, Tanja Schultz, Robert E. Frederking:
NineOneOne: Recognizing and Classifying Speech for Handling Minority Language Emergency Calls. - Christopher Cieri, Stephanie M. Strassel, Meghan Lammie Glenn, Reva Schwartz, Wade Shen, Joseph P. Campbell:
Bridging the Gap between Linguists and Technology Developers: Large-Scale, Sociolinguistic Annotation for Dialect and Speaker Recognition. - Linda Brandschain, Christopher Cieri, David Graff, Abby Neely, Kevin Walker:
Speaker Recognition: Building the Mixer 4 and 5 Corpora.
Session O23 - Corpus Annotation and Classification
- Nancy Ide, Collin F. Baker, Christiane Fellbaum, Charles J. Fillmore, Rebecca J. Passonneau:
MASC: the Manually Annotated Sub-Corpus of American English. - Chu-Ren Huang, Lung-Hao Lee, Jia-Fei Hong, Weiguang Qu, Shiwen Yu:
Quality Assurance of Automatic Annotation of Very Large Corpora: a Study based on heterogeneous Tagging System. - Claire Cardie, Cynthia Farina, Matt Rawding, Adil Aijaz:
An eRulemaking Corpus: Identifying Substantive Issues in Public Comments. - Branimir Boguraev, Mary S. Neff:
Navigating through Dense Annotation Spaces. - David Guthrie, Louise Guthrie, Yorick Wilks:
An Unsupervised Probabilistic Approach for the Detection of Outliers in Corpora.
Session O24 - Machine Translation and Multilinguality
- Michael Carl:
Using Log-linear Models for Tuning Machine Translation Output. - Bogdan Babych, Serge Sharoff, Anthony Hartley:
Generalising Lexical Translation Strategies for MT Using Comparable Corpora. - Masaki Itagaki, Takako Aikawa:
Post-MT Term Swapper: Supplementing a Statistical Machine Translation System with a User Dictionary. - Germán Sanchis-Trilles, Joan-Andreu Sánchez:
Using Parsed Corpora for Estimating Stochastic Inversion Transduction Grammars. - Mark Fishel, Heiki-Jaan Kaalep:
Experiments on Processing Overlapping Parallel Corpora.
Session O25 - Evaluation
- Jennifer Foster, Josef van Genabith:
Parser Evaluation and the BNC: Evaluating 4 constituency parsers with 3 metrics. - Patrick Paroubek, Isabelle Robba, Anne Vilnat, Christelle Ayache:
EASY, Evaluation of Parsers of French: what are the Results? - Xavier Tannier, Philippe Muller:
Evaluation Metrics for Automatic Temporal Annotation of Texts. - Lena Grothe, Ernesto William De Luca, Andreas Nürnberger:
A Comparative Study on Language Identification Methods. - Éric Villemonte de la Clergerie, Olivier Hamon, Djamel Mostefa, Christelle Ayache, Patrick Paroubek, Anne Vilnat:
PASSAGE: from French Parser Evaluation to Large Sized Treebank.
Session O26 - Broadcast News Processing
- Jáchym Kolár, Jan Svec:
Structural Metadata Annotation of Speech Corpora: Comparing Broadcast News and Broadcast Conversations. - Markpong Jongtaveesataporn, Chai Wutiwiwatchai, Koji Iwano, Sadaoki Furui:
Thai Broadcast News Corpus Construction and Evaluation. - Ingunn Amdal, Ole Morten Strand, Jørn Almberg, Torbjørn Svendsen:
RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus. - Sopheap Seng, Sethserey Sam, Laurent Besacier, Brigitte Bigi, Eric Castelli:
First Broadcast News Transcription System for Khmer Language. - Chomicha Bendahman, Meghan Lammie Glenn, Djamel Mostefa, Niklas Paulsson, Stephanie M. Strassel:
Quick Rich Transcriptions of Arabic Broadcast News Speech Data.
Session O27 - Ontologies
- Dennis Spohr:
A General Methodology for Mapping EuroWordNets to the Suggested Upper Merged Ontology. - Satoshi Sekine:
Extended Named Entity Ontology with Attribute Information. - Mari Carmen Suárez-Figueroa,