


default search action
2nd AND 2008: Singapore (SIGIR Workshop)
- Daniel P. Lopresti, Shourya Roy, Klaus U. Schulz, L. Venkata Subramaniam:

Proceedings of the Second Workshop on Analytics for Noisy Unstructured Text Data, AND 2008, Singapore, July 24, 2008. ACM International Conference Proceeding Series 303, ACM 2008, ISBN 978-1-60558-196-5 - Donna Harman:

Some thoughts on failure analysis for noisy data. - John Tait:

Noise and information. - Laurianne Sitbon

, Patrice Bellot
:
How to cope with questions typed by dyslexic users. 1-8 - Daniel P. Lopresti

:
Optical character recognition errors and their effects on natural language processing. 9-16 - Ulrich Reffle, Annette Gotscharek, Christoph Ringlstetter, Klaus U. Schulz:

Successfully detecting and correcting false friends using channel profiles. 17-22 - Valentin Jijkoun, Mahboob Alam Khalid, Maarten Marx

, Maarten de Rijke
:
Named entity normalization in user generated content. 23-30 - Rema Ananthanarayanan, Vijil Chenthamarakshan, Prasad M. Deshpande, Raghuram Krishnapuram:

Rule based synonyms for entity extraction from noisy text. 31-38 - Jiyin He, Wouter Weerkamp, Martha A. Larson, Maarten de Rijke

:
Blogger, stick to your story: modeling topical noise in blogs with coherence measures. 39-46 - Robert McArthur:

Uncovering deep user context from blogs. 47-54 - Jinfeng Zhuang, Steven C. H. Hoi

, Aixin Sun
:
On profiling blogs with representative entries. 55-62 - Soumya Datta, Sudeshna Sarkar

:
A comparative study of statistical features of language in blogs-vs-splogs. 63-66 - Sreangsu Acharyya, Sumit Negi, L. Venkata Subramaniam, Shourya Roy:

Unsupervised learning of multilingual short message service (SMS) dialect from noisy examples. 67-74 - Antti Järvelin, Tuomas Talvensaari, Anni Järvelin:

Data driven methods for improving mono- and cross-lingual IR performance in noisy environments. 75-82 - Lipika Dey, S. K. Mirajul Haque:

Opinion mining from noisy text data. 83-90 - Rachit Arora, Balaraman Ravindran

:
Latent dirichlet allocation based multi-document summarization. 91-97 - Amaresh Kumar Pandey, Tanveer J. Siddiqui

:
An unsupervised Hindi stemmer with heuristic improvements. 99-105 - Anurag Bhardwaj, Faisal Farooq, Huaigu Cao, Venu Govindaraju:

Topic based language models for OCR correction. 107-112 - Eiman Tamah Al-Shammari

, Jessica Lin
:
A novel Arabic lemmatization algorithm. 113-118

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














