default search action

combined dblp search
author search
venue search
publication search

ask others

INTERSPEECH 2004: Lisbon, Portugal

> Home > Conferences and Workshops > INTERSPEECH

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/2004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/2004
8th International Conference on Spoken Language Processing, INTERSPEECH-ICSLP 2004, Jeju Island, Korea, October 4-8, 2004. ISCA 2004

Plenary Talks

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lee04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lee04
Chin-Hui Lee:
From decoding-driven to detection-based paradigms for automatic speech recognition.
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lee04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lee04a
Hyun-Bok Lee:
In search of a universal phonetic alphabet - theory and application of an organic visible speech-.
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Vaissiere04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Vaissiere04
Jacqueline Vaissière:
From X-ray or MRU data to sounds through articulatory synthesis: towards an integrated view of the speech communication process.

Speech Recognition - Adaptation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BalakrishnanVG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BalakrishnanVG04
Sreeram Balakrishnan, Karthik Visweswariah, Vaibhava Goel:
Stochastic gradient adaptation of front-end parameters. 1-4
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RauxS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RauxS04
Antoine Raux, Rita Singh:
Maximum - likelihod adaptation of semi-continuous HMMs by latent variable decomposition of state distributions. 5-8
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangCC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangCC04
Chao Huang, Tao Chen, Eric Chang:
Transformation and combination of hiden Markov models for speaker selection training. 9-12
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MakH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MakH04
Brian Kan-Wing Mak, Roger Wend-Huu Hsiao:
Improving eigenspace-based MLLR adaptation by kernel PCA. 13-16
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChatzichrisafisDDH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChatzichrisafisDDH04
Nikos Chatzichrisafis, Vassilios Digalakis, Vassilios Diakoloukas, Costas Harizakis:
Rapid acoustic model development using Gaussian mixture clustering and language adaptation. 17-20
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VisweswariahG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VisweswariahG04
Karthik Visweswariah, Ramesh A. Gopinath:
Adaptation of front end parameters in a speech recognizer. 21-24
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GiulianiGB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GiulianiGB04
Diego Giuliani, Matteo Gerosa, Fabio Brugnara:
Speaker normalization through constrained MLLR based transforms. 2893-2896
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MuZX04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MuZX04
Xiangyu Mu, Shuwu Zhang, Bo Xu:
Multi-layer structure MLLR adaptation algorithm with subspace regression classes and tying. 2897-2900
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StemmerSHN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StemmerSHN04
Georg Stemmer, Stefan Steidl, Christian Hacker, Elmar Nöth:
Adaptation in the pronunciation space for non-native speech recognition. 2901-2904
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangO04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangO04
Xuechuan Wang, Douglas D. O'Shaughnessy:
Robust ASR model adaptation by feature-based statistical data mapping. 2905-2908
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanZX04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanZX04
Zhaobing Han, Shuwu Zhang, Bo Xu:
A novel target-driven generalized JMAP adaptation algorithm. 2909-2912
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MakHK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MakHK04
Brian Mak, Simon Ka-Lung Ho, James T. Kwok:
Speedup of kernel eigenvoice speaker adaptation by embedded kernel PCA. 2913-2916
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JeonK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JeonK04
Hyung Bae Jeon, Dong Kook Kim:
Maximum a posteriori eigenvoice speaker adaptation for Korean connected digit recognition. 2917-2920
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangZ04
Wei Wang, Stephen A. Zahorian:
Vocal tract normalization based on spectral warping. 2921-2924
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaRKT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaRKT04
Koji Tanaka, Fuji Ren, Shingo Kuroiwa, Satoru Tsuge:
Acoustic model adaptation for coded speech using synthetic speech. 2925-2928
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuzukiOIOM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuzukiOIOM04
Motoyuki Suzuki, Hirokazu Ogasawara, Akinori Ito, Yuichi Ohkawa, Shozo Makino:
Speaker adaptation method for CALL system using bilingual speakers' utterances. 2929-2932
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Watanabe04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Watanabe04
Shinji Watanabe:
Acoustic model adaptation based on coarse/fine training of transfer vectors and its application to a speaker adaptation task. 2933-2936
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsaiCW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsaiCW04
Wei-Ho Tsai, Shih-Sian Cheng, Hsin-Min Wang:
Speaker clustering of speech utterances using a voice characteristic reference space. 2937-2940
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimSK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimSK04
Young Kuk Kim, Hwa Jeon Song, Hyung Soon Kim:
Performance improvement of connected digit recognition using unsupervised fast speaker adaptation. 2941-2944
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimS04
Hyung Soon Kim, Hwa Jeon Song:
Simultaneous estimation of weights of eigenvoices and bias compensation vector for rapid speaker adaptation. 2945-2948
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Wolfel04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Wolfel04
Matthias Wölfel:
Speaker dependent model order selection of spectral envelopes. 2949-2952
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BocchieriRS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BocchieriRS04
Enrico Bocchieri, Michael Riley, Murat Saraclar:
Methods for task adaptation of acoustic models with limited transcribed in-domain data. 2953-2956
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujiiIIA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujiiIIA04
Atsushi Fujii, Tetsuya Ishikawa, Katsunobu Itou, Tomoyosi Akiba:
Unsupervised topic adaptation for lecture speech retrieval. 2957-2960
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuW04
Haibin Liu, Zhenyang Wu:
Mean and covariance adaptation based on minimum classification error linear regression for continuous density HMMs. 2961-2964
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NaginoS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NaginoS04
Goshu Nagino, Makoto Shozakai:
Design of ready-made acoustic model library by two-dimensional visualization of acoustic space. 2965-2968

Spoken Language Identification, Translation and Retrieval I

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GauvainMS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GauvainMS04
Jean-Luc Gauvain, Abdelkhalek Messaoudi, Holger Schwenk:
Language recognition using phone latices. 25-28
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Huckvale04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Huckvale04
Mark A. Huckvale:
ACCDIST: a metric for comparing speakers' accents. 29-32
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LevitGHAN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LevitGHAN04
Michael Levit, Allen L. Gorin, Patrick Haffner, Hiyan Alshawi, Elmar Nöth:
Aspects of named entity processing. 33-36
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CregoMG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CregoMG04
Josep Maria Crego, José B. Mariño, Adrià de Gispert:
Finite-state-based and phrase-based statistical machine translation. 37-40
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchultzJVS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchultzJVS04
Tanja Schultz, Szu-Chen Stan Jou, Stephan Vogel, Shirin Saleem:
Using word latice information for a tighter coupling in speech translation systems. 41-44
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MisuKK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MisuKK04
Teruhisa Misu, Tatsuya Kawahara, Kazunori Komatani:
Confirmation strategy for document retrieval systems with spoken dialog interface. 45-48
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeTI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeTI04
Shi-wook Lee, Kazuyo Tanaka, Yoshiaki Itoh:
Multilayer subword units for open-vocabulary spoken document retrieval. 1553-1556
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ItohTL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ItohTL04
Yoshiaki Itoh, Kazuyo Tanaka, Shi-wook Lee:
An efficient partial matching algorithm toward speech retrieval by speech. 1557-1560
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SedogboHGZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SedogboHGZ04
Celestin Sedogbo, Sébastien Herry, Bruno Gas, Jean-Luc Zarader:
Language detection by neural discrimination. 1561-1564
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CordobaFSGDF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CordobaFSGDF04
Ricardo de Córdoba, Javier Ferreiros, Valentín Sama, Javier Macías Guarasa, Luis Fernando D'Haro, Fernando Fernández Martínez:
Language identification techniques based on full recognition in an air traffic control task. 1565-1568
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HansenYHI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HansenYHI04
John H. L. Hansen, Umit H. Yapanel, Rongqing Huang, Ayako Ikeno:
Dialect analysis and modeling for automatic classification. 1569-1572
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FerragneP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FerragneP04
Emmanuel Ferragne, François Pellegrino:
Rhythm in read british English: interdialect variability. 1573-1576
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FungLYSW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FungLYSW04
Pascale Fung, Yi Liu, Yongsheng Yang, Yihai Shen, Dekai Wu:
A grammar-based Chinese to English speech translation system for portable devices. 1577-1580
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tur04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tur04
Gökhan Tür:
Cost-sensitive call classification. 1581-1584
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KurimoTE04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KurimoTE04
Mikko Kurimo, Ville T. Turunen, Inger Ekman:
An evaluation of a spoken document retrieval baseline system in finish. 1585-1588
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiangLZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiangLZ04
Hui Jiang, Pengfei Liu, Imed Zitouni:
Discriminative training of naive Bayes classifiers for natural language call routing. 1589-1592
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoreauKS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoreauKS04
Nicolas Moreau, Hyoung-Gook Kim, Thomas Sikora:
Phonetic confusion based document expansion for spoken document retrieval. 1593-1596
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChungLHJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChungLHJ04
Euisok Chung, Soojong Lim, Yi-Gyu Hwang, Myung-Gil Jang:
Hybrid named entity recognition for question-answering system. 1597-1600
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AjmeraMB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AjmeraMB04
Jitendra Ajmera, Iain McCowan, Hervé Bourlard:
An online audio indexing system. 1601-1604
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SandersW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SandersW04
Eric Sanders, Febe de Wet:
Histogram normalisation and the recognition of names and ontology words in the MUMIS project. 1605-1608
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AmaralT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AmaralT04
Rui Amaral, Isabel Trancoso:
Improving the topic indexation and segmentation modules of a media watch system. 1609-1612
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Barkat-DefradasHFP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Barkat-DefradasHFP04
Melissa Barkat-Defradas, Rym Hamdi, Emmanuel Ferragne, François Pellegrino:
Speech timing and rhythmic structure in arabic dialects: a comparison of two approaches. 1613-1616
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangC04
Hsin-Min Wang, Shih-Sian Cheng:
METRIC-SEQDAC: a hybrid approach for audio segmentation. 1617-1620
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuoHCW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuoHCW04
Jen-Wei Kuo, Yao-Min Huang, Berlin Chen, Hsin-Min Wang:
Statistical Chinese spoken document retrieval using latent topical information. 1621-1624
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatsushitaNNU04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatsushitaNNU04
Masahiko Matsushita, Hiromitsu Nishizaki, Seiichi Nakagawa, Takehito Utsuro:
Keyword recognition and extraction by multiple-LVCSRs with 60, 000 words in speech-driven WEB retrieval task. 1625-1628
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangKYSWSL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangKYSWSL04
Ruiqiang Zhang, Gen-ichiro Kikui, Hirofumi Yamamoto, Frank K. Soong, Taro Watanabe, Eiichiro Sumita, Wai Kit Lo:
Improved spoken language translation using n-best speech recognition hypotheses. 1629-1632
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WongS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WongS04
Kakeung Wong, Man-Hung Siu:
Automatic language identification using discrete hidden Markov model. 1633-1636
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouDG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouDG04
Bowen Zhou, Daniel Déchelotte, Yuqing Gao:
Two-way speech-to-speech translation on handheld devices. 1637-1640
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Blanchon04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Blanchon04
Hervé Blanchon:
HLT modules scalability within the NESPOLE! project. 1641-1644

Linguistics, Phonology, and Phonetics

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kim04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kim04
Midam Kim:
Correlation between VOT and F0 in the perception of Korean stops and affricates. 49-52
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NoirayMCAS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NoirayMCAS04
Aude Noiray, Lucie Ménard, Marie-Agnès Cathiard, Christian Abry, Christophe Savariaux:
The development of anticipatory labial coarticulation in French: a pionering study. 53-56
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hunt04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hunt04
Melvyn John Hunt:
Speech recognition, sylabification and statistical phonetics. 57-60
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tian04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tian04
Jilei Tian:
Data-driven approaches for automatic detection of syllable boundaries. 61-64
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CutlerNS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CutlerNS04
Anne Cutler, Dennis Norris, Núria Sebastián-Gallés:
Phonemic repertoire and similarity within the vocabulary. 65-68
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaskeyBT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaskeyBT04
Sameer Maskey, Alan W. Black, Laura Tomokiya:
Boostrapping phonetic lexicons for new languages. 69-72
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BroersmaK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BroersmaK04
Mirjam Broersma, K. Marieke Kolkman:
Lexical representation of non-native phonemes. 1241-1244
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeJ04
Jong-Pyo Lee, Tae-Yeoub Jang:
A comparative study on the production of inter-stress intervals of English speech by English native speakers and Korean speakers. 1245-1248
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MuranoT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MuranoT04
Emi Zuiki Murano, Mihoko Teshigawara:
Articulatory correlates of voice qualities of god guys and bad guys in Japanese anime: an MRI study. 1249-1252
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Dusan04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Dusan04
Sorin Dusan:
Effects of phonetic contexts on the duration of phonetic segments in fluent read speech. 1253-1256
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Fang04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Fang04
Qiang Fang:
A study on nasal coda los in continuous speech. 1257-1260
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Jian04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Jian04
Hua-Li Jian:
An improved pair-wise variability index for comparing the timing characteristics of speech. 1261-1264
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Jian04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Jian04a
Hua-Li Jian:
An acoustic study of speech rhythm in taiwan English. 1265-1268
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kim04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kim04a
Sung-A. Kim:
Language specific phonetic rules: evidence from domain-initial strengthening. 1269-1272
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Park04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Park04
Hansang Park:
Spectral characteristics of the release bursts in Korean alveolar stops. 1273-1276
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SonBPL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SonBPL04
Rob van Son, Olga Bolotova, Louis C. W. Pols, Mietta Lennes:
Frequency effects on vowel reduction in three typologically different languages (dutch, finish, Russian). 1277-1280
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AbreschB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AbreschB04
Julia Abresch, Stefan Breuer:
Assessment of non-native phones in anglicisms by German listeners. 1281-1284
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kim04b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kim04b
Sunhee Kim:
Phonology of exceptions for for Korean grapheme-to-phoneme conversion. 1285-1289
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KitazawaK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KitazawaK04
Shigeyoshi Kitazawa, Shinya Kiriyama:
Acoustic and prosodic analysis of Japanese vowel-vowel hiatus with laryngeal effect. 1289-1293
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tsukada04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tsukada04
Kimiko Tsukada:
A cross-linguistic acoustic comparison of unreleased word-final stops: Korean and Thai. 1293-1296
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoJ04
Taehong Cho, Elizabeth K. Johnson:
Acoustic correlates of phrase-internal lexical boundaries in dutch. 1297-1300
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoM04
Taehong Cho, James M. McQueen:
Phonotactics vs. phonetic cues in native and non-native listening: dutch and Korean listeners' perception of dutch and English. 1301-1304
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaminskaiaP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaminskaiaP04
Svetlana Kaminskaia, François Poiré:
Comparing intonation of two varieties of French using normalized F0 values. 1305-1308
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhK04
Mira Oh, Kee-Ho Kim:
Phonetic realization of the suffix-suppressed accentual phrase in Korean. 1309-1312
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BunnellPM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BunnellPM04
H. Timothy Bunnell, James B. Polikoff, Jane McNicholas:
Spectral moment vs. bark cepstral analysis of children's word-initial voiceles stops. 1313-1316
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Minematsu04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Minematsu04
Nobuaki Minematsu:
Pronunciation assessment based upon the compatibility between a learner's pronunciation structure and the target language's lexical structure. 1317-1320
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Yoshida04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Yoshida04
Kenji Yoshida:
Spread of high tone in akita Japanese. 1321-1324

Biomedical Applications of Speech Analysis

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Godino-LlorenteBVPMO04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Godino-LlorenteBVPMO04
Juan Ignacio Godino-Llorente, María Victoria Rodellar Biarge, Pedro Gómez-Vilda, Francisco Díaz Pérez, Agustín Álvarez-Marquina, Rafael Martínez-Olalla:
Biomechanical parameter fingerprint in the mucosal wave power spectral density. 73-76
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JoWYKL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JoWYKL04
Cheolwoo Jo, Soo-Geon Wang, Byung-Gon Yang, Hyung-Soon Kim, Tao Li:
Classification of pathological voice including severely noisy cases. 77-80
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FuM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FuM04
Qiang Fu, Peter Murphy:
A robust glottal source model estimation technique. 81-84
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriKKHK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriKKHK04
Hiroki Mori, Yasunori Kobayashi, Hideki Kasuya, Hajime Hirose, Noriko Kobayashi:
F0 and formant frequency distribution of dysarthric speech - a comparative study. 85-88
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaHMB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaHMB04
Hideki Kawahara, Yumi Hirachi, Masanori Morise, Hideki Banno:
Procedure "senza vibrato": a key component for morphing singing. 89-92
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ManfrediPMDI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ManfrediPMDI04
Claudia Manfredi, Giorgio Peretti, Laura Magnoni, Fabrizio Dori, Ernesto Iadanza:
Thyroplastic medialisation in unilateral vocal fold paralysis: assessing voice quality recovering. 93-96
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KubinH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KubinH04
Gernot Kubin, Martin Hagmüller:
Voice enhancement of male speakers with laryngeal neoplasm. 541-544
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoiSPH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoiSPH04
Jong Min Choi, Myung-Whun Sung, Kwang Suk Park, Jeong-Hun Hah:
A comparison of the perturbation analysis between PRAAT and computerize speech lab. 545-548

Robust Speech Recognition on AURORA

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiH04
Ji Ming, Baochun Hou:
Evaluation of universal compensation on Aurora 2 and 3 and beyond. 97-100
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/hamme04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/hamme04
Hugo Van hamme:
PROSPECT features and their application to missing data techniques for robust speech recognition. 101-104
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/hammeWS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/hammeWS04
Hugo Van hamme, Patrick Wambacq, Veronique Stouten:
Accounting for the uncertainty of speech estimates in the context of model-based feature enhancement. 105-108
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HirschF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HirschF04
Hans-Günter Hirsch, Harald Finster:
Applying the Aurora feature extraction schemes to a phoneme based recognition task. 109-112
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangOF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangOF04
Zhipeng Zhang, Tomoyuki Ohya, Sadaoki Furui:
Evaluation of tree-structured piecewise linear transformation-based noise adaptation on AURORA2 database. 113-116
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MyrvollN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MyrvollN04
Tor André Myrvoll, Satoshi Nakamura:
Online minimum mean square error filtering of noisy cepstral coefficients using a sequential EM algorithm. 117-120
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SasouTNA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SasouTNA04
Akira Sasou, Kazuyo Tanaka, Satoshi Nakamura, Futoshi Asano:
HMM-based feature compensation method: an evaluation using the AURORA2. 121-124
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangO04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangO04a
Xuechuan Wang, Douglas D. O'Shaughnessy:
Noise adaptation for robust AURORA 2 noisy digit recognition using statistical data mapping. 125-128
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShannonP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShannonP04
Benjamin J. Shannon, Kuldip K. Paliwal:
MFCC computation from magnitude spectrum of higher lag autocorrelation coefficients for robust speech recognition. 129-132
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhulamFHN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhulamFHN04
Muhammad Ghulam, Takashi Fukuda, Junsei Horikawa, Tsuneo Nitta:
A noise-robust feature extraction method based on pitch-synchronous ZCPA for ASR. 133-136
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeguraTRRB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeguraTRRB04
José C. Segura, Ángel de la Torre, Javier Ramírez, Antonio J. Rubio, M. Carmen Benítez:
Including uncertainty of speech observations in robust speech recognition. 137-140
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamadaOK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamadaOK04
Takeshi Yamada, Jiro Okada, Nobuhiko Kitawaki:
Integration of n-best recognition results obtained by multiple noise reduction algorithms. 141-144
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SetiawanSF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SetiawanSF04
Panji Setiawan, Sorel Stan, Tim Fingscheidt:
Revisiting some model-based and data-driven denoising algorithms in Aurora 2 context. 145-148
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DingX04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DingX04
Guo-Hong Ding, Bo Xu:
Exploring high-performance speech recognition in noisy environments using high-order taylor series expansion. 149-152
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AuS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AuS04
Wing-Hei Au, Man-Hung Siu:
A robust training algorithm based on neighborhood information. 153-156
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeC04
Siu Wa Lee, Pak-Chung Ching:
In-phase feature induction: an effective compensation technique for robust speech recognition. 157-160
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Au-YeungS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Au-YeungS04
Jeff Siu-Kei Au-Yeung, Man-Hung Siu:
Improved performance of Aurora 4 using HTK and unsupervised MLLR adaptation. 161-164
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsaiL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsaiL04
Shang-nien Tsai, Lin-Shan Lee:
A new feature extraction front-end for robust speech recognition using progressive histogram equalization and multi-eigenvector temporal filtering. 165-168

Spoken / Multimodal Dialogue System

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FugenHW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FugenHW04
Christian Fügen, Hartwig Holzapfel, Alex Waibel:
Tight coupling of speech recognition and dialog management - dialog-context dependent grammar weighting for speech recognition. 169-172
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeNNSS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeNNSS04
Akinobu Lee, Keisuke Nakamura, Ryuichi Nisimura, Hiroshi Saruwatari, Kiyohiro Shikano:
Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs. 173-176
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OshikawaKN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OshikawaKN04
Hironori Oshikawa, Norihide Kitaoka, Seiichi Nakagawa:
Speech interface for name input based on combination of recognition methods using syllable-based n-gram and word dictionary. 177-180
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZitouniLJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZitouniLJ04
Imed Zitouni, Minkyu Lee, Hui Jiang:
Constrained minimization technique for topic identification using discriminative training and support vector machines. 181-184
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WilliamsY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WilliamsY04
Jason D. Williams, Steve J. Young:
Characterizing task-oriented dialog using a simulated ASR chanel. 185-188
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KonashiSIM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KonashiSIM04
Takashi Konashi, Motoyuki Suzuki, Akinori Ito, Shozo Makino:
A spoken dialog system based on automatic grammar generation and template-based weighting for autonomous mobile robots. 189-192
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ItoOKSM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ItoOKSM04
Akinori Ito, Takanobu Oba, Takashi Konashi, Motoyuki Suzuki, Shozo Makino:
Noise adaptive spoken dialog system based on selection of multiple dialog strategies. 193-196
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HartikainenTHSF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HartikainenTHSF04
Mikko Hartikainen, Markku Turunen, Jaakko Hakulinen, Esa-Pekka Salonen, J. Adam Funk:
Flexible dialogue management using distributed and dynamic dialogue control. 197-200
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Houck04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Houck04
Keith Houck:
Contextual revision in information seeking conversation systems. 201-204
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ONeillHLM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ONeillHLM04
Ian M. O'Neill, Philip Hanna, Xingkun Liu, Michael F. McTear:
Cross domain dialogue modelling: an object-based approach. 205-208
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SagawaMN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SagawaMN04
Hirohiko Sagawa, Teruko Mitamura, Eric Nyberg:
A comparison of confirmation styles for error handling in a speech dialog system. 209-212
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangH04
Fan Yang, Peter A. Heeman:
Using computer simulation to compare two models of mixed-initiative. 213-216
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangHH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangHH04
Fan Yang, Peter A. Heeman, Kristy Hollingshead:
Towards understanding mixed-initiative in task-oriented dialogues. 217-220
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WolfWGRW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WolfWGRW04
Peter Wolf, Joseph Woelfel, Jan C. van Gemert, Bhiksha Raj, David Wong:
Spokenquery: an alternate approach to chosing items with speech. 221-224
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DouglasAABRSV04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DouglasAABRSV04
Shona Douglas, Deepak Agarwal, Tirso Alonso, Robert M. Bell, Mazin G. Rahim, Deborah F. Swayne, Chris Volinsky:
Mining customer care dialogs for "daily news". 225-228
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EdlundSC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EdlundSC04
Jens Edlund, Gabriel Skantze, Rolf Carlson:
Higgins - a spoken dialogue system for investigating error handling techniques. 229-232
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WengCRMCSBMPUSBZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WengCRMCSBMPUSBZ04
Fuliang Weng, Lawrence Cavedon, Badri Raghunathan, Danilo Mirkovic, Hua Cheng, Hauke Schmidt, Harry Bratt, Rohit Mishra, Stanley Peters, Sandra Upson, Elizabeth Shriberg, Carsten Bergmann, Lin Zhao:
A conversational dialogue system for cognitively overloaded users. 233-236
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanriederH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanriederH04
Gerhard Hanrieder, Stefan W. Hamerich:
Modeling generic dialog applications for embedded systems. 237-240
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StuttleWY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StuttleWY04
Matthew N. Stuttle, Jason D. Williams, Steve J. Young:
A framework for dialogue data collection with a simulated ASR channel. 241-244
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Pan04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Pan04
Shimei Pan:
A multi-layer conversation management approach for information seeking applications. 245-248
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HarrisR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HarrisR04
Thomas K. Harris, Roni Rosenfeld:
A universal speech interface for appliances. 249-252
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HayashiIYMK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HayashiIYMK04
Keita Hayashi, Yuki Irie, Yukiko Yamaguchi, Shigeki Matsubara, Nobuo Kawaguchi:
Speech understanding, dialogue management and response generation in corpus-based spoken dialogue system. 253-256
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FernandezSDSCM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FernandezSDSCM04
Fernando Fernández Martínez, Valentín Sama, Luis Fernando D'Haro, Rubén San Segundo, Ricardo de Córdoba, Juan Manuel Montero:
Implementation of dialog applications in an open-source voiceXML platform. 257-260
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LauMMMY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LauMMMY04
Chun Wai Lau, Bin Ma, Helen Mei-Ling Meng, Yiu Sang Moon, Yeung Yam:
Fuzzy logic decision fusion in a multimodal biometric system. 261-264
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PollerR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PollerR04
Peter Poller, Norbert Reithinger:
A state model for the realization of visual perceptive feedback in smartkom. 265-268
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IidaUMA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IidaUMA04
Akemi Iida, Yoshito Ueno, Ryohei Matsuura, Kiyoaki Aikawa:
A vector-based method for efficiently representing multivariate environmental information. 269-272
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ToptsisLWF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ToptsisLWF04
Ioannis Toptsis, Shuyin Li, Britta Wrede, Gernot A. Fink:
A multi-modal dialog system for a mobile robot. 273-276
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BernsenD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BernsenD04
Niels Ole Bernsen, Laila Dybkjær:
Structured interview-based evaluation of spoken multimodal conversation with h.c. andersen. 277-280

Speech Recognition - Search

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NovakB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NovakB04
Miroslav Novak, Vladimír Bergl:
Memory efficient decoding graph compilation with wide cross-word acoustic context. 281-284
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangD04
Dongbin Zhang, Limin Du:
Dynamic beam pruning strategy using adaptive control. 285-288
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriHM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriHM04
Takaaki Hori, Chiori Hori, Yasuhiro Minami:
Fast on-the-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition. 289-292
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuS04
Peng Yu, Frank Torsten Bernd Seide:
A hybrid word / phoneme-based approach for improved vocabulary-independent search in spontaneous speech. 293-296
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SmidlM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SmidlM04
Lubos Smídl, Ludek Müller:
Keyword spotting for highly inflectional languages. 297-300
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tendeau04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tendeau04
Frédéric Tendeau:
Optimizing an engine network that allows dynamic masking. 301-304

Spoken Dialogue and Systems

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhtsukiHHBM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhtsukiHHBM04
Katsutoshi Ohtsuki, Nobuaki Hiroshima, Yoshihiko Hayashi, Katsuji Bessho, Shoichi Matsunaga:
Topic structure extraction for meeting indexing. 305-308
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RossetL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RossetL04
Sophie Rosset, Lori Lamel:
Automatic detection of dialog acts based on multilevel information. 309-312
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Levow04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Levow04
Gina-Anne Levow:
Identifying local corrections in human-computer dialogue. 313-316
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ReichlH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ReichlH04
Peter Reichl, Florian Hammer:
Hot discussion or frosty dialogue? towards a temperature metric for conversational interactivity. 317-320
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeneffWHC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeneffWHC04
Stephanie Seneff, Chao Wang, I. Lee Hetherington, Grace Chung:
A dynamic vocabulary spoken dialogue interface. 321-324
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DeneckeDN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DeneckeDN04
Matthias Denecke, Kohji Dohsaka, Mikio Nakano:
Learning dialogue policies using state aggregation in reinforcement learning. 325-328

Speech Perception

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Shatzman04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Shatzman04
Keren B. Shatzman:
Segmenting ambiguous phrases using phoneme duration. 329-332
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SakamotoSAKI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SakamotoSAKI04
Shuichi Sakamoto, Yôiti Suzuki, Shigeaki Amano, Tadahisa Kondo, Naoki Iwaoka:
A compensation method for word-familiarity difference with SNR control in intelligibility test. 333-336
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OtakeSK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OtakeSK04
Takashi Otake, Yoko Sakamoto, Yasuyuki Konomi:
Phoneme-based word activation in spoken-word recognition: evidence from Japanese school children. 337-340
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrahimiMG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrahimiMG04
Belynda Brahimi, Philippe Boula de Mareüil, Cédric Gendrot:
Role of segmental and suprasegmental cues in the perception of maghrebian-acented French. 341-344
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KatoSTM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KatoSTM04
Hiroaki Kato, Yoshinori Sagisaka, Minoru Tsuzaki, Makiko Muto:
Effect of speaking rate on the acceptability of change in segment duration. 345-348
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Yoneyama04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Yoneyama04
Kiyoko Yoneyama:
A cross-linguistic study of diphthongs in spoken word processing in Japanese and English. 349-352

Multi-Lingual Speech-to-Speech Translation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Waibel04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Waibel04
Alex Waibel:
Speech translation: past, present and future. 353-356
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KikuiTY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KikuiTY04
Gen-ichiro Kikui, Toshiyuki Takezawa, Seiichi Yamamoto:
Multilingual corpora for speech-to-speech translation research. 357-360
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ney04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ney04
Hermann Ney:
Statistical machine translation and its challenges. 361-364
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeS04
John Lee, Stephanie Seneff:
Translingual grammar induction. 365-368
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeePO04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeePO04
Youngjik Lee, Jun Park, Seung-Shin Oh:
Usability considerations of speech-to-speech translation system. 369-372
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LazzariWZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LazzariWZ04
Gianni Lazzari, Alex Waibel, Chengqing Zong:
Worldwide ongoing activities on multilingual speech to speech translation. 373-376

Speech Recognition - Large Vocabulary

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FohrMCI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FohrMCI04
Dominique Fohr, Odile Mella, Christophe Cerisara, Irina Illina:
The automatic news transcription system: ANTS, some real time experiments. 377-380
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamabhadranSZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamabhadranSZ04
Bhuvana Ramabhadran, Olivier Siohan, Geoffrey Zweig:
Use of metadata to improve recognition of spontaneous speech and named entities. 381-384
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PylkkonenK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PylkkonenK04
Janne Pylkkönen, Mikko Kurimo:
Duration modeling techniques for continuous speech recognition. 385-388
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Alumae04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Alumae04
Tanel Alumäe:
Large vocabulary continuous speech recognition for estonian using morpheme classes. 389-392
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanZX04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanZX04a
Zhaobing Han, Shuwu Zhang, Bo Xu:
Combining agglomerative and tree-based state clustering for high accuracy acoustic modeling. 393-396
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangP04
William S.-Y. Wang, Gang Peng:
Parallel tone score association method for tone language speech recognition. 397-400
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengFS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengFS04
Jing Zheng, Horacio Franco, Andreas Stolcke:
Effective acoustic modeling for rate-of-speech variation in large vocabulary conversational speech recognition. 401-404
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhadiyaramNTM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhadiyaramNTM04
L. Sarada Ghadiyaram, Hemalatha Nagarajan, Nagarajan Thangavelu, Hema A. Murthy:
Automatic transcription of continuous speech using unsupervised and incremental training. 405-408
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NouzaNZK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NouzaNZK04
Jan Nouza, Dana Nejedlová, Jindrich Zdánský, Jan Kolorenc:
Very large vocabulary speech recognition system for automatic transcription of czech broadcast programs. 409-412
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SiohanRZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SiohanRZ04
Olivier Siohan, Bhuvana Ramabhadran, Geoffrey Zweig:
Speech recognition error analysis on the English MALACH corpus. 413-416
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangR04
Rong Zhang, Alexander I. Rudnicky:
A frame level boosting training scheme for acoustic modeling. 417-420
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangR04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangR04a
Rong Zhang, Alexander I. Rudnicky:
Optimizing boosting with discriminative criteria. 421-424
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuGZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuGZ04
Xianghua Xu, Qiang Guo, Jie Zhu:
Restructuring HMM states for speaker adaptation in Mandarin speech recognition. 425-428
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MattonWCC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MattonWCC04
Mike Matton, Mathias De Wachter, Dirk Van Compernolle, Ronald Cools:
A discriminative locally weighted distance measure for speaker independent template based speech recognition. 429-432
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ItayaZNMTK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ItayaZNMTK04
Yohei Itaya, Heiga Zen, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura:
Deterministic annealing EM algorithm in parameter estimation for acoustic model. 433-436
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GrezlKC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GrezlKC04
Frantisek Grézl, Martin Karafiát, Jan Cernocký:
TRAP based features for LVCSR of meting data. 437-440
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoongLN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoongLN04
Frank K. Soong, Wai Kit Lo, Satoshi Nakamura:
Optimal acoustic and language model weights for minimizing word verification errors. 441-444
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SakoA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SakoA04
Atsushi Sako, Yasuo Ariki:
Structuring of baseball live games based on speech recognition using task dependant knowledge. 445-448
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouM04
Zhengyu Zhou, Helen M. Meng:
A two-level schema for detecting recognition errors. 449-452
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoiKY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoiKY04
In-Jeong Choi, Nam-Hoon Kim, Su Youn Yoon:
Large vocabulary continuous speech recognition based on cross-morpheme phonetic information. 453-456
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ma04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ma04
Changxue Ma:
Automatic phonetic base form generation based on maximum context tree. 457-460
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AbregoOTS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AbregoOTS04
Gustavo Hernández Ábrego, Lex Olorenshaw, Raquel Tato, Thomas Schaaf:
Dictionary refinements based on phonetic consensus and non-uniform pronunciation reduction. 1697-1700
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MessaoudiLG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MessaoudiLG04
Abdelkhalek Messaoudi, Lori Lamel, Jean-Luc Gauvain:
Transcription of arabic broadcast news. 1701-1704
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShinozakiF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShinozakiF04
Takahiro Shinozaki, Sadaoki Furui:
Spontaneous speech recognition using a massively parallel decoder. 1705-1708
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchultzJLPMF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchultzJLPMF04
Tanja Schultz, Qin Jin, Kornel Laskowski, Yue Pan, Florian Metze, Christian Fügen:
Issues in meeting transcription - the ISL meeting transcription system. 1709-1712
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhtsukiHMH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhtsukiHMH04
Katsutoshi Ohtsuki, Nobuaki Hiroshima, Shoichi Matsunaga, Yoshihiko Hayashi:
Multi-pass ASR using vocabulary expansion. 1713-1716
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoumpiotisB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoumpiotisB04
Vlasios Doumpiotis, William Byrne:
Pinched lattice minimum Bayes risk discriminative training for large vocabulary continuous speech recognition. 1717-1720
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShafranB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShafranB04
Izhak Shafran, William Byrne:
Task-specific minimum Bayes-risk decoding using learned edit distance. 1945-1948
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangR04b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangR04b
Rong Zhang, Alexander I. Rudnicky:
Apply n-best list re-ranking to acoustic model combinations of boosting training. 1949-1952
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimUGHW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimUGHW04
Do Yeong Kim, Srinivasan Umesh, Mark J. F. Gales, Thomas Hain, Philip C. Woodland:
Using VTLN for broadcast news transcription. 1953-1956
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StolckeWBGOPOGMP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StolckeWBGOPOGMP04
Andreas Stolcke, Chuck Wooters, Ivan Bulyko, Martin Graciarena, Scott Otterson, Barbara Peskin, Mari Ostendorf, David Gelbart, Nikki Mirghafori, Tuomo W. Pirinen:
From switchboard to meetings: development of the 2004 ICSI-SRI-UW meeting recognition system. 1957-1960
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VenkataramanSWVZG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VenkataramanSWVZG04
Anand Venkataraman, Andreas Stolcke, Wen Wang, Dimitra Vergyri, Jing Zheng, Venkata Ramana Rao Gadde:
An efficient repair procedure for quick transcriptions. 1961-1964
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianLS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianLS04
Yao Qian, Tan Lee, Frank K. Soong:
Tone information as a confidence measure for improving Cantonese LVCSR. 1965-1968

Speech Science

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Due04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Due04
Danielle Duez:
Temporal variables in parkinsonian speech. 461-464
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Engwall04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Engwall04
Olov Engwall:
Speaker adaptation of a three-dimensional tongue model. 465-468
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CooperC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CooperC04
Nicole Cooper, Anne Cutler:
Perception of non-native phonemes in noise. 469-472
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaBIJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaBIJ04
Hideki Kawahara, Hideki Banno, Toshio Irino, Jiang Jin:
Intelligibility of degraded speech from smeared STRAIGHT spectrum. 473-476
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimK04
Young-Ik Kim, Rhee Man Kil:
Sound source localization based on zero-crosing peak-amplitude coding. 477-480
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SachiyoLAJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SachiyoLAJ04
Sachiyo Kajikawa, Laurel Fais, Shigeaki Amano, Janet F. Werker:
Adult and infant sensitivity to phonotactic features in spoken Japanese. 481-484
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GreenC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GreenC04
Phil D. Green, James Carmichael:
Revisiting dysarthria assessment intelligibility metrics. 485-488
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CioccaWM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CioccaWM04
Valter Ciocca, Tara L. Whitehill, Joan K.-Y. Ma:
The effect of intonation on perception of Cantonese lexical tones. 489-492
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Isei-Jaakkola04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Isei-Jaakkola04
Toshiko Isei-Jaakkola:
Maximum short quantity in Japanese and finish in two perception tests with F0 and db variants. 493-496
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlkuAS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlkuAS04
Paavo Alku, Matti Airas, Brad H. Story:
Evaluation of an inverse filtering technique using physical modeling of voice production. 497-500
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsuF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsuF04
Hui-ju Hsu, Janice Fon:
Positional and phonotactic effects on the realization of taiwan Mandarin tone 2. 501-504
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchnellL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchnellL04
Karl Schnell, Arild Lacroix:
Speech production based on lossy tube models: unit concatenation and sound transitions. 505-508
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanVRH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanVRH04
Qin Yan, Saeed Vaseghi, Dimitrios Rentzos, Ching-Hsiang Ho:
Modelling and ranking of differences across formants of british, australian and american accents. 509-512
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KitamuraFHN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KitamuraFHN04
Tatsuya Kitamura, Satoru Fujita, Kiyoshi Honda, Hironori Nishimoto:
An experimental method for measuring transfer functions of acoustic tubes. 513-516
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsujiKWK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsujiKWK04
Takuya Tsuji, Tokihiko Kaburagi, Kohei Wakamiya, Jiji Kim:
Estimation of the vocal tract spectrum from articulatory movements using phoneme-dependent neural networks. 517-520
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MotokiM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MotokiM04
Kunitoshi Motoki, Hiroki Matsuzaki:
Computation of the acoustic characteristics of vocal-tract models with geometrical perturbation. 521-524
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VijayalakshmiR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VijayalakshmiR04
P. Vijayalakshmi, M. Ramasubba Reddy:
Analysis of hypernasality by synthesis. 525-528
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KachaGBS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KachaGBS04
Abdellah Kacha, Francis Grenez, Frédéric Bettens, Jean Schoentgen:
Adaptive long-term predictive analysis of disordered speech. 529-532
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JovicicAS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JovicicAS04
Slobodan Jovicic, Sandra Antesevic, Zoran Saric:
Phoneme restoration in degraded speech communication. 533-536
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MarinakiKPM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MarinakiKPM04
Maria Marinaki, Constantine Kotropoulos, Ioannis Pitas, Nikolaos Maglaveras:
Automatic detection of vocal fold paralysis and edema. 537-540

Novel Features in ASR

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MinamiMNK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MinamiMNK04
Yasuhiro Minami, Erik McDermott, Atsushi Nakamura, Shigeru Katagiri:
A theoretical analysis of speech recognition based on feature trajectory models. 549-552
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OuZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OuZ04
Zhijian Ou, Zuoying Wang:
Discriminative combination of multiple linear predictions for speech recognition. 553-556
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GharavianA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GharavianA04
Davood Gharavian, Seyed Mohammad Ahadi:
Use of formants in stressed and unstressed continuous speech recognition. 557-560
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MarkovND04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MarkovND04
Konstantin Markov, Satoshi Nakamura, Jianwu Dang:
Integration of articulatory dynamic parameters in HMM/BN based speech recognition system. 561-564
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlsterisP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlsterisP04
Leigh David Alsteris, Kuldip K. Paliwal:
ASR on speech reconstructed from short-time fourier phase spectra. 565-568

Spoken and Natural Language Understanding

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiebFRT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiebFRT04
Robert Lieb, Tibor Fábián, Günther Ruske, Matthias Thomae:
Estimation of semantic confidences on lattice hierarchies. 569-572
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FukumotoS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FukumotoS04
Fumiyo Fukumoto, Yoshimi Suzuki:
Learning subject drift for topic tracking. 573-576
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShribergSHOPHL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShribergSHOPHL04
Elizabeth Shriberg, Andreas Stolcke, Dustin Hillard, Mari Ostendorf, Barbara Peskin, Mary P. Harper, Yang Liu:
The ICSI-SRI-UW metadata extraction system. 577-580
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hasegawa-JohnsonLZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hasegawa-JohnsonLZ04
Mark Hasegawa-Johnson, Stephen E. Levinson, Tong Zhang:
Automatic detection of contrast for speech understanding. 581-584
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangST04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangST04
Nick Jui-Chang Wang, Jia-Lin Shen, Ching-Ho Tsai:
Integrating layer concept inform ation into n-gram modeling for spoken language understanding. 585-588
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenWW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenWW04
Junyan Chen, Ji Wu, Zuoying Wang:
A robust understanding model for spoken dialogues. 589-592
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WutiwiwatchaiF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WutiwiwatchaiF04
Chai Wutiwiwatchai, Sadaoki Furui:
Belief-based nonlinear rescoring in Thai speech understanding. 2129-2133
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ItohKIK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ItohKIK04
Toshihiko Itoh, Atsuhiko Kai, Yukihiro Itoh, Tatsuhiro Konishi:
An understanding strategy based on plausibility score in recognition history using CSR confidence measure. 2133-2136
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JungJL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JungJL04
Sangkeun Jung, Minwoo Jeong, Gary Geunbae Lee:
Speech recognition error correction using maximum entropy language model. 2137-2140
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiH04
Xiang Li, Juan M. Huerta:
Discriminative training of compound-word based multinomial classifiers for speech routing. 2141-2144
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EunLL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EunLL04
Jihyun Eun, Changki Lee, Gary Geunbae Lee:
An information extraction approach for spoken language understanding. 2145-2148
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HorowitzLB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HorowitzLB04
David Horowitz, Partha Lal, Pierce Gerard Buckley:
A maximum entropy shallow functional parser for spoken language understanding. 2149-2152
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangC04
Qiang Huang, Stephen J. Cox:
Mixture language models for call routing. 2153-2156
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuYC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuYC04
Chung-Hsien Wu, Jui-Feng Yeh, Ming-Jun Chen:
Speech act identification using an ontology-based partial pattern tree. 2157-2160
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangJ04
Ye-Yi Wang, Yun-Cheng Ju:
Creating speech recognition grammars from regular expressions for alphanumeric concepts. 2161-2164
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TrancosoAVM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TrancosoAVM04
Isabel Trancoso, Paulo Araújo, Céu Viana, Nuno J. Mamede:
Poetry assistant. 2165-2168
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KitadeKN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KitadeKN04
Tasuku Kitade, Tatsuya Kawahara, Hiroaki Nanjo:
Automatic extraction of key sentences from oral presentations using statistical measure based on discourse markers. 2169-2172
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhnoMKI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhnoMKI04
Tomohiro Ohno, Shigeki Matsubara, Nobuo Kawaguchi, Yasuyoshi Inagaki:
Robust dependency parsing of spontaneous Japanese speech and its evaluation. 2173-2176
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MinkerBB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MinkerBB04
Wolfgang Minker, Dirk Bühler, Christiane Beuschel:
Strategies for optimizing a stochastic spoken natural language parser. 2177-2180
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeHHTE04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeHHTE04
Tzu-Lun Lee, Ya-Fang He, Yun-Ju Huang, Shu-Chuan Tseng, Robert Eklund:
Prolongation in spontaneous Mandarin. 2181-2184
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IrieMKYI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IrieMKYI04
Yuki Irie, Shigeki Matsubara, Nobuo Kawaguchi, Yukiko Yamaguchi, Yasuyoshi Inagaki:
Speech intention understanding based on decision tree learning. 2185-2188
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BanerjeeR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BanerjeeR04
Satanjeev Banerjee, Alexander I. Rudnicky:
Using simple speech-based features to detect the state of a meeting and the roles of the meeting participants. 2189-2192
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YildirimBLKDLNB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YildirimBLKDLNB04
Serdar Yildirim, Murtaza Bulut, Chul Min Lee, Abe Kazemzadeh, Zhigang Deng, Sungbok Lee, Shrikanth S. Narayanan, Carlos Busso:
An acoustic study of emotions expressed in speech. 2193-2196
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaLMN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaLMN04
Tatsuya Kawahara, Ian Richard Lane, Tomoko Matsui, Satoshi Nakamura:
Topic classification and verification modeling for out-of-domain utterance detection. 2197-2200
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkKLRK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkKLRK04
So-Young Park, Yong-Jae Kwak, Joon-Ho Lim, Hae-Chang Rim, Soo-Hong Kim:
Partially lexicalized parsing model utilizing rich features. 2201-2204
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuzukiFS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuzukiFS04
Yoshimi Suzuki, Fumiyo Fukumoto, Yoshihiro Sekiguchi:
Clustering similar nouns for selecting related news articles. 2205-2208
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Badino04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Badino04
Leonardo Badino:
Chinese text word-segmentation considering semantic links among sentences. 2209-2212
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeR04
Do-Gil Lee, Hae-Chang Rim:
Syllable-based probabilistic morphological analysis model of Korean. 2213-2216

Speaker Segmentation and Clustering

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ValenteW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ValenteW04
Fabio Valente, Christian Wellekens:
Scoring unknown speaker clustering : VB vs. BIC. 593-596
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JinS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JinS04
Qin Jin, Tanja Schultz:
Speaker segmentation and clustering in meetings. 597-600
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LamelGC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LamelGC04
Lori Lamel, Jean-Luc Gauvain, Leonardo Canseco-Rodriguez:
Speaker diarization from speech transcripts. 601-604
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiroP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiroP04
Xavier Anguera Miró, Javier Hernando Pericas:
Evolutive speaker segmentation using a repository system. 605-608
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AronowitzBA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AronowitzBA04
Hagai Aronowitz, David Burshtein, Amihood Amir:
Speaker indexing in audio archives using test utterance Gaussian mixture modeling. 609-612
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Raux04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Raux04
Antoine Raux:
Automated lexical adaptation and speaker clustering based on pronunciation habits for non-native speech recognition. 613-616

Speech Processing in a Packet Network Environment

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PaliwalS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PaliwalS04
Kuldip K. Paliwal, Stephen So:
Scalable distributed speech recognition using multi-frame GMM-based block quantization. 617-620
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SrinivasamurthyHN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SrinivasamurthyHN04
Naveen Srinivasamurthy, Kyu Jeong Han, Shrikanth S. Narayanan:
Robust speech recognition over packet networks: an overview. 621-624
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ErikssonKKL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ErikssonKKL04
Thomas Eriksson, Samuel Kim, Hong-Goo Kang, Chungyong Lee:
Theory for speaker recognition over IP. 625-628
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChouL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChouL04
Wu Chou, Feng Liu:
Voice portal services in packet network and voIP environment. 629-632
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KabalE04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KabalE04
Peter Kabal, Colm Elliott:
Synchronization of speaker selection for centralized tandem free voIP conferencing. 633-636
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KataokaHMI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KataokaHMI04
Akitoshi Kataoka, Yusuke Hiwasaki, Toru Morinaga, Jotaro Ikedo:
Measuring the perceived importance of time- and frequency-divided speech blocks for transmitting over packet networks. 637-640
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimK04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimK04a
Moo Young Kim, W. Bastiaan Kleijn:
Comparison of transmitter - based packet-loss recovery techniques for voice transmission. 641-644

Acoustic Modeling

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JouvetM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JouvetM04
Denis Jouvet, Ronaldo O. Messina:
Context dependent "long units" for speech recognition. 645-648
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoshizawaS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoshizawaS04
Shinichi Yoshizawa, Kiyohiro Shikano:
Rapid EM training based on model-integration. 649-652
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FohrMIC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FohrMIC04
Dominique Fohr, Odile Mella, Irina Illina, Christophe Cerisara:
Experiments on the accuracy of phone models and liaison processing in a French broadcast news transcription system. 653-656
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SilvaN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SilvaN04
Jorge F. Silva, Shrikanth S. Narayanan:
A statistical discrimination measure for hidden Markov models based on divergence. 657-660
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StadermannR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StadermannR04
Jan Stadermann, Gerhard Rigoll:
A hybrid SVM/HMM acoustic modeling approach to automatic speech recognition. 661-664
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Knoblauch04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Knoblauch04
Dirk Knoblauch:
Data driven number-of-states selection in HMM topologies. 665-668
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoKY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoKY04
Youngkyu Cho, Sung-a Kim, Dongsuk Yook:
Hybrid model using subspace distribution clustering hidden Markov models and semi-continuous hidden Markov models for embedded speech recognizers. 669-672
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OlsenV04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OlsenV04
Peder A. Olsen, Karthik Visweswariah:
Fast clustering of Gaussians and the virtue of representing Gaussians in exponential model format. 673-676
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LivescuG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LivescuG04
Karen Livescu, James R. Glass:
Feature-based pronunciation modeling with trainable asynchrony probabilities. 677-680
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuoG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuoG04
Hong-Kwang Jeff Kuo, Yuqing Gao:
Maximum entropy direct model as a unified model for acoustic modeling in speech recognition. 681-684
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuL04
Yu Zhu, Tan Lee:
Explicit duration modeling for Cantonese connected-digit recognition. 685-688
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChanRRS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChanRRS04
Arthur Chan, Mosur Ravishankar, Alexander I. Rudnicky, Jahanzeb Sherwani:
Four-layer categorization scheme of fast GMM computation techniques in large vocabulary continuous speech recognition systems. 689-692
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkK04
Junho Park, Hanseok Ko:
Compact acoustic model for embedded implementation. 693-696
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JitsuhiroN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JitsuhiroN04
Takatoshi Jitsuhiro, Satoshi Nakamura:
Increasing the mixture components of non-uniform HMM structures based on a variational Bayesian approach. 697-700
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Somervuo04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Somervuo04
Panu Somervuo:
Comparison of ML, MAP, and VB based acoustic models in large vocabulary speech recognition. 701-704
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MachereySN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MachereySN04
Wolfgang Macherey, Ralf Schlüter, Hermann Ney:
Discriminative training with tied covariance matrices. 705-708
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DiehlM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DiehlM04
Frank Diehl, Asunción Moreno:
Acoustic phonetic modeling using local codebook features. 709-712
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JungKO04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JungKO04
Gue Jun Jung, Su-Hyun Kim, Yung-Hwan Oh:
An efficient codebook design in SDCHMM for mobile communication environments. 713-716
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShozakaiN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShozakaiN04
Makoto Shozakai, Goshu Nagino:
Analysis of speaking styles by two-dimensional visualization of aggregate of acoustic models. 717-720
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KooJL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KooJL04
Myoung-Wan Koo, Ho-Hyun Jeon, Sang-Hong Lee:
Context dependent phoneme duration modeling with tree-based state tying. 721-724
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Bridle04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Bridle04
John Scott Bridle:
Towards better understanding of the model implied by the use of dynamic features in HMMs. 725-728

Prosody Modeling and Generation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiHW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiHW04
Jianfeng Li, Guoping Hu, Ren-Hua Wang:
Chinese prosody phrase break prediction based on maximum entropy model. 729-732
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaoY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaoY04
Krothapalli Sreenivasa Rao, Bayya Yegnanarayana:
Intonation modeling for indian languages. 733-736
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengLK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengLK04
Yu Zheng, Gary Geunbae Lee, Byeongchang Kim:
Using multiple linguistic features for Mandarin phrase break prediction in maximum-entropy classification framework. 737
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ReadC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ReadC04
Ian Read, Stephen Cox:
Using part-of-speech for predicting phrase breaks. 741-744
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ManceboP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ManceboP04
David Escudero Mancebo, Valentín Cardeñoso-Payo:
A proposal to quantitatively select the right intonation unit in data-driven intonation modeling. 745-748
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiKH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiKH04
Jinfu Ni, Hisashi Kawai, Keikichi Hirose:
Formulating contextual tonal variations in Mandarin. 749-752
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoulineBB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoulineBB04
Salma Mouline, Olivier Boëffard, Paul C. Bagshaw:
Automatic adaptation of the momel F0 stylisation algorithm to new corpora. 753-756
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AgueroWB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AgueroWB04
Pablo Daniel Agüero, Klaus Wimmer, Antonio Bonafonte:
Joint extraction and prediction of fujisaki's intonation model parameters. 757-760
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZervasFKKX04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZervasFKKX04
Panagiotis Zervas, Nikos Fakotakis, George K. Kokkinakis, Georgios Kouroupetroglou, Gerasimos Xydas:
Evaluation of corpus based tone prediction in mismatched environments for greek tts synthesis. 761-764
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiongC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiongC04
Ziyu Xiong, Juanwen Chen:
The duration of pitch transition phase and its relative factors. 765-768
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuWS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuWS04
Yu Hu, Ren-Hua Wang, Lu Sun:
Polynomial regression model for duration prediction in Mandarin. 769-772
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TooherM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TooherM04
Michelle Tooher, John G. McKenna:
Prediction of the glottal LF parameters using regression trees. 773-776
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DellwoAWDS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DellwoAWDS04
Volker Dellwo, Bianca Aschenberner, Petra Wagner, Jana Dancovicova, Ingmar Steiner:
Bonntempo-corpus and bonntempo-tools: a database for the study of speech rhythm and rate. 777-780
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuHF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuHF04
Wentao Gu, Keikichi Hirose, Hiroya Fujisaki:
Analysis of F0 contours of Cantonese utterances based on the command-response model. 781-784
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DohenL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DohenL04
Marion Dohen, Hélène Loevenbruck:
Pre-focal rephrasing, focal enhancement and postfocal deaccentuation in French. 785-788
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NemalaTBR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NemalaTBR04
Sridhar Krishna Nemala, Partha Pratim Talukdar, Kalika Bali, A. G. Ramakrishnan:
Duration modeling for hindi text-to-speech synthesis system. 789-792
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KrishnaM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KrishnaM04
Nemala Sridhar Krishna, Hema A. Murthy:
A new prosodic phrasing model for indian language telugu. 793-796
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JokischH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JokischH04
Oliver Jokisch, Michael Hofmann:
Evolutionary optimization of an adaptive prosody model. 797-800
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XydasK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XydasK04
Gerasimos Xydas, Georgios Kouroupetroglou:
An intonation model for embedded devices based on natural F0 samples. 801-804
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VeselaPH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VeselaPH04
Katerina Vesela, Nino Peterek, Eva Hajicová:
Prosodic characteristics of czech contrastive topic. 805-808

Multi-Sensor ASR

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GraciarenaCFMCA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GraciarenaCFMCA04
Martin Graciarena, Federico Cesari, Horacio Franco, Gregory K. Myers, Cregg Cowan, Victor Abrash:
Combination of standard and throat microphones for robust speech recognition in highly noisy environments. 809-812
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DemirogluA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DemirogluA04
Cenk Demiroglu, David V. Anderson:
Noise robust digit recognition using a glottal radar sensor for voicing detection. 813-816
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaubMW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaubMW04
Dominik Raub, John W. McDonough, Matthias Wölfel:
A cepstral domain maximum likelihod beamformer for speech recognition. 817-820
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MochikiKSO04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MochikiKSO04
Naoya Mochiki, Tetsunori Kobayashi, Toshiyuki Sekiya, Tetsuji Ogawa:
Recognition of three simultaneous utterance of speech by four-line directivity microphone mounted on head of robot. 821-824
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SagayamaTYN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SagayamaTYN04
Shigeki Sagayama, Okajima Takashi, Yutaka Kamamoto, Takuya Nishimoto:
Complex spectrum circle centroid for microphone-array-based noisy speech recognition. 825-828
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeckM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeckM04
Larry P. Heck, Mark Z. Mao:
Automatic speech recognition of co-channel speech: integrated speaker and speech recognition approach. 829-832

Multi-Lingual Speech Processing

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MarinoMN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MarinoMN04
José B. Mariño, Asunción Moreno, Albino Nogueiras:
A first experience on multilingual acoustic modeling of the languages spoken in morocco. 833-836
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaballeroMN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaballeroMN04
Mónica Caballero, Asunción Moreno, Albino Nogueiras:
Data driven multidialectal phone set for Spanish dialects. 837-840
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OriaV04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OriaV04
Daniela Oria, Akos Vetek:
Multilingual e-mail text processing for speech synthesis. 841-844
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RomsdorferP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RomsdorferP04
Harald Romsdorfer, Beat Pfister:
Multi-context rules for phonological processing in polyglot TTS synthesis. 845-848
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BadinoBQ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BadinoBQ04
Leonardo Badino, Claudia Barolo, Silvia Quazza:
A general approach to TTS reading of mixed-language texts. 849-852
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GeorgiouNM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GeorgiouNM04
Panayiotis G. Georgiou, Shrikanth S. Narayanan, Hooman Shirani Mehr:
Context dependent statistical augmentation of persian transcripts. 853-856

Speech Enhancement

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DemirogluA04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DemirogluA04a
Cenk Demiroglu, David V. Anderson:
A soft decision MMSE amplitude estimator as a noise preprocessor to speech coder s using a glottal sensor. 857-860
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuA04
Rongqiang Hu, David V. Anderson:
Single acoustic-channel speech enhancement based on glottal correlation using non-acoustic sensor. 861-864
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangHAR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangHAR04
Xianxian Zhang, John H. L. Hansen, Kathryn Hoberg Arehart, Jessica Rossi-Katz:
In-vehicle based speech processing for hearing impaired subjects. 865-868
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SrinivasanK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SrinivasanK04
Sriram Srinivasan, W. Bastiaan Kleijn:
Speech enhancement using adaptive time-domain segmentation. 869-872
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakataniKMZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakataniKMZ04
Tomohiro Nakatani, Keisuke Kinoshita, Masato Miyoshi, Parham Zolfaghari:
Harmonicity based monaural speech dereverberation with time warping and F0 adaptive window. 873-876
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DelcroixHM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DelcroixHM04
Marc Delcroix, Takafumi Hikichi, Masato Miyoshi:
Dereverberation of speech signals based on linear prediction. 877-880

Speech and Affect

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Campbell04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Campbell04
Nick Campbell:
Perception of affect in speech - towards an automatic processing of paralinguistic information in spoken conversation. 881-884
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChateauMB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChateauMB04
Noël Chateau, Valérie Maffiolo, Christophe Blouin:
Analysis of emotional speech in voice mail messages: the influence of speakers' gender. 885-888
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeYBkBDLN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeYBkBDLN04
Chul Min Lee, Serdar Yildirim, Murtaza Bulut, Abe Kazemzadeh, Carlos Busso, Zhigang Deng, Sungbok Lee, Shrikanth S. Narayanan:
Emotion recognition based on phoneme classes. 889-892
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RobinsonS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RobinsonS04
Peter Robinson, Tal Sobol Shikler:
Visualizing dynamic features of expressions in speech. 893-896
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiW04
Aijun Li, Haibo Wang:
Friendly speech analysis and perception in standard Chinese. 897-900
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChasaideG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChasaideG04
Ailbhe Ní Chasaide, Christer Gobl:
Decomposing linguistic and affective components of phonatory quality. 901-904
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiangC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiangC04
Dan-Ning Jiang, Lian-Hong Cai:
Classifying emotion in Chinese speech by decomposing prosodic features. 1325-1328
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuAW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuAW04
Chen Yu, Paul M. Aoki, Allison Woodruff:
Detecting user engagement in everyday conversations. 1329-1332
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujisawaC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujisawaC04
Takashi X. Fujisawa, Norman D. Cook:
Identifying emotion in speech prosody using acoustical cues of harmony. 1333-1336
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tao04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tao04
Jianhua Tao:
Context based emotion detection from text input. 1337-1340
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IwaiYO04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IwaiYO04
Atsushi Iwai, Yoshikazu Yano, Shigeru Okuma:
Complex emotion recognition system for a specific user using SOM based on prosodic features. 1341-1344
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoYL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoYL04
Hoon-Young Cho, Kaisheng Yao, Te-Won Lee:
Emotion verification for emotion detection and unknown emotion rejection. 1345-1348
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hirose04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hirose04
Keikichi Hirose:
Improvement in corpus-based generation of F0 contours using generation process model for emotional speech synthesis. 1349-1352

Speech Features

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HegdeMG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HegdeMG04
Rajesh Mahanand Hegde, Hema A. Murthy, Venkata Ramana Rao Gadde:
Continuous speech recognition using joint features derived from the modified group delay function and MFCC. 905-908
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Yu04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Yu04
Hua Yu:
Phase-space representation of speech. 909-912
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MurthyHG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MurthyHG04
Hema A. Murthy, Rajesh Mahanand Hegde, Venkata Ramana Rao Gadde:
The modified group delay feature: a new spectral representation of speech. 913-916
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KwonL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KwonL04
Oh-Wook Kwon, Te-Won Lee:
ICA-based feature extraction for phoneme recognition. 917-920
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuCMS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuCMS04
Qifeng Zhu, Barry Y. Chen, Nelson Morgan, Andreas Stolcke:
On using MLP features in LVCSR. 921-924
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenZM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenZM04
Barry Y. Chen, Qifeng Zhu, Nelson Morgan:
Learning long-term temporal features in LVCSR using neural networks. 925-928
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SreenivasKK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SreenivasKK04
T. V. Sreenivas, G. V. Kiran, A. G. Krishna:
Neural "spike rate spectrum" as a noise robust, speaker invariant feature for automatic speech recognition. 929-932
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakatohNYY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakatohNYY04
Yoshihisa Nakatoh, Makoto Nishizaki, Shinichi Yoshizawa, Maki Yamada:
An adaptive MEL-LPC analysis for speech recognition. 933-936
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IshizukaMNM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IshizukaMNM04
Kentaro Ishizuka, Noboru Miyazaki, Tomohiro Nakatani, Yasuhiro Minami:
Improvement in robustness of speech feature extraction method using sub-band based periodicity and aperiodicity decomposition. 937-940
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ishi04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ishi04
Carlos Toshinori Ishi:
A new acoustic measure for aspiration noise detection. 941-944
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DemuynckGC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DemuynckGC04
Kris Demuynck, Oscar Garcia, Dirk Van Compernolle:
Synthesizing speech from speech recognition parameters. 945-948
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AthineosHE04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AthineosHE04
Marios Athineos, Hynek Hermansky, Daniel P. W. Ellis:
LP-TRAP: linear predictive temporal patterns. 949-952
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS04
Xiang Li, Richard M. Stern:
Parallel feature generation based on maximizing normalized acoustic likelihood. 953-956
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Wang04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Wang04
Kun-Ching Wang:
An adaptive band-partitioning spectral entropy based speech detection in realistic noisy environments. 957-960
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamirezSBTR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamirezSBTR04
Javier Ramírez, José C. Segura, M. Carmen Benítez, Ángel de la Torre, Antonio J. Rubio:
Improved voice activity detection combining noise reduction and subband divergence measures. 961-964
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkCK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkCK04
Kiyoung Park, Changkyu Choi, Jeongsu Kim:
Voice activity detection using global soft decision with mixture of Gaussian model. 965-968
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KempNLC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KempNLC04
Thomas Kemp, Climent Nadeu, Yin Hay Lam, Josep Maria Sola i Caros:
Environmental robust features for speech detection. 969-972
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaskowskiJS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaskowskiJS04
Kornel Laskowski, Qin Jin, Tanja Schultz:
Crosscorrelation-based multispeaker speech activity detection. 973-976
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tsai04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tsai04
Shang-nien Tsai:
Improved robustness of time-frequency principal components (TFPC) by synergy of methods in different domains. 977-980
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DengDA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DengDA04
Li Deng, Yu Dong, Alex Acero:
A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech. 981-984
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KubinP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KubinP04
Gernot Kubin, Tuan Van Pham:
DWT-based classification of acoustic-phonetic classes and phonetic units. 985-988
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoC04
Yong-Choon Cho, Seungjin Choi:
Learning nonnegative features of spectro-temporal sounds for classification. 989-992

Language Modeling, Multimodal & Multilingual Speech Processing

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChungHM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChungHM04
Sungyup Chung, Keikichi Hirose, Nobuaki Minematsu:
N-gram language modeling of Japanese using bunsetsu boundaries. 993-996
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLGA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLGA04
Langzhou Chen, Lori Lamel, Jean-Luc Gauvain, Gilles Adda:
Dynamic language modeling for broadcast news. 997-1000
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LyuLLWCH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LyuLLWCH04
Ren-Yuan Lyu, Dau-Cheng Lyu, Min-Siong Liang, Min-Hong Wang, Yuang-Chin Chiang, Chun-Nan Hsu:
A unified framework for large vocabulary speech recognition of mutually unintelligible Chinese "regionalects". 1001-1004
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SluisK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SluisK04
Ielka van der Sluis, Emiel Krahmer:
The influence of target size and distance on the production of speech and gesture in multimodal referring expressions. 1005-1008
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuptaA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuptaA04
Anurag Kumar Gupta, Tasos Anastasakos:
Dynamic time windows for multimodal input fusion. 1009-1012
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeG04
Raymond H. Lee, Anurag Kumar Gupta:
MICot : a tool for multimodal input data collection. 1013-1016
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TadjDHRL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TadjDHRL04
Chakib Tadj, Hicham Djenidi, Madjid Haouani, Amar Ramdane-Cherif, Nicole Lévy:
Simulating multimodal applications. 1017-1020
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PedersenDL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PedersenDL04
Jakob Schou Pedersen, Paul Dalsgaard, Børge Lindberg:
A multimodal communication aid for global aphasia patients. 1021-1024
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamamotoKS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamamotoKS04
Hirofumi Yamamoto, Gen-ichiro Kikui, Yoshinori Sagisaka:
Mis-recognized utterance detection using hierarchical language model. 1025-1028
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MobergPI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MobergPI04
Marko Moberg, Kimmo Pärssinen, Juha Iso-Sipilä:
Cross-lingual phoneme mapping for multilingual synthesis systems. 1029-1032
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KomataniOOTY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KomataniOOTY04
Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno, Tsuyoshi Tasaki, Takeshi Yamaguchi:
Robot motion control using listener's back-channels and head gesture information. 1033-1036
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaktiANH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaktiANH04
Sakriani Sakti, Arry Akhmad Arman, Satoshi Nakamura, Paulus Hutagaol:
Indonesian speech recognition for hearing and speaking impaired people. 1037-1040
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Rashwan04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Rashwan04
Mohsen A. Rashwan:
A two phase arabic language model for speech recognition and other language applications. 1041-1044
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AkitaK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AkitaK04
Yuya Akita, Tatsuya Kawahara:
Language model adaptation based on PLSA of topics and speakers. 1045-1048
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DolfingBH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DolfingBH04
Hans J. G. A. Dolfing, Pierce Gerard Buckley, David Horowitz:
Unified language modeling using finite-state transducers with first applications. 1049-1052
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ItouFA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ItouFA04
Katsunobu Itou, Atsushi Fujii, Tomoyosi Akiba:
Effects of language modeling on speech-driven question answering. 1053-1056
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SethyNR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SethyNR04
Abhinav Sethy, Shrikanth S. Narayanan, Bhuvana Ramabhadran:
Measuring convergence in language model estimation using relative entropy. 1057-1060

Detection and Classification in ASR

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangH04
Rongqing Huang, John H. L. Hansen:
High-level feature weighted GMM network for audio stream classification. 1061-1064
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZdanskyDN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZdanskyDN04
Jindrich Zdánský, Petr David, Jan Nouza:
An improved preprocessor for the automatic transcription of broadcast news audio stream. 1065-1068
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangH04
Yih-Ru Wang, Chi-Han Huang:
Speaker-and-environment change detection in broadcast news using the common component GMM-based divergence measure. 1069-1072
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lahti04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lahti04
Tommi Lahti:
Beginning of utterance detection algorithm for low complexity ASR engines. 1073-1076
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SukittanonSPB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SukittanonSPB04
Somsak Sukittanon, Arun C. Surendran, John C. Platt, Christopher J. C. Burges:
Convolutional networks for speech detection. 1077-1080
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GangashettySY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GangashettySY04
Suryakanth V. Gangashetty, Chellu Chandra Sekhar, B. Yegnanarayana:
Detection of vowel on set points in continuous speech using autoassociative neural network models. 1081-1084

Speech Analysis

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TamiyaS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TamiyaS04
Toshiki Tamiya, Tetsuya Shimamura:
Reconstruction filter design for bone-conducted speech. 1085-1088
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Quintana-MoralesN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Quintana-MoralesN04
Pedro J. Quintana-Morales, Juan L. Navarro-Mesa:
Frequency warped ARMA analysis of the closed and the open phase of voiced speech. 1089-1192
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DovalBdD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DovalBdD04
Boris Doval, Baris Bozkurt, Christophe d'Alessandro, Thierry Dutoit:
Zeros of z-transform (ZZT) decomposition of speech for source-tract separation. 1093-1096
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DengT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DengT04
Li Deng, Roberto Togneri:
Use of neural network mapping and extended kalman filter to recover vocal tract resonances from the MFCC parameters of speech. 1097-1100
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiMB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiMB04
Xiao Li, Jonathan Malkin, Jeff A. Bilmes:
Graphical model approach to pitch tracking. 1101-1104
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuTK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuTK04
Bo Xu, Jianhua Tao, Yongguo Kang:
A new multicomponent AM-FM demodulation with predicting frequency boundaries and its application to formant estimation. 1105-1108
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Laprie04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Laprie04
Yves Laprie:
A concurrent curve strategy for formant tracking. 2405-2408
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanZVR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanZVR04
Qin Yan, Esfandiar Zavarehei, Saeed Vaseghi, Dimitrios Rentzos:
A formant tracking LP model for speech processing. 2409-2412
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/You04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/You04
Hong You:
Application of long-term filtering to formant estimation. 2413-2416
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BozkurtDDd04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BozkurtDDd04
Baris Bozkurt, Thierry Dutoit, Boris Doval, Christophe d'Alessandro:
A method for glottal formant frequency estimation. 2417-2420
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BozkurtDDd04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BozkurtDDd04a
Baris Bozkurt, Thierry Dutoit, Boris Doval, Christophe d'Alessandro:
Improved differential phase spectrum processing for formant tracking. 2421-2424
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShaoM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShaoM04
Xu Shao, Ben P. Milner:
MAP prediction of pitch from MFCC vectors for speech reconstruction. 2425-2428
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuW04
An-Tze Yu, Hsiao-Chuan Wang:
New harmonicity measures for pitch estimation and voice activity detection. 2429-2432
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NishimotoSK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NishimotoSK04
Takuya Nishimoto, Shigeki Sagayama, Hirokazu Kameoka:
Multi-pitch trajectory estimation of concurrent speech based on harmonic GMM and nonlinear kalman filtering. 2433-2436
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FerenczKLL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FerenczKLL04
Attila Ferencz, Jeongsu Kim, Yong-Beom Lee, Jae-Won Lee:
Automatic pitch marking and reconstruction of glottal closure instants from noisy and deformed electro-glotto-graph signals. 2437-2440
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FlegoAO04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FlegoAO04
Federico Flego, Luca Armani, Maurizio Omologo:
On the use of a weighted autocorrelation based fundamental frequency estimation for a multidimensional speech input. 2441-2444
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ReddyR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ReddyR04
Aarthi M. Reddy, Bhiksha Raj:
A minimum mean squared error estimator for single channel speaker separation. 2445-2448
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MollaHM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MollaHM04
Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu:
Audio source separation from the mixture using empirical mode decomposition with independent subspace analysis. 2449-2452
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhCCJP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhCCJP04
In-Jung Oh, Hyun-Yeol Chung, Jae-Won Cho, Ho-Youl Jung, Rémy Prost:
Audio watermarking in sub-band signals using multiple echo kernels. 2453-2456
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangW04
Jie Zhang, Zhenyang Wu:
A piecewise interpolation method based on log-least square error criterion for HRTF. 2457-2460
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Navarro-MesaQ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Navarro-MesaQ04
Juan L. Navarro-Mesa, Pedro J. Quintana-Morales:
Modified realizable frequency warped ARMA modeling and its application in synthesis structures for voiced speech. 2461-2464
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MuralishankarRK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MuralishankarRK04
R. Muralishankar, A. G. Ramakrishnan, Lakshmish N. Kaushik:
Time-scaling of speech using independent subspace analysis. 2465-2468
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GirinFM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GirinFM04
Laurent Girin, Mohammad Firouzmand, Sylvain Marchand:
Long term modeling of phase trajectories within the speech sinusoidal model framework. 2469-2472
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltaniHCSB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltaniHCSB04
Tina Soltani, Dave Hermann, Etienne Cornu, Hamid Sheikhzadeh, Robert L. Brennan:
An acoustic shock limiting algorithm using time and frequency domain speech features. 2473-2476
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShinCK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShinCK04
Jong Won Shin, Joon-Hyuk Chang, Nam Soo Kim:
Speech probability distribution based on generalized gama distribution. 2477-2480
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengHB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengHB04
Yanli Zheng, Mark Hasegawa-Johnson, Sarah Borys:
Stop consonant classification by dynamic formant trajectory. 2481-2484
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShigaK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShigaK04
Yoshinori Shiga, Simon King:
Estimating detailed spectral envelopes using articulatory clustering. 2485-2488

Speech Production

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Engwall04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Engwall04a
Olov Engwall:
From real-time MRI to 3d tongue movements. 1109-1112
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Nakamura04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Nakamura04
Mitsuhiro Nakamura:
Coarticulatory variability and directionality in [s, ..]: an EPG study. 1113-1116
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanabeK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanabeK04
Yosuke Tanabe, Tokihiko Kaburagi:
Flow representation through the glottis having a polygonal boundary shape. 1117-1120
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PulakkaAGHLLLV04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PulakkaAGHLLLV04
Hannu Pulakka, Paavo Alku, Svante Granqvist, Stellan Hertegard, Hans Larsson, Anne-Maria Laukkanen, Per-Ake Lindestad, Erkki Vilkman:
Analysis of the voice source in different phonation types: simultaneous high-sped imaging of the vocal fold vibration and glottal inverse filtering. 1121-1124
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BirkholzJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BirkholzJ04
Peter Birkholz, Dietmar Jackèl:
Influence of temporal discretization schemes on formant frequencies and bandwidths in time domain simulations of the vocal tract system. 1125-1128
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TodaBT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TodaBT04
Tomoki Toda, Alan W. Black, Keiichi Tokuda:
Acoustic-to-articulatory inversion mapping with Gaussian mixture model. 1129-1132

Audio-Visual Speech Processing

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimKD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimKD04
Jinyoung Kim, Jeesun Kim, Chris Davis:
Audio-visual spoken language processing. 1133-1136
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SekiyamaB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SekiyamaB04
Kaoru Sekiyama, Denis Burnham:
Issues in the development of auditory-visual speech perception: adults, infants, and children. 1137-1140
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KrahmerS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KrahmerS04
Emiel Krahmer, Marc Swerts:
Signaling and detecting uncertainty in audiovisual speech by children and adults. 1141-1144
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HazanSF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HazanSF04
Valérie Hazan, Anke Sennema, Andrew Faulkner:
Effect of intensive audiovisual perceptual training on the perception and production of the /l/-/r/ contrast for Japanese learners of English. 1145-1148
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VroomenLGB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VroomenLGB04
Jean Vroomen, Sabine van Linden, Béatrice de Gelder, Paul Bertelson:
Visual recalibration of auditory speech versus selective speech adaptation: different build-up courses. 1149-1152
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DavisK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DavisK04
Chris Davis, Jeesun Kim:
Of the top of the head: audio-visual speech perception from the nose up. 1153-1156
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MillarWG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MillarWG04
J. Bruce Millar, Michael Wagner, Roland Goecke:
Aspects of speaking-face data corpus design methodology. 1157-1160
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchwartzC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchwartzC04
Jean-Luc Schwartz, Marie-Agnès Cathiard:
Modeling audio-visual speech perception: back on fusion architectures and fusion control. 2017-2020
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SamsOTK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SamsOTK04
Mikko Sams, Ville Ojanen, Jyrki Tuomainen, Vasily Klucharev:
Neurocognition of speech-specific audiovisual perception. 2021-2024
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BarbosaVD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BarbosaVD04
Adriano Vilela Barbosa, Eric Vatikiotis-Bateson, Andreas Daffertshofer:
Target practice on talking faces. 2025-2028
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OdisioB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OdisioB04
Matthias Odisio, Gérard Bailly:
Audiovisual perceptual evaluation of resynthesised speech movements. 2029-2032
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Fagel04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Fagel04
Sascha Fagel:
Video-realistic synthetic speech with a parametric visual speech synthesizer. 2033-2036
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ScanlonPLC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ScanlonPLC04
Patricia Scanlon, Gerasimos Potamianos, Vit Libal, Stephen M. Chu:
Mutual information based visual feature selection for lipreading. 2037-2040
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeHGKBLH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeHGKBLH04
Bowon Lee, Mark Hasegawa-Johnson, Camille Goudeseune, Suketu Kamdar, Sarah Borys, Ming Liu, Thomas S. Huang:
AVICAR: audio-visual speech corpus in a car environment. 2489-2492
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ErzinYT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ErzinYT04
Engin Erzin, Yucel Yemez, A. Murat Tekalp:
Adaptive classifier cascade for multimodal speaker identification. 2493-2496
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IbaSHF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IbaSHF04
Midori Iba, Anke Sennema, Valérie Hazan, Andrew Faulkner:
Use of visual cues in the perception of a labial/labiodental contrast by Spanish-L1 and Japanese-L1 learners of English. 2497-2500
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangTHM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangTHM04
Xianxian Zhang, Kazuya Takeda, John H. L. Hansen, Toshiki Maeno:
Audio-visual SPeaker localization for car navigation systems. 2501-2504
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chaloupka04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chaloupka04
Josef Chaloupka:
Automatic lips reading for audio-visual speech processing and recognition. 2505-2508
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WagnerC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WagnerC04
Michael Wagner, Girija Chetty:
"liveness" verification in audio-video authentication. 2509-2512
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MartinezG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MartinezG04
Maria José Sanchez Martinez, Juan Pablo de la Cruz Gutiérrez:
Speech recognition using motion based lipreading. 2513-2516
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Berthommier04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Berthommier04
Frédéric Berthommier:
Comparative study of linear and non-linear models for viseme in version: modeling of a cortical associative function. 2517-2520
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CisarKZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CisarKZ04
Petr Císar, Zdenek Krnoul, Milos Zelezný:
3d lip-tracking for audio-visual speech recognition in real applications. 2521-2524
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MillarG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MillarG04
J. Bruce Millar, Roland Goecke:
The audio-video australian English speech data corpus AVOZES. 2525-2528
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HongLSL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HongLSL04
Ki-Hyung Hong, Yong-Ju Lee, Jae-Young Suh, Kyong-Nim Lee:
Correcting Korean vowel speech recognition errors with limited lip features. 2529-2532
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Nielsen04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Nielsen04
Kuniko Y. Nielsen:
Segmental differences in the visual contribution to speech inteligibility. 2533-2536

Spoken Language Generation and Synthesis III

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YeY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YeY04
Hui Ye, Steve J. Young:
Voice conversion for unknown speakers. 1161-1164
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FischerOK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FischerOK04
Volker Fischer, Jaime Botella Ordinas, Siegfried Kunzmann:
Domain adaptation methods in the IBM trainable text-to-speech system. 1165-1168
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouZYYC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouZYYC04
Yi Zhou, Yiqing Zu, Zhenli Yu, Dongjian Yue, Guilin Chen:
Applying pitch connection control in Mandarin speech synthesis. 1169-1172
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NeySBH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NeySBH04
Hermann Ney, David Sündermann, Antonio Bonafonte, Harald Höge:
A first step towards text-independent voice conversion. 1173-1176
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuWZYC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuWZYC04
Zhenli Yu, Kaizhi Wang, Yiqing Zu, Dongjian Yue, Guilin Chen:
Data pruning approach to unit selection for inventory generation of concatenative embeddable Chinese TTS systems. 1177-1180
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VepaK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VepaK04
Jithendra Vepa, Simon King:
Subjective evaluation of join cost functions used in unit selection speech synthesis. 1181-1184
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZenKBNTT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZenKBNTT04
Heiga Zen, Tadashi Kitamura, Murtaza Bulut, Shrikanth S. Narayanan, Ryosuke Tsuzuki, Keiichi Tokuda:
Constructing emotional speech synthesizers with limited speech database. 1185-1188
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinJ04
Cheng-Yuan Lin, Jyh-Shing Roger Jang:
A two-phase pitch marking method for TD-PSOLA synthesis. 1189-1192
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BonafonteKSD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BonafonteKSD04
Antonio Bonafonte, Alexander Kain, Jan P. H. van Santen, Helenca Duxans:
Including dynamic and phonetic information in voice conversion systems. 1193-1196
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangWSL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangWSL04
Zixiang Wang, Ren-Hua Wang, Zhiwei Shuang, Zhen-Hua Ling:
A novel voice conversion system based on codebook mapping with phoneme-tied weighting. 1197-1200
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LingHSW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LingHSW04
Zhen-Hua Ling, Yu Hu, Zhiwei Shuang, Ren-Hua Wang:
Compression of speech database by feature separation and pattern clustering using STRAIGHT. 1201-1204
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KataokaMTK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KataokaMTK04
Shunsuke Kataoka, Nobuaki Mizutani, Keiichi Tokuda, Tadashi Kitamura:
Decision-tree backing-off in HMM-based speech synthesis. 1205-1208
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NishizawaK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NishizawaK04
Nobuyuki Nishizawa, Hisashi Kawai:
Using a depth-restricted search to reduce delays in unit selection. 1209-1212
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamagishiMK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamagishiMK04
Junichi Yamagishi, Takashi Masuko, Takao Kobayashi:
MLLR adaptation for hidden semi-Markov model based speech synthesis. 1213-1216
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BreuerA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BreuerA04
Stefan Breuer, Julia Abresch:
Phoxsy: multi-phone segments for unit selection speech synthesis. 1217-1220
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AliasLSSSF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AliasLSSSF04
Francesc Alías, Xavier Llorà, Ignasi Iriondo Sanz, Joan Claudi Socoró, Xavier Sevillano, Lluís Formiga:
Perception-guided and phonetic clustering weight tuning based on diphone pairs for unit selection TTS. 1221-1224
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/En-NajjaryRC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/En-NajjaryRC04
Taoufik En-Najjary, Olivier Rosec, Thierry Chonavel:
A voice conversion method based on joint pitch and spectral envelope transformation. 1225-1228
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/En-NajjaryRC04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/En-NajjaryRC04a
Taoufik En-Najjary, Olivier Rosec, Thierry Chonavel:
Fast GMM-based voice conversion for text-to-speech synthesis systems. 1229-1232
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kumar04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kumar04
Rohit Kumar:
A genetic algorithm for unit selection based speech synthesis. 1233-1236
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangOAD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangOAD04
Jun Huang, Lex Olorenshaw, Gustavo Hernández Ábrego, Lei Duan:
A memory efficient grapheme-to-phoneme conversion system for speech processing. 1237-1240
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarK04
Rohit Kumar, S. Prahallad Kishore:
Automatic pruning of unit selection speech databases for synthesis without loss of naturalness. 1377-1380
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LambertB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LambertB04
Tanya Lambert, Andrew P. Breen:
A database design for a TTS synthesis system using lexical diphones. 1381-1384
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KominekB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KominekB04
John Kominek, Alan W. Black:
A family-of-models approach to HMM-based segmentation for unit selection speech synthesis. 1385-1388
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangJM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangJM04
Wei Zhang, Ling Jin, Xijun Ma:
Mutual-information based segment pre-selection in concatenative text-to-speech. 1389-1392
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZenTMKK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZenTMKK04
Heiga Zen, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Hidden semi-Markov model based speech synthesis. 1393-1396
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Pfitzinger04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Pfitzinger04
Hartmut R. Pfitzinger:
DFW-based spectral smoothing for concatenative speech synthesis. 1397-1400
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MinL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MinL04
Kyung-Joong Min, Un-Cheon Lim:
Korean prosody generation and artificial neural networks. 1869-1872
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Yoon04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Yoon04
Kyuchul Yoon:
A prosodic phrasing model for a Korean text-to-speech synthesis system. 1873-1876
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiF04
Qin Shi, Volker Fischer:
A comparison of statistical methods and features for the prediction of prosodic structures. 1877-1880
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenH04
Gui-Lin Chen, Ke-Song Han:
Letter-to-sound for small-footprint multilingual TTS engine. 1881-1884
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuFL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuFL04
Jun Xu, Guohong Fu, Haizhou Li:
Grapheme-to-phoneme conversion for Chinese text-to-speech. 1885-1888
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchroderB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchroderB04
Marc Schröder, Stefan Breuer:
XML representation languages as a way of interconnecting TTS modules. 1889-1892
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaoZX04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaoZX04
Wenjie Cao, Chengqing Zong, Bo Xu:
Approach to interchange-format based Chinese generation. 1893-1896
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZovatoSQB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZovatoSQB04
Enrico Zovato, Stefano Sandri, Silvia Quazza, Leonardo Badino:
Prosodic analysis of a multi-style corpus in the perspective of emotional speech synthesis. 1897-1900
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MinKL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MinKL04
Kyung-Joong Min, Chan-Goo Kang, Un-Cheon Lim:
Number of output nodes of artificial neural networks for Korean prosody generation. 1901-1904
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimAKL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimAKL04
Sunhee Kim, Ju-Eun Ahn, Soon-Hyob Kim, Yang-Hee Lee:
A Korean grapheme-to-phoneme conversion system using selection procedure for exceptions. 1905-1908
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhaorapapongKI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhaorapapongKI04
Thanate Khaorapapong, Montri Karnjanadecha, Keerati Inthavisas:
Synthesis of vowels and tones in Thai language by articulatory modeling. 1909-1912
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShigaK04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShigaK04a
Yoshinori Shiga, Simon King:
Source-filter separation for articulation-to-speech synthesis. 1913-1916
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HisakoHHM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HisakoHHM04
Hisako Asano, Hideharu Nakajima, Hideyuki Mizuno, Masahiro Oku:
Long vowel detection for letter-to-sound conversion for Japanese sourced words transliterated into the alphabet. 1917-1920
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ClermontM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ClermontM04
Frantz Clermont, Thomas John Millhouse:
Inexactness and robustness in cepstral-to-formant transformation of spoken and sung vowels. 1921-1924
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaitouTUA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaitouTUA04
Takeshi Saitou, Naoya Tsuji, Masashi Unoki, Masato Akagi:
Analysis of acoustic features affecting "singing-ness" and its application to singing-voice synthesis from speaking-voice. 1925-1928
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PolletC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PolletC04
Vincent Pollet, Geert Coorman:
Statistical corpus-based speech segmentation. 1929-1932
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatousekRTT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatousekRTT04
Jindrich Matousek, Jan Romportl, Daniel Tihelka, Zbynek Tychtl:
Recent improvements on ARTIC: czech text-to-speech system. 1933-1936
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NamJLKY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NamJLKY04
Youngim Jung, Donghun Lee, HyeonSook Nam, Ae-sun Yoon, Hyuk-Chul Kwon:
Learning for transliteration of arabic-numeral expressions using decision tree for Korean TTS. 1937-1940
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Beringer04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Beringer04
Nicole Beringer:
How to integrate phonetic and linguistic knowledge in a text-to-phoneme conversion task: a syllabic TPC tool for French. 1941-1944
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HamzaEB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HamzaEB04
Wael Hamza, Ellen Eide, Raimo Bakis:
Reconciling pronunciation differences between the front-end and the back-end in the IBM speech synthesis system. 2561-2564
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaZLSK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaZLSK04
Juhong Ha, Yu Zheng, Gary Geunbae Lee, Yoon-Suk Seong, Byeongchang Kim:
High quality text-to-pinyin conversion using two-phase unknown word prediction. 2565-2568
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimSC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimSC04
Yeon-Jun Kim, Ann K. Syrdal, Alistair Conkie:
Pronunciation lexicon adaptation for TTS voice building. 2569-2572
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Webster04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Webster04
Gabriel Webster:
Improving letter-to-pronunciation accuracy with automatic morphologically-based stress prediction. 2573-2576
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HamzaEBPP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HamzaEBPP04
Wael Hamza, Ellen Eide, Raimo Bakis, Michael Picheny, John F. Pitrelli:
The IBM expressive speech synthesis system. 2577-2580
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchnellH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchnellH04
Markus Schnell, Rüdiger Hoffmann:
What concept-to-speech can gain for prosody. 2581-2584

Speech Recognition - Language Model

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaUIS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaUIS04
Tatsuya Kawahara, Kiyotaka Uchimoto, Hitoshi Isahara, Kazuya Shitaoka:
Dependency structure analysis and sentence boundary detection in spontaneous Japanese. 1353-1356
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JamoussiLHS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JamoussiLHS04
Salma Jamoussi, David Langlois, Jean Paul Haton, Kamel Smaïli:
Statistical feature language model. 1357-1360
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BigiHM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BigiHM04
Brigitte Bigi, Yan Huang, Renato de Mori:
Vocabulary and language model adaptation using information retrieval. 1361-1364
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriT04
Shinsuke Mori, Daisuke Takuma:
Word n-gram probability estimation from a Japanese raw corpus. 1365-1368
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChienC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChienC04
Jen-Tzung Chien, Hung-Ying Chen:
Mining of association patterns for language modeling. 1369-1372
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChienWP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChienWP04
Jen-Tzung Chien, Meng-Sung Wu, Hua-Jui Peng:
On latent semantic language modeling and smoothing. 1373-1376
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Goel04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Goel04
Vaibhava Goel:
Conditional maximum likelihood estimation for improving annotation performance of n-gram models incorporating stochastic finite state grammars. 2237-2241
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Schofield04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Schofield04
Edward James Schofield:
Fast parameter estimation for joint maximum entropy language models. 2241-2244
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VergyriKDS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VergyriKDS04
Dimitra Vergyri, Katrin Kirchhoff, Kevin Duh, Andreas Stolcke:
Morphology-based language modeling for arabic speech recognition. 2245-2248
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhanY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhanY04
A. Nayeemulla Khan, B. Yegnanarayana:
Speech enhanced multi-Span language model. 2249-2252
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchwenkG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchwenkG04
Holger Schwenk, Jean-Luc Gauvain:
Neural network language models for conversational speech recognition. 2253-2256
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MrvaW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MrvaW04
David Mrva, Philip C. Woodland:
A PLSA-based language model for conversational telephone speech. 2257-2260

Speaker Recognition

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LouradourAD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LouradourAD04
Jérôme Louradour, Régine André-Obrecht, Khalid Daoudi:
Segmentation and relevance measure for speaker verification. 1401-1404
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChetouaniGZF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChetouaniGZF04
Mohamed Chetouani, Bruno Gas, Jean-Luc Zarader, Marcos Faúndez-Zanuy:
A new nonlinear feature extraction algorithm for speaker verification. 1405-1408
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShribergFVK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShribergFVK04
Elizabeth Shriberg, Luciana Ferrer, Anand Venkataraman, Sachin S. Kajarekar:
SVM modeling of "SNERF-grams" for speaker recognition. 1409-1412
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoM04
Purdy Ho, Pedro J. Moreno:
SVM kernel adaptation in speaker classification and verification. 1413-1416
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IwanoAF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IwanoAF04
Koji Iwano, Taichi Asami, Sadaoki Furui:
Noise-robust speaker verification using F0 features. 1417-1420
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLJ04
Zi-He Chen, Yuan-Fu Liao, Yau-Tarng Juang:
Eigen-prosody analysis for robust speaker recognition under mismatch handset environment. 1421
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LawsonH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LawsonH04
Aaron D. Lawson, Mark C. Huggins:
Triphone-based confidence system for speaker identification. 1745-1748
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoshidaTO04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoshidaTO04
Kenichi Yoshida, Kazuyuki Takagi, Kazuhiko Ozeki:
Improved model training and automatic weight adjustment for multi-SNR multi-band speaker identification system. 1749-1752
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MakYCK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MakYCK04
Man-Wai Mak, Kwok-Kwong Yiu, Ming-Cheung Cheung, Sun-Yuan Kung:
A new approach to channel robust speaker verification via constrained stochastic feature transformation. 1753-1756
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TadjGB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TadjGB04
Chakib Tadj, Christian S. Gargour, Nabil Badri:
Best speaker-based structure tree for speaker verification. 1757-1760
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChowA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChowA04
David Chow, Waleed H. Abdulla:
Robust speaker identification based on perceptual log area ratio and Gaussian mixture models. 1761-1764
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WenndtF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WenndtF04
Stanley J. Wenndt, Richard M. Floyd:
Channel frequency response correction for speaker recognition. 1765-1768
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangL04
Yh-Her Yang, Yuan-Fu Liao:
Unseen handset mismatch compensation based on a priori knowledge interpolation for robust speaker recognition. 1769-1772
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PadillaQ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PadillaQ04
Michael T. Padilla, Thomas F. Quatieri:
A comparison of soft and hard spectral subtraction for speaker verification. 1773-1776
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RadovaP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RadovaP04
Vlasta Radová, Ales Padrta:
Comparison of several speaker verification procedures based on GMM. 1777-1780
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuanLQW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuanLQW04
Yong Guan, Wenju Liu, Hongwei Qi, Jue Wang:
Improving performance of text-independent speaker identification by utilizing contextual principal curves filtering. 1781-1784
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChienT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChienT04
Jen-Tzung Chien, Chuan-Wei Ting:
Speaker identification using probabilistic PCA model selection. 1785-1788
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AronowitzBA04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AronowitzBA04a
Hagai Aronowitz, David Burshtein, Amihood Amir:
Text independent speaker recognition using speaker dependent word spotting. 1789-1792
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangC04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangC04a
Hsiao-Chuan Wang, Jyh-Min Cheng:
A study on model-based equal error rate estimation for automatic speaker verification. 1793-1796
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatsuiT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatsuiT04
Tomoko Matsui, Kunio Tanabe:
Probabilistic speaker identification with dual penalized logistic regression machine. 1797-1800
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaetaH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaetaH04
Javier R. Saeta, Javier Hernando:
Model quality evaluation during enrolment for speaker verification. 1801-1804
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FratiKK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FratiKK04
Pasi Fränti, Evgeny Karpov, Tomi Kinnunen:
Real-time speaker identification. 1805-1808
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/El-YazeedKE04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/El-YazeedKE04
Mohamed Fathy Abu-ElYazeed, Nemat S. Abdel Kader, Mohammed El-Henawy:
Multi-codebook vector quantization algorithm for speaker identification. 1809-1812
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CheungYMK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CheungYMK04
Ming-Cheung Cheung, Kwok-Kwong Yiu, Man-Wai Mak, Sun-Yuan Kung:
Multi-sample fusion with constrained feature transformation for robust speaker verification. 1813-1816
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BetserBBG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BetserBBG04
Michael Betser, Frédéric Bimbot, Mathieu Ben, Guillaume Gravier:
Speaker diarization using bottom-up clustering based on a parameter-derived distance between adapted GMMs. 2329-2332
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengCL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengCL04
Nengheng Zheng, P. C. Ching, Tan Lee:
Time -frequency analysis of vocal source signal for speaker recognition. 2333-2336
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GangadharaiahNB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GangadharaiahNB04
Rashmi Gangadharaiah, Balakrishnan Narayanaswamy, Narayanaswamy Balakrishnan:
A novel method for two-speaker segmentation. 2337-2340
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YegnanarayanaSK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YegnanarayanaSK04
Bayya Yegnanarayana, A. Shahina, M. R. Kesheorey:
Throat microphone signal for speaker recognition. 2341-2344
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BenZeghibaB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BenZeghibaB04
Mohamed Faouzi BenZeghiba, Hervé Bourlard:
Posteriori probabilities and likelihoods combination for speech and speaker recognition. 2345-2348
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MihoubiOD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MihoubiOD04
Mohamed Mihoubi, Douglas D. O'Shaughnessy, Pierre Dumouchel:
The use of typical sequences for robust speaker identification. 2349-2352
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kim04c
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kim04c
KyungHwa Kim:
A forensic phonetic investigation into the duration and speech rate. 2353-2356
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SreenivasBB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SreenivasBB04
T. V. Sreenivas, Sameer Badaskar:
Mixture Gaussian model training against impostor model parameters: an application to speaker identification. 2357-2360
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AnguitaHA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AnguitaHA04
Jan Anguita, Javier Hernando, Alberto Abad:
Jacobian adaptation with improved noise reference for speaker verification. 2361-2364
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SiafarikasGF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SiafarikasGF04
Mihalis Siafarikas, Todor Ganchev, Nikos Fakotakis:
Objective wavelet packet features for speaker verification. 2365-2368
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChaudhariR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChaudhariR04
Upendra V. Chaudhari, Ganesh N. Ramaswamy:
Policy analysis framework for conversational biometrics. 2369-2372
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoiKKP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoiKKP04
Woo-Yong Choi, Jung Gon Kim, Hyung Soon Kim, Sung Bum Pan:
A new score normalization method for speaker verification with virtual impostor model. 2373-2376
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimEK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimEK04
Samuel Kim, Thomas Eriksson, Hong-Goo Kang:
On the time variability of vocal tract for speaker recognition. 2377-2380
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DesaiM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DesaiM04
Veena Desai, Hema A. Murthy:
Distributed speaker recognition. 2381-2384
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AngkititrakulBH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AngkititrakulBH04
Pongtep Angkititrakul, Sepideh Baghaii, John H. L. Hansen:
Cluster-dependent modeling and confidence measure processing for in-set/out-of-set speaker identification. 2385-2388
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/UmedaKTR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/UmedaKTR04
Yoshiyuki Umeda, Shingo Kuroiwa, Satoru Tsuge, Fuji Ren:
Distributed speaker recognition using earth mover's distance. 2389-2392
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BarlowKC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BarlowKC04
Michael Barlow, Mehrdad Khodai-Joopari, Frantz Clermont:
A forensically-motivated tool for selecting cepstrally-consistent steady-states from non-contemporaneous vowel utterances. 2393-2396
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlexanderD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlexanderD04
Anil Alexander, Andrzej Drygajlo:
Scoring and direct methods for the interpretation of evidence in forensic speaker recognition. 2397-2400
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KinnunenKF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KinnunenKF04
Tomi Kinnunen, Evgeny Karpov, Pasi Fränti:
Efficient online cohort selection method for speaker verification. 2401-2404
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NavratilRZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NavratilRZ04
Jirí Navrátil, Ganesh N. Ramaswamy, Ran D. Zilca:
Statistical model migration in speaker recognition. 2585-2588
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhanY04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhanY04a
A. Nayeemulla Khan, Bayya Yegnanarayana:
Latent semantic analysis for speaker recognition. 2589-2592
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShaoW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShaoW04
Yang Shao, DeLiang Wang:
Model-based sequential organization for cochannel speaker identification. 2593-2596
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeungMK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeungMK04
Ka-Yee Leung, Man-Wai Mak, Sun-Yuan Kung:
Articulatory feature-based conditional pronunciation modeling for speaker verification. 2597-2600
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkH04
Alex Park, Timothy J. Hazen:
A comparison of normalization and training approaches for ASR-dependent speaker identification. 2601-2604
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tran04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tran04
Dat Tran:
New background modeling for speaker verification. 2605-2608

Processing of Prosody by Humans and Machines

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaillyHA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaillyHA04
Gérard Bailly, Bleicke Holm, Véronique Aubergé:
A trainable prosodic model: learning the contours implementing communicative functions within a superpositional model of intonation. 1425-1428
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NguyenMVMN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NguyenMVMN04
Dung Tien Nguyen, Chi Mai Luong, Bang Kim Vu, Hansjörg Mixdorff, Huy Hoang Ngo:
Fujisaki model based F0 contours in vietnamese TTS. 1429-1432
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AshimuraKC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AshimuraKC04
Kazuyuki Ashimura, Hideki Kashioka, Nick Campbell:
Estimating speaking rate in spontaneous speech from z-scores of pattern durations. 1433-1436
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasukoKM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasukoKM04
Takashi Masuko, Takao Kobayashi, Keisuke Miyanaga:
A style control technique for HMM-based speech synthesis. 1437-1440
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hasegawa-JohnsonLZ04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hasegawa-JohnsonLZ04a
Mark Hasegawa-Johnson, Stephen E. Levinson, Tong Zhang:
Children's emotion recognition in an intelligent tutoring scenario. 1441-1444
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HiroseM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HiroseM04
Keikichi Hirose, Nobuaki Minematsu:
Use of prosodic features for speech recognition. 1445-1448

Contemporary Issues in ASR

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PetersD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PetersD04
Jochen Peters, Christina Drexel:
Transformation-based error correction for speech-to-text systems. 1449-1452
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GutkinK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GutkinK04
Alexander Gutkin, Simon King:
Phone classification in pseudo-euclidean vector spaces. 1453-1456
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChungWSFT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChungWSFT04
Grace Chung, Chao Wang, Stephanie Seneff, Edward Filisko, Min Tang:
Combining linguistic knowledge and acoustic information in automatic pronunciation lexicon generation. 1457-1460
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenH04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenH04a
Ken Chen, Mark Hasegawa-Johnson:
Modeling pronunciation variation using artificial neural networks for English spontaneous speech. 1461-1464
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AalburgH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AalburgH04
Stefanie Aalburg, Harald Höge:
Foreign-accented speaker-independent speech recognition. 1465-1468
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeracleousNLSS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeracleousNLSS04
Panikos Heracleous, Yoshitaka Nakajima, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Non-audible murmur (NAM) speech recognition using a stethoscopic NAM microphone. 1469-1472
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RussellDW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RussellDW04
Martin J. Russell, Shona D'Arcy, Lit Ping Wong:
Recognition of read and spontaneous children's speech using two new corpora. 1473-1476
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FrankelWK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FrankelWK04
Joe Frankel, Mirjam Wester, Simon King:
Articulatory feature recognition using dynamic Bayesian networks. 1477-1480
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BouwmanCB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BouwmanCB04
Gies Bouwman, Bert Cranen, Lou Boves:
Predicting word correct rate from acoustic and linguistic confusability. 1481-1484
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IshiharaHNKOO04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IshiharaHNKOO04
Kazushi Ishihara, Yuya Hattori, Tomohiro Nakatani, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Disambiguation in determining phonemes of sound-imitation words for environmental sound recognition. 1485-1488
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AnguitaPHB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AnguitaPHB04
Jan Anguita, Stéphane Peillon, Javier Hernando, Alexandre Bramoulle:
Word confusability prediction in automatic speech recognition. 1489-1492
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JouSW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JouSW04
Szu-Chen Stan Jou, Tanja Schultz, Alex Waibel:
Adaptation for soft whisper recognition using a throat microphone. 1493-1496
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GruhnMN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GruhnMN04
Rainer Gruhn, Konstantin Markov, Satoshi Nakamura:
A statistical lexicon for non-native speech recognition. 1497-1500
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Magimai-DossISB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Magimai-DossISB04
Mathew Magimai-Doss, Shajith Ikbal, Todd A. Stephenson, Hervé Bourlard:
Modeling auxiliary features in tandem systems. 1501-1504
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoschB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoschB04
Louis ten Bosch, Lou Boves:
Survey of spontaneous speech phenomena in a multimodal dialogue system and some implications for ASR. 1505-1508
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CincarekGN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CincarekGN04
Tobias Cincarek, Rainer Gruhn, Satoshi Nakamura:
Speech recognition for multiple non-native accent groups with speaker-group-dependent acoustic models. 1509-1512
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StoutenM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StoutenM04
Frederik Stouten, Jean-Pierre Martens:
Coping with disfluencies in spontaneous speech recognition. 1513-1516
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KwonN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KwonN04
Soonil Kwon, Shrikanth S. Narayanan:
Speaker model quantization for unsupervised speaker indexing. 1517-1520
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GerosaG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GerosaG04
Matteo Gerosa, Diego Giuliani:
Investigating automatic recognition of non-native children's speech. 1521-1524
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuSSH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuSSH04
Yang Liu, Elizabeth Shriberg, Andreas Stolcke, Mary P. Harper:
Using machine learning to cope with imbalanced classes in natural speech: evidence from sentence boundary and disfluency detection. 1525-1528
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JinJYY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JinJYY04
Minho Jin, Gyucheol Jang, Sungrack Yun, Chang Dong Yoo:
Hybrid utterance verification based on n-best models and model derived from kulback-leibler divergence. 1529-1532
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GotoKIK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GotoKIK04
Masataka Goto, Koji Kitayama, Katsunobu Itou, Tetsunori Kobayashi:
Speech spotter: on-demand speech recognition in human-human conversation on the telephone or in face-to-face situations. 1533-1536
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeC04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeC04a
Kyong-Nim Lee, Minhwa Chung:
Pronunciation lexicon modeling and design for Korean large vocabulary continuous speech recognition. 1537-1540
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MollerKR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MollerKR04
Sebastian Möller, Jan Felix Krebber, Alexander Raake:
Performance of speech recognition and synthesis in packet-based networks. 1541-1544
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JamesMG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JamesMG04
Alastair Bruce James, Ben P. Milner, Angel Manuel Gomez:
A comparison of packet loss compensation methods and interleaving for speech recognition in burst-like packet loss. 1545-1548
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MilnerJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MilnerJ04
Ben P. Milner, Alastair Bruce James:
An analysis of packet loss models for distributed speech recognition. 1549-1552

Second Language Learning and Spoken Language Processing

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Minematsu04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Minematsu04a
Nobuaki Minematsu:
Pronunciation assessment based upon the phonological distortions observed in language learners' utterances. 1669-1672
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuzukiSSM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuzukiSSM04
Yasuo Suzuki, Yoshinori Sagisaka, Katsuhiko Shirai, Makiko Muto:
Analysis of the phone level contributions to objective evaluation of English speech by non-natives. 1673-1676
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangPSK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangPSK04
Chao Wang, Mitchell Peabody, Stephanie Seneff, Jong-mi Kim:
An interactive English pronunciation dictionary for Korean learners. 1677-1680
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RheeP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RheeP04
Seok-Chae Rhee, Jeon G. Park:
Development of the knowledge-based spoken English evaluation system and its application. 1681-1684
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BernsteinBRJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BernsteinBRJ04
Jared Bernstein, Isabella Barbier, Elizabeth Rosenfeld, John H. A. L. de Jong:
Theory and data in spoken language assessment. 1685-1688
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaDT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaDT04
Tatsuya Kawahara, Masatake Dantsuji, Yasushi Tsubota:
Practical use of English pronunciation system for Japanese students in the CALL classroom. 1689-1692
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BeskowEGW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BeskowEGW04
Jonas Beskow, Olov Engwall, Björn Granström, Preben Wik:
Design strategies for a virtual language tutor. 1693-1696

Emerging Research: Human Factors in Speech and Communication Systems

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CampanaTAR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CampanaTAR04
Ellen Campana, Michael K. Tanenhaus, James F. Allen, Roger W. Remington:
Evaluating cognitive load in spoken language interfaces using a dual-task paradigm. 1721-1724
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BlackBHLM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BlackBHLM04
Lesley-Ann Black, Norman D. Black, Roy Harper, Michelle Lemon, Michael F. McTear:
The voice-logbook: integrating human factors for a chronic care system. 1725-1728
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Jokinen04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Jokinen04
Kristiina Jokinen:
Communicative competence and adaptation in a spoken dialogue system. 1729-1732
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FuPC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FuPC04
Zhan Fu, Lay Ling Pow, Fang Chen:
Evaluation of the difference between the driving behavior of a speech based and a speech-visual based task of an in-car compute. 1733-1736
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MollerKS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MollerKS04
Sebastian Möller, Jan Felix Krebber, Paula M. T. Smeele:
Evaluating system metaphors via the speech output of a smart home system. 1737-1740
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HammerRR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HammerRR04
Florian Hammer, Peter Reichl, Alexander Raake:
Elements of interactivity in telephone conversations. 1741-1744

Interdisciplinary Topics in Spoken Language Processing

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SegundoMGCFP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SegundoMGCFP04
Rubén San Segundo, Juan Manuel Montero, Javier Macías Guarasa, Ricardo de Córdoba, Javier Ferreiros, José Manuel Pardo:
Generating gestures from speech. 1817-1820
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanederaSIF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanederaSIF04
Noboru Kanedera, Asuka Sumida, Takao Ikehata, Tetsuo Funada:
Subtopic segmentation in the lecture speech. 1821-1824
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EricksonMF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EricksonMF04
Donna Erickson, Caroline Menezes, Akinori Fujino:
Some articulatory measurements of real sadness. 1825-1828
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeCC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeCC04
Chen-Long Lee, Wen-Whei Chang, Yuan-Chuan Chiang:
Application of voice conversion to hearing-impaired Mandarin speech enhancement. 1829-1832
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KweonISM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KweonISM04
Oh Pyo Kweon, Akinori Ito, Motoyuki Suzuki, Shozo Makino:
A Japanese dialogue-based CALL system with mispronunciation and grammar error detection. 1833-1836
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JoB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JoB04
Cheolwoo Jo, Ilsuh Bak:
Statistics-based direction finding for training vowels. 1837-1840
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MontanariYAN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MontanariYAN04
Simona Montanari, Serdar Yildirim, Elaine Andersen, Shrikanth S. Narayanan:
Reference marking in children's computer-directed speech: an integrated analysis of discourse and gestures. 1841-1844
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimF04
Jong-mi Kim, Suzanne Flynn:
What makes a non-native accent?: a study of Korean English. 1845-1848
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimKH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimKH04
Sang-Jin Kim, Kwang-Ki Kim, Minsoo Hahn:
Study on emotional speech features in Korean with its aplication to voice color conversion. 1849-1852
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AmanoNK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AmanoNK04
Shigeaki Amano, Tomohiro Nakatani, Tadahisa Kondo:
Developmental changes in voiced-segment ratio for Japanese infants and parents. 1853-1856
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YouKS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YouKS04
Kisun You, Hoyoun Kim, Wonyong Sung:
Implementation of an intonational quality assessment system for a handheld device. 1857-1860
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BeautempsBG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BeautempsBG04
Denis Beautemps, Thomas Burger, Laurent Girin:
Characterizing and classifying cued speech vowels from labial parameters. 1861-1864
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakahashiMMT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakahashiMMT04
Shinya Takahashi, Tsuyoshi Morimoto, Sakashi Maeda, Naoyuki Tsuruta:
Cough detection in spoken dialogue system for home health care. 1865-1868

Towards Adaptive Machines: Active and Unsupervised Learning

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuHMAD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuHMAD04
Dong Yu, Mei-Yuh Hwang, Peter Mau, Alex Acero, Li Deng:
Unsupervised learning from users' error correction in speech dictation. 1969-1972
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MeyerK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeyerK04
Gerard G. L. Meyer, Teresa M. Kamm:
Robustness aspects of active learning for acoustic modeling. 1973-1976
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VisweswariahGG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VisweswariahGG04
Karthik Visweswariah, Ramesh A. Gopinath, Vaibhava Goel:
Task adaptation of acoustic and language models based on large quantities of data. 1977-1980
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LussierWF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LussierWF04
Luc Lussier, Edward W. D. Whittaker, Sadaoki Furui:
Unsupervised language model adaptation methods for spontaneous speech. 1981-1984
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NishidaMHI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NishidaMHI04
Masafumi Nishida, Yoshitaka Mamiya, Yasuo Horiuchi, Akira Ichikawa:
On-line incremental adaptation based on reinforcement learning for robust speech recognition. 1985-1988
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WatanabeNUN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WatanabeNUN04
Tomohiro Watanabe, Hiromitsu Nishizaki, Takehito Utsuro, Seiichi Nakagawa:
Unsupervised speaker adaptation using high confidence portion recognition results by multiple recognition systems. 1989-1992

Speech Coding

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DusanFKB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DusanFKB04
Sorin Dusan, James L. Flanagan, Amod Karve, Mridul Balaraman:
Speech coding using trajectory compression and multiple sensors. 1993-1996
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FeldbauerK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FeldbauerK04
Christian Feldbauer, Gernot Kubin:
How sparse can we make the auditory representation of speech? 1997-2000
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DavidS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DavidS04
David Malah, Slava Shechtman:
Efficient sub-optimal temporal decomposition with dynamic weighting of speech signals for coding applications. 2001-2004
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GunawanAE04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GunawanAE04
Teddy Surya Gunawan, Eliathamby Ambikairajah, Julien Epps:
Perceptual wavelet packet audio coder. 2005-2008
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JungKYL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JungKYL04
Sung-Kyo Jung, Hong-Goo Kang, Dae Hee Youn, Chang-Heon Lee:
Performance analysis of transcoding algorithms in packet-loss environments. 2009-2012
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FalkCK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FalkCK04
Tiago H. Falk, Wai-Yip Chan, Peter Kabal:
Speech quality estimation using Gaussian mixture models. 2013-2016

Robust ASR

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimR04
Hong Kook Kim, Mazin G. Rahim:
Why speech recognizers make errors ? a robustness view. 1645-1648
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AhadiSBF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AhadiSBF04
Seyed Mohammad Ahadi, Hamid Sheikhzadeh, Robert L. Brennan, George H. Freeman:
An energy normalization scheme for improved robustness in speech recognition. 1649-1652
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuertaMB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuertaMB04
Juan M. Huerta, Etienne Marcheret, Sreeram Balakrishnan:
Rapid on-line environment compensation for server - based speech recognition in noisy mobile environments. 1653-1656
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AnsaryS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AnsaryS04
Leila Ansary, Seyyed Ali Seyyed Salehi:
Modeling phones coarticulation effects in a neural network based speech recognition system. 1657-1660
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Willett04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Willett04
Daniel Willett:
Error - weighted discriminative training for HMM parameter estimation. 1661-1664
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoSN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoSN04
Wai Kit Lo, Frank K. Soong, Satoshi Nakamura:
Robust verification of recognized words in noise. 1665-1668
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiTO04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiTO04
Zili Li, Hesham Tolba, Douglas D. O'Shaughnessy:
Robust automatic speech recognition using an optimal spectral amplitude estimator algorithm in low-SNR car environments. 2041-2044
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoKX04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoKX04
Junhui Zhao, Jingming Kuang, Xiang Xie:
Robust speech recognition using data-driven temporal filters based on independent component analysis. 2045-2048
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KitaokaWN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KitaokaWN04
Norihide Kitaoka, Longbiao Wang, Seiichi Nakagawa:
Robust distant speech recognition based on position dependent CMN. 2049-2052
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SakauchiYTK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SakauchiYTK04
Sumitaka Sakauchi, Yoshikazu Yamaguchi, Satoshi Takahashi, Satoshi Kobashikawa:
Robust speech recognition based on HMM composition and modified wiener filter. 2053-2056
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BritoYM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BritoYM04
Ivan Brito, Néstor Becerra Yoma, Carlos Molina:
Feature-dependent compensation in speech recognition. 2057-2060
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Cox04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Cox04
Stephen Cox:
Using context to correct phone recognition errors. 2061-2064
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Obuchi04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Obuchi04
Yasunari Obuchi:
Improved histogram-based feature compensation for robust speech recognition and unsupervised speaker adaptation. 2065-2068
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiongZW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiongZW04
Zhenyu Xiong, Thomas Fang Zheng, Wenhu Wu:
Weighting observation vectors for robust speech recognition in noisy environments. 2069-2072
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsujikawaI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsujikawaI04
Masanori Tsujikawa, Ken-ichi Iso:
Hands-free speech recognition using blind source separation post-processed by two-stage spectral subtraction. 2073-2076
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GomezLSS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GomezLSS04
Randy Gomez, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Robust speech recognition with spectral subtraction in low SNR. 2077-2080
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CranenV04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CranenV04
Bert Cranen, Johan de Veth:
Active perception: using a priori knowledge from clean speech models to ignore non-target features. 2081-2084
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuTDL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuTDL04
Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Spectral subtraction with full-wave rectification and likelihood controlled instantaneous noise estimation for robust speech recognition. 2085-2088
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KorkmazskyFI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KorkmazskyFI04
Filip Korkmazsky, Dominique Fohr, Irina Illina:
Using linear interpolation to improve histogram equalization for speech recognition. 2089-2092
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hasegawa-JohnsonD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hasegawa-JohnsonD04
Mark Hasegawa-Johnson, Ameya N. Deoras:
A factorial HMM aproach to robust isolated digit recognition in background music. 2093-2096
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeK04
Yoonjae Lee, Hanseok Ko:
Multi-eigenspace normalization for robust speech recognition in noisy environments. 2097-2100
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CerisaraFMI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CerisaraFMI04
Christophe Cerisara, Dominique Fohr, Odile Mella, Irina Illina:
Exploiting models intrinsic robustness for noisy speech recognition. 2101-2104
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PujolPNM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PujolPNM04
Pere Pujol, Jaume Padrell, Climent Nadeu, Dusan Macho:
Speech recognition experiments with the SPEECON database using several robust front-ends. 2105-2108
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IkbalMMB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IkbalMMB04
Shajith Ikbal, Mathew Magimai-Doss, Hemant Misra, Hervé Bourlard:
Spectro-temporal activity pattern (STAP) features for noise robust ASR. 2109-2112
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimKCLL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimKCLL04
Byoung-Don Kim, Jin Young Kim, Seung Ho Choi, Young-Bum Lee, Kyoung-Rok Lee:
Improvement of confidence measure performance using background model set algorithm. 2113-2116
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AradillaDS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AradillaDS04
Guillermo Aradilla, John Dines, Sunil Sivadas:
Using RASTA in task independent TANDEM feature extraction. 2117-2120
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanNS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanNS04
Kyu Jeong Han, Shrikanth S. Narayanan, Naveen Srinivasamurthy:
A distributed speech recognition system in multi-user environments. 2121-2124
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Haeb-UmbachI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Haeb-UmbachI04
Reinhold Haeb-Umbach, Valentin Ion:
Soft features for improved distributed speech recognition over wireless networks. 2125-2128

Emerging Research

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ebukuro04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ebukuro04
Rinzou Ebukuro:
Analysis on disappearing and thriving of speech applications for ergonomic design guidelines and recommendations. 2217-2220
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SmeeleMK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SmeeleMK04
Paula M. T. Smeele, Sebastian Möller, Jan Felix Krebber:
Evaluation of the speech output of a smart-home system in a car environment. 2221-2225
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Haas04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Haas04
Ellen C. Haas:
How does the integration of speech recognition controls and spatialized auditory displays affect user workload? 2225-2228
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chen04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chen04
Fang Chen:
Speech interaction system - how to increase its usability? 2229-2232
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Beringer04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Beringer04a
Nicole Beringer:
Human language acquisition methods in a machine learning task. 2233-2236

Spoken Language Resources and Technology Evaluation I

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DybkjaerBM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DybkjaerBM04
Laila Dybkjær, Niels Ole Bernsen, Wolfgang Minker:
New challenges in usability evaluation - beyond task-oriented spoken dialogue systems. 2261-2264
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimballKIAM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimballKIAM04
Owen Kimball, Chia-Lin Kao, Rukmini Iyer, Teodoro Arvizo, John Makhoul:
Using quick transcriptions to improve conversational speech models. 2265-2268
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MishraSUCWPCNCB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MishraSUCWPCNCB04
Rohit Mishra, Elizabeth Shriberg, Sandra Upson, Joyce Chen, Fuliang Weng, Stanley Peters, Lawrence Cavedon, John Niekrasz, Hua Cheng, Harry Bratt:
A wizard of oz framework for collecting spoken human-computer dialogs. 2269-2272
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HartikainenST04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HartikainenST04
Mikko Hartikainen, Esa-Pekka Salonen, Markku Turunen:
Subjective evaluation of spoken dialogue systems using SER VQUAL method. 2273-2276
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VasilescuDCE04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VasilescuDCE04
Ioana Vasilescu, Laurence Devillers, Chloé Clavel, Thibaut Ehrette:
Fiction database for emotion detection in abnormal situations. 2277-2280
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SarikayaGV04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SarikayaGV04
Ruhi Sarikaya, Yuqing Gao, Paola Virga:
Fast semi-automatic semantic annotation for spoken dialog systems. 2281-2284
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuKNW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuKNW04
Yi-Jian Wu, Hisashi Kawai, Jinfu Ni, Ren-Hua Wang:
A study on automatic detection of Japanese vowel devoicing for speech synthesis. 2721-2724
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CilogluAT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CilogluAT04
Tolga Çiloglu, Dinc Acar, Ahmet Tokatli:
Orientel-turkish: telephone speech database description and notes on the experience. 2725-2728
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoonCCH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoonCCH04
Taejin Yoon, Sandra Chavarria, Jennifer Cole, Mark Hasegawa-Johnson:
Intertranscriber reliability of prosodic labeling on telephone conversation using toBI. 2729-2732
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tian04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tian04a
Jilei Tian:
Efficient compression method for pronunciation dictionaries. 2733-2736
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiangLCL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiangLCL04
Min-Siong Liang, Dau-Cheng Lyu, Yuang-Chin Chiang, Ren-Yuan Lyu:
Construct a multi-lingual speech corpus in taiwan with extracting phonetically balanced articles. 2737-2740
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeggtveitN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeggtveitN04
Per Olav Heggtveit, Jon Emil Natvig:
Automatic prosody labeling of read norwegian. 2741-2744
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SandersDJS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SandersDJS04
Eric Sanders, Andrea Diersen, Willy Jongenburger, Helmer Strik:
Towards automatic word segmentation of dialect speech. 2745-2748
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FousekGHS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FousekGHS04
Petr Fousek, Frantisek Grézl, Hynek Hermansky, Petr Svojanovsky:
New nonsense syllables database - analyses and preliminary ASR experiments. 2749-2752
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KrebberMR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KrebberMR04
Jan Felix Krebber, Sebastian Möller, Alexander Raake:
Speech input and output module assessment for remote access to a smart-home spoken dialog system. 2753-2756
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimRH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimRH04
Dong-Hyun Kim, Yong-Wan Roh, Kwang-Seok Hong:
An implement of speech DB gathering system using voiceXML. 2757-2760
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Almasganj04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Almasganj04
Farshad Almasganj:
Precise phone boundary detection using wavelet packet and recurrent neural networks. 2761-2764
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MorrisMG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MorrisMG04
Andrew Cameron Morris, Viktoria Maier, Phil D. Green:
From WER and RIL to MER and WIL: improved evaluation measures for connected speech recognition. 2765-2768
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RheeLLK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RheeLLK04
Seok-Chae Rhee, Sook-Hyang Lee, Young-Ju Lee, Seok-Keun Kang:
Design and construction of Korean-spoken English corpus. 2769-2772
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VriendM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VriendM04
Folkert de Vriend, Giulio Maltese:
Exploring XML-based technologies and procedures for quality evaluation from a real-life case perspective. 2773-2776
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Wang04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Wang04a
Kuansan Wang:
Spoken language interface in ECMA/ISO telecommunication standards. 2777-2780
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DavelB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DavelB04
Marelie H. Davel, Etienne Barnard:
The efficient generation of pronunciation dictionaries: machine learning factors during bootstrapping. 2781-2784
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Geumann04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Geumann04
Anja Geumann:
Towards a new level of anotation detail of multilingual speech corpora. 2785-2788
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaguchiMYTI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaguchiMYTI04
Nobuo Kawaguchi, Shigeki Matsubara, Yukiko Yamaguchi, Kazuya Takeda, Fumitada Itakura:
CIAIR in-car speech database. 2789-2792
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaelHS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaelHS04
Christophe Van Bael, Henk van den Heuvel, Helmer Strik:
Investigating speech style specific pronunciation variation in large spoken language corpora. 2793-2796
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DavelB04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DavelB04a
Marelie H. Davel, Etienne Barnard:
The efficient generation of pronunciation dictionaries: human factors during bootstrapping. 2797-2800

Multi-Modal / Multi-Media Processing

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Moore04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Moore04
Roger K. Moore:
Modeling data entry rates for ASR and alternative input methods. 2285-2288
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BanMIIT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BanMIIT04
Hiromitsu Ban, Chiyomi Miyajima, Katsunobu Itou, Fumitada Itakura, Kazuya Takeda:
Speech recognition using synchronization between speech and finger tapping. 2289-2292
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuptaA04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuptaA04a
Anurag Kumar Gupta, Tasos Anastasakos:
Integration patterns during multimodal interaction. 2293-2296
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MarcheretCGP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MarcheretCGP04
Etienne Marcheret, Stephen M. Chu, Vaibhava Goel, Gerasimos Potamianos:
Efficient likelihood computation in multi-stream HMM based audio-visual speech recognition. 2297-2300
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoiKLY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoiKLY04
Changkyu Choi, Donggeon Kong, Hyoung-Ki Lee, Sang Min Yoon:
Separation of multiple concurrent speeches using audio-visual speaker localization and minimum variance beam-forming. 2301-2304
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AriyoshiNT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AriyoshiNT04
Tokitomo Ariyoshi, Kazuhiro Nakadai, Hiroshi Tsujino:
Multimodal expression for humanoid robots by integration of human speech mimicking and facial color. 2305-2308

Automatic Speech Recognition in the Context of Mobile Communications

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Novak04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Novak04
Miroslav Novak:
Towards large vocabulary ASR on embedded platforms. 2309-2312
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujimuraITI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujimuraITI04
Hiroshi Fujimura, Katsunobu Itou, Kazuya Takeda, Fumitada Itakura:
Analysis of in-car speech recognition experiments using a large-scale multi-mode dialogue corpus. 2313-2316
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanDL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanDL04
Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
On the integration of speech recognition into personal networks. 2317-2320
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RoseK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RoseK04
Richard C. Rose, Hong Kook Kim:
Robust speech recognition in client-server scenarios. 2321-2324
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JeongHJK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JeongHJK04
Sangbae Jeong, Icksang Han, Eugene Jon, Jeongsu Kim:
Memory and computation reduction for embedded ASR systems. 2325-2328

Robust Features for ASR

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FukudaN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FukudaN04
Takashi Fukuda, Tsuneo Nitta:
Canonicalization of feature parameters for automatic speech recognition. 2537-2540
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SrinivasanRW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SrinivasanRW04
Soundararajan Srinivasan, Nicoleta Roman, DeLiang Wang:
On binary and ratio time-frequency masks for robust speech recognition. 2541-2544
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SanchisJV04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SanchisJV04
Alberto Sanchís, Alfons Juan, Enrique Vidal:
New features based on multiple word graphs for utterance verification. 2545-2548
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Burget04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Burget04
Lukás Burget:
Combination of speech features using smoothed heteroscedastic linear discriminant analysis. 2549-2552
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IkbalMSHB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IkbalMSHB04
Shajith Ikbal, Hemant Misra, Sunil Sivadas, Hynek Hermansky, Hervé Bourlard:
Entropy based combination of tandem representations for noise robust ASR. 2553-2556
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YookK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YookK04
Dongsuk Yook, Donghyun Kim:
Fast speech adaptation in linear spectral domain for additive and convolutional noise. 2557-2560

Towards Rapid Speech and Natural Language Application Development: Tooling, Architectures, Components and Standards

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hetherington04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hetherington04
I. Lee Hetherington:
The MIT finite-state transducer toolkit for speech and language processing. 2609-2612
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FengBR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FengBR04
Junlan Feng, Srinivas Bangalore, Mazin G. Rahim:
Question-answering in webtalk: an evaluation study. 2613-2616
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuertaE04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuertaE04
Juan M. Huerta, Chaitanya Ekanadham:
Automatic network optimization of voice applications. 2617-2620
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Rodriguez-MorenoCM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Rodriguez-MorenoCM04
Miguel Angel Rodriguez-Moreno, Heriberto Cuayáhuitl, Juventino Montiel-Hernández:
Voicebuilder: a framework for automatic speech application development. 2621-2624
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FaccoFGV04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FaccoFGV04
Andrea Facco, Daniele Falavigna, Roberto Gretter, Marcello Viganò:
On the development of telephone applications: some practical issues and evaluation. 2625-2628
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HamerichSKSKICDP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HamerichSKSKICDP04
Stefan W. Hamerich, Volker Schless, Basilis Kladis, Volker Schubert, Otilia Kocsis, Stefan Igel, Ricardo de Córdoba, Luis Fernando D'Haro, José Manuel Pardo:
The GEMINI platform: semi-automatic generation of dialogue applications. 2629-2632

Speech Coding and Enhancement

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KondoN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KondoN04
Kazuhiro Kondo, Kiyoshi Nakagawa:
A packet loss concealment method using recursive linear prediction. 2633-2636
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeZZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeZZ04
Minkyu Lee, Imed Zitouni, Qiru Zhou:
On a n-gram model approach for packet loss concealment. 2637-2640
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoP04
Stephen So, Kuldip K. Paliwal:
Efficient vector quantisation of line spectral frequencies using the switched split vector quantiser. 2641-2644
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChaitanyaPY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChaitanyaPY04
M. Chaitanya, S. R. Mahadeva Prasanna, B. Yegnanarayana:
Enhancement of reverberant speech using excitation source information. 2645-2648
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KinoshitaNM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KinoshitaNM04
Keisuke Kinoshita, Tomohiro Nakatani, Masato Miyoshi:
Improving automatic speech recognition performance and speech inteligibility with harmonicity based dereverberation. 2649-2652
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeKC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeKC04
Seung Yeol Lee, Nam Soo Kim, Joon-Hyuk Chang:
Inner product based-multiband vector quantization for wideband speech coding at 16 kbps. 2653-2656
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AbadH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AbadH04
Alberto Abad, Javier Hernando:
Speech enhancement and recognition by integrating adaptive beamforming and wiener filtering. 2657-2660
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimJLKY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimJLKY04
Kyung-Tae Kim, Sung-Kyo Jung, MiSuk Lee, Hong-Goo Kang, Dae Hee Youn:
Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders. 2661-2664
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AsaiMSS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AsaiMSS04
Tatsunori Asai, Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano:
Interface for barge-in free spoken dialogue system using adaptive sound field control. 2665-2668
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimSL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimSL04
Jong-Hark Kim, Jae-Hyun Shin, InSung Lee:
Multi-mode harmonic transfrom excitation LPC coding for speech and music. 2669-2672
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GandhiH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GandhiH04
Mital Gandhi, Mark Hasegawa-Johnson:
Source separation using particle filters. 2673-2676
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamoNHH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamoNHH04
Anssi Rämö, Jani Nurminen, Sakari Himanen, Ari Heikkinen:
Segmental speech coding model for storage applications. 2677-2680
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JuL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JuL04
Gwo-hwa Ju, Lin-Shan Lee:
Improved speech enhancement by applying time-shift property of DFT on hankel matrices for signal subspace decomposition. 2681-2684
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TurunenTC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TurunenTC04
Jari Juhani Turunen, Juha T. Tanttu, Frank Cameron:
Minimum phase compensation in speech coding using hammerstein model. 2685-2688
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiIT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiIT04
Weifeng Li, Fumitada Itakura, Kazuya Takeda:
Optimizing regression for in-car speech recognition using multiple distributed microphones. 2689-2692
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiTID04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiTID04
Weifeng Li, Kazuya Takeda, Fumitada Itakura, Tran Huy Dat:
Speech enhancement based on magnitude estimation using the gamma prior. 2693-2696
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ErrityMI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ErrityMI04
Andrew Errity, John McKenna, Stephen Isard:
Unscented kalman filtering of line spectral frequencies. 2697-2700
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimS04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimS04a
Hyoung-Gook Kim, Thomas Sikora:
Speech enhancement based on smoothing of spectral noise floor. 2701-2704
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiA04
Junfeng Li, Masato Akagi:
Noise reduction using hybrid noise estimation technique and post-filtering. 2705-2708
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Gabrea04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Gabrea04
Marcel Gabrea:
An adaptive kalman filter for the enhancement of speech signals. 2709-2712
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SreenivasRM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SreenivasRM04
T. V. Sreenivas, K. Sharath Rao, A. Sreenivasa Murthy:
Improved iterative wiener filtering for non-stationary noise speech enhancement. 2713-2716
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianK04
Yasheng Qian, Peter Kabal:
Highband spectrum envelope estimation of telephone speech using hard/soft-classification. 2717-2720

Acoustic Modeling for Robust ASR

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KorkmazskyDFI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KorkmazskyDFI04
Filip Korkmazsky, Murat Deviren, Dominique Fohr, Irina Illina:
Hidden factor dynamic Bayesian networks for speech recognition. 2801-2804
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaoV04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaoV04
Mark Z. Mao, Vincent Vanhoucke:
Design of compact acoustic models through clustering of tied-covariance Gaussians. 2805-2808
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RautNS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RautNS04
Chandra Kant Raut, Takuya Nishimoto, Shigeki Sagayama:
Model composition by lagrange polynomial approximation for robust speech recognition in noisy environment. 2809-2812
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuZH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuZH04
Jian Wu, Donglai Zhu, Qiang Huo:
A study of minimum classification error training for segmental switching linear Gaussian hidden Markov models. 2813-2816
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatsudaJMN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatsudaJMN04
Shigeki Matsuda, Takatoshi Jitsuhiro, Konstantin Markov, Satoshi Nakamura:
Speech recognition system robust to noise and speaking styles. 2817-2820
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YomaBM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YomaBM04
Néstor Becerra Yoma, Ivan Brito, Carlos Molina:
The stochastic weighted viterbi algorithm: a frame work to compensate additive noise and low-bit rate coding distortion. 2821-2824

Spoken Dialogue Technology and Systems

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TomkoR04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TomkoR04
Stefanie Tomko, Roni Rosenfeld:
Shaping spoken input in user-initiative systems. 2825-2828
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PavlovskiLM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PavlovskiLM04
Christopher J. Pavlovski, Jennifer C. Lai, Stella Mitchell:
Etiology of user experience with natural language speech. 2829-2832
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaynerH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaynerH04
Manny Rayner, Beth Ann Hockey:
Side effect free dialogue management in a voice enabled procedure browser. 2833-2836
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaneKU04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaneKU04
Ian Richard Lane, Tatsuya Kawahara, Shinichi Ueno:
Example-based training of dialogue planning incorporating user and situation models. 2837-2840
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujieKYK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujieKYK04
Shinya Fujie, Tetsunori Kobayashi, Daizo Yagi, Hideaki Kikuchi:
Prosody based attitude recognition with feature selection and its application to spoken dialog system as para-linguistic information. 2841-2844
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OllasonJBHL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OllasonJBHL04
David Ollason, Yun-Cheng Ju, Siddharth Bhatia, Daniel Herron, Jackie Liu:
MS connect: a fully featured auto-attendant: system design, implementation and performance. 2845-2848

Multi-Channel Speech Processing

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Haeb-UmbachPW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Haeb-UmbachPW04
Reinhold Haeb-Umbach, Sven Peschke, Ernst Warsitz:
Adaptive beamforming combined with particle filtering for acoustic source localization. 2849-2852
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KwonKB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KwonKB04
Hong-Seok Kwon, Siho Kim, Keun-Sung Bae:
Time delay estimation using weighted CPSP function. 2853-2856
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PotamitisZF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PotamitisZF04
Ilyas Potamitis, Panagiotis Zervas, Nikos Fakotakis:
DOA estimation of speech signals using semi-blind source separation techniques. 2857-2860
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimY04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimY04
Sang-Gyun Kim, Chang D. Yoo:
Blind separation of speech and sub-Gaussian signals in underdetermined case. 2861-2864
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JangCLO04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JangCLO04
Gil-Jin Jang, Changkyu Choi, Yongbeom Lee, Yung-Hwan Oh:
Adaptive cross-channel interference cancellation on blind signal separation outputs using source absence/presence detection and spectral subtraction. 2865-2868
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VisserCKL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VisserCKL04
Erik M. Visser, Kwokleung Chan, Stanley Kim, Te-Won Lee:
A comparison of simultaneous 3-channel blind source separation to selective separation on channel pairs using 2-channel BSS. 2869-2872

Intersection of Spoken Language Processing and Written Language Processing

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lee04b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lee04b
Hyun-Bok Lee:
Towards a harmonious coexistence of spoken and written language. 2873-2876
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sugito04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sugito04
Miyoko Sugito:
Towards a grammar of spoken language - prosody of ill-formed utterances and listener's understanding in discourse -. 2877-2880
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaSN04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaSN04
Tatsuya Kawahara, Kazuya Shitaoka, Hiroaki Nanjo:
Automatic transformation of lecture transcription into document style using statistical framework. 2881-2884
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AroraAVA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AroraAVA04
Karunesh Arora, Sunita Arora, Kapil Verma, Shyam Sunder Agrawal:
Automatic extraction of phonetically rich sentences from large text corpus of indian languages. 2885-2888
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Calzolari04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Calzolari04
Nicoletta Calzolari:
European initiatives to promote cooperation between speech and text communities. 2889-2892

Prosodic Recognition and Analysis

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Takamaru04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Takamaru04
Keiichi Takamaru:
Evaluation of a threshold for detecting local slower phrases in Japanese spontaneous conversational speech. 2969-2972
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EffendyMCJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EffendyMCJ04
Nazrul Effendy, Ekkarit Maneenoi, Patavee Charnvivit, Somchai Jitapunkul:
Intonation recognition for indonesian speech based on fujisaki model. 2973-2976
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangNH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangNH04
Jinsong Zhang, Satoshi Nakamura, Keikichi Hirose:
Efficient tone classification of speaker independent continuous Chinese speech using anchoring based discriminating features. 2977-2980
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WatanabeDHM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WatanabeDHM04
Michiko Watanabe, Yasuharu Den, Keikichi Hirose, Nobuaki Minematsu:
Clause types and filed pauses in Japanese spontaneous monologues. 2981-2984
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YabutaKST04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YabutaKST04
Yohei Yabuta, Yasuhiro Katagiri, Noriko Suzuki, Yugo Takeuchi:
Effect of voice prosody on the decision making process in human-computer interaction. 2985-2988
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuzukiK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuzukiK04
Noriko Suzuki, Yasuhiro Katagiri:
Alignment of human prosodic patterns for spoken dialogue systems. 2989-2992
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KiriyamaK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KiriyamaK04
Shinya Kiriyama, Shigeyoshi Kitazawa:
Evaluation of a prosodic labeling system utilizing linguistic information. 2993-2996
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Blodgett04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Blodgett04
Allison Blodgett:
Functions of intonation boundaries during spoken language comprehension in English. 2997-3000
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhneWEH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhneWEH04
Marco Khne, Matthias Wolff, Matthias Eichner, Rüdiger Hoffmann:
Voice activation using prosodic features. 3001-3004
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kim04d
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kim04d
Sahyang Kim:
The role of prosodic cues in word segmentation of Korean. 3005-3008
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Jun04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Jun04
Sun-Ah Jun:
Default phrasing and attachment preference in Korean. 3009-3012
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BorysCHC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BorysCHC04
Sarah Borys, Aaron Cohen, Mark Hasegawa-Johnson, Jennifer Cole:
Modeling and recognition of phonetic and prosodic factors for improvements to acoustic speech recognition models. 3013-3016
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kong04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kong04
Eunjong Kong:
The role of pitch range variation in the discourse structure and intonation structure of Korean. 3017-3020
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakagiO04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakagiO04
Kazuyuki Takagi, Kazuhiko Ozeki:
Dependency analysis of read Japanese sentences using pause and F0 information: a speaker independent case. 3021-3024
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SpeerK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SpeerK04
Shari R. Speer, Soyoung Kang:
Effects of prosodic boundaries on ambiguous syntactic clause boundaries in Japanese. 3025-3028
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NagasakiK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NagasakiK04
Yasuko Nagasaki, Takanori Komatsu:
The superior effectivenes of the F0 range for identifying the context from sounds without phonemes. 3029-3032
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiKK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiKK04
Tan Li, Montri Karnjanadecha, Thanate Khaorapapong:
A study of tone classification for continuous Thai speech recognition. 3033-3036
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimLS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimLS04
Key-Seop Kim, Un Lim, Dong-Il Shin:
An acoustic-analytic role for the deviation between the scansion and reading of poems. 3037-3040
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhsugaNHI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhsugaNHI04
Tomoko Ohsuga, Masafumi Nishida, Yasuo Horiuchi, Akira Ichikawa:
Estimating syntactic structure from prosodic features in Japanese speech. 3041-3044
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KomatsuSA04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KomatsuSA04
Masahiko Komatsu, Tsutomu Sugawara, Takayuki Arai:
Perceptual discrimination of prosodic types and their preliminary acoustic analysis. 3045-3048

Towards Rapid Speech and Natural Language Application Development

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LHourBSMCM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LHourBSMCM04
Johann L'Hour, Olivier Boëffard, Jacques Siroux, Laurent Miclet, Francis Charpentier, Thierry Moudenc:
DORIS, a multiagent/IP platform for multimodal dialogue applications. 3049-3052
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chen04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chen04a
Yu Chen:
EVITA-RAD: an extensible enterprise voice porTAI - rapid application development tool. 3053-3056
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DHaroCSMGP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DHaroCSMGP04
Luis Fernando D'Haro, Ricardo de Córdoba, Rubén San Segundo, Juan Manuel Montero, Javier Macías Guarasa, José Manuel Pardo:
Strategies to reduce design time in multimodal/multilingual dialog applications. 3057-3060
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Aist04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Aist04
Gregory Aist:
Three-way system-user-expert interactions help you expand the capabilities of an existing spoken dialogue system. 3061-3064
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FabbrizioL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FabbrizioL04
Giuseppe Di Fabbrizio, Charles Lewis:
Florence: a dialogue manager framework for spoken dialogue systems. 3065-3068
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaLTIS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaLTIS04
Tatsuya Kawahara, Akinobu Lee, Kazuya Takeda, Katsunobu Itou, Kiyohiro Shikano:
Recent progress of open-source LVCSR engine julius and Japanese model repository. 3069-3072
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MuraoKMYTI04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MuraoKMYTI04
Hiroya Murao, Nobuo Kawaguchi, Shigeki Matsubara, Yukiko Yamaguchi, Kazuya Takeda, Yasuyoshi Inagaki:
Example-based spoken dialogue system with online example augmentation. 3073-3076
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Bhler04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Bhler04
Dirk Bhler:
Enhancing existing form-based dialogue managers with reasoning capabilities. 3077-3080
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TurunenSHH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TurunenSHH04
Markku Turunen, Esa-Pekka Salonen, Mikko Hartikainen, Jaakko Hakulinen:
Robust and adaptive architecture for multilingual spoken dialogue systems. 3081-3084
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FilipeM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FilipeM04
Porfírio P. Filipe, Nuno J. Mamede:
Towards ubiquitous task management. 3085-3088

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.