default search action
Hynek Hermansky
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2020
- [j30]Ruizhi Li, Xiaofei Wang, Sri Harish Mallidi, Shinji Watanabe, Takaaki Hori, Hynek Hermansky:
Multi-Stream End-to-End Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 646-655 (2020) - 2019
- [j29]Angel Mario Castro Martinez, Lukas Gerlach, Guillermo Payá Vayá, Hynek Hermansky, Jasper Ooster, Bernd T. Meyer:
DNN-based performance measures for predicting error rates in automatic speech recognition and optimizing hearing aid parameters. Speech Commun. 106: 44-56 (2019) - [j28]Hynek Hermansky:
Coding and decoding of messages in human speech communication: Implications for machine recognition of speech. Speech Commun. 106: 112-117 (2019) - 2014
- [j27]Sriram Ganapathy, Sri Harish Reddy Mallidi, Hynek Hermansky:
Robust Feature Extraction Using Modulation Filtering of Autoregressive Models. IEEE ACM Trans. Audio Speech Lang. Process. 22(8): 1285-1295 (2014) - 2013
- [j26]Hynek Hermansky:
Multistream Recognition of Speech: Dealing With Unknown Unknowns. Proc. IEEE 101(5): 1076-1088 (2013) - [j25]Hynek Hermansky, Jordan R. Cohen, Richard M. Stern:
Perceptual Properties of Current Speech Recognition Technology. Proc. IEEE 101(9): 1968-1985 (2013) - [j24]Sri Garimella, Hynek Hermansky:
Factor Analysis of Auto-Associative Neural Networks With Application in Speaker Verification. IEEE Trans. Neural Networks Learn. Syst. 24(4): 522-528 (2013) - 2012
- [j23]Daphna Weinshall, Alon Zweig, Hynek Hermansky, Stefan Kombrink, Frank W. Ohl, Jörn Anemüller, Jörg-Hendrik Bach, Luc Van Gool, Fabian Nater, Tomás Pajdla, Michal Havlena, Misha Pavel:
Beyond Novelty Detection: Incongruent Events, When General and Specific Classifiers Disagree. IEEE Trans. Pattern Anal. Mach. Intell. 34(10): 1886-1901 (2012) - [j22]Shajith Ikbal, Hemant Misra, Hynek Hermansky, Mathew Magimai-Doss:
Phase AutoCorrelation (PAC) features for noise robust speech recognition. Speech Commun. 54(7): 867-880 (2012) - [j21]Sri Garimella, Sri Harish Reddy Mallidi, Hynek Hermansky:
Regularized Auto-Associative Neural Networks for Speaker Verification. IEEE Signal Process. Lett. 19(12): 841-844 (2012) - [j20]Garimella S. V. S. Sivaram, Hynek Hermansky:
Sparse Multilayer Perceptron for Phoneme Recognition. IEEE Trans. Speech Audio Process. 20(1): 23-29 (2012) - 2011
- [j19]Joel Pinto, Garimella S. V. S. Sivaram, Mathew Magimai-Doss, Hynek Hermansky, Hervé Bourlard:
Analysis of MLP-Based Hierarchical Phoneme Posterior Probability Estimator. IEEE Trans. Speech Audio Process. 19(2): 225-241 (2011) - 2010
- [j18]Petr Motlícek, Sriram Ganapathy, Hynek Hermansky, Harinath Garudadri:
Wide-Band Audio Coding Based on Frequency-Domain Linear Prediction. EURASIP J. Audio Speech Music. Process. 2010 (2010) - [j17]Garimella S. V. S. Sivaram, Sridhar Krishna Nemala, Nima Mesgarani, Hynek Hermansky:
Data-Driven and Feedback Based Spectro-Temporal Features for Speech Recognition. IEEE Signal Process. Lett. 17(11): 957-960 (2010) - [j16]Sriram Ganapathy, Petr Motlícek, Hynek Hermansky:
Autoregressive Models of Amplitude Modulations in Audio Compression. IEEE Trans. Speech Audio Process. 18(6): 1624-1631 (2010) - 2008
- [j15]Samuel Thomas, Sriram Ganapathy, Hynek Hermansky:
Recognition of Reverberant Speech Using Frequency Domain Linear Prediction. IEEE Signal Process. Lett. 15: 681-684 (2008) - 2005
- [j14]Werner Verhelst, Jürgen Herre, Gernot Kubin, Hynek Hermansky, Søren Holdt Jensen:
Editorial. EURASIP J. Adv. Signal Process. 2005(9): 1289-1291 (2005) - [j13]Nelson Morgan, Qifeng Zhu, Andreas Stolcke, M. Kemal Sönmez, Sunil Sivadas, Takahiro Shinozaki, Mari Ostendorf, Pratibha Jain, Hynek Hermansky, Dan Ellis, George R. Doddington, Barry Y. Chen, Özgür Çetin, Hervé Bourlard, Marios Athineos:
Pushing the envelope - aside [speech recognition]. IEEE Signal Process. Mag. 22(5): 81-88 (2005) - 2003
- [j12]Naren Malayath, Hynek Hermansky:
Data-driven spectral basis functions for automatic speech recognition. Speech Commun. 40(4): 449-466 (2003) - 2000
- [j11]Naren Malayath, Hynek Hermansky, Sachin S. Kajarekar, B. Yegnanarayana:
Data-Driven Temporal Filters and Alternatives to GMM in Speaker Verification. Digit. Signal Process. 10(1-3): 55-74 (2000) - [j10]Howard Hua Yang, Sarel van Vuuren, Sangita Sharma, Hynek Hermansky:
Relevance of time-frequency features for phonetic and speaker-channel classification. Speech Commun. 31(1): 35-50 (2000) - 1999
- [j9]B. Yegnanarayana, Carlos Avendaño, Hynek Hermansky, P. Satyanarayana Murthy:
Speech enhancement using linear prediction residual. Speech Commun. 28(1): 25-42 (1999) - [j8]Noboru Kanedera, Takayuki Arai, Hynek Hermansky, Misha Pavel:
On the relative importance of various components of the modulation spectrum for automatic speech recognition. Speech Commun. 28(1): 43-55 (1999) - 1998
- [j7]Hynek Hermansky:
Should recognizers have ears? Speech Commun. 25(1-3): 3-27 (1998) - 1997
- [j6]Carlos Avendaño, Hynek Hermansky:
On the effects of short-term spectrum smoothing in channel normalization. IEEE Trans. Speech Audio Process. 5(4): 372-374 (1997) - 1996
- [j5]Hervé Bourlard, Hynek Hermansky, Nelson Morgan:
Towards increasing speech recognition error rates. Speech Commun. 18(3): 205-231 (1996) - 1995
- [j4]Ronald A. Cole, Lynette Hirschman, Les E. Atlas, Mary E. Beckman, Alan Biermann, Marcia A. Bush, Mark Clements, Jordan Cohen, Oscar Garcia, Brian A. Hanson, Hynek Hermansky, Steve Levinson, Kathy McKeown, Nelson Morgan, David G. Novick, Mari Ostendorf, Sharon L. Oviatt, Patti Price, Harvey F. Silverman, Judy Spitz, Alex Waibel, Clifford J. Weinstein, Stephen A. Zahorian, Victor Zue:
The challenge of spoken language systems: research directions for the nineties. IEEE Trans. Speech Audio Process. 3(1): 1-21 (1995) - 1994
- [j3]Hynek Hermansky, Nelson Morgan:
RASTA processing of speech. IEEE Trans. Speech Audio Process. 2(4): 578-589 (1994) - 1993
- [j2]Jean-Claude Junqua, Hisashi Wakita, Hynek Hermansky:
Evaluation and optimization of perceptually-based ASR front-end. IEEE Trans. Speech Audio Process. 1(1): 39-48 (1993) - 1985
- [j1]Hynek Hermansky, Brian A. Hanson, Hisashi Wakita:
Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain. Speech Commun. 4(1-3): 181-187 (1985)
Conference and Workshop Papers
- 2023
- [c207]Samik Sadhu, Hynek Hermansky:
Importance of Different Temporal Modulations of Speech: a Tale of two Perspectives. ICASSP 2023: 1-5 - 2022
- [c206]Martin Sustek, Samik Sadhu, Hynek Hermansky:
Dealing with Unknowns in Continual Learning for End-to-end Automatic Speech Recognition. INTERSPEECH 2022: 1046-1050 - [c205]Samik Sadhu, Hynek Hermansky:
Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech. INTERSPEECH 2022: 3208-3212 - 2021
- [c204]Samik Sadhu, Hynek Hermansky:
Radically Old Way of Computing Spectra: Applications in End-to-End ASR. Interspeech 2021: 1424-1428 - [c203]Ruizhi Li, Gregory Sell, Hynek Hermansky:
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream end-to-end ASR. SLT 2021: 229-235 - 2020
- [c202]Ruizhi Li, Gregory Sell, Xiaofei Wang, Shinji Watanabe, Hynek Hermansky:
A Practical Two-Stage Training Strategy for Multi-Stream End-to-End Speech Recognition. ICASSP 2020: 7014-7018 - [c201]Samik Sadhu, Hynek Hermansky:
Continual Learning in Automatic Speech Recognition. INTERSPEECH 2020: 1246-1250 - [c200]Pegah Ghahramani, Hossein Hadian, Daniel Povey, Hynek Hermansky, Sanjeev Khudanpur:
An Alternative to MFCCs for ASR. INTERSPEECH 2020: 1664-1667 - 2019
- [c199]Lucas Ondel, Ruizhi Li, Gregory Sell, Hynek Hermansky:
Deriving Spectro-temporal Properties of Hearing from Speech Data. ICASSP 2019: 411-415 - [c198]Jinyi Yang, Lucas Ondel, Vimal Manohar, Hynek Hermansky:
Towards Automatic Methods to Detect Errors in Transcriptions of Speech Recordings. ICASSP 2019: 3747-3751 - [c197]Samik Sadhu, Ruizhi Li, Hynek Hermansky:
M-vectors: Sub-band Based Energy Modulation Features for Multi-stream Automatic Speech Recognition. ICASSP 2019: 6545-6549 - [c196]Xiaofei Wang, Ruizhi Li, Sri Harish Mallidi, Takaaki Hori, Shinji Watanabe, Hynek Hermansky:
Stream Attention-based Multi-array End-to-end Speech Recognition. ICASSP 2019: 7105-7109 - [c195]Ruizhi Li, Gregory Sell, Hynek Hermansky:
Performance Monitoring for End-to-End Speech Recognition. INTERSPEECH 2019: 2245-2249 - [c194]Xiaofei Wang, Jinyi Yang, Ruizhi Li, Samik Sadhu, Hynek Hermansky:
Exploring Methods for the Automatic Detection of Errors in Manual Transcription. INTERSPEECH 2019: 3003-3007 - [c193]Samik Sadhu, Hynek Hermansky:
Modulation Vectors as Robust Feature Representation for ASR in Domain Mismatched Conditions. INTERSPEECH 2019: 3441-3445 - 2018
- [c192]Xiaofei Wang, Ruizhi Li, Hynek Hermansky:
Stream Attention for Distributed Multi-Microphone Speech Recognition. INTERSPEECH 2018: 3033-3037 - 2017
- [c191]Bernd T. Meyer, Sri Harish Reddy Mallidi, Hendrik Kayser, Hynek Hermansky:
Predicting error rates for unknown data in automatic speech recognition. ICASSP 2017: 5330-5334 - 2016
- [c190]Sri Harish Reddy Mallidi, Hynek Hermansky:
Novel neural network based fusion for multistream ASR. ICASSP 2016: 5680-5684 - [c189]Tetsuji Ogawa, Sri Harish Reddy Mallidi, Emmanuel Dupoux, Jordan Cohen, Naomi H. Feldman, Hynek Hermansky:
A new efficient measure for accuracy prediction and its application to multistream-based unsupervised adaptation. ICPR 2016: 2222-2227 - [c188]Constantin Spille, Hendrik Kayser, Hynek Hermansky, Bernd T. Meyer:
Assessing Speech Quality in Speech-Aware Hearing Aids Based on Phoneme Posteriorgrams. INTERSPEECH 2016: 1755-1759 - [c187]Sri Harish Reddy Mallidi, Hynek Hermansky:
A Framework for Practical Multistream ASR. INTERSPEECH 2016: 3474-3478 - [c186]Bernd T. Meyer, Sri Harish Reddy Mallidi, Angel Mario Castro Martinez, Guillermo Payá Vayá, Hendrik Kayser, Hynek Hermansky:
Performance monitoring for automatic speech recognition in noisy multi-channel environments. SLT 2016: 50-56 - 2015
- [c185]Sri Harish Reddy Mallidi, Tetsuji Ogawa, Hynek Hermansky:
Uncertainty estimation of DNN classifiers. ASRU 2015: 283-288 - [c184]Roger Hsiao, Jeff Z. Ma, William Hartmann, Martin Karafiát, Frantisek Grézl, Lukás Burget, Igor Szöke, Jan Cernocký, Shinji Watanabe, Zhuo Chen, Sri Harish Reddy Mallidi, Hynek Hermansky, Stavros Tsakalidis, Richard M. Schwartz:
Robust speech recognition in unknown reverberant and noisy conditions. ASRU 2015: 533-538 - [c183]Hynek Hermansky, Lukás Burget, Jordan Cohen, Emmanuel Dupoux, Naomi Feldman, John Godfrey, Sanjeev Khudanpur, Matthew Maciejewski, Sri Harish Reddy Mallidi, Anjali Menon, Tetsuji Ogawa, Vijayaditya Peddinti, Richard C. Rose, Richard M. Stern, Matthew Wiesner, Karel Veselý:
Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop. ICASSP 2015: 5009-5013 - [c182]Jan Pesán, Lukás Burget, Hynek Hermansky, Karel Veselý:
DNN derived filters for processing of modulation spectrum of speech. INTERSPEECH 2015: 1908-1911 - [c181]Sri Harish Reddy Mallidi, Tetsuji Ogawa, Karel Veselý, Phani S. Nidadavolu, Hynek Hermansky:
Autoencoder based multi-stream combination for noise robust speech recognition. INTERSPEECH 2015: 3551-3555 - 2014
- [c180]Keith Kintzley, Aren Jansen, Hynek Hermansky:
Featherweight phonetic keyword search for conversational speech. ICASSP 2014: 7859-7863 - [c179]Feipeng Li, Phani S. Nidadavolu, Hynek Hermansky:
A long, deep and wide artificial neural net for robust speech recognition in unknown noise. INTERSPEECH 2014: 358-362 - [c178]Thomas Schatz, Vijayaditya Peddinti, Xuan-Nga Cao, Francis R. Bach, Hynek Hermansky, Emmanuel Dupoux:
Evaluating speech features with the minimal-pair ABX task (II): resistance to noise. INTERSPEECH 2014: 915-919 - [c177]Nagaraj Mahajan, Nima Mesgarani, Hynek Hermansky:
Principal components of auditory spectro-temporal receptive fields. INTERSPEECH 2014: 1983-1987 - 2013
- [c176]Samuel Thomas, Michael L. Seltzer, Kenneth Church, Hynek Hermansky:
Deep neural network features and semi-supervised training for low resource speech recognition. ICASSP 2013: 6704-6708 - [c175]Oldrich Plchot, Spyros Matsoukas, Pavel Matejka, Najim Dehak, Jeff Z. Ma, Sandro Cumani, Ondrej Glembek, Hynek Hermansky, Sri Harish Reddy Mallidi, Nima Mesgarani, Richard M. Schwartz, Mehdi Soufifar, Zheng-Hua Tan, Samuel Thomas, Bing Zhang, Xinhui Zhou:
Developing a speaker identification system for the DARPA RATS project. ICASSP 2013: 6768-6772 - [c174]Pascal Clark, Sri Harish Reddy Mallidi, Aren Jansen, Hynek Hermansky:
Frequency offset correction in speech without detecting pitch. ICASSP 2013: 7020-7024 - [c173]Vijayaditya Peddinti, Hynek Hermansky:
Filter-bank optimization for Frequency Domain Linear Prediction. ICASSP 2013: 7102-7106 - [c172]Feipeng Li, Hynek Hermansky:
Effect of filter bandwidth and spectral sampling rate of analysis filterbank on automatic phoneme recognition. ICASSP 2013: 7121-7124 - [c171]Hynek Hermansky, Ehsan Variani, Vijayaditya Peddinti:
Mean temporal distance: Predicting ASR error from temporal properties of speech signal. ICASSP 2013: 7423-7426 - [c170]Aren Jansen, Samuel Thomas, Hynek Hermansky:
Weak top-down constraints for unsupervised acoustic model training. ICASSP 2013: 8091-8095 - [c169]Aren Jansen, Emmanuel Dupoux, Sharon Goldwater, Mark Johnson, Sanjeev Khudanpur, Kenneth Church, Naomi Feldman, Hynek Hermansky, Florian Metze, Richard C. Rose, Mike Seltzer, Pascal Clark, Ian McGraw, Balakrishnan Varadarajan, Erin Bennett, Benjamin Börschinger, Justin T. Chiu, Ewan Dunbar, Abdellah Fourtassi, David Harwath, Chia-ying Lee, Keith D. Levin, Atta Norouzian, Vijayaditya Peddinti, Rachael Richardson, Thomas Schatz, Samuel Thomas:
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition. ICASSP 2013: 8111-8115 - [c168]Jeff Z. Ma, Bing Zhang, Spyros Matsoukas, Sri Harish Reddy Mallidi, Feipeng Li, Hynek Hermansky:
Improvements in language identification on the RATS noisy speech corpus. INTERSPEECH 2013: 69-73 - [c167]Keith Kintzley, Aren Jansen, Hynek Hermansky:
Text-to-speech inspired duration modeling for improved whole-word acoustic models. INTERSPEECH 2013: 1253-1257 - [c166]Thomas Schatz, Vijayaditya Peddinti, Francis R. Bach, Aren Jansen, Hynek Hermansky, Emmanuel Dupoux:
Evaluating speech features with the minimal-pair ABX task: analysis of the classical MFC/PLP pipeline. INTERSPEECH 2013: 1781-1785 - [c165]Ehsan Variani, Feipeng Li, Hynek Hermansky:
Multi-stream recognition of noisy speech with performance monitoring. INTERSPEECH 2013: 2978-2981 - [c164]Tetsuji Ogawa, Feipeng Li, Hynek Hermansky:
Stream selection and integration in multistream ASR using GMM-based performance monitoring. INTERSPEECH 2013: 3332-3336 - [c163]Sri Harish Reddy Mallidi, Sriram Ganapathy, Hynek Hermansky:
Robust speaker recognition using spectro-temporal autoregressive models. INTERSPEECH 2013: 3689-3693 - [c162]Hynek Hermansky:
Long, Deep and Wide Artificial Neural Nets for Dealing with Unexpected Noise in Machine Recognition of Speech. TSD 2013: 14-21 - 2012
- [c161]Hans-Günter Hirsch, Sriram Ganapathy, Hynek Hermansky:
Comparison of Different Approaches for Speech Recognition in Hands-free Mode. ITG Conference on Speech Communication 2012: 1-4 - [c160]Daniel Garcia-Romero, Xinhui Zhou, Dmitry N. Zotkin, Balaji Vasan Srinivasan, Yuancheng Luo, Sriram Ganapathy, Samuel Thomas, Sridhar Krishna Nemala, Garimella S. V. S. Sivaram, Majid Mirbagheri, Sri Harish Reddy Mallidi, Thomas Janu, Padmanabhan Rajan, Nima Mesgarani, Mounya Elhilali, Hynek Hermansky, Shihab A. Shamma, Ramani Duraiswami:
The UMD-JHU 2011 speaker recognition system. ICASSP 2012: 4229-4232 - [c159]Samuel Thomas, Sriram Ganapathy, Hynek Hermansky:
Multilingual MLP features for low-resource LVCSR systems. ICASSP 2012: 4269-4272 - [c158]Keith Kintzley, Aren Jansen, Hynek Hermansky:
MAP Estimation of Whole-Word Acoustic Models with Dictionary Priors. INTERSPEECH 2012: 787-790 - [c157]Samuel Thomas, Sriram Ganapathy, Aren Jansen, Hynek Hermansky:
Data-driven Posterior Features for Low Resource Speech Recognition Applications. INTERSPEECH 2012: 791-794 - [c156]Aren Jansen, Samuel Thomas, Hynek Hermansky:
Intrinsic Spectral Analysis for Zero and High Resource Speech Recognition. INTERSPEECH 2012: 879-882 - [c155]Ehsan Variani, Hynek Hermansky:
Estimating Classifier Performance in Unknown Noise. INTERSPEECH 2012: 1800-1803 - [c154]Feipeng Li, Sri Harish Reddy Mallidi, Hynek Hermansky:
Phone recognition in critical bands using sub-band temporal modulations. INTERSPEECH 2012: 1816-1819 - [c153]Sriram Ganapathy, Hynek Hermansky:
Analysis of Temporal Resolution in Frequency Domain Linear Prediction. INTERSPEECH 2012: 1828-1831 - [c152]Samuel Thomas, Sri Harish Reddy Mallidi, Thomas Janu, Hynek Hermansky, Nima Mesgarani, Xinhui Zhou, Shihab A. Shamma, Tim Ng, Bing Zhang, Long Nguyen, Spyros Matsoukas:
Acoustic and Data-driven Features for Robust Speech Activity Detection. INTERSPEECH 2012: 1985-1988 - [c151]Keith Kintzley, Aren Jansen, Kenneth Church, Hynek Hermansky:
Inverting the Point Process Model for Fast Phonetic Keyword Search. INTERSPEECH 2012: 2438-2441 - [c150]Sri Garimella, Hynek Hermansky:
Factor analysis of mixture of auto-associative neural networks for speaker verification. Odyssey 2012: 92-97 - [c149]Samuel Thomas, Sri Harish Reddy Mallidi, Sriram Ganapathy, Hynek Hermansky:
Adaptation transforms of auto-associative neural networks as features for speaker verification. Odyssey 2012: 98-104 - [c148]Sriram Ganapathy, Samuel Thomas, Hynek Hermansky:
Feature extraction using 2-d autoregressive models for speaker recognition. Odyssey 2012: 229-235 - 2011
- [c147]Samuel Thomas, Patrick Nguyen, Geoffrey Zweig, Hynek Hermansky:
MLP based phoneme detectors for Automatic Speech Recognition. ICASSP 2011: 5024-5027 - [c146]Geoffrey Zweig, Patrick Nguyen, Dirk Van Compernolle, Kris Demuynck, Les E. Atlas, Pascal Clark, Gregory Sell, Meihong Wang, Fei Sha, Hynek Hermansky, Damianos G. Karakos, Aren Jansen, Samuel Thomas, Sivaram G. S. V. S., Samuel R. Bowman, Justine T. Kao:
Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop. ICASSP 2011: 5044-5047 - [c145]Garimella S. V. S. Sivaram, Hynek Hermansky:
Multilayer perceptron with sparse hidden outputs for phoneme recognition. ICASSP 2011: 5336-5339 - [c144]Sri Harish Reddy Mallidi, Sriram Ganapathy, Hynek Hermansky:
Modulation Spectrum Analysis for Recognition of Reverberant Speech. INTERSPEECH 2011: 189-192 - [c143]Michael A. Carlin, Samuel Thomas, Aren Jansen, Hynek Hermansky:
Rapid Evaluation of Speech Representations for Spoken Term Discovery. INTERSPEECH 2011: 821-824 - [c142]Keith Kintzley, Aren Jansen, Hynek Hermansky:
Event Selection from Phone Posteriorgrams Using Matched Filters. INTERSPEECH 2011: 1905-1908 - [c141]Nima Mesgarani, Samuel Thomas, Hynek Hermansky:
Adaptive Stream Fusion in Multistream Recognition of Speech. INTERSPEECH 2011: 2329-2332 - [c140]Garimella S. V. S. Sivaram, Samuel Thomas, Hynek Hermansky:
Mixture of Auto-Associative Neural Networks for Speaker Verification. INTERSPEECH 2011: 2381-2384 - [c139]Hynek Hermansky, Nima Mesgarani, Samuel Thomas:
Performance monitoring for robustness in automatic recognition of speechi. MLSLP 2011: 31-34 - [c138]Hynek Hermansky:
Dealing with Unexpected Words in Automatic Recognition of Speech. TSD 2011: 1-15 - [c137]Sriram Ganapathy, Padmanabhan Rajan, Hynek Hermansky:
Multi-layer perceptron based speech activity detection for speaker verification. WASPAA 2011: 321-324 - 2010
- [c136]Sriram Ganapathy, Samuel Thomas, Hynek Hermansky:
Robust spectro-temporal features based on autoregressive models of Hilbert envelopes. ICASSP 2010: 4286-4289 - [c135]Garimella S. V. S. Sivaram, Sridhar Krishna Nemala, Mounya Elhilali, Trac D. Tran, Hynek Hermansky:
Sparse coding for speech recognition. ICASSP 2010: 4346-4349 - [c134]Sriram Ganapathy, Samuel Thomas, Hynek Hermansky:
Comparison of modulation features for phoneme recognition. ICASSP 2010: 5038-5041 - [c133]Hynek Hermansky:
History of modulation spectrum in ASR. ICASSP 2010: 5458-5461 - [c132]Nima Mesgarani, Samuel Thomas, Hynek Hermansky:
A multistream multiresolution framework for phoneme recognition. INTERSPEECH 2010: 318-321 - [c131]Samuel Thomas, Sriram Ganapathy, Hynek Hermansky:
Cross-lingual and multi-stream posterior features for low resource LVCSR systems. INTERSPEECH 2010: 877-880 - [c130]Aren Jansen, Kenneth Church, Hynek Hermansky:
Towards spoken term discovery at scale with zero resources. INTERSPEECH 2010: 1676-1679 - [c129]Garimella S. V. S. Sivaram, Sriram Ganapathy, Hynek Hermansky:
Sparse auto-associative neural networks: theory and application to speech recognition. INTERSPEECH 2010: 2270-2273 - [c128]Samuel Thomas, Kailash Patil, Sriram Ganapathy, Nima Mesgarani, Hynek Hermansky:
A phoneme recognition framework based on auditory spectro-temporal receptive fields. INTERSPEECH 2010: 2458-2461 - [c127]Shih-Chii Liu, Nima Mesgarani, John G. Harris, Hynek Hermansky:
The use of spike-based representations for hardware audition systems. ISCAS 2010: 505-508 - [c126]Tobi Delbrück, Thomas Koch, Raphael Berner, Hynek Hermansky:
Fully integrated 500uW speech detection wake-up circuit. ISCAS 2010: 2015-2018 - [c125]Stefan Kombrink, Mirko Hannemann, Lukás Burget, Hynek Hermansky:
Recovery of Rare Words in Lecture Speech. TSD 2010: 330-337 - 2009
- [c124]Sriram Ganapathy, Samuel Thomas, Hynek Hermansky:
Temporal envelope subtraction for robust speech recognition using modulation spectrum. ASRU 2009: 164-169 - [c123]Misha Pavel, Malcolm Slaney, Hynek Hermansky:
Reconciliation of human and machine speech recognition performance. ICASSP 2009: 1669-1672 - [c122]Joel Pinto, Garimella S. V. S. Sivaram, Hynek Hermansky, Mathew Magimai-Doss:
Volterra series for analyzing MLP based phoneme posterior estimator. ICASSP 2009: 1813-1816 - [c121]Samuel Thomas, Sriram Ganapathy, Hynek Hermansky:
Phoneme recognition using spectral envelope and modulation frequency features. ICASSP 2009: 4453-4456 - [c120]Stefan Kombrink, Lukás Burget, Pavel Matejka, Martin Karafiát, Hynek Hermansky:
Posterior-based out of vocabulary word detection in telephone speech. INTERSPEECH 2009: 80-83 - [c119]Petr Motlícek, Sriram Ganapathy, Hynek Hermansky:
Arithmetic coding of sub-band residuals in FDLP speech/audio codec. INTERSPEECH 2009: 2591-2594 - [c118]Sriram Ganapathy, Samuel Thomas, Hynek Hermansky:
Static and dynamic modulation spectrum for speech recognition. INTERSPEECH 2009: 2823-2826 - [c117]Samuel Thomas, Sriram Ganapathy, Hynek Hermansky:
Tandem representations of spectral envelope and modulation frequency features for ASR. INTERSPEECH 2009: 2955-2958 - [c116]Nima Mesgarani, Garimella S. V. S. Sivaram, Sridhar Krishna Nemala, Mounya Elhilali, Hynek Hermansky:
Discriminant spectrotemporal features for phoneme recognition. INTERSPEECH 2009: 2983-2986 - [c115]Sriram Ganapathy, Petr Motlícek, Hynek Hermansky:
Error Resilient Speech Coding Using Sub-band Hilbert Envelopes. TSD 2009: 355-362 - [c114]Sriram Ganapathy, Samuel Thomas, Petr Motlícek, Hynek Hermansky:
Applications of signal analysis using autoregressive models for amplitude modulation. WASPAA 2009: 341-344 - 2008
- [c113]Garimella S. V. S. Sivaram, Hynek Hermansky:
Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition. EUSIPCO 2008: 1-4 - [c112]Samuel Thomas, Sriram Ganapathy, Hynek Hermansky:
Spectro-temporal features for Automatic Speech Recognition using Linear Prediction in spectral domain. EUSIPCO 2008: 1-4 - [c111]Tamara Tosic, Mathew Magimai-Doss, Hynek Hermansky:
Using comparison of parallel phoneme probability streams for OOV word detection. EUSIPCO 2008: 1-5 - [c110]Lukás Burget, Petr Schwarz, Pavel Matejka, Mirko Hannemann, Ariya Rastrow, Christopher M. White, Sanjeev Khudanpur, Hynek Hermansky, Jan Cernocký:
Combination of strongly and weakly constrained recognizers for reliable detection of OOVS. ICASSP 2008: 4081-4084 - [c109]Christopher M. White, Geoffrey Zweig, Lukás Burget, Petr Schwarz, Hynek Hermansky:
Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments. ICASSP 2008: 4085-4088 - [c108]Fabio Valente, Hynek Hermansky:
Hierarchical and parallel processing of modulation spectrum for ASR applications. ICASSP 2008: 4165-4168 - [c107]Joel Pinto, B. Yegnanarayana, Hynek Hermansky, Mathew Magimai-Doss:
Exploiting contextual information for improved phoneme recognition. ICASSP 2008: 4449-4452 - [c106]Sriram Ganapathy, Petr Motlícek, Hynek Hermansky, Harinath Garudadri:
Temporal masking for bit-rate reduction in audio codec based on Frequency Domain Linear Prediction. ICASSP 2008: 4781-4784 - [c105]Jörn Anemüller, Jörg-Hendrik Bach, Barbara Caputo, Michal Havlena, Jie Luo, Hendrik Kayser, Bastian Leibe, Petr Motlícek, Tomás Pajdla, Misha Pavel, Akihiko Torii, Luc Van Gool, Alon Zweig, Hynek Hermansky:
The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events. ICMI 2008: 289-292 - [c104]Sriram Ganapathy, Petr Motlícek, Hynek Hermansky, Harinath Garudadri:
Spectral noise shaping: improvements in speech/audio codec based on linear prediction in spectral domain. INTERSPEECH 2008: 675-678 - [c103]Garimella S. V. S. Sivaram, Hynek Hermansky:
Introducing temporal asymmetries in feature extraction for automatic speech recognition. INTERSPEECH 2008: 890-893 - [c102]Sriram Ganapathy, Samuel Thomas, Hynek Hermansky:
Front-end for far-field speech recognition based on frequency domain linear prediction. INTERSPEECH 2008: 984-987 - [c101]Samuel Thomas, Sriram Ganapathy, Hynek Hermansky:
Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech. INTERSPEECH 2008: 1521-1524 - [c100]Fabio Valente, Hynek Hermansky:
On the combination of auditory and modulation frequency channels for ASR applications. INTERSPEECH 2008: 2242-2245 - [c99]Joel Pinto, Hynek Hermansky:
Combining evidence from a generative and a discriminative model in phoneme recognition. INTERSPEECH 2008: 2414-2417 - [c98]Samuel Thomas, Sriram Ganapathy, Hynek Hermansky:
Hilbert Envelope Based Features for Far-Field Speech Recognition. MLMI 2008: 119-124 - [c97]Daphna Weinshall, Hynek Hermansky, Alon Zweig, Jie Luo, Holly Brügge Jimison, Frank W. Ohl, Misha Pavel:
Beyond Novelty Detection: Incongruent Events, when General and Specific Classifiers Disagree. NIPS 2008: 1745-1752 - [c96]Petr Motlícek, Sriram Ganapathy, Hynek Hermansky, Harinath Garudadri, Marios Athineos:
Perceptually Motivated Sub-band Decomposition for FDLP Audio Coding. TSD 2008: 435-442 - [c95]Sree Hari Krishnan Parthasarathi, Petr Motlícek, Hynek Hermansky:
Exploiting Contextual Information for Speech/Non-Speech Detection. TSD 2008: 451-459 - [c94]Joel Pinto, Garimella S. V. S. Sivaram, Hynek Hermansky:
Reverse Correlation for Analyzing MLP Posterior Features in ASR. TSD 2008: 469-476 - [c93]Garimella S. V. S. Sivaram, Hynek Hermansky:
Emulating Temporal Receptive Fields of Higher Level Auditory Neurons for ASR. TSD 2008: 509-516 - 2007
- [c92]Petr Motlícek, Vijay Ullal, Hynek Hermansky:
Wide-Band Perceptual Audio Coding Based on Frequency-Domain Linear Prediction. ICASSP (1) 2007: 265-268 - [c91]Fabio Valente, Hynek Hermansky:
Combination of Acoustic Classifiers Based on Dempster-Shafer Theory of Evidence. ICASSP (4) 2007: 1129-1132 - [c90]Fabio Valente, Jithendra Vepa, Christian Plahl, Christian Gollan, Hynek Hermansky, Ralf Schlüter:
Hierarchical neural networks feature extraction for LVCSR system. INTERSPEECH 2007: 42-45 - [c89]Fabio Valente, Jithendra Vepa, Hynek Hermansky:
Multi-stream features combination based on dempster-shafer rule for LVCSR system. INTERSPEECH 2007: 1154-1157 - [c88]S. R. Mahadeva Prasanna, Hynek Hermansky:
MRASTA and PLP in automatic speech recognition. INTERSPEECH 2007: 1166-1169 - [c87]Hamed Ketabdar, Mirko Hannemann, Hynek Hermansky:
Detection of out-of-vocabulary words in posterior based ASR. INTERSPEECH 2007: 1757-1760 - [c86]Joel Pinto, Andrew Lovitt, Hynek Hermansky:
Exploiting phoneme similarities in hybrid HMM-ANN keyword spotting. INTERSPEECH 2007: 1817-1820 - [c85]Petr Motlícek, Sriram Ganapathy, Hynek Hermansky, Harinath Garudadri:
Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding. MLMI 2007: 248-258 - [c84]Petr Motlícek, Hynek Hermansky, Sriram Ganapathy, Harinath Garudadri:
Non-uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes. TSD 2007: 350-357 - 2006
- [c83]Petr Fousek, Hynek Hermansky:
Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition. ICASSP (1) 2006: 433-436 - [c82]Fabio Valente, Hynek Hermansky:
Discriminant linear processing of time-frequency plane. INTERSPEECH 2006 - [c81]Petr Motlícek, Hynek Hermansky, Harinath Garudadri, Naveen Srinivasamurthy:
Speech Coding Based on Spectral Dynamics. TSD 2006: 471-478 - 2005
- [c80]Hynek Hermansky, Petr Fousek:
Multi-resolution RASTA filtering for TANDEM-based ASR. INTERSPEECH 2005: 361-364 - [c79]Hynek Hermansky, Petr Fousek, Mikko Lehtonen:
The Role of Speech in Multimodal Human-Computer Interaction. TSD 2005: 2-8 - 2004
- [c78]Hemant Misra, Shajith Ikbal, Hervé Bourlard, Hynek Hermansky:
Spectral entropy based feature for robust ASR. ICASSP (1) 2004: 193-196 - [c77]Shajith Ikbal, Hemant Misra, Hervé Bourlard, Hynek Hermansky:
Phase autocorrelation (PAC) features in entropy based multi-stream for robust speech recognition. ICASSP (1) 2004: 205-208 - [c76]Sunil Sivadas, Hynek Hermansky:
On use of task independent training data in tandem feature extraction. ICASSP (1) 2004: 541-544 - [c75]Marios Athineos, Hynek Hermansky, Daniel P. W. Ellis:
PLP-squared: autoregressive modeling of auditory-like 2-d spectro-temporal patterns. SAPA@INTERSPEECH 2004: 129 - [c74]Hynek Hermansky:
Stochastic techniques in deriving perceptual knowledge. SAPA@INTERSPEECH 2004: 136 - [c73]Marios Athineos, Hynek Hermansky, Daniel P. W. Ellis:
LP-TRAP: linear predictive temporal patterns. INTERSPEECH 2004: 949-952 - [c72]Shajith Ikbal, Hemant Misra, Sunil Sivadas, Hynek Hermansky, Hervé Bourlard:
Entropy based combination of tandem representations for noise robust ASR. INTERSPEECH 2004: 2553-2556 - [c71]Petr Fousek, Frantisek Grézl, Hynek Hermansky, Petr Svojanovsky:
New nonsense syllables database - analyses and preliminary ASR experiments. INTERSPEECH 2004: 2749-2752 - 2003
- [c70]Sunil Sivadas, Hynek Hermansky:
Generalized tandem feature extraction. ICASSP (1) 2003: 56-59 - [c69]Pratibha Jain, Hynek Hermansky:
Beyond a single critical-band in TRAP based ASR. INTERSPEECH 2003: 437-440 - [c68]Sunil Sivadas, Hynek Hermansky:
In search of target class definition in tandem feature extraction. INTERSPEECH 2003: 837-840 - [c67]André Gustavo Adami, Hynek Hermansky:
Segmentation of speech for speaker and language recognition. INTERSPEECH 2003: 841-844 - [c66]Hynek Hermansky, Pratibha Jain:
Band-independent speech-event categories for TRAP based ASR. INTERSPEECH 2003: 1013-1016 - [c65]Frantisek Grézl, Hynek Hermansky:
Local averaging and differentiating of spectral plane for TRAP-based ASR. INTERSPEECH 2003: 1017-1020 - [c64]Sachin S. Kajarekar, André Gustavo Adami, Hynek Hermansky:
Novel approaches for one- and two-speaker detection. INTERSPEECH 2003: 2661-2664 - [c63]Pavel Matejka, Petr Schwarz, Hynek Hermansky, Jan Cernocký:
Phoneme Recognition Using Temporal Patterns. TSD 2003: 198-205 - 2002
- [c62]Sunil Sivadas, Hynek Hermansky:
Hierarchical tandem feature extraction. ICASSP 2002: 809-812 - [c61]André Gustavo Adami, Sachin S. Kajarekar, Hynek Hermansky:
A new speaker change detection method for two-speaker segmentation. ICASSP 2002: 3908-3911 - [c60]André Gustavo Adami, Lukás Burget, Stéphane Dupont, Harinath Garudadri, Frantisek Grézl, Hynek Hermansky, Pratibha Jain, Sachin S. Kajarekar, Nelson Morgan, Sunil Sivadas:
Qualcomm-ICSI-OGI features for ASR. INTERSPEECH 2002: 21-24 - [c59]Pratibha Jain, Hynek Hermansky, Brian Kingsbury:
Distributed speech recognition using noise-robust MFCC and traps-estimated manner features. INTERSPEECH 2002: 473-476 - [c58]Naren Malayath, Hynek Hermansky:
Bark resolution from speech data. INTERSPEECH 2002: 2169-2172 - [c57]Sachin S. Kajarekar, Hynek Hermansky:
Analysis of Information in Speech Based on MANOVA. NIPS 2002: 1189-1196 - 2001
- [c56]Sachin S. Kajarekar, Bayya Yegnanarayana, Hynek Hermansky:
A study of two dimensional linear discriminants for ASR. ICASSP 2001: 137-140 - [c55]M. Carmen Benítez, Lukás Burget, Barry Y. Chen, Stéphane Dupont, Harinath Garudadri, Hynek Hermansky, Pratibha Jain, Sachin S. Kajarekar, Nelson Morgan, Sunil Sivadas:
Robust ASR front-end using spectral-based and discriminant features: experiments on the Aurora tasks. INTERSPEECH 2001: 429-432 - [c54]Sachin S. Kajarekar, Hynek Hermansky:
Speaker verification based on broad phonetic categories. Odyssey 2001: 201-206 - [c53]Hynek Hermansky:
Human Speech Perception: Some Lessons from Automatic Speech Recognition. TSD 2001: 187-196 - [c52]Lukás Burget, Hynek Hermansky:
Data Driven Design of Filter Bank for Speech Recognition. TSD 2001: 299-304 - 2000
- [c51]Sangita Sharma, Dan Ellis, Sachin S. Kajarekar, Pratibha Jain, Hynek Hermansky:
Feature extraction using non-linear transformation for robust speech recognition on the Aurora database. ICASSP 2000: 1117-1120 - [c50]Hynek Hermansky, Daniel P. W. Ellis, Sangita Sharma:
Tandem connectionist feature extraction for conventional HMM systems. ICASSP 2000: 1635-1638 - [c49]Sunil Sivadas, Pratibha Jain, Hynek Hermansky:
Discriminative MLPs in HMM-based recognition of speech in cellular telephony. INTERSPEECH 2000: 153-156 - [c48]Pratibha Jain, Hynek Hermansky:
Temporal patterns of critical-band spectrum for text-to-speech. INTERSPEECH 2000: 439-441 - [c47]Sachin S. Kajarekar, Hynek Hermansky:
Optimization of units for continuous-digit recognition task. INTERSPEECH 2000: 539-542 - [c46]Sachin S. Kajarekar, Hynek Hermansky:
Analysis of Information in Speech and Its Application in Speech Recognition. TSD 2000: 283-288 - 1999
- [c45]Howard Hua Yang, Sarel van Vuuren, Hynek Hermansky:
Relevancy of time-frequency features for phonetic classification measured by mutual information. ICASSP 1999: 225-228 - [c44]Hynek Hermansky, Sangita Sharma:
Temporal patterns (TRAPs) in ASR of noisy speech. ICASSP 1999: 289-292 - [c43]Hynek Hermansky, Pratibha Jain:
Down-sampling speech representation in ASR. EUROSPEECH 1999: 73-76 - [c42]Sachin S. Kajarekar, Narendranath Malayath, Hynek Hermansky:
Analysis of sources of variability in speech. EUROSPEECH 1999: 343-346 - [c41]Sarel van Vuuren, Hynek Hermansky:
Speech variability in the modulation spectral domain - SANOVA technique -. EUROSPEECH 1999: 2195-2198 - [c40]Hynek Hermansky:
The purpose, history, current state, and some evolving trends in feature extraction for speech recognition. ISSPA 1999: 6 - [c39]Howard Hua Yang, Hynek Hermansky:
Search for Information Bearing Components in Speech. NIPS 1999: 803-812 - [c38]Hynek Hermansky:
Data-Driven Analysis of Speech. TSD 1999: 10-18 - 1998
- [c37]B. Yegnanarayana, P. Satyanarayana Murthy, Carlos Avendaño, Hynek Hermansky:
Enhancement of reverberant speech using LP residual. ICASSP 1998: 405-408 - [c36]Noboru Kanedera, Hynek Hermansky, Takayuki Arai:
On properties of modulation spectrum for robust automatic speech recognition. ICASSP 1998: 613-616 - [c35]Hynek Hermansky, Narendranath Malayath:
Spectral basis functions from discriminant analysis. ICSLP 1998 - [c34]Hynek Hermansky, Sangita Sharma:
TRAPS - classifiers of temporal patterns. ICSLP 1998 - [c33]Sarel van Vuuren, Hynek Hermansky:
On the importance of components of the modulation spectrum for speaker verification. ICSLP 1998 - 1997
- [c32]Sangita Tibrewala, Hynek Hermansky:
Sub-band based recognition of noisy speech. ICASSP 1997: 1255-1258 - [c31]Sarel van Vuuren, Hynek Hermansky:
Data-driven design of RASTA-like filters. EUROSPEECH 1997: 409-412 - [c30]Narendranath Malayath, Hynek Hermansky, Alexander Kain:
Towards decomposing the sources of variability in speech. EUROSPEECH 1997: 497-500 - [c29]Noboru Kanedera, Takayuki Arai, Hynek Hermansky, Misha Pavel:
On the importance of various modulation frequencies for speech recognition. EUROSPEECH 1997: 1079-1082 - [c28]Carlos Avendaño, Sangita Tibrewala, Hynek Hermansky:
Multiresolution channel normalization for ASR in reverberant environments. EUROSPEECH 1997: 1107-1110 - [c27]B. Yegnanarayana, Carlos Avendaño, Hynek Hermansky, P. Satyanarayana Murthy:
Processing linear prediction residual for speech enhancement. EUROSPEECH 1997: 1399-1402 - [c26]Sangita Tibrewala, Hynek Hermansky:
Multi-band and adaptation approaches to robust speech recognition. EUROSPEECH 1997: 2619-2622 - 1996
- [c25]Hervé Bourlard, Stéphane Dupont, Hynek Hermansky, Nelson Morgan:
Towards subband-based speech recognition. EUSIPCO 1996: 1-4 - [c24]Hynek Hermansky, Sangita Tibrewala, Misha Pavel:
Towards ASR on partially corrupted speech. ICSLP 1996: 462-465 - [c23]Carlos Avendaño, Hynek Hermansky:
Study on the dereverberation of speech based on temporal envelope filtering. ICSLP 1996: 889-892 - [c22]Carlos Avendaño, Sarel van Vuuren, Hynek Hermansky:
Data based filter design for RASTA-like channel normalization in ASR. ICSLP 1996: 2087-2090 - [c21]Takayuki Arai, Misha Pavel, Hynek Hermansky, Carlos Avendaño:
Intelligibility of speech with filtered time trajectories of spectral envelopes. ICSLP 1996: 2490-2493 - 1995
- [c20]Nelson Morgan, Hervé Bourlard, Steven Greenberg, Hynek Hermansky, Su-Lin Wu:
Stochastic perceptual models of speech. ICASSP 1995: 397-400 - [c19]Hynek Hermansky, Eric A. Wan, Carlos Avendaño:
Speech enhancement based on temporal processing. ICASSP 1995: 405-408 - [c18]Carlos Avendaño, Hynek Hermansky, Eric A. Wan:
Beyond NYQUIST: towards the recovery of broad-bandwidth speech from narrow-bandwidth speech. EUROSPEECH 1995: 165-168 - 1994
- [c17]Joachim Koehler, Nelson Morgan, Hynek Hermansky, Hans-Günter Hirsch, Grace Tong:
Integrating RASTA-PLP into speech recognition. ICASSP (1) 1994: 421-424 - [c16]Nelson Morgan, Hervé Bourlard, Steven Greenberg, Hynek Hermansky:
Stochastic perceptual auditory-event-based models for speech recognition. ICSLP 1994: 1943-1946 - 1993
- [c15]Hynek Hermansky, Nelson Morgan, Hans-Günter Hirsch:
Recognition of speech in additive and convolutional noise based on RASTA spectral processing. ICASSP (2) 1993: 83-86 - 1992
- [c14]Hynek Hermansky, Nelson Morgan, Aruna Bayya, Phil Kohn:
RASTA-PLP speech analysis technique. ICASSP 1992: 121-124 - [c13]Hynek Hermansky, Nelson Morgan:
Towards handling the acoustic environment in spoken language processing. ICSLP 1992: 85-88 - 1991
- [c12]Nelson Morgan, Hynek Hermansky, Hervé Bourlard, Phil Kohn, Chuck Wooters:
Continuous speech recognition using PLP analysis with multilayer perceptrons. ICASSP 1991: 49-52 - [c11]Hynek Hermansky, Louis Anthony Cox Jr.:
Perceptual linear predictive (PLP) analysis-resynthesis technique. EUROSPEECH 1991: 329-332 - [c10]Hynek Hermansky, Nelson Morgan, Aruna Bayya, Phil Kohn:
Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP). EUROSPEECH 1991: 1367-1370 - 1990
- [c9]Aruna Bayya, Hynek Hermansky:
Towards feature-based speech metric. ICASSP 1990: 781-784 - 1989
- [c8]Hynek Hermansky, David J. Broad:
The effective second formant F2' and the vocal tract front-cavity. ICASSP 1989: 480-483 - 1988
- [c7]Hynek Hermansky, Jean-Claude Junqua:
Optimization of perceptually-based ASR front-end [automatic speech recognition]. ICASSP 1988: 219-222 - 1987
- [c6]Hynek Hermansky:
An efficient speaker-independent automatic speech recognition by simulation of some properties of human auditory perception. ICASSP 1987: 1159-1162 - [c5]Hynek Hermansky:
Automatic speech recognition and human auditory perception. ECST 1987: 1079-1082 - 1986
- [c4]Hynek Hermansky, Kazuhiro Tsuga, Shozo Makino, Hisashi Wakita:
Perceptually based processing in automatic speech recognition. ICASSP 1986: 1971-1974 - 1985
- [c3]Hynek Hermansky, Brian A. Hanson, Hisashi Wakita:
Perceptually based linear predictive analysis of speech. ICASSP 1985: 509-512 - 1984
- [c2]Hynek Hermansky, Hiroya Fujisaki, Yasuo Sato:
Spectral envelope sampling and interpolation in linear predictive analysis of speech. ICASSP 1984: 53-56 - 1983
- [c1]Hynek Hermansky, Hiroya Fujisaki, Yasuo Sato:
Analysis and synthesis of speech based on spectral transform linear predictive method. ICASSP 1983: 777-780
Parts in Books or Collections
- 2012
- [p2]Jörn Anemüller, Barbara Caputo, Hynek Hermansky, Frank W. Ohl, Tomás Pajdla, Misha Pavel, Luc Van Gool, Rufin Vogels, Stefan Wabnik, Daphna Weinshall:
DIRAC: Detection and Identification of Rare Audio-Visual Events. Detection and Identification of Rare Audiovisual Cues 2012: 3-35 - 2009
- [p1]Claude Stricker, Jean-Frédéric Wagen, Guillermo Aradilla, Hervé Bourlard, Hynek Hermansky, Joel Pinto, Paul-Henri Rey, Jérôme Théraulaz:
Intelligent Multi-modal Interfaces for Mobile Applications in Hostile Environment(IM-HOST). Human Machine Interaction 2009: 71-102
Editorship
- 2021
- [e1]Hynek Hermansky, Honza Cernocký, Lukás Burget, Lori Lamel, Odette Scharenborg, Petr Motlícek:
22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30 - September 3, 2021. ISCA 2021 [contents]
Informal and Other Publications
- 2023
- [i14]Martin Sustek, Samik Sadhu, Lukás Burget, Hynek Hermansky, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Stabilized training of joint energy-based models and their practical applications. CoRR abs/2303.04187 (2023) - [i13]Samik Sadhu, Hynek Hermansky:
Self-supervised Learning with Speech Modulation Dropout. CoRR abs/2303.12908 (2023) - 2022
- [i12]Samik Sadhu, Hynek Hermansky:
Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech. CoRR abs/2203.13216 (2022) - [i11]Samik Sadhu, Hynek Hermansky:
Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives. CoRR abs/2204.00065 (2022) - [i10]Samik Sadhu, Hynek Hermansky:
Blind Signal Dereverberation for Machine Speech Recognition. CoRR abs/2210.00117 (2022) - 2021
- [i9]Ruizhi Li, Gregory Sell, Hynek Hermansky:
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR. CoRR abs/2102.03055 (2021) - [i8]Samik Sadhu, Hynek Hermansky:
Radically Old Way of Computing Spectra: Applications in End-to-End ASR. CoRR abs/2103.14129 (2021) - 2019
- [i7]Xiaofei Wang, Jinyi Yang, Ruizhi Li, Samik Sadhu, Hynek Hermansky:
Exploring Methods for the Automatic Detection of Errors in Manual Transcription. CoRR abs/1904.04294 (2019) - [i6]Ruizhi Li, Gregory Sell, Hynek Hermansky:
Performance Monitoring for End-to-End Speech Recognition. CoRR abs/1904.04896 (2019) - [i5]Ruizhi Li, Xiaofei Wang, Sri Harish Mallidi, Shinji Watanabe, Takaaki Hori, Hynek Hermansky:
Multi-Stream End-to-End Speech Recognition. CoRR abs/1906.08041 (2019) - [i4]Ruizhi Li, Gregory Sell, Xiaofei Wang, Shinji Watanabe, Hynek Hermansky:
A practical two-stage training strategy for multi-stream end-to-end speech recognition. CoRR abs/1910.10671 (2019) - 2018
- [i3]Ruizhi Li, Xiaofei Wang, Sri Harish Reddy Mallidi, Takaaki Hori, Shinji Watanabe, Hynek Hermansky:
Multi-encoder multi-resolution framework for end-to-end speech recognition. CoRR abs/1811.04897 (2018) - [i2]Xiaofei Wang, Ruizhi Li, Sri Harish Mallidi, Takaaki Hori, Shinji Watanabe, Hynek Hermansky:
Stream attention-based multi-array end-to-end speech recognition. CoRR abs/1811.04903 (2018) - 2017
- [i1]Xiaofei Wang, Yonghong Yan, Hynek Hermansky:
Stream Attention for far-field multi-microphone ASR. CoRR abs/1711.11141 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-19 23:45 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint