


default search action
Computer Speech & Language, Volume 46
Volume 46, November 2017
- Jindrich Matousek

, Daniel Tihelka
:
Anomaly-based annotation error detection in speech-synthesis corpora. 1-35 - Carmen Magariños

, Paula Lopez-Otero
, Laura Docío Fernández
, Eduardo Rodríguez Banga
, Daniel Erro, Carmen García-Mateo
:
Reversible speaker de-identification using pre-trained transformation functions. 36-52 - Hossein Zeinali

, Hossein Sameti, Lukás Burget
, Jan Cernocký
:
Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models. 53-71 - Brecht Desplanques, Kris Demuynck, Jean-Pierre Martens:

Adaptive speaker diarization of broadcast news based on factor analysis. 72-93 - Janez Starc, Dunja Mladenic

:
Constructing a Natural Language Inference dataset using generative neural networks. 94-112 - Scott Piao

, Fraser Dallachy, Alistair Baron
, Jane Demmen
, Steve Wattam, Philip Durkin, James McCracken, Paul Rayson
, Marc Alexander
:
A time-sensitive historical thesaurus-based semantic tagger for deep semantic annotation. 113-135 - Rachel G. Anushiya

, P. Vijayalakshmi
, T. Nagarajan:
Estimation of glottal closure instants from degraded speech using a phase-difference-based algorithm. 136-153 - Herman Kamper

, Aren Jansen, Sharon Goldwater:
A segmental framework for fully-unsupervised large-vocabulary speech recognition. 154-174 - Christoph Draxler, Jonathan Harrington, Florian Schiel:

Towards the next generation of speech tools and corpora. 175-178 - Michael Pucher, Bettina Zillinger, Markus Toman, Dietmar Schabus, Cassia Valentini-Botinhao, Junichi Yamagishi, Erich Schmid, Thomas Woltron:

Influence of speaker familiarity on blind and visually impaired children's and young adults' perception of synthetic voices. 179-195 - Milos Cernak, Juan Rafael Orozco-Arroyave

, Frank Rudzicz
, Heidi Christensen
, Juan Camilo Vásquez-Correa
, Elmar Nöth:
Characterisation of voice quality of Parkinson's disease using differential phonological posterior features. 196-208 - Taehwan Kim, Jonathan Keane, Weiran Wang, Hao Tang, Jason Riggle, Gregory Shakhnarovich, Diane Brentari, Karen Livescu

:
Lexicon-free fingerspelling recognition from video: Data, models, and signer adaptation. 209-232 - Shakti P. Rath:

Scalable algorithms for unsupervised clustering of acoustic data for speech recognition. 233-248 - Milica Gasic, Dilek Hakkani-Tür

, Asli Celikyilmaz
:
Spoken language understanding and interaction: machine learning for human-like conversational systems. 249-251 - Radek Fér, Pavel Matejka, Frantisek Grézl, Oldrich Plchot, Karel Veselý, Jan Honza Cernocký

:
Multilingually trained bottleneck features in spoken language recognition. 252-267 - Heysem Kaya

, Albert Ali Salah
, Alexey Karpov
, Olga V. Frolova
, Aleksey Grigorev
, Elena E. Lyakso
:
Emotion, age, and gender classification in children's speech by humans and machines. 268-283 - Ingrid Zukerman

, Andisheh Partovi:
Improving the understanding of spoken referring expressions through syntactic-semantic and contextual-phonetic error-correction. 284-310 - Young-Bum Kim, Karl Stratos, Ruhi Sarikaya:

A Framework for pre-training hidden-unit conditional random fields and its extension to long short term memory networks. 311-326 - Raymond W. M. Ng, Mauro Nicolao

, Thomas Hain
:
Unsupervised crosslingual adaptation of tokenisers for spoken language recognition. 327-342 - Harishchandra Dubey

, Abhijeet Sangwan, John H. L. Hansen:
Using speech technology for quantifying behavioral characteristics in peer-led team learning sessions. 343-366 - Marta R. Costa-jussà, Alexandre Allauzen, Loïc Barrault

, Kyunghyun Cho, Holger Schwenk:
Introduction to the special issue on deep learning approaches for machine translation. 367-373 - Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach

:
A generic neural acoustic beamforming architecture for robust multi-channel speech processing. 374-385 - Jon Barker, Ricard Marxer

, Emmanuel Vincent, Shinji Watanabe
:
Multi-microphone speech recognition in everyday environments. 386-387 - Hendrik Barfuss, Christian Huemmer, Andreas Schwarz, Walter Kellermann:

Robust coherence-based spectral enhancement for speech recognition in adverse real-world environments. 388-400 - Takaaki Hori, Zhuo Chen, Hakan Erdogan, John R. Hershey, Jonathan Le Roux, Vikramjit Mitra, Shinji Watanabe

:
Multi-microphone speech recognition integrating beamforming, robust feature extraction, and advanced DNN/RNN backend. 401-418 - Isidoros Rodomagoulakis, Athanasios Katsamanis

, Gerasimos Potamianos, Panagiotis Giannoulis, Antigoni Tsiami, Petros Maragos:
Room-localized spoken command recognition in multi-room, multi-microphone environments. 419-443 - Sunit Sivasankaran, Emmanuel Vincent, Irina Illina:

A combined evaluation of established and new approaches for speech recognition in varied reverberation conditions. 444-460 - Ryu Takeda

, Kazuhiro Nakadai, Kazunori Komatani:
Acoustic model training based on node-wise weight boundary model for fast and small-footprint deep neural networks. 461-480 - Payton Lin, Dau-Cheng Lyu, Fei Chen

, Syu-Siang Wang
, Yu Tsao
:
Multi-style learning with denoising autoencoders for acoustic modeling in the internet of things (IoT). 481-495 - Ji-Won Cho, Jong-Hyeon Park, Joon-Hyuk Chang, Hyung-Min Park

:
Bayesian feature enhancement using independent vector analysis and reverberation parameter re-estimation for noisy reverberant speech recognition. 496-516 - Yanhui Tu, Jun Du, Qing Wang, Xiao Bao, Li-Rong Dai, Chin-Hui Lee:

An information fusion framework with multi-channel feature concatenation and multi-perspective system combination for the deep-learning-based robust recognition of microphone array speech. 517-534 - Emmanuel Vincent, Shinji Watanabe

, Aditya Arie Nugraha
, Jon Barker, Ricard Marxer
:
An analysis of environment, microphone and data simulation mismatches in robust speech recognition. 535-557 - Niko Moritz, Kamil Adiloglu, Jörn Anemüller, Stefan Goetze

, Birger Kollmeier:
Multi-Channel Speech Enhancement and Amplitude Modulation Analysis for Noise Robust Automatic Speech Recognition. 558-573 - Alastair H. Moore

, Pablo Peso Parada, Patrick A. Naylor
:
Speech enhancement for robust automatic speech recognition: Evaluation using a baseline system and instrumental measures. 574-584 - Daniele Falavigna, Marco Matassoni, Shahab Jalalvand, Matteo Negri

, Marco Turchi
:
DNN adaptation by automatic quality estimation of ASR hypotheses. 585-604 - Jon Barker, Ricard Marxer

, Emmanuel Vincent, Shinji Watanabe
:
The third 'CHiME' speech separation and recognition challenge: Analysis and outcomes. 605-626

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














