default search action
Puming Zhan
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c22]Dario Albesano, Nicola Ferri, Felix Weninger, Puming Zhan:
Improving Speed/Accuracy Tradeoff for Online Streaming ASR via Real-Valued and Trainable Strides. ICASSP 2024: 11716-11720 - 2022
- [c21]Felix Weninger, Marco Gaudesi, Md. Akmal Haidar, Nicola Ferri, Jesús Andrés-Ferrer, Puming Zhan:
Conformer with dual-mode chunked attention for joint online and offline ASR. INTERSPEECH 2022: 2053-2057 - [c20]Dario Albesano, Jesús Andrés-Ferrer, Nicola Ferri, Puming Zhan:
On the Prediction Network Architecture in RNN-T for ASR. INTERSPEECH 2022: 2093-2097 - [i6]Dario Albesano, Jesús Andrés-Ferrer, Nicola Ferri, Puming Zhan:
On the Prediction Network Architecture in RNN-T for ASR. CoRR abs/2206.14618 (2022) - [i5]Jesús Andrés-Ferrer, Dario Albesano, Puming Zhan, Paul Vozila:
Contextual Density Ratio for Language Model Biasing of Sequence to Sequence ASR Systems. CoRR abs/2206.14623 (2022) - 2021
- [c19]Felix Weninger, Marco Gaudesi, Ralf Leibold, Roberto Gemello, Puming Zhan:
Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition. ASRU 2021: 534-540 - [c18]Marco Gaudesi, Felix Weninger, Dushyant Sharma, Puming Zhan:
ChannelAugment: Improving Generalization of Multi-Channel ASR by Training with Input Channel Randomization. ASRU 2021: 824-829 - [c17]Jesús Andrés-Ferrer, Dario Albesano, Puming Zhan, Paul Vozila:
Contextual Density Ratio for Language Model Biasing of Sequence to Sequence ASR Systems. Interspeech 2021: 2007-2011 - [i4]Felix Weninger, Marco Gaudesi, Ralf Leibold, Roberto Gemello, Puming Zhan:
Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition. CoRR abs/2109.08744 (2021) - [i3]Marco Gaudesi, Felix Weninger, Dushyant Sharma, Puming Zhan:
ChannelAugment: Improving generalization of multi-channel ASR by training with input channel randomization. CoRR abs/2109.11225 (2021) - 2020
- [c16]Felix Weninger, Franco Mana, Roberto Gemello, Jesús Andrés-Ferrer, Puming Zhan:
Semi-Supervised Learning with Data Augmentation for End-to-End ASR. INTERSPEECH 2020: 2802-2806 - [i2]Felix Weninger, Franco Mana, Roberto Gemello, Jesús Andrés-Ferrer, Puming Zhan:
Semi-Supervised Learning with Data Augmentation for End-to-End ASR. CoRR abs/2007.13876 (2020)
2010 – 2019
- 2019
- [c15]Franco Mana, Felix Weninger, Roberto Gemello, Puming Zhan:
Online Batch Normalization Adaptation for Automatic Speech Recognition. ASRU 2019: 875-880 - [c14]Felix Weninger, Yang Sun, Junho Park, Daniel Willett, Puming Zhan:
Deep Learning Based Mandarin Accent Identification for Accent Robust ASR. INTERSPEECH 2019: 510-514 - [c13]Felix Weninger, Jesús Andrés-Ferrer, Xinwei Li, Puming Zhan:
Listen, Attend, Spell and Adapt: Speaker Adapted Sequence-to-Sequence ASR. INTERSPEECH 2019: 3805-3809 - [i1]Felix Weninger, Jesús Andrés-Ferrer, Xinwei Li, Puming Zhan:
Listen, Attend, Spell and Adapt: Speaker Adapted Sequence-to-Sequence ASR. CoRR abs/1907.04916 (2019) - 2017
- [c12]Matthew Gibson, Gary Cook, Puming Zhan:
Semi-supervised training strategies for deep neural networks. ASRU 2017: 77-83 - 2012
- [c11]Takashi Fukuda, Ryuki Tachibana, Upendra V. Chaudhari, Bhuvana Ramabhadran, Puming Zhan:
Constructing ensembles of dissimilar acoustic models using hidden attributes of training data. ICASSP 2012: 4141-4144 - 2011
- [c10]Ryuki Tachibana, Takashi Fukuda, Upendra V. Chaudhari, Bhuvana Ramabhadran, Puming Zhan:
Frame-level AnyBoost for LVCSR with the MMI Criterion. ASRU 2011: 12-17
1990 – 1999
- 1999
- [c9]Steven Wegmann, Puming Zhan, Larry Gillick:
Progress in Broadcast News transcription at Dragon Systems. ICASSP 1999: 33-36 - [c8]Steven Wegmann, Puming Zhan, Ira Carp, Michael Newman, Jon Yamron, Larry Gillick:
Dragon systems' 1998 broadcast news transcription system. EUROSPEECH 1999 - 1997
- [c7]Alon Lavie, Alex Waibel, Lori S. Levin, Michael Finke, Donna Gates, Marsal Gavaldà, Torsten Zeppenfeld, Puming Zhan:
Janus-III: speech-to-speech translation in multiple languages. ICASSP 1997: 99-102 - [c6]Puming Zhan, Martin Westphal:
Speaker normalization based on frequency warping. ICASSP 1997: 1039-1042 - [c5]Puming Zhan, Martin Westphal, Michael Finke, Alex Waibel:
Speaker normalization and speaker adaptation - a combination for conversational speech recognition. EUROSPEECH 1997: 2087-2090 - 1996
- [c4]Donna Gates, Alon Lavie, Lori S. Levin, Alex Waibel, Marsal Gavaldà, Laura Mayfield, Monika Woszczyna, Puming Zhan:
End-to-End Evaluation in JANUS: A Speech-to-speech Translation System. ECAI Workshop on Dialogue Processing in Spoken Language Systems 1996: 195-206 - [c3]Alex Waibel, Michael Finke, Donna Gates, Marsal Gavaldà, Thomas Kemp, Alon Lavie, Lori S. Levin, Martin Maier, Laura Mayfield, Arthur E. McNair, Ivica Rogina, Kaori Shima, Tilo Sloboda, Monika Woszczyna, Torsten Zeppenfeld, Puming Zhan:
JANUS-II-translation of spontaneous conversational speech. ICASSP 1996: 409-412 - [c2]Puming Zhan, Klaus Ries, Marsal Gavaldà, Donna Gates, Alon Lavie, Alex Waibel:
JANUS-II: towards spontaneous Spanish speech recognition. ICSLP 1996: 2285-2288 - [c1]Alon Lavie, Alex Waibel, Lori S. Levin, Donna Gates, Marsal Gavaldà, Torsten Zeppenfeld, Puming Zhan, Oren Glickman:
Translation of conversational speech with JANUS-II. ICSLP 1996: 2375-2378
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-19 23:42 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint