default search action

combined dblp search
author search
venue search
publication search

ask others

Puming Zhan

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AlbesanoFWZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AlbesanoFWZ24
Dario Albesano, Nicola Ferri, Felix Weninger, Puming Zhan:
Improving Speed/Accuracy Tradeoff for Online Streaming ASR via Real-Valued and Trainable Strides. ICASSP 2024: 11716-11720
2022
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WeningerGHFAZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WeningerGHFAZ22
Felix Weninger, Marco Gaudesi, Md. Akmal Haidar, Nicola Ferri, Jesús Andrés-Ferrer, Puming Zhan:
Conformer with dual-mode chunked attention for joint online and offline ASR. INTERSPEECH 2022: 2053-2057
[c20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlbesanoAFZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlbesanoAFZ22
Dario Albesano, Jesús Andrés-Ferrer, Nicola Ferri, Puming Zhan:
On the Prediction Network Architecture in RNN-T for ASR. INTERSPEECH 2022: 2093-2097
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-14618
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-14618
Dario Albesano, Jesús Andrés-Ferrer, Nicola Ferri, Puming Zhan:
On the Prediction Network Architecture in RNN-T for ASR. CoRR abs/2206.14618 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-14623
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-14623
Jesús Andrés-Ferrer, Dario Albesano, Puming Zhan, Paul Vozila:
Contextual Density Ratio for Language Model Biasing of Sequence to Sequence ASR Systems. CoRR abs/2206.14623 (2022)
2021
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WeningerGLGZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WeningerGLGZ21
Felix Weninger, Marco Gaudesi, Ralf Leibold, Roberto Gemello, Puming Zhan:
Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition. ASRU 2021: 534-540
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/GaudesiWSZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/GaudesiWSZ21
Marco Gaudesi, Felix Weninger, Dushyant Sharma, Puming Zhan:
ChannelAugment: Improving Generalization of Multi-Channel ASR by Training with Input Channel Randomization. ASRU 2021: 824-829
[c17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Andres-FerrerAZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Andres-FerrerAZ21
Jesús Andrés-Ferrer, Dario Albesano, Puming Zhan, Paul Vozila:
Contextual Density Ratio for Language Model Biasing of Sequence to Sequence ASR Systems. Interspeech 2021: 2007-2011
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-08744
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-08744
Felix Weninger, Marco Gaudesi, Ralf Leibold, Roberto Gemello, Puming Zhan:
Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition. CoRR abs/2109.08744 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-11225
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-11225
Marco Gaudesi, Felix Weninger, Dushyant Sharma, Puming Zhan:
ChannelAugment: Improving generalization of multi-channel ASR by training with input channel randomization. CoRR abs/2109.11225 (2021)
2020
[c16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WeningerMGAZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WeningerMGAZ20
Felix Weninger, Franco Mana, Roberto Gemello, Jesús Andrés-Ferrer, Puming Zhan:
Semi-Supervised Learning with Data Augmentation for End-to-End ASR. INTERSPEECH 2020: 2802-2806
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-13876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-13876
Felix Weninger, Franco Mana, Roberto Gemello, Jesús Andrés-Ferrer, Puming Zhan:
Semi-Supervised Learning with Data Augmentation for End-to-End ASR. CoRR abs/2007.13876 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ManaWGZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ManaWGZ19
Franco Mana, Felix Weninger, Roberto Gemello, Puming Zhan:
Online Batch Normalization Adaptation for Automatic Speech Recognition. ASRU 2019: 875-880
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WeningerSPWZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WeningerSPWZ19
Felix Weninger, Yang Sun, Junho Park, Daniel Willett, Puming Zhan:
Deep Learning Based Mandarin Accent Identification for Accent Robust ASR. INTERSPEECH 2019: 510-514
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WeningerALZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WeningerALZ19
Felix Weninger, Jesús Andrés-Ferrer, Xinwei Li, Puming Zhan:
Listen, Attend, Spell and Adapt: Speaker Adapted Sequence-to-Sequence ASR. INTERSPEECH 2019: 3805-3809
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-04916
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-04916
Felix Weninger, Jesús Andrés-Ferrer, Xinwei Li, Puming Zhan:
Listen, Attend, Spell and Adapt: Speaker Adapted Sequence-to-Sequence ASR. CoRR abs/1907.04916 (2019)
2017
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/GibsonCZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/GibsonCZ17
Matthew Gibson, Gary Cook, Puming Zhan:
Semi-supervised training strategies for deep neural networks. ASRU 2017: 77-83
2012
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FukudaTCRZ12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FukudaTCRZ12
Takashi Fukuda, Ryuki Tachibana, Upendra V. Chaudhari, Bhuvana Ramabhadran, Puming Zhan:
Constructing ensembles of dissimilar acoustic models using hidden attributes of training data. ICASSP 2012: 4141-4144
2011
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/TachibanaFCRZ11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/TachibanaFCRZ11
Ryuki Tachibana, Takashi Fukuda, Upendra V. Chaudhari, Bhuvana Ramabhadran, Puming Zhan:
Frame-level AnyBoost for LVCSR with the MMI Criterion. ASRU 2011: 12-17

1990 – 1999

see FAQ

What is the meaning of the colors in the publication lists?

1999
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WegmannZG99
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WegmannZG99
Steven Wegmann, Puming Zhan, Larry Gillick:
Progress in Broadcast News transcription at Dragon Systems. ICASSP 1999: 33-36
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/interspeech/WegmannZCNYG99
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WegmannZCNYG99
Steven Wegmann, Puming Zhan, Ira Carp, Michael Newman, Jon Yamron, Larry Gillick:
Dragon systems' 1998 broadcast news transcription system. EUROSPEECH 1999
1997
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LavieWLFGGZZ97
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LavieWLFGGZZ97
Alon Lavie, Alex Waibel, Lori S. Levin, Michael Finke, Donna Gates, Marsal Gavaldà, Torsten Zeppenfeld, Puming Zhan:
Janus-III: speech-to-speech translation in multiple languages. ICASSP 1997: 99-102
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhanW97
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhanW97
Puming Zhan, Martin Westphal:
Speaker normalization based on frequency warping. ICASSP 1997: 1039-1042
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhanWFW97
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhanWFW97
Puming Zhan, Martin Westphal, Michael Finke, Alex Waibel:
Speaker normalization and speaker adaptation - a combination for conversational speech recognition. EUROSPEECH 1997: 2087-2090
1996
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/ecai/GatesLLWGMWZ96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/GatesLLWGMWZ96
Donna Gates, Alon Lavie, Lori S. Levin, Alex Waibel, Marsal Gavaldà, Laura Mayfield, Monika Woszczyna, Puming Zhan:
End-to-End Evaluation in JANUS: A Speech-to-speech Translation System. ECAI Workshop on Dialogue Processing in Spoken Language Systems 1996: 195-206
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WaibelFGGKLLMMMRSSWZZ96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WaibelFGGKLLMMMRSSWZZ96
Alex Waibel, Michael Finke, Donna Gates, Marsal Gavaldà, Thomas Kemp, Alon Lavie, Lori S. Levin, Martin Maier, Laura Mayfield, Arthur E. McNair, Ivica Rogina, Kaori Shima, Tilo Sloboda, Monika Woszczyna, Torsten Zeppenfeld, Puming Zhan:
JANUS-II-translation of spontaneous conversational speech. ICASSP 1996: 409-412
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhanRGGLW96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhanRGGLW96
Puming Zhan, Klaus Ries, Marsal Gavaldà, Donna Gates, Alon Lavie, Alex Waibel:
JANUS-II: towards spontaneous Spanish speech recognition. ICSLP 1996: 2285-2288
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LavieWLGGZZG96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LavieWLGGZZG96
Alon Lavie, Alex Waibel, Lori S. Levin, Donna Gates, Marsal Gavaldà, Torsten Zeppenfeld, Puming Zhan, Oren Glickman:
Translation of conversational speech with JANUS-II. ICSLP 1996: 2375-2378

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.