default search action

combined dblp search
author search
venue search
publication search

ask others

Suyoun Kim

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/LeLKJLSLAS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/LeLKJLSLAS24
Trang Le, Daniel Lazar, Suyoun Kim, Shan Jiang, Duc Le, Adithya Sagar, Aleksandr Livshits, Ahmed Aly, Akshat Shrivastava:
PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding. EMNLP (Findings) 2024: 14027-14038
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0011KK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0011KK24
Zhe Liu, Suyoun Kim, Ozlem Kalinli:
Evaluating Speech Recognition Performance Towards Large Language Model Based Voice Assistants. INTERSPEECH 2024
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07823
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07823
Trang Le, Daniel Lazar, Suyoun Kim, Shan Jiang, Duc Le, Adithya Sagar, Aleksandr Livshits, Ahmed Aly, Akshat Shrivastava:
PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding. CoRR abs/2406.07823 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-01162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-01162
Wonjune Kang, Junteng Jia, Chunyang Wu, Wei Zhou, Egor Lakomkin, Yashesh Gaur, Leda Sari, Suyoun Kim, Ke Li, Jay Mahadeokar, Ozlem Kalinli:
Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech. CoRR abs/2410.01162 (2024)
2023
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/XuDWKLLS0TLBLS023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/XuDWKLLS0TLBLS023
Derek Xu, Shuyan Dong, Changhan Wang, Suyoun Kim, Zhaojiang Lin, Bing Liu, Akshat Shrivastava, Shang-Wen Li, Liang-Hsuan Tseng, Guan-Ting Lin, Alexei Baevski, Hung-yi Lee, Yizhou Sun, Wei Wang:
Introducing Semantics into Speech Encoders. ACL (1) 2023: 11413-11429
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShrivastavaKTEL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShrivastavaKTEL23
Akshat Shrivastava, Suyoun Kim, Paden Tomasello, Ali Elkahky, Daniel Lazar, Trang Le, Shan Jiang, Duc Le, Aleksandr Livshits, Ahmed Aly:
ICASSP 2023 Spoken Language Understanding Grand Challenge. ICASSP 2023: 1-2
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimSLLKS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimSLLKS23
Suyoun Kim, Akshat Shrivastava, Duc Le, Ju Lin, Ozlem Kalinli, Michael L. Seltzer:
Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding. INTERSPEECH 2023: 1119-1123
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-12134
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-12134
Suyoun Kim, Akshat Shrivastava, Duc Le, Ju Lin, Ozlem Kalinli, Michael L. Seltzer:
Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding. CoRR abs/2307.12134 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09390
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09390
Roshan Sharma, Suyoun Kim, Daniel Lazar, Trang Le, Akshat Shrivastava, Kwanghoon Ahn, Piyush Kansal, Leda Sari, Ozlem Kalinli, Michael L. Seltzer:
Augmenting text for spoken language understanding with Large Language Models. CoRR abs/2309.09390 (2023)
2022
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/KimLKHZKL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/KimLKHZKL22
Suyoun Kim, Ke Li, Lucas Kabela, Ron Huang, Jiedan Zhu, Ozlem Kalinli, Duc Le:
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition. EMNLP (Findings) 2022: 5717-5722
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeSTKLKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeSTKLKS22
Duc Le, Akshat Shrivastava, Paden D. Tomasello, Suyoun Kim, Aleksandr Livshits, Ozlem Kalinli, Michael L. Seltzer:
Deliberation Model for On-Device Spoken Language Understanding. INTERSPEECH 2022: 3468-3472
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimLZSAZFKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimLZSAZFKS22
Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric. INTERSPEECH 2022: 3978-3982
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-01893
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-01893
Duc Le, Akshat Shrivastava, Paden Tomasello, Suyoun Kim, Aleksandr Livshits, Ozlem Kalinli, Michael L. Seltzer:
Deliberation Model for On-Device Spoken Language Understanding. CoRR abs/2204.01893 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00174
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00174
Suyoun Kim, Ke Li, Lucas Kabela, Rongqing Huang, Jiedan Zhu, Ozlem Kalinli, Duc Le:
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition. CoRR abs/2211.00174 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-08402
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-08402
Derek Xu, Shuyan Dong, Changhan Wang, Suyoun Kim, Zhaojiang Lin, Akshat Shrivastava, Shang-Wen Li, Liang-Hsuan Tseng, Alexei Baevski, Guan-Ting Lin, Hung-yi Lee, Yizhou Sun, Wei Wang:
Introducing Semantics into Speech Encoders. CoRR abs/2211.08402 (2022)
2021
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimSMBFSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimSMBFSL21
Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le:
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer. ICASSP 2021: 7333-7337
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeJKKSMCSFKSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeJKKSMCSFKSS21
Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. Interspeech 2021: 1772-1776
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimALYFKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimALYFKS21
Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding. Interspeech 2021: 1977-1981
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/Liu0LKSZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/Liu0LKSZ21
Chunxi Liu, Frank Zhang, Duc Le, Suyoun Kim, Yatharth Saraf, Geoffrey Zweig:
Improving RNN Transducer Based ASR with Auxiliary Tasks. SLT 2021: 172-179
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02138
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02138
Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding. CoRR abs/2104.02138 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02194
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02194
Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. CoRR abs/2104.02194 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05376
Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric. CoRR abs/2110.05376 (2021)
2020
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-13878
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-13878
Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le:
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer. CoRR abs/2010.13878 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-03109
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-03109
Chunxi Liu, Frank Zhang, Duc Le, Suyoun Kim, Yatharth Saraf, Geoffrey Zweig:
Improving RNN Transducer Based ASR with Auxiliary Tasks. CoRR abs/2011.03109 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/KimDM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/KimDM19
Suyoun Kim, Siddharth Dalmia, Florian Metze:
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion. ACL (1) 2019: 1131-1141
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimDM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimDM19
Suyoun Kim, Siddharth Dalmia, Florian Metze:
Cross-Attention End-to-End ASR for Two-Party Conversations. INTERSPEECH 2019: 4380-4384
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/KimM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/KimM19
Suyoun Kim, Florian Metze:
Acoustic-to-Word Models with Conversational Context Information. NAACL-HLT (1) 2019: 2766-2771
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-08796
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-08796
Suyoun Kim, Florian Metze:
Acoustic-to-Word Models with Conversational Context Information. CoRR abs/1905.08796 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-11604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-11604
Suyoun Kim, Siddharth Dalmia, Florian Metze:
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion. CoRR abs/1906.11604 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-10726
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-10726
Suyoun Kim, Siddharth Dalmia, Florian Metze:
Cross-Attention End-to-End ASR for Two-Party Conversations. CoRR abs/1907.10726 (2019)
2018
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimS18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimS18a
Suyoun Kim, Michael L. Seltzer:
Towards Language-Universal End-to-End Speech Recognition. ICASSP 2018: 4914-4918
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimSLZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimSLZ18
Suyoun Kim, Michael L. Seltzer, Jinyu Li, Rui Zhao:
Improved Training for Online End-to-end Speech Recognition Systems. INTERSPEECH 2018: 2913-2917
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/KimM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/KimM18
Suyoun Kim, Florian Metze:
Dialog-Context Aware end-to-end Speech Recognition. SLT 2018: 434-440
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-02171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-02171
Suyoun Kim, Florian Metze:
Dialog-context aware end-to-end speech recognition. CoRR abs/1808.02171 (2018)
2017
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/WatanabeHKHH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/WatanabeHKHH17
Shinji Watanabe, Takaaki Hori, Suyoun Kim, John R. Hershey, Tomoki Hayashi:
Hybrid CTC/Attention Architecture for End-to-End Speech Recognition. IEEE J. Sel. Top. Signal Process. 11(8): 1240-1253 (2017)
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimHW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimHW17
Suyoun Kim, Takaaki Hori, Shinji Watanabe:
Joint CTC-attention based end-to-end speech recognition using multi-task learning. ICASSP 2017: 4835-4839
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimL17
Suyoun Kim, Ian R. Lane:
End-to-End Speech Recognition with Auditory Attention for Multi-Microphone Distance Speech Recognition. INTERSPEECH 2017: 3867-3871
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-02207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-02207
Suyoun Kim, Michael L. Seltzer:
Towards Language-Universal End-to-End Speech Recognition. CoRR abs/1711.02207 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-02212
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-02212
Suyoun Kim, Michael L. Seltzer, Jinyu Li, Rui Zhao:
Improved training for online end-to-end speech recognition systems. CoRR abs/1711.02212 (2017)
2016
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimL16
Suyoun Kim, Ian R. Lane:
Recurrent Models for Auditory Attention in Multi-Microphone Distant Speech Recognition. INTERSPEECH 2016: 3838-3842
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/KimRL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KimRL16
Suyoun Kim, Bhiksha Raj, Ian R. Lane:
Environmental Noise Embeddings for Robust Speech Recognition. CoRR abs/1601.02553 (2016)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/KimHW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KimHW16
Suyoun Kim, Takaaki Hori, Shinji Watanabe:
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning. CoRR abs/1609.06773 (2016)
2015
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/KimL15b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KimL15b
Suyoun Kim, Ian R. Lane:
Recurrent Models for Auditory Attention in Multi-Microphone Distance Speech Recognition. CoRR abs/1511.06407 (2015)
2014
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/MoonKW14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MoonKW14
Seungwhan Moon, Suyoun Kim, Haohan Wang:
Multimodal Transfer Deep Learning for Audio Visual Recognition. CoRR abs/1412.3121 (2014)
2011
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/slip/KimKL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slip/KimKL11
Daehyun Kim, Suyoun Kim, Sung Kyu Lim:
Impact of nano-scale through-silicon vias on the quality of today and future 3D IC designs. SLIP 2011: 1-8

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.