default search action
Yangyang Shi
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j12]Yulin Chen, Lei Du, Qiao Sun, Jie Bai, Haitao Li, Yangyang Shi:
Self-Calibration Method of Displacement Sensor in AMB-Rotor System Based on Magnetic Bearing Current Control. IEEE Trans. Ind. Electron. 71(5): 5148-5156 (2024) - [c62]Zechun Liu, Barlas Oguz, Changsheng Zhao, Ernie Chang, Pierre Stock, Yashar Mehdad, Yangyang Shi, Raghuraman Krishnamoorthi, Vikas Chandra:
LLM-QAT: Data-Free Quantization Aware Training for Large Language Models. ACL (Findings) 2024: 467-484 - [c61]Wei Shao, Yangyang Shi, Daoqiang Zhang, Junjie Zhou, Peng Wan:
Tumor Micro-Environment Interactions Guided Graph Learning for Survival Analysis of Human Cancers from Whole-Slide Pathological Images. CVPR 2024: 11694-11703 - [c60]Yangyang Shi, Linan Tian, Liwei Chen, Yanqi Yang, Gang Shi:
Scheduled Execution-Based Binary Indirect Call Targets Refinement. ESORICS (3) 2024: 3-23 - [c59]Gaël Le Lan, Varun Nagaraja, Ernie Chang, David Kant, Zhaoheng Ni, Yangyang Shi, Forrest N. Iandola, Vikas Chandra:
Stack-and-Delay: A New Codebook Pattern for Music Generation. ICASSP 2024: 796-800 - [c58]Ernie Chang, Sidd Srinivasan, Mahi Luthra, Pin-Jie Lin, Varun Nagaraja, Forrest N. Iandola, Zechun Liu, Zhaoheng Ni, Changsheng Zhao, Yangyang Shi, Vikas Chandra:
On the Open Prompt Challenge in Conditional Audio Generation. ICASSP 2024: 5315-5319 - [c57]Ernie Chang, Pin-Jie Lin, Yang Li, Sidd Srinivasan, Gaël Le Lan, David Kant, Yangyang Shi, Forrest N. Iandola, Vikas Chandra:
In-Context Prompt Editing for Conditional Audio Generation. ICASSP 2024: 5320-5324 - [c56]Yang Li, Liangzhen Lai, Yuan Shangguan, Forrest N. Iandola, Zhaoheng Ni, Ernie Chang, Yangyang Shi, Vikas Chandra:
Folding Attention: Memory and Power Optimization for On-Device Transformer-Based Streaming Speech Recognition. ICASSP 2024: 11901-11905 - [c55]Zechun Liu, Changsheng Zhao, Forrest N. Iandola, Chen Lai, Yuandong Tian, Igor Fedorov, Yunyang Xiong, Ernie Chang, Yangyang Shi, Raghuraman Krishnamoorthi, Liangzhen Lai, Vikas Chandra:
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. ICML 2024 - [c54]Yangyang Shi, Qi Zhu, Yingli Zuo, Peng Wan, Daoqiang Zhang, Wei Shao:
Characterizing the Histology Spatial Intersections Between Tumor-Infiltrating Lymphocytes and Tumors for Survival Prediction of Cancers Via Graph Contrastive Learning. MLMI@MICCAI (2) 2024: 212-221 - [c53]Mark Richardson, Fadi Botros, Yangyang Shi, Bradford J. Snow, Pinhao Guo, Linguang Zhang, Jingming Dong, Keith Vertanen, Shugao Ma, Robert Wang:
StegoType: Surface Typing from Egocentric Cameras. UIST (Adjunct Volume) 2024: 12:1-12:14 - [c52]Mark Richardson, Fadi Botros, Yangyang Shi, Pinhao Guo, Bradford J. Snow, Linguang Zhang, Jingming Dong, Keith Vertanen, Shugao Ma, Robert Wang:
StegoType: Surface Typing from Egocentric Cameras. UIST 2024: 83:1-83:14 - [i41]Yang Liu, Li Wan, Yun Li, Yiteng Huang, Ming Sun, James Luan, Yangyang Shi, Xin Lei:
FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation. CoRR abs/2401.04283 (2024) - [i40]Yang Li, Yuan Shangguan, Yuhao Wang, Liangzhen Lai, Ernie Chang, Changsheng Zhao, Yangyang Shi, Vikas Chandra:
Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition. CoRR abs/2402.13076 (2024) - [i39]Zechun Liu, Changsheng Zhao, Forrest N. Iandola, Chen Lai, Yuandong Tian, Igor Fedorov, Yunyang Xiong, Ernie Chang, Yangyang Shi, Raghuraman Krishnamoorthi, Liangzhen Lai, Vikas Chandra:
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. CoRR abs/2402.14905 (2024) - [i38]Yang Li, Changsheng Zhao, Hyungtak Lee, Ernie Chang, Yangyang Shi, Vikas Chandra:
Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications. CoRR abs/2405.15877 (2024) - [i37]Frank Seide, Morrie Doulaty, Yangyang Shi, Yashesh Gaur, Junteng Jia, Chunyang Wu:
Speech ReaLLM - Real-time Streaming Speech Recognition with Multimodal LLMs by Teaching the Flow of Time. CoRR abs/2406.09569 (2024) - [i36]Gaël Le Lan, Bowen Shi, Zhaoheng Ni, Sidd Srinivasan, Anurag Kumar, Brian Ellis, David Kant, Varun Nagaraja, Ernie Chang, Wei-Ning Hsu, Yangyang Shi, Vikas Chandra:
High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching. CoRR abs/2407.03648 (2024) - [i35]Ernie Chang, Pin-Jie Lin, Yang Li, Changsheng Zhao, Daeil Kim, Rastislav Rabatin, Zechun Liu, Yangyang Shi, Vikas Chandra:
Target-Aware Language Modeling via Granular Data Sampling. CoRR abs/2409.14705 (2024) - 2023
- [j11]Ximing Liu, Xin Ma, Rui Feng, Yulin Chen, Yangyang Shi, Shiqiang Zheng:
Model Reference Adaptive Compensation and Robust Controller for Magnetic Bearing Systems With Strong Persistent Disturbances. IEEE Trans. Ind. Electron. 70(11): 10902-10911 (2023) - [j10]Wei Shao, Yingli Zuo, Yangyang Shi, Yawen Wu, Jiao Tang, Junyong Zhao, Liang Sun, Zixiao Lu, Jianpeng Sheng, Qi Zhu, Daoqiang Zhang:
Characterizing the Survival-Associated Interactions Between Tumor-Infiltrating Lymphocytes and Tumors From Pathological Images and Multi-Omics Data. IEEE Trans. Medical Imaging 42(10): 3025-3035 (2023) - [c51]Zechun Liu, Barlas Oguz, Aasish Pappu, Yangyang Shi, Raghuraman Krishnamoorthi:
Binary and Ternary Natural Language Generation. ACL (1) 2023: 65-77 - [c50]Ernie Chang, Muhammad Hassan Rashid, Pin-Jie Lin, Changsheng Zhao, Vera Demberg, Yangyang Shi, Vikas Chandra:
Revisiting Sample Size Determination in Natural Language Understanding. ACL (Findings) 2023: 6716-6724 - [c49]Ting-Wei Wu, Changsheng Zhao, Ernie Chang, Yangyang Shi, Pierce Chuang, Vikas Chandra, Biing-Hwang Juang:
Towards Zero-Shot Multilingual Transfer for Code-Switched Responses. ACL (1) 2023: 7551-7563 - [c48]Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao:
TorchAudio 2.1: Advancing Speech Recognition, Self-Supervised Learning, and Audio Processing Components for Pytorch. ASRU 2023: 1-9 - [c47]Ke Li, Jay Mahadeokar, Jinxi Guo, Yangyang Shi, Gil Keren, Ozlem Kalinli, Michael L. Seltzer, Duc Le:
Improving fast-slow Encoder based Transducer with Streaming Deliberation. ICASSP 2023: 1-5 - [c46]Yang Liu, Yangyang Shi, Yun Li, Kaustubh Kalgaonkar, Sriram Srinivasan, Xin Lei:
SCA: Streaming Cross-Attention Alignment For Echo Cancellation. ICASSP 2023: 1-5 - [c45]Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. INTERSPEECH 2023: 241-245 - [c44]Florian L. Kreyssig, Yangyang Shi, Jinxi Guo, Leda Sari, Abdel-rahman Mohamed, Philip C. Woodland:
Biased Self-supervised Learning for ASR. INTERSPEECH 2023: 4948-4952 - [i34]Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. CoRR abs/2305.12498 (2023) - [i33]Zechun Liu, Barlas Oguz, Changsheng Zhao, Ernie Chang, Pierre Stock, Yashar Mehdad, Yangyang Shi, Raghuraman Krishnamoorthi, Vikas Chandra:
LLM-QAT: Data-Free Quantization Aware Training for Large Language Models. CoRR abs/2305.17888 (2023) - [i32]Zechun Liu, Barlas Oguz, Aasish Pappu, Yangyang Shi, Raghuraman Krishnamoorthi:
Binary and Ternary Natural Language Generation. CoRR abs/2306.01841 (2023) - [i31]Ernie Chang, Muhammad Hassan Rashid, Pin-Jie Lin, Changsheng Zhao, Vera Demberg, Yangyang Shi, Vikas Chandra:
Revisiting Sample Size Determination in Natural Language Understanding. CoRR abs/2307.00374 (2023) - [i30]Mei-Yuh Hwang, Yangyang Shi, Ankit Ramchandani, Guan Pang, Praveen Krishnan, Lucas Kabela, Frank Seide, Samyak Datta, Jun Liu:
DISGO: Automatic End-to-End Evaluation for Scene Text OCR. CoRR abs/2308.13173 (2023) - [i29]Yang Li, Liangzhen Lai, Yuan Shangguan, Forrest N. Iandola, Ernie Chang, Yangyang Shi, Vikas Chandra:
Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition. CoRR abs/2309.07988 (2023) - [i28]Yangyang Shi, Gaël Le Lan, Varun Nagaraja, Zhaoheng Ni, Xinhao Mei, Ernie Chang, Forrest N. Iandola, Yang Liu, Vikas Chandra:
Enhance audio generation controllability through representation similarity regularization. CoRR abs/2309.08773 (2023) - [i27]Gaël Le Lan, Varun Nagaraja, Ernie Chang, David Kant, Zhaoheng Ni, Yangyang Shi, Forrest N. Iandola, Vikas Chandra:
Stack-and-Delay: a new codebook pattern for music generation. CoRR abs/2309.08804 (2023) - [i26]Xinhao Mei, Varun Nagaraja, Gaël Le Lan, Zhaoheng Ni, Ernie Chang, Yangyang Shi, Vikas Chandra:
FoleyGen: Visually-Guided Audio Generation. CoRR abs/2309.10537 (2023) - [i25]Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao, Robin Scheibler, Samuele Cornell, Sean Kim, Stavros Petridis:
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch. CoRR abs/2310.17864 (2023) - [i24]Ernie Chang, Pin-Jie Lin, Yang Li, Sidd Srinivasan, Gaël Le Lan, David Kant, Yangyang Shi, Forrest N. Iandola, Vikas Chandra:
In-Context Prompt Editing For Conditional Audio Generation. CoRR abs/2311.00895 (2023) - [i23]Ernie Chang, Sidd Srinivasan, Mahi Luthra, Pin-Jie Lin, Varun Nagaraja, Forrest N. Iandola, Zechun Liu, Zhaoheng Ni, Changsheng Zhao, Yangyang Shi, Vikas Chandra:
On The Open Prompt Challenge In Conditional Audio Generation. CoRR abs/2311.00897 (2023) - 2022
- [j9]Yangyang Shi, Xuesong Deng, Yuqi Tong, Ruotong Li, Yanfang Zhang, Lijie Ren, Weixin Si:
Synergistic Digital Twin and Holographic Augmented-Reality-Guided Percutaneous Puncture of Respiratory Liver Tumor. IEEE Trans. Hum. Mach. Syst. 52(6): 1364-1374 (2022) - [j8]Yangyang Shi, Haitao Li, Bangcheng Han:
Position Extraction of Ultralow-Speed Gimbal Servo System With Linear Hall Sensors. IEEE Trans. Ind. Electron. 69(3): 2947-2955 (2022) - [c43]Linan Tian, Yangyang Shi, Liwei Chen, Yanqi Yang, Gang Shi:
Gadgets Splicing: Dynamic Binary Transformation for Precise Rewriting. CGO 2022: 155-167 - [c42]Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution. ICASSP 2022: 8277-8281 - [c41]Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer:
Streaming parallel transducer beam search with fast slow cascaded encoders. INTERSPEECH 2022: 2083-2087 - [c40]Chunxi Liu, Yuan Shangguan, Haichuan Yang, Yangyang Shi, Raghuraman Krishnamoorthi, Ozlem Kalinli:
Learning a Dual-Mode Speech Recognition Model VIA Self-Pruning. SLT 2022: 273-279 - [i22]Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer:
Streaming parallel transducer beam search with fast-slow cascaded encoders. CoRR abs/2203.15773 (2022) - [i21]Chunxi Liu, Yuan Shangguan, Haichuan Yang, Yangyang Shi, Raghuraman Krishnamoorthi, Ozlem Kalinli:
Learning a Dual-Mode Speech Recognition Model via Self-Pruning. CoRR abs/2207.11906 (2022) - [i20]Yang Liu, Yangyang Shi, Yun Li, Kaustubh Kalgaonkar, Sriram Srinivasan, Xin Lei:
SCA: Streaming Cross-attention Alignment for Echo Cancellation. CoRR abs/2211.00589 (2022) - [i19]Florian L. Kreyssig, Yangyang Shi, Jinxi Guo, Leda Sari, Abdelrahman Mohamed, Philip C. Woodland:
Biased Self-supervised learning for ASR. CoRR abs/2211.02536 (2022) - [i18]Haichuan Yang, Zhaojun Yang, Li Wan, Biqiao Zhang, Yangyang Shi, Yiteng Huang, Ivaylo Enchev, Limin Tang, Raziel Alvarez, Ming Sun, Xin Lei, Raghuraman Krishnamoorthi, Vikas Chandra:
LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting. CoRR abs/2211.04635 (2022) - 2021
- [j7]Ruotong Li, Yangyang Shi, Weixin Si, Li Huang, Bowen Zhuang, Michael Weinmann, Reinhard Klein, Pheng-Ann Heng:
Versatile multi-constrained planning for thermal ablation of large liver tumors. Comput. Medical Imaging Graph. 94: 101993 (2021) - [c39]Xiaohui Zhang, Vimal Manohar, David Zhang, Frank Zhang, Yangyang Shi, Nayan Singhal, Julian Chan, Fuchun Peng, Yatharth Saraf, Mike Seltzer:
On Lattice-Free Boosted MMI Training of HMM and CTC-Based Full-Context ASR Models. ASRU 2021: 1026-1033 - [c38]Yongqiang Wang, Yangyang Shi, Frank Zhang, Chunyang Wu, Julian Chan, Ching-Feng Yeh, Alex Xiao:
Transformer in Action: A Comparative Study of Transformer-Based Acoustic Models for Large Scale Speech Recognition Applications. ICASSP 2021: 6778-6782 - [c37]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Ching-Feng Yeh, Julian Chan, Frank Zhang, Duc Le, Mike Seltzer:
Emformer: Efficient Memory Transformer Based Acoustic Model for Low Latency Streaming Speech Recognition. ICASSP 2021: 6783-6787 - [c36]Chunyang Wu, Zhiping Xiu, Yangyang Shi, Ozlem Kalinli, Christian Fuegen, Thilo Köhler, Qing He:
Transformer-Based Acoustic Modeling for Streaming Speech Synthesis. Interspeech 2021: 146-150 - [c35]Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. Interspeech 2021: 1772-1776 - [c34]Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency. Interspeech 2021: 2042-2046 - [c33]Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios. Interspeech 2021: 2107-2111 - [c32]Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. Interspeech 2021: 4553-4557 - [c31]Varun Nagaraja, Yangyang Shi, Ganesh Venkatesh, Ozlem Kalinli, Michael L. Seltzer, Vikas Chandra:
Collaborative Training of Acoustic Encoders for Speech Recognition. Interspeech 2021: 4573-4577 - [c30]Yangyang Shi, Yuqi Tong, Ruotong Li, Weixin Si:
Internal Motion Estimation during Free-Breathing via External/Internal Correlation Model. RCAR 2021: 986-990 - [c29]Ching-Feng Yeh, Yongqiang Wang, Yangyang Shi, Chunyang Wu, Frank Zhang, Julian Chan, Michael L. Seltzer:
Streaming Attention-Based Models with Augmented Memory for End-To-End Speech Recognition. SLT 2021: 8-14 - [i17]Xiaowen Shan, Yangyang Shi, Xuhui Li:
A multiple-relaxation-time collision model by Hermite expansion. CoRR abs/2102.00817 (2021) - [i16]Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency. CoRR abs/2104.02176 (2021) - [i15]Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. CoRR abs/2104.02194 (2021) - [i14]Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. CoRR abs/2104.02207 (2021) - [i13]Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios. CoRR abs/2104.02232 (2021) - [i12]Varun Nagaraja, Yangyang Shi, Ganesh Venkatesh, Ozlem Kalinli, Michael L. Seltzer, Vikas Chandra:
Collaborative Training of Acoustic Encoders for Speech Recognition. CoRR abs/2106.08960 (2021) - [i11]Xiaohui Zhang, Vimal Manohar, David Zhang, Frank Zhang, Yangyang Shi, Nayan Singhal, Julian Chan, Fuchun Peng, Yatharth Saraf, Mike Seltzer:
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models. CoRR abs/2107.04154 (2021) - [i10]Dawei Liang, Yangyang Shi, Yun Wang, Nayan Singhal, Alex Xiao, Jonathan Shaw, Edison Thomaz, Ozlem Kalinli, Mike Seltzer:
Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study. CoRR abs/2110.03174 (2021) - [i9]Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution. CoRR abs/2110.05241 (2021) - [i8]Yao-Yuan Yang, Moto Hira, Zhaoheng Ni, Anjali Chourdia, Artyom Astafurov, Caroline Chen, Ching-Feng Yeh, Christian Puhrsch, David Pollack, Dmitriy Genzel, Donny Greenberg, Edward Z. Yang, Jason Lian, Jay Mahadeokar, Jeff Hwang, Ji Chen, Peter Goldsborough, Prabhat Roy, Sean Narenthiran, Shinji Watanabe, Soumith Chintala, Vincent Quenneville-Bélair, Yangyang Shi:
TorchAudio: Building Blocks for Audio and Speech Processing. CoRR abs/2110.15018 (2021) - 2020
- [c28]Jingyong Hou, Yangyang Shi, Mari Ostendorf, Mei-Yuh Hwang, Lei Xie:
Mining Effective Negative Training Samples for Keyword Spotting. ICASSP 2020: 7444-7448 - [c27]Chunyang Wu, Yongqiang Wang, Yangyang Shi, Ching-Feng Yeh, Frank Zhang:
Streaming Transformer-Based Acoustic Models Using Self-Attention with Augmented Memory. INTERSPEECH 2020: 2132-2136 - [c26]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer:
Weak-Attention Suppression for Transformer Based Speech Recognition. INTERSPEECH 2020: 4996-5000 - [c25]Chunrong Fang, Zixi Liu, Yangyang Shi, Jeff Huang, Qingkai Shi:
Functional code clone detection with syntax and semantics fusion learning. ISSTA 2020: 516-527 - [c24]Ai Gong, Yi Zhong, Weiqin Zou, Yangyang Shi, Chunrong Fang:
Incorporating Android Code Smells into Java Static Code Metrics for Security Risk Prediction of Android Applications. QRS 2020: 30-40 - [i7]Chunyang Wu, Yongqiang Wang, Yangyang Shi, Ching-Feng Yeh, Frank Zhang:
Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory. CoRR abs/2005.08042 (2020) - [i6]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer:
Weak-Attention Suppression For Transformer Based Speech Recognition. CoRR abs/2005.09137 (2020) - [i5]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Ching-Feng Yeh, Julian Chan, Frank Zhang, Duc Le, Michael L. Seltzer:
Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition. CoRR abs/2010.10759 (2020) - [i4]Yongqiang Wang, Yangyang Shi, Frank Zhang, Chunyang Wu, Julian Chan, Ching-Feng Yeh, Alex Xiao:
Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications. CoRR abs/2010.14665 (2020) - [i3]Ching-Feng Yeh, Yongqiang Wang, Yangyang Shi, Chunyang Wu, Frank Zhang, Julian Chan, Michael L. Seltzer:
Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition. CoRR abs/2011.07120 (2020)
2010 – 2019
- 2019
- [j6]Jingyong Hou, Yangyang Shi, Mari Ostendorf, Mei-Yuh Hwang, Lei Xie:
Region Proposal Network Based Small-Footprint Keyword Spotting. IEEE Signal Process. Lett. 26(10): 1471-1475 (2019) - [c23]Yangyang Shi, Mei-Yuh Hwang, Xin Lei:
End-to-end Speech Recognition Using a High Rank LSTM-CTC Based Model. ICASSP 2019: 7080-7084 - [c22]Yangyang Shi, Mei-Yuh Hwang, Xin Lei, Haoyu Sheng:
Knowledge Distillation for Recurrent Neural Network Language Modeling with Trust Regularization. ICASSP 2019: 7230-7234 - [i2]Yangyang Shi, Mei-Yuh Hwang, Xin Lei:
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model. CoRR abs/1903.05261 (2019) - [i1]Yangyang Shi, Mei-Yuh Hwang, Xin Lei, Haoyu Sheng:
Knowledge Distillation For Recurrent Neural Network Language Modeling With Trust Regularization. CoRR abs/1904.04163 (2019) - 2018
- [j5]Yangyang Shi, Lei-Hong Zhang, Wenxing Zhu:
A review of "linear programming computation" by Ping-Qi Pan. Eur. J. Oper. Res. 267(3): 1182-1183 (2018) - [c21]Bangcheng Han, Yulin Chen, Shiqiang Zheng, Mingxing Li, Yangyang Shi:
Robust Control for a Magnetically Suspended Control Moment Gyro with Strong Gyroscopic Effects. IECON 2018: 2440-2446 - 2017
- [j4]Yuhua Huang, Xuejun Dai, Yangyang Shi, Ningzhong Liu, Qingxi Zeng, Fei Su:
基于Feistel结构的超轻量级分组密码算法(PFP) (Ultra-lightweight Block Cipher Algorithm (PFP) Based on Feistel Structure). 计算机科学 44(3): 163-167 (2017) - 2016
- [c20]Yangyang Shi, Kaisheng Yao, Hu Chen, Dong Yu, Yi-Cheng Pan, Mei-Yuh Hwang:
Recurrent Support Vector Machines For Slot Tagging In Spoken Language Understanding. HLT-NAACL 2016: 393-399 - [c19]Yangyang Shi, Kaisheng Yao, Le Tian, Daxin Jiang:
Deep LSTM based Feature Mapping for Query Classification. HLT-NAACL 2016: 1501-1511 - 2015
- [j3]Yangyang Shi, Martha A. Larson, Catholijn M. Jonker:
Recurrent neural network language model adaptation with curriculum learning. Comput. Speech Lang. 33(1): 136-154 (2015) - [j2]Yangyang Shi, Martha A. Larson, Joris Pelemans, Catholijn M. Jonker, Patrick Wambacq, Pascal Wiggers, Kris Demuynck:
Integrating meta-information into recurrent neural network language models. Speech Commun. 73: 64-80 (2015) - [c18]Yangyang Shi, Kaisheng Yao, Hu Chen, Yi-Cheng Pan, Mei-Yuh Hwang:
Semi-supervised slot tagging in spoken language understanding using recurrent transductive support vector machines. ASRU 2015: 353-360 - [c17]Yangyang Shi, Kaisheng Yao, Hu Chen, Yi-Cheng Pan, Mei-Yuh Hwang, Baolin Peng:
Contextual spoken language understanding using recurrent neural networks. ICASSP 2015: 5271-5275 - [c16]Yangyang Shi, Yi-Cheng Pan, Mei-Yuh Hwang, Kaisheng Yao, Hu Chen, Yuanhang Zou, Baolin Peng:
A factorization network based method for multi-lingual domain classification. ICASSP 2015: 5276-5280 - [c15]Yik-Cheung Tam, Yangyang Shi, Hunk Chen, Mei-Yuh Hwang:
RNN-based labeled data generation for spoken language understanding. INTERSPEECH 2015: 125-129 - 2014
- [c14]Yangyang Shi, Yi-Cheng Pan, Mei-Yuh Hwang:
Cluster based Chinese abbreviation modeling. INTERSPEECH 2014: 273-277 - [c13]Kaisheng Yao, Baolin Peng, Yu Zhang, Dong Yu, Geoffrey Zweig, Yangyang Shi:
Spoken language understanding using long short-term memory neural networks. SLT 2014: 189-194 - 2013
- [j1]Yangyang Shi, Pascal Wiggers, Catholijn M. Jonker:
Classifying the socio-situational settings of transcripts of spoken discourses. Speech Commun. 55(10): 988-1002 (2013) - [c12]Yangyang Shi, Martha A. Larson, Catholijn M. Jonker:
K-component recurrent neural network language models using curriculum learning. ASRU 2013: 1-6 - [c11]Yangyang Shi, Martha A. Larson, Pascal Wiggers, Catholijn M. Jonker:
Exploiting the succeeding words in recurrent neural network language models. INTERSPEECH 2013: 632-636 - [c10]Yangyang Shi, Mei-Yuh Hwang, Kaisheng Yao, Martha A. Larson:
Speed up of recurrent neural network language models with sentence independent subsampling stochastic gradient descent. INTERSPEECH 2013: 1203-1207 - [c9]Kaisheng Yao, Geoffrey Zweig, Mei-Yuh Hwang, Yangyang Shi, Dong Yu:
Recurrent neural networks for language understanding. INTERSPEECH 2013: 2524-2528 - [c8]Yangyang Shi, Martha A. Larson, Pascal Wiggers, Catholijn M. Jonker:
K-Component Adaptive Recurrent Neural Network Language Models. TSD 2013: 311-318 - 2012
- [c7]Yangyang Shi, Pascal Wiggers, Catholijn M. Jonker:
Dynamic Bayesian socio-situational setting classification. ICASSP 2012: 5081-5084 - [c6]Yangyang Shi, Pascal Wiggers, Catholijn M. Jonker:
Towards Recurrent Neural Networks Language Models with Linguistic and Contextual Features. INTERSPEECH 2012: 1664-1667 - [c5]Yangyang Shi, Martha A. Larson, Pascal Wiggers, Catholijn M. Jonker:
MediaEval 2012 Tagging Task: Prediction based on One Best List and Confusion Networks. MediaEval 2012 - [c4]Peng Xu, Yangyang Shi, Martha A. Larson:
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization with one-vs-all classifiers. MediaEval 2012 - [c3]Yangyang Shi, Pascal Wiggers, Catholijn M. Jonker:
Adaptive Language Modeling with a Set of Domain Dependent Models. TSD 2012: 472-479 - 2011
- [c2]Yangyang Shi, Pascal Wiggers, Catholijn M. Jonker:
Socio-situational setting classification based on language use. ASRU 2011: 455-460 - [c1]Yangyang Shi, Pascal Wiggers, Catholijn M. Jonker:
Combining Topic Specific Language Models. TSD 2011: 99-106
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-31 20:13 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint