default search action
Pingchuan Ma 0001
Person information
- affiliation: Meta AI
- affiliation (PhD 2022): Imperial College London, UK
Other persons with the same name
- Pingchuan Ma 0002 — Massachusetts Institute of Technology, CSAIL, Cambridge, USA (and 1 more)
- Pingchuan Ma 0003 — Beihang University, Beijing, China
- Pingchuan Ma 0004 — Hong Kong University of Science and Technology, Hong Kong (and 1 more)
- Pingchuan Ma 0005 — Chinese Academy of Sciences, Institute of Information Engineering, Beijing, China
- Pingchuan Ma 0006 — Heidelberg University, Heidelberg Collaboratory for Image Processing, IWR, Germany
- Pingchuan Ma 0007 — Shanghai University of Traditional Chinese Medicine, Longhua Hospital, China
- Pingchuan Ma 0008 — Chinese Academy of Sciences, Academy of Mathematics and Systems Science, KLMM, Beijing, China (and 1 more)
- Pingchuan Ma 0009 — East China University of Science and Technology, Department of Computer Science and Engineering, MoE Laboratory of Smart Manufacturing in Energy Chemical Process, Shanghai, China
- Pingchuan Ma 0011 — Dalian Maritime University, College of Artificial Intelligence, China
- Pingchuan Ma 0012 — Arizona State University, Tempe, AZ, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i28]Adriana Fernandez-Lopez, Honglie Chen, Pingchuan Ma, Lu Yin, Qiao Xiao, Stavros Petridis, Shiwei Liu, Maja Pantic:
MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization. CoRR abs/2406.17614 (2024) - [i27]Qiao Xiao, Pingchuan Ma, Adriana Fernandez-Lopez, Boqian Wu, Lu Yin, Stavros Petridis, Mykola Pechenizkiy, Maja Pantic, Decebal Constantin Mocanu, Shiwei Liu:
Dynamic Data Pruning for Automatic Speech Recognition. CoRR abs/2406.18373 (2024) - [i26]Umberto Cappellazzo, Minsu Kim, Honglie Chen, Pingchuan Ma, Stavros Petridis, Daniele Falavigna, Alessio Brutti, Maja Pantic:
Large Language Models Are Strong Audio-Visual Speech Recognition Learners. CoRR abs/2409.12319 (2024) - 2023
- [j6]Triantafyllos Kefalas, Eftychia Fotiadou, Markos Georgopoulos, Yannis Panagakis, Pingchuan Ma, Stavros Petridis, Themos Stafylakis, Maja Pantic:
KAN-AV dataset for audio-visual face and speech analysis in the wild. Image Vis. Comput. 140: 104839 (2023) - [j5]Yujiang Wang, Mingzhi Dong, Jie Shen, Yiming Luo, Yiming Lin, Pingchuan Ma, Stavros Petridis, Maja Pantic:
Self-Supervised Video-Centralised Transformer for Video Face Clustering. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 12944-12959 (2023) - [j4]Lixiong Liu, Pingchuan Ma, Chongwen Wang, Dong Xu:
Omnidirectional Image Quality Assessment With Knowledge Distillation. IEEE Signal Process. Lett. 30: 1562-1566 (2023) - [j3]Rodrigo Mira, Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Björn W. Schuller, Maja Pantic:
End-to-End Video-to-Speech Synthesis Using Generative Adversarial Networks. IEEE Trans. Cybern. 53(6): 3454-3466 (2023) - [c20]Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao:
TorchAudio 2.1: Advancing Speech Recognition, Self-Supervised Learning, and Audio Processing Components for Pytorch. ASRU 2023: 1-9 - [c19]Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolár, Stavros Petridis, Maja Pantic, Christian Fuegen:
SynthVSR: Scaling Up Visual Speech RecognitionWith Synthetic Supervision. CVPR 2023: 18806-18815 - [c18]Pingchuan Ma, Alexandros Haliassos, Adriana Fernandez-Lopez, Honglie Chen, Stavros Petridis, Maja Pantic:
Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels. ICASSP 2023: 1-5 - [c17]Andreas Zinonos, Alexandros Haliassos, Pingchuan Ma, Stavros Petridis, Maja Pantic:
Learning Cross-Lingual Visual Speech Representations. ICASSP 2023: 1-5 - [c16]Alexandros Haliassos, Pingchuan Ma, Rodrigo Mira, Stavros Petridis, Maja Pantic:
Jointly Learning Visual and Auditory Speech Representations from Raw Data. ICLR 2023 - [c15]Pingchuan Ma, Niko Moritz, Stavros Petridis, Christian Fuegen, Maja Pantic:
Streaming Audio-Visual Speech Recognition with Alignment Regularization. INTERSPEECH 2023: 1598-1602 - [c14]Adriana Fernandez-Lopez, Honglie Chen, Pingchuan Ma, Alexandros Haliassos, Stavros Petridis, Maja Pantic:
SparseVSR: Lightweight and Noise Robust Visual Speech Recognition. INTERSPEECH 2023: 1603-1607 - [i25]Andreas Zinonos, Alexandros Haliassos, Pingchuan Ma, Stavros Petridis, Maja Pantic:
Learning Cross-lingual Visual Speech Representations. CoRR abs/2303.09455 (2023) - [i24]Pingchuan Ma, Alexandros Haliassos, Adriana Fernandez-Lopez, Honglie Chen, Stavros Petridis, Maja Pantic:
Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels. CoRR abs/2303.14307 (2023) - [i23]Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolár, Stavros Petridis, Maja Pantic, Christian Fuegen:
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision. CoRR abs/2303.17200 (2023) - [i22]Yujiang Wang, Anshul Thakur, Mingzhi Dong, Pingchuan Ma, Stavros Petridis, Li Shang, Tingting Zhu, David A. Clifton:
Is dataset condensation a silver bullet for healthcare data sharing? CoRR abs/2305.03711 (2023) - [i21]Adriana Fernandez-Lopez, Honglie Chen, Pingchuan Ma, Alexandros Haliassos, Stavros Petridis, Maja Pantic:
SparseVSR: Lightweight and Noise Robust Visual Speech Recognition. CoRR abs/2307.04552 (2023) - [i20]Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao, Robin Scheibler, Samuele Cornell, Sean Kim, Stavros Petridis:
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch. CoRR abs/2310.17864 (2023) - 2022
- [j2]Pingchuan Ma, Stavros Petridis, Maja Pantic:
Visual speech recognition for multiple languages in the wild. Nat. Mac. Intell. 4(11): 930-939 (2022) - [c13]Pingchuan Ma, Yujiang Wang, Stavros Petridis, Jie Shen, Maja Pantic:
Training Strategies for Improved Lip-Reading. ICASSP 2022: 8472-8476 - [i19]Pingchuan Ma, Stavros Petridis, Maja Pantic:
Visual Speech Recognition for Multiple Languages in the Wild. CoRR abs/2202.13084 (2022) - [i18]Yujiang Wang, Mingzhi Dong, Jie Shen, Yiming Luo, Yiming Lin, Pingchuan Ma, Stavros Petridis, Maja Pantic:
Self-supervised Video-centralised Transformer for Video Face Clustering. CoRR abs/2203.13166 (2022) - [i17]Pingchuan Ma, Yujiang Wang, Stavros Petridis, Jie Shen, Maja Pantic:
Training Strategies for Improved Lip-reading. CoRR abs/2209.01383 (2022) - [i16]Pingchuan Ma, Niko Moritz, Stavros Petridis, Christian Fuegen, Maja Pantic:
Streaming Audio-Visual Speech Recognition with Alignment Regularization. CoRR abs/2211.02133 (2022) - [i15]Alexandros Haliassos, Pingchuan Ma, Rodrigo Mira, Stavros Petridis, Maja Pantic:
Jointly Learning Visual and Auditory Speech Representations from Raw Data. CoRR abs/2212.06246 (2022) - 2021
- [c12]Pingchuan Ma, Stavros Petridis, Maja Pantic:
Detecting Adversarial Attacks on Audiovisual Speech Recognition. ICASSP 2021: 6403-6407 - [c11]Pingchuan Ma, Brais Martínez, Stavros Petridis, Maja Pantic:
Towards Practical Lipreading with Distilled and Efficient Models. ICASSP 2021: 7608-7612 - [c10]Pingchuan Ma, Stavros Petridis, Maja Pantic:
End-To-End Audio-Visual Speech Recognition with Conformers. ICASSP 2021: 7613-7617 - [c9]Pingchuan Ma, Rodrigo Mira, Stavros Petridis, Björn W. Schuller, Maja Pantic:
LiRA: Learning Visual Speech Representations from Audio Through Self-Supervision. Interspeech 2021: 3011-3015 - [c8]Pingchuan Ma, Yujiang Wang, Jie Shen, Stavros Petridis, Maja Pantic:
Lip-reading with Densely Connected Temporal Convolutional Networks. WACV 2021: 2856-2865 - [i14]Pingchuan Ma, Stavros Petridis, Maja Pantic:
End-to-end Audio-visual Speech Recognition with Conformers. CoRR abs/2102.06657 (2021) - [i13]Rodrigo Mira, Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Björn W. Schuller, Maja Pantic:
End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks. CoRR abs/2104.13332 (2021) - [i12]Pingchuan Ma, Rodrigo Mira, Stavros Petridis, Björn W. Schuller, Maja Pantic:
LiRA: Learning Visual Speech Representations from Audio through Self-supervision. CoRR abs/2106.09171 (2021) - 2020
- [j1]Stavros Petridis, Yujiang Wang, Pingchuan Ma, Zuwei Li, Maja Pantic:
End-to-end visual speech recognition for small-scale datasets. Pattern Recognit. Lett. 131: 421-427 (2020) - [c7]Shiyang Cheng, Pingchuan Ma, Georgios Tzimiropoulos, Stavros Petridis, Adrian Bulat, Jie Shen, Maja Pantic:
Towards Pose-Invariant Lip-Reading. ICASSP 2020: 4357-4361 - [c6]Abhinav Shukla, Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Maja Pantic:
Visually Guided Self Supervised Learning of Speech Representations. ICASSP 2020: 6299-6303 - [c5]Brais Martínez, Pingchuan Ma, Stavros Petridis, Maja Pantic:
Lipreading Using Temporal Convolutional Networks. ICASSP 2020: 6319-6323 - [i11]Abhinav Shukla, Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Maja Pantic:
Visually Guided Self Supervised Learning of Speech Representations. CoRR abs/2001.04316 (2020) - [i10]Brais Martínez, Pingchuan Ma, Stavros Petridis, Maja Pantic:
Lipreading using Temporal Convolutional Networks. CoRR abs/2001.08702 (2020) - [i9]Pingchuan Ma, Brais Martínez, Stavros Petridis, Maja Pantic:
Towards practical lipreading with distilled and efficient models. CoRR abs/2007.06504 (2020) - [i8]Pingchuan Ma, Yujiang Wang, Jie Shen, Stavros Petridis, Maja Pantic:
Lip-reading with Densely Connected Temporal Convolutional Networks. CoRR abs/2009.14233 (2020)
2010 – 2019
- 2019
- [c4]Pingchuan Ma, Stavros Petridis, Maja Pantic:
Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition. INTERSPEECH 2019: 4090-4094 - [c3]Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Maja Pantic:
Video-Driven Speech Reconstruction Using Generative Adversarial Networks. INTERSPEECH 2019: 4125-4129 - [i7]Stavros Petridis, Yujiang Wang, Pingchuan Ma, Zuwei Li, Maja Pantic:
End-to-End Visual Speech Recognition for Small-Scale Datasets. CoRR abs/1904.01954 (2019) - [i6]Pingchuan Ma, Stavros Petridis, Maja Pantic:
Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition. CoRR abs/1906.02112 (2019) - [i5]Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Maja Pantic:
Video-Driven Speech Reconstruction using Generative Adversarial Networks. CoRR abs/1906.06301 (2019) - [i4]Shiyang Cheng, Pingchuan Ma, Georgios Tzimiropoulos, Stavros Petridis, Adrian Bulat, Jie Shen, Maja Pantic:
Towards Pose-invariant Lip-Reading. CoRR abs/1911.06095 (2019) - [i3]Pingchuan Ma, Stavros Petridis, Maja Pantic:
Detecting Adversarial Attacks On Audio-Visual Speech Recognition. CoRR abs/1912.08639 (2019) - 2018
- [c2]Stavros Petridis, Themos Stafylakis, Pingchuan Ma, Feipeng Cai, Georgios Tzimiropoulos, Maja Pantic:
End-to-End Audiovisual Speech Recognition. ICASSP 2018: 6548-6552 - [c1]Stavros Petridis, Themos Stafylakis, Pingchuan Ma, Georgios Tzimiropoulos, Maja Pantic:
Audio-Visual Speech Recognition with a Hybrid CTC/Attention Architecture. SLT 2018: 513-520 - [i2]Stavros Petridis, Themos Stafylakis, Pingchuan Ma, Feipeng Cai, Georgios Tzimiropoulos, Maja Pantic:
End-to-end Audiovisual Speech Recognition. CoRR abs/1802.06424 (2018) - [i1]Stavros Petridis, Themos Stafylakis, Pingchuan Ma, Georgios Tzimiropoulos, Maja Pantic:
Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture. CoRR abs/1810.00108 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-18 19:31 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint