default search action
Yerbolat Khassanov
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Books and Theses
- 2020
- [b1]Yerbolat Khassanov:
Language model domain adaptation for automatic speech recognition systems. Nanyang Technological University, Singapore, 2020
Journal Articles
- 2021
- [j1]Madina Abdrakhmanova, Askat Kuzdeuov, Sheikh Jarju, Yerbolat Khassanov, Michael Lewis, Huseyin Atakan Varol:
SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams. Sensors 21(10): 3465 (2021)
Conference and Workshop Papers
- 2024
- [c26]Yerbolat Khassanov, Zhipeng Chen, Tianfeng Chen, Tze Yuang Chong, Wei Li, Lu Lu, Zejun Ma:
Extending Multilingual ASR to New Languages Using Supplementary Encoder and Decoder Components. ICASSP 2024: 10586-10590 - 2023
- [c25]Yist Y. Lin, Tao Han, Haihua Xu, Van Tung Pham, Yerbolat Khassanov, Tze Yuang Chong, Yi He, Lu Lu, Zejun Ma:
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition. INTERSPEECH 2023: 904-908 - [c24]Zhipeng Chen, Haihua Xu, Yerbolat Khassanov, Yi He, Lu Lu, Zejun Ma, Ji Wu:
Knowledge Distillation Approach for Efficient Internal Language Model Estimation. INTERSPEECH 2023: 1339-1343 - [c23]Rustem Yeshpanov, Saida Mussakhojayeva, Yerbolat Khassanov:
Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration. INTERSPEECH 2023: 5521-5525 - 2022
- [c22]Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol:
KSC2: An Industrial-Scale Open-Source Kazakh Speech Corpus. INTERSPEECH 2022: 1367-1371 - [c21]Rustem Yeshpanov, Yerbolat Khassanov, Huseyin Atakan Varol:
KazNERD: Kazakh Named Entity Recognition Dataset. LREC 2022: 417-426 - [c20]Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol:
KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics. LREC 2022: 5404-5411 - [c19]Madina Abdrakhmanova, Saniya Abushakimova, Yerbolat Khassanov, Huseyin Atakan Varol:
A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data. Odyssey 2022: 233-239 - [c18]Mukhamet Nurpeiissov, Askat Kuzdeuov, Aslan Assylkhanov, Yerbolat Khassanov, Huseyin Atakan Varol:
End-to-End Sequential Indoor Localization Using Smartphone Inertial Sensors and WiFi. SII 2022: 566-571 - 2021
- [c17]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named Entities for Improved Speech Recognition. APSIPA ASC 2021: 1021-1025 - [c16]Yerbolat Khassanov, Saida Mussakhojayeva, Almas Mirzakhmetov, Alen Adiyev, Mukhamet Nurpeiissov, Huseyin Atakan Varol:
A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline. EACL 2021: 697-706 - [c15]Saida Mussakhojayeva, Aigerim Janaliyeva, Almas Mirzakhmetov, Yerbolat Khassanov, Huseyin Atakan Varol:
KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset. Interspeech 2021: 2786-2790 - [c14]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. ISCSLP 2021: 1-5 - [c13]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. ISCSLP 2021: 1-5 - [c12]Yerbolat Khassanov, Mukhamet Nurpeiissov, Azamat Sarkytbayev, Askat Kuzdeuov, Huseyin Atakan Varol:
Finer-level Sequential WiFi-based Indoor Localization. SII 2021: 163-169 - [c11]Muhammadjon Musaev, Saida Mussakhojayeva, Ilyos Khujayorov, Yerbolat Khassanov, Mannon Ochilov, Huseyin Atakan Varol:
USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments. SPECOM 2021: 437-447 - [c10]Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol:
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English. SPECOM 2021: 448-459 - 2020
- [c9]Togzhan Syrymova, Yerkebulan Massalim, Yerbolat Khassanov, Zhanat Kappassov:
Vibro-Tactile Foreign Body Detection in Granular Objects based on Squeeze-Induced Mechanical Vibrations. AIM 2020: 175-180 - [c8]Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent Language Modeling Architecture for End-To-End ASR. ICASSP 2020: 7059-7063 - 2019
- [c7]Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. INTERSPEECH 2019: 2160-2164 - [c6]Zhiping Zeng, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Eng Siong Chng, Haizhou Li:
On the End-to-End Solution to Mandarin-English Code-Switching Speech Recognition. INTERSPEECH 2019: 2165-2169 - [c5]Yerbolat Khassanov, Zhiping Zeng, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation. INTERSPEECH 2019: 3505-3509 - 2018
- [c4]Yerbolat Khassanov, Eng Siong Chng:
Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR. INTERSPEECH 2018: 3343-3347 - 2017
- [c3]Yerbolat Khassanov, Tze Yuang Chong, Benjamin Bigot, Eng Siong Chng:
Unsupervised Language Model Adaptation by Data Selection for Speech Recognition. ACIIDS (1) 2017: 508-517 - 2014
- [c2]Yerbolat Khassanov, Nursultan Imanberdiyev, Huseyin Atakan Varol:
Inertial motion capture based reference trajectory generation for a mobile manipulator. HRI 2014: 202-203 - [c1]Yerbolat Khassanov, Nursultan Imanberdiyev, Huseyin Atakan Varol:
Real-time gesture recognition for the high-level teleoperation interface of a mobile manipulator. HRI 2014: 204-205
Informal and Other Publications
- 2024
- [i19]Yerbolat Khassanov, Zhipeng Chen, Tianfeng Chen, Tze Yuang Chong, Wei Li, Jun Zhang, Lu Lu, Yuxuan Wang:
Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR. CoRR abs/2406.07842 (2024) - 2023
- [i18]Rustem Yeshpanov, Saida Mussakhojayeva, Yerbolat Khassanov:
Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration. CoRR abs/2305.15749 (2023) - 2022
- [i17]Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol:
KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics. CoRR abs/2201.05771 (2022) - [i16]Haihua Xu, Van Tung Pham, Yerbolat Khassanov, Yist Y. Lin, Tao Han, Tze Yuan Chong, Yi He, Zejun Ma:
Improving short-video speech recognition using random utterance concatenation. CoRR abs/2210.15876 (2022) - 2021
- [i15]Saida Mussakhojayeva, Aigerim Janaliyeva, Almas Mirzakhmetov, Yerbolat Khassanov, Huseyin Atakan Varol:
KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset. CoRR abs/2104.08459 (2021) - [i14]Muhammadjon Musaev, Saida Mussakhojayeva, Ilyos Khujayorov, Yerbolat Khassanov, Mannon Ochilov, Huseyin Atakan Varol:
USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments. CoRR abs/2107.14419 (2021) - [i13]Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol:
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English. CoRR abs/2108.01280 (2021) - [i12]Madina Abdrakhmanova, Saniya Abushakimova, Yerbolat Khassanov, Huseyin Atakan Varol:
A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data. CoRR abs/2110.12136 (2021) - [i11]Rustem Yeshpanov, Yerbolat Khassanov, Huseyin Atakan Varol:
KazNERD: Kazakh Named Entity Recognition Dataset. CoRR abs/2111.13419 (2021) - 2020
- [i10]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. CoRR abs/2005.08742 (2020) - [i9]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. CoRR abs/2005.10407 (2020) - [i8]Yerbolat Khassanov, Saida Mussakhojayeva, Almas Mirzakhmetov, Alen Adiyev, Mukhamet Nurpeiissov, Huseyin Atakan Varol:
A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline. CoRR abs/2009.10334 (2020) - [i7]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance. CoRR abs/2010.12143 (2020) - [i6]Madina Abdrakhmanova, Askat Kuzdeuov, Sheikh Jarju, Yerbolat Khassanov, Michael Lewis, Huseyin Atakan Varol:
SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams. CoRR abs/2012.02961 (2020) - 2019
- [i5]Yerbolat Khassanov, Zhiping Zeng, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation. CoRR abs/1904.03799 (2019) - [i4]Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. CoRR abs/1904.03802 (2019) - [i3]Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent language modeling architecture for end-to-end ASR. CoRR abs/1912.00863 (2019) - 2018
- [i2]Yerbolat Khassanov, Eng Siong Chng:
Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR. CoRR abs/1806.10306 (2018) - [i1]Zhiping Zeng, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Eng Siong Chng, Haizhou Li:
On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition. CoRR abs/1811.00241 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-07 21:32 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint