default search action
Takafumi Koshinaka
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i11]Honori Udo, Takafumi Koshinaka:
Reading Is Believing: Revisiting Language Bottleneck Models for Image Classification. CoRR abs/2406.15816 (2024) - 2023
- [j5]Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Takafumi Koshinaka:
Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition. IEEE Trans. Inf. Forensics Secur. 18: 3936-3947 (2023) - [i10]Honori Udo, Takafumi Koshinaka:
Image Captioners Sometimes Tell More Than Images They See. CoRR abs/2305.02932 (2023) - 2021
- [j4]Kong Aik Lee, Qiongqiong Wang, Takafumi Koshinaka:
Xi-Vector Embedding for Speaker Recognition. IEEE Signal Process. Lett. 28: 1385-1389 (2021) - [c25]Qiongqiong Wang, Kong Aik Lee, Takafumi Koshinaka, Koji Okabe, Hitoshi Yamamoto:
Task-aware Warping Factors in Mask-based Speech Enhancement. EUSIPCO 2021: 476-480 - [i9]Kong Aik Lee, Qiongqiong Wang, Takafumi Koshinaka:
Xi-Vector Embedding for Speaker Recognition. CoRR abs/2108.05679 (2021) - [i8]Qiongqiong Wang, Kong Aik Lee, Takafumi Koshinaka, Koji Okabe, Hitoshi Yamamoto:
Task-aware Warping Factors in Mask-based Speech Enhancement. CoRR abs/2108.12128 (2021) - 2020
- [j3]Kong Aik Lee, Hitoshi Yamamoto, Koji Okabe, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda:
NEC-TT System for Mixed-Bandwidth and Multi-Domain Speaker Recognition. Comput. Speech Lang. 61: 101033 (2020) - [c24]Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Takafumi Koshinaka:
A Generalized Framework for Domain Adaptation of PLDA in Speaker Recognition. ICASSP 2020: 6619-6623 - [c23]Kong Aik Lee, Koji Okabe, Hitoshi Yamamoto, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Keisuke Ishikawa, Koichi Shinoda:
NEC-TT Speaker Verification System for SRE'19 CTS Challenge. INTERSPEECH 2020: 2227-2231 - [c22]Qiongqiong Wang, Kong Aik Lee, Takafumi Koshinaka:
Using Multi-Resolution Feature Maps with Convolutional Neural Networks for Anti-Spoofing in ASV. Odyssey 2020: 138-142 - [e1]Kong-Aik Lee, Takafumi Koshinaka, Koichi Shinoda:
Odyssey 2020: The Speaker and Language Recognition Workshop, 1-5 November 2020, Tokyo, Japan. ISCA 2020 [contents] - [i7]Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Takafumi Koshinaka:
A Generalized Framework for Domain Adaptation of PLDA in Speaker Recognition. CoRR abs/2008.08815 (2020) - [i6]Qiongqiong Wang, Kong Aik Lee, Takafumi Koshinaka:
Using Multi-Resolution Feature Maps with Convolutional Neural Networks for Anti-Spoofing in ASV. CoRR abs/2008.08865 (2020)
2010 – 2019
- 2019
- [c21]Kong Aik Lee, Qiongqiong Wang, Takafumi Koshinaka:
The CORAL+ Algorithm for Unsupervised Domain Adaptation of PLDA. ICASSP 2019: 5821-5825 - [c20]Ville Vestman, Kong Aik Lee, Tomi H. Kinnunen, Takafumi Koshinaka:
Unleashing the Unused Potential of i-Vectors Enabled by GPU Acceleration. INTERSPEECH 2019: 351-355 - [c19]Hitoshi Yamamoto, Kong Aik Lee, Koji Okabe, Takafumi Koshinaka:
Speaker Augmentation and Bandwidth Extension for Deep Speaker Embedding. INTERSPEECH 2019: 406-410 - [c18]Kong Aik Lee, Hitoshi Yamamoto, Koji Okabe, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda:
The NEC-TT 2018 Speaker Verification System. INTERSPEECH 2019: 4355-4359 - [i5]Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019) - [i4]Ville Vestman, Kong Aik Lee, Tomi H. Kinnunen, Takafumi Koshinaka:
Unleashing the Unused Potential of I-Vectors Enabled by GPU Acceleration. CoRR abs/1906.08556 (2019) - 2018
- [c17]Shivangi Mahto, Takayuki Arakawa, Takafumi Koshinaka:
Ear Acoustic Biometrics Using Inaudible Signals and Its Application to Continuous User Authentication. EUSIPCO 2018: 1407-1411 - [c16]Subhadeep Dey, Takafumi Koshinaka, Petr Motlícek, Srikanth R. Madikeri:
DNN Based Speaker Embedding Using Content Information for Text-Dependent Speaker Verification. ICASSP 2018: 5344-5348 - [c15]Koji Okabe, Takafumi Koshinaka, Koichi Shinoda:
Attentive Statistics Pooling for Deep Speaker Embedding. INTERSPEECH 2018: 2252-2256 - [c14]Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Hitoshi Yamamoto, Takafumi Koshinaka:
Attention Mechanism in Speaker Recognition: What Does it Learn in Deep Speaker Embedding? SLT 2018: 1052-1059 - [i3]Koji Okabe, Takafumi Koshinaka, Koichi Shinoda:
Attentive Statistics Pooling for Deep Speaker Embedding. CoRR abs/1803.10963 (2018) - [i2]Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Hitoshi Yamamoto, Takafumi Koshinaka:
Attention Mechanism in Speaker Recognition: What Does It Learn in Deep Speaker Embedding? CoRR abs/1809.09311 (2018) - [i1]Kong Aik Lee, Qiongqiong Wang, Takafumi Koshinaka:
The CORAL+ Algorithm for Unsupervised Domain Adaptation of PLDA. CoRR abs/1812.10260 (2018) - 2017
- [c13]Hitoshi Yamamoto, Koji Okabe, Takafumi Koshinaka:
Robust i-vector extraction tightly coupled with voice activity detection using deep neural networks. APSIPA 2017: 600-604 - [c12]Shivangi Mahto, Hitoshi Yamamoto, Takafumi Koshinaka:
i-Vector Transformation Using a Novel Discriminative Denoising Autoencoder for Noise-Robust Speaker Recognition. INTERSPEECH 2017: 3722-3726 - [c11]Qiongqiong Wang, Takafumi Koshinaka:
Unsupervised Discriminative Training of PLDA for Domain Adaptation in Speaker Verification. INTERSPEECH 2017: 3727-3731 - 2016
- [c10]Takayuki Arakawa, Takafumi Koshinaka, Shohei Yano, Hideki Irisawa, Ryoji Miyahara, Hitoshi Imaoka:
Fast and accurate personal authentication using ear acoustics. APSIPA 2016: 1-4 - [c9]Qiongqiong Wang, Hitoshi Yamamoto, Takafumi Koshinaka:
Domain adaptation using maximum likelihood linear transformation for PLDA-based speaker verification. ICASSP 2016: 5110-5114 - 2015
- [c8]Hitoshi Yamamoto, Takafumi Koshinaka:
Denoising autoencoder-based speaker feature restoration for utterances of short duration. INTERSPEECH 2015: 1052-1056 - 2013
- [c7]Yumi Ono, Yoshifumi Onishi, Takafumi Koshinaka, Soichiro Takata, Osamu Hoshuyama:
Anomaly detection of motors with feature emphasis using only normal sounds. ICASSP 2013: 2800-2804 - 2012
- [j2]Takafumi Koshinaka, Kentaro Nagatomo, Koichi Shinoda:
Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model. IEICE Trans. Inf. Syst. 95-D(10): 2469-2478 (2012) - [c6]Shuji Komeiji, Takayuki Arakawa, Takafumi Koshinaka:
A noise-robust speech recognition method composed of weak noise suppression and weak Vector Taylor Series Adaptation. SLT 2012: 103-106 - 2011
- [j1]Yuzo Hamanaka, Koichi Shinoda, Takuya Tsutaoka, Sadaoki Furui, Tadashi Emori, Takafumi Koshinaka:
Committee-Based Active Learning for Speech Recognition. IEICE Trans. Inf. Syst. 94-D(10): 2015-2023 (2011) - 2010
- [c5]Yuzo Hamanaka, Koichi Shinoda, Sadaoki Furui, Tadashi Emori, Takafumi Koshinaka:
Speech modeling based on committee-based active learning. ICASSP 2010: 4350-4353
2000 – 2009
- 2009
- [c4]Takafumi Koshinaka, Kentaro Nagatomo, Koichi Shinoda:
Online speaker clustering using incremental learning of an ergodic hidden Markov model. ICASSP 2009: 4093-4096 - 2008
- [c3]Makoto Terao, Takafumi Koshinaka, Shinichi Ando, Ryosuke Isotani, Akitoshi Okumura:
Open-vocabulary spoken-document retrieval based on query expansion using related web documents. INTERSPEECH 2008: 2171-2174 - 2005
- [c2]Takafumi Koshinaka, Ken-ichi Iso, Akitoshi Okumura:
An HMM-based Text Segmentation Method Using Variational Bayes Approach and Its Application to LVCSR for Broadcast News. ICASSP (1) 2005: 485-488 - 2001
- [c1]Takafumi Koshinaka, Daisuke Nishiwaki, Keiji Yamada:
A Stochastic Model for Handwritten Word Recognition Using Context Dependency Between Character Patterns. ICDAR 2001: 154-158
Coauthor Index
aka: Kong Aik Lee
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-07-31 20:46 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint