default search action
Yasuhiro Minami
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2017
- [j12]Toru Nakashika, Yasuhiro Minami:
Speaker-adaptive-trainable Boltzmann machine and its application to non-parallel voice conversion. EURASIP J. Audio Speech Music. Process. 2017: 16 (2017) - 2016
- [j11]Toru Nakashika, Tetsuya Takiguchi, Yasuhiro Minami:
Non-Parallel Training in Voice Conversion Using an Adaptive Restricted Boltzmann Machine. IEEE ACM Trans. Audio Speech Lang. Process. 24(11): 2032-2045 (2016) - 2014
- [j10]Kohji Dohsaka, Ryota Asai, Ryuichiro Higashinaka, Yasuhiro Minami, Eisaku Maeda:
Effects of Conversational Agents on Activation of Communication in Thought-Evoking Multi-Party Dialogues. IEICE Trans. Inf. Syst. 97-D(8): 2147-2156 (2014) - 2013
- [j9]Toyomi Meguro, Yasuhiro Minami, Ryuichiro Higashinaka, Kohji Dohsaka:
Learning to control listening-oriented dialogue using partially observable markov decision processes. ACM Trans. Speech Lang. Process. 10(4): 15:1-15:20 (2013) - 2007
- [j8]Takaaki Hori, Chiori Hori, Yasuhiro Minami, Atsushi Nakamura:
Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition. IEEE Trans. Speech Audio Process. 15(4): 1352-1365 (2007) - 2006
- [j7]Parham Zolfaghari, Hiroko Kato, Yasuhiro Minami, Atsushi Nakamura, Shigeru Katagiri, Roy D. Patterson:
Dynamic Assignment of Gaussian Components in Modelling Speech Spectra. J. VLSI Signal Process. 45(1-2): 7-19 (2006) - 2005
- [j6]Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda:
Selection of Shared-State Hidden Markov Model Structure Using Bayesian Criterion. IEICE Trans. Inf. Syst. 88-D(1): 1-9 (2005) - 2004
- [j5]Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda:
Variational bayesian estimation and clustering for speech recognition. IEEE Trans. Speech Audio Process. 12(4): 365-381 (2004) - 1995
- [j4]Osamu Yoshioka, Yasuhiro Minami, Kiyohiro Shikano:
A Speech Dialogue System with Multimodal Interface for Telephone Directory Assistance. IEICE Trans. Inf. Syst. 78-D(6): 616-621 (1995) - [j3]Satoshi Takahashi, Yasuhiro Minami, Kiyohiro Shikano:
An HMM State Duration Control Algorithm Applied to Large-Vocabulary Spontaneous Speech Recognition. IEICE Trans. Inf. Syst. 78-D(6): 648-653 (1995) - 1994
- [j2]Yasuhiro Minami, Kiyohiro Shikano, Satoshi Takahashi, Tomokazu Yamada, Osamu Yoshioka, Sadaoki Furui:
Large-vocabulary continuous speech recognition algorithm applied to a multi-modal telephone directory assistance system. Speech Communication 15(3-4): 301-310 (1994) - 1991
- [j1]Yasuhiro Minami, Hidefumi Sawai, Masanori Miyatake:
Large-vocabulary spoken word recognition using time-delay neural network phoneme spotting and predictive lr-parsing. Syst. Comput. Jpn. 22(1): 99-108 (1991)
Conference and Workshop Papers
- 2024
- [c61]Wen Shen Teo, Yasuhiro Minami:
CIF-RNNT: Streaming ASR Via Acoustic Word Embeddings with Continuous Integrate-and-Fire and RNN-Transducers. ICASSP 2024: 10561-10565 - 2022
- [c60]Keita Kobayashi, Kohei Koyama, Hiromi Narimatsu, Yasuhiro Minami:
Dataset Construction for Scientific-Document Writing Support by Extracting Related Work Section and Citations from PDF Papers. LREC 2022: 5673-5682 - 2018
- [c59]Yan Cao, Yasuhiro Minami, Yuko Okumura, Tessei Kobayashi:
Analyzing Vocabulary Commonality Index Using Large-scaled Database of Child Language Development. LREC 2018 - [c58]Yasuhiro Minami, Tessei Kobayashi, Yuko Okumura:
Infant Word Comprehension-to-Production Index Applied to Investigation of Noun Learning Predominance Using Cross-lingual CDI database. LREC 2018 - 2017
- [c57]Sotaro Takeshita, Ryuji Tamaki, Yasuhiro Minami, Takeru Kazama, Masato Nakamura:
Report for Japanese subtask for NTCIR-13 STC-2 from mnmlb. NTCIR 2017 - 2016
- [c56]Toru Nakashika, Yasuhiro Minami:
3WRBM-based speech factor modeling for arbitrary-source and non-parallel voice conversion. EUSIPCO 2016: 607-611 - [c55]Toru Nakashika, Yasuhiro Minami:
Speaker adaptive model based on Boltzmann machine for non-parallel training in voice conversion. ICASSP 2016: 5530-5534 - [c54]Toru Nakashika, Yasuhiro Minami:
Generative Acoustic-Phonemic-Speaker Model Based on Three-Way Restricted Boltzmann Machine. INTERSPEECH 2016: 1487-1491 - 2014
- [c53]Hiroaki Sugiyama, Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Minami:
Large-scale Collection and Analysis of Personal Question-Answer Pairs for Conversational Agents. IVA 2014: 420-433 - [c52]Hiroaki Sugiyama, Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Minami:
Open-domain utterance generation using phrase pairs based on dependency relations. SLT 2014: 60-65 - 2013
- [c51]Hiroaki Sugiyama, Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Minami:
Open-domain Utterance Generation for Conversational Dialogue Systems using Web-scale Dependency Structures. SIGDIAL Conference 2013: 334-338 - 2012
- [c50]Hiroaki Sugiyama, Toyomi Meguro, Yasuhiro Minami:
Preference-learning based Inverse Reinforcement Learning for Dialog Control. INTERSPEECH 2012: 222-225 - 2011
- [c49]Yasuhiro Minami, Akira Mori, Toyomi Meguro, Ryuichiro Higashinaka, Kohji Dohsaka, Eisaku Maeda:
Dialogue Control by Pomdp Using Dialogue Data Statistics. IWSDS 2011: 163-186 - [c48]Toyomi Meguro, Yasuhiro Minami, Ryuichiro Higashinaka, Kohji Dohsaka:
Wizard of Oz evaluation of listening-oriented dialogue control using POMDP. ASRU 2011: 318-323 - [c47]Ryuichiro Higashinaka, Noriaki Kawamae, Kugatsu Sadamitsu, Yasuhiro Minami, Toyomi Meguro, Kohji Dohsaka, Hirohito Inagaki:
Building a conversational model from two-tweets. ASRU 2011: 330-335 - [c46]Hiroaki Sugiyama, Yasuhiro Minami:
Information provision-timing control for informational assistance robot. HRI 2011: 259-260 - [c45]Toyomi Meguro, Yasuhiro Minami, Ryuichiro Higashinaka, Kohji Dohsaka:
Evaluation of Listening-Oriented Dialogue Control Rules Based on the Analysis of HMMs. INTERSPEECH 2011: 809-812 - [c44]Ryuichiro Higashinaka, Noriaki Kawamae, Kugatsu Sadamitsu, Yasuhiro Minami, Toyomi Meguro, Kohji Dohsaka, Hirohito Inagaki:
Unsupervised Clustering of Utterances Using Non-Parametric Bayesian Methods. INTERSPEECH 2011: 2081-2084 - 2010
- [c43]Ryuichiro Higashinaka, Yasuhiro Minami, Hitoshi Nishikawa, Kohji Dohsaka, Toyomi Meguro, Satoshi Takahashi, Gen-ichiro Kikui:
Learning to Model Domain-Specific Utterance Sequences for Extractive Summarization of Contact Center Dialogues. COLING (Posters) 2010: 400-408 - [c42]Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Minami, Kohji Dohsaka:
Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes. COLING 2010: 761-769 - [c41]Kazuo Aoyama, Shinji Watanabe, Hiroshi Sawada, Yasuhiro Minami, Naonori Ueda, Kazumi Saito:
Fast similarity search on a large speech data set with neighborhood graph indexing. ICASSP 2010: 5358-5361 - [c40]Ryuichiro Higashinaka, Yasuhiro Minami, Kohji Dohsaka, Toyomi Meguro:
Issues in Predicting User Satisfaction Transitions in Dialogues: Individual Differences, Evaluation Criteria, and Prediction Models. IWSDS 2010: 48-60 - [c39]Ryuichiro Higashinaka, Yasuhiro Minami, Kohji Dohsaka, Toyomi Meguro:
Modeling User Satisfaction Transitions in Dialogues from Overall Ratings. SIGDIAL Conference 2010: 18-27 - [c38]Kohji Dohsaka, Atsushi Kanemoto, Ryuichiro Higashinaka, Yasuhiro Minami, Eisaku Maeda:
User-adaptive Coordination of Agent Communicative Behavior in Spoken Dialogue. SIGDIAL Conference 2010: 314-321 - [c37]Ryuichiro Higashinaka, Yasuhiro Minami, Hitoshi Nishikawa, Kohji Dohsaka, Toyomi Meguro, Satoshi Kobashikawa, Hirokazu Masataki, Osamu Yoshioka, Satoshi Takahashi, Gen-ichiro Kikui:
Improving hmm-based extractive summarization for multi-domain contact center dialogues. SLT 2010: 61-66 - [c36]Yasuhiro Minami, Ryuichiro Higashinaka, Kohji Dohsaka, Toyomi Meguro, Eisaku Maeda:
Trigram dialogue control using POMDPs. SLT 2010: 336-341 - 2009
- [c35]Toyomi Meguro, Ryuichiro Higashinaka, Kohji Dohsaka, Yasuhiro Minami, Hideki Isozaki:
Analysis of Listening-Oriented Dialogue for Building Listening Agents. SIGDIAL Conference 2009: 124-127 - [c34]Kohji Dohsaka, Ryota Asai, Ryuichiro Higashinaka, Yasuhiro Minami, Eisaku Maeda:
Effects of Conversational Agents on Human Communication in Thought-Evoking Multi-Party Dialogues. SIGDIAL Conference 2009: 217-224 - 2008
- [c33]Minako Sawaki, Yasuhiro Minami, Ryuichiro Higashinaka, Kohji Dohsaka, Eisaku Maeda:
"Who is this" quiz dialogue system and users' evaluation. SLT 2008: 149-152 - 2007
- [c32]Yasuhiro Minami:
Mixture Gaussian HMM-trajctory method using likelihood compensation. ASRU 2007: 296-299 - [c31]Yasuhiro Minami, Minako Sawaki, Kohji Dohsaka, Ryuichiro Higashinaka, Kentaro Ishizuka, Hideki Isozaki, Tatsushi Matsubayashi, Masato Miyoshi, Atsushi Nakamura, Takanobu Oba, Hiroshi Sawada, Takeshi Yamada, Eisaku Maeda:
The world of mushrooms: human-computer interaction prototype systems for ambient intelligence. ICMI 2007: 366-373 - 2004
- [c30]Takaaki Hori, Chiori Hori, Yasuhiro Minami:
Fast on-the-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition. INTERSPEECH 2004: 289-292 - [c29]Yasuhiro Minami, Erik McDermott, Atsushi Nakamura, Shigeru Katagiri:
A theoretical analysis of speech recognition based on feature trajectory models. INTERSPEECH 2004: 549-552 - [c28]Kentaro Ishizuka, Noboru Miyazaki, Tomohiro Nakatani, Yasuhiro Minami:
Improvement in robustness of speech feature extraction method using sub-band based periodicity and aperiodicity decomposition. INTERSPEECH 2004: 937-940 - 2003
- [c27]Yasuhiro Minami, Erik McDermott, Atsushi Nakamura, Shigeru Katagiri:
Recognition method with parametric trajectory generated from mixture distribution HMMs. ICASSP (1) 2003: 124-127 - [c26]Takaaki Hori, Daniel Willett, Yasuhiro Minami:
Language model adaptation using WFST-based speaking-style translation. ICASSP (1) 2003: 228-231 - [c25]Daniel Willett, Thomas Niesler, Erik McDermott, Yasuhiro Minami, Shigeru Katagiri:
Pervasive unsupervised adaptation for lecture speech transcription. ICASSP (1) 2003: 292-295 - [c24]Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda:
Application of variational Bayesian estimation and clustering to acoustic model adaptation. ICASSP (1) 2003: 568-571 - [c23]Takaaki Hori, Chiori Hori, Yasuhiro Minami:
Speech summarization using weighted finite-state transducers. INTERSPEECH 2003: 2817-2820 - 2002
- [c22]Yasuhiro Minami, Erik McDermott, Atsushi Nakamura, Shigeru Katagiri:
A recognition method with parametric trajectory synthesized using direct relations between static and dynamic feature vector time series. ICASSP 2002: 957-960 - [c21]Toshio Irino, Yasuhiro Minami, Tomohiro Nakatani, Minoru Tsuzaki, H. Tagawa:
Evaluation of a speech recognition / generation method based on HMM and straight. INTERSPEECH 2002: 2545-2548 - [c20]Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda:
Constructing shared-state hidden Markov models based on a Bayesian approach. INTERSPEECH 2002: 2669-2672 - [c19]Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda:
Application of Variational Bayesian Approach to Speech Recognition. NIPS 2002: 1237-1244 - 2001
- [c18]Daniel Willett, Erik McDermott, Yasuhiro Minami, Shigeru Katagiri:
Time and memory efficient viterbi decoding for LVCSR using a precompiled search network. INTERSPEECH 2001: 847-850 - [c17]Mikio Nakano, Yasuhiro Minami, Stephanie Seneff, Timothy J. Hazen, D. Scott Cyphers, James R. Glass, Joseph Polifroni, Victor Zue:
Mokusei: a telephone-based Japanese conversational system in the weather domain. INTERSPEECH 2001: 1331-1334 - 1998
- [c16]Franck Giron, Yasuhiro Minami, Masashi Tanaka, Ken'ichi Furuya:
Compensation of speaker directivity in speech recognition using HMM composition. ICASSP 1998: 253-256 - 1997
- [c15]Ken Hanazawa, Yasuhiro Minami, Sadaoki Furui:
An efficient search method for large-vocabulary continuous-speech recognition. ICASSP 1997: 1787-1790 - [c14]Etienne Bauche, Bojana Gajic, Yasuhiro Minami, Tatsuo Matsuoka, Sadaoki Furui:
Connected digit recognition in spontaneous speech. EUROSPEECH 1997: 923-926 - 1996
- [c13]Yasuhiro Minami, Sadaoki Furui:
Adaptation method based on HMM composition and EM algorithm. ICASSP 1996: 327-330 - [c12]Yasuhiro Minami, Sadaoki Furui:
Improved extended HMM composition by incorporating power variance. ICSLP 1996: 1109-1112 - 1995
- [c11]Yasuhiro Minami, Sadaoki Furui:
A maximum likelihood procedure for a universal adaptation method based on HMM composition. ICASSP 1995: 129-132 - 1994
- [c10]Yasuhiro Minami, Kiyohiro Shikano, Satoshi Takahashi, Tomokazu Yamada:
Search algorithm that merges candidates in meaning level for very large vocabulary spontaneous speech recognition. ICASSP (2) 1994: 141-144 - [c9]Satoshi Takahashi, Yasuhiro Minami, Kiyohiro Shikano:
An HMM duration control algorithm with a low computational cost. ICSLP 1994: 267-270 - [c8]Osamu Yoshioka, Yasuhiro Minami, Kiyohiro Shikano:
A multi-modal dialogue system for telephone directory assistance. ICSLP 1994: 887-890 - [c7]Yasuhiro Minami, Kiyohiro Shikano, Osamu Yoshioka, Satoshi Takahashi, Tomokazu Yamada, Sadaoki Furui:
A Large-Vocabulary Continuous Speech Recognition Algorithm and its Application to a Multi-Modal Telephone Directory Assistance System. HLT 1994 - 1993
- [c6]Satoshi Takahashi, Tatsuo Matsuoka, Yasuhiro Minami, Kiyohiro Shikano:
Phoneme HMMs constrained by frame correlations. ICASSP (2) 1993: 219-222 - [c5]Franck Martin, Kiyohiro Shikano, Yasuhiro Minami:
Recognition of noisy speech by composition of hidden Markov models. EUROSPEECH 1993: 1031-1034 - [c4]Yasuhiro Minami, Kiyohiro Shikano, Tomokazu Yamada, Tatsuo Matsuoka:
Very-large-vocabulary continuous speech recognition algorithm for telephone directory assistance. EUROSPEECH 1993: 2129-2132 - 1992
- [c3]Yasuhiro Minami, Tatsuo Matsuoka, Kiyohiro Shikano:
Phoneme HMM evaluation algorithm without phoneme labeling. ICSLP 1992: 1535-1538 - 1990
- [c2]Masanori Miyatake, Hidefumi Sawai, Yasuhiro Minami, Kiyohiro Shikano:
Integrated training for spotting Japanese phonemes using large phonemic time-delay neural networks. ICASSP 1990: 449-452 - [c1]Yasuhiro Minami, Toshiyuki Hanazawa, Hitoshi Iwamida, Erik McDermott, Kiyohiro Shikano, Shigeru Katagiri, Masaona Kagawa:
On the robustness of HMM and ANN speech recognition algorithms. ICSLP 1990: 1345-1348
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-07 21:30 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint