default search action
Satoshi Tamura
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2021
- [j7]Shinnosuke Isobe, Satoshi Tamura, Satoru Hayamizu, Yuuto Gotoh, Masaki Nose:
Multi-Angle Lipreading with Angle Classification-Based Feature Extraction and Its Application to Audio-Visual Speech Recognition. Future Internet 13(7): 182 (2021) - 2019
- [j6]Frederico Soares Cabral, Hidekazu Fukai, Satoshi Tamura:
Feature Extraction Methods Proposed for Speech Recognition Are Effective on Road Condition Monitoring Using Smartphone Inertial Sensors. Sensors 19(16): 3481 (2019) - 2016
- [j5]Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayamizu:
Investigation of DNN-Based Audio-Visual Speech Recognition. IEICE Trans. Inf. Syst. 99-D(10): 2444-2451 (2016) - 2013
- [j4]Kentaro Minoura, Satoshi Tamura, Satoru Hayamizu:
Probabilistic expression of Polynomial Semantic Indexing and its application for classification. Pattern Recognit. Lett. 34(13): 1485-1489 (2013) - 2012
- [j3]Keiko Yamamoto, Satoshi Tamura, Satoru Hayamizu, Yasutomi Kinosada:
Visual Analysis of Health Checkup Data Using Multidimensional Scaling. J. Adv. Comput. Intell. Intell. Informatics 16(1): 26-32 (2012) - 2007
- [j2]Koji Iwano, Tomoaki Yoshinaga, Satoshi Tamura, Sadaoki Furui:
Audio-Visual Speech Recognition Using Lip Information Extracted from Side-Face Images. EURASIP J. Audio Speech Music. Process. 2007 (2007) - 2004
- [j1]Satoshi Tamura, Koji Iwano, Sadaoki Furui:
Multi-Modal Speech Recognition Using Optical-Flow Analysis for Lip Images. J. VLSI Signal Process. 36(2-3): 117-124 (2004)
Conference and Workshop Papers
- 2024
- [c65]Ryosuke Tanaka, Satoshi Tamura:
Few-Shot Anomalous Sound Detection Based on Anomaly Map Estimation Using Pseudo Abnormal Data. ICASSP 2024: 1391-1395 - [c64]Satoshi Tamura, Tomohiro Hattori, Yusuke Kato, Naoki Noguchi:
Speech Recognition for Indigenous Language Using Self-Supervised Learning and Natural Language Processing. ICPRAM 2024: 779-784 - 2023
- [c63]Tomohiro Hattori, Satoshi Tamura:
Speech Recognition for Minority Languages Using HuBERT and Model Adaptation. ICPRAM 2023: 350-355 - 2022
- [c62]Shinnosuke Isobe, Satoshi Tamura, Yuuto Gotoh, Masaki Nose:
Efficient Multi-angle Audio-visual Speech Recognition using Parallel WaveGAN based Scene Classifier. ICPRAM 2022: 449-460 - [c61]Keisuke Yamazaki, Satoshi Tamura, Yuuto Gotoh, Masaki Nose:
Visual-only Voice Activity Detection using Human Motion in Conference Video. ICPRAM 2022: 570-577 - 2021
- [c60]Tsubasa Maeda, Satoshi Tamura:
Multi-view Convolution for Lipreading. APSIPA ASC 2021: 1092-1096 - [c59]Hayato Mori, Satoshi Tamura, Satoru Hayamizu:
Anomalous Sound Detection Based On Attention Mechanism. EUSIPCO 2021: 581-585 - [c58]Shinnosuke Isobe, Satoshi Tamura, Satoru Hayamizu:
Speech Recognition using Deep Canonical Correlation Analysis in Noisy Environments. ICPRAM 2021: 63-70 - [c57]Tsubasa Maeda, Satoshi Tamura, Satoru Hayamizu, Keigo Kawaji:
Combination of temporal and spatial denoising methods for cine MRI. LifeTech 2021: 44-47 - [c56]Shinnosuke Isobe, Ryuichi Hirose, Takumi Nishiwaki, Tomohiro Hattori, Satoshi Tamura, Yuuto Gotoh, Masaki Nose:
GAMVA: A Japanese Audio-Visual Multi-Angle Speech Corpus. O-COCOSDA 2021: 134-139 - 2020
- [c55]Shinnosuke Isobe, Satoshi Tamura, Satoru Hayamizu, Yuuto Gotoh, Masaki Nose:
Multi-angle lipreading using angle classification and angle-specific feature integration. ICCSPA 2020: 1-5 - 2018
- [c54]Shota Asahi, Satoshi Tamura, Yuko Sugiyama, Satoru Hayamizu:
Toward a High Performance Piano Practice Support System for Beginners. APSIPA 2018: 73-79 - [c53]Satoshi Tamura, Kento Horio, Hajime Endo, Satoru Hayamizu, Tomoki Toda:
Audio-visual Voice Conversion Using Deep Canonical Correlation Analysis for Deep Bottleneck Features. INTERSPEECH 2018: 2469-2473 - [c52]Frederico Soares Cabral, Mateus Pinto, Fernao A. L. N. Mouzinho, Hidekazu Fukai, Satoshi Tamura:
An Automatic Survey System for Paved and Unpaved Road Classification and Road Anomaly Detection using Smartphone Sensor. SOLI 2018: 65-70 - [c51]Vosco Pereira, Satoshi Tamura, Satoru Hayamizu, Hidekazu Fukai:
A Deep Learning-Based Approach for Road Pothole Detection in Timor Leste. SOLI 2018: 279-284 - 2017
- [c50]Chisa Kodama, Kunihito Kato, Satoshi Tamura, Satoru Hayamizu:
Swallowing function evaluation using deep-learning-based acoustic signal processing. APSIPA 2017: 961-964 - [c49]Yudai Suzuki, Keigo Kawaji, Amit R. Patel, Satoshi Tamura, Satoru Hayamizu:
Toward effective noise reduction for sub-Nyquist high-frame-rate MRI techniques with deep learning. APSIPA 2017: 1136-1139 - [c48]Satoshi Tamura, Koichi Miyazaki, Satoru Hayamizu:
Lipreading using deep bottleneck features for optical and depth images. AVSP 2017: 76-77 - 2016
- [c47]Kodai Nakajima, Satoshi Tamura, Satoru Hayamizu, Takashi Ichinomiya, Yasutomi Kinosada:
Investigation of clinical process visualization using EMR data in clinics. AMIA 2016 - [c46]Kazuaki Ogawa, Tatsuaki Murahashi, Hiroaki Taguchi, Koudai Nakajima, Masanori Takehara, Satoshi Tamura, Satoru Hayamizu:
Spoken Document Retrieval Using Neighboring Documents and Extended Language Models for Query Likelihood Model. NTCIR 2016 - [c45]Shinji Ujita, Yusuke Kinoshita, Hidekazu Umeda, Tatsuo Morita, Kazuhiro Kaibara, Satoshi Tamura, Masahiro Ishida, Tetsuzo Ueda:
A fully integrated GaN-based power IC including gate drivers for high-efficiency DC-DC Converters. VLSI Circuits 2016: 1-2 - 2015
- [c44]Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayamizu:
Audio-visual speech recognition using deep bottleneck features and high-performance lipreading. APSIPA 2015: 575-582 - [c43]Kazuto Ukai, Satoshi Tamura, Satoru Hayamizu:
Stream weight estimation using higher order statistics in multi-modal speech recognition. AVSP 2015: 181-184 - [c42]Satoshi Tamura, Takuya Uno, Masanori Takehara, Satoru Hayamizu, Takeshi Kurata:
Multi-modal service operation estimation using DNN-based acoustic bag-of-features. EUSIPCO 2015: 2291-2295 - [c41]Hiroshi Ninomiya, Norihide Kitaoka, Satoshi Tamura, Yurie Iribe, Kazuya Takeda:
Integration of deep bottleneck features for audio-visual speech recognition. INTERSPEECH 2015: 563-567 - 2014
- [c40]Masanori Takehara, Hiroya Nojiri, Satoshi Tamura, Satoru Hayamizu, Takeshi Kurata:
Analysis of customer communication by employee in restaurant and lead time estimation. APSIPA 2014: 1-5 - [c39]Tetsuya Kawase, Masanori Takehara, Satoshi Tamura, Satoru Hayamizu, Ryuhei Tenmoku, Takeshi Kurata:
Improvement of utterance clustering by using employees' sound and area data. ICASSP 2014: 3047-3051 - [c38]Kohei Sawada, Masanori Takehara, Satoshi Tamura, Satoru Hayamizu:
Audio-visual voice conversion using noise-robust features. ICASSP 2014: 7899-7903 - [c37]Kensuke Hara, Hiroaki Taguchi, Koudai Nakajima, Masanori Takehara, Satoshi Tamura, Satoru Hayamizu:
Segmented Spoken Document Retrieval Using Word Co-occurrence Information. NTCIR 2014 - [c36]Satoshi Tamura, Seko Takumi, Satoru Hayamizu:
Data collection for mobile audio-visual speech recognition in various environments. O-COCOSDA 2014: 1-6 - 2013
- [c35]Takuya Kawasaki, Naoya Ukai, Seko Takumi, Satoshi Tamura, Satoru Hayamizu:
Improvement of Lip Reading Performance in Real Environments Using Speaker and Environmental Adaptation. ACPR 2013: 346-350 - [c34]Alwis Nazir, Ryouhei Kawamoto, Keiko Yamamoto, Satoshi Tamura, Takashi Ichinomiya, Satoru Hayamizu, Yasutomi Kinosada:
Time-series analysis of health checkup data using Hidden-Markov model. AMIA 2013 - [c33]Kensuke Hara, Hideki Sekiya, Tetsuya Kawase, Satoshi Tamura, Satoru Hayamizu:
Confidence estimation and keyword extraction from speech recognition result based on Web information. APSIPA 2013: 1-6 - [c32]Peng Shen, Satoshi Tamura, Satoru Hayamizu:
Audio-visual interaction in sparse representation features for noise robust audio-visual speech recognition. AVSP 2013: 43-48 - [c31]Seko Takumi, Naoya Ukai, Satoshi Tamura, Satoru Hayamizu:
Improvement of lipreading performance using discriminative feature and speaker adaptation. AVSP 2013: 221-226 - [c30]Ryouhei Kawamoto, Alwis Nazir, Atsuyuki Kameyama, Takashi Ichinomiya, Keiko Yamamoto, Satoshi Tamura, Mayumi Yamamoto, Satoru Hayamizu, Yasutomi Kinosada:
Hidden Markov Model for Analyzing Time-Series Health Checkup Data. MedInfo 2013: 491-495 - [c29]Kiichi Hasegawa, Masanori Takehara, Satoshi Tamura, Satoru Hayamizu:
Spoken Document Retrieval Using Extended Query Model and Web Documents. NTCIR 2013 - [c28]Masanori Takehara, Satoshi Tamura, Satoru Hayamizu, Ryuhei Tenmoku, Takashi Okuma, Tomohiro Fukuhara, Takeshi Kurata:
Measurement and analysis of speech data toward improving service in restaurant. O-COCOSDA/CASLRE 2013: 1-4 - 2012
- [c27]Mari Okamura, Masanori Takehara, Satoshi Tamura, Satoru Hayamizu:
Toward polyphonic musical instrument identification using example-based sparse representation. APSIPA 2012: 1-4 - [c26]Kohei Sawada, Yoji Tagami, Satoshi Tamura, Masanori Takehara, Satoru Hayamizu:
Statistical voice conversion using GA-based informative feature. APSIPA 2012: 1-4 - [c25]Peng Shen, Satoshi Tamura, Satoru Hayamizu:
Feature reconstruction using sparse imputation for noise robust audio-visual speech recognition. APSIPA 2012: 1-4 - [c24]Satoshi Tamura, Satoru Hayamizu:
Multi-stream acoustic model adaptation for noisy speech recognition. APSIPA 2012: 1-4 - [c23]Satoshi Tamura, Yoji Tagami, Satoru Hayamizu:
GIF-SP: GA-based informative feature for noisy speech recognition. APSIPA 2012: 1-4 - [c22]Naoya Ukai, Seko Takumi, Satoshi Tamura, Satoru Hayamizu:
GIF-LR: GA-based informative feature for lipreading. APSIPA 2012: 1-4 - [c21]Tatsuya Yamashita, Satoshi Tamura, Kenji Hayashi, Yutaka Nishimoto, Satoru Hayamizu:
Sparse representation of audio features for sputum detection from lung sounds. ICPR 2012: 2005-2008 - 2011
- [c20]Kiichi Hasegawa, Hideki Sekiya, Masanori Takehara, Taro Niinomi, Satoshi Tamura, Satoru Hayamizu:
Toward improvement of SDR accuracy using LDA and query expansion for SpokenDoc. NTCIR 2011 - 2010
- [c19]Shin'ichi Takeuchi, Takashi Hashiba, Satoshi Tamura, Satoru Hayamizu:
Decision fusion by boosting method for multi-modal voice activity detection. AVSP 2010: 1-4 - [c18]Peng Shen, Satoshi Tamura, Satoru Hayamizu:
Evaluation of real-time audio-visual speech recognition. AVSP 2010: 4 - [c17]Satoshi Tamura, Chiyomi Miyajima, Norihide Kitaoka, Takeshi Yamada, Satoru Tsuge, Tetsuya Takiguchi, Kazumasa Yamamoto, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Shigeki Matsuda, Tetsuji Ogawa, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura:
CENSREC-1-AV: an audio-visual corpus for noisy bimodal speech recognition. AVSP 2010: 6 - [c16]Satoshi Tamura, Eriko Hishikawa, Wataru Taguchi, Satoru Hayamizu:
Template-based spectral estimation using microphone array for speech recognition. INTERSPEECH 2010: 2050-2053 - [c15]Satoshi Tamura, Masato Ishikawa, Takashi Hashiba, Shin'ichi Takeuchi, Satoru Hayamizu:
A robust audio-visual speech recognition using audio-visual voice activity detection. INTERSPEECH 2010: 2694-2697 - 2009
- [c14]Shin'ichi Takeuchi, Takashi Hashiba, Satoshi Tamura, Satoru Hayamizu:
Voice activity detection based on fusion of audio and visual information. AVSP 2009: 151-154 - 2008
- [c13]Satoshi Tamura, Chiyomi Miyajima, Norihide Kitaoka, Satoru Hayamizu, Kazuya Takeda:
CENSREC-AV: evaluation frameworks for audio-visual speech recognition. AVSP 2008: 51-54 - [c12]Masato Nakayama, Takanobu Nishiura, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Tetsuji Ogawa, Shigeki Matsuda, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura:
CENSREC-4: development of evaluation framework for distant-talking speech recognition under reverberant environments. INTERSPEECH 2008: 968-971 - [c11]Takanobu Nishiura, Masato Nakayama, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura:
Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -. LREC 2008 - 2007
- [c10]Norihide Kitaoka, Kazumasa Yamamoto, Tomohiro Kusamizu, Seiichi Nakagawa, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura:
Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance. ASRU 2007: 607-612 - [c9]Satoshi Tamura, Kunihiko Takamatsu, Shinji Ogura, Satoru Hayamizu:
GEMSIS - a novel application of speech recognition to emergency and disaster medicine. INTERSPEECH 2007: 2569-2572 - 2006
- [c8]Satoshi Tamura, Koji Hashimoto, Jiong Zhu, Satoru Hayamizu, Hirotsugu Asai, Hideki Tanahashi, Makoto Kanagawa:
Automatic metadata generation and video editing based on speech and image recognition for medical education contents. INTERSPEECH 2006 - [c7]Yujiro Hayashi, Satoshi Tamura, Satoru Hayamizu, Yutaka Nishimoto:
Note-Taking Support for Nurses Using Digital Pen Character Recognition System. VSMM 2006: 428-436 - 2005
- [c6]Satoshi Tamura, Koji Iwano, Sadaoki Furui:
A Stream-Weight Optimization Method for Multi-Stream HMMS Based on Likelihood Value Normalization. ICASSP (1) 2005: 469-472 - 2004
- [c5]Satoshi Tamura, Koji Iwano, Sadaoki Furui:
A stream-weight optimization method for audio-visual speech recognition using multi-stream HMMs. ICASSP (1) 2004: 857-860 - 2003
- [c4]Tomoaki Yoshinaga, Satoshi Tamura, Koji Iwano, Sadaoki Furui:
Audio-visual speech recognition using lip movement extracted from side-face images. AVSP 2003: 117-120 - 2002
- [c3]Satoshi Nakamura, Ken'ichi Kumatani, Satoshi Tamura:
Robust bi-modal speech recognition based on state synchronous modeling and stream weight optimization. ICASSP 2002: 309-312 - [c2]Satoshi Nakamura, Ken'ichi Kumatani, Satoshi Tamura:
Multi-Modal Temporal Asynchronicity Modeling by Product HMMs for Robust. ICMI 2002: 305-312 - 2001
- [c1]Sadaoki Furui, Koji Iwano, Chiori Hori, Takahiro Shinozaki, Yohei Saito, Satoshi Tamura:
Ubiquitous speech processing. ICASSP 2001: 13-16
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-06 20:58 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint