
Sakriani Sakti
Sakriani Watiasri Sakti
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2020
- [j39]Seitaro Shinagawa
, Koichiro Yoshino, Seyed Hossein Alavi, Kallirroi Georgila, David R. Traum, Sakriani Sakti, Satoshi Nakamura:
An Interactive Image Editing System Using an Uncertainty-Based Confirmation Strategy. IEEE Access 8: 98471-98480 (2020) - [j38]The Tung Nguyen
, Koichiro Yoshino, Sakriani Sakti
, Satoshi Nakamura:
Policy Reuse for Dialog Management Using Action-Relation Probability. IEEE Access 8: 159639-159649 (2020) - [j37]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Recurrent Neural Network Compression Based on Low-Rank Tensor Representation. IEICE Trans. Inf. Syst. 103-D(2): 435-449 (2020) - [j36]Johanes Effendi, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Leveraging Neural Caption Translation with Visually Grounded Paraphrase Augmentation. IEICE Trans. Inf. Syst. 103-D(3): 674-683 (2020) - [j35]Andros Tjandra
, Sakriani Sakti
, Satoshi Nakamura
:
Machine Speech Chain. IEEE ACM Trans. Audio Speech Lang. Process. 28: 976-989 (2020) - [j34]Takatomo Kano
, Sakriani Sakti
, Satoshi Nakamura
:
End-to-End Speech Translation With Transcoding by Multi-Task Learning for Distant Language Pairs. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1342-1355 (2020) - [j33]Andros Tjandra
, Sakriani Sakti
, Satoshi Nakamura
:
Corrections to "Machine Speech Chain". IEEE ACM Trans. Audio Speech Lang. Process. 28: 1706 (2020) - [c168]Fan Yang, Feiran Li, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Panoramic Coordinate. ICASSP 2020: 1863-1867 - [c167]Kazuki Tsunematsu, Johanes Effendi, Sakriani Sakti, Satoshi Nakamura:
Neural Speech Completion. INTERSPEECH 2020: 2742-2746 - [c166]Ivan Halim Parmonangan, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Combining Audio and Brain Activity for Predicting Speech Quality. INTERSPEECH 2020: 2762-2766 - [c165]Sashi Novitasari, Andros Tjandra, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Incremental Machine Speech Chain Towards Enabling Listening While Speaking in Real-Time. INTERSPEECH 2020: 4372-4376 - [c164]Ewan Dunbar, Julien Karadayi, Mathieu Bernard, Xuan-Nga Cao, Robin Algayres, Lucas Ondel, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux:
The Zero Resource Speech Challenge 2020: Discovering Discrete Subword and Word Units. INTERSPEECH 2020: 4831-4835 - [c163]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge. INTERSPEECH 2020: 4851-4855 - [c162]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Augmenting Images for ASR and TTS Through Single-Loop and Dual-Loop Multimodal Chain Framework. INTERSPEECH 2020: 4901-4905 - [c161]Sara Asai, Koichiro Yoshino, Seitaro Shinagawa, Sakriani Sakti, Satoshi Nakamura:
Emotional Speech Corpus for Persuasive Dialogue System. LREC 2020: 491-497 - [c160]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis. SLTU/CCURL@LREC 2020: 131-138 - [e2]Dorothee Beermann, Laurent Besacier, Sakriani Sakti, Claudia Soria:
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, SLTU/CCURL@LREC 2020, Marseille, France, May 2020. European Language Resources association 2020, ISBN 979-10-95546-35-1 [contents] - [i27]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge. CoRR abs/2005.11676 (2020) - [i26]Fan Yang, Xin Chang, Chenyu Dang, Ziqiang Zheng, Sakriani Sakti, Satoshi Nakamura, Yang Wu:
ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation. CoRR abs/2007.03200 (2020) - [i25]Ewan Dunbar, Julien Karadayi, Mathieu Bernard, Xuan-Nga Cao, Robin Algayres, Lucas Ondel, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux:
The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units. CoRR abs/2010.05967 (2020) - [i24]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework. CoRR abs/2011.02099 (2020) - [i23]Sashi Novitasari, Andros Tjandra, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time. CoRR abs/2011.02126 (2020) - [i22]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition. CoRR abs/2011.02127 (2020) - [i21]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis. CoRR abs/2011.02128 (2020) - [i20]Katsuhito Sudoh, Takatomo Kano, Sashi Novitasari, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS. CoRR abs/2011.04845 (2020)
2010 – 2019
- 2019
- [j32]Andros Tjandra
, Sakriani Sakti, Satoshi Nakamura:
End-to-End Speech Recognition Sequence Training With Reinforcement Learning. IEEE Access 7: 79758-79769 (2019) - [j31]Fan Yang
, Sakriani Sakti, Yang Wu, Satoshi Nakamura:
A Framework for Knowing Who is Doing What in Aerial Surveillance Videos. IEEE Access 7: 93315-93325 (2019) - [j30]Hiroki Tanaka, Hiroki Watanabe, Hayato Maki, Sakriani Sakti, Satoshi Nakamura:
Electroencephalogram-Based Single-Trial Detection of Language Expectation Violations in Listening to Speech. Frontiers Comput. Neurosci. 13: 15 (2019) - [j29]Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Neural Oscillation-Based Classification of Japanese Spoken Sentences During Speech Perception. IEICE Trans. Inf. Syst. 102-D(2): 383-391 (2019) - [j28]Nurul Lubis
, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura
:
Positive Emotion Elicitation in Chat-Based Dialogue Systems. IEEE ACM Trans. Audio Speech Lang. Process. 27(4): 866-877 (2019) - [c159]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Listening While Speaking and Visualizing: Improving ASR Through Multimodal Chain. ASRU 2019: 471-478 - [c158]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
Neural Machine Translation with Acoustic Embedding. ASRU 2019: 578-584 - [c157]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Speech-to-Speech Translation Between Untranscribed Unknown Languages. ASRU 2019: 593-600 - [c156]Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Zero-Shot Code-Switching ASR and TTS with Multilingual Machine Speech Chain. ASRU 2019: 964-971 - [c155]Holy Lovenia, Hiroki Tanaka, Sakriani Sakti, Ayu Purwarianti, Satoshi Nakamura:
Speech Artifact Removal from Eeg Recordings of Spoken Word Production with Tensor Decomposition. ICASSP 2019: 1115-1119 - [c154]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
End-to-end Feedback Loss in Speech Chain Framework via Straight-through Estimator. ICASSP 2019: 6281-6285 - [c153]Marco Vetter, Sakriani Sakti, Satoshi Nakamura:
Cross-lingual Speech-based Tobi Label Generation Using Bidirectional Lstm. ICASSP 2019: 6620-6624 - [c152]Ewan Dunbar, Robin Algayres, Julien Karadayi, Mathieu Bernard, Juan Benjumea, Xuan-Nga Cao, Lucie Miskic, Charlotte Dugrain, Lucas Ondel, Alan W. Black, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux:
The Zero Resource Speech Challenge 2019: TTS Without T. INTERSPEECH 2019: 1088-1092 - [c151]Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li
, Satoshi Nakamura:
VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019. INTERSPEECH 2019: 1118-1122 - [c150]Ivan Halim Parmonangan, Hiroki Tanaka, Sakriani Sakti, Shinnosuke Takamichi, Satoshi Nakamura:
Speech Quality Evaluation of Synthesized Japanese Speech Using EEG. INTERSPEECH 2019: 1228-1232 - [c149]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition. INTERSPEECH 2019: 3835-3839 - [c148]Fan Yang, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Make Skeleton-based Action Recognition Model Smaller, Faster and Better. MMAsia 2019: 31:1-31:6 - [c147]Sahoko Nakayama, Takatomo Kano, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Recognition and translation of code-switching speech utterances. O-COCOSDA 2019: 1-6 - [c146]Mayuko Okamato, Sakriani Sakti, Satoshi Nakamura:
Phoneme-level speaking rate variation on waveform generation using GAN-TTS. O-COCOSDA 2019: 1-7 - [i19]Ewan Dunbar, Robin Algayres, Julien Karadayi, Mathieu Bernard, Juan Benjumea, Xuan-Nga Cao, Lucie Miskic, Charlotte Dugrain, Lucas Ondel, Alan W. Black, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux:
The Zero Resource Speech Challenge 2019: TTS without T. CoRR abs/1904.11469 (2019) - [i18]Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura:
VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for Zerospeech Challenge 2019. CoRR abs/1905.11449 (2019) - [i17]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning. CoRR abs/1906.00579 (2019) - [i16]Fan Yang, Sakriani Sakti, Yang Wu, Satoshi Nakamura:
Make Skeleton-based Action Recognition Model Smaller, Faster and Better. CoRR abs/1907.09658 (2019) - [i15]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Speech-to-speech Translation between Untranscribed Unknown Languages. CoRR abs/1910.00795 (2019) - [i14]Fan Yang, Feiran Li, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Using panoramic videos for multi-person localization and tracking in a 3D panoramic coordinate. CoRR abs/1911.10535 (2019) - 2018
- [j27]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Learning Supervised Feature Transformations on Zero Resources for Improved Acoustic Unit Discovery. IEICE Trans. Inf. Syst. 101-D(1): 205-214 (2018) - [j26]Nurul Lubis, Dessi Puji Lestari, Sakriani Sakti, Ayu Purwarianti, Satoshi Nakamura:
Construction of Spontaneous Emotion Corpus from Indonesian TV Talk Shows and Its Application on Multimodal Emotion Recognition. IEICE Trans. Inf. Syst. 101-D(8): 2092-2100 (2018) - [j25]Takatomo Kano
, Shinnosuke Takamichi, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
An end-to-end model for cross-lingual transformation of paralinguistic information. Mach. Transl. 32(4): 353-368 (2018) - [j24]Quoc Truong Do
, Sakriani Sakti, Satoshi Nakamura
:
Sequence-to-Sequence Models for Emphasis Speech Translation. IEEE ACM Trans. Audio Speech Lang. Process. 26(10): 1873-1883 (2018) - [j23]Michael Heck
, Sakriani Sakti, Satoshi Nakamura
:
Dirichlet Process Mixture of Mixtures Model for Unsupervised Subword Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 26(11): 2027-2042 (2018) - [c145]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Eliciting Positive Emotion through Affect-Sensitive Dialogue Response Generation: A Neural Network Approach. AAAI 2018: 5293-5300 - [c144]Naoki Hosomi, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Deception Detection and Analysis in Spoken Dialogues based on FastText. APSIPA 2018: 139-142 - [c143]Masahiro Honda, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Detecting suppression of negative emotion by time series change of cerebral blood flow using fNIRS. BHI 2018: 398-401 - [c142]Hiroki Tanaka, Hiroki Watanabe, Hayato Maki, Sakriani Sakti, Satoshi Nakamura:
Single-Trial Detection of Semantic Anomalies From EEG During Listening to Spoken Sentences. EMBC 2018: 977-980 - [c141]Sashi Novitasari, Dessi Puji Lestari, Sakriani Sakti, Ayu Purwarianti:
Rude-Words Detection for Indonesian Speech Using Support Vector Machine. IALP 2018: 19-24 - [c140]Hayato Maki, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Graph Regularized Tensor Factorization for Single-Trial EEG Analysis. ICASSP 2018: 846-850 - [c139]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence Asr Optimization Via Reinforcement Learning. ICASSP 2018: 5829-5833 - [c138]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Tensor Decomposition for Compressing Recurrent Neural Network. IJCNN 2018: 1-8 - [c137]Takuma Mori, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Compressing End-to-end ASR Networks by Tensor-Train Decomposition. INTERSPEECH 2018: 806-810 - [c136]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Machine Speech Chain with One-shot Speaker Adaptation. INTERSPEECH 2018: 887-891 - [c135]Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Incremental TTS for Japanese Language. INTERSPEECH 2018: 902-906 - [c134]The Tung Nguyen, Koichiro Yoshino, Sakriani Sakti, Satoshi Nakamura:
Impact of Deception Information on Negotiation Dialog Management: A Case Study on Doctor-Patient Conversations. IWSDS 2018: 199-206 - [c133]Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Puji Lestari, Satoshi Nakamura:
Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas. LREC 2018 - [c132]Koichiro Yoshino, Yoko Ishikawa, Masahiro Mizukami, Yu Suzuki, Sakriani Sakti, Satoshi Nakamura:
Dialogue Scenario Collection of Persuasive Dialogue with Emotional Expressions via Crowdsourcing. LREC 2018 - [c131]Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Puji Lestari, Satoshi Nakamura:
Multi-Modal Multi-Task Deep Learning For Speaker And Emotion Recognition Of TV-Series Data. O-COCOSDA 2018: 37-42 - [c130]Sahoko Nakayama, Takatomo Kano, Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura:
Japanese-English Code-Switching Speech Data Construction. O-COCOSDA 2018: 67-71 - [c129]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Unsupervised Counselor Dialogue Clustering for Positive Emotion Elicitation in Neural Dialogue System. SIGDIAL Conference 2018: 161-170 - [c128]Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Speech Chain for Semi-Supervised Learning of Japanese-English Code-Switching ASR and TTS. SLT 2018: 182-189 - [c127]Berrak Sisman
, Mingyang Zhang, Sakriani Sakti, Haizhou Li
, Satoshi Nakamura:
Adaptive Wavenet Vocoder for Residual Compensation in GAN-Based Voice Conversion. SLT 2018: 282-289 - [c126]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Multi-Scale Alignment and Contextual History for Attention Mechanism in Sequence-to-Sequence Model. SLT 2018: 648-655 - [c125]Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura:
Toward Multi-Features Emphasis Speech Translation: Assessment of Human Emphasis Production and Perception with Speech and Text Clues. SLT 2018: 700-706 - [c124]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Optimizing Neural Response Generator with Emotional Impact Information. SLT 2018: 876-883 - [c123]Bin Wu, Sakriani Sakti, Jinsong Zhang, Satoshi Nakamura:
Optimizing DPGMM Clustering in Zero Resource Setting Based on Functional Load. SLTU 2018: 1-5 - [c122]Khumaisa Nur'Aini, Johanes Effendi, Sakriani Sakti, Mirna Adriani, Satoshi Nakamura:
Corpus Construction and Semantic Analysis of Indonesian Image Description. SLTU 2018: 42-46 - [i13]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
Structured-based Curriculum Learning for End-to-end English-Japanese Speech Translation. CoRR abs/1802.06003 (2018) - [i12]Seitaro Shinagawa, Koichiro Yoshino, Sakriani Sakti, Yu Suzuki, Satoshi Nakamura:
Interactive Image Manipulation with Natural Language Instruction Commands. CoRR abs/1802.08645 (2018) - [i11]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Tensor Decomposition for Compressing Recurrent Neural Network. CoRR abs/1802.10410 (2018) - [i10]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Machine Speech Chain with One-shot Speaker Adaptation. CoRR abs/1803.10525 (2018) - [i9]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-sequence Model. CoRR abs/1807.08280 (2018) - [i8]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator. CoRR abs/1810.13107 (2018) - 2017
- [j22]Quoc Truong Do, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Preserving Word-Level Emphasis in Speech-to-Speech Translation. IEEE ACM Trans. Audio Speech Lang. Process. 25(3): 544-556 (2017) - [c121]Nurul Lubis, Michael Heck, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Processing negative emotions through social communication: Multimodal database construction and analysis. ACII 2017: 79-85 - [c120]Kazutaka Kubo, Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An investigation of how to design control parameters for statistical voice timbre control. APSIPA 2017: 1520-1523 - [c119]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Listening while speaking: Speech chain by deep learning. ASRU 2017: 301-308 - [c118]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Attention-based Wav2Text with feature transfer learning. ASRU 2017: 309-315 - [c117]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Feature optimized DPGMM clustering for unsupervised subword modeling: A contribution to zerospeech 2017. ASRU 2017: 740-746 - [c116]Naoto Terasawa, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Tracking liking state in brain activity while watching multiple movies. ICMI 2017: 321-325 - [c115]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Local Monotonic Attention Mechanism for End-to-End Speech And Language Processing. IJCNLP(1) 2017: 431-440 - [c114]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Compressing recurrent neural network with tensor train. IJCNN 2017: 4451-4458 - [c113]Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Subject-Independent Classification of Japanese Spoken Sentences by Multiple Frequency Bands Phase Pattern of EEG Response During Speech Perception. INTERSPEECH 2017: 2431-2435 - [c112]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
Structured-Based Curriculum Learning for End-to-End English-Japanese Speech Translation. INTERSPEECH 2017: 2630-2634 - [c111]Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura:
Toward Expressive Speech Translation: A Unified Sequence-to-Sequence LSTMs Approach for Translating Words and Emphasis. INTERSPEECH 2017: 2640-2644 - [c110]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Eliciting Positive Emotional Impact in Dialogue Response Selection. IWSDS 2017: 135-148 - [c109]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Speech recognition features based on deep latent Gaussian models. MLSP 2017: 1-6 - [c108]Johanes Effendi, Sakriani Sakti, Satoshi Nakamura:
Creation of a multi-paraphrase corpus based on various elementary operations. O-COCOSDA 2017: 1-6 - [c107]Kohei Mukaihara, Sakriani Sakti, Satoshi Nakamura:
Recognizing Emotionally Coloured Dialogue Speech Using Speaker-Adapted DNN-CNN Bottleneck Features. SPECOM 2017: 632-641 - [i7]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Compressing Recurrent Neural Network with Tensor Train. CoRR abs/1705.08052 (2017) - [i6]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Local Monotonic Attention Mechanism for End-to-End Speech Recognition. CoRR abs/1705.08091 (2017) - [i5]Andros Tjandra, Sakriani Sakti, Ruli Manurung, Mirna Adriani, Satoshi Nakamura:
Gated Recurrent Neural Tensor Network. CoRR abs/1706.02222 (2017) - [i4]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Listening while Speaking: Speech Chain by Deep Learning. CoRR abs/1707.04879 (2017) - [i3]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Attention-based Wav2Text with Feature Transfer Learning. CoRR abs/1709.07814 (2017) - [i2]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence ASR Optimization via Reinforcement Learning. CoRR abs/1710.10774 (2017) - 2016
- [j21]Hayato Maki, Tomoki Toda, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Enhancing Event-Related Potentials Based on Maximum a Posteriori Estimation with a Spatial Correlation Prior. IEICE Trans. Inf. Syst. 99-D(6): 1437-1446 (2016) - [j20]Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A Statistical Sample-Based Approach to GMM-Based Voice Conversion Using Tied-Covariance Acoustic Models. IEICE Trans. Inf. Syst. 99-D(10): 2490-2498 (2016) - [j19]Lasguido Nio, Sakriani Sakti, Graham Neubig, Koichiro Yoshino, Satoshi Nakamura:
Neural Network Approaches to Dialog Response Retrieval and Generation. IEICE Trans. Inf. Syst. 99-D(10): 2508-2517 (2016) - [j18]Yuji Oshima, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Non-Native Text-to-Speech Preserving Speaker Individuality Based on Partial Correction of Prosodic and Phonetic Characteristics. IEICE Trans. Inf. Syst. 99-D(12): 3132-3139 (2016) - [j17]Takuya Hiraoka, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Learning cooperative persuasive dialogue policies using framing. Speech Commun. 84: 83-96 (2016) - [j16]Shinnosuke Takamichi, Tomoki Toda, Alan W. Black, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 24(4): 755-767 (2016) - [j15]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Hideki Negoro, Hidemi Iwasaka, Satoshi Nakamura:
Teaching Social Communication Skills Through Human-Agent Interaction. ACM Trans. Interact. Intell. Syst. 6(2): 18:1-18:26 (2016) - [c106]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Hideki Negoro, Hidemi Iwasaka, Satoshi Nakamura:
Automated social skills training with audiovisual information. EMBC 2016: 2262-2265 - [c105]Hayato Maki, Tomoki Toda, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Removing noise from event-related potentials using a probabilistic generative model with grouped covariance matrices. EMBC 2016: 3728-3731 - [c104]Rui Hiraoka, Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Personalized unknown word detection in non-native language reading using eye gaze. ICMI 2016: 66-70 - [c103]Andros Tjandra, Sakriani Sakti, Ruli Manurung, Mirna Adriani, Satoshi Nakamura:
Gated Recurrent Neural Tensor Network. IJCNN 2016: 448-455 - [c102]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Supervised Learning of Acoustic Models in a Zero Resource Setting to Improve DPGMM Clustering. INTERSPEECH 2016: 1310-1314 - [c101]Quoc Truong Do, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Transferring Emphasis in Speech Translation Using Hard-Attentional Neural Network Models. INTERSPEECH 2016: 2533-2537 - [c100]Satoshi Tsujioka, Sakriani Sakti, Koichiro Yoshino, Graham Neubig, Satoshi Nakamura:
Unsupervised Joint Estimation of Grapheme-to-Phoneme Conversion Systems and Acoustic Model Adaptation for Non-Native Speech Recognition. INTERSPEECH 2016: 3091-3095 - [c99]Quoc Truong Do, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training. INTERSPEECH 2016: 3196-3200 - [c98]