


default search action
Sakriani Sakti
Sakriani Watiasri Sakti
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j54]Luan Thanh Nguyen
, Sakriani Sakti
:
ZeST: A Zero-Resourced Speech-to-Speech Translation Approach for Unknown, Unpaired, and Untranscribed Languages. IEEE Access 13: 8638-8648 (2025) - 2024
- [j53]Bimasena Putra
, Kurniawati Azizah
, Candy Olivia Mawalim
, Ikhlasul Akmal Hanif
, Sakriani Sakti
, Chee Wee Leong, Shogo Okada
:
MAG-BERT-ARL for Fair Automated Video Interview Assessment. IEEE Access 12: 145188-145205 (2024) - [j52]Kei Furukawa
, Takeshi Kishiyama
, Satoshi Nakamura
, Sakriani Sakti
:
Applying Syntax-Prosody Mapping Hypothesis and Boundary-Driven Theory to Neural Sequence-to-Sequence Speech Synthesis. IEEE Access 12: 160896-160917 (2024) - [j51]Yuka Ko, Katsuhito Sudoh, Sakriani Sakti, Satoshi Nakamura:
Neural End-To-End Speech Translation Leveraged by ASR Posterior Distribution. IEICE Trans. Inf. Syst. 107(10): 1322-1331 (2024) - [c222]Mushaffa Rasyid Ridha, Sakriani Sakti:
Refining rtMRI Landmark-Based Vocal Tract Contour Labels with FCN-Based Smoothing and Point-to-Curve Projection. LREC/COLING 2024: 13796-13802 - [c221]Aulia Adila, Dessi Puji Lestari, Ayu Purwarianti, Dipta Tanaya, Kurniawati Azizah, Sakriani Sakti:
Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities. O-COCOSDA 2024: 1-6 - [c220]Iqbal Pahlevi Amin, Haotian Tan, Kurniawati Azizah, Sakriani Sakti:
Chunk Size Scheduling for Optimizing the Quality-Latency Trade-off in Simultaneous Speech Translation. O-COCOSDA 2024: 1-6 - [c219]Dhiya Dewangga, Dessi Puji Lestari, Ayu Purwarianti, Dipta Tanaya, Kurniawati Azizah, Sakriani Sakti:
An Evaluation of Neural Vocoder-Based Voice Cloning System for Dysphonia Speech Disorder. O-COCOSDA 2024: 1-7 - [c218]Ahmad Alfani Handoyo, Chung Tran, Dessi Puji Lestari, Sakriani Sakti:
Indonesian-English Code-Switching Speech Synthesizer Utilizing Multilingual STEN-TTS and Bert LID. O-COCOSDA 2024: 1-6 - [c217]Geoffrey Tyndall, Kurniawati Azizah, Dipta Tanaya, Ayu Purwarianti, Dessi Puji Lestari, Sakriani Sakti:
Continual Learning in Machine Speech Chain Using Gradient Episodic Memory. O-COCOSDA 2024: 1-6 - [c216]Zhanhang Zhang, Sakriani Sakti:
A Feedback-Driven Self-Improvement Strategy and Emotion-Aware Vocoder for Emotional Voice Conversion. O-COCOSDA 2024: 1-6 - [c215]Sakriani Sakti:
The Message of the O-COCOSDA Convenor. O-COCOSDA 2024: v-vi - [e4]Nicoletta Calzolari, Min-Yen Kan, Véronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue:
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC/COLING 2024, 20-25 May, 2024, Torino, Italy. ELRA and ICCL 2024, ISBN 978-2-493814-10-4 [contents] - [i40]Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Tomoya Yanagita, Kosuke Doi, Mana Makinae, Haotian Tan, Makoto Sakai, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
NAIST Simultaneous Speech Translation System for IWSLT 2024. CoRR abs/2407.00826 (2024) - [i39]Haotian Tan, Sakriani Sakti:
Contrastive Feedback Mechanism for Simultaneous Speech Translation. CoRR abs/2407.20524 (2024) - [i38]Nick Rossenbach, Ralf Schlüter, Sakriani Sakti:
On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition. CoRR abs/2407.21476 (2024) - [i37]Aulia Adila, Dessi Puji Lestari, Ayu Purwarianti, Dipta Tanaya, Kurniawati Azizah, Sakriani Sakti:
Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities. CoRR abs/2410.08828 (2024) - [i36]Bin Wu, Sakriani Sakti, Shinnosuke Takamichi, Satoshi Nakamura:
A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization. CoRR abs/2410.23279 (2024) - [i35]Geoffrey Tyndall, Kurniawati Azizah, Dipta Tanaya, Ayu Purwarianti, Dessi Puji Lestari, Sakriani Sakti:
Continual Learning in Machine Speech Chain Using Gradient Episodic Memory. CoRR abs/2411.18320 (2024) - [i34]Ahmad Alfani Handoyo, Chung Tran, Dessi Puji Lestari, Sakriani Sakti:
Indonesian-English Code-Switching Speech Synthesizer Utilizing Multilingual STEN-TTS and Bert LID. CoRR abs/2412.19043 (2024) - 2023
- [j50]Tomoya Yanagita
, Sakriani Sakti
, Satoshi Nakamura
:
Japanese Neural Incremental Text-to-Speech Synthesis Framework With an Accent Phrase Input. IEEE Access 11: 22355-22363 (2023) - [c214]Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie, Fajri Koto, Rahmad Mahendra, Christian Wibisono, Ade Romadhony, Karissa Vincentio, Jennifer Santoso, David Moeljadi, Cahya Wirawan
, Frederikus Hudi, Muhammad Satrio Wicaksono, Ivan Halim Parmonangan, Ika Alfina, Ilham Firdausi Putra, Samsul Rahmadani, Yulianti Oenang, Ali Akbar Septiandri, James Jaya, Kaustubh D. Dhole, Arie Ardiyanti Suryani, Rifki Afina Putri
, Dan Su, Keith Stevens, Made Nindyatama Nityasya, Muhammad Farid Adilazuarda, Ryan Hadiwijaya, Ryandito Diandaru, Tiezheng Yu, Vito Ghifari, Wenliang Dai, Yan Xu, Dyah Damapuspita, Haryo Akbarianto Wibowo, Cuk Tho, Ichwanul Muslim Karo Karo, Tirana Fatyanosa, Ziwei Ji, Graham Neubig, Timothy Baldwin, Sebastian Ruder, Pascale Fung, Herry Sujaini, Sakriani Sakti, Ayu Purwarianti:
NusaCrowd: Open Source Initiative for Indonesian NLP Resources. ACL (Findings) 2023: 13745-13818 - [c213]Sakriani Sakti, Benita Angela Titalim:
Leveraging the Multilingual Indonesian Ethnic Languages Dataset In Self-Supervised Models for Low-Resource ASR Task. ASRU 2023: 1-8 - [c212]Ruhiyah Widiaputri, Ayu Purwarianti, Dessi Puji Lestari, Kurniawati Azizah, Dipta Tanaya, Sakriani Sakti:
Speech Recognition and Meaning Interpretation: Towards Disambiguation of Structurally Ambiguous Spoken Utterances in Indonesian. EMNLP 2023: 16813-16824 - [c211]Jianan Chen, Sakriani Sakti:
An Isotropy Analysis for Self-Supervised Acoustic Unit Embeddings on the Zero Resource Speech Challenge 2021 Framework. ICASSP 2023: 1-5 - [c210]Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Self-Adaptive Incremental Machine Speech Chain for Lombard TTS with High-Granularity ASR Feedback in Dynamic Noise Condition. ICASSP 2023: 1-5 - [c209]Shun Takahashi, Sakriani Sakti:
Unsupervised Learning of Discrete Latent Representations with Data-Adaptive Dimensionality from Continuous Speech Streams. INTERSPEECH 2023: 416-420 - [c208]Chung Tran, Chi Mai Luong, Sakriani Sakti:
STEN-TTS: Improving Zero-shot Cross-Lingual Transfer for Multi-Lingual TTS with Style-Enhanced Normalization Diffusion Framework. INTERSPEECH 2023: 4464-4468 - [c207]Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Yuka Ko, Tomoya Yanagita, Kosuke Doi, Mana Makinae, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
NAIST Simultaneous Speech-to-speech Translation System for IWSLT 2023. IWSLT@ACL 2023: 330-340 - [c206]Bella Septina Ika Hartanti, Dipta Tanaya, Kurniawati Azizah, Dessi Puji Lestari, Ayu Purwarianti, Sakriani Sakti:
Generating Speech with Prosodic Prominence based on SSL-Visually Grounded Models. O-COCOSDA 2023: 1-6 - [c205]Hang Xi, Sakriani Sakti:
Exploring Difficulties Encountered by Professional Interpreters in Japanese-to-English and English-to-Japanese Simultaneous Translation. O-COCOSDA 2023: 1-6 - [i33]Heli Qi
, Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
SpeeChain: A Speech Toolkit for Large-Scale Machine Speech Chain. CoRR abs/2301.02966 (2023) - 2022
- [j49]Fan Yang, Zheng Wang, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Tackling multiple object tracking with complicated motions - Re-designing the integration of motion and appearance. Image Vis. Comput. 124: 104514 (2022) - [j48]Bin Wu
, Sakriani Sakti
, Jinsong Zhang, Satoshi Nakamura
:
Modeling Unsupervised Empirical Adaptation by DPGMM and DPGMM-RNN Hybrid Model to Extract Perceptual Features for Low-Resource ASR. IEEE ACM Trans. Audio Speech Lang. Process. 30: 901-916 (2022) - [j47]Sashi Novitasari
, Sakriani Sakti
, Satoshi Nakamura
:
A Machine Speech Chain Approach for Dynamically Adaptive Lombard TTS in Static and Dynamic Noise Environments. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2673-2688 (2022) - [c204]Heli Qi
, Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing. INTERSPEECH 2022: 3413-3417 - [c203]Ryo Fukuda, Yuka Ko, Yasumasa Kano, Kosuke Doi, Hirotaka Tokuyama, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
NAIST Simultaneous Speech-to-Text Translation System for IWSLT 2022. IWSLT@ACL 2022: 286-292 - [c202]Rendi Chevi, Radityo Eko Prasojo, Alham Fikri Aji, Andros Tjandra, Sakriani Sakti:
NIX-TTS: Lightweight and End-to-End Text-to-Speech Via Module-Wise Distillation. SLT 2022: 970-976 - [i32]Heli Qi, Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing. CoRR abs/2205.06963 (2022) - [i31]Holy Lovenia, Hiroki Tanaka, Sakriani Sakti, Ayu Purwarianti, Satoshi Nakamura:
Speech Artifact Removal from EEG Recordings of Spoken Word Production with Tensor Decomposition. CoRR abs/2206.00635 (2022) - [i30]Fan Yang, Norimichi Ukita, Sakriani Sakti, Satoshi Nakamura:
Actor-identified Spatiotemporal Action Detection - Detecting Who Is Doing What in Videos. CoRR abs/2208.12940 (2022) - [i29]Fan Yang, Yang Wu, Zheng Wang, Xiang Li, Sakriani Sakti, Satoshi Nakamura:
Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval. CoRR abs/2211.14515 (2022) - [i28]Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie, Rahmad Mahendra, Christian Wibisono, Ade Romadhony, Karissa Vincentio, Fajri Koto, Jennifer Santoso, David Moeljadi, Cahya Wirawan, Frederikus Hudi, Ivan Halim Parmonangan, Ika Alfina, Muhammad Satrio Wicaksono, Ilham Firdausi Putra, Samsul Rahmadani, Yulianti Oenang, Ali Akbar Septiandri, James Jaya, Kaustubh D. Dhole, Arie Ardiyanti Suryani, Rifki Afina Putri
, Dan Su, Keith Stevens, Made Nindyatama Nityasya, Muhammad Farid Adilazuarda, Ryan Ignatius, Ryandito Diandaru, Tiezheng Yu, Vito Ghifari, Wenliang Dai, Yan Xu, Dyah Damapuspita, Cuk Tho, Ichwanul Muslim Karo Karo
, Tirana Noor Fatyanosa
, Ziwei Ji, Pascale Fung, Graham Neubig, Timothy Baldwin, Sebastian Ruder, Herry Sujaini, Sakriani Sakti, Ayu Purwarianti:
NusaCrowd: Open Source Initiative for Indonesian NLP Resources. CoRR abs/2212.09648 (2022) - 2021
- [j46]Johanes Effendi
, Sakriani Sakti
, Satoshi Nakamura
:
End-to-End Image-to-Speech Generation for Untranscribed Unknown Languages. IEEE Access 9: 55144-55154 (2021) - [j45]Johanes Effendi
, Andros Tjandra
, Sakriani Sakti
, Satoshi Nakamura
:
Multimodal Chain: Cross-Modal Collaboration Through Listening, Speaking, and Visualizing. IEEE Access 9: 70286-70299 (2021) - [j44]Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Code-Switching ASR and TTS Using Semisupervised Learning with Machine Speech Chain. IEICE Trans. Inf. Syst. 104-D(10): 1661-1677 (2021) - [j43]Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Neural Incremental Speech Recognition Toward Real-Time Machine Speech Translation. IEICE Trans. Inf. Syst. 104-D(12): 2195-2208 (2021) - [j42]Fan Yang
, Xin Chang, Sakriani Sakti, Yang Wu, Satoshi Nakamura:
ReMOT: A model-agnostic refinement for multiple object tracking. Image Vis. Comput. 106: 104091 (2021) - [j41]Bin Wu
, Sakriani Sakti
, Jinsong Zhang, Satoshi Nakamura
:
Tackling Perception Bias in Unsupervised Phoneme Discovery Using DPGMM-RNN Hybrid Model and Functional Load. IEEE ACM Trans. Audio Speech Lang. Process. 29: 348-362 (2021) - [j40]Fan Yang
, Yang Wu, Zheng Wang
, Xiang Li, Sakriani Sakti
, Satoshi Nakamura
:
Instance-Level Heterogeneous Domain Adaptation for Limited-Labeled Sketch-to-Photo Retrieval. IEEE Trans. Multim. 23: 2347-2360 (2021) - [c201]Shun Takahashi, Sakriani Sakti, Satoshi Nakamura:
Unsupervised Neural-Based Graph Clustering for Variable-Length Speech Representation Discovery of Zero-Resource Languages. Interspeech 2021: 1559-1563 - [c200]Johanes Effendi, Sakriani Sakti, Satoshi Nakamura:
Weakly-Supervised Speech-to-Text Mapping with Visually Connected Non-Parallel Speech-Text Data Using Cyclic Partially-Aligned Transformer. Interspeech 2021: 2257-2261 - [c199]Hirotaka Tokuyama, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Transcribing Paralinguistic Acoustic Cues to Target Language Text in Transformer-Based Speech-to-Text Translation. Interspeech 2021: 2262-2266 - [c198]Yuka Ko, Katsuhito Sudoh, Sakriani Sakti, Satoshi Nakamura:
ASR Posterior-Based Loss for Multi-Task End-to-End Speech Translation. Interspeech 2021: 2272-2276 - [c197]Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder. Interspeech 2021: 4124-4128 - [c196]Sara Asai, Koichiro Yoshino, Seitaro Shinagawa, Sakriani Sakti, Satoshi Nakamura:
Eliciting Cooperative Persuasive Dialogue by Multimodal Emotional Robot. IWSDS 2021: 143-158 - [c195]Ryo Fukuda, Yui Oka, Yasumasa Kano, Yuki Yano, Yuka Ko, Hirotaka Tokuyama, Kosuke Doi, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
NAIST English-to-Japanese Simultaneous Translation System for IWSLT 2021 Simultaneous Text-to-text Task. IWSLT 2021: 39-45 - [c194]Nobuya Tachimori, Sakriani Sakti, Satoshi Nakamura:
Multi-Encoder Sequential Attention Network for Context-Aware Speech Recognition in Japanese Dialog Conversation. O-COCOSDA 2021: 1-6 - [c193]Ryo Fukuda, Sashi Novitasari, Yui Oka, Yasumasa Kano, Yuki Yano, Yuka Ko, Hirotaka Tokuyama, Kosuke Doi, Tomoya Yanagita, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Simultaneous Speech-to-Speech Translation System with Transformer-Based Incremental ASR, MT, and TTS. O-COCOSDA 2021: 186-192 - [c192]Nobuyoshi Kaiki, Sakriani Sakti, Satoshi Nakamura:
Using Local Phrase Dependency Structure Information in Neural Sequence-to-Sequence Speech Synthesis. O-COCOSDA 2021: 206-211 - [c191]Bin Wu, Sakriani Sakti, Satoshi Nakamura:
Incorporating Discriminative DPGMM Posteriorgrams for Low-Resource ASR. SLT 2021: 201-208 - [c190]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
Transformer-Based Direct Speech-To-Speech Translation with Transcoder. SLT 2021: 958-965 - 2020
- [j39]Seitaro Shinagawa
, Koichiro Yoshino, Seyed Hossein Alavi, Kallirroi Georgila, David R. Traum, Sakriani Sakti, Satoshi Nakamura:
An Interactive Image Editing System Using an Uncertainty-Based Confirmation Strategy. IEEE Access 8: 98471-98480 (2020) - [j38]The Tung Nguyen
, Koichiro Yoshino, Sakriani Sakti
, Satoshi Nakamura
:
Policy Reuse for Dialog Management Using Action-Relation Probability. IEEE Access 8: 159639-159649 (2020) - [j37]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Recurrent Neural Network Compression Based on Low-Rank Tensor Representation. IEICE Trans. Inf. Syst. 103-D(2): 435-449 (2020) - [j36]Johanes Effendi, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Leveraging Neural Caption Translation with Visually Grounded Paraphrase Augmentation. IEICE Trans. Inf. Syst. 103-D(3): 674-683 (2020) - [j35]Andros Tjandra
, Sakriani Sakti
, Satoshi Nakamura
:
Machine Speech Chain. IEEE ACM Trans. Audio Speech Lang. Process. 28: 976-989 (2020) - [j34]Takatomo Kano
, Sakriani Sakti
, Satoshi Nakamura
:
End-to-End Speech Translation With Transcoding by Multi-Task Learning for Distant Language Pairs. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1342-1355 (2020) - [j33]Andros Tjandra
, Sakriani Sakti
, Satoshi Nakamura
:
Corrections to "Machine Speech Chain". IEEE ACM Trans. Audio Speech Lang. Process. 28: 1706 (2020) - [c189]Fan Yang, Feiran Li, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Panoramic Coordinate. ICASSP 2020: 1863-1867 - [c188]Kazuki Tsunematsu, Johanes Effendi, Sakriani Sakti, Satoshi Nakamura:
Neural Speech Completion. INTERSPEECH 2020: 2742-2746 - [c187]Ivan Halim Parmonangan, Hiroki Tanaka
, Sakriani Sakti, Satoshi Nakamura:
Combining Audio and Brain Activity for Predicting Speech Quality. INTERSPEECH 2020: 2762-2766 - [c186]Sashi Novitasari, Andros Tjandra, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Incremental Machine Speech Chain Towards Enabling Listening While Speaking in Real-Time. INTERSPEECH 2020: 4372-4376 - [c185]Ewan Dunbar, Julien Karadayi, Mathieu Bernard, Xuan-Nga Cao, Robin Algayres, Lucas Ondel, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux:
The Zero Resource Speech Challenge 2020: Discovering Discrete Subword and Word Units. INTERSPEECH 2020: 4831-4835 - [c184]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge. INTERSPEECH 2020: 4851-4855 - [c183]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Augmenting Images for ASR and TTS Through Single-Loop and Dual-Loop Multimodal Chain Framework. INTERSPEECH 2020: 4901-4905 - [c182]Sara Asai, Koichiro Yoshino, Seitaro Shinagawa, Sakriani Sakti, Satoshi Nakamura:
Emotional Speech Corpus for Persuasive Dialogue System. LREC 2020: 491-497 - [c181]Mayuko Okamato, Sakriani Sakti, Satoshi Nakamura:
Towards Speech Entrainment: Considering ASR Information in Speaking Rate Variation of TTS Waveform Generation. O-COCOSDA 2020: 139-144 - [c180]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis. SLTU-CCURL@LREC 2020: 131-138 - [e3]Dorothee Beermann, Laurent Besacier, Sakriani Sakti, Claudia Soria:
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, SLTU-CCURL@LREC 2020, Marseille, France, May 2020. European Language Resources association 2020, ISBN 979-10-95546-35-1 [contents] - [i27]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge. CoRR abs/2005.11676 (2020) - [i26]Fan Yang, Xin Chang, Chenyu Dang, Ziqiang Zheng, Sakriani Sakti, Satoshi Nakamura, Yang Wu:
ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation. CoRR abs/2007.03200 (2020) - [i25]Ewan Dunbar, Julien Karadayi, Mathieu Bernard, Xuan-Nga Cao, Robin Algayres, Lucas Ondel, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux:
The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units. CoRR abs/2010.05967 (2020) - [i24]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework. CoRR abs/2011.02099 (2020) - [i23]Sashi Novitasari, Andros Tjandra, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time. CoRR abs/2011.02126 (2020) - [i22]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition. CoRR abs/2011.02127 (2020) - [i21]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis. CoRR abs/2011.02128 (2020) - [i20]Katsuhito Sudoh, Takatomo Kano, Sashi Novitasari, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS. CoRR abs/2011.04845 (2020)
2010 – 2019
- 2019
- [j32]Andros Tjandra
, Sakriani Sakti, Satoshi Nakamura:
End-to-End Speech Recognition Sequence Training With Reinforcement Learning. IEEE Access 7: 79758-79769 (2019) - [j31]Fan Yang
, Sakriani Sakti, Yang Wu, Satoshi Nakamura:
A Framework for Knowing Who is Doing What in Aerial Surveillance Videos. IEEE Access 7: 93315-93325 (2019) - [j30]Hiroki Tanaka, Hiroki Watanabe, Hayato Maki, Sakriani Sakti, Satoshi Nakamura:
Electroencephalogram-Based Single-Trial Detection of Language Expectation Violations in Listening to Speech. Frontiers Comput. Neurosci. 13: 15 (2019) - [j29]Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Neural Oscillation-Based Classification of Japanese Spoken Sentences During Speech Perception. IEICE Trans. Inf. Syst. 102-D(2): 383-391 (2019) - [j28]Nurul Lubis
, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura
:
Positive Emotion Elicitation in Chat-Based Dialogue Systems. IEEE ACM Trans. Audio Speech Lang. Process. 27(4): 866-877 (2019) - [c179]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Listening While Speaking and Visualizing: Improving ASR Through Multimodal Chain. ASRU 2019: 471-478 - [c178]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
Neural Machine Translation with Acoustic Embedding. ASRU 2019: 578-584 - [c177]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Speech-to-Speech Translation Between Untranscribed Unknown Languages. ASRU 2019: 593-600 - [c176]Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Zero-Shot Code-Switching ASR and TTS with Multilingual Machine Speech Chain. ASRU 2019: 964-971 - [c175]Holy Lovenia, Hiroki Tanaka
, Sakriani Sakti, Ayu Purwarianti, Satoshi Nakamura:
Speech Artifact Removal from Eeg Recordings of Spoken Word Production with Tensor Decomposition. ICASSP 2019: 1115-1119 - [c174]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
End-to-end Feedback Loss in Speech Chain Framework via Straight-through Estimator. ICASSP 2019: 6281-6285 - [c173]Marco Vetter, Sakriani Sakti, Satoshi Nakamura:
Cross-lingual Speech-based Tobi Label Generation Using Bidirectional Lstm. ICASSP 2019: 6620-6624 - [c172]Ewan Dunbar, Robin Algayres, Julien Karadayi, Mathieu Bernard, Juan Benjumea, Xuan-Nga Cao, Lucie Miskic, Charlotte Dugrain, Lucas Ondel, Alan W. Black, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux:
The Zero Resource Speech Challenge 2019: TTS Without T. INTERSPEECH 2019: 1088-1092 - [c171]Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li
, Satoshi Nakamura:
VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019. INTERSPEECH 2019: 1118-1122 - [c170]Ivan Halim Parmonangan, Hiroki Tanaka
, Sakriani Sakti, Shinnosuke Takamichi, Satoshi Nakamura:
Speech Quality Evaluation of Synthesized Japanese Speech Using EEG. INTERSPEECH 2019: 1228-1232 - [c169]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition. INTERSPEECH 2019: 3835-3839 - [c168]Koichiro Yoshino, Yukitoshi Murase, Nurul Lubis, Kyoshiro Sugiyama, Hiroki Tanaka
, Sakriani Sakti, Shinnosuke Takamichi, Satoshi Nakamura:
Spoken Dialogue Robot for Watching Daily Life of Elderly People. IWSDS 2019: 141-146 - [c167]Fan Yang
, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Make Skeleton-based Action Recognition Model Smaller, Faster and Better. MMAsia 2019: 31:1-31:6 - [c166]Sahoko Nakayama, Takatomo Kano, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Recognition and translation of code-switching speech utterances. O-COCOSDA 2019: 1-6 - [c165]Mayuko Okamato, Sakriani Sakti, Satoshi Nakamura:
Phoneme-level speaking rate variation on waveform generation using GAN-TTS. O-COCOSDA 2019: 1-7 - [c164]Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Neural iTTS: Toward Synthesizing Speech in Real-time with End-to-end Neural Text-to-Speech Framework. SSW 2019: 183-188 - [i19]