Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Akihiko Takashima

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2023
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/MasumuraMITTO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/MasumuraMITTO23
Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Text-to-Text Pre-Training with Paraphrasing for Improving Transformer-Based Image Captioning. EUSIPCO 2023: 516-520
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/UchidaOTYM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/UchidaOTYM23
Mihiro Uchida, Shota Orihashi, Akihiko Takashima, Yoshihiro Yamazaki, Ryo Masumura:
Open-Set Recognition for Facial-Expression Recognition. ICIP 2023: 780-784
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/OrihashiYUTM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/OrihashiYUTM23
Shota Orihashi, Yoshihiro Yamazaki, Mihiro Uchida, Akihiko Takashima, Ryo Masumura:
Distilling Knowledge of Bidirectional Language Model for Scene Text Recognition. ICIP 2023: 2165-2169
[c23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraMYYMIUS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraMYYMIUS23
Ryo Masumura, Naoki Makishima, Taiga Yamane, Yoshihiko Yamazaki, Saki Mizuno, Mana Ihori, Mihiro Uchida, Keita Suzuki, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando:
End-to-End Joint Target and Non-Target Speakers ASR. INTERSPEECH 2023: 2903-2907
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-02273
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-02273
Ryo Masumura, Naoki Makishima, Taiga Yamane, Yoshihiko Yamazaki, Saki Mizuno, Mana Ihori, Mihiro Uchida, Keita Suzuki, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando:
End-to-End Joint Target and Non-Target Speakers ASR. CoRR abs/2306.02273 (2023)
2022
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/OrihashiYUTM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/OrihashiYUTM22
Shota Orihashi, Yoshihiro Yamazaki, Mihiro Uchida, Akihiko Takashima, Ryo Masumura:
Fully Shareable Scene Text Recognition Modeling for Horizontal and Vertical Writing. ICIP 2022: 2636-2640
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraYMMIUST22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraYMMIUST22
Ryo Masumura, Yoshihiro Yamazaki, Saki Mizuno, Naoki Makishima, Mana Ihori, Mihiro Uchida, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Shota Orihashi, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando:
End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training. INTERSPEECH 2022: 3218-3222
[c20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakashimaMAYUO22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakashimaMAYUO22
Akihiko Takashima, Ryo Masumura, Atsushi Ando, Yoshihiro Yamazaki, Mihiro Uchida, Shota Orihashi:
Interactive Co-Learning with Cross-Modal Transformer for Audio-Visual Emotion Recognition. INTERSPEECH 2022: 4740-4744
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/AndoMTSMSMAS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/AndoMTSMSMAS22
Atsushi Ando, Ryo Masumura, Akihiko Takashima, Satoshi Suzuki, Naoki Makishima, Keita Suzuki, Takafumi Moriya, Takanori Ashihara, Hiroshi Sato:
On the Use of Modality-Specific Large-Scale Pre-Trained Encoders for Multimodal Sentiment Analysis. SLT 2022: 739-746
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-09979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-09979
Yoshihiro Yamazaki, Shota Orihashi, Ryo Masumura, Mihiro Uchida, Akihiko Takashima:
Audio Visual Scene-Aware Dialog Generation with Transformer-based Video Representations. CoRR abs/2202.09979 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15937
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15937
Atsushi Ando, Ryo Masumura, Akihiko Takashima, Satoshi Suzuki, Naoki Makishima, Keita Suzuki, Takafumi Moriya, Takanori Ashihara, Hiroshi Sato:
On the Use of Modality-Specific Large-Scale Pre-Trained Encoders for Multimodal Sentiment Analysis. CoRR abs/2210.15937 (2022)
2021
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/OrihashiYMITTM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/OrihashiYMITTM21
Shota Orihashi, Yoshihiro Yamazaki, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Ryo Masumura:
Hierarchical Knowledge Distillation for Dialogue Sequence Labeling. ASRU 2021: 433-440
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MasumuraMITTO21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MasumuraMITTO21
Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Hierarchical Transformer-Based Large-Context End-To-End ASR with Large-Context Knowledge Distillation. ICASSP 2021: 5879-5883
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MakishimaITTOM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MakishimaITTOM21
Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi, Ryo Masumura:
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss. ICASSP 2021: 6673-6677
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/IhoriMTTOM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/IhoriMTTOM21
Mana Ihori, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura:
MAPGN: Masked Pointer-Generator Network for Sequence-to-Sequence Pre-Training. ICASSP 2021: 7563-7567
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MakishimaITTOM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MakishimaITTOM21
Naoki Makishima, Mana Ihori, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura:
Enrollment-Less Training for Personalized Voice Activity Detection. Interspeech 2021: 346-350
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IhoriMTTOM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IhoriMTTOM21
Mana Ihori, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura:
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks Using Switching Tokens. Interspeech 2021: 776-780
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraOMITTO21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraOMITTO21
Ryo Masumura, Daiki Okamura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation. Interspeech 2021: 2591-2595
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaMITMAOM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaMITMAOM21
Tomohiro Tanaka, Ryo Masumura, Mana Ihori, Akihiko Takashima, Takafumi Moriya, Takanori Ashihara, Shota Orihashi, Naoki Makishima:
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition. Interspeech 2021: 4059-4063
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaMITOM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaMITOM21
Tomohiro Tanaka, Ryo Masumura, Mana Ihori, Akihiko Takashima, Shota Orihashi, Naoki Makishima:
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning. Interspeech 2021: 4458-4462
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/mmasia/OrihashiYMITTM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmasia/OrihashiYMITTM21
Shota Orihashi, Yoshihiro Yamazaki, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Ryo Masumura:
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages. MMAsia 2021: 41:1-41:5
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MasumuraMITTO21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MasumuraMITTO21
Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Large-Context Conversational Representation Learning: Self-Supervised Learning For Conversational Documents. SLT 2021: 1012-1019
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-07380
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-07380
Mana Ihori, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura:
MAPGN: MAsked Pointer-Generator Network for sequence-to-sequence pre-training. CoRR abs/2102.07380 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-07935
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-07935
Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Hierarchical Transformer-based Large-Context End-to-end ASR with Large-Context Knowledge Distillation. CoRR abs/2102.07935 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-08147
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-08147
Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Large-Context Conversational Representation Learning: Self-Supervised Learning for Conversational Documents. CoRR abs/2102.08147 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-08154
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-08154
Ryo Masumura, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Takanori Ashihara:
End-to-End Automatic Speech Recognition with Deep Mutual Learning. CoRR abs/2102.08154 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-01463
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-01463
Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi, Ryo Masumura:
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss. CoRR abs/2103.01463 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-12131
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-12131
Mana Ihori, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura:
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks using Switching Tokens. CoRR abs/2106.12131 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-12132
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-12132
Naoki Makishima, Mana Ihori, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura:
Enrollment-less training for personalized voice activity detection. CoRR abs/2106.12132 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-01549
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-01549
Ryo Masumura, Daiki Okamura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation. CoRR abs/2107.01549 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-01569
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-01569
Tomohiro Tanaka, Ryo Masumura, Mana Ihori, Akihiko Takashima, Takafumi Moriya, Takanori Ashihara, Shota Orihashi, Naoki Makishima:
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition. CoRR abs/2107.01569 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-05382
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-05382
Tomohiro Tanaka, Ryo Masumura, Mana Ihori, Akihiko Takashima, Shota Orihashi, Naoki Makishima:
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning. CoRR abs/2107.05382 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-10957
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-10957
Shota Orihashi, Yoshihiro Yamazaki, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Ryo Masumura:
Hierarchical Knowledge Distillation for Dialogue Sequence Labeling. CoRR abs/2111.10957 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-12276
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-12276
Shota Orihashi, Yoshihiro Yamazaki, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Ryo Masumura:
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages. CoRR abs/2111.12276 (2021)
2020
[c7]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/MasumuraITTA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MasumuraITTA20
Ryo Masumura, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Takanori Ashihara:
End-to-End Automatic Speech Recognition with Deep Mutual Learning. APSIPA 2020: 632-637
[c6]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/TakashimaMITOM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TakashimaMITOM20
Akihiko Takashima, Naoki Makishima, Mana Ihori, Tomohiro Tanaka, Shota Orihashi, Ryo Masumura:
Unsupervised Domain Adversarial Training in Angular Space for Facial Expression Recognition. APSIPA 2020: 1054-1059
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MasumuraITMAS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MasumuraITMAS20
Ryo Masumura, Mana Ihori, Akihiko Takashima, Takafumi Moriya, Atsushi Ando, Yusuke Shinohara:
Sequence-Level Consistency Training for Semi-Supervised End-to-End Automatic Speech Recognition. ICASSP 2020: 7054-7058
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/IhoriTM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/IhoriTM20
Mana Ihori, Akihiko Takashima, Ryo Masumura:
Large-Context Pointer-Generator Networks for Spoken-to-Written Style Conversion. ICASSP 2020: 8189-8193
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/inlg/IhoriMMTTO20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/inlg/IhoriMMTTO20
Mana Ihori, Ryo Masumura, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi:
Memory Attentive Fusion: External Language Model Integration for Transformer-based Sequence-to-Sequence Model. INLG 2020: 1-6
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraMITTO20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraMITTO20
Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition. INTERSPEECH 2020: 2822-2826
[c1]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/lrec/IhoriTM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/IhoriTM20
Mana Ihori, Akihiko Takashima, Ryo Masumura:
Parallel Corpus for Japanese Spoken-to-Written Style Conversion. LREC 2020: 6346-6353
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-15437
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-15437
Mana Ihori, Ryo Masumura, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi:
Memory Attentive Fusion: External Language Model Integration for Transformer-based Sequence-to-Sequence Model. CoRR abs/2010.15437 (2020)

Coauthor Index

see FAQ

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.