default search action

combined dblp search
author search
venue search
publication search

ask others

R. J. Skerry-Ryan

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/NachmaniLHSAMRS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/NachmaniLHSAMRS24
Eliya Nachmani, Alon Levkovitch, Roy Hirsch, Julian Salazar, Chulayuth Asawaroengchai, Soroosh Mariooryad, Ehud Rivlin, R. J. Skerry-Ryan, Michelle Tadmor Ramanovich:
Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM. ICLR 2024
2023
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15255
Eliya Nachmani, Alon Levkovitch, Julian Salazar, Chulayuth Asawaroengchai, Soroosh Mariooryad, R. J. Skerry-Ryan, Michelle Tadmor Ramanovich:
LMs with a Voice: Spoken Language Modeling beyond Speech Tokens. CoRR abs/2305.15255 (2023)
2022
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/StantonSMSBBK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/StantonSMSBBK22
Daisy Stanton, Matt Shannon, Soroosh Mariooryad, R. J. Skerry-Ryan, Eric Battenberg, Tom Bagby, David Kao:
Speaker Generation. ICASSP 2022: 7897-7901
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-03232
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-03232
Soroosh Mariooryad, Matt Shannon, Siyuan Ma, Tom Bagby, David Kao, Daisy Stanton, Eric Battenberg, R. J. Skerry-Ryan:
Learning the joint distribution of two sequences using little or no paired data. CoRR abs/2212.03232 (2022)
2021
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WeissSBMK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WeissSBMK21
Ron J. Weiss, R. J. Skerry-Ryan, Eric Battenberg, Soroosh Mariooryad, Diederik P. Kingma:
Wave-Tacotron: Spectrogram-Free End-to-End Text-to-Speech Synthesis. ICASSP 2021: 5679-5683
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EliasZS0JSW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EliasZS0JSW21
Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, R. J. Skerry-Ryan, Yonghui Wu:
Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling. Interspeech 2021: 141-145
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-14574
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-14574
Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, R. J. Skerry-Ryan, Yonghui Wu:
Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling. CoRR abs/2103.14574 (2021)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-05095
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-05095
Daisy Stanton, Matt Shannon, Soroosh Mariooryad, R. J. Skerry-Ryan, Eric Battenberg, Tom Bagby, David Kao:
Speaker Generation. CoRR abs/2111.05095 (2021)
2020
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BattenbergSMSKS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BattenbergSMSKS20
Eric Battenberg, R. J. Skerry-Ryan, Soroosh Mariooryad, Daisy Stanton, David Kao, Matt Shannon, Tom Bagby:
Location-Relative Attention Mechanisms for Robust Long-Form Speech Synthesis. ICASSP 2020: 6194-6198
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/HabibMSBSSKB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HabibMSBSSKB20
Raza Habib, Soroosh Mariooryad, Matt Shannon, Eric Battenberg, R. J. Skerry-Ryan, Daisy Stanton, David Kao, Tom Bagby:
Semi-Supervised Generative Modeling for Controllable Speech Synthesis. ICLR 2020
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-08029
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-08029
Matt Shannon, Ben Poole, Soroosh Mariooryad, Tom Bagby, Eric Battenberg, David Kao, Daisy Stanton, R. J. Skerry-Ryan:
Non-saturating GAN training as divergence minimization. CoRR abs/2010.08029 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-03568
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-03568
Ron J. Weiss, R. J. Skerry-Ryan, Eric Battenberg, Soroosh Mariooryad, Diederik P. Kingma:
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis. CoRR abs/2011.03568 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChungWHZS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChungWHZS19
Yu-An Chung, Yuxuan Wang, Wei-Ning Hsu, Yu Zhang, R. J. Skerry-Ryan:
Semi-supervised Training for Improving Data Efficiency in End-to-end Speech Synthesis. ICASSP 2019: 6940-6944
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangWZWCSJRR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangWZWCSJRR19
Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Zhifeng Chen, R. J. Skerry-Ryan, Ye Jia, Andrew Rosenberg, Bhuvana Ramabhadran:
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning. INTERSPEECH 2019: 2080-2084
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-02246
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-02246
Izhak Shafran, Tom Bagby, R. J. Skerry-Ryan:
Complex Evolution Recurrent Neural Networks (ceRNNs). CoRR abs/1906.02246 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-03402
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-03402
Eric Battenberg, Soroosh Mariooryad, Daisy Stanton, R. J. Skerry-Ryan, Matt Shannon, David Kao, Tom Bagby:
Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis. CoRR abs/1906.03402 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-04448
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-04448
Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Zhifeng Chen, R. J. Skerry-Ryan, Ye Jia, Andrew Rosenberg, Bhuvana Ramabhadran:
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning. CoRR abs/1907.04448 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-01709
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-01709
Raza Habib, Soroosh Mariooryad, Matt Shannon, Eric Battenberg, R. J. Skerry-Ryan, Daisy Stanton, David Kao, Tom Bagby:
Semi-Supervised Generative Modeling for Controllable Speech Synthesis. CoRR abs/1910.01709 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-10288
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-10288
Eric Battenberg, R. J. Skerry-Ryan, Soroosh Mariooryad, Daisy Stanton, David Kao, Matt Shannon, Tom Bagby:
Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis. CoRR abs/1910.10288 (2019)
2018
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShenPWSJYCZWRSA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShenPWSJYCZWRSA18
Jonathan Shen, Ruoming Pang, Ron J. Weiss, Mike Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, R. J. Skerry-Ryan, Rif A. Saurous, Yannis Agiomyrgiannakis, Yonghui Wu:
Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions. ICASSP 2018: 4779-4783
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShafranBS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShafranBS18
Izhak Shafran, Tom Bagby, R. J. Skerry-Ryan:
Complex Evolution Recurrent Neural Networks (ceRNNs). ICASSP 2018: 5854-5858
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/Skerry-RyanBXWS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Skerry-RyanBXWS18
R. J. Skerry-Ryan, Eric Battenberg, Ying Xiao, Yuxuan Wang, Daisy Stanton, Joel Shor, Ron J. Weiss, Rob Clark, Rif A. Saurous:
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron. ICML 2018: 4700-4709
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/WangSZRBSXJRS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangSZRBSXJRS18
Yuxuan Wang, Daisy Stanton, Yu Zhang, R. J. Skerry-Ryan, Eric Battenberg, Joel Shor, Ying Xiao, Ye Jia, Fei Ren, Rif A. Saurous:
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis. ICML 2018: 5167-5176
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/StantonWS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/StantonWS18
Daisy Stanton, Yuxuan Wang, R. J. Skerry-Ryan:
Predicting Expressive Speaking Style from Text in End-To-End Speech Synthesis. SLT 2018: 595-602
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1803-09017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-09017
Yuxuan Wang, Daisy Stanton, Yu Zhang, R. J. Skerry-Ryan, Eric Battenberg, Joel Shor, Ying Xiao, Fei Ren, Ye Jia, Rif A. Saurous:
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis. CoRR abs/1803.09017 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1803-09047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-09047
R. J. Skerry-Ryan, Eric Battenberg, Ying Xiao, Yuxuan Wang, Daisy Stanton, Joel Shor, Ron J. Weiss, Rob Clark, Rif A. Saurous:
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron. CoRR abs/1803.09047 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-01410
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-01410
Daisy Stanton, Yuxuan Wang, R. J. Skerry-Ryan:
Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis. CoRR abs/1808.01410 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-10128
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-10128
Yu-An Chung, Yuxuan Wang, Wei-Ning Hsu, Yu Zhang, R. J. Skerry-Ryan:
Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis. CoRR abs/1808.10128 (2018)
2017
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangSSWWJYXCBLA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangSSWWJYXCBLA17
Yuxuan Wang, R. J. Skerry-Ryan, Daisy Stanton, Yonghui Wu, Ron J. Weiss, Navdeep Jaitly, Zongheng Yang, Ying Xiao, Zhifeng Chen, Samy Bengio, Quoc V. Le, Yannis Agiomyrgiannakis, Rob Clark, Rif A. Saurous:
Tacotron: Towards End-to-End Speech Synthesis. INTERSPEECH 2017: 4006-4010
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/WangSSWWJYXCBLA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WangSSWWJYXCBLA17
Yuxuan Wang, R. J. Skerry-Ryan, Daisy Stanton, Yonghui Wu, Ron J. Weiss, Navdeep Jaitly, Zongheng Yang, Ying Xiao, Zhifeng Chen, Samy Bengio, Quoc V. Le, Yannis Agiomyrgiannakis, Rob Clark, Rif A. Saurous:
Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model. CoRR abs/1703.10135 (2017)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1711-00520
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-00520
Yuxuan Wang, R. J. Skerry-Ryan, Ying Xiao, Daisy Stanton, Joel Shor, Eric Battenberg, Rob Clark, Rif A. Saurous:
Uncovering Latent Style Factors for Expressive Speech Synthesis. CoRR abs/1711.00520 (2017)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-05884
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-05884
Jonathan Shen, Ruoming Pang, Ron J. Weiss, Mike Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, R. J. Skerry-Ryan, Rif A. Saurous, Yannis Agiomyrgiannakis, Yonghui Wu:
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. CoRR abs/1712.05884 (2017)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.