default search action
Jordi Pons
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c31]Jordi Pons, Xiaoyu Liu, Santiago Pascual, Joan Serrà:
GASS: Generalizing Audio Source Separation with Large-Scale Data. ICASSP 2024: 546-550 - [c30]Zach Evans, CJ Carr, Josiah Taylor, Scott H. Hawley, Jordi Pons:
Fast Timing-Conditioned Latent Audio Diffusion. ICML 2024 - [i34]Zach Evans, CJ Carr, Josiah Taylor, Scott H. Hawley, Jordi Pons:
Fast Timing-Conditioned Latent Audio Diffusion. CoRR abs/2402.04825 (2024) - [i33]Zach Evans, Julian D. Parker, CJ Carr, Zack Zukowski, Josiah Taylor, Jordi Pons:
Long-form music generation with latent diffusion. CoRR abs/2404.10301 (2024) - [i32]Zach Evans, Julian D. Parker, CJ Carr, Zack Zukowski, Josiah Taylor, Jordi Pons:
Stable Audio Open. CoRR abs/2407.14358 (2024) - 2023
- [c29]Jordi Pons, Joan Serrà, Santiago Pascual, Giulio Cengarle, Daniel Arteaga, Davide Scaini:
Upsampling Layers for Music Source Separation. EUSIPCO 2023: 311-315 - [c28]Santiago Pascual, Gautam Bhattacharya, Chunghsin Yeh, Jordi Pons, Joan Serrà:
Full-Band General Audio Synthesis with Score-Based Diffusion. ICASSP 2023: 1-5 - [c27]Emilian Postolache, Jordi Pons, Santiago Pascual, Joan Serrà:
Adversarial Permutation Invariant Training for Universal Sound Separation. ICASSP 2023: 1-5 - [c26]Joan Serrà, Davide Scaini, Santiago Pascual, Daniel Arteaga, Jordi Pons, Jeroen Breebaart, Giulio Cengarle:
Mono-to-Stereo Through Parametric Stereo Generation. ISMIR 2023: 304-310 - [c25]Hao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, Julian J. McAuley:
CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models. WASPAA 2023: 1-5 - [i31]Jaume Ros, Margarita Geleta, Jordi Pons, Xavier Giró-i-Nieto:
Towards Robust Image-in-Audio Deep Steganography. CoRR abs/2303.05007 (2023) - [i30]Hao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, Julian J. McAuley:
CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models. CoRR abs/2306.09635 (2023) - [i29]Joan Serrà, Davide Scaini, Santiago Pascual, Daniel Arteaga, Jordi Pons, Jeroen Breebaart, Giulio Cengarle:
Mono-to-stereo through parametric stereo generation. CoRR abs/2306.14647 (2023) - [i28]Jordi Pons, Xiaoyu Liu, Santiago Pascual, Joan Serrà:
GASS: Generalizing Audio Source Separation with Large-scale Data. CoRR abs/2310.00140 (2023) - 2022
- [j2]Eduardo Fonseca, Xavier Favory, Jordi Pons, Frederic Font, Xavier Serra:
FSD50K: An Open Dataset of Human-Labeled Sound Events. IEEE ACM Trans. Audio Speech Lang. Process. 30: 829-852 (2022) - [c24]Enric Gusó, Jordi Pons, Santiago Pascual, Joan Serrà:
On Loss Functions and Evaluation Metrics for Music Source Separation. ICASSP 2022: 306-310 - [c23]Margarita Geleta, Cristina Punti, Kevin McGuinness, Jordi Pons, Cristian Canton, Xavier Giró-i-Nieto:
Pixinwav: Residual Steganography for Hiding Pixels in Audio. ICASSP 2022: 2485-2489 - [c22]Nicolás Schmidt, Jordi Pons, Marius Miron:
PodcastMix: A dataset for separating music and speech in podcasts. INTERSPEECH 2022: 231-235 - [i27]Enric Gusó, Jordi Pons, Santiago Pascual, Joan Serrà:
On loss functions and evaluation metrics for music source separation. CoRR abs/2202.07968 (2022) - [i26]Joan Serrà, Santiago Pascual, Jordi Pons, R. Oguz Araz, Davide Scaini:
Universal Speech Enhancement with Score-based Diffusion. CoRR abs/2206.03065 (2022) - [i25]Nicolás Schmidt, Jordi Pons, Marius Miron:
PodcastMix: A dataset for separating music and speech in podcasts. CoRR abs/2207.07403 (2022) - [i24]Emilian Postolache, Jordi Pons, Santiago Pascual, Joan Serrà:
Adversarial Permutation Invariant Training for Universal Sound Separation. CoRR abs/2210.12108 (2022) - [i23]Santiago Pascual, Gautam Bhattacharya, Chunghsin Yeh, Jordi Pons, Joan Serrà:
Full-band General Audio Synthesis with Score-based Diffusion. CoRR abs/2210.14661 (2022) - 2021
- [c21]Xiaoyu Liu, Jordi Pons:
On Permutation Invariant Training For Speech Source Separation. ICASSP 2021: 6-10 - [c20]Christian J. Steinmetz, Jordi Pons, Santiago Pascual, Joan Serrà:
Automatic Multitrack Mixing With A Differentiable Mixing Console Of Neural Audio Effects. ICASSP 2021: 71-75 - [c19]Daniel Arteaga, Jordi Pons:
Multichannel-based Learning for Audio Object Extraction. ICASSP 2021: 206-210 - [c18]Joan Serrà, Jordi Pons, Santiago Pascual:
SESQA: Semi-Supervised Learning for Speech Quality Assessment. ICASSP 2021: 381-385 - [c17]Jordi Pons, Santiago Pascual, Giulio Cengarle, Joan Serrà:
Upsampling Artifacts in Neural Audio Synthesis. ICASSP 2021: 3005-3009 - [c16]Santiago Pascual, Joan Serrà, Jordi Pons:
Adversarial Auto-Encoding for Packet Loss Concealment. WASPAA 2021: 71-75 - [i22]Xiaoyu Liu, Jordi Pons:
On permutation invariant training for speech source separation. CoRR abs/2102.04945 (2021) - [i21]Daniel Arteaga, Jordi Pons:
Multichannel-based learning for audio object extraction. CoRR abs/2102.06142 (2021) - [i20]Joan Serrà, Santiago Pascual, Jordi Pons:
On tuning consistent annealed sampling for denoising score matching. CoRR abs/2104.03725 (2021) - [i19]Margarita Geleta, Cristina Punti, Kevin McGuinness, Jordi Pons, Cristian Canton, Xavier Giró-i-Nieto:
PixInWav: Residual Steganography for Hiding Pixels in Audio. CoRR abs/2106.09814 (2021) - [i18]Santiago Pascual, Joan Serrà, Jordi Pons:
Adversarial Auto-Encoding for Packet Loss Concealment. CoRR abs/2107.03100 (2021) - [i17]Jordi Pons, Joan Serrà, Santiago Pascual, Giulio Cengarle, Daniel Arteaga, Davide Scaini:
Upsampling layers for music source separation. CoRR abs/2111.11773 (2021) - 2020
- [c15]Pablo Alonso-Jiménez, Dmitry Bogdanov, Jordi Pons, Xavier Serra:
Tensorflow Audio Models in Essentia. ICASSP 2020: 266-270 - [c14]Berkan Kadioglu, Michael Horgan, Xiaoyu Liu, Jordi Pons, Dan Darcy, Vivek Kumar:
An Empirical Study of Conv-Tasnet. ICASSP 2020: 7264-7268 - [d2]Eduardo Fonseca, Xavier Favory, Jordi Pons, Frederic Font, Xavier Serra:
FSD50K. Zenodo, 2020 - [i16]Berkan Kadioglu, Michael Horgan, Xiaoyu Liu, Jordi Pons, Dan Darcy, Vivek Kumar:
An empirical study of Conv-TasNet. CoRR abs/2002.08688 (2020) - [i15]Pablo Alonso-Jiménez, Dmitry Bogdanov, Jordi Pons, Xavier Serra:
TensorFlow Audio Models in Essentia. CoRR abs/2003.07393 (2020) - [i14]Joan Serrà, Jordi Pons, Santiago Pascual:
SESQA: semi-supervised learning for speech quality assessment. CoRR abs/2010.00368 (2020) - [i13]Eduardo Fonseca, Xavier Favory, Jordi Pons, Frederic Font, Xavier Serra:
FSD50K: an Open Dataset of Human-Labeled Sound Events. CoRR abs/2010.00475 (2020) - [i12]Christian J. Steinmetz, Jordi Pons, Santiago Pascual, Joan Serrà:
Automatic multitrack mixing with a differentiable mixing console of neural audio effects. CoRR abs/2010.10291 (2020) - [i11]Jordi Pons, Santiago Pascual, Giulio Cengarle, Joan Serrà:
Upsampling artifacts in neural audio synthesis. CoRR abs/2010.14356 (2020)
2010 – 2019
- 2019
- [b1]Jordi Pons:
Deep neural networks for music and audio tagging. Pompeu Fabra University, Spain, 2019 - [c13]Jordi Pons, Joan Serrà, Xavier Serra:
Training Neural Audio Classifiers with Few Data. ICASSP 2019: 16-20 - [c12]Jordi Pons, Xavier Serra:
Randomly Weighted CNNs for (Music) Audio Classification. ICASSP 2019: 336-340 - [c11]Francesc Lluís, Jordi Pons, Xavier Serra:
End-to-End Music Source Separation: Is it Possible in the Waveform Domain? INTERSPEECH 2019: 4619-4623 - [d1]Eduardo Fonseca, Xavier Favory, Jordi Pons, Frederic Font, Manoj Plakal, Daniel P. W. Ellis, Xavier Serra:
FSDKaggle2018. Zenodo, 2019 - [i10]Jordi Pons, Xavier Serra:
musicnn: Pre-trained convolutional neural networks for music audio tagging. CoRR abs/1909.06654 (2019) - 2018
- [c10]Eduardo Fonseca, Manoj Plakal, Frederic Font, Daniel P. W. Ellis, Xavier Favory, Jordi Pons, Xavier Serra:
General-purpose tagging of Freesound audio with AudioSet labels: task description, dataset, and baseline. DCASE 2018: 69-73 - [c9]Dario Rethage, Jordi Pons, Xavier Serra:
A Wavenet for Speech Denoising. ICASSP 2018: 5069-5073 - [c8]Jordi Pons, Oriol Nieto, Matthew Prockup, Erik M. Schmidt, Andreas F. Ehmann, Xavier Serra:
End-to-end Learning for Music Audio Tagging at Scale. ISMIR 2018: 637-644 - [i9]Jordi Pons, Xavier Serra:
Randomly weighted CNNs for (music) audio classification. CoRR abs/1805.00237 (2018) - [i8]Eduardo Fonseca, Manoj Plakal, Frederic Font, Daniel P. W. Ellis, Xavier Favory, Jordi Pons, Xavier Serra:
General-purpose Tagging of Freesound Audio with AudioSet Labels: Task Description, Dataset, and Baseline. CoRR abs/1807.09902 (2018) - [i7]Jordi Pons, Joan Serrà, Xavier Serra:
Training neural audio classifiers with few data. CoRR abs/1810.10274 (2018) - [i6]Francesc Lluís, Jordi Pons, Xavier Serra:
End-to-end music source separation: is it possible in the waveform domain? CoRR abs/1810.12187 (2018) - 2017
- [c7]Jordi Pons, Olga Slizovskaia, Rong Gong, Emilia Gómez, Xavier Serra:
Timbre analysis of music audio signals with convolutional neural networks. EUSIPCO 2017: 2744-2748 - [c6]Jordi Pons, Xavier Serra:
Designing efficient architectures for modeling temporal features with convolutional neural networks. ICASSP 2017: 2472-2476 - [c5]Jordi Pons, Rong Gong, Xavier Serra:
Score-Informed Syllable Segmentation for A Cappella Singing Voice with Convolutional Neural Networks. ISMIR 2017: 383-389 - [c4]Rong Gong, Jordi Pons, Xavier Serra:
Audio to Score Matching by Combining Phonetic and Duration Information. ISMIR 2017: 428-434 - [c3]Eduardo Fonseca, Jordi Pons, Xavier Favory, Frederic Font, Dmitry Bogdanov, Andres Ferraro, Sergio Oramas, Alastair Porter, Xavier Serra:
Freesound Datasets: A Platform for the Creation of Open Audio Datasets. ISMIR 2017: 486-493 - [i5]Jordi Pons, Olga Slizovskaia, Rong Gong, Emilia Gómez, Xavier Serra:
Timbre Analysis of Music Audio Signals with Convolutional Neural Networks. CoRR abs/1703.06697 (2017) - [i4]Dario Rethage, Jordi Pons, Xavier Serra:
A Wavenet for Speech Denoising. CoRR abs/1706.07162 (2017) - [i3]Jordi Pons, Rong Gong, Xavier Serra:
Score-informed syllable segmentation for a cappella singing voice with convolutional neural networks. CoRR abs/1707.03544 (2017) - [i2]Rong Gong, Jordi Pons, Xavier Serra:
Audio to score matching by combining phonetic and duration information. CoRR abs/1707.03547 (2017) - [i1]Jordi Pons, Oriol Nieto, Matthew Prockup, Erik M. Schmidt, Andreas F. Ehmann, Xavier Serra:
End-to-end learning for music audio tagging at scale. CoRR abs/1711.02520 (2017) - 2016
- [c2]Jordi Pons, Thomas Lidy, Xavier Serra:
Experimenting with musically motivated convolutional neural networks. CBMI 2016: 1-6 - 2015
- [c1]Axel Roebel, Jordi Pons, Marco Liuni, Mathieu Lagrange:
On automatic drum transcription using non-negative matrix deconvolution and itakura saito divergence. ICASSP 2015: 414-418 - 2010
- [j1]Elena Valderrama, Mercè Rullán-Ayza, Fermín Sánchez, Jordi Pons, Claudi Mans i Teixidó, Francesc Giné, Gonzalo Seco, Laureà Jiménez, Enric Peig, Julián Carrera, Asunción Moreno, Jordi García, Julio Pérez, Ramón Vilanova, Fernando Cores, Josep Maria Renau, Javier Tejero, Jesús Bisbal:
La Evaluación de Competencias en los Trabajos Fin de Estudios. Rev. Iberoam. de Tecnol. del Aprendiz. 5(3): 107-114 (2010)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:19 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint