default search action

combined dblp search
author search
venue search
publication search

ask others

Tuomas Virtanen

> Home > Persons

Person information

affiliation: Tampere University of Technology, Finland

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/pieee/TriantafyllopoulosTGMVS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pieee/TriantafyllopoulosTGMVS25
Andreas Triantafyllopoulos, Iosif Tsangko, Alexander Gebhard, Annamaria Mesaros, Tuomas Virtanen, Björn W. Schuller:
Computer Audition: From Task-Specific Machine Learning to Foundation Models. Proc. IEEE 113(4): 317-343 (2025)
[j48]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/XieKRV25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/XieKRV25
Huang Xie, Khazar Khorrami, Okko Räsänen, Tuomas Virtanen:
Text-Based Audio Retrieval by Learning From Similarities Between Audio Captions. IEEE Signal Process. Lett. 32: 221-225 (2025)
[j47]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/MartinssonVSM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/MartinssonVSM25
John Martinsson, Tuomas Virtanen, Maria Sandsten, Olof Mogren:
The Accuracy Cost of Weakness: A Theoretical Analysis of Fixed-Segment Weak Labeling for Events in Time. Trans. Mach. Learn. Res. 2025 (2025)
[c191]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/SudarsanamMV25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/SudarsanamMV25
Parthasaarathy Sudarsanam, Irene Martín-Morató, Tuomas Virtanen:
Representation Learning for Semantic Alignment of Language, Audio, and Visual Modalities. EUSIPCO 2025: 51-55
[c190]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/ZhangV25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/ZhangV25
Shiqi Zhang, Tuomas Virtanen:
Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event Detection. EUSIPCO 2025: 131-135
[c189]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/TunturiDPV25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/TunturiDPV25
Eetu Tunturi, David Diaz-Guerra, Archontis Politis, Tuomas Virtanen:
Score-Informed Music Source Separation: Improving Synthetic-To-Real Generalization in Classical Music. EUSIPCO 2025: 1238-1242
[c188]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/NeriV25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/NeriV25
Michael Neri, Tuomas Virtanen:
Impact of Microphone Array Mismatches to Learning-Based Replay Speech Detection. EUSIPCO 2025: 1243-1247
[c187]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeikkinenPDV25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeikkinenPDV25
Mikko Heikkinen, Archontis Politis, Konstantinos Drossos, Tuomas Virtanen:
Gen-A: Generalizing Ambisonics Neural Encoding to Unseen Microphone Arrays. ICASSP 2025: 1-5
[c186]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MesarosSHVP25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MesarosSHVP25
Annamaria Mesaros, Romain Serizel, Toni Heittola, Tuomas Virtanen, Mark D. Plumbley:
A decade of DCASE: Achievements, practices, evaluations and future challenges. ICASSP 2025: 1-5
[c185]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DaiPV25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DaiPV25
Wang Dai, Archontis Politis, Tuomas Virtanen:
Inter-Speaker Relative Cues for Text-Guided Target Speech Extraction. INTERSPEECH 2025
[c184]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangPDV25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangPDV25
Yuzhu Wang, Archontis Politis, Konstantinos Drossos, Tuomas Virtanen:
Attractor-Based Speech Separation of Multiple Utterances by Unknown Number of Speakers. INTERSPEECH 2025
[c183]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/WangPDV25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/WangPDV25
Yuzhu Wang, Archontis Politis, Konstantinos Drossos, Tuomas Virtanen:
Multi-Utterance Speech Separation and Association Trained on Short Segments. WASPAA 2025: 1-5
[d25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/11/HeittolaMV25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/11/HeittolaMV25
Toni Kristian Heittola, Annamaria Mesaros, Tuomas Virtanen:
TAU Urban Acoustic Scenes 2025 Mobile, Evaluation dataset. Zenodo, 2025
[d24]
- view
  authority control:
- export record
  dblp key:
  - data/11/ShimadaPRSDPUKTSTVM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/11/ShimadaPRSDPUKTSTVM25
Kazuki Shimada, Archontis Politis, Irán R. Román, Parthasaarathy Sudarsanam, David Diaz-Guerra, Ruchi Pandey, Kengo Uchida, Yuichiro Koyama, Naoya Takahashi, Takashi Shibuya, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
DCASE2025 Task3 Stereo SELD Dataset. Version 1.0.0. Zenodo, 2025 [all versions]
[d23]
- view
  authority control:
- export record
  dblp key:
  - data/11/ShimadaPRSDPUKTSTVM25a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/11/ShimadaPRSDPUKTSTVM25a
Kazuki Shimada, Archontis Politis, Irán R. Román, Parthasaarathy Sudarsanam, David Diaz-Guerra, Ruchi Pandey, Kengo Uchida, Yuichiro Koyama, Naoya Takahashi, Takashi Shibuya, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
DCASE2025 Task3 Stereo SELD Dataset. Version 1.1.0. Zenodo, 2025 [all versions]
[d22]
- view
  authority control:
- export record
  dblp key:
  - data/11/TunturiDPV25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/11/TunturiDPV25
Eetu Tunturi, David Diaz-Guerra, Archontis Politis, Tuomas Virtanen:
SynthSOD aligned scores. Version 1. Zenodo, 2025 [all versions]
[d21]
- view
  authority control:
- export record
  dblp key:
  - data/11/TunturiDPV25a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/11/TunturiDPV25a
Eetu Tunturi, David Diaz-Guerra, Archontis Politis, Tuomas Virtanen:
SynthSOD aligned scores. Version v2. Zenodo, 2025 [all versions]
[i97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-08047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-08047
Mikko Heikkinen, Archontis Politis, Konstantinos Drossos, Tuomas Virtanen:
Gen-A: Generalizing Ambisonics Neural Encoding to Unseen Microphone Arrays. CoRR abs/2501.08047 (2025)
[i96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-08129
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-08129
Aapo Hakala, Trevor Kincy, Tuomas Virtanen:
Automatic Live Music Song Identification Using Multi-level Deep Sequence Similarity Learning. CoRR abs/2501.08129 (2025)
[i95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-09363
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-09363
John Martinsson, Olof Mogren, Tuomas Virtanen, Maria Sandsten:
The Accuracy Cost of Weakness: A Theoretical Analysis of Fixed-Segment Weak Labeling for Events in Time. CoRR abs/2502.09363 (2025)
[i94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-07352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-07352
Eetu Tunturi, David Diaz-Guerra, Archontis Politis, Tuomas Virtanen:
Score-informed Music Source Separation: Improving Synthetic-to-real Generalization in Classical Music. CoRR abs/2503.07352 (2025)
[i93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-03442
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-03442
Diep Luong, Mikko Heikkinen, Konstantinos Drossos, Tuomas Virtanen:
Knowledge Distillation for Speech Denoising by Latent Representation Alignment with Cosine Distance. CoRR abs/2505.03442 (2025)
[i92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-14562
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-14562
Parthasaarathy Sudarsanam, Irene Martín-Morató, Tuomas Virtanen:
Representation Learning for Semantic Alignment of Language, Audio, and Visual Modalities. CoRR abs/2505.14562 (2025)
[i91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-16607
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-16607
Yuzhu Wang, Archontis Politis, Konstantinos Drossos, Tuomas Virtanen:
Attractor-Based Speech Separation of Multiple Utterances by Unknown Number of Speakers. CoRR abs/2505.16607 (2025)
[i90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-20956
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-20956
Shiqi Zhang, Tuomas Virtanen:
Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event Detection. CoRR abs/2505.20956 (2025)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-01483
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-01483
Wang Dai, Archontis Politis, Tuomas Virtanen:
Inter-Speaker Relative Cues for Text-Guided Target Speech Extraction. CoRR abs/2506.01483 (2025)
[i88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-02562
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-02562
Yuzhu Wang, Archontis Politis, Konstantinos Drossos, Tuomas Virtanen:
Multi-Utterance Speech Separation and Association Trained on Short Segments. CoRR abs/2507.02562 (2025)
[i87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-12042
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-12042
Kazuki Shimada, Archontis Politis, Irán R. Román, Parthasaarathy Sudarsanam, David Diaz-Guerra, Ruchi Pandey, Kengo Uchida, Yuichiro Koyama, Naoya Takahashi, Takashi Shibuya, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
Stereo Sound Event Localization and Detection with Onscreen/offscreen Classification. CoRR abs/2507.12042 (2025)
[i86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-14789
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-14789
Michael Neri, Tuomas Virtanen:
Acoustic Simulation Framework for Multi-channel Replay Speech Detection. CoRR abs/2509.14789 (2025)
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-21247
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-21247
Jaime Garcia-Martinez, David Diaz-Guerra, John Anderson, Ricardo Falcon-Perez, Pablo Cabañas Molero, Tuomas Virtanen, Julio J. Carabias-Orti, Pedro Vera-Candeas:
The Spheres Dataset: Multitrack Orchestral Recordings for Music Source Separation and Information Retrieval. CoRR abs/2511.21247 (2025)
2024
[j46]
- view
  authority control:
- export record
  dblp key:
  - journals/lalc/HekanahoHV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/lalc/HekanahoHV24
Laura Hekanaho, Maija Hirvonen, Tuomas Virtanen:
Language-based machine perception: linguistic perspectives on the compilation of captioning datasets. Digit. Scholarsh. Humanit. 39(3): 864-883 (2024)
[j45]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DrgasBPNV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DrgasBPNV24
Szymon Drgas, Lars Bramsløw, Archontis Politis, Gaurav Naithani, Tuomas Virtanen:
Dynamic Processing Neural Network Architecture for Hearing Loss Compensation. IEEE ACM Trans. Audio Speech Lang. Process. 32: 203-214 (2024)
[j44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/NeriPKCV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/NeriPKCV24
Michael Neri, Archontis Politis, Daniel Aleksander Krause, Marco Carli, Tuomas Virtanen:
Speaker Distance Estimation in Enclosures From Single-Channel Audio. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2242-2254 (2024)
[c182]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/HakalaKV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/HakalaKV24
Aapo Hakala, Trevor Kincy, Tuomas Virtanen:
Automatic Live Music Song Identification Using Multi-level Deep Sequence Similarity Learning. EUSIPCO 2024: 31-35
[c181]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/DaiLPV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/DaiLPV24
Wang Dai, Xiaofei Li, Archontis Politis, Tuomas Virtanen:
Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement. EUSIPCO 2024: 241-245
[c180]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/MartinssonMSV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/MartinssonMSV24
John Martinsson, Olof Mogren, Maria Sandsten, Tuomas Virtanen:
From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning. EUSIPCO 2024: 902-906
[c179]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeikkinenPV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeikkinenPV24
Mikko Heikkinen, Archontis Politis, Tuomas Virtanen:
Neural Ambisonics Encoding For Compact Irregular Microphone Arrays. ICASSP 2024: 701-705
[c178]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangPV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangPV24
Yuzhu Wang, Archontis Politis, Tuomas Virtanen:
Attention-Driven Multichannel Speech Enhancement in Moving Sound Source Scenarios. ICASSP 2024: 11221-11225
[c177]
- view
  authority control:
- export record
  dblp key:
  - conf/is2/MoritzOV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/is2/MoritzOV24
Martin Moritz, Toni Olán, Tuomas Virtanen:
Noise-to-Mask Ratio Loss for Deep Neural Network Based Audio Watermarking. IS2 2024: 1-6
[c176]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/DoganXHV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/DoganXHV24
Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen:
Multi-Label Zero-Shot Audio Classification with Temporal Attention. IWAENC 2024: 250-254
[d20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/11/GarciaMartinezDPVCV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/11/GarciaMartinezDPVCV24
Jaime Garcia-Martinez, David Diaz-Guerra, Archontis Politis, Tuomas Virtanen, Julio J. Carabias-Orti, Pedro Vera-Candeas:
SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation. Zenodo, 2024
[d19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/11/SudarsanamMHV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/11/SudarsanamMHV24
Parthasaarathy Sudarsanam, Irene Martín-Morató, Aapo Hakala, Tuomas Virtanen:
AVCaps: An audio-visual dataset with modality-specific captions. Zenodo, 2024
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-05916
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-05916
Mikko Heikkinen, Archontis Politis, Tuomas Virtanen:
Neural Ambisonics encoding for compact irregular microphone arrays. CoRR abs/2401.05916 (2024)
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-08525
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-08525
John Martinsson, Olof Mogren, Maria Sandsten, Tuomas Virtanen:
From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning. CoRR abs/2403.08525 (2024)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-17514
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-17514
Michael Neri, Archontis Politis, Daniel Krause, Marco Carli, Tuomas Virtanen:
Speaker Distance Estimation in Enclosures from Single-Channel Audio. CoRR abs/2403.17514 (2024)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-15672
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-15672
Andreas Triantafyllopoulos, Iosif Tsangko, Alexander Gebhard, Annamaria Mesaros, Tuomas Virtanen, Björn W. Schuller:
Computer Audition: From Task-Specific Machine Learning to Foundation Models. CoRR abs/2407.15672 (2024)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-15553
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-15553
Martin Moritz, Toni Olán, Tuomas Virtanen:
Noise-to-mask Ratio Loss for Deep Neural Network based Audio Watermarking. CoRR abs/2408.15553 (2024)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00408
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00408
Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen:
Multi-label Zero-Shot Audio Classification with Temporal Attention. CoRR abs/2409.00408 (2024)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-10995
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-10995
Jaime Garcia-Martinez, David Diaz-Guerra, Archontis Politis, Tuomas Virtanen, Julio J. Carabias-Orti, Pedro Vera-Candeas:
SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation. CoRR abs/2409.10995 (2024)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-04951
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-04951
Annamaria Mesaros, Romain Serizel, Toni Heittola, Tuomas Virtanen, Mark D. Plumbley:
A decade of DCASE: Achievements, practices, evaluations and future challenges. CoRR abs/2410.04951 (2024)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-06892
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-06892
Esa Räsänen, Niko Gullsten, Otto Pulkkinen, Tuomas Virtanen:
Timing and Dynamics of the Rosanna Shuffle. CoRR abs/2411.06892 (2024)
2023
[c175]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/MagronV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/MagronV23
Paul Magron, Tuomas Virtanen:
Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints. EUSIPCO 2023: 36-40
[c174]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/Diaz-GuerraPV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/Diaz-GuerraPV23
David Diaz-Guerra, Archontis Politis, Tuomas Virtanen:
Position Tracking of a Varying Number of Sound Sources with Sliding Permutation Invariant Training. EUSIPCO 2023: 251-255
[c173]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/KhorramiBVR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/KhorramiBVR23
Khazar Khorrami, María Andrea Cruz Blandón, Tuomas Virtanen, Okko Räsänen:
Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System. EUSIPCO 2023: 431-435
[c172]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/SudarsanamV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/SudarsanamV23
Parthasaarathy Sudarsanam, Tuomas Virtanen:
Attention-Based Methods For Audio Question Answering. EUSIPCO 2023: 750-754
[c171]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XieRV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XieRV23
Huang Xie, Okko Räsänen, Tuomas Virtanen:
On Negative Sampling for Contrastive Audio-Text Retrieval. ICASSP 2023: 1-5
[c170]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XieLHCV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XieLHCV23
Wei Xie, Yanxiong Li, Qianhua He, Wenchang Cao, Tuomas Virtanen:
Few-shot Class-incremental Audio Classification Using Adaptively-refined Prototypes. INTERSPEECH 2023: 301-305
[c169]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ShimadaPS0UAHKT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ShimadaPS0UAHKT23
Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Aleksander Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. NeurIPS 2023
[c168]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/LuongTGDV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/LuongTGDV23
Diep Luong, Minh Tran, Shayan Gharib, Konstantinos Drossos, Tuomas Virtanen:
Representation Learning for Audio Privacy Preservation Using Source Separation and Robust Adversarial Learning. WASPAA 2023: 1-5
[c167]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/NeriPKCV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/NeriPKCV23
Michael Neri, Archontis Politis, Daniel Krause, Marco Carli, Tuomas Virtanen:
Single-Channel Speaker Distance Estimation in Reverberant Environments. WASPAA 2023: 1-5
[d18]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisSSHTKTAKUMV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisSSHTKTAKUMV23
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Aapo Hakala, Shusuke Takahashi, Daniel Aleksander Krause, Naoya Takahashi, Sharath Adavanne, Yuichiro Koyama, Kengo Uchida, Yuki Mitsufuji, Tuomas Virtanen:
STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023. Version 1.0.0. Zenodo, 2023 [all versions]
[d17]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisSSHTKTAKUMV23a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisSSHTKTAKUMV23a
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Aapo Hakala, Shusuke Takahashi, Daniel Aleksander Krause, Naoya Takahashi, Sharath Adavanne, Yuichiro Koyama, Kengo Uchida, Yuki Mitsufuji, Tuomas Virtanen:
STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023. Version 1.1.0. Zenodo, 2023 [all versions]
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-01864
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-01864
Paul Magron, Tuomas Virtanen:
Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints. CoRR abs/2303.01864 (2023)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-07816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-07816
Wang Dai, Archontis Politis, Tuomas Virtanen:
Multi-Channel Masking with Learnable Filterbank for Sound Source Separation. CoRR abs/2303.07816 (2023)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-00011
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-00011
Shayan Gharib, Minh Tran, Diep Luong, Konstantinos Drossos, Tuomas Virtanen:
Adversarial Representation Learning for Robust Privacy Preservation in Audio. CoRR abs/2305.00011 (2023)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18045
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18045
Wei Xie, Yanxiong Li, Qianhua He, Wenchang Cao, Tuomas Virtanen:
Few-shot Class-incremental Audio Classification Using Adaptively-refined Prototypes. CoRR abs/2305.18045 (2023)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19769
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19769
Parthasaarathy Sudarsanam, Tuomas Virtanen:
Attention-Based Methods For Audio Question Answering. CoRR abs/2305.19769 (2023)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-02972
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-02972
Khazar Khorrami, María Andrea Cruz Blandón, Tuomas Virtanen, Okko Räsänen:
Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System. CoRR abs/2306.02972 (2023)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-08510
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-08510
David Diaz-Guerra, Archontis Politis, Antonio Miguel, José Ramón Beltrán, Tuomas Virtanen:
Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications. CoRR abs/2306.08510 (2023)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-09126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-09126
Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. CoRR abs/2306.09126 (2023)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-09820
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-09820
Huang Xie, Khazar Khorrami, Okko Räsänen, Tuomas Virtanen:
Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances. CoRR abs/2306.09820 (2023)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-04960
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-04960
Diep Luong, Minh Tran, Shayan Gharib, Konstantinos Drossos, Tuomas Virtanen:
Representation Learning for Audio Privacy Preservation using Source Separation and Robust Adversarial Learning. CoRR abs/2308.04960 (2023)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-16550
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-16550
Szymon Drgas, Lars Bramsløw, Archontis Politis, Gaurav Naithani, Tuomas Virtanen:
Dynamic Processing Neural Network Architecture For Hearing Loss Compensation. CoRR abs/2310.16550 (2023)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-10756
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-10756
Yuzhu Wang, Archontis Politis, Tuomas Virtanen:
Attention-Driven Multichannel Speech Enhancement in Moving Sound Source Scenarios. CoRR abs/2312.10756 (2023)
2022
[j43]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/SchullerEPNVT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/SchullerEPNVT22
Björn W. Schuller, Yonina C. Eldar, Maja Pantic, Shrikanth Narayanan, Tuomas Virtanen, Jianhua Tao:
Editorial: Intelligent Signal Analysis for Contagious Virus Diseases. IEEE J. Sel. Top. Signal Process. 16(2): 159-163 (2022)
[j42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jstsp/WangPMV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/WangPMV22
Shanshan Wang, Archontis Politis, Annamaria Mesaros, Tuomas Virtanen:
Self-Supervised Learning of Audio Representations From Audio-Visual Data Using Spatial Alignment. IEEE J. Sel. Top. Signal Process. 16(6): 1467-1479 (2022)
[c166]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/Martin-MoratoPA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/Martin-MoratoPA22
Irene Martín-Morató, Francesco Paissan, Alberto Ancilotto, Toni Heittola, Annamaria Mesaros, Elisabetta Farella, Alessio Brutti, Tuomas Virtanen:
Low-Complexity Acoustic Scene Classification in DCASE 2022 Challenge. DCASE 2022
[c165]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/PolitisSSA0KTTM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/PolitisSSA0KTTM22
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji, Tuomas Virtanen:
STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. DCASE 2022
[c164]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/XieLV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/XieLV22
Huang Xie, Samuel Lipping, Tuomas Virtanen:
Language-Based Audio Retrieval Task in DCASE 2022 Challenge. DCASE 2022
[c163]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/DoganXHV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/DoganXHV22
Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen:
Zero-Shot Audio Classification using Image Embeddings. EUSIPCO 2022: 1-5
[c162]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/EklundDV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/EklundDV22
Ville-Veikko Eklund, Aleksandr Diment, Tuomas Virtanen:
Noise, Device and Room Robustness Methods for Pronunciation Error Detection. EUSIPCO 2022: 140-144
[c161]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/LippingSDV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/LippingSDV22
Samuel Lipping, Parthasaarathy Sudarsanam, Konstantinos Drossos, Tuomas Virtanen:
Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering. EUSIPCO 2022: 1140-1144
[c160]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XieRDV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XieRDV22
Huang Xie, Okko Räsänen, Konstantinos Drossos, Tuomas Virtanen:
Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases. ICASSP 2022: 8867-8871
[c159]
- view
  authority control:
- export record
  dblp key:
  - conf/mmsp/LiCDV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmsp/LiCDV22
Yanxiong Li, Wenchang Cao, Konstantinos Drossos, Tuomas Virtanen:
Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network. MMSP 2022: 1-6
[c158]
- view
  authority control:
- export record
  dblp key:
  - conf/mmsp/NaithaniPNPTV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmsp/NaithaniPNPTV22
Gaurav Naithani, Kirsi Pietilä, Riitta Niemistö, Erkki Paajanen, Tero Takala, Tuomas Virtanen:
Subjective Evaluation of Deep Neural Network Based Speech Enhancement Systems in Real-World Conditions. MMSP 2022: 1-6
[d16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/10/LippingSDV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/LippingSDV22
Samuel Lipping, Parthasaarathy Sudarsanam, Konstantinos Drossos, Tuomas Virtanen:
Clotho-AQA dataset. Zenodo, 2022
[d15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/10/PolitisAV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisAV22
Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
TAU Spatial Room Impulse Response Database (TAU-SRIR DB). Zenodo, 2022
[d14]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisMSSAKKTTV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisMSSAKKTTV22
Adavanne Politis, Yuki Mitsufuji, Parthasaarathy Sudarsanam, Kazuki Shimada, Sharath Adavanne, Yuichiro Koyama, Daniel Krause, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen:
STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset. Version 1.0.0. Zenodo, 2022 [all versions]
[d13]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisMSSAKKTTV22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisMSSAKKTTV22a
Archontis Politis, Yuki Mitsufuji, Parthasaarathy Sudarsanam, Kazuki Shimada, Sharath Adavanne, Yuichiro Koyama, Daniel Aleksander Krause, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen:
STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset. Version 1.1.0. Zenodo, 2022 [all versions]
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-09634
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-09634
Samuel Lipping, Parthasaarathy Sudarsanam, Konstantinos Drossos, Tuomas Virtanen:
Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering. CoRR abs/2204.09634 (2022)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-00970
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-00970
Shanshan Wang, Archontis Politis, Annamaria Mesaros, Tuomas Virtanen:
Self-supervised Learning of Audio Representations from Audio-Visual Data using Spatial Alignment. CoRR abs/2206.00970 (2022)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-01948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-01948
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji, Tuomas Virtanen:
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events. CoRR abs/2206.01948 (2022)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-04984
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-04984
Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen:
Zero-Shot Audio Classification using Image Embeddings. CoRR abs/2206.04984 (2022)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-02406
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-02406
Yanxiong Li, Wenchang Cao, Konstantinos Drossos, Tuomas Virtanen:
Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network. CoRR abs/2208.02406 (2022)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-05057
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-05057
Gaurav Naithani, Kirsi Pietilä, Riitta Niemistö, Erkki Paajanen, Tero Takala, Tuomas Virtanen:
Subjective Evaluation of Deep Neural Network Based Speech Enhancement Systems in Real-World Conditions. CoRR abs/2208.05057 (2022)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-14536
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-14536
David Diaz-Guerra, Archontis Politis, Tuomas Virtanen:
Position tracking of a varying number of sound sources with sliding permutation invariant training. CoRR abs/2210.14536 (2022)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-04070
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-04070
Huang Xie, Okko Räsänen, Tuomas Virtanen:
On Negative Sampling for Contrastive Audio-Text Retrieval. CoRR abs/2211.04070 (2022)
2021
[j41]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/DrgasV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/DrgasV21
Szymon Drgas, Tuomas Virtanen:
Joint speaker separation and recognition using non-negative matrix deconvolution with adaptive dictionary. Comput. Speech Lang. 70: 101223 (2021)
[j40]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/MesarosHVP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/MesarosHVP21
Annamaria Mesaros, Toni Heittola, Tuomas Virtanen, Mark D. Plumbley:
Sound Event Detection: A tutorial. IEEE Signal Process. Mag. 38(5): 67-83 (2021)
[j39]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/PolitisMAHV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/PolitisMAHV21
Archontis Politis, Annamaria Mesaros, Sharath Adavanne, Toni Heittola, Tuomas Virtanen:
Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019. IEEE ACM Trans. Audio Speech Lang. Process. 29: 684-698 (2021)
[j38]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/XieV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XieV21
Huang Xie, Tuomas Virtanen:
Zero-Shot Audio Classification Via Semantic Embeddings. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1233-1242 (2021)
[c157]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/WangMHV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/WangMHV21
Shanshan Wang, Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Audio-Visual Scene Classification: Analysis of DCASE 2021 Challenge Submissions. DCASE 2021: 45-49
[c156]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/Martin-MoratoHM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/Martin-MoratoHM21
Irene Martín-Morató, Toni Heittola, Annamaria Mesaros, Tuomas Virtanen:
Low-Complexity Acoustic Scene Classification for Multi-Device Audio: Analysis of DCASE 2021 Challenge Systems. DCASE 2021: 85-89
[c155]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/PolitisAKDSV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/PolitisAKDSV21
Archontis Politis, Sharath Adavanne, Daniel Krause, Antoine Deleforge, Prerak Srivastava, Tuomas Virtanen:
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection. DCASE 2021: 125-129
[c154]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/WangNPV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/WangNPV21
Shanshan Wang, Gaurav Naithani, Archontis Politis, Tuomas Virtanen:
Deep Neural Network Based Low-Latency Speech Separation with Asymmetric Analysis-Synthesis Window Pair. EUSIPCO 2021: 301-305
[c153]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/PertilaCHFVPE21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/PertilaCHFVPE21
Pasi Pertilä, Emre Cakir, Aapo Hakala, Eemi Fagerlund, Tuomas Virtanen, Archontis Politis, Antti J. Eronen:
Mobile Microphone Array Speech Detection and Localization in Diverse Everyday Environments. EUSIPCO 2021: 406-410
[c152]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/DjukanovicPMV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/DjukanovicPMV21
Slobodan Djukanovic, Yash Patel, Jirí Matas, Tuomas Virtanen:
Neural network-based acoustic vehicle counting. EUSIPCO 2021: 561-565
[c151]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/TranDV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/TranDV21
An Tran, Konstantinos Drossos, Tuomas Virtanen:
WaveTransformer: An Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information. EUSIPCO 2021: 576-580
[c150]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XieRV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XieRV21
Huang Xie, Okko Räsänen, Tuomas Virtanen:
Zero-Shot Audio Classification with Factored Linear and Nonlinear Acoustic-Semantic Projections. ICASSP 2021: 326-330
[c149]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FavoryDVS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FavoryDVS21
Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra:
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags. ICASSP 2021: 596-600
[c148]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangMHV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangMHV21
Shanshan Wang, Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
A Curated Dataset of Urban Scenes for Audio-Visual Scene Analysis. ICASSP 2021: 626-630
[c147]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/SchullerVRR0MD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/SchullerVRR0MD21
Björn W. Schuller, Tuomas Virtanen, Maria Riveiro, Georgios Rizos, Jing Han, Annamaria Mesaros, Konstantinos Drossos:
Towards Sonification in Multimodal and User-friendlyExplainable Artificial Intelligence. ICMI 2021: 788-792
[c146]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/AdavannePV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/AdavannePV21
Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers. WASPAA 2021: 211-215
[d12]
- view
  authority control:
- export record
  dblp key:
  - data/10/DrossosLV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/DrossosLV21
Konstantinos Drossos, Samuel Lipping, Tuomas Virtanen:
Clotho dataset. Version 2.0. Zenodo, 2021 [all versions]
[d11]
- view
  authority control:
- export record
  dblp key:
  - data/10/DrossosLV21a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/DrossosLV21a
Konstantinos Drossos, Samuel Lipping, Tuomas Virtanen:
Clotho dataset. Version 2.1. Zenodo, 2021 [all versions]
[d10]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisAV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisAV21
Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
TAU-NIGENS Spatial Sound Events 2021. Version 1. Zenodo, 2021 [all versions]
[d9]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisAV21a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisAV21a
Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
TAU-NIGENS Spatial Sound Events 2021. Version 1.1.0. Zenodo, 2021 [all versions]
[d8]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisAV21b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisAV21b
Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
TAU-NIGENS Spatial Sound Events 2021. Version 1.2.0. Zenodo, 2021 [all versions]
[i55]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-13675
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-13675
Shanshan Wang, Toni Heittola, Annamaria Mesaros, Tuomas Virtanen:
Audio-visual scene classification: analysis of DCASE 2021 Challenge submissions. CoRR abs/2105.13675 (2021)
[i54]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-06999
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-06999
Archontis Politis, Sharath Adavanne, Daniel Krause, Antoine Deleforge, Prerak Srivastava, Tuomas Virtanen:
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection. CoRR abs/2106.06999 (2021)
[i53]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-11794
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-11794
Shanshan Wang, Gaurav Naithani, Archontis Politis, Tuomas Virtanen:
Deep neural network Based Low-latency Speech Separation with Asymmetric analysis-Synthesis Window Pair. CoRR abs/2106.11794 (2021)
[i52]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-00030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-00030
Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers. CoRR abs/2111.00030 (2021)
2020
[j37]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/MagronV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/MagronV20
Paul Magron, Tuomas Virtanen:
Online Spectrogram Inversion for Low-Latency Audio Source Separation. IEEE Signal Process. Lett. 27: 306-310 (2020)
[j36]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhaoHV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhaoHV20
Shuyang Zhao, Toni Heittola, Tuomas Virtanen:
Active Learning for Sound Event Detection. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2895-2905 (2020)
[c145]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/CakirDV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/CakirDV20
Emre Çakir, Konstantinos Drossos, Tuomas Virtanen:
Multi-Task Regularization Based on Infrequent Classes for Audio Captioning. DCASE 2020: 6-10
[c144]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/HeittolaMV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/HeittolaMV20
Toni Heittola, Annamaria Mesaros, Tuomas Virtanen:
Acoustic Scene Classification in DCASE 2020 Challenge: Generalization Across Devices and Low Complexity Solutions. DCASE 2020: 56-60
[c143]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/NguyenDV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/NguyenDV20
Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen:
Temporal Sub-Sampling of Audio Feature Sequences for Automated Audio Captioning. DCASE 2020: 110-114
[c142]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/PolitisAV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/PolitisAV20
Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection. DCASE 2020: 165-169
[c141]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/NicodemoNDVS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/NicodemoNDVS20
Niccolò Nicodemo, Gaurav Naithani, Konstantinos Drossos, Tuomas Virtanen, Roberto Saletti:
Memory Requirement Reduction of Deep Neural Networks for Field Programmable Gate Arrays Using Low-Bit Quantization of Parameters. EUSIPCO 2020: 466-470
[c140]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiLDV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiLDV20
Yanxiong Li, Mingle Liu, Konstantinos Drossos, Tuomas Virtanen:
Sound Event Detection Via Dilated Convolutional Recurrent Neural Networks. ICASSP 2020: 286-290
[c139]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DrossosLV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DrossosLV20
Konstantinos Drossos, Samuel Lipping, Tuomas Virtanen:
Clotho: an Audio Captioning Dataset. ICASSP 2020: 736-740
[c138]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/DrossosMGLV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/DrossosMGLV20
Konstantinos Drossos, Stylianos I. Mimilakis, Shayan Gharib, Yanxiong Li, Tuomas Virtanen:
Sound Event Detection with Depthwise Separable and Dilated Convolutions. IJCNN 2020: 1-7
[c137]
- view
  authority control:
- export record
  dblp key:
  - conf/ivs/DjukanovicMV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ivs/DjukanovicMV20
Slobodan Djukanovic, Jiri Matas, Tuomas Virtanen:
Robust Audio-Based Vehicle Counting in Low-to-Moderate Traffic Flow. IV 2020: 1608-1614
[c136]
- view
  authority control:
- export record
  dblp key:
  - conf/mmsp/PyykkonenMDV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmsp/PyykkonenMDV20
Pyry Pyykkönen, Stylianos I. Mimilakis, Konstantinos Drossos, Tuomas Virtanen:
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation. MMSP 2020: 1-6
[d7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/10/DrossosLV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/DrossosLV20
Konstantinos Drossos, Samuel Lipping, Tuomas Virtanen:
Audio captioning DCASE 2020 evaluation (testing) split. Zenodo, 2020
[d6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/10/FavoryDVS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/FavoryDVS20
Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra:
Dataset used in COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations. Zenodo, 2020
[d5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/10/GharibDFV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/GharibDFV20
Shayan Gharib, Konstantinos Drossos, Eemi Fagerlund, Tuomas Virtanen:
VOICe Dataset. Zenodo, 2020
[i51]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-00476
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-00476
Konstantinos Drossos, Stylianos Ioannis Mimilakis, Shayan Gharib, Yanxiong Li, Tuomas Virtanen:
Sound Event Detection with Depthwise Separable and Dilated Convolutions. CoRR abs/2002.00476 (2020)
[i50]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-05033
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-05033
Shuyang Zhao, Toni Heittola, Tuomas Virtanen:
Active Learning for Sound Event Detection. CoRR abs/2002.05033 (2020)
[i49]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-01919
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-01919
Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection. CoRR abs/2006.01919 (2020)
[i48]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-08386
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-08386
Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra:
COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations. CoRR abs/2006.08386 (2020)
[i47]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-02676
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-02676
Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen:
Temporal Sub-sampling of Audio Feature Sequences for Automated Audio Captioning. CoRR abs/2007.02676 (2020)
[i46]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-02683
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-02683
Pyry Pyykkönen, Stylianos Ioannis Mimilakis, Konstantinos Drossos, Tuomas Virtanen:
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation. CoRR abs/2007.02683 (2020)
[i45]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-04660
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-04660
Emre Çakir, Konstantinos Drossos, Tuomas Virtanen:
Multi-task Regularization Based on Infrequent Classes for Audio Captioning. CoRR abs/2007.04660 (2020)
[i44]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-05183
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-05183
Konstantinos Drossos, Stylianos Ioannis Mimilakis, Tuomas Virtanen:
Conditioned Time-Dilated Convolutions for Sound Event Detection. CoRR abs/2007.05183 (2020)
[i43]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-02792
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-02792
Archontis Politis, Annamaria Mesaros, Sharath Adavanne, Toni Heittola, Tuomas Virtanen:
Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019. CoRR abs/2009.02792 (2020)
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11098
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11098
An Tran, Konstantinos Drossos, Tuomas Virtanen:
WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information. CoRR abs/2010.11098 (2020)
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11659
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11659
Slobodan Djukanovic, Yash Patel, Jiri Matas, Tuomas Virtanen:
Neural Network-based Acoustic Vehicle Counting. CoRR abs/2010.11659 (2020)
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11716
Slobodan Djukanovic, Jiri Matas, Tuomas Virtanen:
Robust Audio-Based Vehicle Counting in Low-to-Moderate Traffic Flow. CoRR abs/2010.11716 (2020)
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14171
Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra:
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags. CoRR abs/2010.14171 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j35]
- view
  authority control:
- export record
  dblp key:
  - journals/dsp/Garcia-MollaSVV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/dsp/Garcia-MollaSVV19
Víctor M. García-Molla, Pablo San Juan Sebastián, Tuomas Virtanen, Antonio M. Vidal, Pedro Alonso:
Generalization of the K-SVD algorithm for minimization of β-divergence. Digit. Signal Process. 92: 47-53 (2019)
[j34]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/AdavannePNV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/AdavannePNV19
Sharath Adavanne, Archontis Politis, Joonas Nikunen, Tuomas Virtanen:
Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks. IEEE J. Sel. Top. Signal Process. 13(1): 34-48 (2019)
[j33]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/PurwinsLVSCS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/PurwinsLVSCS19
Hendrik Purwins, Bo Li, Tuomas Virtanen, Jan Schlüter, Shuo-Yiin Chang, Tara N. Sainath:
Deep Learning for Audio Signal Processing. IEEE J. Sel. Top. Signal Process. 13(2): 206-219 (2019)
[j32]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MagronV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MagronV19
Paul Magron, Tuomas Virtanen:
Complex ISNMF: A Phase-Aware Model for Monaural Audio Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 27(1): 20-31 (2019)
[j31]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MesarosDEHVRV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MesarosDEHVRV19
Annamaria Mesaros, Aleksandr Diment, Benjamin Elizalde, Toni Heittola, Emmanuel Vincent, Bhiksha Raj, Tuomas Virtanen:
Sound Event Detection in the DCASE 2017 Challenge. IEEE ACM Trans. Audio Speech Lang. Process. 27(6): 992-1006 (2019)
[j30]
- view
  authority control:
- export record
  dblp key:
  - journals/tjs/SebastianVGV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tjs/SebastianVGV19
Pablo San Juan Sebastián, Tuomas Virtanen, Víctor M. García-Molla, Antonio M. Vidal:
Analysis of an efficient parallel implementation of active-set Newton algorithm. J. Supercomput. 75(3): 1298-1309 (2019)
[c135]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/AdavannePV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/AdavannePV19
Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
A Multi-room Reverberant Dataset for Sound Event Localization and Detection. DCASE 2019: 10-14
[c134]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/AdavannePV19a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/AdavannePV19a
Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network. DCASE 2019: 20-24
[c133]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/DrossosGMV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/DrossosGMV19
Konstantinos Drossos, Shayan Gharib, Paul Magron, Tuomas Virtanen:
Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling. DCASE 2019: 59-63
[c132]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/LippingDV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/LippingDV19
Samuel Lipping, Konstantinos Drossos, Tuomas Virtanen:
Crowdsourcing a Dataset of Audio Captions. DCASE 2019: 139-143
[c131]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/MesarosHV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/MesarosHV19
Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Acoustic Scene Classification in DCASE 2019 Challenge: Closed and Open Set Classification and Data Mismatch Setups. DCASE 2019: 164-168
[c130]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/AhsanKMHKV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/AhsanKMHKV19
M. N. Istiaq Ahsan, Csaba Kertész, Annamaria Mesaros, Toni Heittola, Andrew Knight, Tuomas Virtanen:
Audio-Based Epileptic Seizure Detection. EUSIPCO 2019: 1-5
[c129]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangNV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangNV19
Shanshan Wang, Gaurav Naithani, Tuomas Virtanen:
Low-latency Deep Clustering for Speech Separation. ICASSP 2019: 76-80
[c128]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Martin-MoratoMH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Martin-MoratoMH19
Irene Martín-Morató, Annamaria Mesaros, Toni Heittola, Tuomas Virtanen, Maximo Cobos, Francesc J. Ferri:
Sound Event Envelope Estimation in Polyphonic Mixtures. ICASSP 2019: 935-939
[c127]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/DimentFBV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/DimentFBV19
Aleksandr Diment, Eemi Fagerlund, Adrian Benfield, Tuomas Virtanen:
Detection of Typical Pronunciation Errors in Non-native English Speech Using Convolutional Recurrent Neural Networks. IJCNN 2019: 1-8
[c126]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/BearHMBV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/BearHMBV19
Helen L. Bear, Toni Heittola, Annamaria Mesaros, Emmanouil Benetos, Tuomas Virtanen:
City Classification from Multiple Real-World Sound Scenes. WASPAA 2019: 11-15
[c125]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/DrossosMV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/DrossosMV19
Konstantinos Drossos, Paul Magron, Tuomas Virtanen:
Unsupervised Adversarial Domain Adaptation Based on The Wasserstein Distance For Acoustic Scene Classification. WASPAA 2019: 259-263
[c124]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/XieV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/XieV19
Huang Xie, Tuomas Virtanen:
Zero-Shot Audio Classification Based On Class Label Embeddings. WASPAA 2019: 264-267
[c123]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/GreenAMV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/GreenAMV19
Marc C. Green, Sharath Adavanne, Damian T. Murphy, Tuomas Virtanen:
Acoustic Scene Classification Using Higher-Order Ambisonic Features. WASPAA 2019: 328-332
[c122]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/MesarosAPHV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/MesarosAPHV19
Annamaria Mesaros, Sharath Adavanne, Archontis Politis, Toni Heittola, Tuomas Virtanen:
Joint Measurement of Localization and Detection of Sound Events. WASPAA 2019: 333-337
[d4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/10/AdavannePMHV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/AdavannePMHV19
Sharath Adavanne, Archontis Politis, Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Sound event localization and detection (SELDnet) results. Zenodo, 2019
[d3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/10/DrossosGMV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/DrossosGMV19
Konstantinos Drossos, Shayan Gharib, Paul Magron, Tuomas Virtanen:
Code of the method presented in the paper: Drossos et al, "Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling," in proceedings of DCASE 2019. Zenodo, 2019
[d2]
- view
  authority control:
- export record
  dblp key:
  - data/10/DrossosLV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/DrossosLV19
Konstantinos Drossos, Samuel Lipping, Tuomas Virtanen:
Clotho dataset. Version 1.0. Zenodo, 2019 [all versions]
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-07033
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-07033
Shanshan Wang, Gaurav Naithani, Tuomas Virtanen:
Low-Latency Deep Clustering For Speech Separation. CoRR abs/1902.07033 (2019)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-10678
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-10678
Konstantinos Drossos, Paul Magron, Tuomas Virtanen:
Unsupervised Adversarial Domain Adaptation Based On The Wasserstein Distance For Acoustic Scene Classification. CoRR abs/1904.10678 (2019)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-12769
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-12769
Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network. CoRR abs/1904.12769 (2019)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-00078
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-00078
Hendrik Purwins, Bo Li, Tuomas Virtanen, Jan Schlüter, Shuo-Yiin Chang, Tara N. Sainath:
Deep Learning for Audio Signal Processing. CoRR abs/1905.00078 (2019)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-00979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-00979
Helen L. Bear, Toni Heittola, Annamaria Mesaros, Emmanouil Benetos, Tuomas Virtanen:
City classification from multiple real-world sound scenes. CoRR abs/1905.00979 (2019)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-01926
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-01926
Huang Xie, Tuomas Virtanen:
Zero-Shot Audio Classification Based on Class Label Embeddings. CoRR abs/1905.01926 (2019)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-08546
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-08546
Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
A multi-room reverberant dataset for sound event localization and detection. CoRR abs/1905.08546 (2019)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-08506
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-08506
Konstantinos Drossos, Shayan Gharib, Paul Magron, Tuomas Virtanen:
Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling. CoRR abs/1907.08506 (2019)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-09238
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-09238
Samuel Lipping, Konstantinos Drossos, Tuomas Virtanen:
Crowdsourcing a Dataset of Audio Captions. CoRR abs/1907.09238 (2019)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-09387
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-09387
Konstantinos Drossos, Samuel Lipping, Tuomas Virtanen:
Clotho: An Audio Captioning Dataset. CoRR abs/1910.09387 (2019)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-00527
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-00527
Niccolò Nicodemo, Gaurav Naithani, Konstantinos Drossos, Tuomas Virtanen, Roberto Saletti:
Memory Requirement Reduction of Deep Neural Networks Using Low-bit Quantization of Parameters. CoRR abs/1911.00527 (2019)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-03128
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-03128
Paul Magron, Tuomas Virtanen:
Online Spectrogram Inversion for Low-Latency Audio Source Separation. CoRR abs/1911.03128 (2019)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-07098
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-07098
Shayan Gharib, Konstantinos Drossos, Eemi Fagerlund, Tuomas Virtanen:
VOICe: A Sound Event Detection Dataset For Generalizable Domain Adaptation. CoRR abs/1911.07098 (2019)
2018
[j29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/NaithaniKVTPL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/NaithaniKVTPL18
Gaurav Naithani, Jaana Kivinummi, Tuomas Virtanen, Outi Tammela, Mikko J. Peltola, Jukka M. Leppänen:
Automatic segmentation of infant cry signals using hidden Markov models. EURASIP J. Audio Speech Music. Process. 2018: 1 (2018)
[j28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejivp/MahkonenVK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejivp/MahkonenVK18
Katariina Mahkonen, Tuomas Virtanen, Joni-Kristian Kämäräinen:
Cascade of Boolean detector combinations. EURASIP J. Image Video Process. 2018: 61 (2018)
[j27]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/NikunenDV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/NikunenDV18
Joonas Nikunen, Aleksandr Diment, Tuomas Virtanen:
Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 281-295 (2018)
[j26]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MesarosHBFLVP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MesarosHBFLVP18
Annamaria Mesaros, Toni Heittola, Emmanouil Benetos, Peter Foster, Mathieu Lagrange, Tuomas Virtanen, Mark D. Plumbley:
Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 379-393 (2018)
[j25]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/Carabias-OrtiNV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Carabias-OrtiNV18
Julio J. Carabias-Orti, Joonas Nikunen, Tuomas Virtanen, Pedro Vera-Candeas:
Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization. IEEE ACM Trans. Audio Speech Lang. Process. 26(9): 1512-1527 (2018)
[c121]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/MesarosHV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/MesarosHV18
Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
A multi-device dataset for urban acoustic scene classification. DCASE 2018: 9-13
[c120]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/GharibDCSV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/GharibDCSV18
Shayan Gharib, Konstantinos Drossos, Emre Cakir, Dmitriy Serdyuk, Tuomas Virtanen:
Unsupervised adversarial domain adaptation for acoustic scene classification. DCASE 2018: 138-142
[c119]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/AdavannePV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/AdavannePV18
Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Direction of Arrival Estimation for Multiple Sound Sources Using Convolutional Recurrent Neural Network. EUSIPCO 2018: 1462-1466
[c118]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MagronV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MagronV18
Paul Magron, Tuomas Virtanen:
Bayesian Anisotropic Gaussian Model for Audio Source Separation. ICASSP 2018: 166-170
[c117]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NikunenV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NikunenV18
Joonas Nikunen, Tuomas Virtanen:
Estimation of Time-Varying Room Impulse Responses of Multiple Sound Sources from Observed Mixture and Isolated Source Signals. ICASSP 2018: 421-425
[c116]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MimilakisDSSVB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MimilakisDSSVB18
Stylianos Ioannis Mimilakis, Konstantinos Drossos, João Felipe Santos, Gerald Schuller, Tuomas Virtanen, Yoshua Bengio:
Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask. ICASSP 2018: 721-725
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/AdavannePV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/AdavannePV18
Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features. IJCNN 2018: 1-7
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/CakirV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/CakirV18
Emre Cakir, Tuomas Virtanen:
End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input. IJCNN 2018: 1-7
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/DrossosMSSVB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/DrossosMSSVB18
Konstantinos Drossos, Stylianos Ioannis Mimilakis, Dmitriy Serdyuk, Gerald Schuller, Tuomas Virtanen, Yoshua Bengio:
MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation. IJCNN 2018: 1-8
[c112]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MagronDMV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MagronDMV18
Paul Magron, Konstantinos Drossos, Stylianos Ioannis Mimilakis, Tuomas Virtanen:
Reducing Interference with Phase Recovery in DNN-based Monaural Singing Voice Separation. INTERSPEECH 2018: 332-336
[c111]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MagronV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MagronV18
Paul Magron, Tuomas Virtanen:
Expectation-Maximization Algorithms for Itakura-Saito Nonnegative Matrix Factorization. INTERSPEECH 2018: 856-860
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/ParviainenPVG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/ParviainenPVG18
Mikko Parviainen, Pasi Pertilä, Tuomas Virtanen, Peter Grosche:
Time-Frequency Masking Strategies for Single-Channel Low-Latency Speech Enhancement Using Neural Networks. IWAENC 2018: 51-55
[c109]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/ZhaoHV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/ZhaoHV18
Shuyang Zhao, Toni Heittola, Tuomas Virtanen:
An Active Learning Method Using Clustering and Committee-Based Sample Selection for Sound Event Classification. IWAENC 2018: 116-120
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/MagronV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/MagronV18
Paul Magron, Tuomas Virtanen:
Towards Complex Nonnegative Matrix Factorization with the Beta-Divergence. IWAENC 2018: 156-160
[c107]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/HuangHV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/HuangHV18
Guangpu Huang, Toni Heittola, Tuomas Virtanen:
Using Sequential Information in Polyphonic Sound Event Detection. IWAENC 2018: 291-295
[c106]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/NaithaniNBV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/NaithaniNBV18
Gaurav Naithani, Joonas Nikunen, Lars Bramslow, Tuomas Virtanen:
Deep Neural Network Based Speech Separation Optimizing an Objective Estimator of Intelligibility for Low Latency Applications. IWAENC 2018: 386-390
[c105]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/MesarosHV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/MesarosHV18
Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Acoustic Scene Classification: An Overview of Dcase 2017 Challenge Entries. IWAENC 2018: 411-415
[c104]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/DrossosMMV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/DrossosMMV18
Konstantinos Drossos, Paul Magron, Stylianos Ioannis Mimilakis, Tuomas Virtanen:
Harmonic-Percussive Source Separation with Deep Neural Networks and Phase Recovery. IWAENC 2018: 421-425
[c103]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/MagronV18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/MagronV18a
Paul Magron, Tuomas Virtanen:
On Modeling the STFT Phase of Audio Signals with the Von Mises Distribution. IWAENC 2018: 550-554
[c102]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/GharibDNSTHVH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/GharibDNSTHVH18
Shayan Gharib, Honain Derrar, Daisuke Niizumi, Tuukka Senttula, Janne Tommola, Toni Heittola, Tuomas Virtanen, Heikki Huttunen:
Acoustic Scene Classification: a Competition Review. MLSP 2018: 1-6
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1801-09522
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-09522
Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features. CoRR abs/1801.09522 (2018)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-00300
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-00300
Konstantinos Drossos, Stylianos Ioannis Mimilakis, Dmitriy Serdyuk, Gerald Schuller, Tuomas Virtanen, Yoshua Bengio:
MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation. CoRR abs/1802.00300 (2018)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-03156
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-03156
Paul Magron, Tuomas Virtanen:
Complex ISNMF: a Phase-Aware Model for Monaural Audio Source Separation. CoRR abs/1802.03156 (2018)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-05132
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-05132
Konstantinos Drossos, Stylianos Ioannis Mimilakis, Andreas Floros, Tuomas Virtanen, Gerald Schuller:
Close Miking Empirical Practice Verification: A Source Separation Approach. CoRR abs/1802.05132 (2018)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-03647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-03647
Emre Çakir, Tuomas Virtanen:
End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input. CoRR abs/1805.03647 (2018)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-00129
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-00129
Sharath Adavanne, Archontis Politis, Joonas Nikunen, Tuomas Virtanen:
Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks. CoRR abs/1807.00129 (2018)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-06899
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-06899
Gaurav Naithani, Joonas Nikunen, Lars Bramsløw, Tuomas Virtanen:
Deep neural network based speech separation optimizing an objective estimator of intelligibility for low latency applications. CoRR abs/1807.06899 (2018)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-09840
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-09840
Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
A multi-device dataset for urban acoustic scene classification. CoRR abs/1807.09840 (2018)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-11298
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-11298
Konstantinos Drossos, Paul Magron, Stylianos Ioannis Mimilakis, Tuomas Virtanen:
Harmonic-Percussive Source Separation with Deep Neural Networks and Phase Recovery. CoRR abs/1807.11298 (2018)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-02357
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-02357
Shayan Gharib, Honain Derrar, Daisuke Niizumi, Tuukka Senttula, Janne Tommola, Toni Heittola, Tuomas Virtanen, Heikki Huttunen:
Acoustic Scene Classification: A Competition Review. CoRR abs/1808.02357 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-05777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-05777
Shayan Gharib, Konstantinos Drossos, Emre Çakir, Dmitriy Serdyuk, Tuomas Virtanen:
Unsupervised adversarial domain adaptation for acoustic scene classification. CoRR abs/1808.05777 (2018)
2017
[j24]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/RichardVBOG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/RichardVBOG17
Gaël Richard, Tuomas Virtanen, Juan Pablo Bello, Nobutaka Ono, Hervé Glotin:
Introduction to the Special Section on Sound Scene and Event Analysis. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1169-1171 (2017)
[j23]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/CakirPHHV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/CakirPHHV17
Emre Çakir, Giambattista Parascandolo, Toni Heittola, Heikki Huttunen, Tuomas Virtanen:
Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1291-1303 (2017)
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DrgasVLH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DrgasVLH17
Szymon Drgas, Tuomas Virtanen, Jörg Lücke, Antti Hurmalainen:
Binary Non-Negative Matrix Deconvolution for Audio Dictionary Learning. IEEE ACM Trans. Audio Speech Lang. Process. 25(8): 1644-1656 (2017)
[c101]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/AdavanneV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/AdavanneV17
Sharath Adavanne, Tuomas Virtanen:
Sound Event Detection Using Weakly Labeled Dataset with Stacked Convolutional and Recurrent Neural Network. DCASE 2017: 12-16
[c100]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/CakirV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/CakirV17
Emre Cakir, Tuomas Virtanen:
Convolutional Recurrent Neural Networks for Rare Sound Event Detection. DCASE 2017: 27-31
[c99]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/MesarosHDESVRV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/MesarosHDESVRV17
Annamaria Mesaros, Toni Heittola, Aleksandr Diment, Benjamin Elizalde, Ankit Shah, Emmanuel Vincent, Bhiksha Raj, Tuomas Virtanen:
DCASE2017 Challenge Setup: Tasks, Datasets and Baseline System. DCASE 2017: 85-92
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/ectel/CaballeroAKVMLV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ectel/CaballeroAKVMLV17
Daniela Caballero, Roberto Araya, Hanna Kronholm, Jouni Viiri, André Mansikkaniemi, Sami Lehesvuori, Tuomas Virtanen, Mikko Kurimo:
ASR in Classroom Today: Automatic Visualization of Conceptual Network in Science Classrooms. EC-TEL 2017: 541-544
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/NikunenV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/NikunenV17
Joonas Nikunen, Tuomas Virtanen:
Time-difference of arrival model for spherical microphone arrays and application to direction of arrival estimation. EUSIPCO 2017: 1255-1259
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/AdavanneDCV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/AdavanneDCV17
Sharath Adavanne, Konstantinos Drossos, Emre Cakir, Tuomas Virtanen:
Stacked convolutional and recurrent neural networks for bird audio detection. EUSIPCO 2017: 1729-1733
[c95]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/CakirAPDV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/CakirAPDV17
Emre Cakir, Sharath Adavanne, Giambattista Parascandolo, Konstantinos Drossos, Tuomas Virtanen:
Convolutional recurrent neural networks for bird audio detection. EUSIPCO 2017: 1744-1748
[c94]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoHV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoHV17
Shuyang Zhao, Toni Heittola, Tuomas Virtanen:
Active learning for sound event classification by clustering unlabeled data. ICASSP 2017: 751-755
[c93]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AdavannePV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AdavannePV17
Sharath Adavanne, Pasi Pertilä, Tuomas Virtanen:
Sound event detection using spatial features and convolutional recurrent neural network. ICASSP 2017: 771-775
[c92]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/ValentiSDPV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/ValentiSDPV17
Michele Valenti, Stefano Squartini, Aleksandr Diment, Giambattista Parascandolo, Tuomas Virtanen:
A convolutional neural network approach for acoustic scene classification. IJCNN 2017: 1547-1554
[c91]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/MimilakisDVS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/MimilakisDVS17
Stylianos Ioannis Mimilakis, Konstantinos Drossos, Tuomas Virtanen, Gerald Schuller:
A recurrent encoder-decoder approach with skip-filtering connections for monaural singing voice separation. MLSP 2017: 1-6
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/DimentV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/DimentV17
Aleksandr Diment, Tuomas Virtanen:
Transfer learning of weakly labelled audio. WASPAA 2017: 6-10
[c89]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/ZhaoHV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/ZhaoHV17
Shuyang Zhao, Toni Heittola, Tuomas Virtanen:
Learning vocal mode classifiers from heterogeneous data sources. WASPAA 2017: 16-20
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/NaithaniBPBPV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/NaithaniBPBPV17
Gaurav Naithani, Tom Barker, Giambattista Parascandolo, Lars Bramslow, Niels Henrik Pontoppidan, Tuomas Virtanen:
Low latency sound source separation using convolutional recurrent neural networks. WASPAA 2017: 71-75
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/MagronRV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/MagronRV17
Paul Magron, Jonathan Le Roux, Tuomas Virtanen:
Consistent anisotropic Wiener filtering for audio source separation. WASPAA 2017: 269-273
[c86]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/MesarosHV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/MesarosHV17
Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Assessment of human and machine performance in acoustic scene classification: Dcase 2016 case study. WASPAA 2017: 319-323
[c85]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/DrossosAV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/DrossosAV17
Konstantinos Drossos, Sharath Adavanne, Tuomas Virtanen:
Automated audio captioning with recurrent neural networks. WASPAA 2017: 374-378
[e3]
- view
- export record
  dblp key:
  - conf/dcase/2017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/2017
Tuomas Virtanen, Annamaria Mesaros, Toni Heittola, Aleksandr Diment, Emmanuel Vincent, Emmanouil Benetos, Benjamin Elizalde:
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, DCASE 2017, Munich, Germany, November 16-17, 2017. 2017, ISBN 978-952-15-4042-4 [contents]
[d1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/10/MesarosHVBLLFP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/MesarosHVBLLFP17
Annamaria Mesaros, Toni Heittola, Tuomas Virtanen, Emmanouil Benetos, Mathieu Lagrange, Grégoire Lafay, Peter Foster, Mark D. Plumbley:
DCASE2016 Challenge Submissions Package. Zenodo, 2017
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/CakirPHHV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/CakirPHHV17
Emre Çakir, Giambattista Parascandolo, Toni Heittola, Heikki Huttunen, Tuomas Virtanen:
Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection. CoRR abs/1702.06286 (2017)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/CakirAPDV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/CakirAPDV17
Emre Çakir, Sharath Adavanne, Giambattista Parascandolo, Konstantinos Drossos, Tuomas Virtanen:
Convolutional Recurrent Neural Networks for Bird Audio Detection. CoRR abs/1703.02317 (2017)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/AdavanneDCV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AdavanneDCV17
Sharath Adavanne, Konstantinos Drossos, Emre Çakir, Tuomas Virtanen:
Stacked Convolutional and Recurrent Neural Networks for Bird Audio Detection. CoRR abs/1706.02047 (2017)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/AdavannePV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AdavannePV17
Sharath Adavanne, Pasi Pertilä, Tuomas Virtanen:
Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network. CoRR abs/1706.02291 (2017)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/MalikADVTJ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MalikADVTJ17
Miroslav Malik, Sharath Adavanne, Konstantinos Drossos, Tuomas Virtanen, Dasa Ticha, Roman Jarina:
Stacked Convolutional and Recurrent Neural Networks for Music Emotion Recognition. CoRR abs/1706.02292 (2017)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/AdavannePPHV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AdavannePPHV17
Sharath Adavanne, Giambattista Parascandolo, Pasi Pertilä, Toni Heittola, Tuomas Virtanen:
Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features. CoRR abs/1706.02293 (2017)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DrossosAV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DrossosAV17
Konstantinos Drossos, Sharath Adavanne, Tuomas Virtanen:
Automated Audio Captioning with Recurrent Neural Networks. CoRR abs/1706.10006 (2017)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1709-00611
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-00611
Stylianos Ioannis Mimilakis, Konstantinos Drossos, Tuomas Virtanen, Gerald Schuller:
A Recurrent Encoder-Decoder Approach with Skip-filtering Connections for Monaural Singing Voice Separation. CoRR abs/1709.00611 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-02997
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-02997
Sharath Adavanne, Tuomas Virtanen:
A report on sound event detection with different binaural features. CoRR abs/1710.02997 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-02998
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-02998
Sharath Adavanne, Tuomas Virtanen:
Sound event detection using weakly labeled dataset with stacked convolutional and recurrent neural network. CoRR abs/1710.02998 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-10005
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-10005
Joonas Nikunen, Aleksandr Diment, Tuomas Virtanen:
Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking. CoRR abs/1710.10005 (2017)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-10059
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-10059
Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network. CoRR abs/1710.10059 (2017)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-01437
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-01437
Stylianos Ioannis Mimilakis, Konstantinos Drossos, João Felipe Santos, Gerald Schuller, Tuomas Virtanen, Yoshua Bengio:
Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask. CoRR abs/1711.01437 (2017)
2016
[j21]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/NikunenDVV16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/NikunenDVV16
Joonas Nikunen, Aleksandr Diment, Tuomas Virtanen, Miikka Vilermo:
Binaural rendering of microphone array captures based on source separation. Speech Commun. 76: 157-169 (2016)
[j20]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/BarkerV16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/BarkerV16
Tom Barker, Tuomas Virtanen:
Blind Separation of Audio Mixtures Through Nonnegative Tensor Factorization of Modulation Spectrograms. IEEE ACM Trans. Audio Speech Lang. Process. 24(12): 2377-2389 (2016)
[c84]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/AdavannePPHV16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/AdavannePPHV16
Sharath Adavanne, Giambattista Parascandolo, Pasi Pertilä, Toni Heittola, Tuomas Virtanen:
Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features. DCASE 2016: 6-10
[c83]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/ValentiDPSV16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/ValentiDPSV16
Michele Valenti, Aleksandr Diment, Giambattista Parascandolo, Stefano Squartini, Tuomas Virtanen:
DCASE 2016 Acoustic Scene Classification Using Convolutional Neural Networks. DCASE 2016: 95-99
[c82]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/MesarosHV16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/MesarosHV16
Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
TUT database for acoustic scene classification and sound event detection. EUSIPCO 2016: 1128-1132
[c81]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/MahkonenHVK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/MahkonenHVK16
Katariina Mahkonen, Antti Hurmalainen, Tuomas Virtanen, Joni-Kristian Kamarainen:
Cascade processing for speeding up sliding window sparse classification. EUSIPCO 2016: 2305-2309
[c80]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/DimentPVZG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/DimentPVZG16
Aleksandr Diment, Mikko Parviainen, Tuomas Virtanen, Roman Zelov, Alex Glasman:
Noise-robust detection of whispering in telephone calls using deep neural networks. EUSIPCO 2016: 2310-2314
[c79]
- view
  authority control:
- export record
  dblp key:
  - conf/globalsip/NaithaniPBPV16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/globalsip/NaithaniPBPV16
Gaurav Naithani, Giambattista Parascandolo, Tom Barker, Niels Henrik Pontoppidan, Tuomas Virtanen:
Low-latency sound source separation using deep neural networks. GlobalSIP 2016: 272-276
[c78]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ParascandoloHV16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ParascandoloHV16
Giambattista Parascandolo, Heikki Huttunen, Tuomas Virtanen:
Recurrent neural networks for polyphonic sound event detection in real life recordings. ICASSP 2016: 6440-6444
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/CakirOV16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/CakirOV16
Emre Cakir, Ezgi Can Ozan, Tuomas Virtanen:
Filterbank learning for deep neural network based polyphonic sound event detection. IJCNN 2016: 3399-3406
[e2]
- view
- export record
  dblp key:
  - conf/dcase/2016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/2016
Tuomas Virtanen, Annamaria Mesaros, Toni Heittola, Mark D. Plumbley, Peter Foster, Emmanouil Benetos, Mathieu Lagrange:
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, DCASE 2016, Budapest, Hungary, September 3, 2016. 2016, ISBN 978-952-15-3807-0 [contents]
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ParascandoloHV16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ParascandoloHV16
Giambattista Parascandolo, Heikki Huttunen, Tuomas Virtanen:
Recurrent Neural Networks for Polyphonic Sound Event Detection in Real Life Recordings. CoRR abs/1604.00861 (2016)
2015
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/dsp/SimsekliVC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/dsp/SimsekliVC15
Umut Simsekli, Tuomas Virtanen, Ali Taylan Cemgil:
Non-negative tensor factorization models for Bayesian audio processing. Digit. Signal Process. 47: 178-191 (2015)
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/VirtanenGRS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/VirtanenGRS15
Tuomas Virtanen, Jort Florent Gemmeke, Bhiksha Raj, Paris Smaragdis:
Compositional Models for Audio Processing: Uncovering the structure of sound mixtures. IEEE Signal Process. Mag. 32(2): 125-144 (2015)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/BabyVGh15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/BabyVGh15
Deepak Baby, Tuomas Virtanen, Jort F. Gemmeke, Hugo Van hamme:
Coupled Dictionaries for Exemplar-Based Speech Enhancement and Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 23(11): 1788-1799 (2015)
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/DimentCHV15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/DimentCHV15
Aleksandr Diment, Emre Cakir, Toni Heittola, Tuomas Virtanen:
Automatic recognition of environmental sound events using all-pole group delay features. EUSIPCO 2015: 729-733
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/CakirHHV15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/CakirHHV15
Emre Cakir, Toni Heittola, Heikki Huttunen, Tuomas Virtanen:
Multi-label vs. combined single-label sound event detection with deep neural networks. EUSIPCO 2015: 2551-2555
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/ica/DrgasV15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ica/DrgasV15
Szymon Drgas, Tuomas Virtanen:
Speaker Verification Using Adaptive Dictionaries in Non-negative Spectrogram Deconvolution. LVA/ICA 2015: 462-469
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MesarosHDV15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MesarosHDV15
Annamaria Mesaros, Toni Heittola, Onur Dikmen, Tuomas Virtanen:
Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations. ICASSP 2015: 151-155
[c72]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BarkerVP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BarkerVP15
Tom Barker, Tuomas Virtanen, Niels Henrik Pontoppidan:
Low-latency sound-source-separation using non-negative matrix factorisation with coupled analysis and synthesis dictionaries. ICASSP 2015: 241-245
[c71]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HurmalainenSV15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HurmalainenSV15
Antti Hurmalainen, Rahim Saeidi, Tuomas Virtanen:
Similarity induced group sparsity for non-negative matrix factorisation. ICASSP 2015: 4425-4429
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BabyGVh15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BabyGVh15
Deepak Baby, Jort F. Gemmeke, Tuomas Virtanen, Hugo Van hamme:
Exemplar-based speech enhancement for deep neural network based automatic speech recognition. ICASSP 2015: 4485-4489
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/CakirHHV15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/CakirHHV15
Emre Cakir, Toni Heittola, Heikki Huttunen, Tuomas Virtanen:
Polyphonic sound event detection using multi label deep neural networks. IJCNN 2015: 1-7
[c68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HurmalainenSV15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HurmalainenSV15
Antti Hurmalainen, Rahim Saeidi, Tuomas Virtanen:
Noise robust speaker recognition with convolutive sparse coding. INTERSPEECH 2015: 244-248
[c67]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/DimentV15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/DimentV15
Aleksandr Diment, Tuomas Virtanen:
Archetypal analysis for audio dictionary learning. WASPAA 2015: 1-5
2014
[j16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/HeittolaMKEV14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/HeittolaMKEV14
Toni Heittola, Annamaria Mesaros, Dani Korpi, Antti J. Eronen, Tuomas Virtanen:
Method for creating location-specific audio textures. EURASIP J. Audio Speech Music. Process. 2014: 9 (2014)
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/NikunenV14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/NikunenV14
Joonas Nikunen, Tuomas Virtanen:
Direction of Arrival Based Spatial Covariance Model for Blind Sound Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 22(3): 727-739 (2014)
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WuVCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WuVCL14
Zhizheng Wu, Tuomas Virtanen, Engsiong Chng, Haizhou Li:
Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 22(10): 1506-1521 (2014)
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/accv/MahkonenKV14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/accv/MahkonenKV14
Katariina Mahkonen, Joni-Kristian Kämäräinen, Tuomas Virtanen:
Lifelog Scene Change Detection Using Cascades of Audio and Video Detectors. ACCV Workshops (3) 2014: 434-444
[c65]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/GencogluVH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/GencogluVH14
Oguzhan Gencoglu, Tuomas Virtanen, Heikki Huttunen:
Recognition of acoustic events using deep neural networks. EUSIPCO 2014: 506-510
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BarkerVD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BarkerVD14
Tom Barker, Tuomas Virtanen, Olivier Delhomme:
Ultrasound-coupled semi-supervised nonnegative matrix factorisation for speech enhancement. ICASSP 2014: 2129-2133
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BabyVBh14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BabyVBh14
Deepak Baby, Tuomas Virtanen, Tom Barker, Hugo Van hamme:
Coupled dictionary training for exemplar-based speech enhancement. ICASSP 2014: 2883-2887
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VirtanenRGh14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VirtanenRGh14
Tuomas Virtanen, Bhiksha Raj, Jort F. Gemmeke, Hugo Van hamme:
Active-set newton algorithm for non-negative sparse coding of audio. ICASSP 2014: 3092-3096
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NikunenV14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NikunenV14
Joonas Nikunen, Tuomas Virtanen:
Multichannel audio separation by direction of arrival based spatial covariance model and non-negative matrix factorization. ICASSP 2014: 6677-6681
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/BarkerV14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/BarkerV14
Tom Barker, Tuomas Virtanen:
Semi-supervised non-negative tensor factorisation of modulation spectrograms for monaural speech separation. IJCNN 2014: 3556-3561
[c59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BarkerhV14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BarkerhV14
Tom Barker, Hugo Van hamme, Tuomas Virtanen:
Modelling primitive streaming of simple tone sequences through factorisation of modulation pattern tensors. INTERSPEECH 2014: 1371-1375
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/BabyVGBh14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/BabyVGBh14
Deepak Baby, Tuomas Virtanen, Jort F. Gemmeke, Tom Barker, Hugo Van hamme:
Exemplar-based noise robust automatic speech recognition using modulation spectrogram features. SLT 2014: 519-524
2013
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/HurmalainenGV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/HurmalainenGV13
Antti Hurmalainen, Jort F. Gemmeke, Tuomas Virtanen:
Modelling non-stationary noise with spectral factorisation in automatic speech recognition. Comput. Speech Lang. 27(3): 763-779 (2013)
[j12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/HeittolaMEV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/HeittolaMEV13
Toni Heittola, Annamaria Mesaros, Antti J. Eronen, Tuomas Virtanen:
Context-dependent sound event detection. EURASIP J. Audio Speech Music. Process. 2013: 1 (2013)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/puc/KorpiHPEMV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/puc/KorpiHPEMV13
Dani Korpi, Toni Heittola, Timo Partala, Antti J. Eronen, Annamaria Mesaros, Tuomas Virtanen:
On the human ability to discriminate audio ambiances from similar locations of an urban environment. Pers. Ubiquitous Comput. 17(4): 761-769 (2013)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/VirtanenGR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/VirtanenGR13
Tuomas Virtanen, Jort Florent Gemmeke, Bhiksha Raj:
Active-Set Newton Algorithm for Overcomplete Non-Negative Representations of Audio. IEEE Trans. Speech Audio Process. 21(11): 2277-2289 (2013)
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HurmalainenV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HurmalainenV13
Antti Hurmalainen, Tuomas Virtanen:
Learning state labels for sparse classification of speech with matrix deconvolution. ASRU 2013: 168-173
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/cmmr/DimentRHV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cmmr/DimentRHV13
Aleksandr Diment, Padmanabhan Rajan, Toni Heittola, Tuomas Virtanen:
Group Delay Function from All-Pole Models for Musical Instrument Recognition. CMMR 2013: 606-618
[c55]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/DimentHV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/DimentHV13
Aleksandr Diment, Toni Heittola, Tuomas Virtanen:
Semi-supervised learning for musical instrument recognition. EUSIPCO 2013: 1-5
[c54]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/HurmalainenV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/HurmalainenV13
Antti Hurmalainen, Tuomas Virtanen:
Acquiring variable length speech bases for factorisation-based noise robust speech recognition. EUSIPCO 2013: 1-5
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GemmekeVD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GemmekeVD13
Jort F. Gemmeke, Tuomas Virtanen, Kris Demuynck:
Exemplar-based joint channel and noise compensation. ICASSP 2013: 868-872
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeittolaMVG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeittolaMVG13
Toni Heittola, Annamaria Mesaros, Tuomas Virtanen, Moncef Gabbouj:
Supervised model training for overlapping sound events based on unsupervised source separation. ICASSP 2013: 8677-8681
[c51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BarkerV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BarkerV13
Tom Barker, Tuomas Virtanen:
Non-negative tensor factorisation of modulation spectrograms for monaural sound source separation. INTERSPEECH 2013: 827-831
[c50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuVKCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuVKCL13
Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Exemplar-based unit selection for voice conversion utilizing temporal information. INTERSPEECH 2013: 3057-3061
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/BriggsHRELCHHBFINTFTNNHRMDVMDCHLM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/BriggsHRELCHHBFINTFTNNHRMDVMDCHLM13
Forrest Briggs, Yonghong Huang, Raviv Raich, Konstantinos Eftaxias, Zhong Lei, William Cukierski, Sarah Frey Hadley, Adam Hadley, Matthew Betts, Xiaoli Z. Fern, Jed Irvine, Lawrence Neal, Anil Thomas, Gábor Fodor, Grigorios Tsoumakas, Hong Wei Ng, Thi Ngoc Tho Nguyen, Heikki Huttunen, Pekka Ruusuvuori, Tapio Manninen, Aleksandr Diment, Tuomas Virtanen, Julien Marzat, Joseph Defretin, Dave Callender, Chris Hurlburt, Ken Larrey, Maxim Milakov:
The 9th annual MLSP competition: New methods for acoustic classification of multiple simultaneous bird species in a noisy environment. MLSP 2013: 1-8
[c48]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ssw/WuVKCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/WuVKCL13
Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, Haizhou Li:
Exemplar-based voice conversion using non-negative spectrogram deconvolution. SSW 2013: 201-206
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/KauppinenKV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/KauppinenKV13
Joonas Kauppinen, Anssi Klapuri, Tuomas Virtanen:
Music self-similarity modeling using augmented nonnegative matrix factorization of block and stripe patterns. WASPAA 2013: 1-4
2012
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HelanderSVG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HelanderSVG12
Elina Helander, Hanna Silén, Tuomas Virtanen, Moncef Gabbouj:
Voice Conversion Using Dynamic Kernel Partial Least Squares Regression. IEEE Trans. Speech Audio Process. 20(3): 806-817 (2012)
[c46]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/NikunenVPV12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/NikunenVPV12
Joonas Nikunen, Tuomas Virtanen, Pasi Pertilä, Miikka Vilermo:
Permutation alignment of frequency-domain ICA by the maximization of intra-source envelope correlations. EUSIPCO 2012: 1489-1493
[c45]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/HurmalainenGV12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/HurmalainenGV12
Antti Hurmalainen, Jort F. Gemmeke, Tuomas Virtanen:
Detection, separation and recognition of speech from continuous signals using spectral factorisation. EUSIPCO 2012: 2649-2653
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/ica/Rodriguez-SerranoCVVR12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ica/Rodriguez-SerranoCVVR12
Francisco J. Rodríguez-Serrano, Julio J. Carabias-Orti, Pedro Vera-Candeas, Tuomas Virtanen, Nicolás Ruiz-Reyes:
Multiple Instrument Mixtures Source Separation Evaluation Using Instrument-Dependent NMF Models. LVA/ICA 2012: 380-387
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HurmalainenV12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HurmalainenV12
Antti Hurmalainen, Tuomas Virtanen:
Modelling spectro-temporal dynamics in factorisation-based noise-robust automatic speech recognition. ICASSP 2012: 4113-4116
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WeningerWGSGHVR12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WeningerWGSGHVR12
Felix Weninger, Martin Wöllmer, Jürgen T. Geiger, Björn W. Schuller, Jort F. Gemmeke, Antti Hurmalainen, Tuomas Virtanen, Gerhard Rigoll:
Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize? ICASSP 2012: 4681-4684
[c41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HurmalainenSV12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HurmalainenSV12
Antti Hurmalainen, Rahim Saeidi, Tuomas Virtanen:
Group Sparsity for Speaker Identity Discrimination in Factorisation-based Speech Recognition. INTERSPEECH 2012: 2138-2141
[c40]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Virtanen12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Virtanen12
Tuomas Virtanen:
Human sound perception - what can we learn from it when developing audio analysis algorithms? SAPA@INTERSPEECH 2012
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/isccsp/RadV12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isccsp/RadV12
Ali Bahrami Rad, Tuomas Virtanen:
Phase spectrum prediction of audio signals. ISCCSP 2012: 1-5
[c38]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/SaeidiHVL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/SaeidiHVL12
Rahim Saeidi, Antti Hurmalainen, Tuomas Virtanen, David A. van Leeuwen:
Exemplar-based sparse representation and sparse discrimination for noise robust speaker identification. Odyssey 2012: 248-255
[p3]
- view
  authority control:
- export record
  dblp key:
  - books/wi/12/VirtanenSR12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/wi/12/VirtanenSR12
Tuomas Virtanen, Rita Singh, Bhiksha Raj:
Introduction. Techniques for Noise Robustness in Automatic Speech Recognition 2012: 1-5
[p2]
- view
  authority control:
- export record
  dblp key:
  - books/wi/12/SinghRV12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/wi/12/SinghRV12
Rita Singh, Bhiksha Raj, Tuomas Virtanen:
The Basics of Automatic Speech Recognition. Techniques for Noise Robustness in Automatic Speech Recognition 2012: 7-30
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/wi/12/RajVS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/wi/12/RajVS12
Bhiksha Raj, Tuomas Virtanen, Rita Singh:
The Problem of Robustness in Automatic Speech Recognition. Techniques for Noise Robustness in Automatic Speech Recognition 2012: 31-50
[e1]
- view
  authority control:
- export record
  dblp key:
  - books/wi/12/VSR2012
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/wi/12/VSR2012
Tuomas Virtanen, Rita Singh, Bhiksha Raj:
Techniques for Noise Robustness in Automatic Speech Recognition. Wiley 2012, ISBN 978-1-119-97088-0 [contents]
2011
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/Carabias-OrtiVVRC11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/Carabias-OrtiVVRC11
Julio J. Carabias-Orti, Tuomas Virtanen, Pedro Vera-Candeas, Nicolás Ruiz-Reyes, Francisco J. Cañadas-Quesada:
Musical Instrument Sound Multi-Excitation Model for Non-Negative Spectrogram Factorization. IEEE J. Sel. Top. Signal Process. 5(6): 1144-1158 (2011)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GemmekeVH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GemmekeVH11
Jort F. Gemmeke, Tuomas Virtanen, Antti Hurmalainen:
Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition. IEEE Trans. Speech Audio Process. 19(7): 2067-2080 (2011)
[c37]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/GemmekeHVS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/GemmekeHVS11
Jort F. Gemmeke, Antti Hurmalainen, Tuomas Virtanen, Yang Sun:
Toward a practical implementation of exemplar-based noise robust ASR. EUSIPCO 2011: 1490-1494
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HurmalainenGV11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HurmalainenGV11
Antti Hurmalainen, Jort F. Gemmeke, Tuomas Virtanen:
Non-negative matrix deconvolution in noise robust speech recognition. ICASSP 2011: 4588-4591
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MahkonenHVG11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MahkonenHVG11
Katariina Mahkonen, Antti Hurmalainen, Tuomas Virtanen, Jort F. Gemmeke:
Mapping Sparse Representation to State Likelihoods in Noise-Robust Automatic Speech Recognition. INTERSPEECH 2011: 465-468
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KallasjokiRGVP11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KallasjokiRGVP11
Heikki Kallasjoki, Ulpu Remes, Jort F. Gemmeke, Tuomas Virtanen, Kalle J. Palomäki:
Uncertainty Measures for Improving Exemplar-Based Source Separation. INTERSPEECH 2011: 469-472
[c33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RajSV11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RajSV11
Bhiksha Raj, Rita Singh, Tuomas Virtanen:
Phoneme-Dependent NMF for Speech Enhancement in Monaural Mixtures. INTERSPEECH 2011: 1217-1220
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/NikunenVV11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/NikunenVV11
Joonas Nikunen, Tuomas Virtanen, Miikka Vilermo:
Multichannel audio upmixing based on non-negative tensor factorization representation. WASPAA 2011: 33-36
2010
[j6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/HelenV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/HelenV10
Marko Leonard Helén, Tuomas Virtanen:
Audio Query by Example Using Similarity Measures between Probability Density Functions of Features. EURASIP J. Audio Speech Music. Process. 2010 (2010)
[j5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/MesarosV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/MesarosV10
Annamaria Mesaros, Tuomas Virtanen:
Automatic Recognition of Lyrics in Singing. EURASIP J. Audio Speech Music. Process. 2010 (2010)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KlapuriV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KlapuriV10
Anssi Klapuri, Tuomas Virtanen:
Representing Musical Sounds With an Interpolating State Model. IEEE Trans. Speech Audio Process. 18(3): 613-624 (2010)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HelanderVNG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HelanderVNG10
Elina Helander, Tuomas Virtanen, Jani Nurminen, Moncef Gabbouj:
Voice Conversion Using Partial Least Squares Regression. IEEE Trans. Speech Audio Process. 18(5): 912-921 (2010)
[c31]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/MesarosHEV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/MesarosHEV10
Annamaria Mesaros, Toni Heittola, Antti J. Eronen, Tuomas Virtanen:
Acoustic event detection in real life recordings. EUSIPCO 2010: 1267-1271
[c30]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/HeittolaMEV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/HeittolaMEV10
Toni Heittola, Annamaria Mesaros, Antti J. Eronen, Tuomas Virtanen:
Audio context recognition using audio event histograms. EUSIPCO 2010: 1272-1276
[c29]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/KeronenRPVK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/KeronenRPVK10
Sami Keronen, Ulpu Remes, Kalle J. Palomäki, Tuomas Virtanen, Mikko Kurimo:
Comparison of noise robust methods in large vocabulary speech recognition. EUSIPCO 2010: 1973-1977
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NikunenV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NikunenV10
Joonas Nikunen, Tuomas Virtanen:
Noise-to-mask ratio minimization by weighted non-negative matrix factorization. ICASSP 2010: 25-28
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MesarosV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MesarosV10
Annamaria Mesaros, Tuomas Virtanen:
Recognition of phonemes and words in singing. ICASSP 2010: 2146-2149
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GemmekeV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GemmekeV10
Jort F. Gemmeke, Tuomas Virtanen:
Noise robust exemplar-based connected digit recognition. ICASSP 2010: 4546-4549
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KlapuriVH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KlapuriVH10
Anssi Klapuri, Tuomas Virtanen, Toni Heittola:
Sound source separation in monaural music signals using excitation-filter model and em algorithm. ICASSP 2010: 5510-5513
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RajVCS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RajVCS10
Bhiksha Raj, Tuomas Virtanen, Sourish Chaudhuri, Rita Singh:
Non-negative matrix factorization based compensation of music for automatic speech recognition. INTERSPEECH 2010: 717-720
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VirtanenGH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VirtanenGH10
Tuomas Virtanen, Jort F. Gemmeke, Antti Hurmalainen:
State-based labelling for a sparse representation of speech and its application to robust speech recognition. INTERSPEECH 2010: 893-896
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GemmekeV10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GemmekeV10
Jort F. Gemmeke, Tuomas Virtanen:
Artificial and online acquired noise dictionaries for noise robust ASR. INTERSPEECH 2010: 2082-2085

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c21]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/MesarosV09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/MesarosV09
Annamaria Mesaros, Tuomas Virtanen:
Adaptation of a speech recognizer for singing voice. EUSIPCO 2009: 1779-1783
[c20]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/Virtanen09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/Virtanen09
Tuomas Virtanen:
Spectral covariance in prior distributions of non-negative matrix factorization based speech separation. EUSIPCO 2009: 1933-1937
[c19]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/MyllymakiV09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/MyllymakiV09
Mikko Myllymäki, Tuomas Virtanen:
Non-stationary noise model compensation in voice activity detection. EUSIPCO 2009: 2186-2190
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VirtanenH09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VirtanenH09
Tuomas Virtanen, Toni Heittola:
Interpolating hidden Markov model and its application to automatic instrument recognition. ICASSP 2009: 49-52
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/ida/VirtanenC09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ida/VirtanenC09
Tuomas Virtanen, Ali Taylan Cemgil:
Mixtures of Gamma Priors for Non-negative Matrix Factorization Based Speech Separation. ICA 2009: 646-653
[c16]
- view
  - electronic edition @ ismir.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/HeittolaKV09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/HeittolaKV09
Toni Heittola, Anssi Klapuri, Tuomas Virtanen:
Musical Instrument Recognition in Polyphonic Audio Using Source-Filter Model for Sound Separation. ISMIR 2009: 327-332
2008
[c15]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/MyllymakiV08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/MyllymakiV08
Mikko Myllymäki, Tuomas Virtanen:
Voice activity detection in the presence of breathing noise using neural network and hidden Markov model. EUSIPCO 2008: 1-5
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VirtanenCG08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VirtanenCG08
Tuomas Virtanen, Ali Taylan Cemgil, Simon J. Godsill:
Bayesian extensions to non-negative matrix factorisation for audio signal modelling. ICASSP 2008: 1825-1828
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/RyynanenVPK08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/RyynanenVPK08
Matti Ryynänen, Tuomas Virtanen, Jouni Paulus, Anssi Klapuri:
Accompaniment separation and karaoke application based on automatic melody transcription. ICME 2008: 1417-1420
[c12]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/VirtanenMR08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VirtanenMR08
Tuomas Virtanen, Annamaria Mesaros, Matti Ryynänen:
Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music. SAPA@INTERSPEECH 2008: 17-22
2007
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/Virtanen07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Virtanen07
Tuomas Virtanen:
Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria. IEEE Trans. Speech Audio Process. 15(3): 1066-1074 (2007)
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HelenV07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HelenV07
Marko Leonard Helén, Tuomas Virtanen:
Query by Example of Audio Signals using Euclidean Distance Between Gaussian Mixture Models. ICASSP (1) 2007: 225-228
[c10]
- view
  - electronic edition @ ismir.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/MesarosVK07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/MesarosVK07
Annamaria Mesaros, Tuomas Virtanen, Anssi Klapuri:
Singer Identification in Polyphonic Music Using Vocal Separation and Pattern Recognition Methods. ISMIR 2007: 375-378
2006
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Virtanen06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Virtanen06
Tuomas Virtanen:
Speech recognition using factorial hidden Markov models for separation in the feature space. INTERSPEECH 2006
2005
[c8]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/HelenV05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/HelenV05
Marko Leonard Helén, Tuomas Virtanen:
Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine. EUSIPCO 2005: 1-4
[c7]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/KlapuriVH05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/KlapuriVH05
Anssi Klapuri, Tuomas Virtanen, Marko Leonard Helén:
Modeling musical sounds with an interpolating state model. EUSIPCO 2005: 1-4
[c6]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/PaulusV05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/PaulusV05
Jouni Paulus, Tuomas Virtanen:
Drum transcription with non-negative spectrogram factorisation. EUSIPCO 2005: 1-4
2004
[c5]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Virtanen04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Virtanen04
Tuomas Virtanen:
Separation of sound sources by convolutive sparse coding. SAPA@INTERSPEECH 2004: 55
2003
[c4]
- view
  - electronic edition via handle.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icmc/Virtanen03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmc/Virtanen03
Tuomas Virtanen:
Sound Source Separation Using Sparse Coding with Temporal Continuity Objective. ICMC 2003
2002
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VirtanenK02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VirtanenK02
Tuomas Virtanen, Anssi Klapuri:
Separation of harmonic sounds using linear models for the overtone series. ICASSP 2002: 1757-1760
2000
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/cmpb/JakobKRVKT00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cmpb/JakobKRVKT00
Stephan M. Jakob, Ilkka Korhonen, Esko Ruokonen, Tuomas Virtanen, Alex Kogan, Jukka Takala:
Detection of artifacts in monitored trends in intensive care. Comput. Methods Programs Biomed. 63(3): 203-209 (2000)
[c2]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/SillanpaaKSV00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/SillanpaaKSV00
Jukka Sillanpaa, Anssi Klapuri, Jarno Seppänen, Tuomas Virtanen:
Recognition of acoustic noise mixtures by combined bottom-up and top-down processing. EUSIPCO 2000: 1-4
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VirtanenK00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VirtanenK00
Tuomas Virtanen, Anssi Klapuri:
Separation of harmonic sound sources using sinusoidal modeling. ICASSP 2000: 765-768

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.