


default search action
26th ISMIR 2025: Daejeon, South Korea
- Juhan Nam, Dasaem Jeong, Keunwoo Choi, Li Su, Magdalena Fuentes, Tomoyasu Nakano, Xiao Hu, Hao-Wen (Herman) Dong:

Proceedings of the 26th International Society for Music Information Retrieval Conference, ISMIR 2025, Daejeon, South Korea, September 21-25, 2025. 2025, ISBN 978-1-7327299-5-7 - Harin Lee, Elif Celen, Peter M. C. Harrison, Manuel Anglada-Tort, Pol van Rijn, Minsu Park, Marc Schönwiesner, Nori Jacoby:

GlobalMood: A Cross-Cultural Benchmark for Music Emotion Recognition. 11-19 - Alexander Wang, Chris Donahue, Dhruv Jain:

RISE: Music Rearrangement for Realtime Intensity Synchronization With Exercise. 20-27 - Lidia J. Morris, Michele Newman, Xinya Tang, Renee Singh, Marcel A. Vélez Vásquez, Rebecca Leger, Jin Ha Lee:

Expanding the HAISP Dataset: AI's Impact on Songwriting Across Two AI Song Contests. 28-35 - Brian McFee:

Quantifying Regularity in Music Structure Analysis. 36-43 - Eunjin Choi, Hyerin Kim, Jiwoo Ryu, Juhan Nam, Dasaem Jeong:

On the De-Duplication of the Lakh MIDI Dataset. 44-51 - Matteo Pettenò, Alessandro Ilic Mezza, Alberto Bernardini:

Conditional Diffusion as Latent Constraints for Unconditional Symbolic Music Generation Models. 52-59 - Maziar Kanani, Seán O'Leary, James McDermott:

Radif Corpus; Symbolic Dataset for Non-Metric Iranian Classical Music. 60-67 - Yash Bhake, Ankit Anand, Preeti Rao:

Melodic and Metrical Elements of Expressiveness in Hindustani Vocal Music. 68-74 - Takayuki Nakatsuka, Masahiro Hamasaki, Masataka Goto:

Coloring Music: Bridging Music and Color Palettes for Graphic Design. 75-82 - Patricia Hu, Silvan Peter, Jan Schlüter, Gerhard Widmer:

Exploring Network Adaptations for Minimum Latency Real-Time Piano Transcription. 83-90 - Jiyun Park, Carlos Eduardo Cancino-Chacón, Suhit Chiruthapudi, Juhan Nam:

A Systematic Evaluation of Real-Time Audio Score Following for Piano Performance. 91-99 - Jaeran Choi, Taegyun Kwon, Juhan Nam:

Predicting Flutist Onset Timing in Duet Performance: A Multimodal Analysis of Gesture and Breath Cues. 100-106 - Markus Frohmann, Elena V. Epure, Gabriel Meseguer-Brocal, Markus Schedl, Romain Hennequin:

AI-Generated Song Detection via Lyrics Transcripts. 107-116 - Simon J. Schwär, Stefan Balke, Meinard Müller:

Measuring Sensory Dissonance In Multi-Track Music Recordings: A Case Study With Wind Quartets. 117-126 - Johannes Zeitler, Meinard Müller:

Reformulating Soft Dynamic Time Warping: Insights Into Target Artifacts and Prediction Quality. 127-133 - Junghyun Koo, Marco A. Martínez Ramírez, Wei-Hsiang Liao, Giorgio Fabbro, Michele Mancusi, Yuki Mitsufuji:

ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors. 134-141 - Pascal Schmolenzky, Stephanie Klauk, Rainer Kleinertz, Christof Weiß, Meinard Müller:

A Multidimensional Approach to Opera Analysis: Harmony, Tempo, and Dramatic Interaction in Wagner's Siegfried Act III. 142-149 - Meng Yang, Jon McCormack, Maria Teresa Llano, Wanchao Su:

Exploring the Feasibility of LLMs for Automated Music Emotion Annotation. 150-157 - Yiwei Ding, Yannik Venohr, Christof Weiß:

An Evaluation Strategy for Local Key Estimation: Exploiting Cross-Version Consistency. 158-165 - Hans-Ulrich Berendes, Ben Maman, Meinard Müller:

Tuning Matters: Analyzing Musical Tuning Bias in Neural Vocoders. 166-173 - Yichen Huang, Zachary Novack, Koichi Saito, Jiatong Shi, Shinji Watanabe, Yuki Mitsufuji, John Thickstun, Chris Donahue:

Aligning Text-to-Music Evaluation With Human Preferences. 174-181 - Oleg Lesota, Anna Hausberger, Ivanna Pshenychna, Oleksandr Shvydanenko, Olha Yehorova, Markus Schedl:

Investigating Music Track Liking in the Halo of Album Covers. 182-189 - Hilda Romero-Velo, Gilberto Bernardes, Susana Ladra, José R. Paramá, Fernando Silva-Coira:

Phylo-Analysis of Folk Traditions: A Methodology for the Hierarchical Musical Similarity Analysis. 190-197 - Ching-Yu Chiu, Sebastian Strahl, Meinard Müller:

dPLP: A Differentiable Version of Predominant Local Pulse Estimation. 198-205 - Guillem Cortès-Sebastià, Benjamin Martin, Emilio Molina, Xavier Serra, Romain Hennequin:

PeakNetFP: Peak-Based Neural Audio Fingerprinting Robust to Extreme Time Stretching. 206-214 - Weihan Xu, Julian J. McAuley, Taylor Berg-Kirkpatrick, Shlomo Dubnov, Hao-Wen Dong:

Generating Symbolic Music From Natural Language Prompts Using an LLM-Enhanced Dataset. 215-222 - Zhaokai Wang, Chenxi Bao, Le Zhuo, Jingrui Han, Yang Yue, Yihong Tang, Victor Shea-Jay Huang, Yue Liao:

A Survey on Vision-to-Music Generation: Methods, Datasets, Evaluation, and Challenges. 223-234 - Yuexuan Kong, Gabriel Meseguer-Brocal, Vincent Lostanlen, Mathieu Lagrange, Romain Hennequin:

Emergent Musical Properties of a Transformer Under Contrastive Self-Supervised Learning. 235-246 - Yongyi Zang, Sean O'Brien, Taylor Berg-Kirkpatrick, Julian J. McAuley, Zachary Novack:

Are You Really Listening? Boosting Perceptual Awareness in Music-QA Benchmarks. 247-261 - Julien Guinot, Elio Quinton, George Fazekas:

GD-Retriever: Controllable Generative Text-Music Retrieval With Diffusion Models. 262-270 - Yannik Venohr, Yiwei Ding, Christof Weiß:

Towards Robust Automatic Music Transcription By Measuring Cross-Version Consistency. 271-278 - Roman B. Gebhardt, Arne Kuhle, Eylül Bektur:

Beyond Genre: Diagnosing Bias in Music Embeddings Using Concept Activation Vectors. 279-286 - Tom Baker, Javier Nistal:

LiLAC: A Lightweight Latent ControlNet for Musical Audio Generation. 287-295 - Zakaria Hassein-Bey, Yohann Abbou, Alexandre D'Hooge, Mathieu Giraud, Gilles Guillemain, Aurélien Jeanneau:

What Song Now? Personalized Rhythm Guitar Learning in Western Popular Music. 296-302 - Charilaos Papaioannou, Emmanouil Benetos, Alexandros Potamianos:

Universal Music Representations? Evaluating Foundation Models on World Music Corpora. 303-311 - Martin Rohrmeier:

A Theoretical Model of Musical Form. 312-319 - António Pinto:

Towards Human-in-the-Loop Onset Detection: A Transfer Learning Approach for Maracatu. 320-327 - Yixiao Zhang, Yukara Ikemiya, Woosung Choi, Naoki Murata, Marco A. Martínez Ramírez, Liwei Lin, Gus Xia, Wei-Hsiang Liao, Yuki Mitsufuji, Simon Dixon:

Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning. 328-336 - Qi He, Ziyu Wang, Gus Xia:

TOMI: Transforming and Organizing Music Ideas for Multi-Track Compositions With Full-Song Structure. 337-345 - Ziyu Wang, Yuxuan Wu, Roger B. Dannenberg, Gus Xia:

Automatic Melody Reduction via Shortest Path Finding. 346-353 - Fathinah Asma Izzati, Xinyue Li, Gus Xia:

Expotion: Facial Expression and Motion Control for Multimodal Music Generation. 354-362 - Patrice Thibaud, Mathieu Giraud, Yann Teytaut:

When Voices Interleave: Timing Deviations in Six Performances of Telemann's Fantasias for Solo Flute. 363-372 - Ben Hayes, Charalampos Saitis, György Fazekas:

Audio Synthesizer Inversion in Symmetric Parameter Spaces With Approximately Equivariant Flow Matching. 373-381 - Julien Guinot, Alain Riou, Elio Quinton, George Fazekas:

SLAP: Siamese Language-Audio Pretraining Without Negative Samples for Music Understanding. 382-390 - Hayeon Bang, Eunjin Choi, Seungheon Doh, Juhan Nam:

PianoBind: A Multi-Modal Joint Embedding Model for Pop-Piano Music. 391-398 - Recep Oguz Araz, Guillem Cortès-Sebastià, Emilio Molina, Joan Serrà, Xavier Serra, Yuki Mitsufuji, Dmitry Bogdanov:

Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification. 399-406 - Jonathan Myers, Dard Neuman:

Beyond Notation: A Digital Platform for Transcribing and Analyzing Oral Melodic Traditions. 407-415 - Yinghao Ma, Siyou Li, Juntao Yu, Emmanouil Benetos, Akira Maezawa:

CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following. 416-425 - Qingyang Xi, Brian McFee:

Lose the Frames: Exact Metrics for More Responsible Music Structure Analysis Evaluations. 426-432 - Marco Pasini, Stefan Lattner, George Fazekas:

Unifying Continuous and Discrete Compressed Representations of Audio. 433-441 - Jun-You Wang, Li Su:

Improving BERT for Symbolic Music Understanding Using Token Denoising and Pianoroll Prediction. 442-450 - Louis Bradshaw, Alexander Spangher, Honglu Fan, Stella Biderman, Simon Colton:

Scaling Self-Supervised Representation Learning for Symbolic Piano Performance. 451-459 - Patrick O'Reilly, Julia Barnett, Hugo Flores García, Annie Chu, Nathan Pruyne, Prem Seetharaman, Bryan Pardo:

The Rhythm In Anything: Audio-Prompted Drums Generation With Masked Language Modeling. 460-468 - Jonathan Yaffe, Ben Maman, Meinard Müller, Amit Bermano:

Count the Notes: Histogram-Based Supervision for Automatic Music Transcription. 469-476 - Sebastian Murgul, Johannes Schimper, Michael Heizmann:

Joint Transcription of Acoustic Guitar Strumming Directions and Chords. 477-483 - Alia Morsi, Suhit Chiruthapudi, Silvan Peter, Ivan Pilkov, Laura Bishop, Akira Maezawa, Xavier Serra, Carlos Eduardo Cancino-Chacón:

Enabling Empirical Analysis of Piano Performance Rehearsal With the Rach3 MIDI Dataset. 484-491 - Andrea Poltronieri, Xavier Serra, Martín Rocamora:

From Discord to Harmony: Consonance-Based Smoothing for Improved Audio Chord Estimation. 492-502 - Peter van Kranenburg, Gerben Bisschop:

Keyboard Temperament Estimation From Symbolic Data: A Case Study on Bach's Well-Tempered Clavier. 503-510 - Aditya Bhattacharjee, Ivan Meresman Higgs, Mark Sandler, Emmanouil Benetos:

Refining Music Sample Identification With a Self-Supervised Graph Neural Network. 511-517 - Haven Kim, Zachary Novack, Weihan Xu, Julian J. McAuley, Hao-Wen Dong:

Video-Guided Text-to-Music Generation Using Public Domain Movie Collections. 518-527 - Yonghyun Kim, Junhyung Park, Joonhyung Bae, Kirak Kim, Taegyun Kwon, Alexander Lerch, Juhan Nam:

PianoVAM: A Multimodal Piano Performance Dataset. 528-535 - Davide Marincione, Giorgio Strano, Donato Crisostomi, Roberto Ribuoli, Emanuele Rodolà:

LoopGen: Training-Free Loopable Music Generation. 536-546 - Oleg Lesota, Veronica Clavijo, Attia Rizwani, Markus Schedl, Bruce Ferwerda:

Enhancing Music Recommender Systems With Multimedia Content: A Context-Aware Approach. 547-554 - Angelos-Nikolaos Kanatas, Charilaos Papaioannou, Alexandros Potamianos:

CultureMERT: Continual Pre-Training for Cross-Cultural Music Representation Learning. 555-564 - Xiaoxuan Wang, Martin Rohrmeier:

Adaptive Path of Prediction: An Unsupervised Method for Modeling Note-Level Informational Hierarchy of Polyphony. 565-572 - Junyan Jiang, Daniel Chin, Xuanjie Liu, Liwei Lin, Gus Xia:

Versatile Music-for-Music Modeling via Function Alignment. 573-581 - Philipp Weyers, Christian Uhle, Meinard Müller, Matthias Lang:

Understanding Performance Limitations in Automatic Drum Transcription. 582-588 - Hanwen Zhang, Kun Fang, Ziyu Wang, Ichiro Fujinaga:

High-Resolution Sustain Pedal Depth Estimation From Piano Audio Across Room Acoustics. 589-595 - Frank Cwitkowitz, Zhiyao Duan:

Investigating an Overfitting and Degeneration Phenomenon in Self-Supervised Multi-Pitch Estimation. 596-603 - Juan C. Martinez-Sevilla, Joan Cerveto-Serrano, Noelia N. Luna-Barahona, Greg Chapman, Craig Sapp, David Rizo, Jorge Calvo-Zaragoza:

Sheet Music Benchmark: Standardized Optical Music Recognition Evaluation. 604-611 - Yen-Tung Yeh, Junghyun Koo, Marco A. Martínez Ramírez, Wei-Hsiang Liao, Yi-Hsuan Yang, Yuki Mitsufuji:

Fx-Encoder++: Extracting Instrument-Wise Audio Effect Representations From Mixtures. 612-622 - Jingjing Tang, Xin Wang, Zhe Zhang, Junichi Yamagishi, Geraint A. Wiggins, George Fazekas:

MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling. 623-630 - Manuel Müllerschön, Anssi Klapuri, Marcelo Rodriguez, Christian Cardin:

Playability Prediction in Digital Guitar Learning Using Interpretable Student and Song Representations. 631-637 - Vojtech Lanz, Jan Hajic jr.:

Gregorian Melody, Modality, and Memory: Segmenting Chant With Bayesian Nonparametrics. 638-646 - Hitoshi Suda, Junya Koguchi, Shunsuke Yoshida, Tomohiko Nakamura, Satoru Fukayama, Jun Ogata:

IdolSongsJp Corpus: A Multi-Singer Song Corpus in the Style of Japanese Idol Groups. 647-654 - Jackson Loth, Pedro Sarmento, Saurjya Sarkar, Zixun Guo, Mathieu Barthet, Mark Sandler:

GOAT: A Large Dataset of Paired Guitar Audio Recordings and Tablatures. 655-662 - Giorgio Strano, Chiara Ballanti, Donato Crisostomi, Michele Mancusi, Luca Cosmo, Emanuele Rodolà:

STAGE: Stemmed Accompaniment Generation Through Prefix-Based Conditioning. 663-670 - Richa Namballa, Agnieszka Roginska, Magdalena Fuentes:

Do Music Source Separation Models Preserve Spatial Information in Binaural Audio?. 671-678 - Mathias Rose Bjare, Stefan Lattner, Gerhard Widmer:

Estimating Musical Surprisal From Audio in Autoregressive Diffusion Model Noise Spaces. 679-687 - David Marttila, Joshua D. Reiss:

Improving Neural Pitch Estimation With SWIPE Kernels. 688-695 - Juan Carlos Martinez-Sevilla, Francesco Foscarin, Patricia García-Iasci, David Rizo, Jorge Calvo-Zaragoza, Gerhard Widmer:

Optical Music Recognition of Jazz Lead Sheets. 696-702 - Juan Pedro Martinez-Esteso, Alejandro Galán-Cuenca, Carlos Pérez-Sancho, Francisco J. Castellanos, Antonio Javier Gallego:

Human Vs. Machine: Comparing Selection Strategies in Active Learning for Optical Music Recognition. 703-709 - Haokun Tian, Stefan Lattner, Charalampos Saitis:

Assessing the Alignment of Audio Representations With Timbre Similarity Ratings. 710-718 - Filip Korzeniowski, Richard Vogl:

Simple and Effective Semantic Song Segmentation. 719-726 - Roser Batlle-Roca, Laura Ibáñez-Martínez, Xavier Serra, Emilia Gómez, Martín Rocamora:

MusGO: A Community-Driven Framework for Assessing Openness in Music-Generative AI. 727-738 - Darius Afchar, Gabriel Meseguer-Brocal, Kamil Akesbi, Romain Hennequin:

A Fourier Explanation of AI-Music Artifacts. 739-746 - Simon Librický, Jan Hajic jr.:

Modeling the Difficulty of Saxophone Music. 747-754 - Lancelot Blanchard, Perry Naseck, Stephen Brade, Kimaya Lecamwasam, Jordan Rudess, Cheng-Zhi Anna Huang, Joseph A. Paradiso:

The Jam_bot, a Real-Time System for Collaborative Free Improvisation With Music Language Models. 755-762 - Marcel A. Vélez Vásquez, Mariëlle Baelemans, Jonathan Driedger, John Ashley Burgoyne:

Fretboardflow: A Dual-Model Approach to Optimize Chord Voicings on the Guitar Fretboard. 763-770 - Tao-Tao He, Martin E. Malandro, Douglas Shadle:

The Florence Price Art Song Dataset and Piano Accompaniment Generator. 771-778 - Sarah Nabi, Nils Demerlé, Geoffroy Peeters, Frédéric Bevilacqua, Philippe Esling:

Adding Temporal Musical Controls on Top of Pretrained Generative Models. 779-786 - Jaehun Kim, Matthew C. McCallum, Andreas F. Ehmann:

Quantize & Factorize: A Fast Yet Effective Unsupervised Audio Representation Without Deep Learning. 787-796 - Parampreet Singh, Adwik Gupta, Aakarsh Mishra, Vipul Arora:

Identification and Clustering of Unseen Ragas in Indian Art Music. 797-804 - Yuxuan Liu, Peihong Zhang, Rui Sang, Zhixin Li, Shengchen Li:

MAIA: An Inpainting-Based Approach for Music Adversarial Attacks. 805-812 - Sunyoo Kim, Yunjeong Choi, Doyeon Lee, Seoyoung Lee, Eunyi Lyou, Seungju Kim, Junhyug Noh, Joonseok Lee:

Joint Object Detection and Sound Source Separation. 813-820 - Yutong Wen, Minje Kim, Paris Smaragdis:

User-Guided Generative Source Separation. 821-829 - Genís Plaja-Roglans, Xavier Serra, Martín Rocamora:

Singing Voice Separation From Carnatic Music Mixtures Using a Regression-Guided Latent Diffusion Model. 830-838 - Saurjya Sarkar, Victoria Moomijan, Basil Woods, Emmanouil Benetos, Mark Sandler:

Looking Beyond Averaged Metrics in Music Source Separation. 839-846 - Omar Eldeeb, Martin E. Malandro:

Barwise Section Boundary Detection in Symbolic Music Using Convolutional Neural Networks. 847-854

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














