default search action
22nd ISMIR 2021: Online
- Jin Ha Lee, Alexander Lerch, Zhiyao Duan, Juhan Nam, Preeti Rao, Peter van Kranenburg, Ajay Srinivasamurthy:
Proceedings of the 22nd International Society for Music Information Retrieval Conference, ISMIR 2021, Online, November 7-12, 2021. 2021, ISBN 978-1-7327299-0-2
Papers
- Rohit M. A., Amitrajit Bhattacharjee, Preeti Rao:
Four-way Classification of Tabla Strokes with Models Adapted from Automatic Drum Transcription. 19-26 - Taketo Akama:
A Contextual Latent Space Model: Subsequence Modulation in Melodic Sequence. 27-34 - María Alfaro-Contreras, David Rizo, José M. Iñesta, Jorge Calvo-Zaragoza:
OMR-assisted transcription: a case study with early prints. 35-41 - Stefan Andreas Baumann:
Deeper Convolutional Neural Networks and Broad Augmentation Policies Improve Performance in Musical Key Estimation. 42-49 - Axel Berndt:
The Music Performance Markup Format and Ecosystem. 50-57 - Louis Bigo, David Regnier, Nicolas Martin:
Identification of rhythm guitar sections in symbolic tablatures. 58-65 - Charles Brazier, Gerhard Widmer:
On-Line Audio-to-Lyrics Alignment Based on a Reference Performance. 66-73 - Aaron Carter-Enyi, Gilad Rabinovitch, Nathaniel Condit-Schultz:
Visualizing Intertextual Form with Arc Diagrams: Contour and Schema-based Methods. 74-80 - Francisco J. Castellanos, Antonio Javier Gallego, Jorge Calvo-Zaragoza:
Unsupervised Domain Adaptation for Document Analysis of Music Score Images. 81-87 - Rodrigo Castellon, Chris Donahue, Percy Liang:
Codified audio language modeling learns useful representations for music information retrieval. 88-96 - Chin-Jui Chang, Chun-Yi Lee, Yi-Hsuan Yang:
Variable-Length Music Score Infilling via XLNet and Musically Specialized Positional Encoding. 97-104 - Yi-Wei Chen, Hung-Shin Lee, Yen-Hsing Chen, Hsin-Min Wang:
SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours. 105-112 - Vincent K. M. Cheung, Hsuan-Kai Kao, Li Su:
Semi-supervised violin fingering generation using variational autoencoders. 113-120 - Keunwoo Choi, Yuxuan Wang:
Listen, Read, and Identify: Multimodal Singing Language Identification of Music. 121-127 - Shreyan Chowdhury, Gerhard Widmer:
On Perceived Emotion in Expressive Piano Performance: Further Experimental Evidence for the Relevance of Mid-level Perceptual Features. 128-134 - Bas Cornelissen, Willem H. Zuidema, John Ashley Burgoyne:
Cosine Contours: a Multipurpose Representation for Melodies. 135-142 - Shuqi Dai, Zeyu Jin, Celso Gomes, Roger B. Dannenberg:
Controllable deep melody generation via hierarchical music structure representation. 143-150 - Emir Demirel, Sven Ahlbäck, Simon Dixon:
MSTRE-Net: Multistreaming Acoustic Modeling for Automatic Lyrics Transcription. 151-158 - Hao-Wen Dong, Chris Donahue, Taylor Berg-Kirkpatrick, Julian J. McAuley:
Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music. 159-166 - Sachinda Edirisooriya, Hao-Wen Dong, Julian J. McAuley, Taylor Berg-Kirkpatrick:
An Empirical Evaluation of End-to-End Polyphonic Optical Music Recognition. 167-173 - Anders Elowsson, Olivier Lartillot:
A Hardanger Fiddle Dataset with Performances Spanning Emotional Expressions and Annotations Aligned using Image Registration. 174-181 - Jeffrey Ens, Philippe Pasquier:
Building the MetaMIDI Dataset: Linking Symbolic and Audio Musical Data. 182-188 - Christoph Finkensiep, Martin Rohrmeier:
Modeling and Inferring Proto-Voice Structure in Free Polyphony. 189-196 - Francesco Foscarin, Nicolas Audebert, Raphaël Fournier-S'niehotta:
PKSpell: Data-Driven Pitch Spelling and Key Signature Estimation. 197-204 - Dave Foster, Simon Dixon:
Filosax: A Dataset of Annotated Jazz Saxophone Recordings. 205-212 - Giovanni Gabbolini, Derek Bridge:
An interpretable music similarity measure based on path interestingness. 213-219 - Hugo Flores García, Aldo Aguilar, Ethan Manilow, Bryan Pardo:
Leveraging Hierarchical Structures for Few-Shot Musical Instrument Recognition. 220-228 - Mark Gotham, Rainer Kleinertz, Christof Weiss, Meinard Müller, Stephanie Klauk:
What if the 'When' Implies the 'What'?: Human harmonic analysis datasets clarify the relative role of the separate steps in automatic tonal analysis. 229-236 - Juan Sebastián Gómez Cañón, Estefanía Cano, Yi-Hsuan Yang, Perfecto Herrera, Emilia Gómez:
Let's agree to disagree: Consensus Entropy Active Learning for Personalized Music Emotion Recognition. 237-245 - Curtis Hawthorne, Ian Simon, Rigel Swavely, Ethan Manilow, Jesse H. Engel:
Sequence-to-Sequence Piano Transcription with Transformers. 246-253 - Ben Hayes, Charalampos Saitis, György Fazekas:
Neural Waveshaping Synthesis. 254-261 - Johannes Hentschel, Fabian C. Moss, Markus Neuwirth, Martin Rohrmeier:
A semi-automated workflow paradigm for the distributed creation and curation of expert annotations. 262-269 - Mojtaba Heydari, Frank Cwitkowitz, Zhiyao Duan:
BeatNet: CRNN and Particle Filtering for Online Joint Beat, Downbeat and Meter Tracking. 270-277 - Yuki Hiramatsu, Eita Nakamura, Kazuyoshi Yoshii:
Joint Estimation of Note Values and Voices for Audio-to-Score Piano Transcription. 278-284 - Yo-Wei Hsiao, Li Su:
Learning note-to-note affinity for voice segregation and melody line identification of symbolic music data. 285-292 - Jui-Yang Hsu, Li Su:
VOCANO: A note transcription framework for singing voice in polyphonic music. 293-300 - Rujing Stacy Huang, Bob L. T. Sturm, Andre Holzapfel:
De-centering the West: East Asian Philosophies and the Ethics of Applying Artificial Intelligence to Music. 301-309 - Tun-Min Hung, Bo-Yu Chen, Yen-Tung Yeh, Yi-Hsuan Yang:
A Benchmarking Initiative for Audio-domain Music Generation using the FreeSound Loop Dataset. 310-317 - Hsiao-Tzu Hung, Joann Ching, Seungheon Doh, Nabin Kim, Juhan Nam, Yi-Hsuan Yang:
EMOPIA: A Multi-Modal Pop Piano Dataset For Emotion Recognition and Emotion-based Music Generation. 318-325 - Kevin Ji, Daniel Yang, Timothy Tsai:
Piano Sheet Music Identification Using Marketplace Fingerprinting. 326-333 - Keunhyoung Luke Kim, Jongpil Lee, Sangeun Kum, Juhan Nam:
Learning a cross-domain embedding space of vocal and mixed audio with a structure-preserving triplet loss. 334-341 - Qiuqiang Kong, Yin Cao, Haohe Liu, Keunwoo Choi, Yuxuan Wang:
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation. 342-349 - Filip Korzeniowski, Sergio Oramas, Fabien Gouyon:
Artist Similarity Using Graph Neural Networks. 350-357 - Jin Ha Lee, Arpita Bhattacharya, Ria Antony, Nicole K. Santero, Anh Le:
â??Finding Homeâ?: Understanding How Music Supports Listenersâ?? Mental Health through a Case Study of BTS. 358-365 - Harin Lee, Frank Höger, Marc Schönwiesner, Minsu Park, Nori Jacoby:
Cross-cultural Mood Perception in Pop Songs and its Alignment with Mood Detection Algorithms. 366-373 - Jordan Lenchitz:
Reconsidering quantization in MIR. 374-380 - Liwei Lin, Gus Xia, Qiuqiang Kong, Junyan Jiang:
A unified model for zero-shot music source separation, transcription and synthesis. 381-388 - Carlos Lordelo, Emmanouil Benetos, Simon Dixon, Sven Ahlbäck:
Pitch-Informed Instrument Assignment using a Deep Convolutional Network with Multiple Kernel Shapes. 389-395 - Wei Tsung Lu, Ju-Chiang Wang, Minz Won, Keunwoo Choi, Xuchen Song:
SpecTNT: a Time-Frequency Transformer for Music Audio. 396-403 - Néstor Nápoles López, Mark Gotham, Ichiro Fujinaga:
AugmentedNet: A Roman Numeral Analysis Network with Synthetic Training Examples and Additional Tonal Tasks. 404-411 - Vincenzo Madaghiele, Pasquale Lisena, Raphaël Troncy:
MINGUS: Melodic Improvisation Neural Generator Using Seq2Seq. 412-419 - Ninon Lizé Masclef, Andrea Vaglio, Manuel Moussallam:
User-centered evaluation of lyrics-to-audio alignment. 420-427 - Naotake Masuda, Daisuke Saito:
Synthesizer Sound Matching with Differentiable DSP. 428-434 - Andrew McLeod, Martin Rohrmeier:
A Modular System for the Harmonic Analysis of Musical Scores using a Large Vocabulary. 435-442 - Gianluca Micchi, Katerina Kosta, Gabriele Medeot, Pierre Chanquion:
A deep learning method for enforcing coherence in Automatic Chord Recognition. 443-451 - Martin Miguel, Diego Fernández Slezak:
Modeling beat uncertainty as a 2D distribution of period and phase: a MIR task proposal. 452-459 - Olof Misgeld, Torbjörn Gulz, Jura Miniotaite, Andre Holzapfel:
A case study of deep enculturation and sensorimotor synchronization to real music. 460-467 - Gautam Mittal, Jesse H. Engel, Curtis Hawthorne, Ian Simon:
Symbolic Music Generation with Diffusion Models. 468-475 - Faraaz Nadeem:
Learning from Musical Feedback with Sonic the Hedgehog. 476-483 - Javier Nistal, Stefan Lattner, Gaël Richard:
DarkGAN: Exploiting Knowledge Distillation for Comprehensible Audio Synthesis With GANs. 484-492 - Takehisa Oyama, Ryoto Ishizuka, Kazuyoshi Yoshii:
Phase-Aware Joint Beat and Downbeat Estimation Based on Periodicity of Metrical Structure. 493-499 - Yuto Ozaki, John M. McBride, Emmanouil Benetos, Peter Q. Pfordresher, Joren Six, Adam Tierney, Polina Proutskova, Emi Sakai, Haruka Kondo, Haruno Fukatsu, Shinya Fujii, Patrick E. Savage:
Agreement Among Human and Automated Transcriptions of Global Songs. 500-508 - Emilia Parada-Cabaleiro, Maximilian Schmitt, Anton Batliner, Björn W. Schuller, Markus Schedl:
Automatic Recognition of Texture in Renaissance Music. 509-516 - Ashis Pati, Alexander Lerch:
Is Disentanglement enough? On Latent Representations for Controllable Music Generation. 517-524 - Nicolás Pironio, Diego Fernández Slezak, Martin Miguel:
Pulse clarity metrics developed from a deep learning beat tracking model. 525-530 - Verena Praher, Katharina Prinz, Arthur Flexer, Gerhard Widmer:
On the Veracity of Local, Model-agnostic Explanations in Audio Classification: Targeted Investigations with Adversarial Examples. 531-538 - Laure Prétet, Gaël Richard, Geoffroy Peeters:
Is there a "language of music-video clips" ? A qualitative and quantitative study. 539-546 - R. Gowriprasad, V. Venkatesh, Hema A. Murthy, R. Aravind, K. Sri Rama Murty:
Tabla Gharana Recognition from Audio music recordings of Tabla Solo performances. 547-554 - Lindsey Reymore, Emmanuelle Beauvais-Lacasse, Bennett Smith, Stephen McAdams:
Navigating noise: Modeling perceptual correlates of noise-related semantic timbre categories with audio features. 555-561 - Kyle Robinson, Dan Brown:
Quantitative User Perceptions of Music Recommendation List Diversity. 562-568 - Martin Rohrmeier, Fabian C. Moss:
A Formal Model of Extended Tonal Harmony. 569-578 - Simon Rouard, Gaëtan Hadjeres:
CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis. 579-585 - Luke O. Rowe, George Tzanetakis:
Curriculum Learning for Imbalanced Classification in Large Vocabulary Automatic Chord Recognition. 586-593 - Justin Salamon, Oriol Nieto, Nicholas J. Bryan:
Deep Embeddings and Section Fusion Improve Music Segmentation. 594-601 - Antonia Saravanou, Federico Tomasi, Rishabh Mehrotra, Mounia Lalmas:
Multi-Task Learning of Graph-based Inductive Representations of Music Content. 602-609 - Pedro Sarmento, Adarsh Kumar, CJ Carr, Zack Zukowski, Mathieu Barthet, Yi-Hsuan Yang:
DadaGP: A Dataset of Tokenized GuitarPro Songs for Sequence Models. 610-617 - Harald Victor Schweiger, Emilia Parada-Cabaleiro, Markus Schedl:
Does Track Sequence in User-generated Playlists Matter?. 618-625 - Simon J. Schwär, Sebastian Rosenzweig, Meinard Müller:
A Differentiable Cost Measure for Intonation Processing in Polyphonic Music. 626-633 - Pavan Seshadri, Alexander Lerch:
Improving Music Performance Assessment With Contrastive Learning. 634-641 - Dougal Shakespeare, Camille Roth:
Tracing Affordance and Item Adoption on Music Streaming Platforms. 642-649 - Zhengshan Shi:
Computational analysis and modeling of expressive timing in Chopin's Mazurkas. 650-656 - Nithya Nadig Shikarpur, Asawari Keskar, Preeti Rao:
Computational analysis of melodic mode switching in raga performance. 657-664 - Qingwei Song, Qiwei Sun, Dongsheng Guo, Haiyong Zheng:
SinTra: Learning an inspiration model from a single multi-track music segment. 665-672 - Janne Spijkervet, John Ashley Burgoyne:
Contrastive Learning of Musical Representations. 673-681 - Xiaoheng Sun, Qiqi He, Yongwei Gao, Wei Li:
Musical Tempo Estimation Using a Multi-scale Network. 682-689 - Pau Torras, Arnau Baró, Lei Kang, Alicia Fornés:
On the Integration of Language Models into Sequence to Sequence Architectures for Handwritten Music Recognition. 690-696 - Kosetsu Tsukuda, Keisuke Ishida, Masahiro Hamasaki, Masataka Goto:
Kiite Cafe: A Web Service for Getting Together Virtually to Listen to Music. 697-704 - Kosetsu Tsukuda, Masahiro Hamasaki, Masataka Goto:
Toward an Understanding of Lyrics-viewing Behavior While Listening to Music on a Smartphone. 705-713 - Andrea Vaglio, Romain Hennequin, Manuel Moussallam, Gaël Richard:
The Words Remain the Same: Cover Detection with Lyrics Transcription. 714-721 - Ziyu Wang, Gus Xia:
MuseBERT: Pre-training Music Representation for Music Understanding and Controllable Generation. 722-729 - Ju-Chiang Wang, Jordan B. L. Smith, Wei Tsung Lu, Xuchen Song:
Supervised Metric Learning For Music Structure Features. 730-737 - Shiqi Wei, Gus Xia:
Learning long-term music representations via hierarchical contextual constraints. 738-745 - Christof Weiss, Johannes Zeitler, Tim Zunner, Florian Schuberth, Meinard Müller:
Learning Pitch-Class Representations from Score-Audio Pairs of Classical Music. 746-753 - Christof Weiss, Geoffroy Peeters:
Training Deep Pitch-Class Representations With a Multi-Label CTC Loss. 754-761 - Daniel Wolff, Rémi Mignot, Axel Roebel:
Audio Defect Detection in Music with Deep Networks. 762-768 - Minz Won, Keunwoo Choi, Xavier Serra:
Semi-supervised Music Tagging Transformer. 769-776 - Minz Won, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore, Xavier Serra:
Emotion Embedding Spaces for Matching Music to Stories. 777-785 - Abudukelimu Wuerkaixi, Christodoulos Benetatos, Zhiyao Duan, Changshui Zhang:
CollageNet: Fusing arbitrary melody and accompaniment into a coherent song. 786-793 - Kazuhiko Yamamoto:
Human-in-the-Loop Adaptation for Interactive Musical Beat Tracking. 794-801 - Daniel Yang, Timothy Tsai:
Composer Classification With Cross-Modal Transfer Learning and Musically-Informed Augmentation. 802-809 - Daniel Yang, Kevin Ji, Timothy Tsai:
Aligning Unsynchronized Part Recordings to a Full Mix Using Iterative Subtractive Alignment. 810-817 - Mickaël Zehren, Marco Alunno, Paolo Bientinesi:
ADTOF: A large dataset of non-synthetic music for automatic drum transcription. 818-824 - Huan Zhang, Yiliang Jiang, Tao Jiang, Hu Peng:
Learn by Referencing: Towards Deep Metric Learning for Singing Assessment. 825-832 - Jingwei Zhao, Gus Xia:
AccoMontage: Accompaniment Arrangement via Phrase Selection and Style Transfer. 833-840
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.