Publications

Conference presentations, posters, journal articles

Tovstogan P., Serra X., and Bogdanov D. (2022). Similarity of Nearest-Neighbor Query Results in Deep Latent Spaces. 19th Sound & Music Computing Conference (SMC 2022), Saint-Etienne, France.

Tovstogan P., Serra X., and Bogdanov D. (2022). Visualization of Deep Audio Embeddings for Music Exploration and Rediscovery. 19th Sound & Music Computing Conference (SMC 2022), Saint-Etienne, France.

Delgado A., Demirel E., Subramanian V., Saitis C., and Sandler M. (2022). Deep Embeddings for Robust User-Based Amateur Vocal Percussion Classification. Sound and Music Computing conference 2022, Saint Etienne, France.

Delgado A., Saitis C., Benetos E., and Sandler M. (2022). Deep Conditional Representation Learning for Drum Sample Retrieval by Vocalisation. Under Review.

Yesiler F., Miron, M., Serrà, J., and Gómez E. (2022). Assessing Algorithmic Biases for Musical Version Identification. 15th ACM International Conference on Web Search and Data Mining (WSDM 2022), Online.

Schulze-Forster K., Doire C., Richard G., and Badeau R. (2022). Unsupervised Audio Source Separation Using Differentiable Parametric Source Models. arXiv.

Agrawal R., Wolff D. and Dixon S. (2021). A Covolutional-Atentional Famework for Sructure-Aware Performance-Score Synchronization. IEEE Signal Processing Letters, December 2021.

Li Y., Demirel E. Proutskova P., and Dixon S. (2021). Phoneme-Informed Note Segmentation of Monophonic Vocal Music. 2nd Workshop on NLP for Music and Audio (NLP4MusA 2021), Online.

Brazier C., and Widmer G. (2021). Improving Real-time Score Following in Opera by Combining Music with Lyrics Tracking. 2nd Workshop on NLP for Music and Audio (NLP4MusA 2021), Online.

Demirel E. Ahlback S., and Dixon S. (2021). MSTRE-Net: Multistreaming Acoustic Modeling for Automatic Lyrics Transcription. 22nd International Society for Music Information Retrieval (ISMIR 2021) Conference, Online.

Lordelo C., Benetos E., Dixon S., and Ahlbäck S. (2021). Pitch-Informed Instrument Assignment Using a Deep Convolutional Network with Multiple Kernel Shapes. 22nd International Society for Music Information Retrieval (ISMIR 2021) Conference, Online.

Brazier C., and Widmer G. (2021). On-Line Audio-to-Lyrics Alignment Based on a Reference Performance. 22nd International Society for Music Information Retrieval (ISMIR 2021) Conference, Online.

Yesiler F., Doras, G., Tralie, C. J., Bittner, R. M., and Serrà, J. (2021). Audio-based Musical Version Identification: Elements and Challenges. IEEE Signal Processing Magazine, Vol. 38, Issue 6, 2021.

Cantisani G., Ozerov A., Essid S., and Richard G. (2021). User-guided one-shot deep model adaptation for music source separation. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2021), New Paltz, NY, USA.

Delgado A., Saitis C., and Sandler M. (2021). Phoneme Mappings for Online Vocal Percussion Transcription. 151st Convention of the Audio Engineering Society, Online.

Demirel E. Ahlback S., and Dixon S. (2021). Computational Pronunciation Analysis in Sung Utterances. 29th European Signal Processing Conference (EUSIPCO 2021), Dublin, Ireland.

Carvalho L., and Widmer G. (2021). Exploiting Temporal Dependencies for Cross-Modal Music Piece Identification. 29th European Signal Processing Conference (EUSIPCO 2021), Dublin, Ireland.

Brazier C., and Widmer G. (2021). Handling Structural Mismatches in Real-time Opera Tracking. 29th European Signal Processing Conference (EUSIPCO 2021), Dublin, Ireland.

Liutkus A., Cífka O., Wu S.-L., Şimşekli U., Yang Y.-H., and Richard G. (2021). Relative Positional Encoding for Transformers with Linear Complexity. 38th International Conference on Machine Learning (ICML 2021), Virtual.

Roma G., Font, F., Ramires A. (2021). Floop Jam. Web Audio Conference 2021, Online.

Delgado A., McDonald S., Xu N., Saitis C., and Sandler M., (2021). Learning Models for Query by Vocal Percussion: A Comparative Study. Proceedings of the 46th International Computer Music Conference, ICMC, Santiago de Chile, Chile.

Schulze-Forster K., Doire C., Richard G., and Badeau R. (2021). Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation. IEEE/ACM Transactions on Audio, Speech, and Language Processing.

Chandna P.,Ramires A., Serra X., Gómez E. (2021). LoopNet: Musical Loop Synthesis Conditioned on Intuitive Musical Parameters. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, Canada.

Agrawal R., Wolff D., and Dixon S. (2021). Structure-Aware Audio-to-Score Alignment using Progressively Dilated Convolutional Neural Networks. 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada.

Yesiler F., Molina, E., Serrà, J., and Gómez E. (2021). Investigating the Efficacy of Music Version Retrieval Systems for Setlist Identification. 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada.

Cantisani G., Essid S., and Richard G. (2021). Neuro-steered music source separation with EEG-based auditory attention decoding and contrastive-NMF. 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada.

Demirel E. Ahlback S., and Dixon S. (2021). Low Resource Audio-to-Lyrics Alignment From Polyphonic Music Recordings. 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada.

Cífka O., Ozerov A., Şimşekli U., and Richard G. (2021). Self-Supervised VQ-VAE for One-Shot Music Style Transfer. 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada.

Agrawal R., and Dixon S. (2021). Learning Frame Similarity using Siamese networks for Audio-to-Score Alignment. 28th European Signal Processing Conference (EUSIPCO 2020), Amsterdam, The Netherlands.

Nistal J., Lattner S., and Richard G. (2021). Comparing Representations for Audio Synthesis Using Generative Adversarial Networks. 28th European Signal Processing Conference (EUSIPCO 2020), Amsterdam, The Netherlands.

Lordelo C., Benetos E., Dixon S., Ahlbäck S. and Ohlsson P. (2020). Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation. IEEE Signal Processing Letters (Early Access).

Pankajakshan A., Bear H., Subramanian V. and Benetos E. (2020). Memory Controlled Sequential Self Attention for Sound Recognition. Interspeech 2020.

Demirel E. Ahlback S., and Dixon S. (2020). A Recursive Search Method for Lyrics Alignment. 21st International Society for Music Information Retrieval Conference (ISMIR 2020), Toronto, Canada.

Ching J., Ramires A., Yang Y. (2020). Instrument Role Classification: Auto-Tagging for Loop Based Music. The 2020 Joint Conference on AI Music Creativity, Stockholm, Sweden.

Ramires A., Font F., Bogdanov D., Smith J., Yang Y., Ching J., Chen B., Wu Y., Wei-Han H., Serra X. (2020). The Freesound Loop Dataset and Annotation Tool. 21st International Society for Music Information Retrieval Conference (ISMIR 2020), Toronto, Canada..

Yesiler F., Serrà, J., and Gómez E. (2020). Less is More: Faster and Better Music Version Identification with Embedding Distillation. 21th International Society for Music Information Retrieval Conference (ISMIR 2020), Montreal, Canada.

Doras, G., Yesiler F., Serrà, J., Gómez E., and Peeters, G. (2020). Combining Musical Features for Cover Detection. 21th International Society for Music Information Retrieval Conference (ISMIR 2020), Montreal, Canada.

Ibrahim K., Eepure E., Peeters G., and Richard G. (2020). Should We Consider the Users in Contextual Music Auto-Tagging Models?. 21th International Society for Music Information Retrieval Conference (ISMIR 2020), Montreal, Canada.

Brazier C., and Widmer G. (2020). Addressing the Recitative Problem in Real-Time Opera Tracking. 25th Frontiers of Research in Speech and Music (FRSM 2020), Silchar, India.

Ramires A., Bernardes G., Davies M., Serra X. (2020). TIV.Lib: An Open-source Library for the Tonal Description of Musical Audio. 23rd International Conference on Digital Audio Effects (DAFx-20), Vienna, Austria..

Delgado A., Saitis C., and Sandler M. (2020). Spectral and Temporal Timbral Cues of Vocal Imitations of Drum Sounds. Proceedings of the 2nd International Conference on Timbre (online).

Cífka O., Şimşekli U., and Richard G. (2020). Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic Data. IEEE/ACM Transactions on Audio, Speech, and Language Processing.

Demirel E. Ahlback S., and Dixon S. (2020). Automatic Lyrics Transcription using Dilated Convolutional Neural Networks with Self-Attention. 2020 International Joint Conference on Neural Networks (IJCNN 2020), Glasgow, UK.

Tovstogan P., Serra X., and Bogdanov D. (2020). Web Interface for Exploration of Latent and Tag Spaces in Music Auto-Tagging. Machine Learning for Media Discovery Workshop (ML4MD), 37th International Conference on Machine Learning (ICML 2020), Vienna, Austria.

Brazier C., and Widmer G. (2020). Towards Reliable Real-Time Opera Tracking: Combining Alignment with Audio Event Detectors to Increase Robustness. 17th Sound & Music Computing Conference (SMC 2020), Torino, Italy.

Lartillot O., Cancino-Chacón C., and Brazier C.. (2020). Real-Time Visualisation of Fugue Played by a String Quartet. 17th Sound & Music Computing Conference (SMC 2020), Torino, Italy.

Ibrahim K., Eepure E., Peeters G., and Richard G. (2020). Confidence-based Weighted Loss for Multi-Label Classification with Missing Labels. 2020 International Conference on Multimedia Retrieval (ICMR 2020), Dublin, Ireland.

Yesiler F., Serrà, J., and Gómez E. (2020). Accurate and Scalable Version Identification Using Musically-motivated Embeddings. 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain.

Ramires A., Chandna P., Favory X., Gómez E., and Serra X. (2020). Neural Percussive Synthesis Parameterised by High-Level Timbral Features. 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain.

Ibrahim K., Royo-letelier J., Eepure E., Peeters G., and Richard G. (2020). Audio-Based Auto-Tagging with Contextual Tags for Music. 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain.

Schulze-Forster K., Doire C., Richard G., and Badeau R. (2020). Joint Phoneme Alignment and Text-Informed Speech Separation on Highly Corrupted Speech. 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain.

Subramanian V., Pankajakshan A., Benetos E., Xu N., McDonald S., and Sandler M. (2020). A Study on the Transferability of Adversarial Attacks in Sound Event Classification. 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain.

Demirel E. (2019). MIREX 2019 - Audio-to-Lyrics Alignment Challenge 2019. 20th International Society for Music Information Retrieval Conference (ISMIR 2019), Delft, The Netherlands.

Balke S., Dorfer M., Carvalho L., Arzt A., and Widmer G. (2019). Learning Soft-Attention Models for Tempo-Invariant Audio-Sheet Music Retrieval. 20th International Society for Music Information Retrieval Conference (ISMIR 2019), Delft, The Netherlands.

Cífka O., Şimşekli U., and Richard G. (2019). Supervised Symbolic Music Style Translation Using Synthetic Data. 20th International Society for Music Information Retrieval Conference (ISMIR 2019), Delft, The Netherlands.

Yesiler F., Tralie C., Correya A., Silva D. F., Tovstogan P., Gómez E., and Serra X. (2019). Da-TACOS: A Dataset for Cover Song Identification and Understanding. 20th International Society for Music Information Retrieval Conference (ISMIR 2019), Delft, The Netherlands.

Subramanian V., Benetos E., and Sandler M. (2019). Robustness of Adversarial Attacks in Sound Event Classification. Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019), New York, USA.

Schulze-Forster K., Doire C., Richard G., and Badeau R. (2019). Weakly Informed Audio Source Separation. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2019), New Paltz, NY, USA.

Cantisani G., Essid S., and Richard G. (2019). EEG-based Decoding of Auditory Attention to a Target Instrument in Polyphonic Music. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2019), New Paltz, NY, USA.

Lordelo C., Benetos E., Dixon S., and Ahlbäck S. (2019). Investigating Kernel Shapes and Skip Connections for Deep Learning-Based Harmonic-Percussive Separation. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2019), New Paltz, NY, USA.

Delgado A., McDonald S., Xu N., and Sandler M. (2019). A New Dataset for Amateur Vocal Percussion Analysis. Audio Mostly 2019, Nottingham, UK.

Cantisani G., Trégoat G., Essid S., and Richard G. (2019). MAD-EEG: an EEG Dataset for Decoding Auditory Attention to a Target Instrument in Polyphonic Music. Workshop on Speech, Music and Mind 2019 (SMM 2019), Vienna, Austria.

Ramires A., and Serra X. (2019). Data Augmentation for Instrument Classification Robust to Audio Effects. 22nd International Conference on Digital Audio Effects (DAFx-19), Birmingham, UK.

Demirel E., Ahlback S. and Dixon S. (2019). Exploring Generalizability of Automatic Phoneme Recognition Models. UK Speech 2019, Birmingham, UK.

Agrawal R., and Dixon S. (2019). A Hybrid Approach to Audio-to-Score Alignment. 36th International Conference on Machine Learning (ICML 2019), Long Beach, California, USA.

Bogdanov, D., Won M., Tovstogan P., Porter A., and Serra X. (2019). The MTG-Jamendo Dataset for Automatic Music Tagging. Machine Learning for Music Discovery Workshop (ML4MD), 36th International Conference on Machine Learning (ICML 2019), Long Beach, California, USA.

Demirel E., Bozkurt B., and Serra X. (2019). Automatic Chord-Scale Recognition Using Harmonic Pitch Class Profiles. 16th Sound & Music Computing Conference (SMC 2019), Málaga, Spain.