Publications

Conference presentations, posters, journal articles

Lordelo C., Benetos E., Dixon S., and Ahlbäck S. (2021). Pitch-Informed Instrument Assignment Using a Deep Convolutional Network with Multiple Kernel Shapes. 22nd International Society for Music Information Retrieval (ISMIR 2021) Conference, Online.

Brazier C., and Widmer G. (2021). On-Line Audio-to-Lyrics Alignment Based on a Reference Performance. 22nd International Society for Music Information Retrieval (ISMIR 2021) Conference, Online.

Cantisani G., Ozerov A., Essid S., and Richard G. (2021). User-guided one-shot deep model adaptation for music source separation. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2021), New Paltz, NY, USA.

Demirel E. Ahlback S., and Dixon S. (2021). Computational Pronunciation Analysis in Sung Utterances. 29th European Signal Processing Conference (EUSIPCO 2021), Dublin, Ireland.

Carvalho L., and Widmer G. (2021). Exploiting Temporal Dependencies for Cross-Modal Music Piece Identification. 29th European Signal Processing Conference (EUSIPCO 2021), Dublin, Ireland.

Brazier C., and Widmer G. (2021). Handling Structural Mismatches in Real-time Opera Tracking. 29th European Signal Processing Conference (EUSIPCO 2021), Dublin, Ireland.

Liutkus A., Cífka O., Wu S.-L., Şimşekli U., Yang Y.-H., and Richard G. (2021). Relative Positional Encoding for Transformers with Linear Complexity. 38th International Conference on Machine Learning (ICML 2021), Virtual.

Delgado A., McDonald S., Xu N., Saitis C., and Sandler M., (2021). Learning Models for Query by Vocal Percussion: A Comparative Study. Proceedings of the 46th International Computer Music Conference, ICMC, Santiago de Chile, Chile.

Schulze-Forster K., Doire C., Richard G., and Badeau R. (2021). Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation. IEEE/ACM Transactions on Audio, Speech, and Language Processing.

Agrawal R., Wolff D., and Dixon S. (2021). Structure-Aware Audio-to-Score Alignment using Progressively Dilated Convolutional Neural Networks. 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada.

Cantisani G., Essid S., and Richard G. (2021). Neuro-steered music source separation with EEG-based auditory attention decoding and contrastive-NMF. 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada.

Demirel E. Ahlback S., and Dixon S. (2021). Low Resource Audio-to-Lyrics Alignment From Polyphonic Music Recordings. 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada.

Cífka O., Ozerov A., Şimşekli U., and Richard G. (2021). Self-Supervised VQ-VAE for One-Shot Music Style Transfer. 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada.

Agrawal R., and Dixon S. (2021). Learning Frame Similarity using Siamese networks for Audio-to-Score Alignment. 28th European Signal Processing Conference (EUSIPCO 2020), Amsterdam, The Netherlands.

Nistal J., Lattner S., and Richard G. (2021). Comparing Representations for Audio Synthesis Using Generative Adversarial Networks. 28th European Signal Processing Conference (EUSIPCO 2020), Amsterdam, The Netherlands.

Lordelo C., Benetos E., Dixon S., Ahlbäck S. and Ohlsson P. (2020). Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation. IEEE Signal Processing Letters (Early Access).

Pankajakshan A., Bear H., Subramanian V. and Benetos E. (2020). Memory Controlled Sequential Self Attention for Sound Recognition. Interspeech 2020.

Demirel E. Ahlback S., and Dixon S. (2020). A Recursive Search Method for Lyrics Alignment. 21st International Society for Music Information Retrieval Conference (ISMIR 2020), Toronto, Canada.

Ibrahim K., Eepure E., Peeters G., and Richard G. (2020). Should We Consider the Users in Contextual Music Auto-Tagging Models?. 21th International Society for Music Information Retrieval Conference (ISMIR 2020), Montreal, Canada.

Brazier C., and Widmer G. (2020). Addressing the Recitative Problem in Real-Time Opera Tracking. 25th Frontiers of Research in Speech and Music (FRSM 2020), Silchar, India.

Delgado A., Saitis C., and Sandler M. (2020). Spectral and Temporal Timbral Cues of Vocal Imitations of Drum Sounds. Proceedings of the 2nd International Conference on Timbre (online).

Cífka O., Şimşekli U., and Richard G. (2020). Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic Data. IEEE/ACM Transactions on Audio, Speech, and Language Processing.

Demirel E. Ahlback S., and Dixon S. (2020). Automatic Lyrics Transcription using Dilated Convolutional Neural Networks with Self-Attention. 2020 International Joint Conference on Neural Networks (IJCNN 2020), Glasgow, UK.

Tovstogan P., Serra X., and Bogdanov D. (2020). Web Interface for Exploration of Latent and Tag Spaces in Music Auto-Tagging. Machine Learning for Media Discovery Workshop (ML4MD), 37th International Conference on Machine Learning (ICML 2020), Vienna, Austria.

Brazier C., and Widmer G. (2020). Towards Reliable Real-Time Opera Tracking: Combining Alignment with Audio Event Detectors to Increase Robustness. 17th Sound & Music Computing Conference (SMC 2020), Torino, Italy.

Lartillot O., Cancino-Chacón C., and Brazier C.. (2020). Real-Time Visualisation of Fugue Played by a String Quartet. 17th Sound & Music Computing Conference (SMC 2020), Torino, Italy.

Ibrahim K., Eepure E., Peeters G., and Richard G. (2020). Confidence-based Weighted Loss for Multi-Label Classification with Missing Labels. 2020 International Conference on Multimedia Retrieval (ICMR 2020), Dublin, Ireland.

Ramires A., Chandna P., Favory X., Gómez E., and Serra X. (2020). Neural Percussive Synthesis Parameterised by High-Level Timbral Features. 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain.

Ibrahim K., Royo-letelier J., Eepure E., Peeters G., and Richard G. (2020). Audio-Based Auto-Tagging with Contextual Tags for Music. 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain.

Schulze-Forster K., Doire C., Richard G., and Badeau R. (2020). Joint Phoneme Alignment and Text-Informed Speech Separation on Highly Corrupted Speech. 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain.

Subramanian V., Pankajakshan A., Benetos E., Xu N., McDonald S., and Sandler M. (2020). A Study on the Transferability of Adversarial Attacks in Sound Event Classification. 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain.

Demirel E. (2019). MIREX 2019 - Audio-to-Lyrics Alignment Challenge 2019. 20th International Society for Music Information Retrieval Conference (ISMIR 2019), Delft, The Netherlands.

Balke S., Dorfer M., Carvalho L., Arzt A., and Widmer G. (2019). Learning Soft-Attention Models for Tempo-Invariant Audio-Sheet Music Retrieval. 20th International Society for Music Information Retrieval Conference (ISMIR 2019), Delft, The Netherlands.

Cífka O., Şimşekli U., and Richard G. (2019). Supervised Symbolic Music Style Translation Using Synthetic Data. 20th International Society for Music Information Retrieval Conference (ISMIR 2019), Delft, The Netherlands.

Yesiler F., Tralie C., Correya A., Silva D. F., Tovstogan P., Gómez E., and Serra X. (2019). Da-TACOS: A Dataset for Cover Song Identification and Understanding. 20th International Society for Music Information Retrieval Conference (ISMIR 2019), Delft, The Netherlands.

Subramanian V., Benetos E., and Sandler M. (2019). Robustness of Adversarial Attacks in Sound Event Classification. Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019), New York, USA.

Schulze-Forster K., Doire C., Richard G., and Badeau R. (2019). Weakly Informed Audio Source Separation. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2019), New Paltz, NY, USA.

Cantisani G., Essid S., and Richard G. (2019). EEG-based Decoding of Auditory Attention to a Target Instrument in Polyphonic Music. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2019), New Paltz, NY, USA.

Lordelo C., Benetos E., Dixon S., and Ahlbäck S. (2019). Investigating Kernel Shapes and Skip Connections for Deep Learning-Based Harmonic-Percussive Separation. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2019), New Paltz, NY, USA.

Delgado A., McDonald S., Xu N., and Sandler M. (2019). A New Dataset for Amateur Vocal Percussion Analysis. Audio Mostly 2019, Nottingham, UK.

Cantisani G., Trégoat G., Essid S., and Richard G. (2019). MAD-EEG: an EEG Dataset for Decoding Auditory Attention to a Target Instrument in Polyphonic Music. Workshop on Speech, Music and Mind 2019 (SMM 2019), Vienna, Austria.

Ramires A., and Serra X. (2019). Data Augmentation for Instrument Classification Robust to Audio Effects. 22nd International Conference on Digital Audio Effects (DAFx-19), Birmingham, UK.

Demirel E., Ahlback S. and Dixon S. (2019). Exploring Generalizability of Automatic Phoneme Recognition Models. UK Speech 2019, Birmingham, UK.

Agrawal R., and Dixon S. (2019). A Hybrid Approach to Audio-to-Score Alignment. 36th International Conference on Machine Learning (ICML 2019), Long Beach, California, USA.

Bogdanov, D., Won M., Tovstogan P., Porter A., and Serra X. (2019). The MTG-Jamendo Dataset for Automatic Music Tagging. Machine Learning for Music Discovery Workshop (ML4MD), 36th International Conference on Machine Learning (ICML 2019), Long Beach, California, USA.

Demirel E., Bozkurt B., and Serra X. (2019). Automatic Chord-Scale Recognition Using Harmonic Pitch Class Profiles. 16th Sound & Music Computing Conference (SMC 2019), Málaga, Spain.