Speech-music discrimination

Analyzing the potential of pre-trained embeddings for audio classification tasks

In the context of deep learning, the availability of large amounts of training data can play a critical role in a model's performance. Transfer learning has shown to be a powerful method in which models are first pre-trained for a task where abundant …

Singing Voice, Speech, or Something in Between

In the context of the ACMus research project, we are investigating automatic techniques for annotation and segmentation of digital archives of non-western music. In particular, we are focusing on a collection of traditional Colombian music compiled …