Beta Divergence for Clustering in Monaural Blind Source Separation
128th AES Convention
Conference homepageAbstract
General purpose audio blind source separation algorithms have to deal with a large dynamic range for the different sources to be separated. In our algorithm the mixture is separated into single notes. These notes are clustered to construct the melodies played by the active sources. The non-negative matrix factorization (NMF) leads to good results in clustering the notes according to spectral features. The cost function for the NMF is controlled by a parameter beta. The choice of beta depends on the dynamic difference of the sources. The novelty of this paper is to propose a simple classifier to adjust the parameter beta to current dynamic ranges for increasing the separation quality.
Paper:SpGn10.pdf
Poster: Poster_AES2010.pdf
 
Separation Examples
- Triangel - Vibraphone
- Speech - Piccolo
- Singing - Claves
- Guitar - Clarinet
- Bass drum - Contra bassoon
- Accordion - Flute
- All zip files contain mixtures at 7 different dynamic differences:
- DD1
- dynamic difference -18 dB
- DD2
- dynamic difference -12 dB
- ...
- ...
- DD7
- dynamic difference +18 dB
- For each dynamic differences all three clustering algorithms are marked with:
- c1
- clustering with beta=1
- c2
- clustering with beta=2
- c3
- clustering with adaptive beta
- For each clustering the separated sources are marked with s1 and s2
(C) by Martin Spiertz - 28. May 2010 - spiertz@ient.rwth-aachen.de