Beta Divergence for Clustering in Monaural Blind Source Separation

128th AES Convention

Conference homepage

Abstract

General purpose audio blind source separation algorithms have to deal with a large dynamic range for the different sources to be separated. In our algorithm the mixture is separated into single notes. These notes are clustered to construct the melodies played by the active sources. The non-negative matrix factorization (NMF) leads to good results in clustering the notes according to spectral features. The cost function for the NMF is controlled by a parameter beta. The choice of beta depends on the dynamic difference of the sources. The novelty of this paper is to propose a simple classifier to adjust the parameter beta to current dynamic ranges for increasing the separation quality.

Paper:SpGn10.pdf

Poster: Poster_AES2010.pdf

 

Separation Examples

Remarks:
  • All zip files contain mixtures at 7 different dynamic differences:
    DD1
    dynamic difference -18 dB
    DD2
    dynamic difference -12 dB
    ...
    ...
    DD7
    dynamic difference +18 dB
  • For each dynamic differences all three clustering algorithms are marked with:
    c1
    clustering with beta=1
    c2
    clustering with beta=2
    c3
    clustering with adaptive beta
  • For each clustering the separated sources are marked with s1 and s2

(C) by Martin Spiertz - 28. May 2010 - spiertz@ient.rwth-aachen.de