Music source separation is an important task for many applications in music information retrieval field. However, due to the complexity of the music signal t is still considered a challenging task. In this paper, we propose a novel approach extending Wasserstein generative adversarial networks to source separation task. We used the mixture signal as a condition to generate sources and applied the U-net style network for the stable training of the model. Experiments with the DSD100 dataset show the promising results with the potential of using the GANs for music source separation.
(a)Mixture
(b)True Vocal
(c)Estimated vocal using only generative adversarial loss
(d)Estimated vocal using only L1 loss
(e)Estimated vocal using both generative adversarial loss and L1 loss
(a)Mixture
(b)True Accompaniment
(c)Estimated accompaniment using only generative adversarial loss
(d)Estimated accompaniment using only L1 loss
(e)Estimated accompaniment using both generative adversarial loss and L1 loss
Vocal | Accompaniment | Original Song | |
---|---|---|---|
Cold Play - Paradise | Youtube Link | ||
Gallant - Episode | Youtube Link | ||
Michael Jackson - Beat it | Youtube Link | ||
Red Velvet – Red Flavor | Youtube Link |
Vocal | Accompaniment | Original Song | |
---|---|---|---|
Cold Play - Paradise | Youtube Link | ||
Gallant - Episode | Youtube Link | ||
Michael Jackson - Beat it | Youtube Link | ||
Red Velvet – Red Flavor | Youtube Link |
Vocal | Accompaniment | Original Song | |
---|---|---|---|
Cold Play - Paradise | Youtube Link | ||
Gallant - Episode | Youtube Link | ||
Michael Jackson - Beat it | Youtube Link | ||
Red Velvet – Red Flavor | Youtube Link |