Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network Andrew J.R. Simpson #1 , Gerard Roma#2 , Mark D. Plumbley#3 # Centre for Vision, Speech and Signal Processing, University of Surrey Guildford, UK 1 andrew.simpson@surrey.ac.uk 2 g.roma@surrey.ac.uk 3 m.plumbley@surrey.ac.uk Abstract—Identification and extraction of singing voice from within musical mix