Deep Learning in Speech Synthesis Heiga Zen Google August 31st, 2013 Outline Background Deep Learning Deep Learning in Speech Synthesis Motivation Deep learning-based approaches DNN-based statistical parametric speech synthesis Experiments Conclusion Text-to-speech as sequence-to-sequence mapping • Automatic speech recognition (ASR) Speech (continuous time series) → Text (discrete symbol sequence)