JSUT (Japanese speech corpus of Saruwatari-lab., University of Tokyo) The JSUT Collection is Japanese speech corpora connecting speech, song, and audio events. The JSUT corpus is a part of the JSUT Collection. JSUT コレクションは,声・歌・音声模倣をつなげるための音声コーパスです.このJSUT コーパスは,JSUT コレクションの一部です. This corpus consists of Japanese text (transcription) and reading-style audio. The audio data is sampled at 48kHz and rec