[B! 音][@Google] yamadarのブックマーク

yamadar id:yamadar

音と@Googleに関するyamadarのブックマーク (1)

SoundStorm
SoundStorm:Efficient Parallel Audio Generation [paper] Zalán Borsos, Matt Sharifi, Damien Vincent, Eugene Kharitonov, Neil Zeghidour, Marco Tagliasacchi Google Research Abstract. We present SoundStorm, a model for efficient, non-autoregressive audio generation. SoundStorm receives as input the semantic tokens of AudioLM, and relies on bidirectional attention and confidence-based parallel decoding
yamadar 2023/06/12
"SoundStorm"は、AudioLMのセマンティックトークンを用いて、高品質で一貫性のある音声を非自己回帰で高速生成するモデル。高速で生成可能で、注意して聞いても人間と判別がつかないレベルの品質。

AI

音

@Google
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx