3つの要点 ✔️ TransformerとCNNを組み合わせたモデル,Conformerを音声認識に応用 ✔️ 畳み込みモジュールがConformerにおいて最も重要であることがわかった ✔️ 既存の音声認識研究の中でも最高の精度を確認 Conformer: Convolution-augmented Transformer for Speech Recognition written by Anmol Gulati, James Qin, Chung-Cheng Chiu, Niki Parmar, Yu Zhang, Jiahui Yu, Wei Han, Shibo Wang, Zhengdong Zhang, Yonghui Wu, Ruoming Pang (Submitted on 16 May 2020) Comments: Accepted at Interspeech20
![Conformer:Transformerを音声認識に応用!? GoogleによるTransformer×CNNが凄すぎる!!](https://cdn-ak-scissors.b.st-hatena.com/image/square/39bdcce54afff5d7a5b2bcb91916fc2a3b660827/height=288;version=1;width=512/https%3A%2F%2Faisholar.s3.ap-northeast-1.amazonaws.com%2Fmedia%2FNovember2020%2FTransformer_%25C3%2597_CNN%253DConformer-min.png)