samurairodeoのブックマーク - はてなブックマーク

ブックマーク / www.pragmatic.ml (1)

A Survey of Long-Term Context in Transformers
It's no secret that multi-head self-attention is expensive -- the \(O(n²)\) complexity with respect to sequence length means allowing vanilla transf ormers to attend to long sequences quickly becomes intractable. Over the past two years the NLP community has developed a veritable zoo of methods to combat this probl ematic complexity, but in this post we'll focus on a dozen promising approaches. You
samurairodeo 2022/01/11
リンク
1

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx