umiyoshのブックマーク / 2024年4月13日

umiyosh id:umiyosh

2024年4月13日のブックマーク (1件)

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
This work introduces an efficient method to scale Transf ormer-based Large Language Models (LLMs) to infinitely long inputs with bounded memory and computation. A key component in our proposed approach is a new attention technique dubbed Infini-attention. The Infini-attention incorporates a compressive memory into the vanilla attention mechanism and builds in both masked local attention and long-te
umiyosh 2024/04/13
リンク
- 2024年4月15日
- 2024年4月13日
- 2024年4月10日