tasukuchanのブックマーク - はてなブックマーク

FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention

In theory, Attention is All You Need. In practice, however, we also need optimized attention implementations like FlashAttention. Although these fused attention implementations have substantially improved performance and enabled long contexts, this efficiency has come with a loss of flexibility. You can no longer try out a new attention variant by writing a few PyTorch operators - you often need t

tasukuchan 2024/08/08

リンク

Accelerating Generative AI with PyTorch II: GPT, Fast

by Team PyTorch This post is the second part of a multi-series blog focused on how to accelerate generative AI models with pure, native PyTorch. We are excited to share a breadth of newly released PyTorch performance features alongside practical examples to see how far we can push PyTorch native performance. In part one, we showed how to accelerate Segment Anything over 8x using only pure, native

tasukuchan 2023/12/01

リンク

Accelerating Generative AI with PyTorch II: GPT, Fast

tasukuchan 2023/12/01

リンク

Welcome to the ExecuTorch Documentation — ExecuTorch documentation

tasukuchan 2023/10/18

リンク

Performance Tuning Guide — PyTorch Tutorials 2.4.0+cu121 documentation

tasukuchan 2023/09/19

リンク

PyTorch 2.0: Our next generation release that is faster, more Pythonic and Dynamic as ever

*To see a full list of public 2.0, 1.13 and 1.12 feature submissions click here. Stable Features [Stable] Accelerated PyTorch 2 Transf ormers The PyTorch 2.0 release includes a new high-performance implementation of the PyTorch Transf ormer API. In releasing Accelerated PT2 Transf ormers, our goal is to make training and deployment of state-of-the-art Transf ormer models affordable across the industry

tasukuchan 2023/03/16

リンク

PyTorch 1.10 Release, including CUDA Graphs APIs, Frontend and Compiler Improvements

by Team PyTorch We are excited to announce the release of PyTorch 1.10. This release is composed of over 3,400 commits since 1.9, made by 426 contributors. We want to sincerely thank our community for continuously improving PyTorch. PyTorch 1.10 updates are focused on improving training and performance of PyTorch, and developer usability. The full release notes are available here. Highlights inclu

tasukuchan 2021/10/22

リンク

はてなブックマーク

タグ

ブックマーク / pytorch.org (7)

お知らせ

今週のはてなブックマーク数ランキング（2024年9月第4週）

今週のはてなブックマーク数ランキング（2024年9月第3週）

今週のはてなブックマーク数ランキング（2024年9月第2週）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス