[B! performance][pytorch] dannのブックマーク

dann id:dann

performanceとpytorchに関するdannのブックマーク (21)

Pytorch Conference
dann 2024/09/19
pytorch

llm

performance
リンク
PytorchによるLLMの高速化
アドベントカレンダー「ほぼ横浜の民」の11日目の記事です。今年は LLM の高速化実装について書いています。私はLLMの専門家ではないですが前々から興味があったので少し勉強してみました。この記事を読んでわかること LLMが文章を生成する仕組み torch.compile によって LLM はどのように高速化されるのか？ Speculative Decoding とは？背景少し前に Accelerating Generative AI with Pytorch II: GPT, Fast という素晴らしいブログ記事を見かけました。この記事は Pytorch チームから出されたもので、素の Pytorch のみを用いて LLM の推論を 10 倍高速化できるというものでした。一体どのように 10 倍もの高速化を実現しているのか気になったので、個人的な勉強も兼ねてこの記事を書いています。
dann 2024/01/31
pytorch

performance
リンク
Accelerating Generative AI Part III: Diffusion, Fast
by Sayak Paul and Patrick von Platen (Hugging Face 🤗) This post is the third part of a multi-series blog focused on how to accelerate generative AI models with pure, native PyTorch. We are excited to share a breadth of newly released PyTorch performance features alongside practical examples to see how far we can push PyTorch native performance. In part one, we showed how to accelerate Segment Any
dann 2024/01/05
performance

inference

pytorch
リンク
GitHub - pytorch-labs/segment-anything-fast: A batched offline inference oriented version of segment-anything
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
dann 2023/12/03
pytorch

performance
リンク
Accelerating Generative AI with PyTorch: Segment Anything, Fast
by Team PyTorch This post is the first part of a multi-series blog focused on how to accelerate generative AI models with pure, native PyTorch. We are excited to share a breadth of newly released PyTorch performance features alongside practical examples of how these features can be combined to see how far we can push PyTorch native performance. As announced during the PyTorch Developer Conference
dann 2023/12/02
pytorch

performance
リンク
gpt-fast/eval.py at main · pytorch-labs/gpt-fast
dann 2023/12/02
pytorch

llm

performance
リンク
kineto/tb_plugin/README.md at main · pytorch/kineto
dann 2023/05/04
pytorch

profiler

performance
リンク
PyTorch 2.0 Live Q&A Series: PT2 Profiling and Debugging
dann 2023/02/07
pytorch

performance
リンク
Making Deep Learning go Brrrr From First Principles
Making Deep Learning Go Brrrr From First Principles So, you want to improve the performance of your deep learning model. How might you approach such a task? Often, folk fall back to a grab-bag of tricks that might've worked before or saw on a tweet. "Use in-place operations! Set gradients to None! Install PyTorch 1.10.0 but not 1.10.1!" It's understandable why users often take such an ad-hoc appro
dann 2023/02/05
gpu

performance

deeplearning

nvidia

pytorch
リンク
PyTorch Performance Tuning Guide - Szymon Migacz, NVIDIA
ECCV 2020 Tutorial on Accelerating Computer Vision with Mixed Precision Website: https://nvlabs.github.io/eccv2020-mixed-precision-tutorial/ Slides: https://nvlabs.github.io/eccv2020-mixed-precision-tutorial/files/szymon_migacz-pytorch-performance-tuning-guide.pdf
dann 2022/09/13
pytorch

performance
リンク
torch.nn.modules.module — PyTorch 2.3 documentation
dann 2022/09/13
performance

pytorch
リンク
torch.utils.bottleneck — PyTorch 2.2 documentation
dann 2022/09/13
pytorch

performance
リンク
torch.profiler — PyTorch 2.4 documentation
dann 2022/08/10
pytorch

performance
リンク
Performance Tips and Tricks | fastai
dann 2020/11/07
performance

pytorch
リンク
OLCF-Analytics / summit / Distributed Deep Learning Examples · GitLab
dann 2020/11/06
pytorch

horovod

performance
リンク
vision/torchvision/datasets/lsun.py at main · pytorch/vision
dann 2020/11/06
pytorch

lmdb

performance
リンク
https://nvlabs.github.io/eccv2020-mixed-precision-tutorial/files/szymon_migacz-pytorch-performance-tuning-guide.pdf
dann 2020/08/31
pytorch

performance
リンク
Writing a PyTorch custom layer in CUDA for Transformer
dann 2019/10/26
pytorch

performance
リンク
pytorch/torch/cuda/nvtx.py at main · pytorch/pytorch
dann 2019/08/28
pytorch

nvtx

performance
リンク
Profiling Deep Learning Networks
dann 2019/07/27
pytorch

performance

nvtx
リンク
1 2 次のページ

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx