“FlexGen は、LLM 推論のリソース要件を 1 つのコモディティ GPU (T4、3090 など) にまで下げ、さまざまなハードウェアセットアップの柔軟な展開を可能にすることを目的としています。”

ディープラーニング

misshiki のブックマーク 2023/02/21 14:34

<blockquote class="hatena-bookmark-comment"><a class="comment-info" href="https://b.hatena.ne.jp/entry/4732659427161609476/comment/misshiki" data-user-id="misshiki" data-entry-url="https://b.hatena.ne.jp/entry/s/github.com/FMInference/FlexGen" data-original-href="https://github.com/FMInference/FlexGen" data-entry-favicon="https://cdn-ak2.favicon.st-hatena.com/64?url=https%3A%2F%2Fgithub.com%2FFMInference%2FFlexGen" data-user-icon="/users/misshiki/profile.png">GitHub - FMInference/FlexGen: Running large language models on a single GPU for throughput-oriented scenarios.</a><ul class="comment-tag" style="list-style: none; margin: 0px;"><li style="float: left">[<a href="https://b.hatena.ne.jp/q/%E3%83%87%E3%82%A3%E3%83%BC%E3%83%97%E3%83%A9%E3%83%BC%E3%83%8B%E3%83%B3%E3%82%B0">ディープラーニング</a>]</li></ul><br><p style="clear: left">“FlexGen は、LLM 推論のリソース要件を 1 つのコモディティ GPU (T4、3090 など) にまで下げ、さまざまなハードウェア セットアップの柔軟な展開を可能にすることを目的としています。”</p><a class="datetime" href="https://b.hatena.ne.jp/misshiki/20230221#bookmark-4732659427161609476"><span class="datetime-body">2023/02/21 14:34</span></a></blockquote><script src="https://b.st-hatena.com/js/comment-widget.js" charset="utf-8" async></script>

このブックマークにはスターがありません。
最初のスターをつけてみよう！

GitHub - FMInference/FlexGen: Running large language models on a single GPU for throughput-oriented scenarios.

github.com/FMInference2023/02/21

In recent years, large language models (LLMs) have shown great performance across a wide range of tasks. Increasingly, LLMs have been applied not only to interactive applications (such as chat), bu...

36 人がブックマーク・2 件のコメント

他のコメントを読む

＼コメントがサクサク読めるアプリです／

はてなブックマーク

GitHub - FMInference/FlexGen: Running large language models on a single GPU for throughput-oriented scenarios.

はてなブックマーク

公式Twitter

はてなのサービス