GitHub - NVIDIA/Megatron-LM: Ongoing research training transformer models at scale

テクノロジーカテゴリーの変更を依頼記事元:

github.com/NVIDIA

3 usersがブックマークコメント

記事へのコメント2件

注目コメント
新着コメント

rawwell “we developed a simple and efficient two-dimensional model-parallel approach. To use tensor model parallelism (splitting execution of a single transformer module over multiple GPUs), add the --tensor-model-parallel-size flag to specify the number of GPUs among which to split the model, along with

2021/04/07 リンク

hnishi2509 これ動かせる環境持っている人ってよっぽどやなぁ。“We have provided an example of how to configure Megatron to run GPT-3 with 175 billion parameters on 1024 GPUs.”

2021/03/23 リンク

注目コメント算出アルゴリズムの一部にLINEヤフー株式会社の「建設的コメント順位付けモデルAPI」を使用しています

規約違反を報告

いまの話題をアプリでチェック！

バナー広告なし
ミュート機能あり
ダークモード搭載

アプリをダウンロード

GitHub - NVIDIA/Megatron-LM: Ongoing research training transformer models at scale

You signed in with another tab or window. Reload to refresh your session. You signed out in anoth... You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert