stealthinuのブックマーク - はてなブックマーク

stealthinu id:stealthinu

ブックマーク / lmsys.org (2)

Chatbot Arena Leaderboard Week 8: Introducing MT-Bench and Vicuna-33B | LMSYS Org
Welcome to try the Chatbot Arena voting demo. Keep in mind that each benchmark has its limitations. Please consider the results as guiding references. See our discussion below for more technical details. Evaluating Chatbots with MT-bench and Arena Motivation While several benchmarks exist for evaluating Large Language Model's (LLM) performance, such as MMLU, HellaSwag, and HumanEval, we noticed
stealthinu 2023/06/23
Vicuna-33BがだいぶGPT-3.5に迫りつつある。軽量化手法が進んだ今なら33Bだと3090x2くらいあれば動かせるのかな？3.5レベルの性能あるとやれることだいぶ違う。

deeplearning

LLM
リンク
Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org
Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Qualityby: The Vicuna Team, Mar 30, 2023 We introduce Vicuna-13B, an open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT. Preliminary evaluation using GPT-4 as a judge shows Vicuna-13B achieves more than 90%* quality of OpenAI ChatGPT and Google Bard while outperforming other models
stealthinu 2023/03/31
LLaMAベースからのファインチューニングするのにたったの$300！でこんだけ性能が上がっている。SDで起きた画像生成の改良がLLMでも急激に起きてる。

LLM

deeplearning

ChatGPT
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx