arrowKatoのブックマーク / 2023年6月15日 - はてなブックマーク

arrowKato id:arrowKato

2023年6月15日のブックマーク (1件)

AlpacaEval Leaderboard
About AlpacaEval AlpacaEval an LLM-based automatic evaluation that is fast, cheap, and reliable. It is based on the AlpacaFarm evaluation set, which tests the ability of models to follow general user instructions. These responses are then compared to reference responses (Davinci003 for AlpacaEval, GPT-4 Preview for AlpacaEval 2.0) by the provided GPT-4 based auto-annotators, which results in the w
arrowKato 2023/06/15
Alpaca Evalというベンチマークでのランキング

LLM
リンク
- 2023年6月19日
- 2023年6月15日
- 2023年6月14日

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx