ktykogmのブックマーク - はてなブックマーク

ktykogm id:ktykogm

ブックマーク / github.com/confident-ai (1)

GitHub - confident-ai/deepeval: The LLM Evaluation Framework
DeepEval is a simple-to-use, open-source LLM evaluation framework. It is similar to Pytest but specialized for unit testing LLM outputs. DeepEval incorporates the latest research to evaluate LLM outputs based on metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., which uses LLMs and various other NLP models that runs locally on your machine for evaluation. Whether your applicatio
ktykogm 2024/08/19
LLM

evaluation

RAG

NLP

AI

metrics

framework

LLMOps
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx