arrowKatoのブックマーク / 2024年4月26日 - はてなブックマーク

arrowKato id:arrowKato

2024年4月26日のブックマーク (3件)

AgentBench: Evaluating LLMs as Agents
Large Language Models (LLMs) are becoming increasingly smart and autonomous, targeting real-world pragmatic missions beyond traditional NLP tasks. As a result, there has been an urgent need to evaluate LLMs as agents on challenging tasks in interactive environments. We present AgentBench, a multi-dimensional evolving benchmark that currently consists of 8 distinct environments to assess LLM-as-Age
arrowKato 2024/04/26
ベンチマークの論文

LLM

Agent
リンク
AI Index Report 2024 – Artificial Intelligence Index
Welcome to the seventh edition of the AI Index report. The 2024 Index is our most comprehensive to date and arrives at an important moment when AI’s influence on society has never been more pronounced. This year, we have broadened our scope to more extensively cover essential trends such as technical advancements in AI, public perceptions of the techno logy, and the geopolitical dynamics surroundin
arrowKato 2024/04/26
State of AI 2024/4月版

ML
リンク
Amazon Bedrock: New innovations for building generative AI applications
arrowKato 2024/04/26
なんかでた。

AWS

llm
リンク
- 2024年4月30日
- 2024年4月26日
- 2024年4月25日

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx