yoavg’s gists[B!]新着記事・評価 - はてなブックマーク

『yoavg’s gists』

What makes multi-agent LLM systems multi-agent?
3 users
gist.github.com/yoavg

multi-llm-agents.md Are multi-LLM-agent systems a thing? Yes they are. But. Yoav Goldberg, Nov 24, 2024 This piece started with a pair of twitter and bluesky posts: let's talk about "agents" (in the LLM sense). there's a lot of buzz around "multi-agent" systems where agents collaborate but... i don't really get how it differs from a thinking of a single agent with multiple modes of operation. what
- テクノロジー
- 2024/11/26 18:06

Reinforcement Learning for Language Models
19 users
gist.github.com/yoavg

rl-for-llms.md Reinforcement Learning for Language Models Yoav Goldberg, April 2023. Why RL? With the release of the ChatGPT model and followup large language models (LLMs), there was a lot of discussion of the importance of "RLHF training", that is, "reinforcement learning from human feedback". I was puzzled for a while as to why RL (Reinforcement Learning) is better than learning from demonstrat
- テクノロジー
- 2023/04/23 19:29
- AI
- あとで読む

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx