サクサク読めて、アプリ限定の機能も多数!
トップへ戻る
WWDC25
gist.github.com/yoavg
multi-llm-agents.md Are multi-LLM-agent systems a thing? Yes they are. But. Yoav Goldberg, Nov 24, 2024 This piece started with a pair of twitter and bluesky posts: let's talk about "agents" (in the LLM sense). there's a lot of buzz around "multi-agent" systems where agents collaborate but... i don't really get how it differs from a thinking of a single agent with multiple modes of operation. what
rl-for-llms.md Reinforcement Learning for Language Models Yoav Goldberg, April 2023. Why RL? With the release of the ChatGPT model and followup large language models (LLMs), there was a lot of discussion of the importance of "RLHF training", that is, "reinforcement learning from human feedback". I was puzzled for a while as to why RL (Reinforcement Learning) is better than learning from demonstrat
このページを最初にブックマークしてみませんか?
『yoavg’s gists』の新着エントリーを見る
j次のブックマーク
k前のブックマーク
lあとで読む
eコメント一覧を開く
oページを開く