本文「measuring」を検索 - はてなブックマーク

1 - 1 件 / 1件

新着順人気順

絞り込み

検索対象
ブックマーク数
期間
セーフサーチ

measuringの検索結果1 - 1 件 / 1件

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
- 3 users
- arxiv.org
- テクノロジー
- 2024/10/13
Recent advancements in Large Language Models (LLMs) have sparked interest in their formal reasoning capabilities, particularly in mathematics. The GSM8K benchmark is widely used to assess the mathematical reasoning of models on grade-school-level questions. While the performance of LLMs on GSM8K has significantly improved in recent years, it remains unclear whether their mathematical reasoning cap
- 機械学習
- 数学
- AI

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx