Transformerの人気記事 2件 - はてなブックマーク

Transformerの検索結果1 - 2 件 / 2件

Transformer Explainer: LLM Transformer Model Visually Explained

What is a Transformer? Transformer is a neural network architecture that has fundamentally changed the approach to Artificial Intelligence. Transformer was first introduced in the seminal paper "Attention is All You Need" in 2017 and has since become the go-to architecture for deep learning models, powering text-generative models like OpenAI's GPT, Meta's Llama, and Google's Gemini. Beyond text, T

Attentionと類似度は異なるという話

はじめに「Transformerのattentionはトークン間の類似度をモデリングしている」という説明をよく聞くが、この表現は適切でないことを示す。なお、このような説明がよくされる背景としては、Transformerのdot-product attentionは内積で計算され、コサイン類似度も正規化されたベクトルの内積で計算される点によるものと思われる。しかしながら両者は正規化の有無に違いがあり、ベクトル空間に埋め込んだ時の数学的性質はかなり異なるということを本稿では指摘する。 TL; DR Attention(dot-product attention)は類似度とは異なる数学的性質を持つ類似度はトークン間の近接関係はモデリングできるが、それ以外の多様な関連をモデリングするには適さない。 dot-product attentionはトークン間の近接関係を含むさまざまな関連をモデリン

はてなブックマーク

検索対象

並び順

ブックマーク数

セーフサーチ

期間指定

絞り込み

ブックマーク数

期間

セーフサーチ

Transformerの検索結果1 - 2 件 / 2件

Transformer Explainer: LLM Transformer Model Visually Explained

Attentionと類似度は異なるという話

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス