wanchan-daisukiのブックマーク - はてなブックマーク

wanchan-daisuki id:wanchan-daisuki

ブックマーク / openreview.net (1)

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than...
Published: 20 Dec 2019, Last Modified: 22 Oct 2023 ICLR 2020 Conference Blind SubmissionReaders: Everyone Abstract: Masked language modeling (MLM) pre-training methods such as BERT corrupt the input by replacing some tokens with [MASK] and then train a model to reconstruct the original tokens. While they produce good results when transf erred to downstream NLP tasks, they generally require large amo
wanchan-daisuki 2020/03/18
BERTなどで採用されているMASKトークンの復元予測ではなく、Generatorから生成された置換トークンを検出するDiscriminatorを訓練する事前学習タスクを採用したPretrainedモデル。Google Research。

DeepLearning
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx