[B! DeepLearning][NLP] wanchan-daisukiのブックマーク

Language-agnostic representation learning for product search on e-commerce platforms

wanchan-daisuki 2020/01/22

商品表現のための、クロスリンガルなTransformerベースの表現学習手法。単一言語で学習したときよりも高精度な検索ができるようになったらしい。

リンク

Single Headed Attention RNN: Stop Thinking With Your Head

The leading approaches in language modeling are all obsessed with TV shows of my youth - namely Transf ormers and Sesame Street. Transf ormers this, Transf ormers that, and over here a bonfire worth of GPU-TPU-neuromorphic wafer scale silicon. We opt for the lazy path of old and proven techniques with a fancy crypto inspired acronym: the Single Headed Attention RNN (SHA-RNN). The author's lone goal i

wanchan-daisuki 2020/01/15

LSTMベースの言語モデルSHA-RNNを提案。enwiki8データセットで、Transformerベースの手法に匹敵する性能を実現。1GPUで24時間前後に訓練可能らしい。今後もLSTMと変わらぬお付き合いよろしくな、って論文みたい。

リンク

SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization

wanchan-daisuki 2019/12/18

事前学習済みモデルを破壊せず丁寧にファインチューニングするための手法SMARTの提案。

リンク

はてなブックマーク

タグ

関連タグで絞り込む (0)

DeepLearningとNLPに関するwanchan-daisukiのブックマーク (3)

お知らせ

今週のはてなブックマーク数ランキング（2024年11月第2週）

今週のはてなブックマーク数ランキング（2024年11月第1週）

月間はてなブックマーク数ランキング（2024年10月）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス