[B! multimodal] saitodevel01のブックマーク

saitodevel01 id:saitodevel01

multimodalに関するsaitodevel01のブックマーク (11)

Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
saitodevel01 2023/08/29
multimodal
リンク
Grounded Language-Image Pre-training
saitodevel01 2023/08/03
image

multimodal
リンク
GLIPv2: Unifying Localization and Vision-Language Understanding
saitodevel01 2023/08/03
image

multimodal
リンク
Zero-shot Learning網羅的サーベイ：CLIPが切り開いたVision & Languageの新しい世界 - エクサウィザーズ Engineer Blog
こんにちは！　画像システムグループで機械学習エンジニアをやっている小島です。この記事では、今ホットな「Zero-shot Learning」と「Vision & Language」に関する最新情報を、CLIPという研究を起点として網羅的にサーベイをしていきます。このために論文1000本に目を通し、70本程度を記事にしました。 Zero-shotやVision & Languageは、Stable Diffusionに代表される画像生成AIとも密接に関連している技術です。この記事を通して、Vision & Languageの奥深い世界を体感できるでしょう。注意事項この記事は非常に長いため、全部読むのに1時間以上かかる可能性があるので、休憩を取りながら、または必要な部分だけ読んでください。各セクションを個別に読んでも問題ありません。また、文章中の画像は、特別な記載がない限り、引用元の論
saitodevel01 2023/08/01
LLM

multimodal
リンク
Learning Transferable Visual Models From Natural Language Supervision
State-of-the-art computer vision systems are trained to predict a fixed set of predetermined object categories. This restricted form of supervision limits their generality and usability since additional labeled data is needed to specify any other visual concept. Learning directly from raw text about images is a promising alternative which leverages a much broader source of supervision. We demonstr
saitodevel01 2023/07/23
multimodal
リンク
Flamingo: a Visual Language Model for Few-Shot Learning
saitodevel01 2023/07/23
LLM

multimodal
リンク
Visual Instruction Tuning
saitodevel01 2023/07/23
LLM

multimodal
リンク
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
saitodevel01 2023/07/23
LLM

multimodal
リンク
Gradio
saitodevel01 2023/07/23
LLM

multimodal
リンク
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model
saitodevel01 2023/07/23
multimodal

LLM
リンク
ImageBind: One Embedding Space To Bind Them All
saitodevel01 2023/07/23
multimodal

LLM
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx