1. RefEgo: Referring Expression Comprehension Dataset from First-Person Perception of Ego4D, Shuhei Kurita, Naoki Katsura, Eri Onami, (ICCV2023). 2. ScanQA: 3D Question Answering, Daichi Azuma(*), Taiki Miyanishi(*), Shuhei Kurita(*) and Motoaki Kawanabe. (CVPR2022). (*): eq. cont. 3. Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes’ Rule, Shuhei Kurita and Kyunghyu
![SSII2024 [OS2] 大規模言語モデルとVision & Languageのこれから](https://cdn-ak-scissors.b.st-hatena.com/image/square/b99dfd999543ee41fded2e0182b98a0132d56871/height=288;version=1;width=512/https%3A%2F%2Ffiles.speakerdeck.com%2Fpresentations%2F25fe2db5f9b2440c84a13090c42bacb6%2Fslide_0.jpg%3F30574730)