[B! computerVision][deeplearning] manboubirdのブックマーク

manboubird id:manboubird

computerVisionとdeeplearningに関するmanboubirdのブックマーク (145)

NLPとVision-and-Languageの基礎・最新動向 (2) / DEIM Tutorial Part 2 Vision-and-Language
DEIM2023 第15回データ工学と情報マネジメントに関するフォーラムチュートリアル講演資料 Part2: Vision-and-Language
manboubird 2023/03/08
slide

deepLearning

nlp

computerVision
リンク
NLPとVision-and-Languageの基礎・最新動向 (1) / DEIM Tutorial Part 1: NLP
DEIM2023 第15回データ工学と情報マネジメントに関するフォーラムチュートリアル講演資料 Part1: NLP
manboubird 2023/03/08
nlp

computerVision

slide

deepLearning
リンク
自然言語処理とVision-and-Language / A Tutorial on NLP & Vision-and-Language
2022年度人工知能学会全国大会（第36回）チュートリアル講演資料
manboubird 2023/02/12
slide

ntt

vision

nlp

computerVision

deepLearning

clip
リンク
https://dl.acm.org/doi/pdf/10.1145/3505244
manboubird 2023/01/22
Transformers in Vision: A Survey

paper

transformers

computerVision

survey

deepLearning
リンク
GitHub - metauto-ai/Kaleido-BERT: 💐Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
manboubird 2022/10/30
kaleidoBert

bert

deepLearning

fashion

computerVision

nlp
リンク
GitHub - facebookresearch/vissl: VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
manboubird 2022/05/14
facebook

computerVision

instagram

classification

model

deepLearning
リンク
ディープラーニングの先端技術、マルチモーダルの日本語モデルを作ってみた【日本語VL-T5】 - Qiita
要点マルチモーダル深層学習って何？Vision-Language Modelって何？という方向けに、 Google Colabで実際に学習済みモデルを動かしながら技術概要を理解していただけるチュートリアル記事です。マルチモーダルの時代が到来この10年、ディープラーニングの登場により、画像の分類や、文章読解（日本語等の自然言語によるQA）などが高い精度で自動化できるようになりましたね。しかし、画像は画像、自然言語は自然言語・・・と、それぞれに特化した手法の開発が中心で、それらが混在したマルチメディア（マルチモーダル）の問題へのチャレンジは少ない状況に長らくありました。マルチモーダルの重要性は人間の様々な知的判断の場面を思い返せば分かりますね。実課題解決において重要なAI 技術分野といえます。シングルモーダルが中心だった潮目はこの1年くらいで変わり、昨今、マルチモーダルな深層学習モデル
manboubird 2021/12/03
multiModal

deepLearning

computerVision

textAnalysis
リンク
Accelerating Queries over Unstructured Data with ML, Part 4 (Accelerating Aggregation Queries with Expensive Predicates) · Stanford DAWN
manboubird 2021/11/17
approximateQuery

computerVision

stanford

research

dawn

deepLearning
リンク
GitHub - cs230-stanford/cs230-code-examples: Code examples in pyTorch and Tensorflow for CS230
manboubird 2021/11/07
course

standard

deeplearning

lecture

code

tutorial

tensorFlow

namedEntityRecognition

nlp

computerVision
リンク
コンピュータビジョンの最新論文調査 3D Human Pose Estimation 編 | BLOG - DeNA Engineering
はじめにこんにちは、AIシステム部でコンピュータビジョンの研究開発をしている加藤です。我々のチームでは、常に最新のコンピュータビジョンに関する論文調査を行い、部内で共有・議論しています。前回の 2D Human Pose Estimation 編に引き続き、今回は 3D Human Pose Estimation 編として加藤直樹 ( @nk35jk ) が調査を行いました。本記事では 3D Human Pose Estimation に関する代表的な研究事例を紹介するとともに、コンピュータビジョンのトップカンファレンスである ICCV 2019 に採録された論文を中心に 3D Human Pose Estimation の最新の研究動向を紹介します。過去の他タスク編については以下をご参照ください。 Human Recognition 編 (2019/04/26) 3D Visio
manboubird 2021/11/03
survey

computerVision

paper

poseEstimation

links

deeplearning

cvpr

eccv
リンク
コンピュータビジョンの最新論文調査 2D Human Pose Estimation 編 | BLOG - DeNA Engineering
はじめにこんにちは、AIシステム部でコンピュータビジョンの研究開発をしている加藤です。我々のチームでは、常に最新のコンピュータビジョンに関する論文調査を行い、部内で共有・議論しています。今回は 2D Human Pose Estimation 編として加藤直樹 ( @nk35jk ) が調査を行いました。本記事では 2D Human Pose Estimation に関する代表的な研究事例を紹介するとともに、2019年10月から11月にかけて開催されたコンピュータビジョンのトップカンファレンスである ICCV 2019 に採録された 2D Human Pose Estimation の最新論文を紹介します。過去の他タスク編については以下をご参照ください。 Human Recognition 編 (2019/04/26) 3D Vision 編 (2019/06/04) キーポイント検
manboubird 2021/11/03
survey

computerVision

paper

poseEstimation

links

deeplearning

cvpr

eccv
リンク
Welcome to Hao Su's homepage
manboubird 2021/11/03
UCSD

research

lab

computerVision

robot

machineLearning

deepLearning
リンク
Toward Fast and Accurate Neural Networks for Image Recognition
manboubird 2021/09/18
efficientNet

google

neuralArchitectureSearch

deepLearning

computerVision
リンク
Computers Do Not Make Art, People Do – Communications of the ACM
We live in an age of amazing new visual art created with artificial intelligence (AI) techno logy. The recent wave began with neural stylization apps and the trippy, evocative DeepDream. Many fine artists now work with neural network algorithms, creating high-profile works appearing in major venues.1 Together with these new developments comes the hype: techno logists who claim that their algorithms
manboubird 2021/05/30
computerVision

art

cacm

paper

deepLearning
リンク
Image captioning with visual attention | Text | TensorFlow
Deploy ML on mobile, microcontrollers and other edge devices
manboubird 2021/05/15
tensorFlow

captionGenerator

computerVision

deepLearning
リンク
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Convolutional Neural Networks (ConvNets) are commonly developed at a fixed resource budget, and then scaled up for better accuracy if more resources are available. In this paper, we systematically study model scaling and identify that carefully balancing network depth, width, and resolution can lead to better performance. Based on this observation, we propose a new scaling method that uniformly sc
manboubird 2021/04/25
icml

paper

google

efficientNet

computerVision

deeplearning
リンク
GitHub - facebookresearch/Detectron: FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
manboubird 2021/04/02
facebook

cnn

deeplearning

computerVision
リンク
Facebook
manboubird 2021/02/07
fashion

artificialIntelligence

facebook

research

ec

shopping

computerVision

clothing

objectDetection

deepLearning
リンク
Powered by AI: Advancing product understanding and building new shopping experiences
Powered by AI: Advancing product understanding and building new shopping experiences Today we’re announcing: We’ve built and deployed GrokNet, a universal computer vision system designed for shopping. It can identify fine-grained product attributes across billions of photos — in different categories, such as fashion, auto, and home decor. GrokNet is powering new Marketplace features for buyers and
manboubird 2020/05/26
facebook

imageRecognition

product

paper

research

deepLearning
リンク
OverFeat: Object Recognizer, Feature Extractor | CILVR Lab @ NYU
The CILVR Lab (Computational Intelligence, Learning, Vision, and Robotics) regroups faculty members, research scientists, postdocs, and students working on AI, machine learning, and a wide variety of applications, notably computer perception, natural language understanding, robotics, and healthcare. Follow us @CILVRatNYU on Twitter! CILVR News 05/03/25 – Congratulations to NYU Assistant Professor
manboubird 2020/03/28
nyu

dataScience

deeplearning

lab

facebookAI

computerVision

robot

nlp
リンク
1 2 3 4 5 6 7 8 次のページ

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx