[B! meta] manboubirdのブックマーク

manboubird id:manboubird

metaに関するmanboubirdのブックマーク (42)

End-to-end object detection with Transformers | Research - AI at Meta
manboubird 2024/10/07
transformers

paper

meta

facebook

computerVision

llm

objectDetection

detr
リンク
Transformer を物体検出に採用！話題のDETRを詳細解説！
はじめに Transf ormerを物体検出にはじめて取り入れた「DETR（DEtection Transf ormer）」が2020年５月にFacebookから発表されました。DETRは人間による手作業を大幅に減らすことに成功し、End-to-Endモデルに近く誰でも利用しやすいモデルになっています。また、「水着があるなら、一緒に写っている板のようなものはサーフボードである確率が高い」など、一枚の画像内にあるオブジェクト間の関係性を利用する形で物体検出が可能になりました。こうしたことがどうして可能になったのかを以下で見ていきたいと思います。なお、Transf ormerに関しては一定程度の理解がある前提で説明しております。Transf ormerに関しても記事を作成しておりますので、下記をご参照ください。公式論文「End-to-End Object Detection with Trans
manboubird 2024/10/05
objectDetection

meta

paper

detr

transformer
リンク
GitHub - IDEA-Research/DINO: [ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
[2023/7/10] We release Semantic-SAM, a universal image segmentation model to enable segment and recognize anything at any desired granularity. Code and checkpoint are available! [2023/4/28]: We release a strong open-set object detection and segmentation model OpenSeeD that achieves the best results on open-set object segmentation tasks. Code and checkpoints are available here. [2023/4/26]: DINO is
manboubird 2024/10/05
dino

paper

meta

facebook

computerVision

transformers

detr
リンク
GitHub - IDEA-Research/GroundingDINO: [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
manboubird 2024/10/03
paper

objectDetection

imageSegmentation

meta

computerVision

llm

generativeAi

groundingDino
リンク
GitHub - IDEA-Research/Grounded-Segment-Anything: Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
We plan to create a very interesting demo by combining Grounding DINO and Segment Anything which aims to detect and segment anything with text inputs! And we will continue to improve it and create more interesting demos based on this foundation. And we have already released an overall technical report about our project on arXiv, please check Grounded SAM: Assem bling Open-World Models for Diverse V
manboubird 2024/10/03
paper

objectDetection

imageSegmentation

meta

computerVision

llm

generativeAi

groundedSam
リンク
【保存版】さまざまなAI画像処理の手法を学べるレシピ50選（2022年8月版） - Qiita
はじめに ※本記事は2022年8月16日に20個のレシピを追加し50選へと更新いたしました。 AxrossRecipeを運営している松田です。 AxrossRecipe は、エンジニアの"アカデミックな教育"と"現場の業務"のスキルギャップに着目し、「学んだが活用できない人を減らしたい」という想いで、ソフトバンク社内起業制度にて立ち上げたサービスです。現役エンジニアによるノウハウが"レシピ"として教材化されており、動くものを作りながらAI開発やデータ分析の流れを追体験できます。 AxrossRecipe: https://axross-recipe.com Twitter: https://twitter.com/AxrossRecipe_SB 画像処理とは画像処理は、「動画像のデータに対して、コンピュータが何かしらの処理を施すこと」の総称で、「画像認識」や「物体検出」、「画像合成・加
manboubird 2024/10/01
computerVision

llm

generativeAi

links

methodology

yolo

meta

clip

openAi
リンク
このページを見るには、ログインまたは登録してください
Facebookで投稿や写真などをチェックできます。
manboubird 2024/09/29
meta

facebook

ad
リンク
このページを見るには、ログインまたは登録してください
Facebookで投稿や写真などをチェックできます。
manboubird 2024/09/29
facebook

meta

ad
リンク
GitHub - facebookresearch/mmf: A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
manboubird 2024/09/29
mmf

llm

artificialIntelligence

multimodal

meta

facebook

framework
リンク
New AI advancements drive Meta’s ads system performance and efficiency
New AI advancements drive Meta’s ads system performance and efficiency AI has long been a crucial component of Meta’s ads system. We began with manual feature engineering for small models and progressed to building hundreds of deep neural network models with trillions of parameters. Each model is independently optimized for different goals — such as improving ad quality to provide better experienc
manboubird 2024/09/29
artificialIntelligence

meta

ad
リンク
Meta、無料で商用可のLLM「Llama 3.2」リリース　マルチモーダルモデルも
米Metaは9月25日（現地時間）、年次開発者会議「Meta Connect 2024」で、同社のLLM「Llama」の最新版「Llama 3.2」のリリースを発表した。7月に「Llama 3.1」をリリースしたばかりだが、初のマルチモーダルモデルの追加など、大きな更新になった。画像認識機能の追加 Llama 3.2では、11B（110億）と90B（900億）の2つのモデルで画像認識機能をサポートする。これにより、表やグラフなどの理解、画像キャプションの生成、画像内のオブジェクトに自然言語で指示する視覚的なグラウンディングなどの画像推論ユースケースが可能になる。例えば、ユーザーが前年のどの月に売り上げが最も多かったのかをグラフに基づいて質問すると、Llama 3.2は迅速に回答を提供するという。エッジデバイスに対応した軽量モデル 1Bと3Bの軽量モデルは、要約、指示の追従、書き換え
manboubird 2024/09/26
meta

llm
リンク
MetaがGPT-4超えのAIモデル「Llama 3.1」をリリース
Metaが大規模言語モデル「Llama 3.1」を2024年7月23日にリリースしました。Llama 3.1はオープンソースで公開されており、GPT-4やGPT-4oといった最先端のクローズドソースAIモデルと同等以上の性能を備えているそうです。 Llama 3.1 https://llama.meta.com/ Introducing Llama 3.1: Our most capable models to date https://ai.meta.com/blog/meta-llama-3-1/ Llama 3.1はパラメーター数「4050億」「700億」「80億」のモデルが用意されており、すべてのモデルが12万8000のコンテキストウィンドウを備えています。パラメーター数4050億の「Llama 3.1 405B」のベンチマーク結果を「Nemotron 4 340B Instru
manboubird 2024/08/25
llama

meta

chatGpt
リンク
Metaが画像だけでなく動画内のオブジェクトもリアルタイムかつ正確に識別可能なAIモデル「Segment Anything Model 2(SAM 2)」をリリース
Metaが画像や動画内のどのピクセルがどのオブジェクトと関係したものかを正確に識別することができる統合AIモデルの「Segment Anything Model 2(SAM 2)」を発表しました。SAM 2を利用することであらゆるオブジェクトをセグメント化し、動画のすべてのフレームにわたってリアルタイムで一貫した追跡が可能になるため、動画編集や複合現実の分野で革新的なツールとなる可能性があります。 Our New AI Model Can Segment Anything – Even Video | Meta https://about.fb.com/news/2024/07/our-new-ai-model-can-segment-video/ Introducing SAM 2: The next generation of Meta Segment Anything Model f
manboubird 2024/08/25
meta

computerVision

segmentAnything
リンク
【ミニレビュー】スマートサングラス「Ray-Ban Meta」をラスベガスで試す
manboubird 2024/07/06
smartglass

rayban

meta

sunglass
リンク
Meta
manboubird 2024/07/06
rayban

meta

sunglass

smartglass
リンク
https://dl.acm.org/doi/abs/10.1145/3543507.3583310
manboubird 2024/06/26
taxonomy

paper

ucla

meta

www

ontology
リンク
メタのザッカーバーグＣＥＯ、新たな機関設立－ＡＩ製品で助言受ける
米メタ・プラットフォームズのマーク・ザッカーバーグ最高経営責任者（ＣＥＯ）は、新たな製品に関する助言機関を設けた。メタ経営陣と定期的に会合を開き、同社の人工知能（ＡＩ）や技術の向上について助言するグループとなる。「メタ・アドバイザリー・グループ」と呼ばれる同機関には、ストライプの共同創業者パトリック・コリソンＣＥＯ、ギットハブのナット・フリードマン元ＣＥＯ、ショッピファイのトビアス・リュトケＣＥＯ、投資家でマイクロソフト元幹部のチャーリー・ソングハースト氏ら４人がメンバーとして名を連ねる。メタの広報担当者によると、全員に対して報酬は支払われない。広報担当によれば、メタ・アドバイザリー・グループは取締役会とは異なり、株主によるメンバーの選任やメタに対する忠実義務はない。「技術面での向上やイノベーション、戦略的な成長機会に関する見識と助言を提供する役割を担う」という。原題：Meta’s
manboubird 2024/05/24
meta

stripe

artificialIntelligence

advisory
リンク
前澤友作さん、米MetaとFacebook Japanを提訴　損害賠償として“1円”請求　なりすまし詐欺広告を巡って
SNSなどで表示される著名人を使った偽広告を巡り、ZOZO創業者の前澤友作さんは5月15日、米Meta社とFacebook Japanをそれぞれ提訴したと発表した。前澤さんは自身の公式Xアカウント（＠yousuck2020）で訴状の一部を公開。損害賠償金として1円を請求していると明かした。
manboubird 2024/05/15
meta

sue
リンク
メタが桁違いのAIインフラ構築　「完全な汎用知能」へ - 日本経済新聞
米メタが人工知能（AI）向けのIT（情報技術）インフラ投資を再拡大し始めた。リストラのため2023年は設備投資などの資本的支出（CAPEX）を減らしていたが、24年は再び増やして300億〜370億ドル（約4.4兆〜5.5兆円）を投じる。マーク・ザッカーバーグ最高経営責任者（CEO）は「完全な汎用知能の実現を目指す」と明言している。前年よりも減ったものの、設備投資自体は23年も巨額だった。同社が
manboubird 2024/03/03
meta

generativeAi

ad

llm

nvidia

oss

model
リンク
KDD Tutorial: Assistant
manboubird 2024/02/06
Kidd

tutorial

meta

llm

chatbot
リンク
1 2 3 次のページ

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx