本文「transformers python library install」を検索

1 - 34 件 / 34件

新着順人気順

絞り込み

検索対象
ブックマーク数
期間
セーフサーチ

transformers python library installの検索結果1 - 34 件 / 34件

大規模言語モデルをフルスクラッチする練習 (環境構築ー前処理ー事前学習ーファインチューニングー評価まで)｜Kan Hatakeyama
- 112 users
- note.com/kan_hatakeyama
- テクノロジー
- 2024/03/06
はじめに以下のオープンなプロジェクトの一環で､大規模言語モデルをフルスクラッチで作る練習をします｡24年3月現在､協力者も募集中です｡リポジトリ当該プロジェクトの標準コードが公開※されたので､それを走らせてみます｡ ※24/3/5時点で､まだレポジトリ内に､工事中の箇所が多々､あります｡このリポ上では､事前学習ー事後学習ー評価まで､一気通貫(?)したパイプラインが提供されています※｡ 0. 環境構築プロジェクトの本番環境はクラウドですが､今回は手持ちのubuntuを使います｡ Dockerはお手軽な一方で､スパコン上で使うと､どうやら速度が落ちるらしいとの噂を聞いたので､condaで作ります(とはいえ､pipしか使わないので､pyenvでもいけると思います)｡必要なマシン適当なlinux: 例えばUbuntu 22.04.3 LTS GPU: 20 GBくらいは欲しいかも? ディスク
Google ColabとVSCodeを用いた分析環境運用方法〜kaggle Tipsを添えて〜 - ギークなエンジニアを目指す男
- 48 users
- www.takapy.work
- テクノロジー
- 2021/08/17
こんにちは。takapy（@takapy0210）です。本エントリは下記イベントでLTした内容の元に、補足事項やコードスニペットなどをまとめたものになります。 kaggle-friends.connpass.com ちなみに今回LTしようと思ったきっかけは以下のような出来事からだったので、みなさんのTipsなども教えていただけると嬉しいです！情報出回ってる感あるけど、colab pro × vscode ssh のオレオレ運用方法を晒すことにより、もっと良い方法のフィードバックもらえるのではドリブンでLTするのはありなのかもしれない・・・？— takapy | たかぱい (@takapy0210) 2021年8月1日 LT資料当日みなさんから頂いたコメント環境構築手順 ngrokアカウント作成と認証キーの取得 ColabにGoogleドライブを接続、ngrok、sshサーバー起動
- VSCode
- colab
- kaggle
- あとで読む
- python
- google
Mojo may be the biggest programming language advance in decades – fast.ai
- 46 users
- www.fast.ai
- テクノロジー
- 2023/05/04
I remember the first time I used the v1.0 of Visual Basic. Back then, it was a program for DOS. Before it, writing programs was extremely complex and I’d never managed to make much progress beyond the most basic toy applications. But with VB, I drew a button on the screen, typed in a single line of code that I wanted to run when that button was clicked, and I had a complete application I could now
- mojo
- Python
- プログラミング
- 興味深い
- Pocket
- 歴史
コンピュータビジョン分野における世界最高峰の国際会議CVPR2022の論文紹介（後編） - NTT docomo Business Engineers' Blog
- 38 users
- engineers.ntt.com
- テクノロジー
- 2022/08/01
目次目次はじめに論文紹介 The Norm Must Go On: Dynamic Unsupervised Domain Adaptation by Normalization OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction EPro-PnP: Generalized End-to-End Probabilistic Perspective-N-Points for Monocular Object Pose Estimation Cascade Transformers for End-to-End Person Search TrackFormer: Multi-Object Tracking With Transformers Global T
Optimizing your LLM in production
- 28 users
- huggingface.co
- テクノロジー
- 2023/09/16
Note: This blog post is also available as a documentation page on Transformers. Large Language Models (LLMs) such as GPT3/4, Falcon, and LLama are rapidly advancing in their ability to tackle human-centric tasks, establishing themselves as essential tools in modern knowledge-based industries. Deploying these models in real-world tasks remains challenging, however: To exhibit near-human text unders
- LLM
- あとで読む
GitHub - open-webui/open-webui: User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
- 19 users
- github.com/open-webui
- テクノロジー
- 2024/05/25
🚀 Effortless Setup: Install seamlessly using Docker or Kubernetes (kubectl, kustomize or helm) for a hassle-free experience with support for both :ollama and :cuda tagged images. 🤝 Ollama/OpenAI API Integration: Effortlessly integrate OpenAI-compatible APIs for versatile conversations alongside Ollama models. Customize the OpenAI API URL to link with LMStudio, GroqCloud, Mistral, OpenRouter, and
- llm
- OpenAI
- AI
大規模言語モデルをシングルGPUで動かせる!? FlexGenを触ってみた | DevelopersIO
- 15 users
- dev.classmethod.jp
- テクノロジー
- 2023/02/24
こんちには。データアナリティクス事業本部インテグレーション部機械学習チームの中村です。今回は大規模言語モデルをシングルGPUで動かせるという噂のFlexGenについて使ってみて紹介したいと思います。 FlexGenとは FlexGenは、大規模言語モデル（LLM: Large Language Model）をシングルGPU（例えば、16GBのT4や24GBのRTX3090）で実行可能な高スループットな生成エンジンです。以下がGitHubになります。 FlexGenは、Meta社が開発したOPT（Open Pre-trained Transformer）を動かすことができ、実際にAIアシスタントと会話することができます。参考までにOPTに関する論文は以下です。使用環境 Google ColaboratoryのPro環境を使います。モデルのアーキテクチャによって動作させるスペック
- 機械学習
- あとで読む
CUDA 12.8 における Checkpoint API の概要 - NTT docomo Business Engineers' Blog
- 14 users
- engineers.ntt.com
- テクノロジー
- 2025/03/12
こんにちは、イノベーションセンターの鈴ヶ嶺です。普段は AI/ML システムに関する業務に従事しています。本記事では、CUDA 12.8 から追加された Checkpoint API の概要について解説します。まず、Checkpoint のユースケースやこれまでの NVIDIA CUDA における Checkpoint の試みなどの背景を説明し、新たに追加された CUDA Checkpointing について解説します。さらに実際に実装し、torchvision や transformers などの CUDA アプリケーションに対して、Checkpoint の検証をしています。背景 CUDA Checkpointing 実装と検証 cu_check tool 検証 Pytorch Counter torchvision transformers まとめ背景 Checkpoint
- 人工知能
はじめての自然言語処理 spaCy 3.0 で Transformer を利用する | オブジェクトの広場
- 10 users
- www.ogis-ri.co.jp
- テクノロジー
- 2021/06/29
今更ですが今年の2月に spaCy 3.0 が公開されました。 3.0 で導入された新機能の中で目玉と言えるのは、やはり Hugging Face Transformers (以下、単にTransformers) のサポートや PyTorch, Tensorflow との連携になるでしょう。今回はその辺りを実際に学習を動かしながら紹介したいと思います。 1. はじめに今回は今年の2月に公開された spaCy 3.0 の話です。 spaCy は第4回でも紹介しましたが、研究者向けというよりは自然言語処理アプリ開発者向けのオープンソース自然言語処理ライブラリになります。日本語を含めた様々な言語の学習済みモデルが存在しており、 spaCy をインストールして、学習済みモデルをダウンロードするだけで、分かち書き、品詞や依存関係の推定、単語や文の類似度の判定など様々な機能を使用することができます。
- *あとで読む
- あとで読む
The State of Python 2025: Trends and Survey Insights | The PyCharm Blog
- 9 users
- blog.jetbrains.com
- テクノロジー
- 2025/08/20
This is a guest post from Michael Kennedy, the founder of Talk Python and a PSF Fellow. Welcome to the highlights, trends, and key actions from the eighth annual Python Developers Survey. This survey is conducted as a collaborative effort between the Python Software Foundation and JetBrains’ PyCharm team. The survey results provide a comprehensive look at Python usage statistics and popularity tre
- python
はじめての自然言語処理 Hugging Face Transformers で T5 を使ってみる | オブジェクトの広場
- 8 users
- www.ogis-ri.co.jp
- テクノロジー
- 2021/06/18
前回が分量的にやたらと重かったので、今回はその反省(反動?)を踏まえて軽い感じでいってみます。第7回で紹介した T5 ですが Hugging Face の Transformers でもサポートされてますので、その使用方法をご紹介したいと思います。 1. はじめに今回は久しぶりに T5 の話です。T5 に関しては第7回、第8回で一度紹介しているので、未読の方は記事に目を通してから戻ってきて頂けると、より理解がしやすいと思います。さて、 T5 ですが Google のオリジナルコード(以下 “t5"と記述)1は敷居が高いと感じる方もいらっしゃるのではないでしょうか。 Estimator API ベースのコードや gin による設定など慣れていないと、とっつきにくいのではないかと思います。そこで今回は Hugging Face の Transformers 2を使って T5 を動かす方法
- あとで読む
Welcome GPT OSS, the new open-source model family from OpenAI!
- 8 users
- huggingface.co
- テクノロジー
- 2025/08/06
GPT OSS is a hugely anticipated open-weights release by OpenAI, designed for powerful reasoning, agentic tasks, and versatile developer use cases. It comprises two models: a big one with 117B parameters (gpt-oss-120b), and a smaller one with 21B parameters (gpt-oss-20b). Both are mixture-of-experts (MoEs) and use a 4-bit quantization scheme (MXFP4), enabling fast inference (thanks to fewer active
- 人工知能
3分プロトタイピング: ベクトルデータベース超入門 - ROUTE06 Tech Blog
- 6 users
- tech.route06.co.jp
- テクノロジー
- 2024/01/10
連載「3分プロトタイピング」 Streamlitを用いたAIチャットアプリ RAGを使ってAIチャットアプリケーションに知識を与えるベクトルデータベース超入門(この記事です) 前回、前々回とAIアプリケーションのプロトタイプを作る時に便利な2つのフレームワーク: StreamlitとLlamaIndexを紹介しました。この記事では、本格的なAIアプリケーションを作成するときに必要になることの多い、ベクトルデータベースを紹介します。今回も説明が長くなりますが、コード部分は3分で試せることを目指しています！ベクトルデータベース、ベクトル検索とはベクトルデータベースとはどのような技術か、AWSのドキュメントがわかりやすく説明しているので引用します。ベクトルデータベースは、ベクトルを高次元の点として保存および取得する機能を提供します。これらには、N 次元空間の最も近い近傍を効率的かつ高
- あとで読む
Managing Python Environments in 2022 (for the 99%)
- 5 users
- aseifert.com
- テクノロジー
- 2022/04/30
TL;DR: Install mambaforge, use mamba to install packages, and pin direct dependencies inside your environment.yml. Pip, venv, virtualenv, pyenv, pipenv, micropipenv, pip-tools, conda, miniconda, mamba, micromamba, poetry, hatch, pdm, pyflow 🤯 These days even the most senior Python developer is confused about all the options to manage environments. Instead of untangling the whole convolutated mess
- Pocket
- あとで読む
Technology Trends for 2024
- 4 users
- www.oreilly.com
- テクノロジー
- 2024/01/26
This has been a strange year. While we like to talk about how fast technology moves, internet time, and all that, in reality the last major new idea in software architecture was microservices, which dates to roughly 2015. Before that, cloud computing itself took off in roughly 2010 (AWS was founded in 2006); and Agile goes back to 2000 (the Agile Manifesto dates back to 2001, Extreme Programming t
- software
- あとで読む
ControlNet in 🧨 Diffusers
- 4 users
- huggingface.co
- テクノロジー
- 2023/03/04
Ever since Stable Diffusion took the world by storm, people have been looking for ways to have more control over the results of the generation process. ControlNet provides a minimal interface allowing users to customize the generation process up to a great extent. With ControlNet, users can easily condition the generation with different spatial contexts such as a depth map, a segmentation map, a s
- 機械学習
- AI
はじめての自然言語処理 Transformer 系モデルの推論高速化の検証 | オブジェクトの広場
- 4 users
- www.ogis-ri.co.jp
- テクノロジー
- 2023/03/31
今回は Transformer 系のモデル、具体的には BERT, T5, GPT の推論を高速化してみます。高速化手法として FasterTransformer, Torch-TensorRT, AWS Neuron を用い、素の transfomers に比べ、どの程度速くなるか（ならないか）、利点・欠点を確認してみましょう。 1. はじめに今回は Transformer 系のモデル、具体的には BERT, T5, GPT の推論を様々な技術を使って高速化してみます。高速化の元ネタは Hugging Face の transformers1 縛りとして、素の transformers で推論する場合に比べ、どの程度速くなるか（ならないか）見てみましょう。推論を高速化する技術としては FasterTransfomer2, Torch-TensorRT3, AWS Neuron(
PyTorch 2.0: Our next generation release that is faster, more Pythonic and Dynamic as ever – PyTorch
- 4 users
- pytorch.org
- テクノロジー
- 2023/03/16
*To see a full list of public 2.0, 1.13 and 1.12 feature submissions click here. Stable Features [Stable] Accelerated PyTorch 2 Transformers The PyTorch 2.0 release includes a new high-performance implementation of the PyTorch Transformer API. In releasing Accelerated PT2 Transformers, our goal is to make training and deployment of state-of-the-art Transformer models affordable across the industry
- PyTorch
Open AIを敢えて使わない、セルフホストなAI Chat Bot(RAG)の意義と作り方〜スケーラビリティ/コスト最適化のアイデアを添えて〜 - Bering Note – formerly 流沙河鎮
- 4 users
- bering.hatenadiary.com
- テクノロジー
- 2024/01/05
はじめにオープンなモデルの急速な進化オープンなモデルとはオープンなモデルの活況オープンなモデルをセルフホストする利点コスト効率カスタマイズの自由度選択肢の多様性データのプライバシーとセキュリティチームのケイパビリティ向上スケーラブルでコスト最適なAI Chat Bot(RAG)の作り方を考える RAGアプローチの構成要素ベクトル検索とはベクトル検索の前準備ベクトル検索の流れ LLMのデプロイデプロイ方法必要なスペック End to Endなアプリケーションに仕立てる環境使い方実装上のポイント faiss_index検索時のtop_k intfloat/multilingual-e5-largeでのquery / passageの指定 LLMへのプロンプト回答のストリーム systemdによるプロセスのデーモン化スケーラビリティ、コスト最適化のアイデア
はじめての自然言語処理 ByT5 と Charformer の検証 | オブジェクトの広場
- 4 users
- www.ogis-ri.co.jp
- テクノロジー
- 2022/02/16
トークナイザを使わない自然言語処理モデルである ByT5 と Charformer のご紹介です。従来の自然言語処理では多くの場合で文章を単語（あるいはサブワード）単位に分かち書きして処理しましたが、今回のモデルは直接、生のテキストを処理します。それでは実際に動かして単語（サブワード）ベースのモデルと比較してみましょう。 1. はじめに今回は今年5月と6月に発表された ByT51 と Charformer2 の紹介をしたいと思います。一本の記事で 2 つのモデルを扱うのは、この連載では珍しいのですが、この二つはよく似ているというか、Charformer は「ByT5 にもう一工夫加えたもの」くらいの認識なので、一度にさばいてしまいましょうということで。さて、この二つのモデルの特徴ですが「分かち書きをしない」という点に尽きます。今まで、この連載では BERT や T5 等の Tran
deepseek-ai/DeepSeek-OCR · Hugging Face
- 4 users
- huggingface.co
- テクノロジー
- 2025/10/20
🌟 Github | 📥 Model Download | 📄 Paper Link | 📄 Arxiv Paper Link | DeepSeek-OCR: Contexts Optical Compression Explore the boundaries of visual-text compression. Usage Inference using Huggingface transformers on NVIDIA GPUs. Requirements tested on python 3.12.9 + CUDA11.8： torch==2.6.0 transformers==4.46.3 tokenizers==0.20.3 einops addict easydict pip install flash-attn==2.7.3 --no-build-isolati
- 人工知能
GitHub - taishi-i/awesome-ChatGPT-repositories: A curated list of resources dedicated to open source GitHub repositories related to ChatGPT and OpenAI API
- 3 users
- github.com/taishi-i
- テクノロジー
- 2023/04/03
awesome-chatgpt-api - Curated list of apps and tools that not only use the new ChatGPT API, but also allow users to configure their own API keys, enabling free and on-demand usage of their own quota. awesome-chatgpt-prompts - This repo includes ChatGPT prompt curation to use ChatGPT better. awesome-chatgpt - Curated list of awesome tools, demos, docs for ChatGPT and GPT-3 awesome-totally-open-chat
megagonlabs/roberta-long-japanese · Hugging Face
- 3 users
- huggingface.co
- テクノロジー
- 2022/10/05
YAML Metadata Error:"datasets[0]" with value "mC4 Japanese" is not valid. If possible, use a dataset id from https://hf.co/datasets. roberta-long-japanese (jumanpp + sentencepiece, mC4 Japanese) This is the longer input version of RoBERTa Japanese model pretrained on approximately 200M Japanese sentences. max_position_embeddings has been increased to 1282, allowing it to handle much longer inputs
Google ColabでLLM（llm-jp-3-instruct、HF形式）をGGUF形式に変換する｜まゆひらa
- 3 users
- note.com/mayu_hiraizumi
- テクノロジー
- 2024/10/03
Google ColabでLLM（llm-jp-3-instruct、HF形式）をGGUF形式に変換する ※ Last update 10-03-2024 ※ (10-3) rinna/gemma-2-baku-2b-it も利用できます。 ※ (10-3) 3-3.に、google/gemma-2-2b-jpn-it への対応方法を追加しました。 ※ 本記事はHugging Face形式のモデルの変換がメインです。GPUは使用しません。モデルの動作確認もできますが遅いので、特にllm-jp-3-instruct 13bは行わない方が良いでしょう。 ※ 記述がややこしくなるため、F16やf16の記述を省いています。こちらが必要な場合はBF16やbf16の部分を読み替えてください。 ※ Windows PC版については、別の記事を公開しています。 ■ 0. はじめに▼ 0-0. 本記事の内容
CTranslate2でrinna instructionをquantizeして動かす｜if001
- 3 users
- note.com/if001
- テクノロジー
- 2023/06/13
これまでquantizeはllama.cppを使っていたが、GPUを使えなかったのでCTranslate2を使ってみた。 CTranslate2はtransfomer modelのメモリを削減、高速化を行うライブラリ CTranslate2 is a C++ and Python library for efficient inference with Transformer models. https://github.com/OpenNMT/CTranslate2#ctranslate2C++で実装されており、pythonからC++を呼び出している。document通りinstallすると、GPUも利用してくれる。いろんなモデルが対応している Decoder-only models: GPT-2, GPT-J, GPT-NeoX, OPT, BLOOM, MPT, LLaMa, Co
You can now train a 70b language model at home – Answer.AI
- 3 users
- www.answer.ai
- テクノロジー
- 2024/03/08
We’re releasing an open source system, based on FSDP and QLoRA, that can train a 70b model on two 24GB GPUs. Summary Today, we’re releasing Answer.AI’s first project: a fully open source system that, for the first time, can efficiently train a 70b large language model on a regular desktop computer with two or more standard gaming GPUs (RTX 3090 or 4090). This system, which combines FSDP and QLoRA,
whitphx.info
- 3 users
- whitphx-info-ksv92081c-whitphxs-projects.vercel.app
- テクノロジー
- 2023/01/10
coi munje! Yuichiro Tachibana (whitphx) Software Artisan, Indie Dev, OSS Enthusiast Posts 2025年に行ったイベント・旅行 2025-12-31 2025年振り返り; 仕事・旅行・イベント・OSS開発など 2025-12-31 Open Source 101: Practice your first OSS contributions 2025-10-29 You share, you gain: OSS, community, and reward 2025-10-09 PyCon TW 2025 コラム: 台湾のITお守りスナック、乖乖 2025-10-05 Able Carry Max EDCに手を加えて理想のバックパックに近づける 2025-09-18 日常の中で一つのことをうまくやるツール
Topic Modelling With BERTopic
- 3 users
- www.vennify.ai
- テクノロジー
- 2022/07/03
Topic modelling is a common task in NLP. It's an unsupervised technique for determining what topics, which can be thought of as categories, are part of a set of documents and what topics each document is likely to be a part of. Since it's an unsupervised technique, no labels are required, meaning we do not need a predefined list of topics – rather just the text from the documents. In this article,
- 自然言語処理
- あとで読む
Stable diffusion infinity で、絵画の枠外を描き足す | cedro-blog
- 3 users
- cedro3.com
- テクノロジー
- 2022/10/26
1.はじめに今回ご紹介するのは、絵画の枠外をテキストに沿って描き足す Stable diffusion infinity という技術です。 2.Stable diffusion infinity とは？画像生成器（拡散モデル）と画像とテキストの類似度計算器（CLIP）を組み合わせると、空白を初期値としてテキストから画像生成出来ます（Stable diffusion）。このモデルを使って、絵画の一部＋枠外の空白を初期値としてテキストから画像生成すると、枠外の画像も生成出来るわけです（Stable diffusion infinity）。言葉だけで説明するのは難しいので、以下の操作の仕方を見て頂けると、分かりやすいと思います。 3.コード（2022.10.25更新）コードはGoogle Colabで動かす形にしてGithubに上げてありますので、それに沿って説明して行きます。自分で動
- 人工知能
SageMaker + vLLM で LLM API を構築 - asken テックブログ
- 3 users
- tech.asken.inc
- テクノロジー
- 2025/12/23
目次はじめにこの記事で学べること前提知識技術スタックの全体像各コンポーネントの詳細解説パフォーマンス設計構築手順まとめ参考リンクはじめにこの記事は、株式会社asken (あすけん) Advent Calendar 2025の12/23の記事です。こんにちは。AX推進部（テックリード・AIエンジニア）の山口です。今回は、SageMakerでvLLMを動かす方法を説明したいと思います。 LLM を API として提供する際の課題 ChatGPT のような LLM を自社サービスに組み込みたい場合、以下のような課題があります：レイテンシ（応答時間）: ユーザーがテキストを入力してから結果が返るまでの時間。長いとユーザー体験が悪化しますスループット（処理能力）: 同時に多くのリクエストを処理できるかコスト: GPU は高価なので、効率的に使いたい運用負荷: サーバ
GitHub - ComfyUI-Workflow/awesome-comfyui: A collection of awesome custom nodes for ComfyUI
- 3 users
- github.com/ComfyUI-Workflow
- テクノロジー
- 2025/01/08
ComfyUI-Gemini_Flash_2.0_Exp (⭐+172): A ComfyUI custom node that integrates Google's Gemini Flash 2.0 Experimental model, enabling multimodal analysis of text, images, video frames, and audio directly within ComfyUI workflows. ComfyUI-ACE_Plus (⭐+115): Custom nodes for various visual generation and editing tasks using ACE_Plus FFT Model. ComfyUI-Manager (⭐+113): ComfyUI-Manager itself is also a cu
Improving Diffusers Package for High-Quality Image Generation | Towards Data Science
- 3 users
- towardsdatascience.com
- テクノロジー
- 2023/04/06
Overcoming token size limitations, custom model loading, LoRa support, textual inversion support, and more Stable Diffusion WebUI from AUTOMATIC1111 has proven to be a powerful tool for generating high-quality images using the Diffusion model. However, while the WebUI is easy to use, data scientists, machine learning engineers, and researchers often require more control over the image generation p
Getting Started with Sentiment Analysis using Python
- 3 users
- huggingface.co
- テクノロジー
- 2022/02/08
Sentiment analysis is the automated process of tagging data according to their sentiment, such as positive, negative and neutral. Sentiment analysis allows companies to analyze data at scale, detect insights and automate processes. In the past, sentiment analysis used to be limited to researchers, machine learning engineers or data scientists with experience in natural language processing. However
はじめての自然言語処理 MixCSE による教師なし文章ベクトル生成 | オブジェクトの広場
- 3 users
- www.ogis-ri.co.jp
- テクノロジー
- 2022/10/08
今回は教師なしの文章ベクトル化手法である MixCSE の検証です。教師なし学習ですから教師ありの手法よりは精度的に不利でしょうが、局面によっては役に立つケースもあるのでは？と試してみることに。公開されているコードは transformers ベースなのですが、今回は Colab の TPU で動かしてみたので、その方法も紹介しますね。 1. はじめに今回は教師なしの文章ベクトル化手法である MixCSE1 の検証をしてみました。本連載では文章ベクトル化のモデルとして、 Sentence BERT を取り上げたこと(第9回, 第18回)がありますが、品質の良いベクトルを生成する為には大量かつ良質の教師データが必要でした。法律や特許のような特定領域に特化した文章を扱う局面では、対象領域の文書で学習したモデルを使いたいところですが、特定領域限定の都合良いデータはなかなか手に入りません。そ
- NLP