[B! cuda] masterqのブックマーク

masterq id:masterq

cudaに関するmasterqのブックマーク (18)

SCALE GPGPU Programming Language
SCALE is a GPGPU programming toolkit that allows CUDA applications to be natively compiled for AMD GPUs.
masterq 2024/07/16
"CUDAのプログラムを全く変更することなくAMD製GPUで実行できるようにコンパイルするツールキット"

cuda

gpu

gpgpu

linux
リンク
CUDA 10.2 or CUDA 11.0 for RTX3060
masterq 2024/07/05
"RTX 3060 uses the Ampere architecture, which requires CUDA 11.x."

cuda

gpu

gpgpu

nvidia
リンク
[CUDA] NVIDIA GPUやCUDA周りの互換性を理解したかった
よくわからなかったので、調べて整理しようとした試み。 Compute Capability GPU ハードウェアがサポートする機能を識別するためのもので、例えば RTX 3000 台であれば 8.6 であるなど、そのハードウェアに対応して一意に決まる。アーキテクチャの世代が新しくなり、機能が増えるほど、この数字も上がっていく。以下のリンク先に、Compute Capability と機能の対応表があるが、これを見ると（少なくとも執筆時点で） Compute Capability 7.x 以上でテンソルコアが使えるといったことがわかる。それぞれの機種がどの値かは以下のサイトから確認できる。 NVIDIA Driver のバージョン Compute Capabl ity 一般向けの Compute Capability との関連性は見つからなかったが、データセンタ向けの資料には Maxwe
masterq 2024/07/05
ややこしい "Compute Capability 7.x 以上でテンソルコアが使える"

nvidia

gpu

cuda

driver

linux

gpgpu
リンク
【2023】爆速でGCPにリモートAI開発環境を構築する方法🔥 | TC3株式会社｜GIG INNOVATED.
はじめにこんにちは、TC3 Data Scienceチームの@mumeco_mlです！弊社は2022/10からGCP Cloud Partnerとなっておりまして、現在GCP(Google Cloud Platform)のプロジェクトでの活用をより促進しております。今回は、このGCPの機能の1つであるCompute Engineを利用したAI開発環境の作り方をご紹介いたします。GCPのVMで開発環境を作る場合、大きく分けて事前にML用に用意された環境を利用する方法と、Dockerを使ってOS環境等も含めて作る方法があると思いますが、今回は前者を説明します。需要があれば、後者の解説も作ろうと思います。クラウド開発環境の利点・欠点利点高額なGPUをオンデマンドで効率的に活用できるローカルマシンの動作が重くならない任意のマシンスペックを利用できる欠点使用時間に応じて課金される実
masterq 2024/07/02
"このOSでおすすめなのがDeep Learning用に事前に設定されたDeep Learning on Linuxです。このOSを選ぶことで、煩雑なCUDAのインストールなどを簡単に済ますことが可能です。"

google

gcp

nvidia

cuda

gpu

linux
リンク
https://users.dimi.uniud.it/~agostino.dovier/PAPERS/CUDAatSAT_JETAI_DRAFT.pdf
masterq 2024/06/30
sat

solver

gpu

cuda

あとで読む
リンク
Welcome to Triton’s documentation! — Triton documentation
Getting Started Installation Tutorials Python API triton triton.language triton.testing Triton MLIR Dialects Triton MLIR Dialects and Ops Programming Guide Introduction Related Work Welcome to Triton’s documentation!¶ Triton is a language and compiler for parallel programming. It aims to provide a Python-based programming environment for productively writing custom DNN compute kernels capable of r
masterq 2024/05/22
今ならCUDA直書きではなくこれを使うらしい

doc

gpu

ai

language

python

dsl

jit

cuda
リンク
GitHub - karpathy/llm.c: LLM training in simple, raw C/CUDA
LLM training in simple, pure C/CUDA. There is no need for 245MB of PyTorch or 107MB of cPython. For example, training GPT-2 (CPU, fp32) is ~1,000 lines of clean code in a single file. It compiles and runs instantly, and exactly matches the PyTorch reference implementation. I chose GPT-2 as the first working example because it is the grand-daddy of LLMs, the first time the modern stack was put toge
masterq 2024/04/12
ai

llm

training

c

simd

cuda
リンク
GitHub - ggerganov/llama.cpp: LLM inference in C/C++
The main goal of llama.cpp is to enable LLM inference with minimal setup and state-of-the-art performance on a wide variety of hardware - locally and in the cloud. Plain C/C++ implementation without any dependencies Apple silicon is a first-class citizen - optimized via ARM NEON, Accelerate and Metal frameworks AVX, AVX2 and AVX512 support for x86 architectures 1.5-bit, 2-bit, 3-bit, 4-bit, 5-bit,
masterq 2023/03/13
facebook

ai

c

c++

llm

cuda

simd

avx

opencl

gpu
リンク
NVIDIAの「Jetson Nano開発者キット」が店頭入荷、価格は12,800円電源アダプタ付きのスターターキットもあり
masterq 2019/05/21
たった12,800円。。。欲しい。。。

nvidia

jetson

gpu

board

hardware

arm

cortexa

cuda

ai

deeplearning
リンク
GitHub - mrakgr/The-Spiral-Language: Functional language with intensional polymorphism and first-class staging.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
masterq 2019/01/09
cuda

gpu

language

fsharp
リンク
西川善司の3DGE：GeForce RTX 20完全理解。レイトレ以外の部分も強化が入ったTuringアーキテクチャにとことん迫る
西川善司の3DGE：GeForce RTX 20完全理解。レイトレ以外の部分も強化が入ったTuringアーキテクチャにとことん迫るライター：西川善司 SIGGRAPH 2018のタイミングで新世代GPUアーキテクチャ「Turing」（テューリング）と，Turing採用のGPU「Quadro RTX」を発表したNVIDIAは，その直後のgamescom 2018において，Turing世代のゲーマー向けGPUとなる「GeForce RTX 20」も発表した。 Jonah M.Alben氏（SVP, GPU Engineering, NVIDIA）筆者の連載ではこれまで，発表時点の情報に基づいて独自に考察を行ったり，突発的に開示された追加情報の解説を行ったりしてきたわけだが，ついに，Turingアーキテクチャの詳細情報が解禁となったので，今回はそのあたりをとことん紹介してみたいと思う。なお
masterq 2018/09/15
"直方体の木構造データをデコードする処理と，光線と直方体との衝突判定処理，光線とポリゴンとの衝突判定処理を，NVIDIAはRT Coreという専用ハードウェアで実装した"/"2060以下の型番を採用するTuring世代"

nvidia

gpu

cuda
リンク
http://algos.inesc-id.pt/~pff/projects/parsat/publications/Costa-TR2013Feb.pdf
masterq 2018/09/14
nvidia

cuda

gpu

sat

solver

verify
リンク
GitHub - ssvlab/esbmc-gpu: ESBMC-GPU is a context-bounded model checker based on the satisfiability modulo theories (SMT) to check for data race, deadlock, pointer safety, array bounds, arithmetic overflow, division by zero, and user-specified assertions
masterq 2018/09/14
そんなのあるんだ。。。2017年まで更新

smt

verify

cuda

cpu

lock

pointer

array

z3
リンク
https://www.semanticscholar.org/paper/PUG-%3A-A-Symbolic-Verifier-of-GPU-Programs-Li-Gopalakrishnan/bd89a99dd86a649daf0a0954eb49a63195091a47?p2df
masterq 2018/09/14
paper

cuda

cpu

verify
リンク
｢スカラー･チューニング講習会｣と｢並列プログラミング(MPI)講習会｣で配布したテキスト｜HPC High Performance Computing｜ACCC. RIKEN
（2009年7月21日更新）注意原稿をイメージデータとして読み込んでPDFファイル化したため､サイズが大きくなっています。全ページを含むファイルと､章(またはいくつかの節)ごとに分割したファイルの2種類があります（内容は同じです）。テキストの内容は今後不定期に加筆修正します｡ファイル名の末尾についている｢20xx-xx-xx ｣は、そのファイル内のページが最後に更新された日付を示します｡簡易製本する際は､i, iii, vページ(vは(2)のみ) ､および本文の奇数ページ(例えば1-1,1-3, ･･･,2-1,2-3, ･･･)が､見開きした右側のページになるようにして下さい｡下記資料に関するお問い合わせはまでご連絡下さい｡ (1) 「チューニング技法入門」(旧「チューニング技法虎の巻」) 「スカラー・チューニング講習会」で配
masterq 2011/03/01
gpu

cuda
リンク
GPGPU 勉強会 - CUDA Install
現在編集中概要 CUDA アプリケーション開発を行うためには, 以下が必要である. CUDA に対応した NVIDIA 製 GPU GeForce (8, 9, 100, 200) シリーズ Tesra シリーズ Quadro シリーズ (ただし開発するだけならGPUが無くても可。またデバイスエミュレーションが使えるのでRuntimeAPIを使う限りはGPUが無くても実行可能。性能は別問題。) CUDA に対応した GPU ドライバー CUDA ツールキット CUDA SDK (コードサンプル) OS 別インストール手順 CUDA プログラミングツールキットのダウンロードより、必要なファイルを入手する. Choose OS : Linux 32-bit or 64-bit を選択 Linux 版 : Ubuntu 8.04 を選択 CUDA ドライバ, ツールキット, SDK をダウ
masterq 2010/10/12
cuda

debian

gpu
リンク
Cuda-gdbを使う - CUDA Information Site
CUDA 2.3からCUDA Debuggerが正式に開発環境に含まれるようになりました。ここではCUDA Debuggerの使いかたについて説明します。インストール CUDA Debuggerは現在の所、Redhat Enterprise 5.x用のものになっています。そのため、Fedora 9ではデフォルトの環境ではライブラリが足りず起動できません。次の手順で足りないライブラリを他のライブラリで代用することによりCUDA Debuggerの実行が可能になります。 # /usr/local/cuda/lib # ln -sf /lib/libncurses.so.5.7 libtermcap.so.2 また、GPUが古い場合、デバッガを使えない場合があります。少しバージョンが古いですが、CUDA2.1のデバッガのマニュアルによると、 GeForce 8800 GTS, GeFor
masterq 2010/04/23
cuda

debug

gpu
リンク
CUDA Information Site
NVIDIA CUDA Information Site — unleash the power of GPU / GPUのパワーを引き出そう — NVIDIA CUDA Information サイト（以下「本サイト」）は、マルチコアソリューションを提供するフィックスターズの技術者有志が運営する、NVIDIA CUDAの普及と利用促進を目的とする情報公開と情報交換のためのサイトです。本サイトでは、NVIDIA社が発売するGPUアクセラレーターボードTesla C1060/S1070 (Tesla）上のCUDAアプリケーション開発に関する情報を中心として、GPUに関する各種の情報をお伝えします。
masterq 2010/04/23
cuda

gpu
リンク
1