[B! GPU][python] xiangzeのブックマーク

ABCIで回そう爆速深層学習 (基礎編)

チーム内で使用したABCI活用Tips集を公開します．免責｜本資料は坂東の個人的経験・実験に基づくTips集であり，ABCI公式とは無関係で不正確な場合があります．公式情報は https://abci.ai , https://docs.abci.ai をご参照下さい．

xiangze 2023/06/14

リンク

google/jax: GPU- and TPU-backed NumPy with differentiation and JIT compilation.

JAX is a Python library for accelerator-oriented array computation and program transf ormation, designed for high-performance numerical computing and large-scale machine learning. With its updated version of Autograd, JAX can automatically differentiate native Python and NumPy functions. It can differentiate through loops, branches, recursion, and closures, and it can take derivatives of derivative

xiangze 2019/01/15

リンク

NumPyの歴史とPythonの並行処理(PyDataTokyo Conference) - HackMD

# NumPyの歴史とPythonの並行処理(PyDataTokyo Conference) https://pydatatokyo.connpass.com/event/87511/ >Sess

xiangze 2018/10/20

リンク

ViennaCL - Linear Algebra Library using CUDA, OpenCL, and OpenMP

ViennaCL is a free open-source linear algebra library for computations on many-core architectures (GPUs, MIC) and multi-core CPUs. The library is written in C++ and supports CUDA, OpenCL, and OpenMP (including switches at runtime). The highlights of the latest 1.7.x release family are: Fast sparse matrix-matrix multiplications, outperforming CUBLAS and MKL.Fine-grained parallel algebraic multigrid

xiangze 2016/01/20

リンク

Chainer – A flexible framework of neural networks — Chainer 7.8.1 documentation

Chainer – A flexible framework of neural networks¶ Chainer is a powerful, flexible and intuitive deep learning framework. Chainer supports CUDA computation. It only requires a few lines of code to leverage a GPU. It also runs on multiple GPUs with little effort. Chainer supports various network architectures including feed-forward nets, convnets, recurrent nets and recursive nets. It also supports

xiangze 2015/06/15

リンク

GPU-Accelerated Graph Analytics in Python with Numba | NVIDIA Technical Blog

Numba is an open-source just-in-time (JIT) Python compiler that generates native machine code for X86 CPU and CUDA GPU from annotated Python Code. (Mark Harris introduced Numba in the post Numba: High-Performance Python with CUDA Acceleration.) Numba specializes in Python code that makes heavy use of NumPy arrays and loops. In addition to JIT compiling NumPy array code for the CPU or GPU, Numba ex

xiangze 2015/03/18

リンク

http://nbviewer.ipython.org/github/memelabo/JetsonMemo/blob/master/Jetson%20TK1%20IPython%20Notebook%20PyCUDA.ipynb

xiangze 2015/03/17

NVIDIA Jetson TK1でIPython NotebookとPyCUDAを使ってみる

リンク

TheanoをWindowsにインストール - 人工知能に関する断創録

Deep Learningを実装するのによく使われるTheanoというPythonライブラリをWindowsマシンにインストールしたのでそのときの記録。ただ使うだけだったらPythonとnumpy/scipyをインストールした後にpip installs Theanoで普通に使えていた。実際、GPUが貧弱なMacbook Airではこの方法でインストールしていた。今回、PCを買い替えた（2015/1/19）こともあって、NVIDIA社のGPUを使って高速演算できるようにしてみたというわけ。私のマシンは、 OS : Windows 8.1 64bit GPU: NVIDIA GeForce GTX 760 Ti OEM という環境。Theano TutorialにもWindowsへのインストール方法は載っているけどはっきり言ってよくわからない。いろいろ調べていたところ以下の記事が自信満々で手

xiangze 2015/01/28

pyCUDAもある。

gpu
python

リンク

GPU-Accelerated Computing with Python

GPU-Accelerated Computing with Python NVIDIA’s CUDA Python provides a driver and runtime API for existing toolkits and libraries to simplify GPU-based accelerated processing. Python is one of the most popular programming languages for science, engineering, data analytics, and deep learning applications. However, as an interpreted language, it’s been considered too slow for high-performance computi

xiangze 2014/06/09

python
gpu

リンク

PyCUDAの紹介

11. スペックIntel Xeon X5570 × 264bit 4コアNVIDIA Tesla M2050 × 2448GPUコア単精度ピーク時1.03Tflops倍精度対応Memory 22GB高速なネットワーク同一Placement Group内は高速通信可能CentOS 5.5同程度のマシンを買うと80〜100万円ぐらい？ 12. 費用On-Demand Instance$2.1 / hour (￥178.5 / hour)1年￥1,563,660，3年￥4,690,980，…Reserved Instance1年 $5,630 (￥478,550)$0.65 / hour (￥55 / hour)3年 $8,650 (￥735,250)$0.33 / hour (￥28 / hour)+EBS料金など(月$2〜)※$1 = ￥85で計算

xiangze 2014/06/08

python
gpu

リンク

PyMC3の計算でGPUを使っている気がしただけの話

たまには浮気させてください。PyMC3は内部でTheanoを使っており、自動微分（auto-diff）が計算可能でStanのサンプラーであるNUTSも実装済みです。またTheanoがGPUに対応しているため、これはMCMCの超高速化が簡単にできるのではッ！と試した記事になります。まずは環境設定から。Windows 7 64bitにVisual Studio Express 2012, CUDA 6, Anaconda 2.0.0 (python 2.7 + numpyとか), theano 0.6.0, PyMC3をインストールします。手順が少し多い場合は分かる範囲でDAG構造を描くのがオススメです。学術書の章案内もDAG構造があるといいですね。さて、今回の手順の依存性をDAG構造で表すと以下になります。 1. Visual Studio Express 2012のインストールここから

xiangze 2014/06/07

リンク

gnumpy

Note that Gnumpy is probably no longer relevant because it was last updated in 2013 and more advanced tools are available now. (There are translations of Gnumpy into various other languages; if you're looking for a translation I suggest a web search) Gnumpy is free software, but if you use it in scientific work that gets published, you should cite this tech report in your publication. Download: gn

xiangze 2014/06/04

gpu
python

リンク

はてなブックマーク

タグ

関連タグで絞り込む (12)

GPUとpythonに関するxiangzeのブックマーク (12)

お知らせ

今週のはてなブックマーク数ランキング（2024年11月第2週）

今週のはてなブックマーク数ランキング（2024年11月第1週）

月間はてなブックマーク数ランキング（2024年10月）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス