[B! memory][performance] manboubirdのブックマーク

manboubird id:manboubird

memoryとperformanceに関するmanboubirdのブックマーク (3)

Analyzing Python Pandas' memory leak and the fix
manboubird 2019/10/15
pandas

memory

performance

tuning
リンク
PythonでCSVを高速＆省メモリに読みたい - tkm2261's blog
今日はPython (Pandas)で高速にCSVを読むことに挑戦したいと思います。 Kaggleに参加するたびに、イライラしていたので各実装の白黒はっきりさせようと思います。 R使いが羨ましいなぁと思う第一位がCSV読込が簡単に並列出来て速いことなので、なんとかGILのあるPythonでも高速に読み込みたいと思います。ただ、この検証ではコーディング量が多いものは検証しません。 CSV読込は頻出するので、フットワークの軽さが重要です。（オレオレライブラリ嫌い） Pickleは早いけど。。。結論はDask使おう！検証環境データ速度検証 pandas.read_csv() pandas.read_csv() (dtype指定) pandas.read_csv() (gzip圧縮) numpy.genfromtxt() pandas.read_csv() (chunksize指定 +
manboubird 2017/08/05
dask

pickle

csv

memory

tuning

performance

comparizon
リンク
Tuning - Spark 3.5.2 Documentation
Tuning Spark Data Serialization Memory Tuning Memory Management Overview Determining Memory Consumption Tuning Data Structures Serialized RDD Storage Garbage Collection Tuning Other Considerations Level of Parallelism Parallel Listing on Input Paths Memory Usage of Reduce Tasks Broadcasting Large Variables Data Locality Summary Because of the in-memory nature of most Spark computations, Spark prog
manboubird 2013/12/01
Spark

memory

config

doc

tuning

performance
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx