[B! multiprocessing] incepのブックマーク

Multiprocessing vs. Threading in Python: What you need to know.

incep 2021/08/18

multiprocessing と threading の違いと守備範囲

リンク

たった数行でpandasを高速化する2つのライブラリ(pandarallel/swifter) - フリーランチ食べたい

pandas はデータ解析やデータ加工に非常に便利なPythonライブラリですが、並列化されている処理とされていない処理があり、注意が必要です。例えば pd.Sereis.__add__ のようなAPI(つまり df['a'] + df['b'] のような処理です)は処理が numpy に移譲されているためPythonのGILの影響を受けずに並列化されますが、 padas.DataFrame.apply などのメソッドはPythonのみで実装されているので並列化されません。処理によってはそこがボトルネックになるケースもあります。今回は「ほぼimportするだけ」で pandas の並列化されていない処理を並列化し高速化できる2つのライブラリを紹介します。同時に2つのライブラリのベンチマークをしてみて性能を確かめました。 pandarallel pandaralell はPythonの m

incep 2021/05/21

リンク

Communication Between Processes - Python Module of the Week

If you find this information useful, consider picking up a copy of my book, The Python Standard Library By Example. Page Contents Communication Between Processes Passing Messages to Processes Signaling between Processes Controlling Access to Resources Synchronizing Operations Controlling Concurrent Access to Resources Managing Shared State Shared Namespaces Process Pools Navigation Table of Conten

incep 2015/11/13

" Effective use of multiple processes usually requires some communication between them, so that work can be divided and results can be aggregated."

リンク

Python multiprocessing PicklingError: Can't pickle <type 'function'>

I am sorry that I can't reproduce the error with a simpler example, and my code is too complicated to post. If I run the program in IPython shell instead of the regular Python, things work out well. I looked up some previous notes on this probl em. They were all caused by using pool to call function defined within a class function. But this is not the case for me. Exception in thread Thread-3: Trac

incep 2015/09/08

"In particular, functions are only picklable if they are defined at the top-level of a module."

リンク

大規模並列処理：PythonとSparkの甘酸っぱい関係～PyData.Tokyo Meetup #3イベントレポート

ロゴステッカーの作成計画も進行中です。近々イベント会場でお配りできるかも知れません。チュートリアルおよび次回勉強会のお知らせこの度PyData.Tokyo初の試みとして、初心者向けのチュートリアルを3月7日（土曜日）に行います。また、次回勉強会はデータ解析に関する「高速化」をテーマにし、4月3日（金曜日）に開催します。詳細は記事の最後をご覧下さい。 Sparkによる分散処理入門 PyData.Tokyo オーガナイザーのシバタアキラ（@madyagi）です。ビッグデータを処理するための基盤としてHadoopは既にデファクトスタンダードになりつつあります。一方で、データ処理に対するさらなる高速化と安定化に向けて、新しい技術が日々生まれており、様々な技術が競争し、淘汰されています。そんな中、Apache Spark（以下Spark）は、新しい分析基盤として昨年あたりから急激にユーザーを増

incep 2015/08/20

リンク

はてなブックマーク

タグ

関連タグで絞り込む (6)

multiprocessingに関するincepのブックマーク (5)

お知らせ

今週のはてなブックマーク数ランキング（2024年6月第1週）

今週のはてなブックマーク数ランキング（2024年5月第4週）

今週のはてなブックマーク数ランキング（2024年5月第3週）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス