[B! distributed] [8ページ] yukimori

yukimori_726 id:yukimori_726

distributedに関するyukimori_726のブックマーク (293)

Python 分散処理 Spartan - Qiita
この記事はPythonで分散処理したい方に向けた記事です。 pythonのイメージは遅いという方も多いと思います。そのイメージを払拭すべくcythonなどのライブラリが出ていますが、今回はpythonを高速化する手法の一つとして分散処理について紹介しようと思います。分散処理の代表といえば。・Hadoop ・Spark です。今回はSparkを単純にpythonに適用したいと考えたのですが・・下記の記事でJVMとPythonのデータ構造の変換が何回も起こり、レイテンシーが大きくなるのであまり早くならないと記述がありました。上図の構造を見てみるとSpark Workerとデータをパイプする部分が多く分散処理するとそこがネックになるかもという印象を受けます。そこで今回はPythonでのデータ処理はNumPyという行列データ構造を使うことで高速化することができるため、Numpy行列を
yukimori_726 2016/01/26
python

distributed

spartan

hadoop
リンク
第15回　計算機クラスタのためのリソース管理基盤 Hadoop YARN | gihyo.jp
はじめに前回は、MapReduceとその実装であるApache Hadoopの概要について説明しました。今回は、Apache Hadoopにおいて計算機クラスタのリソース管理を行うYARNについて解説します。多種多様な処理系の登場 Hadoopの登場を1つの契機として、コモディティな計算機を複数台用いた計算機クラスタ上でデータ処理を行うことが広く普及しつつあります。たとえば、Hadoop MapReduceと比べてアプリケーションの記述性が柔軟であり、より高効率な実行が可能であるApache Spark、Apache Tez、Apache Flinkをはじめとし、低い遅延で実行可能なApache Impala、Facebook Presto、Apache Drill、また、大量のストリームデータを低い遅延で処理可能なデータ処理系であるApache Storm、Twitter Heron
yukimori_726 2016/01/26
hadoop

distributed

yarn
リンク
Raft:Understandable Distributed Consensus
yukimori_726 2016/01/22
raft

distributed
リンク
GitHub - hashicorp/serf: Service orchestration and management tool.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
yukimori_726 2016/01/22
distributed

codereading

serf

provisioning
リンク
第1回　Serf入門：動的に変化する環境を簡単に管理 | gihyo.jp
Serfとは？ Serfは、HashiCorp社がオープンソースとして開発・公開しているクラスタ管理用のツールです。軽量なエージェントを起動するだけで手軽にクラスタを構成でき、複数台のサーバにまたがる作業の自動化に役立ちます。2013年後半から開発がスタートし、現在もGitHubやIRC上で開発が行われています。登場背景と利用シーンクラウドコンピューティングを使ったシステムの普及と、継続的な開発・運用スタイルの普及により、インフラ環境が増えたり減ったりするシーンが増えつつあります。クラウドを活用し、OSの領域までは短時間で準備できるようになりました。ミドルウェアやアプリケーションの設定も、ChefやAnsible等の構成管理ツールを使う手法が広まり、作業時間の短縮や正確性の向上が実現しています。このようにインフラ部分が動的に変わることが当たり前になりつつある一方、運用視点で新しい課題
yukimori_726 2016/01/22
serf

distributed
リンク
Distributed algorithms - Chapter 7 : Failure Detectors, Consensus and Self-Stabilization
yukimori_726 2016/01/22
distributed
リンク
GitHub - bachmanm/failure-detectors: Agreement in Asynchronous Distributed Systems
yukimori_726 2016/01/22
test

distributed
リンク
Automated Failure Testing
At Netflix, we have found that proactive failure testing is a great way to ensure that we have a reliable product for our members by helping us prepare our systems, and our teams, for the probl ems that arise in our production environment. Our various efforts in this space, some of which are manual, have helped us make it through the holiday season without incident (which is great if you’re on-call
yukimori_726 2016/01/22
distributed

test
リンク
分散システムについて語るときに我々の語ること ― 分散システムにまつわる重要な概念について | POSTD
分散システムについては、もう随分と前から学びたいと思っていました。ただ、それは一度首を突っ込んだら最後、ゴールのない迷路に迷い込むようなものなのです。どこまでも続いているウサギの穴のようなものです。分散システムに関する文献は星の数ほど存在します。様々な大学からたくさんの論文が発表されているばかりでなく、膨大な数の書籍もあるのです。私のような全くの初心者には、どの論文を読んだらいいのか、どの書籍を買ったらいいのか、見当もつきません。そんなとき、一部のブロガーが、分散システムエンジニア（それがどういう意味であれ）になるなら知っておくべき論文というものを推奨しているのを見つけました。その一部を紹介しましょう。 FLP , Zab , Time, Clocks and the Ordering of Events in a Distributed Systems , Viewstamped
yukimori_726 2016/01/21
distributed
リンク
Strong consistency models
Update, 2018-08-24: For a more complete, formal discussion of consistency models, see jepsen.io. Network partitions are going to happen. Switches, NICs, host hardware, operating systems, disks, virtualization layers, and language runtimes, not to mention program semantics themselves, all conspire to delay, drop, duplicate, or reorder our messages. In an uncertain world, we want our software to mai
yukimori_726 2016/01/18
consistency

distributed

linearizability
リンク
No Compromises: Distributed Transactions with Consistency, Availability, and Performance | the morning paper
yukimori_726 2016/01/18
あとで読む
リンク
1.Spark1.5でSparkStreaming開発 [こと始め編] - Qiita
紹介この投稿は、Advent Calendar 2015 .. NextGen DistributedComputing system をキッカケにして初めています！ Advent Calendar１日目の記事です。方針について Spark、SparkStreamingが初めての方でも順を追っていただければスムーズに理解できるよう書きたいと思います。開発はScalaベースです。Spark処理を書くためにScalaガッツり使いこなせないと分からないわけではないですが、基本的なことは必要かと思います。 Scala基礎を習得するためには下記リンク先を参照することをお勧めいたします。 https://gist.github.com/scova0731/2c405ea55488d804b366 SparkStreamingの紹介 SparkStreamingとは Sparkコアの拡張モジュー
yukimori_726 2016/01/15
spark

sparkstreaming

distributed

codereading
リンク
GitHub - stacks-network/reading-list: Reading list of research papers on blockchains, P2P networks, consensus etc
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
yukimori_726 2016/01/08
distributed
リンク
Multi-version Conflict Serializability - 急がば回れ、選ぶなら近道
1.目的今後の分散DBでは、前提が分散ノードから分散コアに主体が移る。ムーアの法則の限界は、メニーコア化とノードの高密度化を推し進める。分散のノードではリードロックの問題とノード分散の相性の良さでSnapshot Isolation(以下SI）がほぼ前提であったが、RDMA等のハードウェアの技術革新でレイテンシーが改善されるのであれば、SILOのような(表面上は）単ノードのS2PLの改良版も有りになってくる。そうなってくると理論的な背景も、SIを前提という話ではなくて、通常のConflict Serializability (以下CSR)も頭に置きながら話をおっていかないと理解が厳しい。 SI「だけ」であれば、なんとなくまぁセオリーでr-w依存での循環グラフだよね、ということを前提において議論を追いかけて、r-w依存はあとで復習して調べとけばなんとかやり過ごせる。が、通常のCSRも混線
yukimori_726 2016/01/06
distributed
リンク
GoとgRPCでKVS的なものを作ってみた - 小野マトペの納豆ペペロンチーノ日記
正月で時間があったので、以前から触ってみたかったgRPCをGo言語から使い、キー・バリュー・ストアのようなものを作ってみた。 KVSといっても、GoのmapへのGet/Put/Delete/ScanをgRPC経由で叩けるようにしただけのもの。それだけだとあまり面白く無いので、gRPCらしく、Watch機能をつけてmapへの更新を監視できるようにした。 github.com 個人的には、HTTP/1.1 + JSON APIと比べた時のgRPC(HTTP/2 + ProtoBuf)のメリットや違いが気になっていたので、そのあたりを気をつけながら書いた。開発の手順サービス定義まずはProtocol Buffers 3でKVSのサービスを定義する。サンプルを見ながら適当に書いた。 grpc-kvs/grpc-kvs.proto at master · matope/grpc-kvs · G
yukimori_726 2016/01/05
kvs

distributed

grpc

golang
リンク
etcdの分散Key-Valuesストアを試してみる - SDN開発エンジニアを目指した活動ブログ
CoreOSが提供するetcdの動作をお手軽に試してみました。なお、etcdとは、分散Key-Valuesストアを使い，各種設定をノード間で共有するメカニズムだそうです。 etcd is a distributed key value store that provides a reliable way to store data across a cluster of machines. It’s open-source and available on GitHub. etcd gracefully handles master elections during network partitions and will tolerate machine failure, including the master. ◼️ まずは、環境準備から ... まずは、golang環境を準備してお
yukimori_726 2015/12/28
etcd

CoreOS

kvs

distributed
リンク
HDFSのイレイジャーコーディング (Erasure Coding)
2017/5/19追記: ClouderaのHDFS Erasure Codingのブログ翻訳しました -> Apache HadoopのHDFS Erasure Codingの紹介以前紹介したHDFSのイレイジャーコーディング「HDFSが変わる？HDFSのイレイジャーコーディング対応」について詳しく書かれたブログがClouderaから公開されました。Hadoop 3.0をターゲットにして開発されているようです。 http://blog.cloudera.com/blog/2015/09/introduction-to-hdfs-erasure-coding-in-apache-hadoop/ 背景から設計の方針、評価まで幅広くかなり詳しく網羅されており読み応えがあります。しかし、日本語訳が出るかわからないので、自分用にまとめてみました。間違いを発見したらご指摘下さい。 ※Erasure
yukimori_726 2015/12/22
hdfs

distributed
リンク
分散アプリケーションアーキテクチャ 2015
Developer Summit 2015 Autumn での講演資料です
yukimori_726 2015/12/22
あとで読む

distributed

Architecture

microservices
リンク
Building Distributed Systems with Netflix OSS and Spring Cloud
Building Distributed Systems with Netflix OSS and Spring Cloud As presented at: http://www.meetup.com/Pivotal-Open-Source-Hub/events/219264521/ With the advent of microservice and cloud-native application architectures, building distributed systems is becoming increasingly common for the enterprise Java developer. Fortunately many of the innovators in the space, including Twitter, LinkedIn, and Ne
yukimori_726 2015/12/22
distributed

architecture

netflix
リンク
Spark + Deep Learning: Distributed Deep Neural Network Training with SparkNet - KDnuggets
Spark + Deep Learning: Distributed Deep Neural Network Training with SparkNet Training deep neural nets can take precious time and resources. By leveraging an existing distributed batch processing framework, SparkNet can train neural nets quickly and efficiently. Deep learning is the hottest machine learning method there is, and it continues to achieve remarkable results. Deep neural networks have
yukimori_726 2015/12/21
Spark

distributed

deeplearning
リンク
前のページ 4 5 6 7 8 9 10 11 12 13 次のページ