[B! MapReduce] wlbhiroのブックマーク

wlbhiro id:wlbhiro

MapReduceに関するwlbhiroのブックマーク (6)

[TEZ-2972] Avoid task rescheduling when a node turns unhealthy - ASF JIRA
wlbhiro 2017/03/31
TEZ

MapReduce

failure
リンク
SparkとHadoop MapReduceの違い
速度 MapReduceはHadoopクラスタのメモリを有効活用できていなかった。 SparkではRDD（Resilient Distributed Datasets）を使うことで、データをメモリに保存することができ、必要な場合にのみディスクへの保存を行うことができる。これにより、SparkはHadoopよりも格段に高速である。データ Hadoopはデータをディスクに保存するが、Sparkはメモリに保存する。 SparkはRDD（Resilient Distributed Datasets）とよばれるデータストレージモデルを用いる。RDDはnetwork IOを最小化するフォールトトレランスの機構を提供する。RDDの一部のデータが失われた場合、lineage（データに提供された処理の履歴）を元に再構築が行われる。このためフォールトトレランスのためのレプリケーションが不要となる。これに
wlbhiro 2017/03/08
Hadoop

MapReduce

Spark

Compare
リンク
Disabling Tez for Hive Queries - Hortonworks Data Platform
wlbhiro 2016/12/10
“set hive.execution.engine=mr; ” hiveでTezじゃなくてMapReduceを使う方法

Hive

MapReduce

HortonWorks

TEZ
リンク
MapReduce Tutorial
This document comprehensively describes all user-facing facets of the Hadoop MapReduce framework and serves as a tutorial. Ensure that Hadoop is installed, configured and is running. More details: Single Node Setup for first-time users. Cluster Setup for large, distributed clusters. Hadoop MapReduce is a software framework for easily writing applications which process vast amounts of data (multi-t
wlbhiro 2015/10/28
MRv1とMRv2との比較。

MapReduce

MRv2

MRv1
リンク
EMC XtremSF is a server based PCIe Flash hardware
Please be advised our License Portal will be undergoing maintenance between March 15 10:30pm PST - March 16th 9:00am PST during which time users may experience intermittent performance issues. We apologize for the inconvenience. Please be advised that the Broadcom ERP system will be undergoing maintenance between March 28 7pm PST - Apr 1 7pm PST which will impact all new customer accounts created
wlbhiro 2015/08/03
MapReduce

Hadoop
リンク
MongoでMapReduceする - Qiita
この記事はMongo DB Advent Calender2013の21日目です。 Mongo DBで手軽にMapReduceする方法について書かせていただきます。 #Mongo DBでMapReduce 世間的にMongoでM/Rするのは情弱、世間知らず、自殺行為などと色々disられてはおりますが、やはりスキーマレスで好きなデータを突っ込んでおいて、あとで集計をかけるというお手軽さからいうとMongoのM/Rも充分選択肢としてありだと思います。 #前準備実際に自分が今やっているプロジェクトのうちの１つに、apacheのアクセスログをfluentd経由でMongoに書き出しているものがあります。ちなみにデフォルトのfluentdのプラグインだとpathが１フィールドに登録されてM/Rしずらいので、クエリストリングスをパースして１クエリを１フィールドに入れるout_exec_filterを自作
wlbhiro 2015/07/16
MongoDB

MapReduce
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx