View the accompanying video on vimeo: https://vimeo.com/47656333
More Information Overview of RHadoop from the Revolution Analytics blog Slides and Replay of 30-minute presentation about RHadoop, "Leveraging R in Hadoop Environments" R in a Nutshell, 2nd edition devotes a good part of the last chapter to RHadoop. "The most mature (and best integrated) project for R and Hadoop is RHadoop." For developers Problems, Suggestions, Interesting Examples Post on the [R
RHIPE: An Interface to Hadoop and R for Large and Complex Data Analysis [ad name=”Google Adsense 468_60″] Ron Fredericks writes: Dr. Saptarshi Guha created an open-source interface between R and Hadoop called the R and Hadoop Integrated Processing Environment or RHIPE for short. LectureMaker was on the scene filming Saptarshi’s RHIPE presentation to the Bay Area’s useR Group, introduced by Michael
統計解析・データマイニングに特化した言語である R言語。 Hadoop Streaming を使った R言語でのMapReduce実装を以下に紹介。 Hadoop Streaming で標準入出力を用いデータの受け渡しを行い、files オプションで R の mapper, reducer を配布し実行。 例えば、WordCountはR言語のMapReduce実装で以下のように書ける。 mapper.r #!/usr/bin/Rscript con = file(description="stdin",open="r") while (length(line <- readLines(con, n = 1, warn = FALSE)) > 0) { line <- unlist(strsplit(line, "\t")) for(word in line){ cat(sprintf("%
リリース、障害情報などのサービスのお知らせ
最新の人気エントリーの配信
処理を実行中です
j次のブックマーク
k前のブックマーク
lあとで読む
eコメント一覧を開く
oページを開く