[B! mapreduce] dannのブックマーク

dann id:dann

mapreduceに関するdannのブックマーク (37)

並列データベースシステムの概念と原理
2014/01/30 筑波大学情報システム特別講義Dの講義資料です。 join関係はNAIST時代の宮崎先生のデータ工学Ⅱの内容を参考にしてます。 animation有効なビデオはこちら https://vimeo.com/85598907
dann 2014/02/16
mapreduce
リンク
DSN2012
dann 2012/03/26
mapreduce

performance
リンク
VLDB’11の読む予定の論文リスト - maropuのメモ墓場
VLDB'11から読む予定の論文リストをPick-up（citeulikeに登録予定リスト） DB系/モダンハードウェア系/分散システム系/グラフアルゴリズム系を中心に http://www.vldb.org/2011/?q=node/28 HYRISE - A Main Memory Hybrid Storage Engine Martin Grund (Hasso-Plattner-Institut), Jens Krueger (Hasso-Plattner-Institut), Hasso Plattner (Hasso-Plattner Institute), Alexander Zeier (Hasso-Plattner Institute), Philippe Cudre-Mauroux (MIT CSAIL), Samuel Madden (MIT) Fast Sparse
dann 2012/03/14
mapreduce

research
リンク
MapReduceのパターン、アルゴリズム、そしてユースケース - きしだのHatena
Ilya Katsov氏による「MapReduce Patterns, Algorithms, and Use Cases」の翻訳 http://highlyscala ble.wordpress.com/2012/02/01/mapreduce-patterns/ (下書きに入れて推敲するつもりが、なんか公開されてしまっていたので、あとでいろいろ修正すると思います) February 1, 2012 この記事では、Webや科学論文で見られる異なるテクニックの体系的な視点を与えるために、数々のMapReduceパターンとアルゴリズムをまとめた。いくつかの実用的なケーススタディも提供している。すべての説明とコードスニペットでは、Mapper、Reducer、Combiner、Partitionaer、ソーティングにおいてHadoopの標準的なMapReduceモデルを利用します。このフレー
dann 2012/02/27
mapreduce

hadoop
リンク
MapReduce Patterns, Algorithms, and Use Cases
In this article I digested a number of MapReduce patterns and algorithms to give a systematic view of the different techniques that can be found on the web or scientific articles. Several practical case studies are also provided. All descriptions and code snippets use the standard Hadoop’s MapReduce model with Mappers, Reduces, Combiners, Partitioners, and sorting. This framework is depicted in th
dann 2012/02/10
designpattern

mapreduce
リンク
../graphs/incoop_logProcessing_wordCount.eps
dann 2011/08/05
mapreduce
リンク
NUS Computing - Home
dann 2011/06/23
mapreduce

hadoop
リンク
Mining of Massive Datasets
The book has a new Web site www.mmds.org. This page will no longer be maintained. Your browser should be automatically redirected to the new site in 10 seconds. The book has now been published by Cambridge University Press. The publisher is offering a 20% discount to anyone who buys the hardcopy Here. By agreement with the publisher, you can still download it free from this page. Cambridge Press d
dann 2011/01/26
mapreduce
リンク
第2回,第3回MapReduce本読書会 - 科学と非科学の迷宮
第1回はこちら第2回日時 2010/09/26 19:30 - 21:00？場所都内某所挑戦者 marqs shiumachi 標的 Data-Intensive Text Processing with MapReduce 範囲 3章残り(marqs)4章途中まで(shiumachi) 第3回(take1) 10/3にやるはずだったが、marqs が会場に着いたとたんに(ピー)したので中止第3回(take2) 日時 2010/10/11 19:30 - 21:00？場所都内某所挑戦者 marqs shiumachi 標的 Data-Intensive Text Processing with MapReduce 範囲 4章残り(shiumachi)5章途中まで(marqs) Data-Intensive Text Processing with MapReduce ch
dann 2010/12/20
mapreduce

cool
リンク
MapReduce/Bigtable for Distributed Optimization
Neural Information Processing Systems Workshop on Leaning on Cores, Clusters, and Clouds (2010) For large data it can be very time consuming to run gradient based optimizat ion,for example to minimize the log-likelihood for maximum entropy models.Distributed methods are therefore appealing and a number of distributed gradientoptimization strategies have been proposed including: distributed gradien
dann 2010/12/17
mapreduce

distributed
リンク
機械学習 × MapReduce - ny23の日記
個人的な興味というより，雑用絡みで眺めた論文の紹介．機械学習アルゴリズムを並列分散化するという話が最近流行っているようだ．全然網羅的ではないけど，誰かの役に立つかも知れないので，幾つかメモしておく．まず古典的にはこれ， Map-reduce for machine learning on multicore (NIPS 2006) 古典的な機械学習アルゴリズム（バッチ学習）の多くは，Statistical Query Model で記述できて，それらは summation form で記述できる (から，MapReduce で並列化できる)．実装は Mahout．ただ最近は，バッチアルゴリズムで解ける問題には多くの場合対応するオンラインアルゴリズムが提案されていて，バッチアルゴリズムを並列化することのメリットはあまり無い．オンラインアルゴリズムだとパラメタが連続的に更新されるので，MapR
dann 2010/11/30
mapreduce
リンク
『Real-Time MapReduce』へのコメント
ブックマークしましたここにツイート内容が記載されます https://b.hatena.ne.jp/URLはspanで囲んでください Twitterで共有
dann 2010/11/13
s4

mapreduce
リンク
Designing algorithms for Map Reduce
Since the emerging of Hadoop implementation, I have been trying to morph existing algorithms from various areas into the map/reduce model. The result is pretty encouraging and I've found Map/Reduce is applicable in a wide spectrum of application scenarios. So I want to write down my findings but then found the scope is too broad and also I haven't spent enough time to explore different probl em dom
dann 2010/11/09
mapreduce
リンク
開発メモ: ローカルMapReduceの性能
Kyoto CabinetにMapReduceを実装したという話は前回書いたが、そのLuaバインディングでもMapReduceをサポートした。また、Kyoto Tycoonとそのスクリプト言語拡張でもMapReduceをサポートした。今回はその性能について解説する。ローカルMapReduceのツボ世に言うMapReduceは分散処理のフレームワークだけれども、KC/KTの「ローカルMapReduce」は分散処理を行わない。分散処理をしなかったらデータ処理能力が上がらないじゃないかと思うかもしれないけれども、そうとも限らないのだ。前回も書いたけども、MapReduceフレームワーク部分をうまく実装すると、時間効率と空間効率の双方を向上させることができる。特にキャッシュとソートの部分に工夫がある。 MapReduceは、リポジトリ内（KCではデータベースファイル内）の各レコードからキーと値
dann 2010/11/08
mapreduce
リンク
S4: Distributed Stream Computing Platform
- 55 users
- s4.io
- 暮らし
We've got your back )Buyer Protection ProgramWhen you buy a domain name at Dan.com, you’re automatically covered by our Buyer Protection Program. Our unique & carefully designed domain ownership transfer process is the best rated service in the market. Buyer Protection ProgramWhen you buy a domain name at Dan.com, you’re automatically covered by our unique Buyer Protection Program. Read more about
dann 2010/11/07
java

realtime

mapreduce
リンク
Hadoopを使いこなす(1)
まず、 1 の入力ファイルを分割する方法は、InputFormatクラスの、getSplits関数を上書きすることで、カスタマイズできます。また、 3 のInputSplitから、KeyとValueを抽出する処理も、InputFormatクラスを通じてカスタマイズできます。 InputFormatのgetRecordReader関数を通じて、RecordReaderクラスを生成するのですが、これに任意のRecordReaderクラスを指定すればOKです。 2 のMap処理ですが、ユーザが指定したMapperクラスの処理を実行します。 Mapperクラスは、MapRunnerクラスを通じて、初期化処理、map関数を繰り返す過程、終了処理といった一連の流れを実行します。 MapRunnerクラスをカスタマイズすれば、こうした流れを制御することができます。 0.20.0からの新しいMapRed
dann 2010/03/01
hadoop

mapreduce
リンク
Jimmy Lin » Data-Intensive Text Processing with MapReduce
dann 2010/03/01
mapreduce

algorithm

hadoop

cool
リンク
Googleの基盤クローン Hadoopについて
JSONでメール送信 | HTTP API Server ``Haineko''/YAPC::Asia Tokyo 2013 LT Day2azumakuniyuki 🐈
dann 2009/06/08
google

hadoop

mapreduce
リンク
Introduction to "Cloud Computing" (Fall 2008)
What are the lab sessions? The lectures focus on concepts and theory, but there's often quite a gap between that and actually getting your code to run. There are a lot of details that are best practiced in a hands-on/tutorial environment with peers. Rem ember to bring your laptops! The lab sessions will be loosely structured: I will discuss algorithms, share tips and tricks, answer any questi
dann 2009/05/19
cloud

mapreduce
リンク
Amazon Elastic MapReduceでperlを使った処理をしてみる（その3）
Amazon Elastic MapReduceの例で出てくるのは今まで見た限りでは、みんなs3n://で始まるS3 Native FileSystem上にファイルを置いている。 http://wiki.apache.org/hadoop/AmazonS3 にあるように、もう一つ s3://で始まるS3 Block FileSystemというのがある。これまでS3fsって言ってたけどこれはs3-fuseと紛らわしいし、名前として正しくないのでS3 Block FileSystemと呼ぶべきでした。で、これを使いたい。メリットは、以下のように理解してる。ファイルがブロックに分割されるので、通常5GBまでというS3のファイルサイズの制限を超えられるファイルがブロックに分割されるので、HDFSと同様Hadoopの各jobtaskに処理を効率よく分散できるデメリットは、たぶんこんな感じ
dann 2009/04/11
perl

mapreduce

aws

amazon
リンク
1 2 次のページ