[B! hbase] restartrのブックマーク

restartr id:restartr

hbaseに関するrestartrのブックマーク (13)

Powering Flickr’s Magic view by fusing bulk and real-time compute | code.flickr.com
restartr 2015/09/04
stream

lambda

architecture

hbase
リンク
HbaseとHadoopMR - 急がば回れ、選ぶなら近道
Hbase勉強会のまとめの延長として今後の考え方をまとめておきます。まずは前提として <一般論> Hbaseにかぎらず、NoSQL系一般に言えることではあるが Usecaseを意識して利用する事が必要だ、ということだと思う。最近の傾向としては、Googleでも顕著だけど、一定の用途をターゲットにして特定のミドルを開発するという方法が結構多い。 Hbaseもその流れはあるので、そのあたりは意識する必要はあるかもしれない。 Hbaseついては、注目するとすればFacebookになるかな。 http://www.cloudera.com/resource/hw10_hbase_in_production_at_facebook いずれにしても、割とうまくいっているUsecaseの情報の有用性は他の技術よりも高いと思う。基本的に単純に分散KVSを使いたいならHbaseにこだわる必要
restartr 2011/06/20
hadoop

hbase

mapreduce
リンク
Facebookの新しいリアルタイム解析システムとは？ - nokunoの日記
Facebookの新しいリアルタイム解析のシステムでは、HBaseで1日200億件のイベントを処理しているそうです。以下の記事の翻訳です。High Scalability - High Scalability - Facebook’s New Realtime Analytics System: HBase to Process 20 Billion Events Per DayFacebookがまたやってくれた。彼らは巨大なリアルタイムデータのストリームを処理するもう1つのシステムを構築したのだ。以前にもFacebookはリアルタイムなメッセージシステムをHBaseで構築している(http://highscalability.com/blog/2010/11/16/facebooks-new-real-time-messaging-system-hbase-to-store-135.ht
restartr 2011/03/25
analysis

realtime

facebook

hbase
リンク
第1回HBaseとCassandraの討論会のメモ - ひしだまの変更履歴
HBaseとCassandra討論会のつっこみー。 (豊月) 2010-11-08 10:51:55 >HBaseはキーが偏ると一部のノードだけに負荷がかかるこれは「Cassandraは、キーが偏ると一部のノードだけに負荷が掛かる」です。 HBaseの場合は、リージョンファイル毎に分散させているので、リージョンファイルの指定サイズを越えてまで大きくなったら自動で分割されて、別のノードへ移ります。 Cassandraの場合、キーのハッシュを元に担当を決めるので巧くキーの生成ルールを考えないと特定ノードに負荷が集中する事になります。 >「このトークンはこのリング」「Ring上で、このTokenはこのノード」という情報を管理している、が正しいです。 >Cassandraは構築は楽だが、故障時が面倒（リバランスに時間がかかる） Cassandraに於いて面倒なのは、故障時じゃないです。故障後
restartr 2010/11/08
*開発

hbase

cassandra

study
リンク
Cloudera Blog
We are excited to announce the acquisition of Octop ai, a leading data lineage and catalog platform that provides data discovery and governance for enterprises to enhance their data-driven decision making. Cloudera’s mission since its inception has been to empower organizations to transf orm all their data to deliver trusted, valuable, and predictive insights. With AI and […] Read blog post
restartr 2010/09/03
network,memory,disk,cpu

*サーバー

hadoop

hbase

容量計画
リンク
Facebook on Hadoop, Hive, HBase, and A/B Testing
Effective Practices for Coding with a Chat-Based AI In this article, we explore how AI agents are reshaping software development and the impact they have on a developer’s workflow. We introduce a practical approach to staying in control while working with these tools by adopting key best practices from the discipline of software architecture, including defining an implementation plan, splitting ta
restartr 2010/07/15
*サーバー

facebook

hadoop

hive

hbase
リンク
NoSQL Week in Review 25
restartr 2010/07/12
*サーバー

nosql

まとめ

cassandra

mongodb

couchdb

neo4j

graphDB

hbase

terrastore
リンク
Pig, Cascalog & HBase Among Highlights of May Hadoop Meet-Up (Hadoop and Distributed Computing at Yahoo!)
YDN Hadoop and Distributed Computing at Yahoo! Pig, Cascalog & HBase Among Highlights of May Hadoop Meet-Up Hi Hadoopers Thanks to close to 300 developers who came this week to Yahoo! for our monthly Hadoop User Group meeting. The energy in the packed room was phenomenal and conversations continued long after the formal sessions. Hundreds of Hadoop Fans Flock to Yahoo! for the May Hadoop User Grou
restartr 2010/05/24
*サーバー

hadoop

pig

hbase

cascalog

yahoo
リンク
Riak and Cassandra and HBase, oh my! « Blog of Data
We are marching along in our integration of HBase with the Socorro Crash Stats project, but I wanted to take a minute away from that to talk about a separate project the Metrics team has also been involved with. Mozilla Labs Test Pilot is a project to experiment and analyze data from real world Firefox users to discover quantifiable ways to improve our user experience. I was very interested and e
restartr 2010/05/19
*サーバー

cassandra

riak

hbase

nosql

database
リンク
1 Billion Reasons Why Adobe Chose HBase - High Scalability -
Cosmin Lehene wrote two excellent articles on Adobe's experiences with HBase: Why we’re using HBase: Part 1 and Why we’re using HBase: Part 2. Adobe needed a generic, real-time, structured data storage and processing system that could handle any data volume, with access times under 50ms, with no downtime and no data loss. The article goes into great detail about their experiences with HBase and t
restartr 2010/03/17
HBaseを選ぶ理由。構成が複雑だけど、それだけの価値があるようす。

*サーバー

hadoop

hbase

adobe
リンク
From Shared-All to Shared-Nothing(PDF)
-Patterns From Shared-All to Shared-Nothing Successfully used Patterns in application and table design with Hbase Bob Schulze, eCircle AG March 2010 @ Berlin Apache Hadoop Get Together -Patterns Audience ➲ You have Big Data ➲ Your Organization needs predictable scaling options ➲ You need to be flexible with your Data ➲ You are a Techie Person -Patterns Content ➲ What is shared? ➲ Recap RDBMS vs HB
restartr 2010/03/11
[filetype:pdf][media:document]HBaseのintroduction。DBMSとの違いや勘所も含む。

*サーバー

hadoop

hbase

keyvalue

ppt

db
リンク
Good night, Posterous
Posterous Spaces is no longer available Thanks to all of my @posterous peeps. Y'all made this a crazy ride and it was an honor and pleasure working with all of y'all. Thanks to all of the users. Thanks to the academy. Nobody will read this.
restartr 2010/02/26
分散処理系比較

*サーバー

cloud

hadoop

hbase

cassandra
リンク
Pig on Hadoop - kuangueの日記
Pigってのは，googleで言うところのsawzallに対応するようです．が，ちょっと見たところでは，Sawzallどころではなくて，もっと意欲的です．Sawzallは，MapReduce処理モデルに思い切り引っ張られているけど，Pigは，リレーショナル演算をHadoop::MapReduce上の処理に変換しようという割と壮大な試み．Hadoopは利用しているけども，完全に別プロジェクトでやっています．yahooで作られていたものをオープンソースにしましたということですね．たとえば，下のように書くことができるような言語になっています． VISITS = load '/visits' as (user, url, time); USER_VISITS = group VISITS by user; USER_COUNTS = foreach USER_VISITS generate gr
restartr 2010/02/16
pigとhbaseは違うものと。・pig=ロウベース&MapReduce ・hbase=カラムデータベース,HDFS(テキストファイル)

*サーバー

hadoop

pig

hbase
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx