[B! hdfs] dannのブックマーク

以前に公開し、一番アクセスされている hdfs dfs のサブコマンドページの情報が古かったので、Hadoop３系で検証して全面的に書き直しました。 HDFS FSshell (hdfs dfs) コマンドのチートシートのPDFファイルです。(Hadoop 3.1.1)。ダウンロードも可能です。 View Fullscreen 余力があれば、管理系のコマンド(hdfs dfsadmin等)も追って作成するかもしれません。

dann 2020/07/09

hdfs

リンク

Real-time Recommendations using Spark Comcast Labs

Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, Grammarly, and over 50% of the Fortune 500 —...

dann 2019/08/18

hdfs
k8s

リンク

A Tale of Two Erasure Codes in HDFS | USENIX

Mingyuan Xia, McGill University; Mohit Saxena, Mario Blaum, and David A. Pease, IBM Research Almaden Distributed storage systems are increasingly transitioning to the use of erasure codes since they offer higher reliability at significantly lower storage costs than data replication. However, these codes tradeoff recovery performance as they require multiple disk reads and network transfers for rec

dann 2015/05/10

hdfs

リンク

Cloudera Blog

Riding the wave of the generative AI revolution, third party large language model (LLM) services like ChatGPT and Bard have swiftly emerged as the talk of the town, converting AI skeptics to evangelists and transf orming the way we interact with techno logy. For proof of this megatrend look no further than the instant success of ChatGPT, […] Read blog post

dann 2012/08/04

hdfs

リンク

Why Does Cloudera Really Use HDFS?

Gluster blog stories provide high-level spotlights on our users all over the world Apparently, someone in Hadoop-land is getting worried about alternatives to HDFS, and has decided to address that fear via social media instead of code. Two days ago we had Daniel Abadi casting aspersions on Hadoop adapters. Today we have Charles Zedlewski explaining why Cloudera uses HDFS. He mentions a recent Giga

dann 2012/08/04

hdfs

リンク

https://static.usenix.org/publications/login/2012-02/pdfs/Chansler.pdf

dann 2012/03/25

hdfs

リンク

Rick Kazman | Shidler College of Business |

I am interested in the design and analysis of large, complex software-intensive systems. I care not only about the technical aspects of design but also the economic and social implications of design decisions. My research methods, tools, and books have been adopted and applied by governments and Fortune 500 companies around the world. According to Google Scholar and Microsoft Academic my books a

dann 2011/07/13

hdfs

リンク

"Hbase at Facebook" に行ってきた - たごもりすメモ

名称表記が揺れてて微妙だけど Hbase at FaceBook on Zusaar このイベントに行ってきた。Facebookの人は "HBase Tokyo meetup" と認識していたようだ。内容のまとめはやらないので、以下の各ページなどをご覧になると良いのではないでしょうか。 Tokyo HBase Meetup - Realtime Big Data at Facebook with Hadoop and HB… Hbase at FaceBookのまとめ - Togetterまとめ FacebookがHBaseを大規模リアルタイム処理に利用している理由（前編）－ Publickey FacebookがHBaseを大規模リアルタイム処理に利用している理由（後編）－ Publickey セッションの内容と自分が考えたことと人としゃべったことをいっしょくたにここに書いておく。

dann 2011/07/04

hdfs
hbase

リンク

Apache Hadoop Goes Realtime at Facebook

Apache Hadoop Goes Realtime at Facebook Dhruba Borthakur Kannan Muthukkaruppan Karthik Ranganathan Samuel Rash Joydeep Sen Sarma Nicolas Spiegelberg Dmytro Molkov Rodrigo Schmidt Facebook {dhruba,jssarma,jgray,kannan, nicolas,hairong,kranganathan,dms, aravind.menon,rash,rodrigo, amitanand.s}@fb.com Jonathan Gray Hairong Kuang Aravind Menon Amitanand Aiyer ABSTRACT Facebook recently deployed Facebo

dann 2011/07/02

リンク

https://www.jeffshafer.com/publications/presentations/shafer-ispass2010-presentation.pdf

dann 2011/06/13

リンク

Microsoft Word - sigmodwarehouse2010.doc

Data Warehousing and Analytics Infrastructure at Facebook Ashish Thusoo Zheng Shao Suresh Anthony Dhruba Borthakur Namit Jain Joydeep Sen Sarma Facebook1 1 The authors can be reached at the following addresses: {athusoo,dhruba,rmurthy,zshao,njain,hliu, suresh,jssarma}@facebook.com Raghotham Murthy Hao Liu ABSTRACT Scala ble analysis on large data sets has been core to the functions of a number of t

dann 2011/04/14

リンク

Pratice of NameNode Cluster for HDFS HA » gnawux.info

Abstraction and Motivation The only single point of failure (SPOF) in HDFS is on the most important node — NameNode. If it fails, the ongoing operations will fail and user data may be lost. In 2008, we (team from China Mobile Research Institute) have implemented an initial version of Name-Node Cluster (NNC) on hadoop 0.17. NNC introduced a Synchronization Agent, which synchronizes the updates of F

dann 2010/12/02

hadoop
hdfs

リンク

Sujee Maniyam : Tech Writeups : Variable names you shouldn't use in Rails Controller

So we will write a map reduce program. Similar to the popular example word-count - couple of differences. Our Input-Source is a Hbase table. Also output is sent to an Hbase table. First, code access & Hbase setup The code is in GIT repository at GitHub : http://github.com/sujee/hbase-mapreduce You can get it by git clone git://github.com/sujee/hbase-mapreduce.git This is an Eclipse project. T

dann 2010/11/28

hbase
hdfs

リンク

Is HBase appropriate for indexed blob storage in HDFS, or is another tool in the Hadoop ecosystem more appropriate? How large can the blo...

Answer (1 of 4): I agree with Joydeep, HBase is not ideal when dealing with arbitrarily large cells (aka. column values). I am looking into this as a side project I am working on, I used the name "hbl obstore" out of lack of finding a more appealing one. I am using HBase to store a few things, on...

dann 2010/11/28

hdfs
hbase

リンク

Patterns of Hadoop Deployment - SmartFrog -

Hadoop only makes sense deployed onto a cluster, which means that you have to keep a whole set of machines up to date with code keep the hadoop cluster configuration consistent across the cluster push out the cluster configuration to everyone who can submit jobs lock down the LAN to keep out untrusted people (there is no more security in the Hadoop filesystem than NFS: it is based on trust). You

dann 2010/11/21

hadoop
hdfs

リンク

[HADOOP-1652] Rebalance data blocks when new data nodes added or data nodes become full - ASF JIRA

dann 2010/11/21

...

リンク

百度文库

dann 2010/11/21

hdfs

リンク

HDFS Scribe Integration

It is finally here: you can configure the open source log-aggregator, scribe, to log data directly into the Hadoop distributed file system. Many Web 2.0 companies have to deploy a bunch of costly filers to capture weblogs being generated by their application. Currently, there is no option other than a costly filer because the write-rate for this stream is huge. The Hadoop-Scribe integration allows

dann 2010/11/21

hdfs
scribe

リンク

Hadoop AvatarNode High Availability

Our Use-Case The Hadoop Distributed File System's (HDFS) NameNode is a single point of falure. This has been a major stumbling block in using HDFS for a 24x7 type of deployment. It has been a topic of discussion among a wide circle of engineers. I am part of a team that is operating a cluster of 1200 nodes and a total size of 12 PB. This cluster is currently running hadoop 0.20. The NameNode is co

dann 2010/11/21

hadoop
hdfs

リンク

はてなブックマーク

タグ

関連タグで絞り込む (21)

hdfsに関するdannのブックマーク (35)

お知らせ

今週のはてなブックマーク数ランキング（2024年6月第3週）

今週のはてなブックマーク数ランキング（2024年6月第2週）

月間はてなブックマーク数ランキング（2024年5月）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス