[B! YARN][hadoop] kimutanskのブックマーク

kimutansk id:kimutansk

YARNとhadoopに関するkimutanskのブックマーク (14)

Introduction to Apache Apex
kimutansk 2016/05/29
やはり、YARNに最適化された動作、HDFSに状態保存可能という意味で、Hadoopクラスタで運用する利点が大きいですね。Apexは。

hadoop

yarn

apex
リンク
Can't create directory /yarn/nm/usercache/urika/appcache/application_1 - Permission denied
Try to run a simple test and get permissioned denied errors; tried as both root and urika user. Just enabled kerberos... [root@skipper4 cloudera-scm-server]# hadoop jar /opt/cloudera/parcels/CDH/lib/hadoop-mapreduce/hadoop-mapreduce-examples.jar pi 10 100 Number of Maps = 10 Samples per Map = 100 Wrote input for Map #0 Wrote input for Map #1 Wrote input for Map #2 Wrote input for Map #3 Wrote inpu
kimutansk 2015/12/03
yarnに対してジョブ投入した際にディレクトリができなくなる事象、解決方法はわかっても何故解決するかが不明ですし、ClouderaのQAにも書いていませんか。。。

hadoop

yarn
リンク
YARN Container Launch Details
kimutansk 2015/03/30
YARNではdefault_container_executor.sh等の関連ファイルは全てその場で生成されて実行されるので、後で何か仕込むというのがやりにくい、と。個々のファイルの機能はこうなりますか。

yarn

Hadoop
リンク
Running TeraSort MapReduce Benchmark – All Help & Support
Please be advised our License Portal will be undergoing maintenance between March 15 10:30pm PST - March 16th 9:00am PST during which time users may experience intermittent performance issues. We apologize for the inconvenience. Please be advised that the Broadcom ERP system will be undergoing maintenance between March 28 7pm PST - Apr 1 7pm PST which will impact all new customer accounts created
kimutansk 2015/03/30
YARNの生成ファイルの出力先はnm系のディレクトリ配下と。後で情報追うには必要ですか。

yarn

hadoop

MapReduce
リンク
Hortonworksのイベントに行ってきた - wyukawa's diary
Hadoop and the Modern Data Architecture に行ってきました。立派なホテルで良いイベントでした。ありがとうございました。＞ Hortonworksのみなさまセッションや会場にいた人との会話について少し書きます。まず僕が使っているAmbariに関して発表がありました。それによると、Hueみたいなクエリをsubmitする機能が入る模様。どうもHadoopクラスタへのアクセスをすべてAmbari経由にしたいようだ。うーん、それはどうなんだろ。。。Prestoあるし。個人的にはそれよりもっとクラスタ管理に注力してほしいと思ったり。。。例えばエラー通知をメールじゃなくてHipChatとかSlackにとばせるようにするとか。 Ambariで使っているNagios, Gangilaはdeprecatedになり、メトリクスをHBaseにためてPhenioxでクエ
kimutansk 2015/03/13
Spark on YARNはどこでも悩みの種なんですかね。メインの仕事でない中クラスタ管理で困った点をどうするか、というのもやはり悩みどころです

spark

hadoop

yarn
リンク
Apache Hadoop YARN: Avoiding 6 Time-Consuming "Gotchas" | Cloudera Developer Blog
The ongoing progress in Artificial Intelligence is constantly expanding the realms of possibility, revolutionizing industries and societies on a global scale. The release of LLMs surged by 136% in 2023 compared to 2022, and this upward trend is projected to continue in 2024. Today, 44% of organizations are experimenting with generative AI, with 10% having […] Read blog post
kimutansk 2015/03/06
YARNのNodeManagerのメモリチェックを無効化すれば、Container Killerによってkillされるのは回避可能なんですかね。

yarn

hadoop

spark
リンク
Taming YARN @ Hadoop conference Japan 2014
YARN (Yet Another Resource Negotiator) is a resource management framework for Hadoop clusters that improves on the scalability limitations of the original MapReduce framework. YARN separates resource management from job scheduling to allow multiple data processing engines like MapReduce, Spark, and Storm to share common cluster resources. It introduces a new architecture with a ResourceManager to
kimutansk 2015/03/02
Container Killerによるkillはいきなり発動するのでかなり厄介ではありますね・・・　事前にそこまできちんと予測できるわけでもありませんので。

yarn

hadoop
リンク
YARN 上における分散処理基盤のリソース管理について - Qiita
本稿では，YARN 上における分散処理基盤のリソース管理の仕組みと，問題となる状況，および Spark の解決方法について，Spark の例をまじえて説明します． YARN の基礎 MapReduce v1 では，TaskTracker が MapSlot/ReduceSlot という単位でリソースを管理していましたが，YARN では，"コンテナ"という単位でリソースを確保し，その中で処理を行います．コンテナには，CPU/メモリ/ディスク帯域幅/ネットワーク帯域幅などを割り当てることが可能です．2014/12時点では，CPU/メモリのリソース管理サポートが入っています．ディスクIO/ネットワークIOの制御も来年には入るかもしれません． Spark on YARN におけるリソース管理の例 Apache Spark は，オンメモリ用上のデータ処理を容易に行うことができる分散処理フレームワー
kimutansk 2015/01/08
YARN上でSpark動作させる場合にExecutor数をリソース利用状況に応じて増減させられるようになりましたか。減る予定のExecutorが保持するRDDはどうなるか調べてみますか。

yarn

spark

hadoop
リンク
Cloudera Blog
In an era where artificial intelligence (AI) is reshaping enterprises across the globe—be it in healthcare, finance, or manufacturing—it’s hard to overstate the transf ormation that AI has had on businesses, regardless of industry or size. At Cloudera, we recognize the urgent need for bold steps to harness this potential and dramatically accelerate the time to […] Read blog post
kimutansk 2014/10/28
KubernatesがDockerを実行するためのリソースをYARNから割り当てられる周りがやはり結合する上での一番の課題ですか。

hadoop

yarn

kubernetes
リンク
Taming YARN @ Hadoop Conference Japan 2014
The document discusses YARN (Yet Another Resource Negotiator), a resource management framework for Hadoop. It describes YARN components like the ResourceManager, NodeManager, and ApplicationMaster. It covers YARN configuration, capacity planning, health checks, thread tuning, and enabling high availability of the ResourceManager through ZooKeeper.Read less
kimutansk 2014/07/11
Mesosとよく似ていますが、でも明確に違うんですよね。Mesosより自動的にやってくれる要素が強い分、別の意味での大変さがあるわけですか。

yarn

hadoop
リンク
Cascading 2.5がHadoop 2をサポート
Spring BootによるAPIバックエンド構築実践ガイド第2版何千人もの開発者が、InfoQのミニブック「Practical Guide to Building an API Back End with Spring Boot」から、Spring Bootを使ったREST API構築の基礎を学んだ。この本では、出版時に新しくリリースされたバージョンである Spring Boot 2 を使用している。しかし、Spring Boot3が最近リリースされ、重要な変...
kimutansk 2013/12/10
「90%の問題は，SQLで合理的に表現することができます。Cascadingは戦い方の選択肢を提供するものなのです」と。

hadoop

dsl

yarn
リンク
Samza
Samza allows you to build stateful applications that process data in real-time from multiple sources including Apache Kafka. Battle-tested at scale, it supports flexible deployment options to run on YARN or as a standalone library.
kimutansk 2013/08/25
後はYARNで耐障害性を確保してKafkaからメッセージを取得する分散ストリーム処理基盤Samzaと。今度Stormと比較してみますかね。

stream

kafka

yarn

Hadoop
リンク
CDH4 LogPath and LogLevel Configurations | 外道父の匠
Hadoop実践シリーズの続きで、地味ながら重要な所をいきます。 CDH4＋YARNの運用において、どのログを見れば良いのか、ログレベルはどのように設定するのか、というなんとも痒いところに手が届く孫の手編となります。はじめに本件に関連して、前回の外道式設定を若干更新してありますここではログレベルをWARNとしています各ノードのログデーモンとしてのログで、HDFSとYARNに分かれてます。ログ保存パス /etc/hadoop/conf/hadoop-env.sh
kimutansk 2012/09/08
CDH4を扱う『斬新なブログ』のために。。。とさすがですｗ

yarn

hadoop

cdh4
リンク
CDH4 Configurations with YARN, Capacity Scheduler and Hive | 外道父の匠
CDH4 Configurations with YARN, Capacity Scheduler and Hive 前回の続きで、私がCDH4で利用している外道式設定ファイルの紹介です。非常に手間暇かけて作成したものなので墓の下まで持って行きたいところですが、某所からの圧力と社会のために、恥ずかしながら放出したいと思います。はじめに言うまでもなく、参考・利用においては自己責任でお願いしますホスト名・ファイルパスは適当に変換してください descriptionの日本語は私のメモですので怪しかったらググってください英文のみの場合はまだ未検証だったり説明するまでもないものになっています 2ヶ月以上これで運用していますが、まだまだ変わると思うので更新日付を書いておきますファイルはUTF-8です Hadoop設定 CDH3からCDH4への設定変更リスト core 公式 core-def
kimutansk 2012/09/06
こ、これは・・・みましょう。普通こういうの出してくれないですよね。凄いです。

hadoop

cdh4

yarn

config
リンク
1