[B! hadoop] ICHIROのブックマーク

ICHIRO id:ICHIRO

hadoopに関するICHIROのブックマーク (143)

Hadoopは分散処理のプラットフォームになる～米Clouderaエンジニア
ICHIRO 2016/02/29
hadoop

Kudu
リンク
Hadoopが扱う機密データのマスキングについて - Qiita
CDH 5.4 から導入された、Sensitive Data Redaction (機密データのマスキング) 機能を紹介します。できること Hadoopクラスタのログファイル、Hive/Impalaクエリに含まれる任意の機密データのマスキングが可能です。必要なもの CDH 5.4 / Cloudera Manager 5.4 手順 Cloudera Managerにログインし、HDFSサービスを選択します。 2. HDFSの設定画面で、「redaction」で検索します。 3. デフォルトでは「クレジットカード情報」、「社会保障番号」、「ホスト名」、「メールアドレス」のマスキングテンプレートが用意されています。カスタムのマスキングを定義することも可能です。ここではクレジットカード情報をマスキングします。 4. 設定画面内で、マスキングがどのように動作するのか、テストすることができます。
ICHIRO 2015/07/23
hadoop

hive

security
リンク
Scala Data Pipelines for Music Recommendations
Are you still building data pipelines with Java and Python? Are you curious about the current buzz in the Big Data community surrounding Scala as a data processing environment? In this talk I'll discuss how Spotify migrated its music recommendations pipeline from Python to Scala. I'll dive into the language specific features that make Scala the ideal candidate for big data processing as well as hi
ICHIRO 2015/01/13
Scala

Recommendation

ML

hadoop

spark
リンク
Open Sourcing Cubert: A High Performance Computation Engine for Complex Big Data Analytics
Open Sourcing Cubert: A High Performance Computation Engine for Complex Big Data Analytics Authors: Maneesh Varshney, Srinivas Vemuri What do you do when your Hadoop ETL script is mercilessly killed because it is hogging too many resources on the cluster, or if it starts missing completion deadlines by hours? We encountered this exact same probl em more than a year ago while building the computatio
ICHIRO 2014/11/13
hadoop

middleware

database
リンク
Doesn't work with HA Namenodes (CDH 4.3) · Issue #845 · prestodb/presto
ICHIRO 2013/11/20
Presto with HA NN

hive

hadoop

facebook
リンク
Presto: Free, Open-Source SQL Query Engine for any Data
Calling our Presto community speakers – we want to hear from you! Fill out out community call for papers to speak at upcoming meetups and conferences. What is Presto?Presto is an open source SQL query engine that’s fast, reliable, and efficient at scale. Use Presto to run interactive/ad hoc queries at sub-second performance for your high volume apps.
ICHIRO 2013/11/13
hadoop

hive

facebook
リンク
Cloudera Standard のご案内 ~ 無償版大幅機能強化のお知らせ | Cloudera Japan
データを信頼し、AI を信頼する信頼できるデータ、信頼できるモデル、信頼できる AI を実現するために、これほど多くのクラウドのさまざまなデータタイプを管理でき、オープンデータのイノベーションと大規模展開に対応できるプラットフォームは他にありません。
ICHIRO 2013/11/05
cdh

hadoop
リンク
Cloudera Blog
The ongoing progress in Artificial Intelligence is constantly expanding the realms of possibility, revolutionizing industries and societies on a global scale. The release of LLMs surged by 136% in 2023 compared to 2022, and this upward trend is projected to continue in 2024. Today, 44% of organizations are experimenting with generative AI, with 10% having […] Read blog post
ICHIRO 2013/10/17
hive

hadoop
リンク
このページを見るには、ログインまたは登録してください
Facebookで投稿や写真などをチェックできます。
ICHIRO 2013/08/20
facebook

hadoop
リンク
Impala and Parquet | DBMS 2 : DataBase Management System Services
I visited Cloudera Friday for, among other things, a chat about Impala with Marcel Kornacker and colleagues. Highlights included: Impala is meant to someday be a competitive MPP (Massively Parallel Processing) analytic RDBMS. At the moment, it is not one. For example, Impala lacks any meaningful form of workload management or query optimization. While Impala will run against any HDFS (Hadoop Distr
ICHIRO 2013/06/25
hadoop

Impala
リンク
Parquet
Documentation Download Apache Parquet is an open source, column-oriented data file format designed for efficient data storage and retrieval. It provides high performance compression and encoding schemes to handle complex data in bulk and is supported in many programming language and analytics tools.
ICHIRO 2013/06/05
hadoop

Impala
リンク
HBase at Ameba
HBase×Impalaで作るアドテク�「GMOプライベートDMP」@HBaseMeetupTokyo2015SummerMichio Katano
ICHIRO 2013/03/14
HBase

hadoop

CA
リンク
RHEL6のマルチキューで効率的なネットワークの付加分散
TechCenterから移行されたテクニカルリソース 2018年8月時点で、アクティブなTechCenterのコンテンツが移行され、Dell.com のDellサポートの一部になり、フォーラムがDellコミュニティに移行されました。概要: 2018年8月時点で、アクティブなTechCenterのコンテンツが移行され、Dell.com のDellサポートの一部になり、フォーラムがDellコミュニティに移行されました。
ICHIRO 2013/03/04
hadoop

movie
リンク
Cloudera Blog
Riding the wave of the generative AI revolution, third party large language model (LLM) services like ChatGPT and Bard have swiftly emerged as the talk of the town, converting AI skeptics to evangelists and transf orming the way we interact with techno logy. For proof of this megatrend look no further than the instant success of ChatGPT, […] Read blog post
ICHIRO 2013/02/26
hadoop

R
リンク
Cloudera Blog
Enterprises see embracing AI as a strategic imperative that will enable them to stay relevant in increasingly competitive markets. However, it rem ains difficult to quickly build these capabilities given the challenges with finding readily available talent and resources to get started rapidly on the AI journey. Cloudera recently signed a strategic collaboration agreement with Amazon […] Read blog p
ICHIRO 2013/01/31
hadoop
リンク
Cloudera Blog
ICHIRO 2013/01/31
hadoop
リンク
GitHub - intel-hadoop/project-panthera: Project Panthera is our open source efforts to enable efficient support of standard SQL features for advacned analytics on Hadoop
ICHIRO 2012/12/30
HBase

hadoop
リンク
Performance evaluation of cloudera impala (with Comparison to Hive)
ICHIRO 2012/12/20
hadoop

Impala

hive
リンク
ネームノードHAにおける自動フェイルオーバー(フェンシング編)
Disclaimer: The opinions expressed here are my own and do not necessarily represent those of current or past employers.Twitter / Photos Disclaimer: The opinions expressed here are my own and do not necessarily represent those of current or past employers. Twitter / Photos Hadoopアドベントカレンダー2012 #hadoopAC12jpの、6日目のエントリです。前回は、CDH4.1で導入されたネームノードHAの自動フェイルオーバーについて紹介しました。本エントリでは、自動フェイルオーバー時のフェンシング機能について紹介
ICHIRO 2012/12/07
hadoop

cdh
リンク
ネームノードHAにおける自動フェイルオーバー(概要編)
Disclaimer: The opinions expressed here are my own and do not necessarily represent those of current or past employers.Twitter / Photos Disclaimer: The opinions expressed here are my own and do not necessarily represent those of current or past employers. Twitter / Photos Hadoopアドベントカレンダー2012 #hadoopAC12jpの4日目のエントリとして、CDH4.1で導入された高可用性(HA:High Availability)ネームノードの自動フェイルオーバーについて紹介します。 Introduction C
ICHIRO 2012/12/04
hadoop

cdh
リンク
1 2 3 4 5 6 7 8 次のページ

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx