kuenishiのブックマーク - はてなブックマーク

Best Practices Guide for Systems Security Services Daemon Configuration and Installation - Part 1 - Cloudera Blog

kuenishi 2021/05/11

リンク

Apache Ozone and Dense Data Nodes - Cloudera Blog

kuenishi 2021/04/28

High density node 400TB (in theory, up to 1TB) support and they ran TPC-DS over Impala!

リンク

Multi-Raft – Boost up write performance for Apache Hadoop-Ozone - Cloudera Blog

Multi-Raft – Boost up write performance for Apache Hadoop-Ozone This blog post was written by Guest Blogger Li Cheng, Software Engineer, Tencent Inc. Using Hadoop-Ozone in Prod Apache Hadoop-Ozone is a new-era object storage solution for Big Data platform. It is scala ble with strong consistency. Ozone uses Raft protocol, implemented by Apache Ratis (Incubating), to achieve high availability in it

kuenishi 2020/12/08

It's reallya good writeup

リンク

Small Files, Big Foils: Addressing the Associated Metadata and Application Challenges - Cloudera Blog

*Can go as high as 1.4KB/Column/Partition Example: If there are 1000 tables with 200 partitions each and 10 files per partitions, the Impala Catalog Size will be at least (excluding table stats and table width): #tables * 5KB + #partitions * 2kb + #files * 750B + #file_blocks * 300B = 5MB + 400MB + 1.5GB + 600MB = ~ 2.5GB The larger the Impala Catalog Size the higher its memory footprint. Large me

kuenishi 2019/05/17

リンク

How-to: Set Up a Hadoop Cluster with Network Encryption - Cloudera Blog

Hadoop network encryption is a feature introduced in Apache Hadoop 2.0.2-alpha and in CDH4.1. In this blog post, we’ll first cover Hadoop’s pre-existing security capabilities. Then, we’ll explain why network encryption may be required. We’ll also provide some details on how it has been implemented. At the end of this blog post, you’ll get step-by-step instructions to help you set up a Hadoop clust

kuenishi 2019/01/29

リンク

Untangling Apache Hadoop YARN, Part 4: Fair Scheduler Queue Basics - Cloudera Blog

Untangling Apache Hadoop YARN, Part 4: Fair Scheduler Queue Basics In this installment, we provide insight into how the Fair Scheduler works, and why it works the way it does. In Part 3 of this series, you got a quick introduction to Fair Scheduler, one of the scheduler choices in Apache Hadoop YARN (and the one recommended by Cloudera). In Part 4, we will cover most of the queue properties, some

kuenishi 2016/08/18

リンク

Untangling Apache Hadoop YARN, Part 3: Scheduler Concepts - Cloudera Blog

Untangling Apache Hadoop YARN, Part 3: Scheduler Concepts In Parts 1 and 2, we covered the basics of YARN resource allocation. In this installment, we’ll provide an overview of cluster scheduling and introduce the Fair Scheduler, one of the scheduler choices available in YARN. A standalone computer can have several CPU cores, each running a single process, but there can be as many as a few hundred

kuenishi 2016/01/24

リンク

Untangling Apache Hadoop YARN, Part 2: Global Configuration Basics - Cloudera Blog

Untangling Apache Hadoop YARN, Part 2: Global Configuration Basics A new installment in the series about the tangled ball of thread that is YARN In Part 1 of this series, we covered the fundamentals of clusters of YARN. In Part 2, you’ll learn about other components than can run on a cluster and how they affect YARN cluster configuration. Ideal YARN Allocation As shown in the previous post, a YARN

kuenishi 2016/01/24

"The amount of memory set aside can be fairly large"

リンク

Untangling Apache Hadoop YARN, Part 1: Cluster and YARN Basics - Cloudera Blog

Untangling Apache Hadoop YARN, Part 1: Cluster and YARN Basics In this multipart series, fully explore the tangled ball of thread that is YARN. YARN (Yet Another Resource Negotiator) is the resource management layer for the Apache Hadoop ecosystem. YARN has been available for several releases, but many users still have fundamental questions about what YARN is, what it’s for, and how it works. This

kuenishi 2016/01/24

リンク

Cloudera Blog

kuenishi 2016/01/24

リンク

Cloudera Blog

Enterprises see embracing AI as a strategic imperative that will enable them to stay relevant in increasingly competitive markets. However, it rem ains difficult to quickly build these capabilities given the challenges with finding readily available talent and resources to get started rapidly on the AI journey. Cloudera recently signed a strategic collaboration agreement with Amazon […] Read blog p

kuenishi 2016/01/24

リンク

Cloudera Blog

Enterprises see embracing AI as a strategic imperative that will enable them to stay relevant in increasingly competitive markets. However, it rem ains difficult to quickly build these capabilities given the challenges with finding readily available talent and resources to get started rapidly on the AI journey. Cloudera recently signed a strategic collaboration agreement with Amazon […] Read blog p

kuenishi 2016/01/24

リンク

Cloudera Blog

In an era where artificial intelligence (AI) is reshaping enterprises across the globe—be it in healthcare, finance, or manufacturing—it’s hard to overstate the transf ormation that AI has had on businesses, regardless of industry or size. At Cloudera, we recognize the urgent need for bold steps to harness this potential and dramatically accelerate the time to […] Read blog post

kuenishi 2016/01/21

リンク

How-to: Scan Salted Apache HBase Tables with Region-Specific Key Ranges in MapReduce - Cloudera Blog

kuenishi 2015/07/01

MapReduce is gonna scan instead of you?

リンク

Inside Apache HBase’s New Support for MOBs - Cloudera Blog

Learn about the design decisions behind HBase’s new support for MOBs. Apache HBase is a distributed, scala ble, performant, consistent key value database that can store a variety of binary data types. It excels at storing many relatively small values (<10K), and providing low-latency reads and writes. However, there is a growing demand for storing documents, images, and other moderate objects (MOBs

kuenishi 2015/07/01

database

リンク

Cloudera Blog

Enterprises see embracing AI as a strategic imperative that will enable them to stay relevant in increasingly competitive markets. However, it rem ains difficult to quickly build these capabilities given the challenges with finding readily available talent and resources to get started rapidly on the AI journey. Cloudera recently signed a strategic collaboration agreement with Amazon […] Read blog p

kuenishi 2014/05/20

リンク

Cloudera Blog

kuenishi 2013/08/23

リンク

Cloudera Blog

Enterprises see embracing AI as a strategic imperative that will enable them to stay relevant in increasingly competitive markets. However, it rem ains difficult to quickly build these capabilities given the challenges with finding readily available talent and resources to get started rapidly on the AI journey. Cloudera recently signed a strategic collaboration agreement with Amazon […] Read blog p

kuenishi 2013/07/31

そしてこれがCloudera Searchへと…？

hadoop

リンク

Cloudera Blog

The ongoing progress in Artificial Intelligence is constantly expanding the realms of possibility, revolutionizing industries and societies on a global scale. The release of LLMs surged by 136% in 2023 compared to 2022, and this upward trend is projected to continue in 2024. Today, 44% of organizations are experimenting with generative AI, with 10% having […] Read blog post

kuenishi 2012/10/07

ちょっとPigなめてた

リンク

Cloudera Blog

Riding the wave of the generative AI revolution, third party large language model (LLM) services like ChatGPT and Bard have swiftly emerged as the talk of the town, converting AI skeptics to evangelists and transf orming the way we interact with techno logy. For proof of this megatrend look no further than the instant success of ChatGPT, […] Read blog post

kuenishi 2012/09/01

hadoop

リンク

はてなブックマーク

タグ

ブックマーク / blog.cloudera.com (22)

お知らせ

今週のはてなブックマーク数ランキング（2024年7月第1週）

月間はてなブックマーク数ランキング（2024年6月）

今週のはてなブックマーク数ランキング（2024年6月第5週）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス