[B! tdtech] uokadaのブックマーク

Journey of Migrating Millions of Queries on The Cloud

uokada 2024/03/15

tdtech

リンク

Incubating Apache Hivemall

Hivemall is an open source machine learning library built as a collection of Hive UDFs. It provides over 100 machine learning algorithms and functions for tasks like feature engineering, evaluation, and recommendation. Hivemall entered the Apache Incubator in 2016 and the first Apache release (v0.5.0) is upcoming. It supports platforms like Hive, Spark, and Pig for scala ble parallel processing.

uokada 2018/02/22

tdtech

リンク

History of Event Collector in Treasure Data

The Event Collector system was one of the legacy systems at Treasure Data. Over time it faced several performance and scalability probl ems as usage increased. Engineers addressed these probl ems through optimizations like increasing socket backlogs, caching parsers, running processes in parallel, and moving deduplication to a separate thread to avoid blocking the input pipeline. These changes helpe

uokada 2018/02/22

リンク

Planet-scale Data Ingestion Pipeline: Bigdam

Bigdam is a planet-scale data ingestion pipeline designed for large-scale data ingestion. It addresses issues with the traditional pipeline such as imperfectqueue throughput limitations, latency in queries from event collectors, difficulty maintaining event collector code, many small temporary and imported files. The redesigned pipeline includes Bigdam-Gateway for HTTP endpoints, Bigdam-Pool for d

uokada 2018/02/22

リンク

User Defined Partitioning on PlazmaDB

User defined partitioning is a new partitioning strategy in Treasure Data that allows users to specify which column to use for partitioning, in addition to the default "time" column. This provides more flexible partitioning that better fits customer data platform workloads. The user can define partitioning rules through Presto or Hive to improve query performance by enabling colocated joins and fi

uokada 2018/02/21

リンク

はてなブックマーク

タグ

関連タグで絞り込む (5)

tdtechに関するuokadaのブックマーク (5)

お知らせ

今週のはてなブックマーク数ランキング（2025年8月第2週）

今週のはてなブックマーク数ランキング（2025年8月第1週）

月間はてなブックマーク数ランキング（2025年7月）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス