[B! parquet][uber] manboubirdのブックマーク

manboubird id:manboubird

parquetとuberに関するmanboubirdのブックマーク (1)

Spark Meetup at Uber
1) Uber uses Spark and Hadoop to process large amounts of transportation data in real-time and batch. This includes building pipelines to ingest trip data from databases into a data warehouse within 1-2 hours. 2) Paricon is Uber's first Spark application which infers schemas from raw JSON data, converts it to Parquet format for faster querying, and validates the results. It processes over 15TB of
manboubird 2015/10/25
Spark

uber

slide

Kafka

parquet

schemaEvolution

dataStitching

schemaManagement

metadata
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx