[B! columnar] msyktのブックマーク

Apache Carbondata: An Indexed Columnar File Format for Interactive Query with Spark SQL: Spark Summit East talk by Jacky Li and Jihong Ma

msykt 2018/09/16

リンク

How should you build a high-performance column store for the 2020s? – Daniel Lemire's blog

Though most relational databases (like MySQL) are “row oriented”, in that they keep rows stored together… experience has taught us that pivoting the data arrow so that columns, and not rows, are stored together can be beneficial. This is an old observation that most experienced programmers know about as the array-of-struct versus struct-of-array choice. There is even a wikipedia page on the topic.

msykt 2017/11/12

Arrowにdictionary encodingがあるの知らなかった…。カラムナストアのエンコーディングやその周辺の話。関連技術へのリンクが色々貼ってあって楽しい

columnar

リンク

疑似コードで、昨今のIn-Memoryとかカラム型とかを味わう

とうとう、JPOUG Advent Calendar 2014 も最終日となりました。今年もご参加頂いた皆様に感謝しつつも、去年に続き、オオトリを務めさせていただきます。 Oracleデータベースも12.1.0.2というバージョンでIn-Memoryかつカラム型で分析系ワークロード用を高速化するオプションが導入されていることはご存じの通りです。このIn-Memoryオプションという文脈で "ディスクは遅くメモリーは速い。だからIn-Memoryなデータベースは速い" とか "分析系ワークロードはカラム型といったデータフォーマット合っている。だからカラム型が速い" とか "データベースの処理をSIMD(シムディー)とかVector処理といった処理で行うと速い" とかなかなか、上記のキーワードがどのようにデータベース処理と関連しているか不明な状態で説明されることが多いのではないか。と思う今

msykt 2017/05/07

分岐予測ミスのオーバーヘッドがこんなにあるとは

リンク

https://ir.cwi.nl/pub/19953/19953B.pdf

msykt 2017/05/07

リンク

Positional Update Handling in Column Stores

msykt 2017/05/07

Positional Delta Tree(PDT)の話

リンク

列指向データベースのページのデータ構造 - ablog

行指向データベースは行単位でページ(Oracle Database でいうデータブロック)にデータを格納しているのに対して、列指向データベースは列ごとにページに格納している。クエリ実行時に結果セットを返す際に列別にバラバラのページに格納されているデータをどうやってタプル(レコード)に復元している*1のかと思ったがやはり行IDのようなものを持っているようだ。行ID は C-Store では pid、Monet DB では BAT(Binary Association Tables) の oid と呼ばれている。 The Design and Implementation of Modern Column-Oriented Database Systems NSM(N-ary Storage Model): 行方向でブロック（ページ）にデータを格納する方式 DSM(Decomposition

msykt 2017/05/07

読むべき資料がまとめられていて大変良い

リンク

On Column Stores. Interview with Shilpa Lawande | ODBMS Industry Watch

msykt 2014/07/17

columnar

リンク

カラムナストレージ - Yet Another HDIF?

Disclaimer: The opinions expressed here are my own and do not necessarily represent those of current or past employers.Twitter / Photos Disclaimer: The opinions expressed here are my own and do not necessarily represent those of current or past employers. Twitter / Photos Henry Robinsonによる、カラムナストレージの解説記事を翻訳しました。カラムナストレージは、Googleで開発されたデータ処理ツールであるDremelに使用されているファイルフォーマットであり、Clouderaが開発を進めるImpalaでも採用

msykt 2014/07/14

リンク

はてなブックマーク

タグ

関連タグで絞り込む (4)

columnarに関するmsyktのブックマーク (8)

お知らせ

今週のはてなブックマーク数ランキング（2024年7月第1週）

月間はてなブックマーク数ランキング（2024年6月）

今週のはてなブックマーク数ランキング（2024年6月第5週）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス