[B! index] mogwaingのブックマーク

mogwaing id:mogwaing

indexに関するmogwaingのブックマーク (22)

About Indexes in Cassandra — Apache Cassandra 1.1.x documentation
mogwaing 2015/09/27
cassandra

kvs

index
リンク
Cassandra at Scale: The Problem with Secondary Indexes | Pantheon.io
Maybe you’re a seasoned Cassandra veteran, or maybe you’re someone who’s stepping out into the world of NoSQL for the first time—and Cassandra is your first step. Maybe you’re well versed in the probl ems that secondary indexes pose, or maybe you’re looking for best practices before you invest too much time and effort into including Cassandra in your stack. The truth is, if you’re using Cassandra o
mogwaing 2015/09/27
cassandra

index

kvs
リンク
Flexible In-Situ Indexing for Hadoop via Elephant Twin
This document discusses flexible indexing in Hadoop. It describes how Twitter uses Elephant-Twin, an open source library they developed, to create indexes at the block level or record level in Hadoop. Elephant-Twin allows minimal changes to jobs/scripts, indexes data without copying it, supports post-factum indexing, and indexes can be used to efficiently retrieve relevant data through an IndexedI
mogwaing 2012/06/24
hadoop

twitter

index

pig
リンク
GiST - Wikipedia
This article includes a list of general references, but it lacks sufficient corresponding inline citations. Please help to improve this article by introducing more precise citations. (June 2015) (Learn how and when to remove this message) In computing, GiST or Generalized Search Tree, is a data structure and API that can be used to build a variety of disk-based search trees. GiST is a generalizati
mogwaing 2011/12/28
database

gist

index
リンク
[HIVE-417] Implement Indexing in Hive - ASF JIRA
Public signup for this instance is disabled. Go to our Self serve sign up page to request an account.
mogwaing 2011/10/17
3) right now we use a map-reduce job to scan the whole index table to find hits offsets. But since the index table is sorted, we can leverage the sort property to avoid the map-reduce job in many cases. (easiest way is to do a binary search in client.)

hive

index
リンク
FilterPushdownDev - Apache Hive - Apache Software Foundation
IntroductionThis document explains how we are planning to add support in Hive's optimizer for pushing filters down into physical access methods. This is an important optimization for minimizing the amount of data scanned and processed by an access method (e.g. for an indexed key lookup), as well as reducing the amount of data passed into Hive for further query evaluation. Use CasesBelow are the ma
mogwaing 2011/10/17
hive

optimizer

index
リンク
Indexed Hive
This document summarizes a presentation on using indexes in Hive to accelerate query performance. It describes how indexes provide an alternative view of data to enable faster lookups compared to full data scans. Example queries demonstrating group by and aggregation are rewritten to use an index on the shipdate column. Performance tests on TPC-H data show the indexed queries outperforming the non
mogwaing 2011/10/17
hive

index

hadoop
リンク
DB2 Range Partitioning
mogwaing 2011/08/16
Partition local index is ... Default in 9.7 unless it does not meet local index criteria

database

db2

partitioning

index
リンク
IndexDev - Apache Hive - Apache Software Foundation
Indexing Is Removed since 3.0There are alternate options which might work similarily to indexing: Materialized views with automatic rewriting can result in very similar results. Hive 2.3.0 adds support for materialzed views.Using columnar file formats (Parquet, ORC) – they can do selective scanning; they may even skip entire files/blocks. IntroductionThis document explains the proposed design for
mogwaing 2011/08/07
hive

index

hadoop
リンク
Product Reviews - Watch and Diamond Reviews Here!
mogwaing 2011/06/23
hadoop

mapfile

index
リンク
Dense and Sparse Indices
mogwaing 2009/04/07
sparse index, dense index

index
リンク
Database index - Wikipedia
This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed. Find sources: "Database index" – news · newspapers · books · scholar · JSTOR (May 2024) (Learn how and when to remove this message) A database index is a data structure that improves the speed of data retrieval operati
mogwaing 2009/04/03
database

index
リンク
https://dl.acm.org/doi/10.1145/319628.319663
mogwaing 2009/01/14
myuiさんの http://www.slideshare.net/myui/blinktree-presentation/ がとても参考になる

b-tree

b*tree

b+tree

database

index

must
リンク
https://dl.acm.org/citation.cfm?id=1458174
- 1 user
- dl.acm.org
- 学び
mogwaing 2008/12/05
search

index

paper

acm
リンク
A Diary Which Heads for Convergence (Naist branch).(2007-08-13)
テキスト索引パトリシア木を作り終えて，今度はString B-Treeの実装していたんだけど，階層化パトリシア木でも良いのかなっとかsuffix木の階層化の話で良いのが提案されてるかも...と気になって，CPS-tree: A Compact Partitioned Suffix Tree for Disk-based Indexing on Large Genome Sequencesという論文を読んでた。結論としてはString B-Treeで良いや，なんだけど。 Suffix Treeを素直にページサイズごとに切って，少し工夫しましたというようなお話。ICDE2007なんだけど，2007年までsuffix treeの二次記憶への格納の話があんまりされていないことに驚き。索引構築と主記憶に収まるように極小表現を考えたり，主記憶上の話が主に研究課題であったようだ。悪くない論文なんだけど
mogwaing 2008/12/04
index

b-tree

patricia tree

research

to see
リンク
https://dl.acm.org/doi/10.1145/564691.564753
mogwaing 2008/12/04
database

index

b-tree

research

must

acm

paper
リンク
株式会社スタイルズ
AWSアドバンスドコンサルティングパートナーの一員として活動する株式会社スタイルズが、AWS導入、移行、開発、セキュリティ、運用保守など、すべてのご相談に乗らせていただきます。 AWSを導入したいが何から始めたらいいかわからない既存のベンダーが新技術に弱く、良い提案がもらえないクラウドの導入にセキュリティの不安がある AWSをとりあえず導入したが、さらに活用していきたい社内にAWSの知見を持っている人がいない AWSならではのシステム開発を詳しく知りたい
mogwaing 2008/11/04
innodb

mysql

database

b+tree

index
リンク
R-Tree - こども(てれび)
R-Tree を勉強します。参考 Rtrees: Theory and Applications この本のサンプル pdf がたぶんわかりやすい (chap.1, chap.2) R-Trees: A Dynamic Index Structure for Spatial Searching 原著論文目的与えられた矩形と交差する図形を探索する問題を考えます。window query と言うらしいです。これを効率的に実行するためのデータ構造が R-Tree です。 R-Tree の概要 R-Tree は B+-Tree の構造をしています。B+-Tree は、 leaf に要素が入っていて非 leaf の node は探索の為のインデックスのみを持っている B-Tree です、たぶん。R-Tree の leaf に入る要素は Minimum Bounding Rectangle (MB
mogwaing 2008/10/30
r-tree

index

b+tree

to see
リンク
MySQL :: MySQL 8.4 Reference Manual :: 17.6.2.2 The Physical Structure of an InnoDB Index
Enabling Automatic InnoDB Configuration for a Dedicated MySQL Server
mogwaing 2008/10/05
InnoDB tries to leave 1/16 of the page free for future insertions and updates of the index records.

innodb

index

b+tree
リンク
空間インデックス - ma38su.sourceforge.jp
Cell method データ領域を包含するセルのサイズを事前に決定する必要があるため、動的なデータベースには不利のようです。各セルは、そのセル領域と重なる領域の識別子を持ちます。セルを細かく区切れば検索精度は向上しますが、使用するデータ領域が増加します。 Digital MapはCell methodにより読み込む領域を検索していますが、すべてオンメモリでデータ構造を構築しているため、セルのサイズを大きくせざるを得ず、検索精度があまりよくありません。それに加えて数値地図25000には領域の外接長方形のデータしかないのでcell methodの恩恵をあまり受けることができず、地図データの読み込みが遅く、また表示領域以外の領域が読み込まれることが多々あります。 Quad Trees 領域を4分割（2次元）することで木構造を構築します．ディスク上に構築する際には、2分割するよりも高速です．ペ
mogwaing 2008/10/04
index

access method
リンク
1 2 次のページ

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx