ログイン
While indexing, Lucene periodically merges multiple segments in the index into a single larger segment. This keeps the number of segments relatively contained (important for search performance), and also reclaims disk space for any deleted docs on those segments. However, it has a well known problem: the merging process evicts pages from the OS's buffer cache. The eviction is ~2X the size of the m
What you get Detect Cassandra Performance Issues FasterSematext Monitoring makes life simpler by putting all Apache Cassandra metrics, logs, dashboards, and alerts at your fingertips. It’s an all-in-one solution with all the tools that you need to troubleshoot Cassandra node performance and health. Spot slow nodes that can degrade the responsiveness of the whole Cassandra clusterGet alerted on exc
Apache Tika - a content analysis toolkit The Apache Tika™ toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). All of these file types can be parsed through a single interface, making Tika useful for search engine indexing, content analysis, translation, and much more. You can find the latest release on the download page. Please see
Meetup no longer supports file uploading. To upload photos, please use the Photos section of your Meetup group; for other file types, we recommend that you use another service, many of which are mobile-friendly and free, such as Dropbox or Google Drive. Files that have been previously uploaded are still accessible below. We recommend that you save any files that you want to keep to your computer.
by Patrick O'Leary (pjaol at pjaol.com) There are 3 components to how local lucene performs a geographical search. Create a bounding area. Query that areas document set for a text match. Reduce the document set to a precise distance from a center point. Geographical text searching can take multiple formats, inclusion or reductionism. Inclusion is applying text searching over a document set and ver
Apache Solrというのは、Javaベースの検索エンジンシステムです。 「ソーラ」と呼ぶそうです。どうしても覚えられません。 Solr - Wikipedia 実はモバツイッターにも、秘かにツイッターのログ検索なる機能が追加してありまして、モバツイのエゴサーチなどをして、不具合がないかを調べていたりします。 検索エンジンはmysql + sennaを使っているのですが、自分のマシンのスペックよりも、データ量が増えてしまった状態らしく、ヒット数が多い「tinyurl」などの文字列で検索すると、めっさ遅いという状態になってしまいました。 おそらくmysqlの設定などはまだまだ余地があるんでしょう、と、いろいろ工夫しようとしたのですが、どうせならsenna以外も使えるようになりたいなぁと思って、こちらのtwitter検索で使われているSolrってのがあるというお話を聞いたので、Java久々
<!> Solr1.4 This document describes the Java implementation of index replication that works over HTTP and was introduced in Solr1.4. For information on the ssh/rsync based replication available since Solr1.1 please consult CollectionDistribution. Note that for SolrCloud in Solr4.0, replication will be done push-style, and this way of replicating the index will not be necessary anymore. Features Re
リリース、障害情報などのサービスのお知らせ
最新の人気エントリーの配信
処理を実行中です
j次のブックマーク
k前のブックマーク
lあとで読む
eコメント一覧を開く
oページを開く