[B! Search] xefのブックマーク

BTrees, Inverted Indices, and a Model for Full Text Search

xef 2025/04/11

Rust
Search

リンク

PostgreSQL BM25 Full-Text Search: Speed Up Performance with These Tips

You might have come across discussions or blog posts suggesting that PostgreSQL's built-in full-text search (FTS) struggles with performance compared to dedicated search engines or specialized extensions. A notable recent example comes from Neon's blog post, "Performance Benchmark: pg_search on Neon" (link). In their benchmark, Neon compared query performance on their database platform with their

xef 2025/04/10

リンク

Building a search engine from scratch, in Rust: part 1

🦀 Building a search engine from scratch, in Rust: part 1 In the previous article, I introduced what project we're going to address in the following weeks: how to build a cross-platform search engine with encryption capabilities. Today, we'll have a look at the first technical challenge: how to store things on disk. You might be thinking that we start with a simple topic, to warm up and get ready

xef 2025/03/30

Rust
search

リンク

Hello

xef 2025/03/18

Search
Rust

リンク

OpenSearchで日本語全文検索をするためのドメイン知識を整理する - ドワンゴ教育サービス開発者ブログ

導入ドキュメントとインデックスドキュメントインデックスアナライザ Tokenizer n-gram 形態素解析 Character Filter Token Filter マッピングフィールド型文字列配列 null Multifields 検索クエリ Leaf Query match match_bool_prefix match_phrase multi_match query_string Compound Query Boolean Query あとがき We are hiring! 導入 ZEN Study の新しい教材基盤 (Kotlin) では、現在コンテンツ管理のための全文検索機能の導入中で、AWS OpenSearch Service を利用する予定です。 aws.amazon.com この記事は、OpenSearch導入にあたって各種概念モデルの概要を把握す

xef 2025/03/13

Search

リンク

改めて振り返る全文検索エンジンのアナライザー設定 - ZOZO TECH BLOG

はじめにデータシステム部検索技術ブロックの内田です。私たちはZOZOTOWNの検索精度改善や検索システムの運用効率化のためのメンテナンスなどに取り組んでいます。これまでテックブログでご紹介してきた通り、ZOZOの検索改善チームではランキング学習（Learning to Rank）やクエリの意図解釈、ベクトル検索の導入など、比較的モダンなアプローチでZOZOTOWNの検索改善に努めてきました。先進的な技術を調査し、サービスの開発に応用することはサービスの品質改善において重要な取り組みです。 tech blog.zozo.com しかし、モダンなアプローチをとる一方で、検索エンジンのベーシックな設定についてはメンテナンスする機会が徐々に減少していきました。設定内容や経緯を把握している開発メンバーの割合も減っていき、このままだと誰も触れない謎の設定になってしまうリスクがあったため、一度見直しを

xef 2025/02/20

Search

リンク

technomancy search

Did you learn to use the Internet in the 90s like me? There's a lot of nostalgia around those simpler times before the Web had been colonized by companies. Some of it is valid and some is seen thru rose-colored glasses, but anyone who was there at the time can attest to the fact that it was hard to find stuff. The web was wild, weird, and deeply chaotic. But then I rem ember back in 2002 or 2003 th

xef 2025/02/14

Search

リンク

Google Chromeサイト内検索10連発 - 大西ブログ

この記事は、はてなエンジニア Advent Calendar 2024 の27日目の記事です。昨日は、id:k1s1eee さんのAWSリザーブドインスタンスの購入時にチームメンバーのレビューを通すでした。RI購入も結構な額になるのでレビューがあって安心ですね！みなさん、ブラウザで検索してますか！検索エンジンの精度が下がったからといってAIに頼りすぎていませんか？私は検索が好きすぎて、Google Chrome でサイト内検索を大量に設定しています。今日はおすすめのサイト内検索を10個ご紹介します。最初は100個くらい紹介しようと思ったのですがネタが尽きました。サイト内検索とは既定の検索エンジンとサイト内検索のショートカットを設定する - パソコン - Google Chrome ヘルプアドレスバーにショートカットを入力して、特定のサイト内をすばやく検索したり、別の検索エンジン

xef 2024/12/27

Search

リンク

安定したElasticsearchバージョンアップを目指して - MonotaRO Tech Blog

(2024/12/10 13:35) Elastic Stack (Elasticsearch) Advent Calendar 2024のリンクを追加初めまして。ECシステムエンジニアリング部門 EC商品基盤グループサーチチーム松浦です。今回は、全文検索エンジンElasticsearch のバージョンアップの具体的な取り組みについて取り上げます。このブログ記事の内容はElasticsearch株式会社が主催するElasticsearch Community in Osaka - connpassで発表した内容を元に作成しました。また、Elastic Stack (Elasticsearch) - Qiita Advent Calendar 2024 - Qiita の10日目の記事です。所属チームとシステムの概要説明今回のバージョンアップの詳細と、これまでのバージョンアッ

xef 2024/12/10

リンク

Meilisearch Indexes Embeddings 7x Faster with Binary Quantization

xef 2024/11/30

Search

リンク

Understanding the BM25 full text search algorithm

Nov 19, 2024 BM25, or Best Match 25, is a widely used algorithm for full text search. It is the default in Lucene/Elasticsearch and SQLite, among others. Recently, it has become common to combine full text search and vector similarity search into "hybrid search". I wanted to understand how full text search works, and specifically BM25, so here is my attempt at understanding by re-explaining. Motiv

xef 2024/11/21

リンク

Meilisearch is too slow

xef 2024/08/21

Search

リンク

What's next for Kagi? | Kagi Blog

30 May, 2024 Two years ago, on June 1st, 2022, Kagi introduced a search engine that challenged the ad-supported version of the web. Kagi Search instead works for you, the user, and not an advertiser paying for your attention. At the time of launch, we did not know if anyone would pay for their search engine and web browser, but luckily, here we are two years later at the forefront of a movement to

xef 2024/05/31

Search

リンク

誰でもわかる全文検索入門

全文検索エンジンも、事前に検索対象のデータを調べてこのような索引を作っておくことで、高速な検索を実現しています。インターネット検索の例たとえばインターネット検索の場合、まず各サイトからデータを収集して、その中から索引に載せる単語を選んで索引を作ります。索引のデータは下の図の右側のような表になっています。各単語と、それがどのサイトのどこにあったのかを記録しておく形です。検索の時は、この索引データを調べます。例えば利用者が「理解」という言葉で検索したら、索引の「理解」のところを調べます。そうすると「サイトAの8文字目と18文字目、サイトGの……」と出現場所がわかるので、すぐに結果を返せます。なお、この図では索引の単語の並び方が適当ですが、実際には本と同じように「あいうえお順」などに並べておいて、すぐに単語を見つけられるようにしてあります。索引にない単語この仕組みの場合、索引にある単

xef 2024/05/21

Search
NLP

リンク

Meilisearch 1.8

xef 2024/05/13

Search

リンク

Why Full Text Search is Hard

It’s easy to find documents containing "large" and "elephant". It’s hard to find documents in German which have "large" and "elephant" together in a sentence, or words with similar meanings to large, and provide only the 10 most relevant documents. And the sense that full-text search should be easy often stems from fixating on the middle part of "What’s so hard about implementing an inverted index

xef 2024/05/05

Search

リンク

静的サイトに特化した全文検索ライブラリ「Pagefind」、さくらのレンタルサーバで動かしてみた

先日、静的サイトに特化した全文検索ライブラリとして「Pagefind」というソフトウェアがあることを、下記の記事が話題になったことで知りました。参考：静的サイトに特化した検索ライブラリ Pagefind を試す | grip on minds 実は、いまお読みのPublickeyはまさに、Movable TypeというCMSを用いて生成された静的なWebサイトです。現在、PublickeyではGoogleが提供している「Googleカスタム検索エンジン」を全文検索エンジンとして採用しています（右上の虫眼鏡アイコンから呼び出せます）。しかしPublickeyでは以前からGoogleに依存しない、自前の全文検索エンジンを持てないかと模索しており、まさにPagefindは私が探し求めていたソフトウェアだったと言えます。そこでさっそくPagefindがPublickeyに導入できるかどうか、

xef 2024/01/17

Search

リンク

静的サイト向けの全文検索エンジンと UI ライブラリの Pagefind

デモとして、このブログに Pagefind を導入してみました。ヘッダーの検索アイコンをクリックすると検索フォームが表示されるので、キーワードを入力して検索してみてください。使い方 Pagefind は構築済みの UI ライブラリと、CLI コマンドとしてインデックスを作成するためのツールから構成されています。まずは UI ライブラリの部分から見てみましょう。 UI ライブラリ Pagefind の UI ライブラリは、検索フォームと検索結果を表示するためのコンポーネントから構成されています。この UI は以下のコードを追加するだけで簡単に利用できます。 <link href="/pagefind/pagefind-ui.css" rel="stylesheet" /> <script src="/pagefind/pagefind-ui.js"></script> <div id="s

xef 2024/01/07

Search
SSG

リンク

クックパッドの検索反映時間を 1/288 にしたシステム改修 - クックパッド開発者ブログ

こんにちは。レシピ事業部の新井（@SpicyCoffee）です。クックパッドではこれまで、レシピを投稿してから検索結果に反映されるまで最長で 24 時間程度の時間がかかっていました。今回、この時間を 5 分程度、最長でも 10 分程度に短縮することに成功しました。本記事では、プロジェクトオーナーの立場で関わった私が代表してその開発について紹介します。プロジェクトの目的と数値目標本プロジェクトでは上記の「レシピを投稿してから検索結果に反映されるまでの時間短縮」が目的とされました。しかし、時間短縮といっても現状 24 時間であるものを "1 時間" にするのか、"1 分" にするのか、"1 秒" にするのかでは話が全然違います。この数値目標は設計を始めとした後の意思決定に大きく影響を与えるため、しっかりとした意図を持った状態で明確に定めておく必要がありました。そこで、私とプロダクトオー

xef 2023/10/06

Search
Solr

リンク

Release Pagefind v1.0.0 — Stable static search at scale ✨ 🚀 · CloudCannon/pagefind

Pagefind 1.0 is here! This release has been many months in the making, and we're thrilled to be bringing some great new features and improvements. This release also marks a point in time for Pagefind's stability and maturity. Thanks to everyone who has helped out with contributions and feedback in the last year, we're now more confident than ever that Pagefind is a perfect fit with nearly any stat

xef 2023/09/14

Search
Rust

リンク

はてなブックマーク

タグ

関連タグで絞り込む (17)

Searchに関するxefのブックマーク (40)

お知らせ

今週のはてなブックマーク数ランキング（2025年4月第3週）

今週のはてなブックマーク数ランキング（2025年4月第2週）

今週のはてなブックマーク数ランキング（2025年4月第1週）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス