wate_wateのブックマーク - はてなブックマーク

ブックマーク / www.garysieling.com (1)

Full-Text Indexing PDFs in Javascript - Gary Sieling
I once worked for a company that sold access to legal and financial databases (as they call it, “intelligent information“). Most court records are PDFS available through PACER, a website developed specifically to distribute court records. Meaningful database products on this dataset require building a processing pipeline that can extract and index text from the 200+ million PDFs, representing 20+
wate_wate 2013/05/19
JavaScript
リンク
1

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx