サクサク読めて、アプリ限定の機能も多数!
トップへ戻る
体力トレーニング
okfnlabs.org
Extracting data from PDFs remains, unfortunately, a common data wrangling task. This post reviews various tools and services for doing this with a focus on free (and preferably) open source options. The tools we can consider fall into three categories: Extracting text from PDF Extracting tables from PDF Extracting data (text or otherwise) from PDFs where the content is not text but is images (for
ElasticSearch is a great open-source search tool that’s built on Lucene (like SOLR) but is natively JSON + RESTful. Its been used quite a bit at the Open Knowledge Foundation over the last few years. Plus, as its easy to setup locally its an attractive option for digging into data on your local machine. While its general interface is pretty natural, I must confess I’ve sometimes struggled to find
Bad Data is a site providing real-world examples of how not to prepare or provide data. It showcases the poorly structured, the mis-formatted, or the just plain ugly. Its primary purpose is to educate – though there may also be some aspect of entertainment. As a side-product it also provides a source of good practice material for budding data wranglers (the repo in fact began as a place to keep pr
Wikipedia.JS is a small Javascript library for accessing information in Wikipedia articles such as dates, places, abstracts and more ... Get the code You can grab the (incredibly lightweight) wikipedia.js library from here. Want to browse the annotated source? The library is the work of Open Knowledge Foundation Labs and Rufus Pollock in particular. It is, in essence, a small wrapper around the da
A simple but powerful library for building data applications in pure Javascript and HTML. Recline re-uses best-of-breed presentation libraries like SlickGrid, Leaflet, Flot and D3 to create data 'Views' and allows you to connect them with your data in seconds. Documentation » Tutorials » Demos » Get started fast // Load some data var dataset = recline.Model.Dataset({ records: [ { value: 1, date: '
The Annotator is an open-source JavaScript library and tool that can be added to any webpage to make it annotatable. Annotations can have comments, tags, users and more. Morever, the Annotator is designed for easy extensibility so its a cinch to add a new feature or behaviour. Check out the live demonstration or install it now. Adding annotation to your webpage using the Annotator is easy. Full in
このページを最初にブックマークしてみませんか?
『Open Knowledge Labs』の新着エントリーを見る
j次のブックマーク
k前のブックマーク
lあとで読む
eコメント一覧を開く
oページを開く