サクサク読めて、アプリ限定の機能も多数!
トップへ戻る
Wikipedia
webdatacommons.org
A Series of Web Table Corpora extracted from the Common Crawl A subset of the HTML tables on the Web contains relational data which can be useful for various applications. The Web Data Commons project has extracted two large corpora of relational Web tables from the Common Crawl and offers them for public download. This page provides an overview of the corpora as well as their use cases. News 2017
The Web Data Commons project extracts structured data from the Common Crawl, the largest web corpus available to the public, and provides the extracted data for public download in order to support researchers and companies in exploiting the wealth of information that is available on the Web. News 2024-02-01: We have released the WDC Schema.org Table Corpus 2023 which contains ~5M tables and is bas
このページを最初にブックマークしてみませんか?
『Web Data Commons』の新着エントリーを見る
j次のブックマーク
k前のブックマーク
lあとで読む
eコメント一覧を開く
oページを開く