Towards a Large Corpus of Richly Annotated Web Tables
for Knowledge Base Population
Authors: Philipp Braukmann, Lorenzo Cazzoli, Basil Ell*, Sherzod Hakimov*, Fabian Kaupmann, Amerigo Mancino, Junaid Altaf Memon, Kai Rother, Abhishek Saini
Statistics for Language Detection Task
This Diagram shows the occurence of specific combinations of languages that were detected in tables.
This Diagram shows the occurence of different languages in a sample of 1,000,000 tables of the WebdataCommons corpus.