Michael Färber, Achim Rettinger

A Statistical Comparison of Current Knowledge Bases


Used data sets:

  • DBpedia 2014:
    article_categories_en.nt
    category_labels_en.nt
    disambiguations_en.nt
    external_links_en.nt
    homepages_en.nt
    instance_types_en.nt
    labels_en.nt
    long_abstracts_en.nt
    mappingbased_properties_en.nt
    page_ids_en.nt
    page_links_en.nt
    persondata_en.nt
    redirects_en.nt
    revision_ids_en.nt
    short_abstracts_en.nt
    skos_categories_en.nt
    specific_mappingbased_properties_en.nt
    wikipedia_links_en.nt
    freebase_links_en.nt
  • Wikidata (RDF dump):
    wikidata-simple-statements.nt.gz (23/02/2015)
  • YAGO3:
    yago3_tsv.7z
    Since the different YAGO3 dataset were not yet available in triple format, we transformed the available tsv files into the triple format using the turtle format.

Evaluation framework: Download








(c) 2015 Michael Färber, Institute AIFB, KIT



Written by Michael Färber , AIFB, 2015