DBpediaDQ: User-driven quality evaluation of DBpedia

  • screenshot

With the myriad of data sets and use cases available in the LOD cloud, data quality is one of the important concepts to be considered. The DBpedia Data Quality Curation project is aimed at evaluating the quality of the resources present in DBpedia.

Homepage

As we all know, DBpedia is an important dataset in Linked Data as it is not only connected to and from numerous other datasets, but it also is relied upon for useful information. However, quality problems are inherent in DBpedia be it in terms of incorrectly extracted values or datatype problems since it contains information extracted from crowd-sourced content.

However, not all the data quality problems are automatically detectable. Thus, we aim at crowd-sourcing the quality assessment of the dataset. In order to perform this assessment, we developed a tool whereby a user can evaluate a random resource by analyzing each triple individually and store the results. Here is the link to the tool: http://nl.dbpedia.org:8080/TripleCheckMate/.

If you have any questions or comments, please do not hesitate to contact us at dbpedia-data-quality@googlegroups.com.

Results

  • Results : http://goo.gl/lIKK7
  • Total no. of users : 58
  • Total no. of distinct resources evaluated : 521
  • Total no. of resources evaluated : 792
  • Total no. of distinct resources without problems : 86
  • Total no. of distinct resources with problems : 435
  • Total no. of distinct incorrect triples : 2928
  • Total no. of distinct incorrect triples in the dbprop namespace : 1745
  • Total no. of inter-evaluations : 268
  • No. of resources with evaluators having different opinions : 89
  • Resource-based inter-rater agreement (Cohen’s Kappa) : 0.34
  • Triple-based inter-rater agreement (Cohen’s Kappa) : 0.38
  • No. of triples evaluated for correctness : 700
  • No. of triples evaluated to be correct : 567
  • No. of triples evaluated incorrectly : 133
  • % of triples correctly evaluated : 81
  • Average no. of problems per resource : 5.69
  • Average no. of problems per resource in the dbprop namespace : 3.45
  • Average no. of triples per resource : 47.19
  • % of triples affected : 11.93
  • % of triples affected in the dbprop namespace : 7.11

Manuscript

Publications

by (Editors: ) [BibTex of ]

News

LDK conference @ University of Leipzig ( 2019-03-22T09:21:41+01:00 by Julia Holze)

2019-03-22T09:21:41+01:00 by Julia Holze

With the advent of digital technologies, an ever-increasing amount of language data is now available across various application areas and industry sectors, thus making language data more and more valuable. Read more about "LDK conference @ University of Leipzig"

13th DBpedia community meeting in Leipzig ( 2019-02-22T12:22:07+01:00 by Julia Holze)

2019-02-22T12:22:07+01:00 by Julia Holze

We are happy to invite you to join the 13th edition of the DBpedia Community Meeting, which will be held in Leipzig. Read more about "13th DBpedia community meeting in Leipzig"

DBpedia Day @ SEMANTiCS 2018 ( 2018-07-20T14:37:25+02:00 by Johannes Frey)

2018-07-20T14:37:25+02:00 by Johannes Frey

Don’t miss the 12th edition of the DBpedia Community Meeting in Vienna, the city with the highest quality of life in the world. Read more about "DBpedia Day @ SEMANTiCS 2018"

DBpedia Open Text Extraction Challenge – TextExt ( 2017-03-09T12:15:57+01:00 Christopher Schulz)

2017-03-09T12:15:57+01:00 Christopher Schulz

DBpedia, a community project affiliated with the Institute for Applied Informatics (InfAI) e.V., extract structured information from Wikipedia & Wikidata. Now DBpedia started the DBpedia Open Text Extraction Challenge – TextExt. Read more about "DBpedia Open Text Extraction Challenge – TextExt"

Accepted Papers of AKSW Members @ Semantics 2016 ( 2016-06-27T12:50:01+02:00 by Sandra Bartsch)

2016-06-27T12:50:01+02:00 by Sandra Bartsch

This year’s SEMANTiCS conference which is taking place between September 12 – 15, 2016 in Leipzig recently invited for the submission of research papers on semantic technologies. Read more about "Accepted Papers of AKSW Members @ Semantics 2016"