Properties of DBpediaDQ

  1. Properties
  2. History
  3. Source
hookline
  • User-driven quality evaluation of DBpedia
http://aksw.org/schema/publicationTag
  • dbpediadq
related project
content
  • As we all know, DBpedia is an important dataset in Linked Data as it is not only connected to and from numerous other datasets, but it also is relied upon for useful information. However, quality problems are inherent in DBpedia be it in terms of incorrectly extracted values or datatype problems since it contains information extracted from crowd-sourced content.

    However, not all the data quality problems are automatically detectable. Thus, we aim at crowd-sourcing the quality assessment of the dataset. In order to perform this assessment, we developed a tool whereby a user can evaluate a random resource by analyzing each triple individually and store the results. Here is the link to the tool: http://nl.dbpedia.org:8080/TripleCheckMate/.

    If you have any questions or comments, please do not hesitate to contact us at dbpedia-data-quality@googlegroups.com.

    Results

    • Results : http://goo.gl/lIKK7
    • Total no. of users : 58
    • Total no. of distinct resources evaluated : 521
    • Total no. of resources evaluated : 792
    • Total no. of distinct resources without problems : 86
    • Total no. of distinct resources with problems : 435
    • Total no. of distinct incorrect triples : 2928
    • Total no. of distinct incorrect triples in the dbprop namespace : 1745
    • Total no. of inter-evaluations : 268
    • No. of resources with evaluators having different opinions : 89
    • Resource-based inter-rater agreement (Cohen’s Kappa) : 0.34
    • Triple-based inter-rater agreement (Cohen’s Kappa) : 0.38
    • No. of triples evaluated for correctness : 700
    • No. of triples evaluated to be correct : 567
    • No. of triples evaluated incorrectly : 133
    • % of triples correctly evaluated : 81
    • Average no. of problems per resource : 5.69
    • Average no. of problems per resource in the dbprop namespace : 3.45
    • Average no. of triples per resource : 47.19
    • % of triples affected : 11.93
    • % of triples affected in the dbprop namespace : 7.11

    Manuscript

abstract
  • With the myriad of data sets and use cases available in the LOD cloud, data quality is one of the important concepts to be considered. The DBpedia Data Quality Curation project is aimed at evaluating the quality of the resources present in DBpedia.
feed
maintainer
type
label
  • DBpediaDQ
depiction
homepage

Feeds

DBpedia Day @ SEMANTiCS 2022

Aug 8, 2022 11:24:02 AM | Julia Holze

We are happy to announce that we are partnering again with the SEMANTiCS Conference which will host this year’s DBpedia Day on September 13, 2022. The SEMANTiCS is an established knowledge hub which brings together technology professionals, industry experts, and … Continue reading → ...

DBpedia Knowledge Engineering PhD Symposium

May 2, 2022 4:59:37 PM | Julia Holze

Dear all,  We are excited to invite you to the 1st DBpedia Knowledge Engineering PhD Symposium, organized on July 6th, 2022 in Leipzig, Germany. The DBpedia Knowledge Engineering PhD Symposium will be held on the third day of the week-long … Continue reading → ...

Tutorial @ Knowledge Graph Conference 2022

Apr 25, 2022 12:24:06 PM | Julia Holze

On May 2, 2022 we will organize a tutorial 2.0 at the Knowledge Graph Conference (KGC) 2022. The tutorial targets existing and potential new users of DBpedia, developers that wish to learn how to replicate DBpedia infrastructure, service providers interested … Continue reading → ...

DBpedia @ Google Summer of Code Program 2022

Mar 23, 2022 2:26:48 PM | Julia Holze

DBpedia, one of InfAI’s community projects, will be part of the 11th Google Summer of Code (GSoC) program. The GSoC program has the goal to bring students from all over the globe into open source software development. In this regard … Continue reading → ...

DBpedia Tutorial @ The Web Conference 2022

Mar 16, 2022 1:52:16 PM | Julia Holze

Dear all, We are proud to announce that we will organize an online tutorial at the Web Conference on 25th of April 2022. A particular focus will be put on the DBpedia Infrastructure, i.e. DBpedia’s Databus publishing platform and the … Continue reading → ...

OntoWiki

Knowledge Bases

Login

  1. Local
  2. OpenID