DEER: RDF Data Extraction and Enrichment Framework

Over the last years, the Linked Data principles have been used across academia and industry to publish and consume structured data. Thanks to the fourth Linked Data principle, many of the RDF datasets used within these applications contain implicit and explicit references to more data. For example, music datasets such as Jamendo include references to locations of record labels, places where artists were born or have been, etc. Datasets such as Drugbank contain references to drugs from DBpedia, were verbal description of the drugs and their usage is explicitly available. The goal of mapping component, dubbed DEER, is to retrieve this information, make it explicit and integrate it into data sources according to the specifications of the user. To this end, DEER relies on a simple yet powerful pipeline system that consists of two main components: enrichment functions and operators.

Download Issues

Enrichment functions and operators.

Enrichment functions implement functionality for processing the content of a dataset (e.g., applying named entity recognition to a particular property). Thus, they take a dataset as input and return a dataset as output. Enrichment operators work at a higher level of granularity and combine datasets. Thus, they take sets of datasets as input and return sets of datasets.

RDF specification paradigm

In the current version of DEER we introduce our new RDF based specification paradigm. The main idea behind this new paradigm is to enable the processing execution of specifications in an efficient way. To this end, we first decided to use RDF as language for the specification. This has the main advantage of allowing for creating specification repositories which can be queried easily with the aim of retrieving accurate specifications for the use cases at hand. Moreover, extensions of the specification language do not require a change of the specification language due to the intrinsic extensibility of ontologies. The third reason for choosing RDF as language for specifications is that we can easily check the specification for correctness by using a reasoner, as the specification ontology allows for specifying the restrictions that specifications must abide by.

Publications

by (Editors: ) [BibTex of ]

News

DBpedia Knowledge Engineering PhD Symposium ( 2022-05-02T16:59:37+02:00 by Julia Holze)

2022-05-02T16:59:37+02:00 by Julia Holze

Dear all,  We are excited to invite you to the 1st DBpedia Knowledge Engineering PhD Symposium, organized on July 6th, 2022 in Leipzig, Germany. Read more about "DBpedia Knowledge Engineering PhD Symposium"

Tutorial @ Knowledge Graph Conference 2022 ( 2022-04-25T12:24:06+02:00 by Julia Holze)

2022-04-25T12:24:06+02:00 by Julia Holze

On May 2, 2022 we will organize a tutorial 2.0 at the Knowledge Graph Conference (KGC) 2022. Read more about "Tutorial @ Knowledge Graph Conference 2022"

International Workshop on Data-driven Resilience Research 2022 ( 2022-04-21T14:43:27+02:00 by Julia Holze)

2022-04-21T14:43:27+02:00 by Julia Holze

In the face of continuously changing contextual conditions and ubiquitous disruptive crisis events, the concept of resilience refers to some of the most urgent, challenging, and interesting issues of nowadays society. Read more about "International Workshop on Data-driven Resilience Research 2022"

DBpedia @ Google Summer of Code Program 2022 ( 2022-03-23T14:26:48+01:00 by Julia Holze)

2022-03-23T14:26:48+01:00 by Julia Holze

DBpedia, one of InfAI’s community projects, will be part of the 11th Google Summer of Code (GSoC) program. The GSoC program has the goal to bring students from all over the globe into open source software development. Read more about "DBpedia @ Google Summer of Code Program 2022"

DBpedia Tutorial @ The Web Conference 2022 ( 2022-03-16T13:52:16+01:00 by Julia Holze)

2022-03-16T13:52:16+01:00 by Julia Holze

Dear all, We are proud to announce that we will organize an online tutorial at the Web Conference on 25th of April 2022. A particular focus will be put on the DBpedia Infrastructure, i.e. Read more about "DBpedia Tutorial @ The Web Conference 2022"