HAWK: Hybrid Question Answering over Linked Data

HAWK is going to drive forth the OKBQA vision of hybrid question answering using Linked Data and full-text information. Performance benchmarks are done on the QALD-4 task 3 hybrid.

Source Code Demo Issues

Introduction

Recent advances in question answering (QA) over Linked Data provide end users with more and more sophisticated tools for querying linked data by expressing their information need in natural language. This allows access to the wealth of structured data available on the Semantic Web also to non-experts. However, a lot of information is still available only in textual form, both on the Document Web and in the form of labels and abstracts in Linked Data sources. Therefore, a considerable number of questions can only be answered by using hybrid question answering approaches, which can find and combine information stored in both structured and textual data sources.

Architecture

The HAWK Architecture

We present HAWK, the (to best of our knowledge) first full-fledged hybrid QA framework for entity search over Linked Data and textual data.

Given an input query, HAWK implements an 8-step pipeline, which comprises 1) part-of-speech tagging, 2) detecting entities in the query, 3) dependency parsing and 4) applying linguistic pruning heuristics for an in-depth analysis of the natural language input. The results of these first four steps is a predicate-argument graph annotated with resources from the Linked Data Web. HAWK then 5) assign semantic meaning to nodes and 6) generates basic triple patterns for each component of the input query with respect to a multitude of features. This deductive linking of triples results in a set of SPARQL queries containing text operators as well as triple patterns. In order to reduce operational costs, 7) HAWK discards queries using several rules, e.g., by discarding not connected query graphs. Finally, 8) queries are ranked using extensible feature vectors and cosine similarity.

Supplementary material concerning the evaluation and implementation of HAWK can be found here

Project Team

Former Members

Publications

by (Editors: ) [BibTex of ]

News

DBpedia Day @ SEMANTiCS 2022 ( 2022-08-08T11:24:02+02:00 by Julia Holze)

2022-08-08T11:24:02+02:00 by Julia Holze

We are happy to announce that we are partnering again with the SEMANTiCS Conference which will host this year’s DBpedia Day on September 13, 2022. Read more about "DBpedia Day @ SEMANTiCS 2022"

DBpedia Knowledge Engineering PhD Symposium ( 2022-05-02T16:59:37+02:00 by Julia Holze)

2022-05-02T16:59:37+02:00 by Julia Holze

Dear all,  We are excited to invite you to the 1st DBpedia Knowledge Engineering PhD Symposium, organized on July 6th, 2022 in Leipzig, Germany. Read more about "DBpedia Knowledge Engineering PhD Symposium"

Tutorial @ Knowledge Graph Conference 2022 ( 2022-04-25T12:24:06+02:00 by Julia Holze)

2022-04-25T12:24:06+02:00 by Julia Holze

On May 2, 2022 we will organize a tutorial 2.0 at the Knowledge Graph Conference (KGC) 2022. Read more about "Tutorial @ Knowledge Graph Conference 2022"

International Workshop on Data-driven Resilience Research 2022 ( 2022-04-21T14:43:27+02:00 by Julia Holze)

2022-04-21T14:43:27+02:00 by Julia Holze

In the face of continuously changing contextual conditions and ubiquitous disruptive crisis events, the concept of resilience refers to some of the most urgent, challenging, and interesting issues of nowadays society. Read more about "International Workshop on Data-driven Resilience Research 2022"

DBpedia @ Google Summer of Code Program 2022 ( 2022-03-23T14:26:48+01:00 by Julia Holze)

2022-03-23T14:26:48+01:00 by Julia Holze

DBpedia, one of InfAI’s community projects, will be part of the 11th Google Summer of Code (GSoC) program. The GSoC program has the goal to bring students from all over the globe into open source software development. Read more about "DBpedia @ Google Summer of Code Program 2022"