CubeQA: Question Answering on Statistical Linked Data

As an increasing amount of statistical data is published as RDF, intuitive ways of satisfying information needs and getting new insights out of this type of data becomes increasingly important. Question answering systems provide intuitive access to data by translating natural language queries into SPARQL, which is the native query language of RDF knowledge bases. Existing approaches, however, perform poorly on statistical data because of the different structure. Based on a question corpus compiled in previous work, we created a benchmark for evaluating statistical questions answering systems and to stimulate further research. Building upon a previously established algorithm outline, we detail a Question Anwering algorithm for statistical Linked Data, which covers a wide range of question types, evaluate it using the benchmark and discuss future challenges in this field. To our knowledge, this is the first question answering approach for statistical RDF data and could open up a new research area.

Homepage Source Code Issues

Project Team



by (Editors: ) [BibTex of ]


Enable stopwords and use default stopword set. Improve comments. Add … ( 2020-11-11T10:59:16+00:00 by Dr. Konrad Höffner)

2020-11-11T10:59:16+00:00 by Dr. Konrad Höffner

Add missing URL import. ( 2020-11-02T09:07:30+00:00 by Dr. Konrad Höffner)

2020-11-02T09:07:30+00:00 by Dr. Konrad Höffner

Improve benchmark error logging. ( 2020-10-23T14:14:50+00:00 by Dr. Konrad Höffner)

2020-10-23T14:14:50+00:00 by Dr. Konrad Höffner

Update QALD6T3-Test version from 1.0 to 1.2. ( 2020-10-23T14:13:14+00:00 by Dr. Konrad Höffner)

2020-10-23T14:13:14+00:00 by Dr. Konrad Höffner

Update QALD links. ( 2020-10-23T14:12:50+00:00 by Dr. Konrad Höffner)

2020-10-23T14:12:50+00:00 by Dr. Konrad Höffner