GHO: Publishing and Interlinking the Global Health Observatory Dataset

  • screenshot

The improvement of public health is one of the main indicators for societal progress. Statistical data for monitoring public health is highly relevant for a number of sectors, such as research (e.g. in the life sciences or economy), policy making, health care, pharmaceutical industry, insurances etc. Such data is meanwhile available even on a global scale, e.g. in the Global Health Observatory (GHO) of the United Nations’s World Health Organization (WHO). GHO comprises more than 50 different datasets, it covers all 198 WHO member countries and is updated as more recent or revised data becomes available or when there are changes to the methodology being used. However, this data is only accessible via complex spreadsheets and, therefore, queries over the 50 different datasets as well as combinations with other datasets are very tedious and require a significant amount of manual work. By making the data available as RDF, we lower the barrier for data re-use and integration.

Homepage Download

The converted GHO data is now available at http://gho.aksw.org/. After converting the entire GHO data, an RDF dataset containing almost 8 million triples was obtained. Following is the example of a single statistical item, the death value of 1098, from the GHO dataset represented using the Data Cube vocabulary:

gho:o1     a           qb:Observation;
       gho:Country     Afghanistan;
       gho:stat_pop    24076;
       gho:Disease     All Causes;
       gho:incidence   18437.

gho:Country    a    qb:DimensionProperty;
                    rdfs:label    "Country".

gho:Disease    a    qb:DimensionProperty;
                    rdfs:label    "Disease".

Further Information

  • This is a short presentation describing the process of conversion of the CSV files to RDF using SCOVO (Statistical Core Vocabulary) in OntoWiki. SCOVO is an earlier version of the Data Cube Vocabulary and the conversion process is similar for both.
  • This is a position paper that was accepted for a presentation at the Ontologies in Biomedicine and Life Sciences workshop held at Mannheim (Germany) from September 9 – 10, 2010.
  • The dataset description has been published here.
  • This dataset is also part of the LODD datasets.

Project Team

Former Members

Publications

by (Editors: ) [BibTex of ]

News

Jekyll RDF Tutorial Screencast ( 2018-08-07T11:11:12+02:00 by Natanael Arndt)

2018-08-07T11:11:12+02:00 by Natanael Arndt

Since 2016 we are developing Jekyll-RDF a plugin for the famous Jekyll–static website generator. Read more about "Jekyll RDF Tutorial Screencast"

DBpedia Day @ SEMANTiCS 2018 ( 2018-07-20T14:37:25+02:00 by Johannes Frey)

2018-07-20T14:37:25+02:00 by Johannes Frey

Don’t miss the 12th edition of the DBpedia Community Meeting in Vienna, the city with the highest quality of life in the world. Read more about "DBpedia Day @ SEMANTiCS 2018"

SANSA 0.4 (Semantic Analytics Stack) Released ( 2018-06-26T18:33:38+02:00 by Prof. Dr. Jens Lehmann)

2018-06-26T18:33:38+02:00 by Prof. Dr. Jens Lehmann

We are happy to announce SANSA 0.4 – the fourth release of the Scalable Semantic Analytics Stack. Read more about "SANSA 0.4 (Semantic Analytics Stack) Released"

AKSW is organizing the 6th Leipzig Semantic Web Day (LSWT2018) ( 2018-04-17T14:14:17+02:00 by Natanael Arndt)

2018-04-17T14:14:17+02:00 by Natanael Arndt

On June 18th 2018 we will have the 6th Leipzig Semantic Web Day (LSWT2018). A platform for regional actors to get in touch with each other regarding Semantic Web topics. Read more about "AKSW is organizing the 6th Leipzig Semantic Web Day (LSWT2018)"

SANSA 0.3 (Semantic Analytics Stack) Released ( 2017-12-18T11:15:38+01:00 by Simon Bin)

2017-12-18T11:15:38+01:00 by Simon Bin

Dear all, We are happy to announce SANSA 0.3 – the third release of the Scalable Semantic Analytics Stack. Read more about "SANSA 0.3 (Semantic Analytics Stack) Released"