Machine Learning and Ontology Engineering

The MOLE group focuses on combining Semantic Web and supervised Machine Learning technologies. The goal is to improve both quality and quantity of available knowledge by extracting, analysing, enriching and linking existing data. To make obtained results readily available for use in other applications, the group also provides several established open source tools, frameworks and demonstrators.

Research Areas

  • Creating knowledge bases from weakly structured data
  • Quality assurance and enhancement in ontologies
  • Semi-automatic instance matching
  • Supervised Machine Learning in OWL/RDF knowledge bases


  • ALIGNEDAligned, Quality-centric Software and Data Engineering
  • AskNowAskNow is a Question Answering (QA) system for RDF datasets.
  • AskNowAskNow is a Question Answering (QA) system for RDF datasets.
  • ASSESSAutomatic Self Assessment
  • AutoSPARQLConvert a natural language expression to a SPARQL query
  • BDEBig Data Europe
  • BIGBig Data Public Private Forum
  • CubeQAQuestion Answering on Statistical Linked Data
  • DBpediaQuerying Wikipedia like a Semantic Database
  • DBpediaDQUser-driven quality evaluation of DBpedia
  • DBpediaDQCrowdCrowdsourcing DBpedia Quality Assessment
  • DEERRDF Data Extraction and Enrichment Framework
  • DeFactoDeep Fact Validation
  • DEQADeep Web Extraction for Question Answering
  • DL-Learnera tool for supervised Machine Learning in OWL and Description Logics
  • FaceteJavaScript SPARQL-based Faceted Search Library and Browsing Widgets
  • FTSRDF Version of the Financial Transparency System of the European Commission
  • GEISERVon Sensordaten zu internetbasierten Geo-Services
  • GeoKnowMaking the Web an Exploratory for Geospatial Knowledge
  • GeoLiftSpatial mapping framework for enriching RDF datasets with Geo-spatial information
  • GHOPublishing and Interlinking the Global Health Observatory Dataset
  • GOLDGenerating Ontologies from Linked Data
  • HAWKHybrid Question Answering over Linked Data
  • HOBBITHolistic Benchmarking of Big Linked Data
  • JassaJAvascript Suite for Sparql Access
  • jena-sparql-apiA Java library featuring tools for transparently boosting SPARQL query execution.
  • LATCLOD Around-the-Clock
  • LIMESLInk discovery framework for MEtric Spaces
  • LinkedGeoDataadds a spatial dimension to the Web of Data
  • LinkedIdiomsA Multilingual Linked Idioms Data Set
  • LinkedSpendinggovernment spendings from all over the world as Linked Data
  • LOD2Creating Knowledge out of Interlinked Data
  • LODStatsa statement-stream-based approach for gathering comprehensive statistics about RDF datasets
  • MEX VocabularyA Light-Weight Interchange Format for Machine Learning Experiments
  • NIF4OGGDNatural Language Interchange Format for Open German Governmental Data
  • NLP2RDFConverting NLP tool output to RDF
  • OREA tool for the enrichment, repair and validation of OWL based knowledge bases.
  • projects:SemMap
  • QAMELQuestion Answering on Mobil Devices
  • RDFUnitan RDF Unit-Testing suite
  • ReDD-ObservatoryUsing the Web of Data for Evaluating the Research-Disease Disparity
  • REXWeb-Scale Extension of RDF Knowledge Bases
  • SAIM(Semi-)Automatic Instance Matcher
  • SAKEWith RDF and Machine Learning Getting Results Faster
  • SemanticQurana Multilingual Resource for Natural-Language Processing
  • SML-BenchA Benchmark for Symbolic Supervised Machine Learning from Expressive Structured Data
  • SPARQL2NLconverting SPARQL queries to natural language
  • SparqlAnalyticsI Know What You Did Last Query
  • Sparqlifya SPARQL-SQL rewriter
  • SparqlMapis a SPARQL-to-SQL rewriter
  • TripleCheckMateCrowdsourcing the evaluation of Linked Data
  • USPatentsPublishing and Interlinking the USPTO Patent Data
  • VeriLinksverifying links in an arbitrary linkset



by (Editors: ) [BibTex of ]


AKSW Colloquium, 17.10.2016, Version Control for RDF Triple Stores + NEED4Tweet ( 2016-10-17T09:55:50+02:00 by Marvin Frommhold)

2016-10-17T09:55:50+02:00 by Marvin Frommhold

In the upcoming Colloquium, October the 17th at 3 PM, two papers will be presented: Version Control for RDF Triple Stores Marvin Frommhold will discuss the paper “Version Control for RDF Triple Stores” by Steve Cassidy and James Ballantine which forms the foundation … Continue reading → Read more about "AKSW Colloquium, 17.10.2016, Version Control for RDF Triple Stores + NEED4Tweet"

LIMES 1.0.0 Released ( 2016-10-14T11:38:31+02:00 by Kleanthi Georgala)

2016-10-14T11:38:31+02:00 by Kleanthi Georgala

Dear all, the LIMES Dev team is happy to announce LIMES 1.0.0. LIMES, the Link Discovery Framework for Metric Spaces, is a link discovery framework for the Web of Data. Read more about "LIMES 1.0.0 Released"

DL-Learner 1.3 (Supervised Structured Machine Learning Framework) Released ( 2016-10-11T21:41:00+02:00 by Dr. Jens Lehmann)

2016-10-11T21:41:00+02:00 by Dr. Jens Lehmann

Dear all, the Smart Data Analytics group at AKSW is happy to announce DL-Learner 1.3. DL-Learner is a framework containing algorithms for supervised machine learning in RDF and OWL. Read more about "DL-Learner 1.3 (Supervised Structured Machine Learning Framework) Released"

OntoWiki 1.0.0 released ( 2016-10-05T16:50:05+02:00 by Natanael Arndt)

2016-10-05T16:50:05+02:00 by Natanael Arndt

Dear Semantic Web and Linked Data Community, we are proud to finally announce the releases of OntoWiki 1.0.0 and the underlying Erfurt Framework in version 1.8.0. Read more about "OntoWiki 1.0.0 released"

AKSW Colloquium, 05.09.2016. LOD Cloud Statistics, OpenAccess at Leipzig University. ( 2016-08-31T11:23:10+02:00 by Ivan Ermilov)

2016-08-31T11:23:10+02:00 by Ivan Ermilov

On the upcoming Monday (05.09.2016), AKSW group will discuss topics related to Semantic Web and LOD Cloud Statistics. Also, we will have invited speaker from University of Leipzig Library (UBL) Dr. Astrid Vieler talking about OpenAccess at Leipzig University. Read more about "AKSW Colloquium, 05.09.2016. LOD Cloud Statistics, OpenAccess at Leipzig University."