LIMES: LInk discovery framework for MEtric Spaces

  • screenshot

LIMES is a link discovery framework for the Web of Data. It implements time-efficient approaches for large-scale link discovery based on the characteristics of metric spaces. It is easily configurable via a web interface. It can also be downloaded as standalone tool for carrying out link discovery locally.

Demo Issues Download

General Overview

LIMES implements novel time-efficient approaches for link discovery in metric spaces. Our approaches different approximation techniques to compute estimates of the similarity between instances. These estimates are then used to filter out a large amount of those instance pairs that do not suffice the mapping conditions. By these means, LIMES can reduce the number of comparisons needed during the mapping process by several orders of magnitude. The approaches implemented in LIMES include the original LIMES algorithm for edit distances, REEDED for weighted edit distances, HR3, HYPPO, and ORCHID. Moreover, LIMES implements supervised and unsupervised machine-learning algorithms for finding accurate link specifications. The algorithms implemented here include the supervised, active and unsupervised versions of EAGLE, COALA and EUCLID.

Architecture

The LIMES framework consists of seven main modules of which each can be extended to accommodate new or improved functionality. The central modules of LIMES are the controller module, which coordinates the matching process and the data module, which contains all the classes necessary to store data. The matching process is carried out as follows: First, the controller calls the I/O-module, which reads the configuration file and extracts all the information necessary to carry out the comparison of instances, including the URL of the SPARQL-endpoints of the knowledge bases, the restrictions on the instances to map (e.g., their type), the expression of the metric to be used and the threshold to be used. Examples of configuration files can be found in the distribution.

Given that the configuration file is valid w.r.t. the LIMES Specification Language (LSL), the query module is called. This module uses the configuration for the target and source knowledge bases to retrieve instances and properties from the SPARQL-endpoints of the source and target knowledge bases that adhere to the restrictions specified in the configuration file. The query module writes its output into a cache, which can be a file (for large number of instances, not implemented yet) or main memory. Once all instances have been stored in the cache, the controller calls the LIMES engine which runs through the specification and computes the results. The results are finally returned as RDF or TSV files.

Evaluation Results

The algorithms implemented in LIMES were published in several papers. Below are links to evaluation results.

Running LIMES

Running LIMES can be carried in one of three ways.

Current Team

Publications

by (Editors: ) [BibTex of ]

News

Five AKSW Papers at ESWC 2014 ( 2014-03-14T01:37:52+01:00 by Dr. Axel-C. Ngonga Ngomo)

2014-03-14T01:37:52+01:00 by Dr. Axel-C. Ngonga Ngomo

Hello World! We are very pleased to announce that five of our papers were accepted for presentation at ESWC 2014. These papers range from natural-language processing to the acquisition of temporal data. Read more about "Five AKSW Papers at ESWC 2014"

Two AKSW Papers at ESWC ( 2013-02-25T17:28:02+01:00 by Dr. Axel-C. Ngonga Ngomo)

2013-02-25T17:28:02+01:00 by Dr. Axel-C. Ngonga Ngomo

Greetings World. We are happy to announce that the AKSW papers “COALA – Correlation-Aware Active Learning of Link Specifications” and “When to Reach for the Cloud: Using Parallel Hardware for Link Discovery” were selected for presentation at ESWC 2013. Each of the papers deals with one of the two main hurdles to scalable Link Discovery. Read more about "Two AKSW Papers at ESWC"

More than 20 European Union Datasets Converted to RDF by LATC Project ( 2012-07-09T15:13:52+02:00 by Dr. Jens Lehmann)

2012-07-09T15:13:52+02:00 by Dr. Jens Lehmann

Over the past two years, the LATC project (Linked Open Data Around-The-Clock) has worked on converting more than 20 EU datasets to RDF, make them available as Linked Data and SPARQL, and link them to other datasets. The datasets have gone through internal quality assurance against a publication checklist. Read more about "More than 20 European Union Datasets Converted to RDF by LATC Project"

LOD2 Webinar Series: LIMES – Discovery of Links across Knowledge Bases ( 2012-03-26T15:03:48+02:00 Nadine Jaenicke)

2012-03-26T15:03:48+02:00 Nadine Jaenicke

The 1st version of the LOD2 Stack has been published in September 2011 in the form of an LOD2 Stack demo and the downloadable LOD2 Stack virtual machine image – additional details and the instructions on installing the LOD2 Stack from scratch are available in the How-To-Start document. Read more about "LOD2 Webinar Series: LIMES – Discovery of Links across Knowledge Bases"

AKSW tools prominently featured in TÜBİTAK ( 2011-12-01T21:57:26+01:00 by Prof. Dr. Sören Auer)

2011-12-01T21:57:26+01:00 by Prof. Dr. Sören Auer

Börtecin Ege wrote an article on the Semantic Web in the December issue of TÜBİTAK (The Scientific and Technological Research Council of Turkey) prominently featuring DBpedia, Relfinder, LIMES, SPARQL Benchmark and other AKSW related research projects. See Semantik Web Tübitak Bilim Teknik 12 2011. Read more about "AKSW tools prominently featured in TÜBİTAK"