LIMES: LInk discovery framework for MEtric Spaces

  • screenshot

LIMES is a link discovery framework for the Web of Data. It implements time-efficient approaches for large-scale link discovery based on the characteristics of metric spaces. It is easily configurable via a web interface. It can also be downloaded as standalone tool for carrying out link discovery locally.

Demo Issues Download

General Overview

LIMES implements novel time-efficient approaches for link discovery in metric spaces. Our approaches different approximation techniques to compute estimates of the similarity between instances. These estimates are then used to filter out a large amount of those instance pairs that do not suffice the mapping conditions. By these means, LIMES can reduce the number of comparisons needed during the mapping process by several orders of magnitude. The approaches implemented in LIMES include the original LIMES algorithm for edit distances, REEDED for weighted edit distances, HR3, HYPPO, and ORCHID. Moreover, LIMES implements supervised and unsupervised machine-learning algorithms for finding accurate link specifications. The algorithms implemented here include the supervised, active and unsupervised versions of EAGLE, COALA and EUCLID.

Architecture

The LIMES framework consists of seven main modules of which each can be extended to accommodate new or improved functionality. The central modules of LIMES are the controller module, which coordinates the matching process and the data module, which contains all the classes necessary to store data. The matching process is carried out as follows: First, the controller calls the I/O-module, which reads the configuration file and extracts all the information necessary to carry out the comparison of instances, including the URL of the SPARQL-endpoints of the knowledge bases, the restrictions on the instances to map (e.g., their type), the expression of the metric to be used and the threshold to be used. Examples of configuration files can be found in the distribution.

Given that the configuration file is valid w.r.t. the LIMES Specification Language (LSL), the query module is called. This module uses the configuration for the target and source knowledge bases to retrieve instances and properties from the SPARQL-endpoints of the source and target knowledge bases that adhere to the restrictions specified in the configuration file. The query module writes its output into a cache, which can be a file (for large number of instances, not implemented yet) or main memory. Once all instances have been stored in the cache, the controller calls the LIMES engine which runs through the specification and computes the results. The results are finally returned as RDF or TSV files.

Evaluation Results

The algorithms implemented in LIMES were published in several papers. Below are links to evaluation results.

Running LIMES

Running LIMES can be carried in one of three ways.

Publications

by (Editors: ) [BibTex of ]

News

AKSW at #ISWC2014. Come and join, talk and discuss with us! ( 2014-10-16T14:00:30+02:00 by Ricardo Usbeck)

2014-10-16T14:00:30+02:00 by Ricardo Usbeck

Hello AKSW Follower! We are very pleased to announce that nine of our papers were accepted for presentation at ISWC 2014. Read more about "AKSW at #ISWC2014. Come and join, talk and discuss with us!"

LIMES Version 0.6 RC4 ( 2014-10-07T00:49:53+02:00 by Dr. Axel-C. Ngonga Ngomo)

2014-10-07T00:49:53+02:00 by Dr. Axel-C. Ngonga Ngomo

It has been a while but that moment has arrived again. We are happy to announce a new release of the LIMES framework. This version implements novel geo-spatial measures (e.g., geographic mean) as well as string similarity measures (jaro, jaro-winkler, etc.). Moreover, we fixed some minor bugs (thanks for the bug reports). Read more about "LIMES Version 0.6 RC4"

Additional contributions to SEMANTiCS 2014 ( 2014-08-05T10:31:12+02:00 by Ricardo Usbeck)

2014-08-05T10:31:12+02:00 by Ricardo Usbeck

Hello again! Unfortunately, we missed the opportunity to inform you about other contributions of AKSW to the SEMANTiCS 2014. First, we missed to tell you about another accepted paper: Towards Question Answering on Statistical Linked Data ( Konrad Höffner and Jens Lehmann) Second, there is also another excellent and interesting series of workshops. Read more about "Additional contributions to SEMANTiCS 2014"

Five AKSW Papers at SEMANTiCS 2014 ( 2014-08-01T16:54:00+02:00 by Ricardo Usbeck)

2014-08-01T16:54:00+02:00 by Ricardo Usbeck

Hello Community! We are very pleased to announce that five of our papers were accepted for presentation at SEMANTiCS 2014.  The papers cover architectures for Big Data Search Engines, Linked Data Visualisations, Machine Learning and Dataset Descriptions. Read more about "Five AKSW Papers at SEMANTiCS 2014"

Five AKSW Papers at ESWC 2014 ( 2014-03-14T01:37:52+01:00 by Dr. Axel-C. Ngonga Ngomo)

2014-03-14T01:37:52+01:00 by Dr. Axel-C. Ngonga Ngomo

Hello World! We are very pleased to announce that five of our papers were accepted for presentation at ESWC 2014. These papers range from natural-language processing to the acquisition of temporal data. Read more about "Five AKSW Papers at ESWC 2014"