Semantic Abstraction
SIMBA's focus in supporting the transition of non-semantic applications to knowledge-driven applications. Hence we support all major steps from legacy data to rich semantic applications. This includes but is not limited to knowledge storage (triple stores, federated queries), knowledge extraction (RDF extraction from text, structured data, etc.), knowledge integration (link discovery, data fusion), knowledge access (keyword-based search, question answering and rich interfaces) and knowledge consumption within semantic applications . For this purpose, SIMBA develops novel and scalable approaches for data ranging from small to Big Data. In addition, SIMBA provides tools and frameworks that implement these approaches and allow for their swift integration into industry projects.
Research Areas
- Knowledge Access, e.g., keyword-based search, question answering, and interfaces
- Knowledge Extraction, e.g., extraction of RDF and OWL from unstructured data
- Knowledge Integration, e.g., link discovery and linked data fusion
- Knowledge Storage, e.g., federated queries, triple stores
- Knowledge-Driven applications, e.g., industry 4.0, big data, benchmarking
Members
- Alexander Bigerl
- Simon Bordewisch
- Jonathan Eberle
- Jonathan Huthmann
- Paul Spooren
- Adnan Akhter
- Simon Bin
- Lixi Conrads
- Kevin Dressler
- Ivan Ermilov
- Dr.-Ing. Timofey Ermilov
- Diego Esteves
- Kleanthi Georgala
- Michael Hoffmann
- Klaus Lyko
- Edgard Marx
- Diego Moussallem
- Prof. Dr. Axel-C. Ngonga Ngomo
- Daniel Obraczka
- Prof. Dr. Sandro Rautenberg
- Michael Röder
- Dr. Muhammad Saleem
- Dr. Mohamed Ahmed Sherif
- Tommaso Soru
- René Speck
- Dr. Ricardo Usbeck
- Dr. André Valdestilhas
- Dr. Matthias Wauer
Projects
- AGDISTIS – Agnostic Disambiguation of Named Entities Using Linked Open Data
- AgriNepalData – Ontology Based Data Access and Integration for Improving the Effectiveness of Farming in Nepal
- ALOE – Assisted Linked Data Consumption Engine
- ART-e-FACT – Media continuity artefact management
- ASSESS – Automatic Self Assessment
- AutoSPARQL – Convert a natural language expression to a SPARQL query
- BDE – Big Data Europe
- BIG – Big Data Public Private Forum
- BioASQ – a challenge on large-scale biomedical semantic indexing and question answering
- BOA – BOotstrapping linked datA
- BorderFlow – a general-purpose graph clustering tool
- conTEXT – Lightweight Text Analytics using Linked Data
- CSVImport – Representing multi-dimensional statistical data as RDF using the RDF Data Cube Vocabulary
- CubeViz – The RDF DataCube Browser.
- DBtrends – Evaluating Ranking functions on RDF data sets
- DEER – RDF Data Extraction and Enrichment Framework
- DeFacto – Deep Fact Validation
- DEQA – Deep Web Extraction for Question Answering
- DIESEL – Distributed Search in Large Enterprise Data
- DL-Learner – a tool for supervised Machine Learning in OWL and Description Logics
- FEASIBLE – A Featured-Based SPARQL Benchmarks Generation Framework.
- FOX – Federated knOwledge eXtraction Framework
- GEISER – Von Sensordaten zu internetbasierten Geo-Services
- GeoKnow – Making the Web an Exploratory for Geospatial Knowledge
- GeoLift – Spatial mapping framework for enriching RDF datasets with Geo-spatial information
- GERBIL – General Entity Annotation Benchmark Framework
- HAWK – Hybrid Question Answering over Linked Data
- HOBBIT – Holistic Benchmarking of Big Linked Data
- IGUANA – Intelligent Suite for Benchmarking SPARQL with Updates
- KBox – Distributing Ready-to-Query RDF Knowledge Graphs
- LDWPO – the Linked Data Workflow Project ontology
- LIMES – LInk discovery framework for MEtric Spaces
- LinkedGeoData – adds a spatial dimension to the Web of Data
- LinkedIdioms – A Multilingual Linked Idioms Data Set
- LinkingLOD – interlinking knowledge bases
- LION – Induction of Link Specifications using Refinement Operators
- LODStats – a statement-stream-based approach for gathering comprehensive statistics about RDF datasets
- LSQ – Linked SPARQL Queries Dataset
- MEX Vocabulary – A Light-Weight Interchange Format for Machine Learning Experiments
- N3 - Collection – N3 - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format
- Neural SPARQL Machines – Translating natural language into machine language for data access.
- NIF4OGGD – Natural Language Interchange Format for Open German Governmental Data
- OntoWiki – a tool providing support for agile, distributed knowledge engineering scenarios
- OntoWiki Mobile – Knowledge Management in your Pocket
- openQA – Open Question Answering Framework
- Palmetto – Palmetto is a quality measuring tool for topics
- PCP on Web – Professorial Career Patterns of the Early Modern History
- QAMEL – Question Answering on Mobil Devices
- QROWD – The power of the Qrowd combines with RDF
- QualisBrasil – Linked Open Data for supporting scientometric studies
- QUETSAL – A Query Federation Suite for SPARQL
- RDFSlice – Large-scale RDF Dataset Slicing
- Relation Annotation in GENIA
- REX – Web-Scale Extension of RDF Knowledge Bases
- Rocker – A Refinement Operator for Key Discovery
- SAGE – Semantic Geospatial Analytics
- SAIM – (Semi-)Automatic Instance Matcher
- SAKE – With RDF and Machine Learning Getting Results Faster
- SANSA-Stack – Open source platform for distributed data processing for RDF large-scale datasets
- SCARO – Scalable RDF compression using rule subsumption hierachies
- SCMS – Semantic Content Management Systems
- SemanticQuran – a Multilingual Resource for Natural-Language Processing
- SLIPO – Scalable Linking and Integration of Big POI data
- SMART – A Semantic Search Engine
- SML-Bench – A Benchmark for Symbolic Supervised Machine Learning from Expressive Structured Data
- SPARQL2NL – converting SPARQL queries to natural language
- Tapioca – Tapioca is a search engine for topically similar RDF datasets.