Semantic Abstraction

SIMBA's focus in supporting the transition of non-semantic applications to knowledge-driven applications. Hence we support all major steps from legacy data to rich semantic applications. This includes but is not limited to knowledge storage (triple stores, federated queries), knowledge extraction (RDF extraction from text, structured data, etc.), knowledge integration (link discovery, data fusion), knowledge access (keyword-based search, question answering and rich interfaces) and knowledge consumption within semantic applications . For this purpose, SIMBA develops novel and scalable approaches for data ranging from small to Big Data. In addition, SIMBA provides tools and frameworks that implement these approaches and allow for their swift integration into industry projects.

Research Areas

  • Knowledge Access, e.g., keyword-based search, question answering, and interfaces
  • Knowledge Extraction, e.g., extraction of RDF and OWL from unstructured data
  • Knowledge Integration, e.g., link discovery and linked data fusion
  • Knowledge Storage, e.g., federated queries, triple stores
  • Knowledge-Driven applications, e.g., industry 4.0, big data, benchmarking

Projects

  • AGDISTISAgnostic Disambiguation of Named Entities Using Linked Open Data
  • ALIGNEDAligned, Quality-centric Software and Data Engineering
  • ALOEAssisted Linked Data Consumption Engine
  • ART-e-FACTMedia continuity artefact management
  • ASSESSAutomatic Self Assessment
  • AutoSPARQLConvert a natural language expression to a SPARQL query
  • BDEBig Data Europe
  • BIGBig Data Public Private Forum
  • BioASQa challenge on large-scale biomedical semantic indexing and question answering
  • BOABOotstrapping linked datA
  • BorderFlowa general-purpose graph clustering tool
  • conTEXTLightweight Text Analytics using Linked Data
  • CSVImportRepresenting multi-dimensional statistical data as RDF using the RDF Data Cube Vocabulary
  • CubeVizThe RDF DataCube Browser.
  • DEERRDF Data Extraction and Enrichment Framework
  • DeFactoDeep Fact Validation
  • DEQADeep Web Extraction for Question Answering
  • DIESELDistributed Search in Large Enterprise Data
  • FEASIBLEA Featured-Based SPARQL Benchmarks Generation Framework.
  • FOXFederated knOwledge eXtraction Framework
  • GEISERVon Sensordaten zu internetbasierten Geo-Services
  • GeoKnowMaking the Web an Exploratory for Geospatial Knowledge
  • GeoLiftSpatial mapping framework for enriching RDF datasets with Geo-spatial information
  • GERBILGeneral Entity Annotation Benchmark Framework
  • HAWKHybrid Question Answering over Linked Data
  • HOBBITHolistic Benchmarking of Big Linked Data
  • IGUANAIntelligent Suite for Benchmarking SPARQL with Updates
  • LDWPOthe Linked Data Workflow Project ontology
  • LIMESLInk discovery framework for MEtric Spaces
  • LinkedGeoDataadds a spatial dimension to the Web of Data
  • LinkedIdiomsA Multilingual Linked Idioms Data Set
  • LinkingLODinterlinking knowledge bases
  • LIONInduction of Link Specifications using Refinement Operators
  • LODStatsa statement-stream-based approach for gathering comprehensive statistics about RDF datasets
  • LSQLinked SPARQL Queries Dataset
  • MEX VocabularyA Light-Weight Interchange Format for Machine Learning Experiments
  • N3 - CollectionN3 - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format
  • NIF4OGGDNatural Language Interchange Format for Open German Governmental Data
  • NLP Interchange Format (NIF)an RDF/OWL-based format that allows to combine and chain several NLP tools in a flexible, light-weight way
  • OntoWikia tool providing support for agile, distributed knowledge engineering scenarios
  • OntoWiki MobileKnowledge Management in your Pocket
  • openQAOpen Question Answering Framework
  • PalmettoPalmetto is a quality measuring tool for topics
  • PCP on WebProfessorial Career Patterns of the Early Modern History
  • QAMELQuestion Answering on Mobil Devices
  • QualisBrasilLinked Open Data for supporting scientometric studies
  • QUETSALA Query Federation Suite for SPARQL
  • RDFSliceLarge-scale RDF Dataset Slicing
  • Relation Annotation in GENIA
  • REXWeb-Scale Extension of RDF Knowledge Bases
  • RockerA Refinement Operator for Key Discovery
  • SAIM(Semi-)Automatic Instance Matcher
  • SAKEWith RDF and Machine Learning Getting Results Faster
  • SANSA-StackOpen source platform for distributed data processing for RDF large-scale datasets
  • SCAROScalable RDF compression using rule subsumption hierachies
  • SCMSSemantic Content Management Systems
  • SemanticQurana Multilingual Resource for Natural-Language Processing
  • SlideWikihelps communities to create great presentations collaboratively
  • SLIPOScalable Linking and Integration of Big POI data
  • SMARTA Semantic Search Engine
  • SPARQL2NLconverting SPARQL queries to natural language
  • TapiocaTapioca is a search engine for topically similar RDF datasets.

Publications

Filters

by (Editors: ) [BibTex of ]

News

DBpedia @ Google Summer of Code – GSoC 2017 ( 2017-03-13T11:12:50+01:00 Christopher Schulz)

2017-03-13T11:12:50+01:00 Christopher Schulz

DBpedia, one of InfAI’s community projects, will be part of the 5th Google Summer of Code program. The GsoC has the goal to bring students from all over the globe into open source software development. Read more about "DBpedia @ Google Summer of Code – GSoC 2017"

New GERBIL release v1.2.5 – Benchmarking entity annotation systems ( 2017-03-10T11:49:51+01:00 by Ricardo Usbeck)

2017-03-10T11:49:51+01:00 by Ricardo Usbeck

Dear all, the Smart Data Management competence center at AKSW is happy to announce GERBIL 1.2.5. Read more about "New GERBIL release v1.2.5 – Benchmarking entity annotation systems"

DBpedia Open Text Extraction Challenge – TextExt ( 2017-03-09T12:15:57+01:00 Christopher Schulz)

2017-03-09T12:15:57+01:00 Christopher Schulz

DBpedia, a community project affiliated with the Institute for Applied Informatics (InfAI) e.V., extract structured information from Wikipedia & Wikidata. Now DBpedia started the DBpedia Open Text Extraction Challenge – TextExt. Read more about "DBpedia Open Text Extraction Challenge – TextExt"

The USPTO Linked Patent Dataset release ( 2017-02-24T17:18:51+01:00 by Mofeed Hassan)

2017-02-24T17:18:51+01:00 by Mofeed Hassan

Dear all, We are happy to announce USPTO Linked Patent Dataset release. Patents are widely used to protect intellectual property and a measure of innovation output. Read more about "The USPTO Linked Patent Dataset release"

Two accepted papers in ESWC 2017 ( 2017-02-22T17:43:38+01:00 by Dr. Mohamed Ahmed Sherif)

2017-02-22T17:43:38+01:00 by Dr. Mohamed Ahmed Sherif

Hello Community! We are very pleased to announce the acceptance of two papers in ESWC 2017 research track. The ESWC 2017 is to be held in Portoroz, Slovenia from 28th of May to the 1st of June. Read more about "Two accepted papers in ESWC 2017"