I am a PhD student and research associate at the Knowledge Integration and Language Technology (KILT) research group at AKSW. Having finished my master’s degree in linguistics in 2013, I am primarily working in the interdisciplinary area of linguistic Linked Data which connects linguistic and language data with Semantic Web technologies. My main research fields are:
- Development of ontological models for linguistic data description
- The creation and usage of morphological language data
- Automatic language data generation
- Lexical and morphological text annotation and processing
- Linking and providing interoperable language resources
- Language documentation and typology
If you are interested in one of these or related research areas and like to write your Bachelor or Master thesis, feel free to contact me!
Since 2013 I am actively participating in the Open Linguistics Working group. This community group is an initiative of experts from different fields concerned with linguistic data, ranging from academic linguistics (e.g. typology, corpus linguistics) and applied linguistics (e.g. computational linguistics, lexicography, language documentation) to data from the Natural Language Processing and Semantic Web communities.
Being a member of the organization committee, I will be directly involved in the realization of the 5th Workshop on Linked Data in Linguistics “Managing, Building and Using Linked Language Resources” which will be co-located with the 10th edition of the Language Resources and Evaluation Conference.
The project’s mission is to provide the basis for the creation of a Linguistic Linked Data cloud that can support content analytics tasks of unstructured multilingual cross-media content. Next to being actively involved in various LIDER events, I contributed to four deliverables. My main contribution is the investigation and comparison of the major existing linguistic data repositories on the Web (cf. D2.1.1 p. 32ff.).
In June 2015 I joined the 1st Summer Datathon on Linguistic Linked Open Data as a tutor. In that role I was able to share my practical knowledge with people from industry and academia in the field of Linked Data applied to Linguistics.
During my one year internship (2014/2015) at K Dictionaries Ltd I was able to receive valuable insights into the company’s procedures in creating and exploiting multilingual language data. I learned from their longstanding experiences and success in creating high quality lexicographic resources and contributed to the transition from K Dictionaries’ elaborate XML data structures to semantic data formats by providing a lemon-RDF conversion for a German sample data set.
Being in the organizing committee, I was part of the Multilingual Linked Open Data for Enterprises workshop which brought together developers, data producers, academia and enterprises from various fields of linguistics, natural language processing, and information technology (Linked Data) to present and discuss principles, case studies, and best practices for representing, publishing and linking linguistic data collections, including corpora, dictionaries, lexical networks, thesauri, etc.
- LIDER – Linked Data as an enabler of cross-media and multilingual content analytics for enterprises across Europe
- MMoOn – The Multilingual Morpheme Ontology and Language Inventories
- NLP2RDF – Converting NLP tool output to RDF