KorAP: Architecture & Development
The KorAP project was launched in 2011 at the Leibniz Institute for the German Language (IDS) in Mannheim, Germany. KorAP is a Corpus Analysis Platform, with an extensible and scalable architecture. It consists of multiple components, that are exchangable due to well defined APIs.
- Kalamar
- Kalamar is the user frontend. Most users will access the search and analysis capabilities of KorAP through the user frontend. The user will formulate queries in one of several query languages that are passed to Kustvakt.
- API
- As an alternative to the user frontend, queries can be initiated using a REST web interface directly with Kustvakt.
- Kustvakt
- Kustvakt is the central user and policy management service. It takes a query and rewrites it to restrict the scope of a search to documents the user is allowed to. It may also inject further properties the user has set up, like preferred annotation layers. To transfer a query written in a certain query language to the internal general query protocol KoralQuery, Kustvakt uses Koral.
- Koral
- Koral is a library that translates queries written in one of several implemented query languages into a general query protocol, called KoralQuery.
- Krill
- Krill is a Lucene based backend engine that can search large corpora for occurrences of search patterns formulated in KoralQuery.
- Karang
- Karang is a Neo4j based backend engine that can traverse large corpora represented as graphs.
Team
- Head
- Marc Kupietz, Andreas Witt
- Scientific Staff
- Franck Bodmer, Nils Diewald, Peter Harders, Eliza Margaretha, Helge Stallkamp
- Translations
- Susanne Feix
- Past Project Members
- Piotr Bański, Joachim Bingel, Elena Frick, Michael Hanl, Carsten Schnober
- Consulting
- Cyril Belica, Piotr Pęzik, Oliver Schonefeld