Extending NLP functionality for Germanic Languages
Eleftheria
NLP is severely lacking in meaningful functionalities for Germanic languages. Normalization, POS tagging and stemming modules (all significant parts...
The Road to CDLI’s Corpora Integration into CLTK: an Undertaking
Andrew Deloucas
This project focuses on integrating Cuneiform Digital Library Initiative (CDLI) corpora into the Classical Language Toolkit (CLTK). Currently, CLTK...
Expanding the CLTK with Synonyms, Translations and Word Embeddings
James Gawley
The CLTK features the most sophisticated algorithm available for lemmatizing classical Latin. Lemmatization is the process by which inflected...