LEMMATIZER AND POS TAGGER FOR THE GREEK LANGUAGE
The lemmatizer for the Greek language is a tool whose function is, when given as
input a word in Greek, to analyze the word and to find its dictionary citation form.
The lemmatizer has been used as the basis for the development of a tool that counts
the occurrences of words in a greek corpus, in all their inflected forms. This tool
given a number of texts in Greek creates a list giving the frequency of total occurrences
of each word in the texts, regardless of the inflection type in which this word appears.
The Part of Speech Tagger is using a Brill-based tagger adapted for the greek language. It takes as input a sentence in Greek and tags each word in that sentence with its corresponding Part of Speech.