LEMMATIZER


:: WebSite >>

LEMMATIZER AND POS TAGGER FOR THE GREEK LANGUAGE

 

DESCRIPTION

The lemmatizer for the Greek language is a tool whose function is, when given as input a word in Greek, to analyze the word and to find its dictionary citation form. The lemmatizer has been used as the basis for the development of a tool that counts the occurrences of words in a greek corpus, in all their inflected forms. This tool given a number of texts in Greek creates a list giving the frequency of total occurrences of each word in the texts, regardless of the inflection type in which this word appears.
The Part of Speech Tagger is using a Brill-based tagger adapted for the greek language. It takes as input a sentence in Greek and tags each word in that sentence with its corresponding Part of Speech.

  Back
Up  
UNIVERSITY OF ATHENS - DEPARTMENT OF INFORMATICS & TELECOMMUNICATIONS - COMPUTER SYSTEMS & APPLICATIONS