RSS

TRmorph: a relatively complete morphological analyzer for Turkish (under GPL)

25 Aug

TRmorph is a relatively complete morphological analyzer for Turkish. It is implemented using SFST, and uses a lexicon based on (but heavily modified) the word list from Zemberek spell checker. The morphological analyzer is distributed under the GPL.

To use the analyzer you need SFST. As well as the full source code, a compiled fsa, suitable to be used with SFST’s fst-mor or fst-infl is included. A UNIX makefile is provided for easy compilation from the sources (see the included README file for details. The analyzer is fairly complete, however, it may not be easy on unaccustomed eyes. Documentation and cleanup work is going on, you may want to visit soon to get a newer version.”

For details and live demo see http://www.let.rug.nl/~coltekin/trmorph/ and http://www.let.rug.nl/~coltekin/papers/coltekin-lrec2010.pdf

For some relevant natural language processing resources please see Resources for Turkish morphological processing, Morphological Disambiguation of Turkish Text with Perceptron Algorithm and http://denizyuret.blogspot.com/2006/11/turkish-resources.html.

 
Leave a comment

Posted by on August 25, 2010 in Linguistics, Programlama

 

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

 
%d bloggers like this: