wordxtr

Created 7 years ago
Maintained by pnemade
Wordxtr project aims at creating dictionary of words for a given language text
Members 1
                     wordxtr

wordxtr project aims at creating dictionary of words for a given language text. This dictionary is based on hunspell format where output is .dic and .aff files.

This is command line tool which takes 2 inputs 1)language code along with country code. Language code can be found in /usr/share/iso-codes/iso_639.tab e.g. For Hindi language use hi_IN or for Nepali language use ne_NP

2)Full directory path where text data files only exists

Assumptions: This tool assumes that only plain text unicode files present in directory.

Example: To create Hindi dictionary once can use following command wordxtr hi_IN sample_text_directory

Requirements: * http://people.redhat.com/caolanm/hunspell/wordlist2hunspell script