MyThes is a simple thesaurus that uses a structured
text data file and an index file with binary search
to lookup words and phrases and return information
on part of speech, meanings, and synonyms
MyThes was originall written to provide a thesaurus
for the OpenOffice.org project
The Main features of MyThes are:
1. written in C++ to make it easier to interface with
LibreOffice, OpenOffice, AbiWord, Pspell, etc
2. it is stateless, uses no static variables and
should be completely reentrant with no ifdefs
3. it compiles with -ansi and -pedantic and -Wall
with no warnigns so it shouldbe quite portable
4. it uses a simple perl program to read the structured
text file and create the index needed for binary
5. it is very simple with *lots* of comments.
The main "smarts" are in the structure of the
text file that makes up the thesaurus data
6. It comes with a ready-to-go structured thesaurus
data file for en_US extracted from the WordNet-2.0 data.
Please see WordNet_license.txt and WordNet_readme.txt
for more information on the very useful project!
See http://www.danielnaber.de/wn2ooo/ for utilities to
regenerate an up to date English thesaurus from the most
recent WordNet data.
7. The source code has a BSD license (and no advertising clause)
MyThes comes with a simple example program that looks up some words and returns
meanings and synonyms.
To build it simply do the following:
To run the example program:
./example th_en_US_new.idx th_en_US_new.dat checkme.lst
To run the example program with stemming and morphological generation:
e.g. to check mouse, mice, rodents, eats, eaten, ate, eating etc. words
./example morph.idx morph.dat morph.lst morph.aff morph.dic
NOTE: this is only an example and test environment for dictionary developers,
full English stemming and morphological generation needs an improved
English Hunspell dictionary.
László Németh <nemeth at OO.o>
Kevin Hendricks <firstname.lastname@example.org>