|
Packit |
1184b9 |
MyThes is a simple thesaurus that uses a structured
|
|
Packit |
1184b9 |
text data file and an index file with binary search
|
|
Packit |
1184b9 |
to lookup words and phrases and return information
|
|
Packit |
1184b9 |
on part of speech, meanings, and synonyms
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
MyThes was originall written to provide a thesaurus
|
|
Packit |
1184b9 |
for the OpenOffice.org project
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
The Main features of MyThes are:
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
1. written in C++ to make it easier to interface with
|
|
Packit |
1184b9 |
LibreOffice, OpenOffice, AbiWord, Pspell, etc
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
2. it is stateless, uses no static variables and
|
|
Packit |
1184b9 |
should be completely reentrant with no ifdefs
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
3. it compiles with -ansi and -pedantic and -Wall
|
|
Packit |
1184b9 |
with no warnigns so it shouldbe quite portable
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
4. it uses a simple perl program to read the structured
|
|
Packit |
1184b9 |
text file and create the index needed for binary
|
|
Packit |
1184b9 |
searching
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
5. it is very simple with *lots* of comments.
|
|
Packit |
1184b9 |
The main "smarts" are in the structure of the
|
|
Packit |
1184b9 |
text file that makes up the thesaurus data
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
6. It comes with a ready-to-go structured thesaurus
|
|
Packit |
1184b9 |
data file for en_US extracted from the WordNet-2.0 data.
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
Please see WordNet_license.txt and WordNet_readme.txt
|
|
Packit |
1184b9 |
for more information on the very useful project!
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
See http://www.danielnaber.de/wn2ooo/ for utilities to
|
|
Packit |
1184b9 |
regenerate an up to date English thesaurus from the most
|
|
Packit |
1184b9 |
recent WordNet data.
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
7. The source code has a BSD license (and no advertising clause)
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
MyThes comes with a simple example program that looks up some words and returns
|
|
Packit |
1184b9 |
meanings and synonyms.
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
To build it simply do the following:
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
unzip mythes.zip
|
|
Packit |
1184b9 |
cd mythes
|
|
Packit |
1184b9 |
./configure
|
|
Packit |
1184b9 |
make
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
To run the example program:
|
|
Packit |
1184b9 |
./example th_en_US_new.idx th_en_US_new.dat checkme.lst
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
To run the example program with stemming and morphological generation:
|
|
Packit |
1184b9 |
e.g. to check mouse, mice, rodents, eats, eaten, ate, eating etc. words
|
|
Packit |
1184b9 |
./example morph.idx morph.dat morph.lst morph.aff morph.dic
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
NOTE: this is only an example and test environment for dictionary developers,
|
|
Packit |
1184b9 |
full English stemming and morphological generation needs an improved
|
|
Packit |
1184b9 |
English Hunspell dictionary.
|
|
Packit |
1184b9 |
|
|
Packit |
1184b9 |
László Németh <nemeth at OO.o>
|
|
Packit |
1184b9 |
Kevin Hendricks <kevin.hendricks@sympatico.ca>
|