Thanks to Mojca Miklavec + +2010-03-19: Version 0.5.2 + * Do not break before cons+virama in Tamil- Thanks to Kevin for patch. + * Do not break before the Vowel in Tamil and Malayalam (eg: Transliterated Arabic words) - Thanks to Kevin for patch. + * We can break after virama in Tamil- Thanks Kevin for the patch + * Many cosmetic fixes- Thanks to Mojca Miklavec, texhyphens + +2010-02-04: Version 0.5.1 + * Fixed the confusing license headers, thanks to Parag for reporting. + +2009-08-22: Version 0.5 + * Fixed the bugs reported by Kevin for Malayalam + * For all patterns the problems with incompatibility between tex patterns and libhyphen of Openoffice fixed + * Added testcases for all languages. + +2009-08-18: Version 0.41 + * Assamese hyphenation patters added. Copied from Bengali. + +2009-08-13: Version 0.4 + * The patters for all languages made compatible with tex rules for hyphenation + * Don't break on either side of zwj/zwnj for all languages + * LEFTHYPHENMIN and RIGHTHYPHENMIN properties removed. It can be configured from applications + +2009-08-12: Version 0.3 + * Removed unnecessary zwj/zwnj rules + * Added LEFTHYPHENMIN and RIGHTHYPHENMIN for rules + * Added Marathi Rules + +2009-05-12: Version 0.2 + * Vowel sign + [anuswaram|visargam|chandrabindu] fix for Bengali as reported by Runa B. + * Vowel sign + [visargam] fix for Malayalam + +2008-12-14: Version 0.1 + * Initial version + * Hyphenation Patterns for 9 Languages diff --git a/Makefile b/Makefile new file mode 100644 index 0000000..47c56a5 --- /dev/null +++ b/Makefile @@ -0,0 +1,63 @@ +# Hyphenation Makefile +# Copyright (C) 2009-2011 Shravan Aras <> +# Copyright (C) 2008-2010 Santhosh Thottingal +# +# This library is free software; you can redistribute it and/or +# modify it under the terms of the GNU Lesser General Public +# License as published by the Free Software Foundation; +# version 3 or later version of the License. +# +# This library is distributed in the hope that it will be useful, +# but WITHOUT ANY WARRANTY; without even the implied warranty of +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU +# Lesser General Public License for more details. +# +# You should have received a copy of the GNU Lesser General Public +# License along with this library; if not, write to the Free Software +# Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA + +DESTDIR = +hyphdir = /usr/share/hyphen/ +dictdir = /usr/share/myspell/dicts/ +ooo_info_dir = /usr/share/myspell/infos/ooo/ +docdir = /usr/share/doc/hyphen-te/ + + +dict_files = hyph_te_IN.dic +doc_files = README ChangeLog +extra_files = Makefile +ooo_info_file = +version = 0.7.0 + +distdir=hyphen-te-${version} + +all: ${doc_files} ${dict_files} + +install: all + mkdir -p ${DESTDIR}${ooo_info_dir} + cp ${ooo_info_file} ${DESTDIR}${ooo_info_dir} + mkdir -p ${DESTDIR}${docdir} + cp ${doc_files} ${DESTDIR}${docdir} + mkdir -p ${DESTDIR}${hyphdir} + cp ${dict_files} ${DESTDIR}${hyphdir} + chmod 644 ${DESTDIR}${hyphdir}${dict_files} + mkdir -p ${DESTDIR}${dictdir} + ln -s ${hyphdir}${dict_files} ${DESTDIR}${dictdir}${dict_files} + +uninstall: + -cd ${DESTDIR}${dictdir}/ && rm ${dict_files} + -cd ${DESTDIR}${docdir}/ && rm ${doc_files} + -cd ${DESTDIR}${ooo_info_dir}/ && rm ${ooo_info_file} + -cd ${DESTDIR}${hyphdir}/ && rm ${dict_files} + +clean: + -rm -rf ${distdir}.tar.bz2 ${distdir} + +dist: + -rm -rf ${distdir}.tar.bz2 ${distdir} + mkdir ${distdir} + cp -p ${dict_files} ${doc_files} ${ooo_info_file} ${extra_files} ${distdir}/ + tar cf ${distdir}.tar ${distdir}/ + bzip2 -9 ${distdir}.tar + rm -rf ${distdir}/ + diff --git a/README b/README new file mode 100644 index 0000000..92f2964 --- /dev/null +++ b/README @@ -0,0 +1,11 @@ +Hyphenation Patterns for Indian Languages +----------------------------------------- + +Author: +Santhosh Thottingal + +Description: +This is a set of hyphenation rules as per the tex hyphenation pattern. Can be used with openoffice hyphenation. + +License: +LGPL v3+ diff --git a/hyph_te_IN.dic b/hyph_te_IN.dic new file mode 100755 index 0000000..5193cfe --- /dev/null +++ b/hyph_te_IN.dic @@ -0,0 +1,99 @@ +UTF-8 +% Hyphenation for Telugu +% Copyright (C) 2008-2010 Santhosh Thottingal +% +% This library is free software; you can redistribute it and/or +% modify it under the terms of the GNU Lesser General Public +% License as published by the Free Software Foundation; +% version 3 or later version of the License. +% +% This library is distributed in the hope that it will be useful, +% but WITHOUT ANY WARRANTY; without even the implied warranty of +% MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU +% Lesser General Public License for more details. +% +% You should have received a copy of the GNU Lesser General Public +% License along with this library; if not, write to the Free Software +% Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA +% +% GENERAL RULE +% Do not break either side of ZERO-WIDTH JOINER (U+200D) +2‍2 +% Break on both sides of ZERO-WIDTH NON JOINER (U+200C) +1‌1 +% Break before or after any independent vowel. +అ1 +ఆ1 +ఇ1 +ఈ1 +ఉ1 +ఊ1 +ఋ1 +ౠ1 +ఌ1 +ౡ1 +ఎ1 +ఏ1 +ఐ1 +ఒ1 +ఓ1 +ఔ1 +% Break after any dependent vowel, but not before. +ా1 +ి1 +ీ1 +ు1 +ూ1 +ృ1 +ౄ1 +ె1 +ే1 +ై1 +ొ1 +ో1 +ౌ1 +% Break before or after any consonant. +1క +1ఖ +1గ +1ఘ +1ఙ +1చ +1ఛ +1జ +1ఝ +1ఞ +1ట +1ఠ +1డ +1ఢ +1ణ +1త +1థ +1ద +1ధ +1న +1ప +1ఫ +1బ +1భ +1మ +1య +1ర +1ఱ +1ల +1ళ +1వ +1శ +1ష +1స +1హ +% Do not break before chandrabindu, anusvara, visarga, +% length mark and ai length mark. +2ఁ1 +2ం1 +2ః1 +2ౕ1 +2ౖ1 +% Do not break either side of virama (may be within conjunct). +2్2 diff --git a/ b/ new file mode 100644 index 0000000..e992a32 --- /dev/null +++ b/ @@ -0,0 +1 @@ +HYPH te IN hyph_te_IN