$Id: README,v 1.1.1.1 2001/05/24 15:57:40 sano Exp $ EntityMap -- Entity Mapping Tables Version 0.1 Maintained by Ken MacLeod ken@bitsko.slc.ut.us INTRODUCTION EntityMap is a set of look-up tables for translating SGML character entity names into output formats. This release of EntityMap includes mappings for the ISO 8879:1986 character entity sets to ASCII, Latin 1, TeX, Texinfo, and RTF. EntityMap includes a Perl module for reading and querying the entity mapping tables. Documentation is in PerlDoc in `EntityMap' and will also be installed as a man page as `Text::EntityMap(3)'. STATUS The mapping tables in this release come directly from GF (General Formatter) by Gary Houston. Upcoming releases will merge mappings from SGML Tools and Jade. ACKNOWLEDGEMENTS These are Gary Houston's acknowledgements for the initial `sdata' files: * The tables for the conversion of `ISOlat1' to ``best'' ASCII follow a system developed by Markus Kuhn. * `ISOlat1.2tex' is based on a `latin1' to TeX table by (I think) Peter Flynn. * Other TeX symbols were grabbed individually from numerous sources. INSTALLATION If you are not using the Perl module you can copy the files in the `sdata' directory to wherever you need them. If you are using the Perl module, the following commands will install the Perl module into your standard Perl library and install the `sdata' files into `$PREFIX/lib/entity-map-0.1'. zcat entity-map-0.1.tar.gz | tar xvf - cd entity-map-0.1 ./configure make make install FORMAT Each file contains one character entity per line. Each line is the entity name, followed by a tab, followed by the replacement text for that entity. The replacement text should be already escaped properly for it's output format. If there is no equivalent output format for an entity, the convention is use the entity name within braces (`{name}') so that the braces appear in the output. NOTE: The file format may change in the future. Other output formats may also require a new file format. FILE NAMES The current convention is `ENTITY-SET.2FORMAT' where ENTITY-SET is the source entity set name (like `ISOpub') and FORMAT is an identifier for the output format: .2ab ASCII (best approximation) .2as ASCII .2l1b Latin 1 (best approximation) .2l1s Latin 1 .2tex TeX .2texi Texinfo .2rtf RTF .2tr TROFF