Blame TW/TW.pm

Packit d0f5c2
package Encode::TW;
Packit d0f5c2
BEGIN {
Packit d0f5c2
    if ( ord("A") == 193 ) {
Packit d0f5c2
        die "Encode::TW not supported on EBCDIC\n";
Packit d0f5c2
    }
Packit d0f5c2
}
Packit d0f5c2
use strict;
Packit d0f5c2
use warnings;
Packit d0f5c2
use Encode;
Packit d0f5c2
our $VERSION = do { my @r = ( q$Revision: 2.3 $ =~ /\d+/g ); sprintf "%d." . "%02d" x $#r, @r };
Packit d0f5c2
use XSLoader;
Packit d0f5c2
XSLoader::load( __PACKAGE__, $VERSION );
Packit d0f5c2
Packit d0f5c2
1;
Packit d0f5c2
__END__
Packit d0f5c2
Packit d0f5c2
=head1 NAME
Packit d0f5c2
Packit d0f5c2
Encode::TW - Taiwan-based Chinese Encodings
Packit d0f5c2
Packit d0f5c2
=head1 SYNOPSIS
Packit d0f5c2
Packit d0f5c2
    use Encode qw/encode decode/; 
Packit d0f5c2
    $big5 = encode("big5", $utf8); # loads Encode::TW implicitly
Packit d0f5c2
    $utf8 = decode("big5", $big5); # ditto
Packit d0f5c2
Packit d0f5c2
=head1 DESCRIPTION
Packit d0f5c2
Packit d0f5c2
This module implements tradition Chinese charset encodings as used
Packit d0f5c2
in Taiwan and Hong Kong.
Packit d0f5c2
Encodings supported are as follows.
Packit d0f5c2
Packit d0f5c2
  Canonical   Alias		Description
Packit d0f5c2
  --------------------------------------------------------------------
Packit d0f5c2
  big5-eten   /\bbig-?5$/i	Big5 encoding (with ETen extensions)
Packit d0f5c2
          /\bbig5-?et(en)?$/i
Packit d0f5c2
          /\btca-?big5$/i
Packit d0f5c2
  big5-hkscs  /\bbig5-?hk(scs)?$/i
Packit d0f5c2
              /\bhk(scs)?-?big5$/i
Packit d0f5c2
                                Big5 + Cantonese characters in Hong Kong
Packit d0f5c2
  MacChineseTrad		Big5 + Apple Vendor Mappings
Packit d0f5c2
  cp950		                Code Page 950 
Packit d0f5c2
                                = Big5 + Microsoft vendor mappings
Packit d0f5c2
  --------------------------------------------------------------------
Packit d0f5c2
Packit d0f5c2
To find out how to use this module in detail, see L<Encode>.
Packit d0f5c2
Packit d0f5c2
=head1 NOTES
Packit d0f5c2
Packit d0f5c2
Due to size concerns, C<EUC-TW> (Extended Unix Character), C<CCCII>
Packit d0f5c2
(Chinese Character Code for Information Interchange), C<BIG5PLUS>
Packit d0f5c2
(CMEX's Big5+) and C<BIG5EXT> (CMEX's Big5e) are distributed separately
Packit d0f5c2
on CPAN, under the name L<Encode::HanExtra>. That module also contains
Packit d0f5c2
extra China-based encodings.
Packit d0f5c2
Packit d0f5c2
=head1 BUGS
Packit d0f5c2
Packit d0f5c2
Since the original C<big5> encoding (1984) is not supported anywhere
Packit d0f5c2
(glibc and DOS-based systems uses C<big5> to mean C<big5-eten>; Microsoft
Packit d0f5c2
uses C<big5> to mean C<cp950>), a conscious decision was made to alias
Packit d0f5c2
C<big5> to C<big5-eten>, which is the de facto superset of the original
Packit d0f5c2
big5.
Packit d0f5c2
Packit d0f5c2
The C<CNS11643> encoding files are not complete. For common C<CNS11643>
Packit d0f5c2
manipulation, please use C<EUC-TW> in L<Encode::HanExtra>, which contains
Packit d0f5c2
planes 1-7.
Packit d0f5c2
Packit d0f5c2
The ASCII region (0x00-0x7f) is preserved for all encodings, even
Packit d0f5c2
though this conflicts with mappings by the Unicode Consortium.
Packit d0f5c2
Packit d0f5c2
=head1 SEE ALSO
Packit d0f5c2
Packit d0f5c2
L<Encode>
Packit d0f5c2
Packit d0f5c2
=cut