diff --git a/man3/nl_langinfo.3 b/man3/nl_langinfo.3 index e37f07b..63de5c2 100644 --- a/man3/nl_langinfo.3 +++ b/man3/nl_langinfo.3 @@ -167,6 +167,12 @@ or .PP POSIX specifies that the application may not modify the string returned by these functions. +.PP +Codeset for en_US defaults to ISO-8859-1 (Latin-1). +The Latin-1 default has historical reasons, +since all Unix systems originally used only 8-bit character encoding. +For more information about ISO-8859-1 see +.BR charsets (7). .SH ATTRIBUTES For an explanation of the terms used in this section, see .BR attributes (7). diff --git a/man7/charsets.7 b/man7/charsets.7 index 5f91784..9de88fe 100644 --- a/man7/charsets.7 +++ b/man7/charsets.7 @@ -29,6 +29,8 @@ ASCII, GB 2312, ISO 8859, JIS, KOI8-R, KS, and Unicode. The primary emphasis is on character sets that were actually used by locale character sets, not the myriad others that could be found in data from other systems. +.LP +The recommended encoding in all settings and locales is UTF-8. .SS ASCII ASCII (American Standard Code For Information Interchange) is the original 7-bit character set, originally designed for American English.