pkgsrc/converters/recode/DESCR
kleink d519f87dbc Better late than never, update to recode-3.6. Changes since 3.5 include:
.* General changes
. + The recode manual is now indexed, by charset, by concept, etc.
. + Program messages are also available in Greek, Gallicean and Italian.
. + Bruno Haible's nice portable iconv library has been integrated.
. + RFC 1345 tables and French character names have been updated.
. + The Texinfo charset has been refreshed, and made reversible.

.* New charsets (most from libiconv)

. + Japanese
    EUC-JP (csEUCPkdFmtJapanese, EUC_JP,
      Extended_UNIX_Code_Packed_Format_for_Japanese);
    ISO-2022-JP (csISO2022JP); ISO-2022-JP-1; ISO-2022-JP-2 (csISO2022JP2);
    JIS_C6220-1969-ro (csISO14JISC6220ro, ISO646-JP, iso-ir-14, jp);
    JIS_X0201 (csHalfWidthKatakana, JIS0201, JISX0201-1976, JISX0201.1976-0,
       X0201);
    JIS_X0208 (csISO87JISX0208, ISO-IR-87, JIS0208, JIS_X0208.1983-0,
       JIS_X0208.1983-1, JIS_X0208-1990-0, JIS_X0208.1983-1, X0208);
    JIS_X0212 (csISO159JISX02121990, ISO-IR-159, JIS0212, JIS_X0212.1990-0,
      JIS_X0212-1990, X0212);
    SJIS (csShiftJIS, MS_KANJI, SHIFT-JIS).

. + Chinese
    BIG5 (BIG-5, BIG-FIVE, BIGFIVE, CN-BIG5 csBig5); BIG5HKSCS;
    EUC-CN (CN-GB, csGB2312, EUC_CN, GB2312); EUC-TW (csEUCTW, EUC_TW);
    GB18030; HZ (HZ-GB-2312); ISO-2022-CN (csISO2022CN); ISO-2022-CN-EXT;
    GB_1988-80 (cn, csISO57GB1988, ISO646-CN, iso-ir-57);
    GB_2312-80 (CHINESE, csISO58GB231280, GB2312.1980-0, ISO-IR-58);
    ISO-IR-165 (CN-GB-ISOIR165).

. + Korean
    JOHAB (CP1361); EUC-KR (csEUCKR, EUC_KR); GBK (CP936);
    ISO-2022-KR (csISO2022KR);
    KSC_5601 (CP949, csKSC56011987, ISO-IR-149, KOREAN, KSC5601.1987-0,
      KS_C_5601-1987, KS_C_5601-1989, KSX1001:1992).

. + Vietnamese (independently of libiconv)
    TCVN; VIQR; VISCII; VNI; VPS.

. + Other languages
    ARMSCII-8; Georgian-Academy; Georgian-PS; WINDOWS-874 (CP874);
    MuleLao-1; CP1133 (IBM-CP1133); CP1258 (WINDOWS-1258);
    TIS-620 (ISO-IR-166, TIS620, TIS620.2529-1, TIS620-0, TIS620.2533-0,
      TIS620.2533-1).

. + Apple specifics
    MacArabic; MacCentralEurope; MacCroatian; MacCyrillic; MacGreek;
    MacHebrew; MacIceland; MacRomania; MacThai; MacTurkish; MacUkraine

. + Unicode
    JAVA; UCS-2-INTERNAL; UCS-2LE (UnicodeLITTLE); UCS-2-SWAPPED; UCS-4BE;
    UCS-4-INTERNAL; UCS-4LE; UCS-4-SWAPPED; UTF-16BE; UTF-16LE.

. + Others
    CP932; CP949 (UHC); CP950; CP866 (866, csIBM866, IBM866).
    ISO-8859-16 (ISO-IR-226, ISO_8859-16:2000).

. + Recode internal
    :libiconv: (:)   [so option -x: avoids going through libiconv]

.* New aliases (from libiconv) [list to be revised]
   csASCII (for ANSI_X3.4-1968); csHPRoman8 (for hp-roman8);
   csISOLatin1 (for ISO-8859-1); csISOLatin2 (for ISO-8859-2);
   csISOLatin3 (for ISO-8859-3); csISOLatin4 (for ISO-8859-4);
   csISOLatin5 (for ISO-8859-9);
   csISOLatin6 and ISO_8859-10:1992 (for ISO-8859-10);
   csISOLatinArabic (for ISO-8859-6); csISOLatinCyrillic (for ISO-8859-5);
   csISOLatinGreek (for ISO-8859-7); csISOLatinHebrew (for ISO-8859-8);
   csKOI8R (for KOI8-R); csPC850Multilingual (for IBM850);
   csUCS4 (for ISO-10646-UCS-4);
   csUnicode, csUnicode11, UCS-2BE, UnicodeBIG (for ISO-10646-UCS-2);
   csUnicode11UTF7 (for UNICODE-1-1-UTF-7);
   csVISCII and VISCII1.1-1 (for VISCII);
   ISO-IR-179 (for ISO-8859-13); csMacintosh and MacRoman (for macintosh);
   TCVN5712-1, TCVN5712-1:1993 and TCVN-5712 (for TCVN).

.* New surfaces
   tree (experimental).
2002-01-11 13:20:22 +00:00

6 lines
360 B
Text

The `recode' library converts files between character sets and usages.
It recognises or produces more than 300 different character sets and
transliterates files between almost any pair. When exact transliteration
are not possible, it gets rid of offending characters or falls back
on approximations. The `recode' program is a handy front-end to
the library.