From 1ff370b0edc323dc8743c458fb10932bc120abf3 Mon Sep 17 00:00:00 2001 From: =?utf8?q?Fran=C3=A7ois=20Pinard?= Date: Fri, 14 Mar 2008 18:48:13 -0400 Subject: [PATCH] Manual: omitted charsets. THANKS adjusted --- THANKS | 3 + doc/ChangeLog | 5 ++ doc/recode.info | 201 +++++++++++++++++++++++++----------------------- doc/recode.texi | 21 +++-- src/ChangeLog | 12 +-- 5 files changed, 134 insertions(+), 108 deletions(-) diff --git a/THANKS b/THANKS index 55aa2a8..cfbf0e0 100644 --- a/THANKS +++ b/THANKS @@ -17,6 +17,7 @@ Alexandre Oliva oliva@dcc.unicamp.br http://www.dcc.unicamp.br/~oliva Andreas Dietrich quasi@mathematik.uni-marburg.de Andreas Frick afr@aifb.uni-karlsruhe.de +Andreas Kunert akunert@main-rheiner.de Andreas Mueller afm@othello.ch http://www.othello.ch Andreas Schwab schwab@suse.de @@ -185,7 +186,9 @@ Noèl Köthe noel@debian.org Olivier Lecarme Olivier.Lecarme@unice.fr Olivier Tharan olive@laria.u-picardie.fr http://www.laria.u-picardie.fr/~olive/ +Pascal Gourdel gourdel@univ-paris1.fr Patrick Andries pandries@iti.qc.ca +Pavel Forsite forsite@spamtest.ru Pawel Krawczyk kravietz@ceti.pl http://ceti.pl/~kravietz/ Per Andersson perand@stacken.kth.se diff --git a/doc/ChangeLog b/doc/ChangeLog index cac0c65..77c1313 100644 --- a/doc/ChangeLog +++ b/doc/ChangeLog @@ -1,3 +1,8 @@ +2008-03-14 François Pinard + + * recode.texi: More about omitted charsets. + Reported by Pascal Gourdel. + 2008-03-13 François Pinard * recode.texi: Clarify include files and configuration matters. diff --git a/doc/recode.info b/doc/recode.info index 1bf881c..ebda372 100644 --- a/doc/recode.info +++ b/doc/recode.info @@ -182,9 +182,13 @@ resort to many other email conversions, yet more rarely than the frequent cases above. It _seems_ like this should be doable using Recode. However, when - I try something like `grecode mac macfile.txt' I get nothing + I try something like `recode mac macfile.txt' I get nothing out--no error, no output, nothing. + *Note_* For the following discussion to be true, you should have +something like `export LANG=fr_FR.ISO-8859-1' in your environment, the +important bit here being the specification of an preferred charset. + Presuming you are using some recent version of Recode, the command: recode mac macfile.txt @@ -652,10 +656,13 @@ BEFORE charset.(2) When a charset name is omitted or left empty, the value of the `DEFAULT_CHARSET' variable in the environment is used instead. If this -variable is not defined, the Recode library uses the current locale's -encoding. On POSIX compliant systems, this depends on the first -non-empty value among the environment variables LC_ALL, LC_CTYPE, LANG, -and can be determined through the command `locale charmap'. +variable is not defined, the Recode library (from version 3.6 and up) +uses the current locale's encoding. On POSIX compliant systems, this +depends on the first non-empty value among the environment variables +LC_ALL, LC_CTYPE, LANG, and can be determined through the command +`locale charmap'. If the current locale's encoding may not be +resolved, then Recode presumes `ASCII' (or, for Recode 3.6 only: +`CHAR'). If the charset name is omitted but followed by surfaces, the surfaces then qualify the usual or default charset. For example, the request @@ -5457,7 +5464,8 @@ by Recode, and their aliases. * Apple-Mac: Apple-Mac. (line 6) * arabic: Tabular. (line 464) * arabic7: Tabular. (line 74) -* ASCII: Tabular. (line 69) +* ASCII <1>: Tabular. (line 69) +* ASCII: Requests. (line 104) * ASCII, an alias for the ANSI_X3.4-1968 charset: ASCII. (line 6) * ASCII-BS, and its aliases: ASCII-BS. (line 6) * ASMO-708: Tabular. (line 464) @@ -5475,6 +5483,7 @@ by Recode, and their aliases. * BS_viewdata, aliases and source: Tabular. (line 82) * ca: Tabular. (line 118) * CDC-NOS, and its aliases: CDC-NOS. (line 6) +* CHAR: Requests. (line 104) * cl: End lines. (line 37) * cn: Tabular. (line 211) * combined-UCS-2: UCS-2. (line 34) @@ -6114,95 +6123,95 @@ by Recode, and their aliases. Tag Table: Node: Top1148 Node: Tutorial5608 -Node: Introduction9836 -Node: Charset overview14070 -Node: Surface overview15875 -Node: Contributing17343 -Ref: Contributing-Footnote-119577 -Node: Invoking recode19711 -Node: Synopsis20666 -Ref: Synopsis-Footnote-123106 -Node: Requests23403 -Ref: Requests-Footnote-129293 -Ref: Requests-Footnote-229360 -Ref: Requests-Footnote-329538 -Node: Listings29997 -Ref: Listings-Footnote-141280 -Node: Recoding41603 -Node: Reversibility44424 -Ref: Reversibility-Footnote-152879 -Node: Sequencing53016 -Node: Mixed55460 -Node: Emacs58828 -Node: Debugging59862 -Node: Library64132 -Node: Outer level67092 -Node: Request level72441 -Node: Task level83433 -Node: Charset level94063 -Node: Errors94905 -Ref: Errors-Footnote-199751 -Ref: Errors-Footnote-299865 -Node: Universal100226 -Ref: Universal-Footnote-1103338 -Ref: Universal-Footnote-2103404 -Node: UCS-2103617 -Node: UCS-4106143 -Node: UTF-7106683 -Node: UTF-8107278 -Node: UTF-16111583 -Node: count-characters112731 -Node: dump-with-names113402 -Node: iconv115951 -Node: Tabular119382 -Node: ASCII misc141595 -Node: ASCII141961 -Node: ISO 8859142777 -Node: ASCII-BS145071 -Node: flat146908 -Node: IBM and MS147579 -Node: EBCDIC148123 -Node: IBM-PC150219 -Ref: IBM-PC-Footnote-1152333 -Node: Icon-QNX152492 -Node: CDC152917 -Node: Display Code154598 -Ref: Display Code-Footnote-1156879 -Node: CDC-NOS157084 -Node: Bang-Bang159046 -Node: Micros160975 -Node: Apple-Mac161358 -Node: AtariST163392 -Node: Miscellaneous164378 -Node: HTML165297 -Node: LaTeX171286 -Node: Texinfo172060 -Node: Vietnamese172832 -Node: African173808 -Node: Others175158 -Node: Java176611 -Node: Texte177278 -Ref: Texte-Footnote-1181826 -Ref: Texte-Footnote-2181906 -Ref: Texte-Footnote-3182381 -Node: Mule182478 -Ref: Mule-Footnote-1184259 -Node: Surfaces184778 -Ref: Surfaces-Footnote-1188197 -Node: Permutations188301 -Node: End lines189142 -Node: MIME191343 -Node: Dump192530 -Node: Test196700 -Node: Internals199178 -Node: Main flow200406 -Node: New charsets203509 -Node: New surfaces208047 -Node: Design208773 -Ref: Design-Footnote-1217939 -Node: Concept Index218043 -Node: Option Index233516 -Node: Library Index236369 -Node: Charset and Surface Index240944 +Node: Introduction10043 +Node: Charset overview14277 +Node: Surface overview16082 +Node: Contributing17550 +Ref: Contributing-Footnote-119784 +Node: Invoking recode19918 +Node: Synopsis20873 +Ref: Synopsis-Footnote-123313 +Node: Requests23610 +Ref: Requests-Footnote-129645 +Ref: Requests-Footnote-229712 +Ref: Requests-Footnote-329890 +Node: Listings30349 +Ref: Listings-Footnote-141632 +Node: Recoding41955 +Node: Reversibility44776 +Ref: Reversibility-Footnote-153231 +Node: Sequencing53368 +Node: Mixed55812 +Node: Emacs59180 +Node: Debugging60214 +Node: Library64484 +Node: Outer level67444 +Node: Request level72793 +Node: Task level83785 +Node: Charset level94415 +Node: Errors95257 +Ref: Errors-Footnote-1100103 +Ref: Errors-Footnote-2100217 +Node: Universal100578 +Ref: Universal-Footnote-1103690 +Ref: Universal-Footnote-2103756 +Node: UCS-2103969 +Node: UCS-4106495 +Node: UTF-7107035 +Node: UTF-8107630 +Node: UTF-16111935 +Node: count-characters113083 +Node: dump-with-names113754 +Node: iconv116303 +Node: Tabular119734 +Node: ASCII misc141947 +Node: ASCII142313 +Node: ISO 8859143129 +Node: ASCII-BS145423 +Node: flat147260 +Node: IBM and MS147931 +Node: EBCDIC148475 +Node: IBM-PC150571 +Ref: IBM-PC-Footnote-1152685 +Node: Icon-QNX152844 +Node: CDC153269 +Node: Display Code154950 +Ref: Display Code-Footnote-1157231 +Node: CDC-NOS157436 +Node: Bang-Bang159398 +Node: Micros161327 +Node: Apple-Mac161710 +Node: AtariST163744 +Node: Miscellaneous164730 +Node: HTML165649 +Node: LaTeX171638 +Node: Texinfo172412 +Node: Vietnamese173184 +Node: African174160 +Node: Others175510 +Node: Java176963 +Node: Texte177630 +Ref: Texte-Footnote-1182178 +Ref: Texte-Footnote-2182258 +Ref: Texte-Footnote-3182733 +Node: Mule182830 +Ref: Mule-Footnote-1184611 +Node: Surfaces185130 +Ref: Surfaces-Footnote-1188549 +Node: Permutations188653 +Node: End lines189494 +Node: MIME191695 +Node: Dump192882 +Node: Test197052 +Node: Internals199530 +Node: Main flow200758 +Node: New charsets203861 +Node: New surfaces208399 +Node: Design209125 +Ref: Design-Footnote-1218291 +Node: Concept Index218395 +Node: Option Index233868 +Node: Library Index236721 +Node: Charset and Surface Index241296  End Tag Table diff --git a/doc/recode.texi b/doc/recode.texi index c410293..48ce615 100644 --- a/doc/recode.texi +++ b/doc/recode.texi @@ -231,10 +231,14 @@ other email conversions, yet more rarely than the frequent cases above. @quotation It @emph{seems} like this should be doable using Recode. However, -when I try something like @samp{grecode mac macfile.txt} I get nothing +when I try something like @samp{recode mac macfile.txt} I get nothing out---no error, no output, nothing. @end quotation +@strong{Note:} For the following discussion to be true, you should have +something like @samp{export LANG=fr_FR.ISO-8859-1} in your environment, +the important bit here being the specification of an preferred charset. + Presuming you are using some recent version of Recode, the command: @example @@ -758,12 +762,17 @@ not very useful, as the recoding reduces to a mere copy in that case.} @cindex default charset @cindex charset, default @vindex DEFAULT_CHARSET +@tindex ASCII +@tindex CHAR When a charset name is omitted or left empty, the value of the -@code{DEFAULT_CHARSET} variable in the environment is used instead. If this -variable is not defined, the Recode library uses the current locale's -encoding. On POSIX compliant systems, this depends on the first non-empty -value among the environment variables LC_ALL, LC_CTYPE, LANG, and can be -determined through the command @samp{locale charmap}. +@code{DEFAULT_CHARSET} variable in the environment is used instead. +If this variable is not defined, the Recode library (from version +3.6 and up) uses the current locale's encoding. On POSIX compliant +systems, this depends on the first non-empty value among the environment +variables LC_ALL, LC_CTYPE, LANG, and can be determined through the +command @samp{locale charmap}. If the current locale's encoding may not +be resolved, then Recode presumes @code{ASCII} (or, for Recode +3.6 only: @code{CHAR}). If the charset name is omitted but followed by surfaces, the surfaces then qualify the usual or default charset. For example, the request diff --git a/src/ChangeLog b/src/ChangeLog index b5ffa92..3ca2770 100644 --- a/src/ChangeLog +++ b/src/ChangeLog @@ -41,12 +41,12 @@ Solve an old bug about loosing a few characters every 2048. * iconv.c (wrapped_transform): Rewritten. (transform_with_iconv): Simplified, use only one iconv_t. - Reported by Alexander Kurz, Andrew Green, Daniel Pfeiffer, - David Necas, Éric Bischoff, Frederik Fouvry, Guillaume Rumeau, - Jean-Christophe Guillain, John Gorman, Luc Maisonobe, - Marcin Owsiany, Martin Trautmann, Mikko Harjula, Peter Much, - Petr Pajas, Régis Martin, Stephen Walkauskas, Urban Mueller - dand Volker Wysk. + Reported by Alexander Kurz, Andreas Kunert, Andrew Green, Daniel + Pfeiffer, David Necas, Éric Bischoff, Frederik Fouvry, Guillaume + Rumeau, Jean-Christophe Guillain, John Gorman, Luc Maisonobe, + Marcin Owsiany, Martin Trautmann, Mikko Harjula, Pavel Forsite, + Peter Much, Petr Pajas, Régis Martin, Stephen Walkauskas, + Urban Mueller and Volker Wysk. * Makefile.am (EXTRA_DIST): Add stamp-strip. -- 2.40.0