From b683fd341e27a953a7c8ab8d101eebdd53823ff5 Mon Sep 17 00:00:00 2001 From: Bob Stayton Date: Sun, 14 May 2006 02:15:53 +0000 Subject: [PATCH] Support multiple indexing methods for different languages. --- xsl/params/index.method.xml | 134 ++++++++++++++++++++++++++++++++++++ 1 file changed, 134 insertions(+) create mode 100644 xsl/params/index.method.xml diff --git a/xsl/params/index.method.xml b/xsl/params/index.method.xml new file mode 100644 index 000000000..ab32727e8 --- /dev/null +++ b/xsl/params/index.method.xml @@ -0,0 +1,134 @@ + + +index.method +string + + +index.method +Select method used to group index entries in an index + + + + + + + + +Description + +This parameter lets you select which method should be +used to sort and group index entries in an index. +Indexes in languages that have accented characters typically +sort together accented words and unaccented words. +So Á (A acute) would sort together +with A, so both would appear in the A +section of the index. + + +The default indexing method does not sort accented characters this way. +Words that start with an accented character will instead appear in the +Symbols section of the index. +As such, the default method is only suitable for +English and other unaccented languages. +The other indexing methods require extensions of one type or +another, which is why there are not used by default. + +The three choices for indexing method are: + + +english + + +(default) Sort and groups words based only on the Latin alphabet. +Accented words and words in non-Latin alphabets will be +put in the Symbols section of the index. + + + + +kosek + + +Sort and groups words based on letter groups configured in +the DocBook locale file for the given language. +For example, the French locale file is common/fr.xml. +This method requires support for EXSLT extensions in +the XSL processor. It also requires support for using +user-defined functions in xsl:key (xsltproc does not). + +This method is suitable for any language for which you can +list all the individual characters that should appear +in each letter group in an index. +It is probably not practical to use it for ideographic languages +such as Chinese that have hundreds or thousands of characters. + +To use this method, your customization must set this +parameter and import an additional stylesheet module: + +kosek +]]> + +The stylesheet module defines functions and adds an xsl:key +used by the method. + + + + +kimber + + +This method uses extensions to the Saxon processor to implement +sophisticated indexing processes. It uses its own +configuration file, which can include information for any number of +languages. Each language's configuration can group +words using either an +enumerated method similar to the kosek method, or it can +designate the first character in each group when viewed in sort order. +The latter configuration is useful for ideographic languages +such as Chinese, Japanese, and Korean. +You can also define your own collation algorithms and where you +want Latin-alphabet words sorted. + + +For a whitepaper describing the extensions, see: +. + + + +To download the extension library, see +. + + + + +To use this method, you must: + + + +Use Saxon as your XSLT processor. + + + +Install and configure the Innodata Isogen library, using +the documentation that comes with it. + + + +Set this parameter's value to kimber. + + + +Import the index extensions stylesheet module +common/autoidx-ng.xsl into your +customization. + + + + + + + + + + + -- 2.40.0