From: Fred Drake Date: Tue, 12 Nov 2002 23:12:54 +0000 (+0000) Subject: Clarified meaning of \w and \W with respect to the UNICODE and LOCALE flags. X-Git-Tag: v2.3c1~3465 X-Git-Url: https://granicus.if.org/sourcecode?a=commitdiff_plain;h=3d03968c75f834dcd9d58b4aaf7475d9d2d72d43;p=python Clarified meaning of \w and \W with respect to the UNICODE and LOCALE flags. Closes SF bug #635595. --- diff --git a/Doc/lib/libre.tex b/Doc/lib/libre.tex index b8aeb40c8b..ddeee2afed 100644 --- a/Doc/lib/libre.tex +++ b/Doc/lib/libre.tex @@ -347,10 +347,10 @@ equivalent to the set \regexp{[ \e t\e n\e r\e f\e v]}. equivalent to the set \regexp{[\textasciicircum\ \e t\e n\e r\e f\e v]}. \item[\code{\e w}]When the \constant{LOCALE} and \constant{UNICODE} -flags are not specified, -matches any alphanumeric character; this is equivalent to the set +flags are not specified, matches any alphanumeric character and the +underscore; this is equivalent to the set \regexp{[a-zA-Z0-9_]}. With \constant{LOCALE}, it will match the set -\regexp{[0-9_]} plus whatever characters are defined as letters for +\regexp{[0-9_]} plus whatever characters are defined as alphanumeric for the current locale. If \constant{UNICODE} is set, this will match the characters \regexp{[0-9_]} plus whatever is classified as alphanumeric in the Unicode character properties database. @@ -359,9 +359,9 @@ in the Unicode character properties database. flags are not specified, matches any non-alphanumeric character; this is equivalent to the set \regexp{[{\textasciicircum}a-zA-Z0-9_]}. With \constant{LOCALE}, it will match any character not in the set -\regexp{[0-9_]}, and not defined as a letter for the current locale. +\regexp{[0-9_]}, and not defined as alphanumeric for the current locale. If \constant{UNICODE} is set, this will match anything other than -\regexp{[0-9_]} and characters marked at alphanumeric in the Unicode +\regexp{[0-9_]} and characters marked as alphanumeric in the Unicode character properties database. \item[\code{\e Z}]Matches only at the end of the string.