]> granicus.if.org Git - python/commit
This patch changes the behaviour of the UTF-16 codec family. Only the
authorMarc-André Lemburg <mal@egenix.com>
Mon, 21 May 2001 20:30:15 +0000 (20:30 +0000)
committerMarc-André Lemburg <mal@egenix.com>
Mon, 21 May 2001 20:30:15 +0000 (20:30 +0000)
commit489b56e04480b8ca3f2d1676265e67c65bae788d
treea148a1f74890d004f6434a77eb14185b76c73c77
parentf52d27e52d289b99837b4555fb3f757f2c89f4ad
This patch changes the behaviour of the UTF-16 codec family. Only the
UTF-16 codec will now interpret and remove a *leading* BOM mark. Sub-
sequent BOM characters are no longer interpreted and removed.
UTF-16-LE and -BE pass through all BOM mark characters.

These changes should get the UTF-16 codec more in line with what
the Unicode FAQ recommends w/r to BOM marks.
Include/unicodeobject.h
Objects/unicodeobject.c