]> granicus.if.org Git - icu/log
icu
3 years agoICU-21710 Additional clean up after removing BOYER_MOORE code from usearch.cpp
Jeff Genovy [Fri, 27 Aug 2021 02:23:16 +0000 (19:23 -0700)]
ICU-21710 Additional clean up after removing BOYER_MOORE code from usearch.cpp

Changes:
- We can completely remove the shift tables and related fields from
  data structs.

- Creation of UStringSearch objects should be faster now,
  as it doesn't waste time computing the unused shift tables.

- The sizeof(UStringSearch) is decreased from 5240 to 3192, so
  this should help to reduce memory for applications that create many string search objects.

Note regarding the comments on initialize(). It actually does not set illegal argument error if pattern is all ignoreables. Added a test case for it.

3 years agoICU-21555 Fix typos in repo from letter d to i
Erik Torres [Fri, 27 Aug 2021 01:16:53 +0000 (01:16 +0000)]
ICU-21555 Fix typos in repo from letter d to i

Fix typos in repo from letter d to i

3 years agoICU-21617 Remove work around of uninstalling GCC11 in CI build
Erik Torres [Fri, 27 Aug 2021 00:20:33 +0000 (00:20 +0000)]
ICU-21617 Remove work around of uninstalling GCC11 in CI build

Remove work around of uninstalling GCC11 in CI build

3 years agoICU-21130 Update cpp14 to cpp17 in CI builds
Erik Torres [Thu, 26 Aug 2021 21:46:57 +0000 (21:46 +0000)]
ICU-21130 Update cpp14 to cpp17 in CI builds

See #1827

3 years agoICU-21598 Add CI builds that exercises the data filtering on windows
Erik Torres [Thu, 26 Aug 2021 20:52:58 +0000 (20:52 +0000)]
ICU-21598 Add CI builds that exercises the data filtering on windows

See #1820

3 years agoICU-20973 Use standard keywords true & false to initialize type bool.
Fredrik Roubert [Thu, 26 Aug 2021 13:15:30 +0000 (15:15 +0200)]
ICU-20973 Use standard keywords true & false to initialize type bool.

Now when all equality operators return standard bool (commit 633438f),
it no longer makes any sense to use the ICU4C constants TRUE & FALSE
or local variables of type UBool for their return value.

3 years agoICU-21710 Remove BOYER_MOORE dead code from usearch.cpp
Jeff Genovy [Thu, 12 Aug 2021 20:17:52 +0000 (13:17 -0700)]
ICU-21710 Remove BOYER_MOORE dead code from usearch.cpp

3 years agoICU-21705 ICU-21706 Fix crash if ICU's default locale has BCP47 Unicode Extensions...
Jeff Genovy [Tue, 10 Aug 2021 01:37:22 +0000 (18:37 -0700)]
ICU-21705 ICU-21706 Fix crash if ICU's default locale has BCP47 Unicode Extensions, and fix ures_openDirect crash with NULL locale ID.

Add test case for ures_openDirect with NULL locale ID.

3 years agoICU-20973 Rewrite polymorphic CacheKeyBase equality operators for C++20.
Fredrik Roubert [Wed, 25 Aug 2021 16:11:20 +0000 (18:11 +0200)]
ICU-20973 Rewrite polymorphic CacheKeyBase equality operators for C++20.

The existing polymorphic equality operators that use different types for
the `this` and `other` objects are ambiguous with C++20 resolution rules
that require equality for reversed arguments.

In order to resolve that, while also possibly making the implementation
somewhat simpler overall, the implementation classes (LocaleCacheKey
and DateFmtBestPatternKey) now get normal (non-polymorphic) equality
operators that are trivially non-ambiguous (and as a bonus also don't
need any type casts), while the dynamic type checking logic is moved
into protected helper functions, which in the end are invoked
(without any ambiguity) by friend operators in the base class.

This way, all equality testing of cache key objects ends up taking one
of these two possible paths:

1. Both sides of the equality operator are of the same implementation
   type (ie. LocaleCacheKey or DateFmtBestPatternKey):

   The type specific equality operator is called directly, comparing the
   relevant attributes of the two objects directly.

2. The two sides of the equality operator are either of different types
   or of some base class type:

   The friend equality operators of CacheKeyBase call the virtual helper
   function to figure out whether the two objects are actually of the
   same type and if they are and this type is an implementation type
   then does the necessary type cast to get to 1.

3 years agoICU-21215 Add default constructor to FormattedNumberRange cldr/2021-08-25
Shane F. Carr [Fri, 20 Aug 2021 20:04:37 +0000 (15:04 -0500)]
ICU-21215 Add default constructor to FormattedNumberRange

3 years agoICU-21590 Add skeleton syntax for integer-width-trunc
Shane F. Carr [Fri, 20 Aug 2021 23:01:26 +0000 (18:01 -0500)]
ICU-21590 Add skeleton syntax for integer-width-trunc

3 years agoICU-21648 limit backslash-uhhhh escapes to ASCII hex digits
Markus Scherer [Tue, 24 Aug 2021 19:42:40 +0000 (12:42 -0700)]
ICU-21648 limit backslash-uhhhh escapes to ASCII hex digits

3 years agoICU-21663 UnicodeSetIterator: make protected members private
Markus Scherer [Tue, 24 Aug 2021 21:12:55 +0000 (14:12 -0700)]
ICU-21663 UnicodeSetIterator: make protected members private

3 years agoICU-21708 Update ant version in cldr-to-icu tool
Yoshito Umaoka [Mon, 23 Aug 2021 22:24:49 +0000 (18:24 -0400)]
ICU-21708 Update ant version in cldr-to-icu tool

dependabot reported ant 1.10.9 has a security problem and suggested to update to 1.10.11.

3 years agoICU-21655 widen Java character APIs from char to int arguments
Markus Scherer [Tue, 24 Aug 2021 01:36:57 +0000 (18:36 -0700)]
ICU-21655 widen Java character APIs from char to int arguments

3 years agoICU-21581 Update ICU4J LICENSE file
Yoshito Umaoka [Wed, 18 Aug 2021 16:34:58 +0000 (12:34 -0400)]
ICU-21581 Update ICU4J LICENSE file

3 years agoICU-21693 Fix FormattedNumber::toDecimalNumber on 0
Shane F. Carr [Sat, 21 Aug 2021 03:30:31 +0000 (22:30 -0500)]
ICU-21693 Fix FormattedNumber::toDecimalNumber on 0

3 years agoICU-21635 Unicode 14 data files 20210820, line break LB30b.2
Markus Scherer [Mon, 23 Aug 2021 03:24:02 +0000 (03:24 +0000)]
ICU-21635 Unicode 14 data files 20210820, line break LB30b.2

See #1807

3 years agoICU-21584 Fixed Java7 build error in earlier fix.
Rich Gillam [Fri, 20 Aug 2021 17:28:02 +0000 (10:28 -0700)]
ICU-21584 Fixed Java7 build error in earlier fix.

3 years agoICU-21581 lingering exhaustive fail, need to separately skip sd_Deva month name exemp...
Peter Edberg [Thu, 19 Aug 2021 23:00:54 +0000 (16:00 -0700)]
ICU-21581 lingering exhaustive fail, need to separately skip sd_Deva month name exemplar test

3 years agoICU-21581 integrate CLDR release-40-alpha1 to ICU trunk
Peter Edberg [Thu, 19 Aug 2021 20:28:43 +0000 (13:28 -0700)]
ICU-21581 integrate CLDR release-40-alpha1 to ICU trunk

3 years agoICU-21358 Use sign position to format approximate numbers
Shane F. Carr [Wed, 17 Mar 2021 17:45:58 +0000 (17:45 +0000)]
ICU-21358 Use sign position to format approximate numbers

See #1635

3 years agoICU-21581 integrate CLDR 40a0 to ICU trunk
Peter Edberg [Wed, 18 Aug 2021 23:51:53 +0000 (16:51 -0700)]
ICU-21581 integrate CLDR 40a0 to ICU trunk

3 years agoICU-21244 Fix the Valgrind status badge in the top-level Readme.
Jeff Genovy [Wed, 18 Aug 2021 20:07:55 +0000 (13:07 -0700)]
ICU-21244 Fix the Valgrind status badge in the top-level Readme.

3 years agoICU-20973 Delete unnecessary deprecated TimeUnitFormat::operator!=().
Fredrik Roubert [Tue, 17 Aug 2021 13:28:19 +0000 (15:28 +0200)]
ICU-20973 Delete unnecessary deprecated TimeUnitFormat::operator!=().

An operator!=() is already defined by a base class so this class doesn't
need to define it again and not doing so avoids a C++20 ambiguity.

This simplifies the code.

3 years agoICU-21729 Fix setLanguageTag doc matching Java
Frank Tang [Wed, 18 Aug 2021 19:00:03 +0000 (12:00 -0700)]
ICU-21729 Fix setLanguageTag doc matching Java

3 years agoICU-21202 Fixed error in alias resolution in uresbund.cpp.
Rich Gillam [Mon, 19 Jul 2021 23:39:34 +0000 (16:39 -0700)]
ICU-21202 Fixed error in alias resolution in uresbund.cpp.

3 years agoICU-20973 Add C++20 build bots to Azure CI: Clang-13, GCC-11, MSVC with std:c++latest
Jeff Genovy [Tue, 17 Aug 2021 00:31:44 +0000 (17:31 -0700)]
ICU-20973 Add C++20 build bots to Azure CI: Clang-13, GCC-11, MSVC with std:c++latest

3 years agoICU-21581 BRS#18 Promote @draft ICU4J API elements from version 68 to @stable
Craig Cornelius [Tue, 17 Aug 2021 00:10:46 +0000 (00:10 +0000)]
ICU-21581 BRS#18 Promote @draft ICU4J API elements from version 68 to @stable

See #1788

3 years agoICU-20973 Update the Coding Guidelines with bool equality operators.
Fredrik Roubert [Tue, 10 Aug 2021 20:30:07 +0000 (22:30 +0200)]
ICU-20973 Update the Coding Guidelines with bool equality operators.

3 years agoICU-20973 Update configure files from configure.ac using autoreconf.
Fredrik Roubert [Wed, 4 Aug 2021 16:10:54 +0000 (18:10 +0200)]
ICU-20973 Update configure files from configure.ac using autoreconf.

3 years agoICU-20973 Use the Clang -Wno-ambiguous-reversed-operator flag.
Fredrik Roubert [Wed, 4 Aug 2021 15:27:52 +0000 (17:27 +0200)]
ICU-20973 Use the Clang -Wno-ambiguous-reversed-operator flag.

3 years agoICU-20973 Manually resolve C++20 reversed argument order ambiguity.
Fredrik Roubert [Wed, 4 Aug 2021 14:25:34 +0000 (16:25 +0200)]
ICU-20973 Manually resolve C++20 reversed argument order ambiguity.

3 years agoICU-20973 Change all equality operator return types from UBool to bool.
Fredrik Roubert [Wed, 16 Sep 2020 14:50:15 +0000 (16:50 +0200)]
ICU-20973 Change all equality operator return types from UBool to bool.

3 years agoICU-21064 Fix declspec macro conflict on clang.
Gregorio Litenstein [Thu, 5 Aug 2021 06:02:32 +0000 (02:02 -0400)]
ICU-21064 Fix declspec macro conflict on clang.

3 years agoICU-21707 Fix LocaleBuilder assumption that the default locale doesn't have any BCP47...
Jeff Genovy [Wed, 11 Aug 2021 00:35:34 +0000 (17:35 -0700)]
ICU-21707 Fix LocaleBuilder assumption that the default locale doesn't have any BCP47 extension tags.

Add test case for LocaleBuilder with default locale with extensions.

Use Locale::getRoot().clone() instead of new Locale();

Add CI build bot with LANG that has extension tags

3 years agoICU-21694 ubidi.h sample code would not compile cldr/2021-08-11
Myles C. Maxfield [Fri, 30 Jul 2021 04:49:55 +0000 (21:49 -0700)]
ICU-21694 ubidi.h sample code would not compile

3 years agoICU-21639 Added an internal utility class to streamline preflighting and heap-allocat...
Rich Gillam [Thu, 8 Jul 2021 00:35:00 +0000 (17:35 -0700)]
ICU-21639 Added an internal utility class to streamline preflighting and heap-allocating a char buffer for a locale ID
and changed several internal methods in ULocale to use it, so that they work correctly on locale IDs that are longer
than ULOC_FULLNAME_CAPACITY.

3 years agoICU-21581 BRS 70 front-load task scrub closed issues: replace ticket# ICU-11234,
gnrunge [Tue, 27 Jul 2021 20:43:37 +0000 (13:43 -0700)]
ICU-21581 BRS 70 front-load task scrub closed issues: replace ticket# ICU-11234,
which was closed as invalid, with its successor ticket ICU-21257.

3 years agoICU-21424 Migrate bldicures.py to Python 3
gnrunge [Thu, 29 Jul 2021 22:21:26 +0000 (15:21 -0700)]
ICU-21424 Migrate bldicures.py to Python 3

3 years agoICU-21581 BRS 70 front-load task: scrub logKnownIssue for ticket CLDR-14477,
gnrunge [Tue, 27 Jul 2021 20:18:47 +0000 (13:18 -0700)]
ICU-21581 BRS 70 front-load task: scrub logKnownIssue for ticket CLDR-14477,
which is completed.

3 years agoICU-21662 Rename UVector::addElement().
Andy Heninger [Tue, 27 Jul 2021 22:51:34 +0000 (15:51 -0700)]
ICU-21662 Rename UVector::addElement().

This is the first step towards improving the error handling and out-of-memory
behavior of UVector::addElement(). A followup PR will add back a new addElement()
with corrected error handling, then additional followups will switch call sites
from the original (renamed) function to the new addElement().

This commit includes no logic or behavior changes; it only renames the existing functions.

3 years agoICU-21688 escape angle bracket in non-HTML content
DungSaga [Thu, 22 Jul 2021 02:42:02 +0000 (09:42 +0700)]
ICU-21688 escape angle bracket in non-HTML content

`<name>` should be escaped with backslash to prevent it from being rendered as a HTML tag

3 years agoICU-21647 Added support for narrow quarter names to DateFormatSymbols and SimpleDateF...
Rich Gillam [Thu, 8 Jul 2021 00:03:03 +0000 (17:03 -0700)]
ICU-21647 Added support for narrow quarter names to DateFormatSymbols and SimpleDateFormat.

3 years agoICU-21680 Add -std flag when generating dependency information
David Haney [Sun, 18 Jul 2021 15:01:50 +0000 (15:01 +0000)]
ICU-21680 Add -std flag when generating dependency information

3 years agoICU-21638 re-enable some of the ICU4J performance tests
Craig Cornelius [Fri, 23 Jul 2021 17:19:42 +0000 (17:19 +0000)]
ICU-21638 re-enable some of the ICU4J performance tests

See #1773

3 years agoICU-21622 Add UYW to list of currencies from ucurr_openISOCurrencies()
Peter Edberg [Thu, 22 Jul 2021 04:21:54 +0000 (21:21 -0700)]
ICU-21622 Add UYW to list of currencies from ucurr_openISOCurrencies()

3 years agoICU-21624 Fixed it so that a DecimalFormat no longer owns two separate DecimalFormatS...
Rich Gillam [Thu, 8 Jul 2021 01:23:45 +0000 (18:23 -0700)]
ICU-21624 Fixed it so that a DecimalFormat no longer owns two separate DecimalFormatSymbols objects.

3 years agoICU-21584 Added code to allow regions of type "grouping" to return their children.
Rich Gillam [Fri, 9 Jul 2021 00:06:11 +0000 (17:06 -0700)]
ICU-21584 Added code to allow regions of type "grouping" to return their children.

3 years agoICU-21581 BRS 70rc, update urename.h pass 1
Peter Edberg [Wed, 21 Jul 2021 21:20:17 +0000 (14:20 -0700)]
ICU-21581 BRS 70rc, update urename.h pass 1

3 years agoICU-21581 Promoted @draft 68 APIs to stable
Rich Gillam [Fri, 2 Jul 2021 21:43:29 +0000 (14:43 -0700)]
ICU-21581 Promoted @draft 68 APIs to stable

3 years agoICU-21580 Fix typos in icu4j/
luz paz [Wed, 14 Jul 2021 21:03:25 +0000 (17:03 -0400)]
ICU-21580 Fix typos in icu4j/

Found via `codespell -q 3 -L ans,anumber,atleast,ba,bre,hace,nd,nin,ois,rsource,som,sur,tht -S icu4c/source/data/zone,icu4c/source/data/lang`

3 years agoICU-21580 Fix typos in icu4c/
luz paz [Wed, 14 Jul 2021 22:45:49 +0000 (18:45 -0400)]
ICU-21580 Fix typos in icu4c/

Found via `codespell -q 3 -L ans,anumber,atleast,ba,bre,hace,nd,nin,ois,rsource,som,sur,tht -S icu4c/source/data/zone,icu4c/source/data/lang`
ICU-21580 Fix source (related) typos
ICU-21580 Revert extraneous auto-encoding
ICU-21580 Re-add previous reverted fix without auto-encoding

3 years agoICU-21650 Modified DateTimePatternGenerator to handle skeletons with "e" and "c"...
Rich Gillam [Tue, 6 Jul 2021 20:56:15 +0000 (13:56 -0700)]
ICU-21650 Modified DateTimePatternGenerator to handle skeletons with "e" and "c" better (allowing support for numeric
day-of-week abbreviations).

3 years agoICU-21676 Fix strcpy buffer override in ultag_parse
Frank Yung-Fong Tang [Wed, 14 Jul 2021 00:47:01 +0000 (00:47 +0000)]
ICU-21676 Fix strcpy buffer override in ultag_parse

See #1767

3 years agoICU-21641 Fix "emoji" and "eor" for Collator::getKeywordValues
Frank Tang [Fri, 9 Jul 2021 00:40:23 +0000 (00:40 +0000)]
ICU-21641 Fix "emoji" and "eor" for Collator::getKeywordValues

See #1743

3 years agoICU-21664 Fix set but unused warning in locid.cpp
Jeff Genovy [Wed, 30 Jun 2021 19:20:46 +0000 (12:20 -0700)]
ICU-21664 Fix set but unused warning in locid.cpp

3 years agoICU-21659 Adding test for exabyte
Shane F. Carr [Wed, 30 Jun 2021 21:08:46 +0000 (16:08 -0500)]
ICU-21659 Adding test for exabyte

3 years agoICU-21525 UnicodeSet.hasString(), UnicodeSetIterator.skipToStrings() & C API
Markus Scherer [Tue, 29 Jun 2021 17:27:09 +0000 (17:27 +0000)]
ICU-21525 UnicodeSet.hasString(), UnicodeSetIterator.skipToStrings() & C API

3 years agoICU-21657 Update Azure CI build bots to use Ubuntu 18.04
Jeff Genovy [Wed, 30 Jun 2021 18:08:22 +0000 (11:08 -0700)]
ICU-21657 Update Azure CI build bots to use Ubuntu 18.04

3 years agoICU-21644 Use LocalPointer in ICUService::getKey to prevent leaks.
Jeff Genovy [Thu, 24 Jun 2021 04:57:04 +0000 (21:57 -0700)]
ICU-21644 Use LocalPointer in ICUService::getKey to prevent leaks.

3 years agoICU-21580 fix TimeZoneFormat doc typo, extra open brace
Markus Scherer [Thu, 24 Jun 2021 23:48:46 +0000 (23:48 +0000)]
ICU-21580 fix TimeZoneFormat doc typo, extra open brace

3 years agoICU-21580 fix RuleBasedCollator doc typo, extra open brace
Markus Scherer [Thu, 24 Jun 2021 23:41:59 +0000 (23:41 +0000)]
ICU-21580 fix RuleBasedCollator doc typo, extra open brace

3 years agoICU-21491 Remove internal usage of @internal BasicTimeZone::getOffsetFromLocal
Frank Tang [Wed, 16 Jun 2021 21:53:22 +0000 (21:53 +0000)]
ICU-21491 Remove internal usage of @internal BasicTimeZone::getOffsetFromLocal

See #1681

3 years agoICU-21635 Unicode 14 beta
Markus Scherer [Thu, 10 Jun 2021 00:56:15 +0000 (00:56 +0000)]
ICU-21635 Unicode 14 beta

3 years agoICU-21645 reduce heap allocations in unescape() and other parsing
Markus Scherer [Sat, 19 Jun 2021 20:25:33 +0000 (20:25 +0000)]
ICU-21645 reduce heap allocations in unescape() and other parsing

3 years agoICU-21645 test & fix unescapeAt(2 supp escapes)
Markus Scherer [Sat, 19 Jun 2021 02:01:58 +0000 (02:01 +0000)]
ICU-21645 test & fix unescapeAt(2 supp escapes)

3 years agoICU-21580 fix unicode.org/unicode/ URLs
Markus Scherer [Fri, 18 Jun 2021 22:19:21 +0000 (22:19 +0000)]
ICU-21580 fix unicode.org/unicode/ URLs

3 years agoICU-21581 integrate CLDR release-40-m1 to ICU trunk cldr/2021-06-15
Peter Edberg [Tue, 15 Jun 2021 07:10:06 +0000 (00:10 -0700)]
ICU-21581 integrate CLDR release-40-m1 to ICU trunk

3 years agoICU-21404 Migrate various documents from Google Sites pages to Markdown.
Hugo van der Merwe [Wed, 27 Jan 2021 15:33:46 +0000 (16:33 +0100)]
ICU-21404 Migrate various documents from Google Sites pages to Markdown.

Includes:
- https://sites.google.com/site/icusite/processes/release/tasks/*
- https://sites.google.com/site/icusite/processes/uni-up

Some links fixed, copyright notices added, filenames improved, Sidebar
navigation links to the new pages, but needs some further
improvements. Updated /trac/ticket/ links, and /trac/changeset/ links
where I could find the corresponding git commit. Also tweaked
userguide/dev/editing.md to clarify 'root directory'. Apply branch
rename: s/master/main/.

3 years agoICU-21632 Fix links in readme.html files to respective readme pages in User Guide
Elango Cheran [Wed, 9 Jun 2021 19:07:44 +0000 (12:07 -0700)]
ICU-21632 Fix links in readme.html files to respective readme pages in User Guide

3 years agoICU-21555 Fix typos from G to L
Erik Torres [Fri, 4 Jun 2021 16:38:42 +0000 (16:38 +0000)]
ICU-21555 Fix typos from G to L

See #1737

3 years agoICU-21581 BRSRC 70.0.1 Version update and regenerate configure for v70.0.1
Erik Torres [Fri, 4 Jun 2021 18:45:05 +0000 (18:45 +0000)]
ICU-21581 BRSRC 70.0.1 Version update and regenerate configure for v70.0.1

See #1738

3 years agoICU-21569 Fix Java7 breakage in LSTM & add Java7 CI
Frank Yung-Fong Tang [Thu, 27 May 2021 21:57:01 +0000 (21:57 +0000)]
ICU-21569 Fix Java7 breakage in LSTM & add Java7 CI

See #1734

3 years agoICU-21626 Add CIs to build/test Java 16.
Frank Tang [Tue, 18 May 2021 21:12:34 +0000 (14:12 -0700)]
ICU-21626 Add CIs to build/test Java 16.

3 years agoICU-21579 Compiler warning fixes.
Andy Heninger [Fri, 21 May 2021 01:34:39 +0000 (18:34 -0700)]
ICU-21579 Compiler warning fixes.

Fix typos in hard-coded test data, uncovered by a compiler upgrade to
clang version 12.0.0-1ubuntu1

3 years agoICU-21623 Removed linker flag -dexpall for AIX xlclang build.
yumaoka [Wed, 19 May 2021 20:35:04 +0000 (20:35 +0000)]
ICU-21623 Removed linker flag -dexpall for AIX xlclang build.

See #1727

3 years agoICU-21621 Fixing TestCharset/TestInvalidInput failure on Java 16 runtime.
Yoshito Umaoka [Wed, 19 May 2021 20:19:06 +0000 (16:19 -0400)]
ICU-21621 Fixing TestCharset/TestInvalidInput failure on Java 16 runtime.

3 years agoICU-21600 Increase timeout and separate build/test in Cygwin CI
Daniel Ju [Fri, 14 May 2021 23:46:34 +0000 (16:46 -0700)]
ICU-21600 Increase timeout and separate build/test in Cygwin CI

3 years agoICU-21569 LSTM fix resource issue.
Frank Tang [Thu, 13 May 2021 21:01:20 +0000 (21:01 +0000)]
ICU-21569 LSTM fix resource issue.

See #1718

3 years agoICU-21569 LSTM: Remove Java 8 dependency
Frank Tang [Fri, 14 May 2021 21:58:10 +0000 (14:58 -0700)]
ICU-21569 LSTM: Remove Java 8 dependency

3 years agoICU-21615 Work-around Azure CI build bot issue on Ubuntu 18.04
Jeff Genovy [Fri, 14 May 2021 23:28:39 +0000 (16:28 -0700)]
ICU-21615 Work-around Azure CI build bot issue on Ubuntu 18.04

3 years agoICU-21569 LSTM Add test to test long string.
Frank Tang [Wed, 12 May 2021 03:18:00 +0000 (20:18 -0700)]
ICU-21569 LSTM Add test to test long string.

3 years agoICU-21569 Remove extra LocalUResourceBundlePointer
Frank Tang [Tue, 11 May 2021 19:18:51 +0000 (12:18 -0700)]
ICU-21569 Remove extra LocalUResourceBundlePointer

Remove LocalUResourceBundlePointer fDictRes which shadowed the real
UUResourceBundle*.

ICU-21569 fix assert

3 years agoICU-21555 fix typos for D, E and F found in the repo
Erik Torres [Mon, 10 May 2021 16:49:16 +0000 (16:49 +0000)]
ICU-21555 fix typos for D, E and F found in the repo

See #1714

3 years agoICU-21569 LSTM Part 3 Add Java implementation
Frank Yung-Fong Tang [Sat, 8 May 2021 22:02:03 +0000 (22:02 +0000)]
ICU-21569 LSTM Part 3 Add Java implementation

See #1706

3 years agoICU-21569 LSTM Optimize memory usage.
Frank Tang [Fri, 7 May 2021 22:23:54 +0000 (22:23 +0000)]
ICU-21569 LSTM Optimize memory usage.

See #1712

3 years agoICU-21244 Make Valgrind CI build/test check run on every Pull Request
Erik Torres Aguilar [Thu, 6 May 2021 20:40:27 +0000 (20:40 +0000)]
ICU-21244 Make Valgrind CI build/test check run on every Pull Request

See #1713

3 years agoICU-21588 Fix strncpy warning
Frank Tang [Tue, 4 May 2021 01:24:40 +0000 (18:24 -0700)]
ICU-21588 Fix strncpy warning

3 years agoICU-21599 Move LanguageBreakEngine classes to com.ibm.icu.impl.breakiter
Frank Tang [Mon, 3 May 2021 16:17:25 +0000 (09:17 -0700)]
ICU-21599 Move LanguageBreakEngine classes to com.ibm.icu.impl.breakiter

3 years agoICU-21577 doc/le: update harfbuzz link; + icu-le-hb link
Mingye Wang [Fri, 19 Mar 2021 06:42:46 +0000 (14:42 +0800)]
ICU-21577 doc/le: update harfbuzz link; + icu-le-hb link

The fd.o harfbuzz link currently goes to the github repo, which is a bit less documentation-ish than the github.io page.

Add link to icu-le-hb.

3 years agoICU-21569 Propagate LSTM memory allocation issue
Frank Tang [Sat, 1 May 2021 02:19:24 +0000 (19:19 -0700)]
ICU-21569 Propagate LSTM memory allocation issue

3 years agoICU-21585 Remove not used var.
Frank Tang [Tue, 4 May 2021 00:00:04 +0000 (17:00 -0700)]
ICU-21585 Remove not used var.

3 years agoICU-21569 Add GA to test LSTM configuration
Frank Tang [Thu, 29 Apr 2021 07:07:45 +0000 (00:07 -0700)]
ICU-21569 Add GA to test LSTM configuration

1. Add GA to test BreakIterator under LSTM configuration (remove Thai
and Burmese dictionary and include Thai and Burmese LSTM)
2. Add LSTMDataName for the purpose of testing.
3. Add file base test code to test BreakIterator match results from test
file generated by pythong code in
https://github.com/unicode-org/lstm_word_segmentation/blob/master/segment_text.py
4. Fix a LSTMBreakEngine::divideUpDictionaryRange bug when the return value
should only contains the number of words found when the passed in foundBreaks
already contains some data.
5. Change the cintltest TestSwapData from testing thaidict to laodict so
it will not break while we filter out thaidict under the LSTM
configuration.

3 years agoICU-21569 Add BOM to the beginning of the lstm files (minor fix of Part 2 C++)
Frank Tang [Wed, 28 Apr 2021 23:12:47 +0000 (23:12 +0000)]
ICU-21569 Add BOM to the beginning of the lstm files (minor fix of Part 2 C++)

See #1702

3 years agoICU-21567 Avoid using regex in ULocale.getName()
Victor Chang [Mon, 29 Mar 2021 22:52:43 +0000 (23:52 +0100)]
ICU-21567 Avoid using regex in ULocale.getName()

No behavior change is expected

3 years agoICU-21597 Fix Null-deref W in canonicalizeLocale
Frank Tang [Wed, 28 Apr 2021 19:39:57 +0000 (12:39 -0700)]
ICU-21597 Fix Null-deref W in canonicalizeLocale

3 years agoICU-21569 LSTM Part 2- include C++ implementation
Frank Tang [Wed, 28 Apr 2021 01:06:06 +0000 (01:06 +0000)]
ICU-21569 LSTM Part 2- include C++ implementation

See #1689

3 years agoICU-21591 Release lock in SimpleDateFormat::tzFormat in case of failure
Andy Heninger [Sun, 25 Apr 2021 22:45:18 +0000 (15:45 -0700)]
ICU-21591 Release lock in SimpleDateFormat::tzFormat in case of failure

Also remove the use of the unsafe double-checked lock idiom in the same
function, SimpleDateFormat::tzFormat(). Synchronization now always uses a
mutex, which is slower, but in the context of format or parse operations,
shouldn't be significant.

Added synchronization to one more unsafe direct reference to a const
SimpleDateFormat::fTimeZoneFormat. In the assignment operator.

3 years agoICU-21569 LSTM Part 1- data file and build tool
Frank Yung-Fong Tang [Fri, 23 Apr 2021 21:28:30 +0000 (21:28 +0000)]
ICU-21569 LSTM Part 1- data file and build tool

See #1688