diff options
author | Olaf Weber <olaf@sgi.com> | 2019-04-25 20:49:18 +0300 |
---|---|---|
committer | Theodore Ts'o <tytso@mit.edu> | 2019-04-25 20:49:18 +0300 |
commit | a8384c68797ee022f5fd7bcef5f4cc57863d4042 (patch) | |
tree | 5cab2a7a1e9aa4a284354236868aed6eccfb5d5c /fs/unicode/README.utf8data | |
parent | 44594c2fbf42528001dfb1597d26adb40ba6d178 (diff) | |
download | linux-a8384c68797ee022f5fd7bcef5f4cc57863d4042.tar.xz |
unicode: reduce the size of utf8data[]
Remove the Hangul decompositions from the utf8data trie, and do
algorithmic decomposition to calculate them on the fly. To store the
decomposition the caller of utf8lookup()/utf8nlookup() must provide a
12-byte buffer, which is used to synthesize a leaf with the
decomposition. This significantly reduces the size of the utf8data[]
array.
Changes made by Gabriel:
Rebase to mainline
Fix checkpatch errors
Extract robustness fixes and merge back to original mkutf8data.c patch
Regenerate utf8data.h
Signed-off-by: Olaf Weber <olaf@sgi.com>
Signed-off-by: Gabriel Krisman Bertazi <krisman@collabora.co.uk>
Signed-off-by: Theodore Ts'o <tytso@mit.edu>
Diffstat (limited to 'fs/unicode/README.utf8data')
-rw-r--r-- | fs/unicode/README.utf8data | 4 |
1 files changed, 2 insertions, 2 deletions
diff --git a/fs/unicode/README.utf8data b/fs/unicode/README.utf8data index 7b18dca1146f..4af398a9fb31 100644 --- a/fs/unicode/README.utf8data +++ b/fs/unicode/README.utf8data @@ -46,8 +46,8 @@ cd to this directory (fs/unicode) and run this command: make C=../.. objdir=../.. utf8data.h.new After sanity checking the newly generated utf8data.h.new file (the -version generated from the 11.0.0 UCD should be 13,834 lines long, and -have a total size of 1104k) and/or comparing it with the older version +version generated from the 11.0.0 UCD should be 4,061 lines long, and +have a total size of 320k) and/or comparing it with the older version of utf8data.h, rename it to utf8data.h. If you are a kernel developer updating to a newer version of the |