65 Commits

Author SHA1 Message Date
35eb472e6b Backport gcc changes to MSVC
wchar_t is not char16_t on MSVC, requiring a lot of typecasting when
interfacing ZRCola database strings with GUI.

Signed-off-by: Simon Rozman <simon@rozman.si>
2022-09-15 15:33:08 +02:00
ba4ff3cd42 libZRCola: Add test
Signed-off-by: Simon Rozman <simon@rozman.si>
2022-09-15 14:36:44 +02:00
9709cc2845 libZRCola: Make UTF-16 explicit
ZRCola is using UTF-16LE strings internally (thanks to Windows).
However, wchar_t and std::wstring are UTF-32 on other platforms.

Signed-off-by: Simon Rozman <simon@rozman.si>
2022-09-15 14:36:44 +02:00
eedab7da56 libZRCola: Initial stab at compiling with gcc
Signed-off-by: Simon Rozman <simon@rozman.si>
2022-09-15 14:36:44 +02:00
63fda12c99 Switch integer datatypes to C99
This makes code more portable.

Signed-off-by: Simon Rozman <simon@rozman.si>
2022-09-15 14:36:44 +02:00
47d3884af3 Update Copyright and build year
Signed-off-by: Simon Rozman <simon@rozman.si>
2022-01-07 11:38:32 +01:00
7fb29aed80 Introduce bad ZRCola Unicode Composition highlighting
Signed-off-by: Simon Rozman <simon@rozman.si>
2021-12-21 14:21:28 +01:00
0d112cd90b Switch to SPDX license notice
Signed-off-by: Simon Rozman <simon@rozman.si>
2021-12-01 08:48:49 +01:00
12c2889ea3 Copyright: Bump year
Signed-off-by: Simon Rozman <simon@rozman.si>
2021-03-25 08:36:00 +01:00
c7e29b1862 Add UTF-8 BOM markers
MSVC needs them to use correct charset when Language for non-Unicode
programs is set to Windows-1252 or anything different than UTF-8.

Signed-off-by: Simon Rozman <simon@rozman.si>
2020-11-19 10:59:54 +01:00
ebdbb84a29 Enable recommended code analysis and address warnings
Signed-off-by: Simon Rozman <simon@rozman.si>
2020-02-13 13:18:11 +01:00
b3702ed237 Extend copyright year
Signed-off-by: Simon Rozman <simon@rozman.si>
2020-02-11 15:38:57 +01:00
0800f813ff Rename stdafx.h to pch.h
Signed-off-by: Simon Rozman <simon@rozman.si>
2020-02-10 14:10:29 +01:00
893dbf5783 Update year to 2019
Signed-off-by: Simon Rozman <simon@rozman.si>
2019-04-01 14:50:04 +02:00
ddae64ca6d Extend copyright year 2018-09-07 23:57:59 +02:00
59d06ae3dc Make explicit int to unsigned __int6 conversion 2018-09-07 19:36:10 +02:00
3831012af9 Distinguish variables with same names
...to resolve C4457 warnings.
2018-09-07 19:32:54 +02:00
9ca82f02f8 Decomposition (inverse-translation) fixed 2017-07-13 14:50:27 +02:00
84fa35c9be Clean-up 2017-05-10 09:44:06 +02:00
b7bef14746 Copyright extended to 2017 2017-04-18 14:05:45 +02:00
acbae76737 Clean-up 2017-03-30 10:57:31 +02:00
cc4a150501 Translation set ID data-type introduced 2017-03-30 10:23:08 +02:00
186dbee443 Multiple translation sets support 2017-03-30 09:49:22 +02:00
155642a3f9 "Composition" and "Decomposition" renamed to more general terms "Translation" and "Inverse translation" to extend its use for transliteration 2017-03-27 14:10:43 +02:00
806aa550a5 Characters are represented as UTF-16 sequences instead of a single UTF-16 character 2017-03-21 09:09:07 +01:00
ca306345c2 Composed and decomposed strings of the ZRCola::translation_db::translation protected 2017-03-16 10:21:29 +01:00
03ff056898 ZRCola::translation_db::translation::com_start marked as const now 2017-03-16 09:44:43 +01:00
9f083bb521 Character-Language table extended to support multi-UTF-16 characters 2017-03-14 14:14:39 +01:00
bd0fdba435 Composed characters extended to support multiple UTF-16 character 2017-03-14 12:54:46 +01:00
87814981db ZRCola::translation_db::Compose fixed to honor inputMax==-1 correctly 2017-03-14 10:51:56 +01:00
4ce3213b1a Searching by character tags honors category selection now 2016-10-13 12:30:55 +02:00
0ed0cf8c49 Support for searchable character tags added 2016-10-13 11:13:46 +02:00
f51c4dbbec ZRCola::tagname_db::Search() added 2016-10-12 18:45:59 +02:00
f53779dbad Character tagging support added 2016-10-12 14:30:24 +02:00
641d178248 include/zrcola.h >> include/version.h 2016-10-11 12:49:01 +02:00
dddc2d5794 chrcatid_t_blank and langid_t_blank are static members of their respective structures now 2016-10-06 23:17:24 +02:00
ba8254c097 Auto C++ typing introduced where appropriate 2016-10-06 11:52:17 +02:00
e5b246a4f0 Character search ranking and display is more sophisticated now 2016-05-31 11:50:12 +02:00
155fb03c5a Character search moved to separate thread for smoother experience 2016-05-13 12:11:38 +02:00
710937f8df Language ID type redeclaration for easier and safer work 2016-05-13 10:16:29 +02:00
4ec7dc3ca5 Search optimizations 2016-05-13 09:32:36 +02:00
9c3c1585d5 Character Select finished
(closes #11)
2016-05-13 03:44:28 +02:00
4aa0d9183e Support for Unicode character database added 2016-05-11 14:39:20 +02:00
4a27d62b4e ZRCola::LangConvert() function added 2016-04-29 12:35:59 +02:00
8c51f9c2a6 Ambiguous decomposition sequences now decompose to first ranked decomposition
(resolves #18)
2016-04-22 11:30:42 +02:00
2e89edb62c Source and destination index mapping on the left side of the first transformation issue fixed 2016-04-22 10:59:29 +02:00
a021dd31f7 ZRCola::mapping is generic now 2016-04-20 12:24:23 +02:00
f735bd5bee Decomposition can omit language specific characters now 2016-04-11 13:28:19 +02:00
177edd19e8 Languages and language-specific characters added to the database 2016-04-11 12:41:22 +02:00
fc93474b9a Characters with decompositions starting with # do not decompose any more. 2016-04-11 10:21:11 +02:00