wxWidgets

Author	SHA1	Message	Date
Vadim Zeitlin	46ea3cb8c0	Refactor: merge decode_utf16() into wxDecodeSurrogate() No real changes, but just get rid of two functions doing the same thing but using (semantically) different API, this was just too confusing. Change all the code to use wxDecodeSurrogate() that encapsulates decoding the surrogate and advancing the input pointer as needed and so is less error-prone. More generally, change the code to use end pointers instead of decrementing the length to check for the end condition: this is more clear, simpler and probably even more efficient.	2017-11-09 23:49:53 +01:00
Vadim Zeitlin	d82e3d4429	Never read uninitialized memory when decoding UTF-16 again Pass length value to decode_utf16() and end pointer to wxDecodeSurrogate() to ensure that we never read beyond the end of the buffer when decoding UTF-16 when the last (complete) 16 bit value in the buffer is the first half of a surrogate. This had been previously partially addressed by ad hoc changes, e.g. `f72aa7b1c9` did it for wxMBConvUTF16swap, but the problem still remained for wxMBConvUTF16straight. Ensure that this bug is fixed everywhere now but making it impossible to even try decoding a surrogate without providing the buffer length.	2017-11-09 23:48:20 +01:00
Vadim Zeitlin	2ee199acac	Change decode_utf16() to take wxChar16 instead of wxUint16 Under Unix systems, this is the same thing, but under MSW, where sizeof(wchar_t) == 2, this allows to pass wchar_t pointers to this function without casts. It also makes it consistent with wxDecodeSurrogate() and allows to get rid of another ugly cast there. No real changes.	2017-11-09 23:47:22 +01:00
Vadim Zeitlin	666ff421bb	Fix an out of bounds read in UTF-7 decoding code Calling wxMBConvUTF7::ToWChar(..., "+", 1) resulted in reading uninitialized memory as the decoding code didn't check that there were any bytes left when switching to the "shifted" mode. Fix this by explicitly checking for this and returning an error if nothing is left.	2017-11-09 23:38:00 +01:00
Vadim Zeitlin	c47acbeb52	Fix wxMBConv::cWC2MB() and cMB2WC() returned buffer length This commit refactors the overloads of cMB2WC() and cWC2MB() methods taking raw pointers and buffers to reuse the same code and fixes the wrong length of the buffer returned by cWC2MB(wchar_t) overload for conversions using multiple bytes to represent the NUL terminator character (it previously was wrong for UTF-16 and UTF-32 conversions due to wrongly subtracting 1 from the length when creating it instead of correctly subtracting GetMBNulLen()) and the wrong length of the buffer returned from cMB2WC(char) overload where no adjustment for the trailing NUL was done at all. Also return simple default-constructed buffers from these methods in case of failure instead of using wxScopedCharBuffer::CreateNonOwned() which is less obvious and less efficient (even if the latter probably doesn't matter here because it's only done in case of an error). Finally, add tests checking that using WC2MB() or either of cWC2MB() overloads returns the buffers of the same length and with the same contents.	2017-11-03 23:26:39 +01:00
Vadim Zeitlin	8bf239f8e4	Make wxMBConv dtor inline The Darwin linking problem mentioned in the comment doesn't exist in any of the still supported macOS versions, so it doesn't make sense to continue working around it.	2017-11-03 18:00:04 +01:00
Vadim Zeitlin	b3fe07942f	Remove top level "const" from wxMBConv methods return values This "const" is useless and doesn't actually do anything, remove it to avoid confusion.	2017-11-02 01:57:22 +01:00
Tobias Taschner	73a22766ee	Always enable wxMBConv::IsUTF8() These where previously guarded by wxUSE_UNICODE_UTF8 but may be useful in other configurations too.	2017-10-27 20:13:04 +02:00
ARATA Mizuki	8a29c5c09f	Use the added wxUniChar functions in the existing code	2017-05-01 18:18:45 +09:00
Paul Cornett	64f1d760c6	Improve handling of wxUSE_FONTMAP==0 case with wxMSW wxEncodingToCodepage() can be used when wxUSE_FONTMAP==0 Also avoid unreachable code warning with MSVC when using whole program optimization	2016-12-22 22:09:41 -08:00
Lauri Nurmi	902130f64e	Use the new wxSysErrorMsgStr() instead of wxSysErrorMsg()	2016-11-21 19:15:20 +02:00
Maarten	977a826639	use more wxOVERRIDE (#329 )	2016-09-25 13:21:28 -07:00
ARATA Mizuki	daf944909c	Use wxChar16 instead of wxDecodeSurrogate_t for UTF-16 unit type in wxDecodeSurrogate	2016-03-01 16:01:39 +09:00
ARATA Mizuki	5a92181ac1	Fix the length returned by UTF-32 conversion for non-BMP input Don't optimize the required length as this is a tiny gain resulting in big problems with the strings containing surrogates for which the actual result is shorter than the length returned, resulting in extra NUL bytes at the end of the converted buffer. This is similar to `3410aa372f` (see #16298) but for UTF-32 and not UTF-16. Closes #17070.	2016-02-21 14:38:17 +01:00
Vadim Zeitlin	956edbb309	Reimplement wxSafeConvertXXX() functions using wxWhateverWorksConv These functions were almost but not quite identical to it: wxSafeConvertMB2WX() tried the current locale encoding before UTF-8 while wxConvWhateverWorks tries UTF-8 first and then the current locale encoding. The latter behaviour is more correct as valid UTF-8 could be misinterpreted as some legacy multibyte encoding otherwise, so get rid of this difference and just forward these functions to wxConvWhateverWorks.	2016-02-19 02:57:20 +01:00
Vadim Zeitlin	a11456c078	Add wxWhateverWorksConv and use it for file names under Unix This ensures that we can create output files with Unicode names even when they're not representable in the current locale encoding, notably when the current locale has never been changed and is still the default "C" one, not supporting anything else other than 7 bit ASCII. Credits for the new class name go to Woody Allen.	2016-02-19 02:52:43 +01:00
Vadim Zeitlin	90eae99cd6	Use strict UTF-8 conversion in wxSafeConvertXXX() functions It doesn't make sense to use any fallbacks when converting to/from UTF-8 and this wasn't even done consistently as only wxSafeConvertWX2MB() used MAP_INVALID_UTF8_TO_OCTAL, but not wxSafeConvertMB2WX(). More importantly, UTF-8 conversion can never fail for a valid Unicode string, so there is no need for any fall backs.	2016-02-13 17:03:47 +01:00
ARATA Mizuki	e570e8b6ac	Fix conversion from wchar_t string with surrogates to UTF-8 Correctly account for the second half of the surrogate in wxMBConvUTF8::FromWChar() implementation, this makes it actually work for the strings containing surrogates on the platforms using UTF-16 encoding for wchar_t (such as MSW). See #17070.	2015-11-13 19:36:34 +01:00
Vadim Zeitlin	5cff8c1232	Fix return value of wxMBConvUTF32::cWC2MB() in presence of surrogates UTF-32 conversions only estimate, from above, the size of the output buffer needed, so the value returned from the first call to FromWChar(NULL) in cWC2MB() can be inexact for them and we need to return the value returned by the second call to FromWChar() doing the real conversion from cWC2MB() itself to ensure that we return the correct output length. See #17070.	2015-11-13 19:36:33 +01:00
Vadim Zeitlin	048ba4b509	Fail to convert wide string with incomplete surrogates to UTF-8 Correctly fail if the wide string being converted is UTF-16 encoded (which can only happen on platforms using 16 bit wchar_t, i.e. MSW) and ends in the middle of a surrogate pair. Notice that other conversions still wrongly encode invalid wchar_t sequences such as 0xd800 not followed by anything, this will need to be fixed in the future, but for now at least make it work for the most commonly used conversion. See #17070.	2015-11-13 19:36:32 +01:00
Tobias Taschner	8282c1be0f	Remove Windows CE support Windows CE doesn't seem to be supported by Microsoft any longer. Last CE release was in early 2013 and the PocketPC and Smartphone targets supported by wxWidgets are long gone. The build files where already removed in an earlier cleanup this commit removes all files, every #ifdef and all documentation regarding the Windows CE support. Closes https://github.com/wxWidgets/wxWidgets/pull/81	2015-09-23 00:52:30 +02:00
Tobias Taschner	f1abb351af	Remove MicroWindows support. MicroWindows (aka Nano-X) support hasn’t been updated since 2010 and last work for it in wxWidgets happened more than 10 years ago.	2015-08-27 11:00:16 +02:00
Vadim Zeitlin	f72aa7b1c9	Fix reading beyond end of buffer in UTF-16 decoding code. Verify that incrementing the input pointer doesn't take us outside the buffer. Still accept a single trailing NUL as the string terminator.	2015-06-21 02:24:49 +02:00
Tim Kosse	d9d6247f37	In wxMBConvStrictUTF8::ToWChar the length of a multibyte UTF-8 sequence is obtained from a table, with the leading byte as offset. Later in that function, the prefix of the leading byte is compared against the expected prefix for the given length. Unless the table is faulty, this comparison can never fail. It is thus redundant and not needed. As optimizing compilers aren't smart enough yet to detect this, this commit removes the redundant check. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@78264 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2014-12-11 20:31:21 +00:00
Vadim Zeitlin	f99ff49e29	Don't end converting prematurely on encountering consecutive NULs. Don't stop converting subsequent chunks just because the length of one of them was 0: this can happen if the first character of a string is a NUL or if there are two (or more) NULs in it later. Simply remove the check for this and continue as usual even in this case. Also add a unit test verifying that we do translate NULs in input into NULs in output. Closes #16620. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@78021 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2014-10-14 19:36:46 +00:00
Vadim Zeitlin	3410aa372f	Fix wrong resulting string length in UTF-16 to wchar_t conversion. Don't optimize the returned length for surrogate-less case, this does save a pass of the string but at the price of returning a wrong result, which is not worth it, just compute the really required length exactly. Closes #16298. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@76622 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2014-05-29 23:48:40 +00:00
Vadim Zeitlin	0d4ad161d5	Remove support for Win9x from wxMSW. Most importantly, this allows us to remove all MSLU-related stuff. Some functions which were previously loaded dynamically can now be just used directly, too. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@76535 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2014-05-16 02:33:40 +00:00
Vadim Zeitlin	761e0c12a0	Remove Windows version check from wxMBConv_win32. We don't support systems predating Windows 2000 SP4 any more, so there is no need to check for them. This also allows to get rid of the code checking for conversion correctness. Also remove the broken URLs from the comments, they didn't contain any particularly useful information anyhow. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@76406 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2014-04-27 22:39:28 +00:00
Paul Cornett	8b4ae731d3	use wxOVERRIDE git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@76220 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2014-03-30 00:02:23 +00:00
Vadim Zeitlin	33ad33d447	Add wxOVERRIDE and use it in common and wxOSX code. Make overriding virtual methods more explicit and enable additional checks provided by C++11 compilers when "override" is used. Closes #16100. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@76173 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2014-03-20 13:26:28 +00:00
Vadim Zeitlin	d94f3f5aba	Fix bug with non-NUL-terminaed inputs in wxMBConvUTF8. We read beyond the provided maximal length as we didn't update the remaining length while parsing the remaining bytes of an UTF-8-encoded code point. Fix this and add a test for it. Closes #15901. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@75733 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2014-01-29 22:25:14 +00:00
Vadim Zeitlin	3f66f6a5b3	Remove all lines containing cvs/svn "$Id$" keyword. This keyword is not expanded by Git which means it's not replaced with the correct revision value in the releases made using git-based scripts and it's confusing to have lines with unexpanded "$Id$" in the released files. As expanding them with Git is not that simple (it could be done with git archive and export-subst attribute) and there are not many benefits in having them in the first place, just remove all these lines. If nothing else, this will make an eventual transition to Git simpler. Closes #14487. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@74602 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2013-07-26 16:02:46 +00:00
Vadim Zeitlin	935693c484	Don't unnecessarily NUL-terminate wxCharBuffer contents. wxCharBuffer already initializes the last byte of the buffer it allocates to 0 so there is no need to do it explicitly. Also don't allocate an extra byte, wxCharBuffer already adds one to the length passed to it for the trailing NUL. See #13885. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@73141 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2012-12-08 00:37:17 +00:00
Vadim Zeitlin	2ba61518f4	Fix return value of wxMBConvUTF8::FromWChar(). Apply the same fix as was done in r68694 for ToWChar() to FromWChar(): it also returned an off by 1 value when not using MAP_INVALID_UTF8_NOT. Closes #13400. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@70462 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2012-01-25 00:10:44 +00:00
Vadim Zeitlin	041e6050fd	Don't read beyond the end of input buffer when decoding UTF-16. wxMBConvStrictUTF8::FromWChar() didn't update the input length correctly when encountering a surrogate while decoding UTF-16 and could read beyond the end of the input buffer in this case. Fix this by simply adjusting the input length when a surrogate is read. Closes #13614. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@69676 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2011-11-05 11:23:44 +00:00
Vadim Zeitlin	f4cb7c58da	Fix return value of wxMBConvUTF8::ToWChar() when not using MAP_INVALID_UTF8_NOT. wxMBConvUTF8::ToWChar() was off by 1 when the input length was explicitly specified, the extra NUL should only be added in the implicit length case. This bug didn't occur for the default wxMBConvUTF8 object as it simply forwarded to the base class wxMBConvStrictUTF8 implementation but it happened when MAP_INVALID_UTF8_TO_OCTAL or MAP_INVALID_UTF8_TO_PUA was used. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@68694 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2011-08-14 19:39:31 +00:00
Vadim Zeitlin	d883acaaa0	Initialize variable in UTF{16,32} conversion code to avoid warnings. In optimized build g++ warned about the second element of two-element array passed to encode_utf16() being possibly uninitialized. This wasn't really the case but change the code just to avoid the warnings. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@68112 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2011-06-30 12:20:54 +00:00
Stefan Csomor	0dcbb107ee	fix clang warning (? having two different operand types) git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@67336 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2011-03-29 18:40:18 +00:00
Paul Cornett	da2f117200	fix preprocessor expression, closes #12822 git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@66523 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2011-01-02 16:59:15 +00:00
Vadim Zeitlin	6c4d607e60	Initialize wxCSConv immediately instead of deferring it. Deferred initialization code was not MT-safe and just wasn't that useful anyhow because it is rare to create a wxCSConv object and not use it afterwards. Remove the deferred initialization logic and create the real conversion used by wxCSConv immediately in its ctor. Also improve the comments by clearly explaining the possible values of wxCSConv::m_name and m_encoding. Closes #12630. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@66119 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2010-11-11 12:09:22 +00:00
Vadim Zeitlin	8244507f68	Ensure that strings returned by wxMBConv_cf are in NFC form. Normalize all Unicode strings used internally even though the Darwin kernel gives them to us in decomposed (NFD) form. Closes #11730. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@66033 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2010-11-05 21:40:09 +00:00
Vadim Zeitlin	b64f93b67f	Make wxMBConv_iconv MT-safe by not using wxString in it. Use just "char *" for wxMBConv_iconv::m_name to avoid MT-safety problems related to using a wxString (which is not always MT-safe) from multiple threads. See #12630. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@65968 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2010-10-31 13:41:03 +00:00
Vadim Zeitlin	f48a115976	No real changes, just use const_cast<> instead of C casts. Replace many comments indicating that the C cast used was really a const_cast<> with the proper cast itself. There is no reason to not use it any longer, all the supported compilers understand it. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@65861 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2010-10-22 14:17:30 +00:00
Vadim Zeitlin	cfcfada96e	Return valid buffer from wxMBConv::c{MB,WC}2{WC,MB} for empty input. Returning invalid buffer for empty input is unexpected and resulted in e.g. wxString::utf8_str() returning NULL and not "" in ANSI build for empty strings (which, in turn, resulted in crashes in the test suite and undoubtedly not only) as well as crashes when calling GTK+ functions (see #12432). Other uses of cMB2WC() also show that NULL return value from it is unexpected as it is often passed to CRT functions not accepting NULL. So return empty buffer instead for empty input to avoid all these problems. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@65836 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2010-10-17 13:59:42 +00:00
Vadim Zeitlin	5276b0a53c	Use wxDELETE() and wxDELETEA() when possible. Use wxDELETE[A]() functions which automatically NULL out their arguments after deleting them instead of doing it manually. Closes #9685. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@64656 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2010-06-20 18:18:23 +00:00
Václav Slavík	8d94819c43	Remove wxUSE_WCHAR_T checks. wxWidgets requires wchar_t for some time now; wx/chartype.h has a check to fail complation without it. Simplify code by removing now-dead code for the !wxUSE_WCHAR_T case. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@63991 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2010-04-16 10:43:18 +00:00
Vadim Zeitlin	2730723380	Correct two bugs in wxMBConv::FromWChar() with non NUL-terminated strings. The variable "lenChunk" was incorrectly used as the length of the wide string chunk which could result in wrong output. Worse, the output buffer could be overflown for the final chunk because it didn't have to have enough space for the trailing NUL(s) in it. Fix both bugs and added unit tests for them. Based on patch by Kuang-che Wu. Closes #11486. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@62793 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2009-12-06 02:30:05 +00:00
Paul Cornett	a243da29c8	make array data fully const git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@62764 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2009-12-02 17:28:45 +00:00
Vadim Zeitlin	bbb0ff36db	Fix another off-by-1 bug in wxMBConv::ToWChar(). When converting a fixed number of characters we need to take any NULs inside the buffer being converted into account for our return value -- but this wasn't done and converting 2 characters 'x' and '\0' returned only 1, even if the length 2 was explicitly specified. Fix this bug and add a unit test checking for it. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@62141 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2009-09-26 13:31:27 +00:00
Vadim Zeitlin	40ac5040ce	Add convenient wxMBConv::cMB2WC/WC2MB overloads taking buffers. These overloads allow not to worry about buffer lengths and just convert between wxCharBuffer and wxWCharBuffer directly in a convenient way. git-svn-id: https://svn.wxwidgets.org/svn/wx/wxWidgets/trunk@61896 c3d73ce0-8a6f-49c7-b76d-6d57e0e08775	2009-09-12 22:40:25 +00:00

1 2 3 4 5 ...

332 Commits