From noreply at sourceforge.net Wed May 12 18:03:31 2010 From: noreply at sourceforge.net (SourceForge.net) Date: Wed, 12 May 2010 16:03:31 +0000 Subject: [Expat-bugs] [ expat-Bugs-3000537 ] expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Message-ID: Bugs item #3000537, was opened at 2010-05-12 18:03 Message generated for change (Tracker Item Submitted) made by playya You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: playya (playya) Assigned to: Nobody/Anonymous (nobody) Summary: expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Initial Comment: if you try to unpack the tar.gz you get the following error: gzip: expat-2.0.1.tar.gz: invalid compressed data--crc error Anyone knows the tool/version which was used to generate the archive to isolate the problem? Would it be possible to repack the archive? Additional information: * This bug only affects 32 bit systems * launchpad bug: http://bugs.launchpad.net/fedora/+source/gzip/+bug/524366 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127 From noreply at sourceforge.net Wed May 12 18:11:07 2010 From: noreply at sourceforge.net (SourceForge.net) Date: Wed, 12 May 2010 16:11:07 +0000 Subject: [Expat-bugs] [ expat-Bugs-3000537 ] expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Message-ID: Bugs item #3000537, was opened at 2010-05-12 18:03 Message generated for change (Settings changed) made by playya You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Resolution: None >Priority: 2 Private: No Submitted By: playya (playya) Assigned to: Nobody/Anonymous (nobody) Summary: expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Initial Comment: if you try to unpack the tar.gz you get the following error: gzip: expat-2.0.1.tar.gz: invalid compressed data--crc error Anyone knows the tool/version which was used to generate the archive to isolate the problem? Would it be possible to repack the archive? Additional information: * This bug only affects 32 bit systems * launchpad bug: http://bugs.launchpad.net/fedora/+source/gzip/+bug/524366 ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127 From noreply at sourceforge.net Wed May 12 19:55:22 2010 From: noreply at sourceforge.net (SourceForge.net) Date: Wed, 12 May 2010 17:55:22 +0000 Subject: [Expat-bugs] [ expat-Bugs-3000537 ] expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Message-ID: Bugs item #3000537, was opened at 2010-05-12 12:03 Message generated for change (Comment added) made by kwaclaw You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. >Category: Build control Group: None Status: Open Resolution: None Priority: 2 Private: No Submitted By: playya (playya) Assigned to: Nobody/Anonymous (nobody) Summary: expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Initial Comment: if you try to unpack the tar.gz you get the following error: gzip: expat-2.0.1.tar.gz: invalid compressed data--crc error Anyone knows the tool/version which was used to generate the archive to isolate the problem? Would it be possible to repack the archive? Additional information: * This bug only affects 32 bit systems * launchpad bug: http://bugs.launchpad.net/fedora/+source/gzip/+bug/524366 ---------------------------------------------------------------------- >Comment By: Karl Waclawek (kwaclaw) Date: 2010-05-12 13:55 Message: Have you tried re-downloading the file? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127 From noreply at sourceforge.net Thu May 13 21:37:27 2010 From: noreply at sourceforge.net (SourceForge.net) Date: Thu, 13 May 2010 19:37:27 +0000 Subject: [Expat-bugs] [ expat-Bugs-3000537 ] expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Message-ID: Bugs item #3000537, was opened at 2010-05-12 18:03 Message generated for change (Comment added) made by carcinoma You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Build control Group: None Status: Open Resolution: None Priority: 2 Private: No Submitted By: playya (playya) Assigned to: Nobody/Anonymous (nobody) Summary: expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Initial Comment: if you try to unpack the tar.gz you get the following error: gzip: expat-2.0.1.tar.gz: invalid compressed data--crc error Anyone knows the tool/version which was used to generate the archive to isolate the problem? Would it be possible to repack the archive? Additional information: * This bug only affects 32 bit systems * launchpad bug: http://bugs.launchpad.net/fedora/+source/gzip/+bug/524366 ---------------------------------------------------------------------- Comment By: Christian Anke (carcinoma) Date: 2010-05-13 21:37 Message: have the same problem redownload it, use other mirror, same thing ---------------------------------------------------------------------- Comment By: Karl Waclawek (kwaclaw) Date: 2010-05-12 19:55 Message: Have you tried re-downloading the file? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127 From noreply at sourceforge.net Thu May 13 22:47:58 2010 From: noreply at sourceforge.net (SourceForge.net) Date: Thu, 13 May 2010 20:47:58 +0000 Subject: [Expat-bugs] [ expat-Bugs-3000537 ] expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Message-ID: Bugs item #3000537, was opened at 2010-05-12 12:03 Message generated for change (Comment added) made by kwaclaw You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Build control Group: None Status: Open Resolution: None Priority: 2 Private: No Submitted By: playya (playya) Assigned to: Nobody/Anonymous (nobody) Summary: expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Initial Comment: if you try to unpack the tar.gz you get the following error: gzip: expat-2.0.1.tar.gz: invalid compressed data--crc error Anyone knows the tool/version which was used to generate the archive to isolate the problem? Would it be possible to repack the archive? Additional information: * This bug only affects 32 bit systems * launchpad bug: http://bugs.launchpad.net/fedora/+source/gzip/+bug/524366 ---------------------------------------------------------------------- >Comment By: Karl Waclawek (kwaclaw) Date: 2010-05-13 16:47 Message: Following the link in the original post this does not seem an issue with the archive, but rather with a specific build of gzip. ---------------------------------------------------------------------- Comment By: Christian Anke (carcinoma) Date: 2010-05-13 15:37 Message: have the same problem redownload it, use other mirror, same thing ---------------------------------------------------------------------- Comment By: Karl Waclawek (kwaclaw) Date: 2010-05-12 13:55 Message: Have you tried re-downloading the file? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127 From noreply at sourceforge.net Thu May 13 22:48:16 2010 From: noreply at sourceforge.net (SourceForge.net) Date: Thu, 13 May 2010 20:48:16 +0000 Subject: [Expat-bugs] [ expat-Bugs-3000537 ] expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Message-ID: Bugs item #3000537, was opened at 2010-05-12 12:03 Message generated for change (Settings changed) made by kwaclaw You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Build control Group: None Status: Open Resolution: None >Priority: 5 Private: No Submitted By: playya (playya) Assigned to: Nobody/Anonymous (nobody) Summary: expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Initial Comment: if you try to unpack the tar.gz you get the following error: gzip: expat-2.0.1.tar.gz: invalid compressed data--crc error Anyone knows the tool/version which was used to generate the archive to isolate the problem? Would it be possible to repack the archive? Additional information: * This bug only affects 32 bit systems * launchpad bug: http://bugs.launchpad.net/fedora/+source/gzip/+bug/524366 ---------------------------------------------------------------------- Comment By: Karl Waclawek (kwaclaw) Date: 2010-05-13 16:47 Message: Following the link in the original post this does not seem an issue with the archive, but rather with a specific build of gzip. ---------------------------------------------------------------------- Comment By: Christian Anke (carcinoma) Date: 2010-05-13 15:37 Message: have the same problem redownload it, use other mirror, same thing ---------------------------------------------------------------------- Comment By: Karl Waclawek (kwaclaw) Date: 2010-05-12 13:55 Message: Have you tried re-downloading the file? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127 From juraj.ivancic at gmail.com Sat May 15 13:30:54 2010 From: juraj.ivancic at gmail.com (=?UTF-8?B?SnVyYWogSXZhbsSNacSH?=) Date: Sat, 15 May 2010 13:30:54 +0200 Subject: [Expat-bugs] Expat occasionaly cropping multibyte character strings Message-ID: Expat does not handle multibyte characters correctly. Steps to reproduce this behaviour: 1) You need an input XML in e.g. UTF8 encoding which contains some multibyte characters (e.g. cyrillic characters) 2) Create an XML parser and feed it input file, but ensure that buffer breaks somewhere in the middle of a multibyte string. (To make sure - feed the parser one byte at a time). Say input file contains: '... ???? ????? ...' and it gets buffered like this: Buffer1: '... ???? ' Buffer2: '????? ...' Expat parser will, when completing parsing Buffer1, invoke character data handler containing only partial ('???? ') data, instead of waiting for the rest of the input. I think this is a bug as it only manifests when multibyte characters appear. I managed to correct this behaviour. Patch attached. This made my input file parse correctly, but note that my use case only had multibyte characters as element values, not as element names or attribute names/values. I'd imagine that some of these might also suffer from a similar issue. I also created a test case which demonstrates this behaviour. Please let me know if more information is needed. HTH, Juraj Ivan?i? P.S. Patch is made against wxWidgets fork of the Expat project. I did however verify with Expat CVS - the issue is still there. P.P.S. I also filed this issue to wxWidgets TRAC http://trac.wxwidgets.org/ticket/12053 -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: expat_mb_fix.diff URL: -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: expatbug.cpp URL: From karl at waclawek.net Sat May 15 17:43:24 2010 From: karl at waclawek.net (Karl Waclawek) Date: Sat, 15 May 2010 11:43:24 -0400 Subject: [Expat-bugs] Expat occasionaly cropping multibyte character strings In-Reply-To: References: Message-ID: <4BEEC11C.7090101@waclawek.net> On 15/05/2010 7:30 AM, Juraj Ivan?i? wrote: > Expat does not handle multibyte characters correctly. > Steps to reproduce this behaviour: > > 1) You need an input XML in e.g. UTF8 encoding which > contains some multibyte characters (e.g. cyrillic characters) > > 2) Create an XML parser and feed it input file, but ensure > that buffer breaks somewhere in the middle of a multibyte string. > (To make sure - feed the parser one byte at a time). > > Say input file contains: > '... ???? ????? ...' > > and it gets buffered like this: > > Buffer1: '... ???? ' > Buffer2: '????? ...' > > Expat parser will, when completing parsing Buffer1, invoke character > data handler containing only partial ('???? ') data, instead of waiting > for the rest of the input. I think this is a bug as it only manifests > when multibyte characters appear. the way you describe it it is not a bug. Expat does not guarantee reporting the text between element tags as one string. It would be a bug if Expat broke a single multi-byte character up into one or more parts. Karl -------------- next part -------------- A non-text attachment was scrubbed... Name: karl.vcf Type: text/x-vcard Size: 179 bytes Desc: not available URL: From juraj.ivancic at gmail.com Mon May 17 23:13:37 2010 From: juraj.ivancic at gmail.com (=?UTF-8?B?SnVyYWogSXZhbsSNacSH?=) Date: Mon, 17 May 2010 23:13:37 +0200 Subject: [Expat-bugs] Expat occasionaly cropping multibyte character strings In-Reply-To: <4BEEC11C.7090101@waclawek.net> References: <4BEEC11C.7090101@waclawek.net> Message-ID: On 15.5.2010 17:43, Karl Waclawek wrote: > the way you describe it it is not a bug. Expat does not guarantee > reporting the text between element tags as one string. > It would be a bug if Expat broke a single multi-byte character up into > one or more parts. Yes, after further digging I found that character data handler must be prepared to receive strings piece by piece. I (incorrectly) expected that values will come complete similarly as it is done for element names and attributes. Sorry for the noise. From noreply at sourceforge.net Tue May 18 01:07:56 2010 From: noreply at sourceforge.net (SourceForge.net) Date: Mon, 17 May 2010 23:07:56 +0000 Subject: [Expat-bugs] [ expat-Bugs-3000537 ] expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Message-ID: Bugs item #3000537, was opened at 2010-05-12 18:03 Message generated for change (Comment added) made by playya You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Build control Group: None >Status: Closed Resolution: None Priority: 5 Private: No Submitted By: playya (playya) Assigned to: Nobody/Anonymous (nobody) Summary: expat 2.0.1 unpacking fails on ubuntu 10.04, fedora 12/13 Initial Comment: if you try to unpack the tar.gz you get the following error: gzip: expat-2.0.1.tar.gz: invalid compressed data--crc error Anyone knows the tool/version which was used to generate the archive to isolate the problem? Would it be possible to repack the archive? Additional information: * This bug only affects 32 bit systems * launchpad bug: http://bugs.launchpad.net/fedora/+source/gzip/+bug/524366 ---------------------------------------------------------------------- >Comment By: playya (playya) Date: 2010-05-18 01:07 Message: Another malformed archive has been detected. ---------------------------------------------------------------------- Comment By: Karl Waclawek (kwaclaw) Date: 2010-05-13 22:47 Message: Following the link in the original post this does not seem an issue with the archive, but rather with a specific build of gzip. ---------------------------------------------------------------------- Comment By: Christian Anke (carcinoma) Date: 2010-05-13 21:37 Message: have the same problem redownload it, use other mirror, same thing ---------------------------------------------------------------------- Comment By: Karl Waclawek (kwaclaw) Date: 2010-05-12 19:55 Message: Have you tried re-downloading the file? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=110127&aid=3000537&group_id=10127