From mohammed_rasmy at hotmail.com Thu Mar 2 16:57:11 2006 From: mohammed_rasmy at hotmail.com (Mohammed Rasmy) Date: Thu, 02 Mar 2006 17:57:11 +0200 Subject: [Expat-discuss] piecewise parsing Message-ID: Hello, I understand that expat can do piece-wise parsing with different parts of an XML document but I can't seem to get it to work. I read XML data from a socket using HTTP and the moment I can a response I call XML_Parse() with the data and the size of data received. This part of the document is parsed successfully but if I call XML_Parse() once more on the next part received from the socket (which definitely has a part of it attached to the first part), parsing fails... Am I missing something? Should I call some other function to make expat aware of my usage of piece-wise parsing? Please advise. Thank you. _________________________________________________________________ Express yourself instantly with MSN Messenger! Download today it's FREE! http://messenger.msn.click-url.com/go/onm00200471ave/direct/01/ From weigelt at metux.de Fri Mar 3 11:31:05 2006 From: weigelt at metux.de (Enrico Weigelt) Date: Fri, 3 Mar 2006 11:31:05 +0100 Subject: [Expat-discuss] Expat 2.0 released In-Reply-To: References: Message-ID: <20060303103105.GA17673@nibiru.local> hi folks, I'm still very unhappy with this version numbering ... calling an bugfixed release the next major version seems very irritating to me. People normally expect something quite different/new with another major release, at least some major improvements. I still don't see, why we couldn't stay in the 1.9.x line and simply fix things as problems show up ... cu -- --------------------------------------------------------------------- Enrico Weigelt == metux IT service phone: +49 36207 519931 www: http://www.metux.de/ fax: +49 36207 519932 email: contact at metux.de cellphone: +49 174 7066481 --------------------------------------------------------------------- -- DSL ab 0 Euro. -- statische IP -- UUCP -- Hosting -- Webshops -- --------------------------------------------------------------------- From karl at waclawek.net Fri Mar 3 13:54:44 2006 From: karl at waclawek.net (Karl Waclawek) Date: Fri, 03 Mar 2006 07:54:44 -0500 Subject: [Expat-discuss] Expat 2.0 released In-Reply-To: <20060303103105.GA17673@nibiru.local> References: <20060303103105.GA17673@nibiru.local> Message-ID: <44083C94.9080309@waclawek.net> Enrico Weigelt wrote: > hi folks, > > > I'm still very unhappy with this version numbering ... > calling an bugfixed release the next major version seems > very irritating to me. > > People normally expect something quite different/new with > another major release, at least some major improvements. > > I still don't see, why we couldn't stay in the 1.9.x line > and simply fix things as problems show up ... > > > Although I didn't come up with it, it was always clear to me that the 1.95.XX line of releases was the 2.0 beta cycle, more or less. That is already obvious from reading James Clark's original web site: http://www.jclark.com/xml/expat.html Karl From xcross at us.ibm.com Sat Mar 4 18:00:48 2006 From: xcross at us.ibm.com (Chris Cross) Date: Sat, 4 Mar 2006 10:00:48 -0700 Subject: [Expat-discuss] Chris Cross/West Palm Beach/IBM is out of the office. Message-ID: I will be out of the office starting 02/27/2006 and will not return until 03/07/2006. I'll be on vacation and have limited access to email during this time. From jzhang at ximpleware.com Mon Mar 6 03:11:29 2006 From: jzhang at ximpleware.com (jzhang at ximpleware.com) Date: Sun, 05 Mar 2006 18:11:29 -0800 Subject: [Expat-discuss] [ANN] VTD-XML Version 1.5 Released Message-ID: The VTD-XML project team is proud to announce the availability of both C and Java version 1.5 of VTD-XML, the next generation open-source XML parser that goes beyond DOM and SAX in terms of performance, memory usage and ease of use. The technical highlights of VTD-XML are: * Performance: the world's fastest XML parser, between 5x~10x faster than DOM * Memory Usage: 3x to 5x less than DOM, 1.3x~1.5x XML document size * Random access with built-in XPath support * A simple and intuitive API Other advanced features include: * Buffer reuse * Large document support (2GByte) * Incremental update * Hardware acceleration * Native XML indexing. For demos, latest benchmarks, related articles and software downloads, please visit http://vtd-xml.sf.net. Also let us know your thoughts and suggestions and help us improve VTD-XML. From nickmacd at gmail.com Mon Mar 6 17:18:46 2006 From: nickmacd at gmail.com (Nick MacDonald) Date: Mon, 6 Mar 2006 11:18:46 -0500 Subject: [Expat-discuss] [ANN] VTD-XML Version 1.5 Released In-Reply-To: References: Message-ID: This package is probably a waste of time for anyone who knows what they are doing. If you need DOM, get DOM. If you only need forward access to the content, and understand why SAX is the right way to do it, then you don't need a compressed binary representation of DOM, which is what this appears to be. And despite what they claim, there is NO way it can be faster than SAX (i.e. expat) as you need to read the file at least once to compress it into memory... and all the memory allocation can't be good for performance. Now, compared to DOM... maybe there is a need for this... but being that DOM is built into a lot of development environments, and doesn't come attached to a GPL license like this package... I think smell someone trying to get free advertising for their "consulting business" by broadcasting their unrelated project to uninterested individuals... On 3/5/06, jzhang at ximpleware.com wrote: > * Performance: the world's fastest XML parser, > between 5x~10x faster than DOM > * Memory Usage: 3x to 5x less than DOM, 1.3x~1.5x > XML document size > * Random access with built-in XPath support > * A simple and intuitive API > > Other advanced features include: > * Buffer reuse > * Large document support (2GByte) > * Incremental update > * Hardware acceleration > * Native XML indexing. From karl at waclawek.net Mon Mar 6 18:00:20 2006 From: karl at waclawek.net (Karl Waclawek) Date: Mon, 06 Mar 2006 12:00:20 -0500 Subject: [Expat-discuss] [ANN] VTD-XML Version 1.5 Released In-Reply-To: References: Message-ID: <440C6AA4.2090405@waclawek.net> jzhang at ximpleware.com wrote: > The VTD-XML project team is proud to announce the > availability of both C and Java version 1.5 of VTD-XML, > the next generation open-source XML parser that goes > beyond DOM and SAX in terms of performance, memory > usage and ease of use. > > The technical highlights of VTD-XML are: > > * Performance: the world's fastest XML parser, > between 5x~10x faster than DOM > * Memory Usage: 3x to 5x less than DOM, 1.3x~1.5x > XML document size > * Random access with built-in XPath support > * A simple and intuitive API > On the surface, this seems intriguing. It appars what VTD-XML is doing, is to build a kind of "pointer structure" consisting of "VTD-records" that allows navigating the document structure without having copied out anything. This may be an efficient alternative to DOM. I do not quite understand why it should be faster than SAX, or SAX-like parsers, especially in the C-world (Expat). Yes, maybe the initial creation of the VTD records is faster, but there is some memory allocation involved as well, and Expat already is very good on memory allocations. Now, consider that the input document is encoded in UTF-16, but your application needs UTF-8. You still need to convert, which involves memory allocations and copying, more so in Java then in C, as string objects in Java are immutable (no re-use of memory). So, for any practical SAX work, I doubt that there are advantages, even for small documents, unless one can use the original encoding. Karl From jzhang at ximpleware.com Mon Mar 6 18:17:01 2006 From: jzhang at ximpleware.com (Jimmy Zhang) Date: Mon, 6 Mar 2006 09:17:01 -0800 Subject: [Expat-discuss] [ANN] VTD-XML Version 1.5 Released References: Message-ID: <00b401c64141$ca51dc80$0d02a8c0@ximpleware> VTD is not DOM compressed... VTD-XML significantly outperforms Expat with NULL content handler, and offers random access... try it the performance will speak for itself... but it won't matter to who ever *enjoys* SAX style of programming... ----- Original Message ----- From: "Nick MacDonald" To: Cc: Sent: Monday, March 06, 2006 8:18 AM Subject: Re: [Expat-discuss] [ANN] VTD-XML Version 1.5 Released This package is probably a waste of time for anyone who knows what they are doing. If you need DOM, get DOM. If you only need forward access to the content, and understand why SAX is the right way to do it, then you don't need a compressed binary representation of DOM, which is what this appears to be. And despite what they claim, there is NO way it can be faster than SAX (i.e. expat) as you need to read the file at least once to compress it into memory... and all the memory allocation can't be good for performance. Now, compared to DOM... maybe there is a need for this... but being that DOM is built into a lot of development environments, and doesn't come attached to a GPL license like this package... I think smell someone trying to get free advertising for their "consulting business" by broadcasting their unrelated project to uninterested individuals... On 3/5/06, jzhang at ximpleware.com wrote: > * Performance: the world's fastest XML parser, > between 5x~10x faster than DOM > * Memory Usage: 3x to 5x less than DOM, 1.3x~1.5x > XML document size > * Random access with built-in XPath support > * A simple and intuitive API > > Other advanced features include: > * Buffer reuse > * Large document support (2GByte) > * Incremental update > * Hardware acceleration > * Native XML indexing. From karl at waclawek.net Mon Mar 6 21:41:06 2006 From: karl at waclawek.net (Karl Waclawek) Date: Mon, 06 Mar 2006 15:41:06 -0500 Subject: [Expat-discuss] [ANN] VTD-XML Version 1.5 Released In-Reply-To: <00b401c64141$ca51dc80$0d02a8c0@ximpleware> References: <00b401c64141$ca51dc80$0d02a8c0@ximpleware> Message-ID: <440C9E62.9020708@waclawek.net> Jimmy Zhang wrote: > VTD is not DOM compressed... > VTD-XML significantly outperforms Expat with NULL content > handler, Which benchmark shows that result? I ran 3 test files through benchmark(_BR).java, and Expat's benchmark (no handlers, 256k working buffer). Environment: Windows XP, jre 1.5.0_04 (client, don't have server), Athlon 2500, 1GB RAM. File 1: 1.1 MB, UTF-8, lots of markup: benchmark.java: 67ms benchmark_BR.java: 51ms Expat: 53ms (54ms with namespace processing on) File 2: 1.7MB, UTF-16, lots of PCDATA: benchmark.java: 33ms benchmark_BR.java: 32ms Expat: 17.5ms (17.8ms with namespace processing on) File 3: 198KB, ISO-8859-1 (REC-xml-20001006.xml, the XML specification as XML file) VTD-XML failed parsing the XML specification (which is an XML document) with error: Not wellformed -->com.ximpleware.ParseException: Other Error: Unrecognized ch after I recommend that you choose a wider array of documents If you put run the test using *server* JVM, VTD-XML blows away Expat.... For some reason I wasn't able to receive you email, but instead a daily digest... Cheers, Jz From jzhang at ximpleware.com Tue Mar 7 19:11:01 2006 From: jzhang at ximpleware.com (Jimmy Zhang) Date: Tue, 7 Mar 2006 10:11:01 -0800 Subject: [Expat-discuss] C compilation flag Message-ID: <00c001c64212$7fbcc500$0d02a8c0@ximpleware> Can you make the comparison open a bit... we have benchmarked expat extensively, VTD-XML in general leads by a big margin... for cases where there are a large number of text, such as RSS file, Expat occasionally outperforms VTD-XML, which is quite a feat... VTD-XML's fundamental improvements came from its memory allocation strategy: 1. use large memory blocks... (64k for many VTD records all at once) 2. reuse memory blocks ... ( because VTD records are constant in length) Cheers, Jimmy From karl at waclawek.net Tue Mar 7 19:23:50 2006 From: karl at waclawek.net (Karl Waclawek) Date: Tue, 07 Mar 2006 13:23:50 -0500 Subject: [Expat-discuss] vtd-xml 1.5 In-Reply-To: <005901c64210$c1ff45c0$0d02a8c0@ximpleware> References: <005901c64210$c1ff45c0$0d02a8c0@ximpleware> Message-ID: <440DCFB6.70306@waclawek.net> Jimmy Zhang wrote: > I recommend that you choose a wider array of documents Just tested with bioinfo.xml (one of your sample files): benchmark.java: 1.12ms benchmark_BR.java: 1.06ms Expat (average of 700 iterations): 0.94ms > > If you put run the test using *server* JVM, Don't have that. I have only J2SE. I get Error: no `server' JVM at `C:\Program Files\Java\jre1.5.0_04\bin\server\jvm.dll'. > > VTD-XML blows away Expat.... Well, not so far. > > For some reason I wasn't able to receive you email, but instead a > daily digest... I only mail to the list. Regards, Karl From karl at waclawek.net Tue Mar 7 19:43:15 2006 From: karl at waclawek.net (Karl Waclawek) Date: Tue, 07 Mar 2006 13:43:15 -0500 Subject: [Expat-discuss] C compilation flag In-Reply-To: <00c001c64212$7fbcc500$0d02a8c0@ximpleware> References: <00c001c64212$7fbcc500$0d02a8c0@ximpleware> Message-ID: <440DD443.3030204@waclawek.net> Jimmy Zhang wrote: > Can you make the comparison open a bit... > What do you mean with open? The Expat benchmark application is part of the Expat distribution, open for anyone to inspect. > we have benchmarked expat extensively, VTD-XML in general > leads by a big margin... > for cases where there are a large number of text, such as RSS file, > Expat occasionally outperforms VTD-XML, which is quite a feat... > Then show me the Expat benchmarks you have done. > VTD-XML's fundamental improvements came from its memory > allocation strategy: > > 1. use large memory blocks... (64k for many VTD records all at once) > 2. reuse memory blocks ... ( because VTD records are constant in length) > Expat reuses memory a lot. You would be surprised to see how seldom Expat actually allocates memory. Regards, Karl From chris at marrin.com Wed Mar 8 00:01:20 2006 From: chris at marrin.com (Chris Marrin) Date: Tue, 07 Mar 2006 15:01:20 -0800 Subject: [Expat-discuss] Attributes do not parse as CDATA??? Message-ID: <440E10C0.1020300@marrin.com> If I make this XML: Expat complains that the '<' in the string is illegal. Looking at the scanAtts() function it is clear why. When parsing the attribute value, it scans for '<' and '&', and of course the closing quote. In other words, it is parsing the attribute value. But according to the XML 1.0 spec: "All attributes for which no declaration has been read SHOULD be treated by a non-validating processor as if declared CDATA" Now, I know that SHOULD does not mean MUST, but it seems that expat should at least be able to consider attributes as being CDATA. Yet I can see no way to disable to parsing of '<' and '&'. Am I missing something, either in the way expat works, or in the XML spec? -- chris marrin ,""$, chris at marrin.com b` $ ,,. mP b' , 1$' ,.` ,b` ,` :$$' ,|` mP ,` ,mm ,b" b" ,` ,mm m$$ ,m ,`P$$ m$` ,b` .` ,mm ,'|$P ,|"1$` ,b$P ,` :$1 b$` ,$: :,`` |$$ ,` $$` ,|` ,$$,,`"$$ .` :$| b$| _m$`,:` :$1 ,` ,$Pm|` ` :$$,..;"' |$: P$b, _;b$$b$1" |$$ ,` ,$$" ``' $$ ```"```'" `"` `""` ""` ,P` "As a general rule,don't solve puzzles that open portals to Hell"' From jzhang at ximpleware.com Wed Mar 8 18:34:04 2006 From: jzhang at ximpleware.com (Jimmy Zhang) Date: Wed, 8 Mar 2006 09:34:04 -0800 Subject: [Expat-discuss] Server JVM Message-ID: <007401c642d6$8137f340$0d02a8c0@ximpleware> I can help you set up server JVM, just a dll file you need to put in the right place... let me know if you need any help Why VTD-XML outperforms SAX is pretty thouroughly answered by FAQ in VTD-XML's web site http://vtd-xml.sf.net/faq.html Message: 3 Date: Tue, 07 Mar 2006 13:23:50 -0500 From: Karl Waclawek Subject: Re: [Expat-discuss] vtd-xml 1.5 To: expat-discuss at libexpat.org Message-ID: <440DCFB6.70306 at waclawek.net> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Jimmy Zhang wrote: > I recommend that you choose a wider array of documents Just tested with bioinfo.xml (one of your sample files): benchmark.java: 1.12ms benchmark_BR.java: 1.06ms Expat (average of 700 iterations): 0.94ms > > If you put run the test using *server* JVM, Don't have that. I have only J2SE. I get Error: no `server' JVM at `C:\Program Files\Java\jre1.5.0_04\bin\server\jvm.dll'. > > VTD-XML blows away Expat.... Well, not so far. > > For some reason I wasn't able to receive you email, but instead a > daily digest... I only mail to the list. Regards, Karl From DOREL at fr.ibm.com Wed Mar 8 15:50:40 2006 From: DOREL at fr.ibm.com (Alain Dorel) Date: Wed, 8 Mar 2006 15:50:40 +0100 Subject: [Expat-discuss] ld: 0711-317 ERROR: Undefined symbol: .XML_Parse error Message-ID: Hello, We are using the expat module and XML_Parser module from 1 year without problem. During the installation of a new machine, the XML-Parser-2.34 make command fails during the link. We know that these externals symbols are in the libexpat.a generated by the expat-1.95.8 installation. The libexpat.a has been successfully generated and is present on the directory "/usr/opt/lib". Do you already know if there is a known problem on AIX 5.3 PowerPC 64 bits processor installation? LD_RUN_PATH="/usr/opt/lib" ld -b64 -bhalt:4 -bM:SRE -bI:/usr/opt/perl5/lib64/5.8.2/aix-thread-multi-64all/CORE/perl.exp -bE:Expat.exp -bnoentry -lpthreads -lc_r Expat.o -o ../blib/arch/auto/XML/Parser/Expat/Expat.so -L/usr/opt/lib -lexpat ld: 0711-317 ERROR: Undefined symbol: .XML_Parse ld: 0711-317 ERROR: Undefined symbol: .XML_SetNamespaceDeclHandler Regards, Alain Dorel SCR & URT & SCM & SCREEN IBM Global Services - Strategic Solutions Enablement (SSE) Technical Solutions Development (TSD) - Security & Compliance TSO Tel: +33 4-92-11-5969 (internal 36-5969) IBM La Gaude, B1 1N32, BP 14 mailto:dorel at fr.ibm.com From xcross at us.ibm.com Wed Mar 8 18:02:02 2006 From: xcross at us.ibm.com (Chris Cross) Date: Wed, 8 Mar 2006 10:02:02 -0700 Subject: [Expat-discuss] Chris Cross/West Palm Beach/IBM is out of the office. Message-ID: I will be out of the office starting 03/08/2006 and will not return until 03/13/2006. I'm traveling due to family emergency and will have limited access to email. From karl at waclawek.net Thu Mar 9 00:28:41 2006 From: karl at waclawek.net (Karl Waclawek) Date: Wed, 08 Mar 2006 18:28:41 -0500 Subject: [Expat-discuss] Server JVM In-Reply-To: <007401c642d6$8137f340$0d02a8c0@ximpleware> References: <007401c642d6$8137f340$0d02a8c0@ximpleware> Message-ID: <440F68A9.2010309@waclawek.net> Jimmy Zhang wrote: > I can help you set up server JVM, just a dll file you need to put in > the right place... let me know if you need any help I installed the JDK and tested with the server JVM. It does indeed increase performance significantly, beating Expat on files with lots of markup vs character data (27ms for benchmark_BR vs 49ms for Expat). Question, why should one not always use the "buffer-reuse" version? Expat was still faster on the file with lots of character data (13ms vs 17ms for benchmark_BR). I recompiled the Expat library with all optimizations for speed. Now given that vtd-xml is quite fast, you still have to prove two things: 1) It does everything a conforming non-validating parser must do. How many of the tests in the XML-Test-Suite does it pass? Expat passes all the tests for a non-validating parser except a handful that are optional or in doubt. Example: vtd-xml failed to detect duplicate attributes when they had different prefixes pointing to the same namespace. Completing vtd-xml to conform as well as Expat may well add more overhead. 2) To which degree does it pay off to delay the work of encoding transformation to the point when the data is actually needed, as in a real-world application? If the document is encoded in UTF-8 and your application requires UTF-16, then this is already done by Expat, but for vtd-xml this work still has to be performed. Whether it will be preferable over Expat for documents where memory usage is not an issue, will depend on the answers to these questions. Overall I do like your approach, and I think it is excellent for random access with an in-memory document. It may also do very well on SOAP processing for smaller messages. Karl From markfhaigh at sbcglobal.net Thu Mar 9 23:52:39 2006 From: markfhaigh at sbcglobal.net (Mark F. Haigh) Date: Thu, 09 Mar 2006 14:52:39 -0800 Subject: [Expat-discuss] ld: 0711-317 ERROR: Undefined symbol: .XML_Parse error Message-ID: <4410B1B7.8050707@sbcglobal.net> > > >Hello, > >We are using the expat module and XML_Parser module from 1 year without >problem. > >During the installation of a new machine, the XML-Parser-2.34 make command >fails during the link. >We know that these externals symbols are in the libexpat.a generated by the >expat-1.95.8 installation. > >The libexpat.a has been successfully generated and is present on the >directory "/usr/opt/lib". > > > You're going to have to be more specific if you want help. In particular, I need the exact commands you use to build both expat and XML-Parser, and anything else you're doing to the filesystem (ie moving/copying files around, etc). Also include version numbers for your working configuration, and whether or not there's a pre-existing system expat installation lying around (it looks like it). >Do you already know if there is a known problem on AIX 5.3 PowerPC 64 bits >processor installation? > > > It's known to work, at least using gcc. The AIX box I have access to does not have an IBM compiler license. Mark F. Haigh markfhaigh at sbcglobal.net From jzhang at ximpleware.com Fri Mar 10 03:00:10 2006 From: jzhang at ximpleware.com (Jimmy Zhang) Date: Thu, 9 Mar 2006 18:00:10 -0800 Subject: [Expat-discuss] Expat-discuss Digest, Vol 72, Issue 7 References: Message-ID: <040201c643e6$5fa70c00$0d02a8c0@ximpleware> When VTD-XML project got officially started, the DTD part and external references were already deprecated somewhat... no major vocabularies seem to use the external refernces anymore... Buffer reuse is introduced in the latest releases, maybe it should always be used, the performance improves starts from the second time VTDGen parses XML documents.. We are adding more documents on this feature right now... External references not withstanding, VTD-XML conforms to, and passes, every XML test suite, VTD-XML handles namespace problem a little different than DOM or SAX, the error checking is delayed until during navigation, the prefix induced attribute duplicate problem is quite unlikely to concern anyone, and is in fact part of the problems of XML namespace spec... The cost of encoding transformation ranges from zero to negligible, most are ascii anyway. One can argue that, to process XML, SAX parsers need to be used at least twice... first time is to scan the document from start to end, just to check wellformedness, the second pass is to perform the application processing... otherwise, what happens if the application perform 10 transactions but then discover that the last angle bracket of the XMl file is missing.?? roll back those 10 transactions ?? So should we reduce the SAX perform by 50% just to be fair comparison with VTD-XML?? and VTD-XML is still forward only and unpleasant to use... I don't see any comparison... Maybe the world has moved forward... maybe it is time to say good bye to SAX?? > Date: Wed, 08 Mar 2006 18:28:41 -0500 > From: Karl Waclawek > Subject: Re: [Expat-discuss] Server JVM > To: expat-discuss at libexpat.org > Message-ID: <440F68A9.2010309 at waclawek.net> > Content-Type: text/plain; charset=ISO-8859-1; format=flowed > > Jimmy Zhang wrote: >> I can help you set up server JVM, just a dll file you need to put in >> the right place... let me know if you need any help > > I installed the JDK and tested with the server JVM. > It does indeed increase performance significantly, beating Expat on files > with lots of markup vs character data (27ms for benchmark_BR vs 49ms for > Expat). > Question, why should one not always use the "buffer-reuse" version? > > Expat was still faster on the file with lots of character data (13ms vs > 17ms for benchmark_BR). > I recompiled the Expat library with all optimizations for speed. > > Now given that vtd-xml is quite fast, you still have to prove two things: > > 1) It does everything a conforming non-validating parser must do. How > many of > the tests in the XML-Test-Suite does it pass? Expat passes all > the tests for a non-validating parser except a handful that are optional > or in doubt. > Example: vtd-xml failed to detect duplicate attributes when they had > different prefixes pointing to the same namespace. Completing vtd-xml > to conform as well as Expat may well add more overhead. > > 2) To which degree does it pay off to delay the work of encoding > transformation > to the point when the data is actually needed, as in a real-world > application? > If the document is encoded in UTF-8 and your application requires UTF-16, > then this is already done by Expat, but for vtd-xml this work still has > to be performed. > > Whether it will be preferable over Expat for documents where memory > usage is not an issue, will depend on the answers to these questions. > > Overall I do like your approach, and I think it is excellent for random > access with an > in-memory document. It may also do very well on SOAP processing for > smaller messages. > > Karl > > > > ------------------------------ > > _______________________________________________ > Expat-discuss mailing list > Expat-discuss at libexpat.org > http://mail.libexpat.org/mailman/listinfo/expat-discuss > > > End of Expat-discuss Digest, Vol 72, Issue 7 > ******************************************** > From nickmacd at gmail.com Fri Mar 10 04:26:21 2006 From: nickmacd at gmail.com (Nick MacDonald) Date: Thu, 9 Mar 2006 22:26:21 -0500 Subject: [Expat-discuss] Expat-discuss Digest, Vol 72, Issue 7 In-Reply-To: <040201c643e6$5fa70c00$0d02a8c0@ximpleware> References: <040201c643e6$5fa70c00$0d02a8c0@ximpleware> Message-ID: > Maybe the world has moved forward... maybe it is time to say good bye to > SAX?? Maybe Jimmy should say goodbye to posting on this list?? Please? Go make your own list and annoy people there. > and VTD-XML is still forward only and unpleasant to use... Quite probably! > One can argue that, to process XML, SAX parsers need to be used at least > twice... > first time is to scan the document from start to end, just to check > wellformedness, the > second pass is to perform the application processing... otherwise, what > happens > if the application perform 10 transactions but then discover that the last > angle > bracket of the XMl file is missing.?? roll back those 10 transactions ?? So > should we > reduce the SAX perform by 50% just to be fair comparison with VTD-XML?? You have a valid point, but then you would because apparently you designed your product for transaction oriented processing... This is not how I use expat and I find it absolutely perfect for my needs. What are those needs you ask... two fold: - configuration files - code generation In both cases, I only need one pass through the XML to achieve my goals... and memory efficiency is way more important to me (in my embedded systems and designs) than any possible benefit a "compressed DOM representation" can give me. Please let it rest and go write some more code for your project and leave the rest of us in peace to use SAX for what it is best at: memory and time efficient XML parsing. From karl at waclawek.net Fri Mar 10 04:33:03 2006 From: karl at waclawek.net (Karl Waclawek) Date: Thu, 09 Mar 2006 22:33:03 -0500 Subject: [Expat-discuss] Expat-discuss Digest, Vol 72, Issue 7 In-Reply-To: <040201c643e6$5fa70c00$0d02a8c0@ximpleware> References: <040201c643e6$5fa70c00$0d02a8c0@ximpleware> Message-ID: <4410F36F.7070801@waclawek.net> Jimmy Zhang wrote: > Buffer reuse is introduced in the latest releases, maybe it should always be > used, the > performance improves starts from the second time VTDGen parses XML > documents.. > The same document or any? > We are adding more documents on this feature right now... > > External references not withstanding, VTD-XML conforms to, and passes, every > XML > test suite, I recommend to put a script or small program on your web-site that runs vtd-xml through the official xml test suite. This will make potential users gain confidence with it. The SAX for .NET project has such an application, you can use it as a blue-print. The mere fact that after 10 minutes of trying I already discovered a problem does not bode well for your claim that VTD-XML conforms to, and passes, *every* XML test suite. > VTD-XML handles namespace problem a little different than DOM or > SAX, > the error checking is delayed until during navigation, the prefix induced > attribute duplicate > problem is quite unlikely to concern anyone, and is in fact part of the > problems of XML > namespace spec... > Well, being 99% correct is not good enough. > The cost of encoding transformation ranges from zero to negligible, most are > ascii anyway. > Depends on the amount of character data the application retrieves. > One can argue that, to process XML, SAX parsers need to be used at least > twice... > first time is to scan the document from start to end, just to check > wellformedness, the > second pass is to perform the application processing... otherwise, what > happens > if the application perform 10 transactions but then discover that the last > angle > bracket of the XMl file is missing.?? roll back those 10 transactions ?? Yes, that is what transaction processing allows you to do. Or do you suggest not to use transactions just because you know the XML is well-formed? > So > should we > reduce the SAX perform by 50% just to be fair comparison with VTD-XML?? > That argument makes no sense to me. > and VTD-XML is still forward only and unpleasant to use... > But especially good at processing large giga-byte sized data dumps. Expat is routinely used for such purposes. > I don't see any comparison... > > Maybe the world has moved forward... maybe it is time to say good bye to > SAX?? > > For some applications, I agree. Karl From jzhang at ximpleware.com Fri Mar 10 05:19:20 2006 From: jzhang at ximpleware.com (Jimmy Zhang) Date: Thu, 9 Mar 2006 20:19:20 -0800 Subject: [Expat-discuss] Expat-discuss Digest, Vol 72, Issue 7 References: <040201c643e6$5fa70c00$0d02a8c0@ximpleware> Message-ID: <007b01c643f9$d0259740$0d02a8c0@ximpleware> I doubt you speak for everyone on the list... I actually find a lot of the comments intellectually challenging... ----- Original Message ----- From: "Nick MacDonald" To: "Jimmy Zhang" Cc: Sent: Thursday, March 09, 2006 7:26 PM Subject: Re: [Expat-discuss] Expat-discuss Digest, Vol 72, Issue 7 > Maybe the world has moved forward... maybe it is time to say good bye to > SAX?? Maybe Jimmy should say goodbye to posting on this list?? Please? Go make your own list and annoy people there. > and VTD-XML is still forward only and unpleasant to use... Quite probably! > One can argue that, to process XML, SAX parsers need to be used at least > twice... > first time is to scan the document from start to end, just to check > wellformedness, the > second pass is to perform the application processing... otherwise, what > happens > if the application perform 10 transactions but then discover that the > last > angle > bracket of the XMl file is missing.?? roll back those 10 transactions ?? > So > should we > reduce the SAX perform by 50% just to be fair comparison with VTD-XML?? You have a valid point, but then you would because apparently you designed your product for transaction oriented processing... This is not how I use expat and I find it absolutely perfect for my needs. What are those needs you ask... two fold: - configuration files - code generation In both cases, I only need one pass through the XML to achieve my goals... and memory efficiency is way more important to me (in my embedded systems and designs) than any possible benefit a "compressed DOM representation" can give me. Please let it rest and go write some more code for your project and leave the rest of us in peace to use SAX for what it is best at: memory and time efficient XML parsing. From regis.st-gelais at laubrass.com Fri Mar 10 13:18:25 2006 From: regis.st-gelais at laubrass.com (=?iso-8859-1?Q?R=E9gis_St-Gelais_=28Laubrass=29?=) Date: Fri, 10 Mar 2006 07:18:25 -0500 Subject: [Expat-discuss] vtd-xml 1.5 References: <005901c64210$c1ff45c0$0d02a8c0@ximpleware> Message-ID: <002f01c6443c$bbf267b0$6400a8c0@laubrasssag1> What is the point in discussing vtd-xml on that list? is'n it the Expat list? Regis From mark.serlin at ashgaming.com Fri Mar 10 16:49:43 2006 From: mark.serlin at ashgaming.com (Mark Serlin) Date: Fri, 10 Mar 2006 15:49:43 -0000 Subject: [Expat-discuss] Comments are a killer? Message-ID: Hi all My version of expat 1.95.8 falls foul of any comments like this but not like this Is this expected? ___________________________________ Mark Serlin Senior Developer - Ash Gaming Ltd Profitable Games For Interactive Platforms * 59 Charlotte Road, London, EC2A 3QW * +44 (0)207 729 5328 * ashgaming.com -- No virus found in this outgoing message. Checked by AVG Free Edition. Version: 7.1.375 / Virus Database: 268.2.1/278 - Release Date: 09/03/2006 From regis.st-gelais at laubrass.com Fri Mar 10 17:27:46 2006 From: regis.st-gelais at laubrass.com (=?iso-8859-1?Q?R=E9gis_St-Gelais_=28Laubrass=29?=) Date: Fri, 10 Mar 2006 11:27:46 -0500 Subject: [Expat-discuss] Comments are a killer? References: Message-ID: <065301c6445f$9178e900$6400a8c0@laubrasssag1> ----- Original Message ----- From: Mark Serlin To: expat-discuss at libexpat.org Sent: Friday, March 10, 2006 10:49 AM Subject: [Expat-discuss] Comments are a killer? > >Hi all > >My version of expat 1.95.8 falls foul of any comments like this > > > >but not like this > > > >Is this expected? Mark, I don't know if this is normal beavior for an XML parser but I can tell you that my version Expat 2.0.0 also gives me a not well formed error message. Regis St-Gelais www.laubrass.com From lists at appinf.com Fri Mar 10 20:10:23 2006 From: lists at appinf.com (Guenter Obiltschnig) Date: Fri, 10 Mar 2006 20:10:23 +0100 Subject: [Expat-discuss] Comments are a killer? In-Reply-To: <065301c6445f$9178e900$6400a8c0@laubrasssag1> References: <065301c6445f$9178e900$6400a8c0@laubrasssag1> Message-ID: <5F878E3F-6CA7-4245-9C01-AC56ED281D29@appinf.com> Hi, from the XML 1.0 recommendation: 2.5 Comments [Definition: Comments may appear anywhere in a document outside other markup; in addition, they may appear within the document type declaration at places allowed by the grammar. They are not part of the document's character data; an XML processor may, but need not, make it possible for an application to retrieve the text of comments. For compatibility, the string "--" (double-hyphen) must not occur within comments.] Parameter entity references must not be recognized within comments. There you have it. Best regards, G?nter On Mar 10, 2006, at 17:27 , R?gis St-Gelais (Laubrass) wrote: > ----- Original Message ----- > From: Mark Serlin > To: expat-discuss at libexpat.org > Sent: Friday, March 10, 2006 10:49 AM > Subject: [Expat-discuss] Comments are a killer? >> >> Hi all >> >> My version of expat 1.95.8 falls foul of any comments like this >> >> >> >> but not like this >> >> >> >> Is this expected? > > Mark, > > I don't know if this is normal beavior for an XML parser but I can > tell you > that my version Expat 2.0.0 also gives me a not well formed error > message. > > Regis St-Gelais > www.laubrass.com > > > _______________________________________________ > Expat-discuss mailing list > Expat-discuss at libexpat.org > http://mail.libexpat.org/mailman/listinfo/expat-discuss -- G?nter Obiltschnig Applied Informatics guenter.obiltschnig at appinf.com http://www.appinf.com P: +43 4253 32596 M: +43 676 5166737 F: +43 4253 32096 -------------------------------------------------------- The C++ Portable Components: http://poco.appinf.com From weigelt at metux.de Mon Mar 13 05:54:52 2006 From: weigelt at metux.de (Enrico Weigelt) Date: Mon, 13 Mar 2006 05:54:52 +0100 Subject: [Expat-discuss] Expat 2.0 released In-Reply-To: <44083C94.9080309@waclawek.net> References: <20060303103105.GA17673@nibiru.local> <44083C94.9080309@waclawek.net> Message-ID: <20060313045452.GA12951@nibiru.local> * Karl Waclawek wrote: > Although I didn't come up with it, it was always clear to me that the > 1.95.XX line of releases was the 2.0 beta cycle, more or less. > That is already obvious from reading James Clark's original web site: > http://www.jclark.com/xml/expat.html Maybe, but that's quite unusual and not what a common user would expect. cu -- --------------------------------------------------------------------- Enrico Weigelt == metux IT service phone: +49 36207 519931 www: http://www.metux.de/ fax: +49 36207 519932 email: contact at metux.de cellphone: +49 174 7066481 --------------------------------------------------------------------- -- DSL ab 0 Euro. -- statische IP -- UUCP -- Hosting -- Webshops -- --------------------------------------------------------------------- From mark.serlin at ashgaming.com Mon Mar 13 10:58:48 2006 From: mark.serlin at ashgaming.com (Mark Serlin) Date: Mon, 13 Mar 2006 09:58:48 -0000 Subject: [Expat-discuss] Comments are a killer? Message-ID: Gosh who'd have thought? Thanks for taking the trouble to point that somewhat newbie error out to me! Mark -----Original Message----- From: expat-discuss-bounces at libexpat.org [mailto:expat-discuss-bounces at libexpat.org]On Behalf Of Guenter Obiltschnig Sent: Friday, March 10, 2006 7:10 PM To: expat-discuss at libexpat.org Subject: Re: [Expat-discuss] Comments are a killer? Hi, from the XML 1.0 recommendation: 2.5 Comments [Definition: Comments may appear anywhere in a document outside other markup; in addition, they may appear within the document type declaration at places allowed by the grammar. They are not part of the document's character data; an XML processor may, but need not, make it possible for an application to retrieve the text of comments. For compatibility, the string "--" (double-hyphen) must not occur within comments.] Parameter entity references must not be recognized within comments. There you have it. Best regards, G?nter On Mar 10, 2006, at 17:27 , R?gis St-Gelais (Laubrass) wrote: > ----- Original Message ----- > From: Mark Serlin > To: expat-discuss at libexpat.org > Sent: Friday, March 10, 2006 10:49 AM > Subject: [Expat-discuss] Comments are a killer? >> >> Hi all >> >> My version of expat 1.95.8 falls foul of any comments like this >> >> >> >> but not like this >> >> >> >> Is this expected? > > Mark, > > I don't know if this is normal beavior for an XML parser but I can > tell you > that my version Expat 2.0.0 also gives me a not well formed error > message. > > Regis St-Gelais > www.laubrass.com > > > _______________________________________________ > Expat-discuss mailing list > Expat-discuss at libexpat.org > http://mail.libexpat.org/mailman/listinfo/expat-discuss -- G?nter Obiltschnig Applied Informatics guenter.obiltschnig at appinf.com http://www.appinf.com P: +43 4253 32596 M: +43 676 5166737 F: +43 4253 32096 -------------------------------------------------------- The C++ Portable Components: http://poco.appinf.com _______________________________________________ Expat-discuss mailing list Expat-discuss at libexpat.org http://mail.libexpat.org/mailman/listinfo/expat-discuss -- No virus found in this incoming message. Checked by AVG Free Edition. Version: 7.1.375 / Virus Database: 268.2.1/279 - Release Date: 10/03/2006 -- No virus found in this outgoing message. Checked by AVG Free Edition. Version: 7.1.375 / Virus Database: 268.2.1/279 - Release Date: 10/03/2006 From srinivas.cheruku at gmail.com Wed Mar 15 11:28:27 2006 From: srinivas.cheruku at gmail.com (Srinivas Cheruku) Date: Wed, 15 Mar 2006 15:58:27 +0530 Subject: [Expat-discuss] XML Manipulation Message-ID: <4417EC4B.5000007@gmail.com> Hi, I understand that expat is used for XML Parsing. I need to manipulate the XML data read from a xml file by changing certain node values. Can i use expat for this type of XML manipulation? If so, can anyone direct me to the sample code for XML manipulation? Thanks and Regards, Srini From devyani.sapre at wipro.com Wed Mar 15 14:17:57 2006 From: devyani.sapre at wipro.com (devyani.sapre at wipro.com) Date: Wed, 15 Mar 2006 18:47:57 +0530 Subject: [Expat-discuss] Older version of Expat Message-ID: <21227C6FF0A66E459AC07C4DC1CE7F760148F00C@BLR-EC-MBX01.wipro.com> Hi All, Due to some reasons I have to use a very old version on EXPAT :-(, I suppose it is 0.9(that what I have been told). Can anyone tell me from where can I download this package or get some information about it. I am aleardy using that version but unfortunately there are not sample programs available with me. Hence I tried executing sample programs of EXPAT version 2.0 and was able to successfully compile by merely changing the header file name from expat.h to cmexpat.h But I faced a problem with XML_Parse() method. This method returns and int value which is for example programs is "XML_STATUS_ERROR". I tried searching in the list of error defined on James Clark site but unfortunately I could not find any info. Since I am using an older version of expat (0.9) it is not able to recognize this error code. Can I use another error code in place of this ?? I know there would be major changes from version 0.9 to 2.0 but can't help it. I have no go. Please help. Thanks in Advance Devyani The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments. WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email. www.wipro.com From nickmacd at gmail.com Wed Mar 15 16:33:45 2006 From: nickmacd at gmail.com (Nick MacDonald) Date: Wed, 15 Mar 2006 10:33:45 -0500 Subject: [Expat-discuss] XML Manipulation In-Reply-To: <4417EC4B.5000007@gmail.com> References: <4417EC4B.5000007@gmail.com> Message-ID: Note that eXpat is a SAX parser, which means it does not load the entire XML document into memory, and you basically get to see the document in one pass. If you need to have the whole document in memory for your processing, you'll probably want to look into another type of XML processor, such as a DOM based one that loads the whole XML document into memory, and provides the means to "walk" through it in memory. Assuming you can get by with the SAX limitations, there is some sample code included with the eXpat package, but there will probably not be enough there to get you near the finish line. You will need to write a bit of a state machine to track the incoming XML so that you know when you have found the things you're looking for. You'll probably also need to use a stack data structure to know the "path" at any given time as you are parsing your file. Some day someone should write a sample program to translate an XML file and include it with the basic eXpat package as an example... if such a thing were to be done that would be exactly where you'd want to start. Good luck, Nick On 3/15/06, Srinivas Cheruku wrote: > I understand that expat is used for XML Parsing. > > I need to manipulate the XML data read from a xml file by changing certain node values. > Can i use expat for this type of XML manipulation? > If so, can anyone direct me to the sample code for XML manipulation? From jzhang at ximpleware.com Wed Mar 15 19:35:18 2006 From: jzhang at ximpleware.com (Jimmy Zhang) Date: Wed, 15 Mar 2006 10:35:18 -0800 Subject: [Expat-discuss] XML Manipulation (Srinivas Cheruku) Message-ID: <009301c6485f$38aa0f80$0d02a8c0@ximpleware> VTD-XML is very good at doing what you described... Hi, I understand that expat is used for XML Parsing. I need to manipulate the XML data read from a xml file by changing certain node values. Can i use expat for this type of XML manipulation? If so, can anyone direct me to the sample code for XML manipulation? Thanks and Regards, Srini From m_biswas at mailinator.com Wed Mar 15 20:24:16 2006 From: m_biswas at mailinator.com (Mohun Biswas) Date: Wed, 15 Mar 2006 14:24:16 -0500 Subject: [Expat-discuss] XML Manipulation In-Reply-To: References: <4417EC4B.5000007@gmail.com> Message-ID: Nick MacDonald wrote: > Note that eXpat is a SAX parser, which means it does not load the > entire XML document into memory, and you basically get to see the > document in one pass. If you need to have the whole document in > memory for your processing, you'll probably want to look into another > type of XML processor, such as a DOM based one that loads the whole > XML document into memory, and provides the means to "walk" through it > in memory. > > Some day someone should write a sample program to translate an XML > file and include it with the basic eXpat package as an example... if > such a thing were to be done that would be exactly where you'd want to > start. There's an open source C library out there called "DOMC" which, as I understand it, uses eXpat to form a DOM. I haven't used it but it looks interesting. From nickmacd at gmail.com Wed Mar 15 20:31:05 2006 From: nickmacd at gmail.com (Nick MacDonald) Date: Wed, 15 Mar 2006 14:31:05 -0500 Subject: [Expat-discuss] XML Manipulation (Srinivas Cheruku) In-Reply-To: <009301c6485f$38aa0f80$0d02a8c0@ximpleware> References: <009301c6485f$38aa0f80$0d02a8c0@ximpleware> Message-ID: > VTD-XML is very good at doing what you described... Actually, I'm sure Jimmy may be amazed, but I would possibly agree with him on this count... although it would really depend on what you need to do... whether or not you need DOM functionality. Despite what he will say, VTD-XML is no much more than a compressed DOM representation. The thing to be wary of here is motive... the licensing of VTD-XML is not quite the same as the completely open license of eXpat. Nick > I understand that expat is used for XML Parsing. > I need to manipulate the XML data read from a xml file by changing certain node values. > Can i use expat for this type of XML manipulation? > If so, can anyone direct me to the sample code for XML manipulation? From Fang.Xu at innocon.com Wed Mar 15 21:03:51 2006 From: Fang.Xu at innocon.com (Xu, Fang) Date: Wed, 15 Mar 2006 15:03:51 -0500 Subject: [Expat-discuss] Small Program Message-ID: <3E8081396F6B524BA2854E7FA3F1643802428498@mail.innocon.com> Hi I just need to use SAX to parse a configuration file under LynxOS. Is it possible that I can link the library only necessary to my program so that it can keep the size of my program as small as possible? Thanks. Fang Xu From devyani.sapre at wipro.com Thu Mar 16 10:40:16 2006 From: devyani.sapre at wipro.com (devyani.sapre at wipro.com) Date: Thu, 16 Mar 2006 15:10:16 +0530 Subject: [Expat-discuss] XML Manipulation Message-ID: <21227C6FF0A66E459AC07C4DC1CE7F760148F30D@BLR-EC-MBX01.wipro.com> There is an open source library in c called SIMPLE C EXPAT WRAPPER which provides a DOM like interface and structure over expat -----Original Message----- From: expat-discuss-bounces at libexpat.org [mailto:expat-discuss-bounces at libexpat.org] On Behalf Of Nick MacDonald Sent: Wednesday, March 15, 2006 9:04 PM To: Srinivas Cheruku Cc: expat-discuss at libexpat.org Subject: Re: [Expat-discuss] XML Manipulation Note that eXpat is a SAX parser, which means it does not load the entire XML document into memory, and you basically get to see the document in one pass. If you need to have the whole document in memory for your processing, you'll probably want to look into another type of XML processor, such as a DOM based one that loads the whole XML document into memory, and provides the means to "walk" through it in memory. Assuming you can get by with the SAX limitations, there is some sample code included with the eXpat package, but there will probably not be enough there to get you near the finish line. You will need to write a bit of a state machine to track the incoming XML so that you know when you have found the things you're looking for. You'll probably also need to use a stack data structure to know the "path" at any given time as you are parsing your file. Some day someone should write a sample program to translate an XML file and include it with the basic eXpat package as an example... if such a thing were to be done that would be exactly where you'd want to start. Good luck, Nick On 3/15/06, Srinivas Cheruku wrote: > I understand that expat is used for XML Parsing. > > I need to manipulate the XML data read from a xml file by changing certain node values. > Can i use expat for this type of XML manipulation? > If so, can anyone direct me to the sample code for XML manipulation? _______________________________________________ Expat-discuss mailing list Expat-discuss at libexpat.org http://mail.libexpat.org/mailman/listinfo/expat-discuss The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments. WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email. www.wipro.com From feraudyh at yahoo.com Mon Mar 20 18:55:28 2006 From: feraudyh at yahoo.com (Henri De Feraudy) Date: Mon, 20 Mar 2006 09:55:28 -0800 (PST) Subject: [Expat-discuss] Novice blues with UTF-8, Expat and Python Message-ID: <20060320175528.29954.qmail@web50107.mail.yahoo.com> Hi folks, I've been trying to parse an XML file that contained some French accented characters and the program raised an error. Although you may have a quick answer, may I suggest that an example in the Python Expat documentation could have avoided my taking up your time. Here's the script: import codecs from xml.parsers.expat import ParserCreate ifh = codecs.open("FrenchText.xml","r","utf-8") text = ifh.read() ifh.close() parser = ParserCreate("utf-8") parser.Parse(text) Here's the content of the XML file: Un ??t?? torride Here's the error: UnicodeEncodeError: 'ascii' codec can't encode character u'\xe9' in position 58: ordinal not in range(128) To boldly go photographing where everyone has been before. From karl at waclawek.net Mon Mar 20 21:03:54 2006 From: karl at waclawek.net (Karl Waclawek) Date: Mon, 20 Mar 2006 15:03:54 -0500 Subject: [Expat-discuss] Novice blues with UTF-8, Expat and Python In-Reply-To: <20060320175528.29954.qmail@web50107.mail.yahoo.com> References: <20060320175528.29954.qmail@web50107.mail.yahoo.com> Message-ID: <441F0AAA.2010100@waclawek.net> Henri De Feraudy wrote: > Hi folks, > I've been trying to parse an XML file that contained some French accented characters and the program raised an error. Although you may have a quick answer, > may I suggest that an example in the Python Expat documentation could have avoided my taking up your time. > > We do not maintain the Python bindings for Expat here. Karl From weigelt at metux.de Fri Mar 24 01:07:13 2006 From: weigelt at metux.de (Enrico Weigelt) Date: Fri, 24 Mar 2006 01:07:13 +0100 Subject: [Expat-discuss] test - mail.libexpat.org seems offline Message-ID: <20060324000713.GA9893@nibiru.local> test -- --------------------------------------------------------------------- Enrico Weigelt == metux IT service phone: +49 36207 519931 www: http://www.metux.de/ fax: +49 36207 519932 email: contact at metux.de cellphone: +49 174 7066481 --------------------------------------------------------------------- -- DSL ab 0 Euro. -- statische IP -- UUCP -- Hosting -- Webshops -- --------------------------------------------------------------------- From weigelt at metux.de Fri Mar 24 14:28:45 2006 From: weigelt at metux.de (Enrico 'nekrad' Weigelt) Date: Fri, 24 Mar 2006 14:28:45 +0100 Subject: [Expat-discuss] [article] trimming down autoconf's configure scripts by using pkg-config Message-ID: <200603241328.k2ODSj3n020509@metux.de> Hi folks, here's a little article I wrote about an approach for encapsulating platform dependent stdc-lib fixes (ie. some platforms have broken string handling, etc) behind pkg-config. It is mostly directed on autoconf (It is the one I have most of my daily trouble with), but of course the approach works with other buildsystems, even plain make, too. The permanent URL of the article: * http://www.metux.de/articles/coding/trimming-down-autoconf-with-pkg-config I'm also currently working on concrete wrapper packages: * http://wiki.metux.de/public/OSS_QM_Taskforce/projects/stdc-pkgconfig Comments and suggestions please to the Maillist: oss-qm-discuss at metux.de Thanks. -- --------------------------------------------------------------------- Enrico Weigelt == metux IT service phone: +49 36207 519931 www: http://www.metux.de/ fax: +49 36207 519932 email: info at metux.de cellphone: +49 174 7066481 --------------------------------------------------------------------- From hymedia at videotron.ca Fri Mar 24 23:28:37 2006 From: hymedia at videotron.ca (hymedia) Date: Fri, 24 Mar 2006 17:28:37 -0500 Subject: [Expat-discuss] test - mail.libexpat.org seems offline In-Reply-To: <20060324000713.GA9893@nibiru.local> Message-ID: <0IWN00HHML3WWQA0@VL-MO-MR003.ip.videotron.ca> Hi all! Im new here so im not sure im at the right place, I got some questions regarding Expat, maybe u guys can help me out: Is there any web based forum on Expat? I didn't find any working tutorials/samples on Expat and I m getting worried I ll find any. Is there any tutorials/samples on Expat? What function/callback can retrieve element data... data in between a start and an end tag: Administrator I need to get Administrator as 'char' I do know how to get attributes with XML_SetElementHandler, my first guess was to get it by attribute[0]... but in this case, seams that there is no attributes. How do I retrieve this data? THX From fdrake at acm.org Sun Mar 26 08:50:45 2006 From: fdrake at acm.org (Fred L. Drake, Jr.) Date: Sun, 26 Mar 2006 01:50:45 -0500 Subject: [Expat-discuss] test - mail.libexpat.org seems offline In-Reply-To: <20060324000713.GA9893@nibiru.local> References: <20060324000713.GA9893@nibiru.local> Message-ID: <200603260150.45592.fdrake@acm.org> On Thursday 23 March 2006 19:07, Enrico Weigelt wrote: > test Hey everyone! I know I've been incommunicado for a long time now, but life can be that way sometimes. (Mostly just been really busy with work and family.) I've not had time to contribute for a while, and don't know that I do now, either, but I've tried to keep mail held for the list from backing up. Enrico's test message was held, so I noticed the subject line. The DNS servers that serve libexpat.org have been suffering severe DDoS attacks lately; the administrators for those servers are still working against those attacks. This has affected other domains I have registered there as well. Hopefully the problems will dissipate soon; if not, I'll look into alternate DNS arrangements. Karl: Congratulations on Expat 2! -Fred -- Fred L. Drake, Jr. "Don't let schooling interfere with your education." -- Mark Twain From mail2shivam at gmail.com Wed Mar 29 07:12:10 2006 From: mail2shivam at gmail.com (shivam sinha) Date: Wed, 29 Mar 2006 10:42:10 +0530 Subject: [Expat-discuss] SyExpat Parser sending single character Message-ID: <9dfa95ba0603282112ufedfcb7iaaf108d00eada70d@mail.gmail.com> Hi, I am using SyExpat(Symbian-Expat) 1.95.8 to parse a XML document on symbian. However parser sends only the first character of "TagName" to the "StartElement" call back.Does Anybody know , whats the issue? Plz help me out.Its urgent. Cheers Shivam From 11mjazbdg02 at sneakemail.com Wed Mar 29 11:25:40 2006 From: 11mjazbdg02 at sneakemail.com (Mark) Date: 29 Mar 2006 09:25:40 -0000 Subject: [Expat-discuss] Internal operation of expat Message-ID: <15388-31864@sneakemail.com> Hi All, I wounder if anyone could clarify something about the internal operation of expat? When I was trying to speed up the operation of a program that parses large files I noticed that, in my CharData callback, the address of the data pointer is always one of two values, and does not appear to point into the buffer allocated by XML_GetBuffer(). This suggests to me that the data must be copied internally. Can someone tell me whether this is correct and, if so, how expat performs so well? TIA, Mark From karl at waclawek.net Thu Mar 30 20:45:49 2006 From: karl at waclawek.net (Karl Waclawek) Date: Thu, 30 Mar 2006 13:45:49 -0500 Subject: [Expat-discuss] Internal operation of expat In-Reply-To: <15388-31864@sneakemail.com> References: <15388-31864@sneakemail.com> Message-ID: <442C275D.2020003@waclawek.net> Mark wrote: > Hi All, > > I wounder if anyone could clarify something about the internal operation of expat? > > When I was trying to speed up the operation of a program that parses large files I noticed that, in my CharData callback, the address of the data pointer is always one of two values, and does not appear to point into the buffer allocated by XML_GetBuffer(). > > This suggests to me that the data must be copied internally. > > Can someone tell me whether this is correct and, if so, how expat performs so well? > > Expat has to convert encodings, normalize line-breaks, expand entities, etc. So copying is normally required. Why it performs so well? Because James Clark is a hell of a coder. Karl From weigelt at metux.de Fri Mar 31 03:00:26 2006 From: weigelt at metux.de (Enrico Weigelt) Date: Fri, 31 Mar 2006 03:00:26 +0200 Subject: [Expat-discuss] test - mail.libexpat.org seems offline In-Reply-To: <200603260150.45592.fdrake@acm.org> References: <20060324000713.GA9893@nibiru.local> <200603260150.45592.fdrake@acm.org> Message-ID: <20060331010025.GA9379@nibiru.local> * Fred L. Drake, Jr. wrote: Hi, > The DNS servers that serve libexpat.org have been suffering severe DDoS > attacks lately; the administrators for those servers are still working > against those attacks. This has affected other domains I have registered > there as well. if you like, I'd offer 3 additional backup servers. cu -- --------------------------------------------------------------------- Enrico Weigelt == metux IT service - http://www.metux.de/ --------------------------------------------------------------------- Please visit the OpenSource QM Taskforce: http://wiki.metux.de/public/OpenSource_QM_Taskforce Patches / Fixes for a lot dozens of packages in dozens of versions: rsync://sources.metux.de/metux-patches ---------------------------------------------------------------------