From noreply at sourceforge.net Wed Aug 2 01:26:59 2006 From: noreply at sourceforge.net (SourceForge.net) Date: Tue, 01 Aug 2006 16:26:59 -0700 Subject: [XML-SIG] [ pyxml-Bugs-1532817 ] --msie in xbel_parse calls undefined dump_msie Message-ID: Bugs item #1532817, was opened at 2006-08-02 01:26 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=106473&aid=1532817&group_id=6473 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: XBEL Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jerémy Bobbio (jbobbio) Assigned to: Nobody/Anonymous (nobody) Summary: --msie in xbel_parse calls undefined dump_msie Initial Comment: xbel_parse contains support for a --msie option which is not implemented in bookmark.py where the logic lives. This issue is already pretty old, see http://bugs.debian.org/111438 Maybe the --msie option could just be removed from xbel_parse... ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=106473&aid=1532817&group_id=6473 From noreply at sourceforge.net Wed Aug 2 01:58:32 2006 From: noreply at sourceforge.net (SourceForge.net) Date: Tue, 01 Aug 2006 16:58:32 -0700 Subject: [XML-SIG] [ pyxml-Bugs-1532824 ] xbel utilities should print usage on stderr Message-ID: Bugs item #1532824, was opened at 2006-08-02 01:58 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=106473&aid=1532824&group_id=6473 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: XBEL Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jerémy Bobbio (jbobbio) Assigned to: Nobody/Anonymous (nobody) Summary: xbel utilities should print usage on stderr Initial Comment: Error and usage messages for XBEL utilities should be printed on stderr instead of stdout, as their output will often be redirected. Attached is a patch fixing this issue. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=106473&aid=1532824&group_id=6473 From noreply at sourceforge.net Wed Aug 2 02:16:09 2006 From: noreply at sourceforge.net (SourceForge.net) Date: Tue, 01 Aug 2006 17:16:09 -0700 Subject: [XML-SIG] [ pyxml-Patches-1532827 ] Fix handling of line terminators in msie_parse Message-ID: Patches item #1532827, was opened at 2006-08-02 02:16 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=306473&aid=1532827&group_id=6473 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: XBEL Group: None Status: Open Resolution: None Priority: 5 Submitted By: Jerémy Bobbio (jbobbio) Assigned to: Nobody/Anonymous (nobody) Summary: Fix handling of line terminators in msie_parse Initial Comment: msie_parse does not work correctly on *nix because Windows line terminators are not transformed from '\r\n' to '\n' by the libc as they are on win32. The attached patch uses strip() to make it work on both plateforms. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=306473&aid=1532827&group_id=6473 From evdo.hsdpa at gmail.com Wed Aug 16 15:31:04 2006 From: evdo.hsdpa at gmail.com (Robert Kim Wireless Internet Advisor) Date: Wed, 16 Aug 2006 06:31:04 -0700 Subject: [XML-SIG] Remove obsolete link In-Reply-To: <20060719194541.GA4446@oranjetip.leverkruid.eu> References: <20060719194541.GA4446@oranjetip.leverkruid.eu> Message-ID: <1ec620e90608160631o7a20a5beq902ccb136f026e4f@mail.gmail.com> is there a newer page with updated information? On 7/19/06, Simon Pepping wrote: > Hi, > > For a long time, on your web page > http://pyxml.sourceforge.net/topics/docs.html you have carried a link > to my page "Writing an application for a SAX-compliant XML parser", at > http://www.hobby.nl/~scaprea/XML/. This site is going away. Because > the document is so old, I do not want to republish it on my new > website. Could you remove the link? > > Regards, Simon Pepping > > -- > Simon Pepping > email: spepping at leverkruid.eu > home page: http://www.leverkruid.eu > IM (jabber): spepping at nedlinux.nl > public key: http://www.leverkruid.eu/personal/sp.asc > fingerprint: E3BF 7295 9AA8 8B8A C01A 219D FAAC 088C 6B28 F549 > _______________________________________________ > XML-SIG maillist - XML-SIG at python.org > http://mail.python.org/mailman/listinfo/xml-sig > -- Robert Q Kim, Wireless Internet Advisor http://wireless-internet-coverage.blogspot.com http://evdo-coverage.com/wireless-computer-network-consultant.html 2611 S. Pacific Coast Highway 101 Suite 203 Cardiff by the Sea, CA 92007 206 984 0880 From MAnstey at csu.edu.au Mon Aug 21 13:16:37 2006 From: MAnstey at csu.edu.au (Anstey, Matthew) Date: Mon, 21 Aug 2006 21:16:37 +1000 Subject: [XML-SIG] Large xml databases and python Message-ID: Hi, I'm a newbie at python but we are using it daily in our research, so I'm getting the hang of it. My RA and I are developing about 30 large unicode xml databases, averaging about 100 Mb each. They have to retain their XML format as they are used in a specialised program designed to read them in as they are. We use Python to do our programming logic, which so far is mainly data manipulation. For example, each day we read through the data, which we save *for the time being* in python dictionaries (and thus they are about 10Mb each!) many times, processing various values to modify existing and create new values. Our question is this: when we finish porting our 300Mb "python" data into 3Gb of XML data, how can we continue to read it from disk in its xml format and manipulate it? We are looking at Berkeley XML with the Python API, but are concerned this is not the best solution. we have also dabbled with Amara and ElementTree, but the size our our XML is giving us problems. We want to focus on the programming logic, which is all we do in python, and working with python structures is great, but this is not a viable option (as far as we can tell) once we move to XML. Or is there a way to read large XML files into python structures and then write them out again? Our ideal solution (we think!) would be to have an xml database whose elements we could directly modify using python scripts and where the disk and memory handling is done for us somehow. Any advice would be appreciated. Regards, Matthew Postdoctoral Fellow, Charles Sturt University -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/xml-sig/attachments/20060821/7aad9fe1/attachment.html From heuer at semagia.com Mon Aug 21 14:25:15 2006 From: heuer at semagia.com (Lars Heuer) Date: Mon, 21 Aug 2006 14:25:15 +0200 Subject: [XML-SIG] Large xml databases and python In-Reply-To: References: Message-ID: <852318472.20060821142515@semagia.com> Hi Matthew, [...] > We are looking at Berkeley XML with the Python API, but are > concerned this is not the best solution. we have also dabbled with [...] You can also try [exist-db] together with its [REST-API]. The dev.guide gives an example. [exist-db] http://exist.sourceforge.net/ [REST-API] http://exist.sourceforge.net/devguide.html#N10216 Best regards, Lars -- http://www.semagia.com From fredrik at pythonware.com Mon Aug 21 14:38:24 2006 From: fredrik at pythonware.com (Fredrik Lundh) Date: Mon, 21 Aug 2006 14:38:24 +0200 Subject: [XML-SIG] Large xml databases and python In-Reply-To: References: Message-ID: Anstey, Matthew wrote: > Our question is this: when we finish porting our 300Mb "python" data > into 3Gb of XML data, how can we continue to read it from disk in its > xml format and manipulate it? > > We are looking at Berkeley XML with the Python API, but are concerned > this is not the best solution. we have also dabbled with Amara and > ElementTree, but the size our our XML is giving us problems. if the Python version of the data fits in memory, you can use iterparse and the "incremental decoding" approach outlined here: http://effbot.org/zone/element-iterparse.htm to save the data, you can build subtrees (e.g. on a record level) and write each tree out by itself. f = open("out.xml", "w") f.write("") for record in data: tree = make_record_tree(record) tree.write(f) f.write("") f.close() From michael.perkonigg at liland.at Fri Aug 25 11:00:26 2006 From: michael.perkonigg at liland.at (Michael Perkonigg) Date: Fri, 25 Aug 2006 11:00:26 +0200 Subject: [XML-SIG] Wrong link in XML-howto Message-ID: <44EEBC2A.2030403@liland.at> Hello, there is a wrong link in your XML howto at section 3 (3 Installing the XML Toolkit ). Link to that section: http://pyxml.sourceforge.net/topics/howto/section-install.html " If you have difficulty installing this software, send a problem report to the XML-SIG mailing list describing the problem, or submit a bug report at http://sourceforget.net/projects/pyxml. " It's "sourceforge.net/projects/pyxml", not "sourceforget.net/projects/pyxml". The existing link routes one to a default ad and selling site. Regards, Mike From uche at ogbuji.net Mon Aug 28 18:57:44 2006 From: uche at ogbuji.net (Uche Ogbuji) Date: Mon, 28 Aug 2006 10:57:44 -0600 Subject: [XML-SIG] ANN: 4Suite XML 1.0rc3 Message-ID: <44F32088.6030001@ogbuji.net> Today we release 4Suite XML 1.0 release candidate 3, now available from Sourceforge and ftp.4suite.org. Please help us test 4Suite to ensure a high-quality 1.0 final release. 4Suite is split into three separate packages: 4Suite XML - XML, XPath, XSLT, related technologies and support libraries 4Suite RDF - RDF processing libraries and stand-alone DBMS 4Suite Repository - XML and RDF repository This is a release of only the first component. Highlights of changes in 4Suite XML since the last release -- * Python 2.5 support (tested on Python 2.5b3) * Improved packages, including support for generation of eggs * Integrated PEP 302 import hooks (zipimports) to better support third-party applications (setuptools, py2exe, cx_Freeze) * Add output and params arguments to Ft.Xml.Xslt.Transform * Implement all of EXSLT Dynamic module (dyn:map, dyn:max, dyn:min, dyn:sum and dyn:closure were missing) * Subclassed gettext support so that all Python versions support plural forms and loading of message catalogs as embedded data files. * Domlette node classes can now be subclassed * Add support for EntityResolver interface to Saxlette (primarily to satisfy Python's XMLFilterBase for SAX filters) * Add Ft.Xml.CreateInputSource convenience function * Add Ft.Xml.Sax.SaxPrinter, a SAX handler that uses a 4Suite printer to re-serialize from SAX events * Add C14N and exclusive C14N writers * Added support for xml:id * Add native getElementById() method to Domlette documents to allow for XPath's id() and XPointer's short-hand expressions. * Bug fixes, improvements to performance and documentation 4Suite XML is a comprehensive library for XML processing. It is implemented in Python and C and supports XML (SAX-like and DOM-like), XPath, XSLT, RELAX NG, XUpdate, XInclude, XPointer, and more. The manual is included in the documents, or you can browse it online at http://uche.ogbuji.net/tech/4suite/etc/CoreManual.html For general information, see: http://4suite.org http://uche.ogbuji.net/tech/4Suite/ For the files, see: ftp://ftp.4suite.org/pub/4Suite/ Mirrored on www.fourthought.com: http://www.fourthought.com/files/4Suite/4Suite_XML-1.0rc3-py2.3-win32.egg http://www.fourthought.com/files/4Suite/4Suite_XML-1.0rc3-py2.4-win32.egg http://www.fourthought.com/files/4Suite/4Suite_XML-1.0rc3-py2.5-win32.egg http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.tar.bz2 http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.tar.gz http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.zip http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.win32-py2.2.exe http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.win32-py2.3.exe http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.win32-py2.4.exe http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.win32-py2.5.exe http://www.fourthought.com/files/4Suite/4Suite-XML-docs-1.0rc3.tar.bz2 http://www.fourthought.com/files/4Suite/4Suite-XML-docs-1.0rc3.tar.gz http://www.fourthought.com/files/4Suite/4Suite-XML-docs-1.0rc3.zip You can also get the files on Python CheeseShop: http://cheeseshop.python.org/pypi/4Suite-XML/ We usually upload to Sourceforge, and will do so as soon as we can, but server errors have made it impossible for us to complete the file release so far. https://sourceforge.net/projects/foursuite/ https://sourceforge.net/project/showfiles.php?group_id=39954 Documentation: In the locations specified above, with filenames of the form 4Suite-XML-docs-1.0rc3.* Release notes -- The current installation directory layout document tells where package files are installed: http://4suite.org/docs/installation-locations.xhtml From charlie.groves at gmail.com Wed Aug 30 04:12:57 2006 From: charlie.groves at gmail.com (Charlie Groves) Date: Tue, 29 Aug 2006 22:12:57 -0400 Subject: [XML-SIG] Updates to PyXML for Jython 2.2 Message-ID: <96c4692d0608291912g10e96132sb097a2239e50e6f5@mail.gmail.com> Hi, I'm working on getting the xml module back into a working state in Jython for the 2.2 release. I've gotten to the point where all of the pieces that are distributed with CPython are working, so I'm starting to think about how to actually put the xml module into Jython's release system. How does CPython pull stuff from pyxml? Do they maintain a separate version or do they pull over the current version of pyxml when they're preparing for a release? I could just submit a patch to get my changes into pyxml proper and pull the version for Jython from cvs. Does that seem reasonable? Thanks, Charlie From jeremy.kloth at 4suite.org Wed Aug 30 04:32:57 2006 From: jeremy.kloth at 4suite.org (Jeremy Kloth) Date: Tue, 29 Aug 2006 20:32:57 -0600 Subject: [XML-SIG] Updates to PyXML for Jython 2.2 In-Reply-To: <96c4692d0608291912g10e96132sb097a2239e50e6f5@mail.gmail.com> References: <96c4692d0608291912g10e96132sb097a2239e50e6f5@mail.gmail.com> Message-ID: <200608292032.57478.jeremy.kloth@4suite.org> On Tuesday 29 August 2006 8:12 pm, Charlie Groves wrote: > Hi, > > I'm working on getting the xml module back into a working state in > Jython for the 2.2 release. I've gotten to the point where all of the > pieces that are distributed with CPython are working, so I'm starting > to think about how to actually put the xml module into Jython's > release system. How does CPython pull stuff from pyxml? Do they > maintain a separate version or do they pull over the current version > of pyxml when they're preparing for a release? The stuff in CPython's xml package is a select set of components from the PyXML whole. The PyXML maintainer would manually sync up the components for each CPython release. Note, PyXML is currently in need of a maintainer, Martin v. Loewis has stepped down from that role. > I could just submit a patch to get my changes into pyxml proper and > pull the version for Jython from cvs. Does that seem reasonable? That seems fine, but you may want to consider only including the set of components included in CPython to be in Jython (for consistency). Although I am by no means versed in Jython packaging. Feel free to include whatever parts you desire. -- Jeremy Kloth http://4suite.org/ From tom.dalton at EDS.COM Thu Aug 31 11:38:17 2006 From: tom.dalton at EDS.COM (Dalton, Tom) Date: Thu, 31 Aug 2006 10:38:17 +0100 Subject: [XML-SIG] XML-HOWTO Message-ID: Hi, I would find it very useful if you could make some/all of the documentation given at http://pyxml.sourceforge.net/topics/docs.html available in downloadable format (I appreciate some of it already is, i.e. the PDF files). In particular, the XML HOWTO tutorial style introduction would be very useful to me, but there's no easy way of taking a copy of it since it's all separate html files. The main python documentation is available this way, e.g. http://docs.python.org/ (top link) and I've found that very useful. I hope my comments are useful to you :-) Regards, Tom Dalton EDS Ltd DIS 1-3 Bartley Wood Business Park Bartley Way Hook Hampshire RG27 9XA (Tel: 01256 742481) -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/xml-sig/attachments/20060831/6dc8d72a/attachment.html From fredrik at pythonware.com Thu Aug 31 12:09:41 2006 From: fredrik at pythonware.com (Fredrik Lundh) Date: Thu, 31 Aug 2006 12:09:41 +0200 Subject: [XML-SIG] XML-HOWTO In-Reply-To: References: Message-ID: Dalton, Tom wrote: > I would find it very useful if you could make some/all of the > documentation given at _http://pyxml.sourceforge.net/topics/docs.html_ > available in downloadable format (I appreciate some of it already is, > i.e. the PDF files). In particular, the XML HOWTO tutorial style > introduction would be very useful to me, but there's no easy way of > taking a copy of it since it's all separate html files. well, I guess you could always use the "websucker" tool to get your own local copy: http://effbot.org/zone/websucker.htm From dkgunter at lbl.gov Thu Aug 31 22:27:00 2006 From: dkgunter at lbl.gov (Dan Gunter) Date: Thu, 31 Aug 2006 13:27:00 -0700 Subject: [XML-SIG] XML-HOWTO In-Reply-To: References: Message-ID: <44F74614.1090800@lbl.gov> wget -r http://pyxml.sourceforge.net/topics/howto/ -Dan Dalton, Tom wrote: > > Hi, > > I would find it very useful if you could make some/all of the > documentation given at _http://pyxml.sourceforge.net/topics/docs.html_ > available in downloadable format (I appreciate some of it already is, > i.e. the PDF files). In particular, the XML HOWTO tutorial style > introduction would be very useful to me, but there's no easy way of > taking a copy of it since it's all separate html files. > > The main python documentation is available this way, e.g. > _http://docs.python.org/_ (top link) and I've found that very useful. > > I hope my comments are useful to you :-) > > Regards, > > Tom Dalton > EDS Ltd > DIS > 1-3 Bartley Wood Business Park > Bartley Way > Hook > Hampshire > RG27 9XA > (Tel: 01256 742481) > > ------------------------------------------------------------------------ > > _______________________________________________ > XML-SIG maillist - XML-SIG at python.org > http://mail.python.org/mailman/listinfo/xml-sig > -- Dan Gunter. voice:510-495-2504 fax:510-486-6363 dsd.lbl.gov/~dang