From noreply at sourceforge.net  Wed Aug  2 01:26:59 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 01 Aug 2006 16:26:59 -0700
Subject: [XML-SIG] [ pyxml-Bugs-1532817 ] --msie in xbel_parse calls
	undefined dump_msie
Message-ID: <E1G83db-0002rd-QU@sc8-sf-web5.sourceforge.net>

Bugs item #1532817, was opened at 2006-08-02 01:26
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=106473&aid=1532817&group_id=6473

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: XBEL
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Jerémy Bobbio (jbobbio)
Assigned to: Nobody/Anonymous (nobody)
Summary: --msie in xbel_parse calls undefined dump_msie

Initial Comment:
xbel_parse contains support for a --msie option which 
is not implemented in bookmark.py where the logic 
lives.

This issue is already pretty old, see 
http://bugs.debian.org/111438

Maybe the --msie option could just be removed from 
xbel_parse...

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=106473&aid=1532817&group_id=6473

From noreply at sourceforge.net  Wed Aug  2 01:58:32 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 01 Aug 2006 16:58:32 -0700
Subject: [XML-SIG] [ pyxml-Bugs-1532824 ] xbel utilities should print usage
	on stderr
Message-ID: <E1G8488-0002o6-Aa@sc8-sf-web3.sourceforge.net>

Bugs item #1532824, was opened at 2006-08-02 01:58
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=106473&aid=1532824&group_id=6473

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: XBEL
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Jerémy Bobbio (jbobbio)
Assigned to: Nobody/Anonymous (nobody)
Summary: xbel utilities should print usage on stderr

Initial Comment:
Error and usage messages for XBEL utilities should be 
printed on stderr instead of stdout, as their output 
will often be redirected.

Attached is a patch fixing this issue.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=106473&aid=1532824&group_id=6473

From noreply at sourceforge.net  Wed Aug  2 02:16:09 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 01 Aug 2006 17:16:09 -0700
Subject: [XML-SIG] [ pyxml-Patches-1532827 ] Fix handling of line
	terminators in msie_parse
Message-ID: <E1G84PB-0000Ew-H6@sc8-sf-web5.sourceforge.net>

Patches item #1532827, was opened at 2006-08-02 02:16
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=306473&aid=1532827&group_id=6473

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: XBEL
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Jerémy Bobbio (jbobbio)
Assigned to: Nobody/Anonymous (nobody)
Summary: Fix handling of line terminators in msie_parse

Initial Comment:
msie_parse does not work correctly on *nix because 
Windows line terminators are not transformed 
from '\r\n' to '\n' by the libc as they are on win32.

The attached patch uses strip() to make it work on 
both plateforms.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=306473&aid=1532827&group_id=6473

From evdo.hsdpa at gmail.com  Wed Aug 16 15:31:04 2006
From: evdo.hsdpa at gmail.com (Robert Kim Wireless Internet Advisor)
Date: Wed, 16 Aug 2006 06:31:04 -0700
Subject: [XML-SIG] Remove obsolete link
In-Reply-To: <20060719194541.GA4446@oranjetip.leverkruid.eu>
References: <20060719194541.GA4446@oranjetip.leverkruid.eu>
Message-ID: <1ec620e90608160631o7a20a5beq902ccb136f026e4f@mail.gmail.com>

is there a newer page with updated information?

On 7/19/06, Simon Pepping <spepping at leverkruid.eu> wrote:
> Hi,
>
> For a long time, on your web page
> http://pyxml.sourceforge.net/topics/docs.html you have carried a link
> to my page "Writing an application for a SAX-compliant XML parser", at
> http://www.hobby.nl/~scaprea/XML/. This site is going away. Because
> the document is so old, I do not want to republish it on my new
> website. Could you remove the link?
>
> Regards, Simon Pepping
>
> --
> Simon Pepping
> email: spepping at leverkruid.eu
> home page: http://www.leverkruid.eu
> IM (jabber): spepping at nedlinux.nl
> public key: http://www.leverkruid.eu/personal/sp.asc
> fingerprint: E3BF 7295 9AA8 8B8A C01A  219D FAAC 088C 6B28 F549
> _______________________________________________
> XML-SIG maillist  -  XML-SIG at python.org
> http://mail.python.org/mailman/listinfo/xml-sig
>


-- 
Robert Q Kim, Wireless Internet Advisor
http://wireless-internet-coverage.blogspot.com
http://evdo-coverage.com/wireless-computer-network-consultant.html
2611 S. Pacific Coast Highway 101
Suite 203
Cardiff by the Sea, CA 92007
206 984 0880

From MAnstey at csu.edu.au  Mon Aug 21 13:16:37 2006
From: MAnstey at csu.edu.au (Anstey, Matthew)
Date: Mon, 21 Aug 2006 21:16:37 +1000
Subject: [XML-SIG] Large xml databases and python
Message-ID: <CA03C29F-3F75-4ADF-8D46-5DD85A23AC39@mimectl>

Hi,

I'm a newbie at python but we are using it daily in our research, so I'm getting the hang of it.

My RA and I are developing about 30 large unicode xml databases, averaging about 100 Mb each. They have to retain their XML format as they are used in a specialised program designed to read them in as they are.

We use Python to do our programming logic, which so far is mainly data manipulation. For example, each day we read through the data, which we save *for the time being* in python dictionaries (and thus they are about 10Mb each!) many times, processing various values to modify existing and create new values.

Our question is this: when we finish porting our 300Mb "python" data into 3Gb of XML data, how can we continue to read it from disk in its xml format and manipulate it?

We are looking at Berkeley XML with the Python API, but are concerned this is not the best solution. we have also dabbled with Amara and ElementTree, but the size our our XML is giving us problems.

We want to focus on the programming logic, which is all we do in python, and working with python structures is great, but this is not a viable option (as far as we can tell) once we move to XML. 

Or is there a way to read large XML files into python structures and then write them out again? 

Our ideal solution (we think!) would be to have an xml database whose elements we could directly modify using python scripts and where the disk and memory handling is done for us somehow.

Any advice would be appreciated. 

Regards,
Matthew
Postdoctoral Fellow, Charles Sturt University
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.python.org/pipermail/xml-sig/attachments/20060821/7aad9fe1/attachment.html 

From heuer at semagia.com  Mon Aug 21 14:25:15 2006
From: heuer at semagia.com (Lars Heuer)
Date: Mon, 21 Aug 2006 14:25:15 +0200
Subject: [XML-SIG] Large xml databases and python
In-Reply-To: <CA03C29F-3F75-4ADF-8D46-5DD85A23AC39@mimectl>
References: <CA03C29F-3F75-4ADF-8D46-5DD85A23AC39@mimectl>
Message-ID: <852318472.20060821142515@semagia.com>

Hi Matthew,

[...]
> We are looking at Berkeley XML with the Python API, but are
> concerned this is not the best solution. we have also dabbled with
[...]

You can also try [exist-db] together with its [REST-API]. The
dev.guide gives an example.

[exist-db] http://exist.sourceforge.net/
[REST-API] http://exist.sourceforge.net/devguide.html#N10216

Best regards,
Lars
-- 
http://www.semagia.com


From fredrik at pythonware.com  Mon Aug 21 14:38:24 2006
From: fredrik at pythonware.com (Fredrik Lundh)
Date: Mon, 21 Aug 2006 14:38:24 +0200
Subject: [XML-SIG] Large xml databases and python
In-Reply-To: <CA03C29F-3F75-4ADF-8D46-5DD85A23AC39@mimectl>
References: <CA03C29F-3F75-4ADF-8D46-5DD85A23AC39@mimectl>
Message-ID: <ecc9g0$nps$1@sea.gmane.org>

Anstey, Matthew wrote:

> Our question is this: when we finish porting our 300Mb "python" data 
> into 3Gb of XML data, how can we continue to read it from disk in its 
> xml format and manipulate it?
>  
> We are looking at Berkeley XML with the Python API, but are concerned 
> this is not the best solution. we have also dabbled with Amara and 
> ElementTree, but the size our our XML is giving us problems.

if the Python version of the data fits in memory, you can use iterparse 
and the "incremental decoding" approach outlined here:

     http://effbot.org/zone/element-iterparse.htm

to save the data, you can build subtrees (e.g. on a record level) and 
write each tree out by itself.

     f = open("out.xml", "w")
     f.write("<data>")
     for record in data:
         tree = make_record_tree(record)
         tree.write(f)
     f.write("</data>")
     f.close()

</F>


From michael.perkonigg at liland.at  Fri Aug 25 11:00:26 2006
From: michael.perkonigg at liland.at (Michael Perkonigg)
Date: Fri, 25 Aug 2006 11:00:26 +0200
Subject: [XML-SIG] Wrong link in XML-howto
Message-ID: <44EEBC2A.2030403@liland.at>

Hello,

there is a wrong link in your XML howto at section 3 (3 Installing the 
XML Toolkit 
<http://pyxml.sourceforge.net/topics/howto/section-install.html>).
Link to that section: 
http://pyxml.sourceforge.net/topics/howto/section-install.html

"

If you have difficulty installing this software, send a problem report 
to the XML-SIG mailing list describing the problem, or submit a bug 
report at http://sourceforget.net/projects/pyxml.


"

It's "sourceforge.net/projects/pyxml", not 
"sourceforget.net/projects/pyxml".
The existing link routes one to a default ad and selling site.

Regards,
Mike


From uche at ogbuji.net  Mon Aug 28 18:57:44 2006
From: uche at ogbuji.net (Uche Ogbuji)
Date: Mon, 28 Aug 2006 10:57:44 -0600
Subject: [XML-SIG] ANN: 4Suite XML 1.0rc3
Message-ID: <44F32088.6030001@ogbuji.net>

Today we release 4Suite XML 1.0 release candidate 3, now available
from Sourceforge and ftp.4suite.org.

Please help us test 4Suite to ensure a high-quality 1.0 final release.

4Suite is split into three separate packages:

4Suite XML - XML, XPath, XSLT, related technologies and support libraries
4Suite RDF - RDF processing libraries and stand-alone DBMS
4Suite Repository - XML and RDF repository

This is a release of only the first component.

Highlights of changes in 4Suite XML since the last release
--

* Python 2.5 support (tested on Python 2.5b3)
* Improved packages, including support for generation of eggs
* Integrated PEP 302 import hooks (zipimports) to better support
  third-party applications (setuptools, py2exe, cx_Freeze)
* Add output and params arguments to Ft.Xml.Xslt.Transform
* Implement all of EXSLT Dynamic module (dyn:map, dyn:max, dyn:min,
  dyn:sum and dyn:closure were missing)
* Subclassed gettext support so that all Python versions support
  plural forms and loading of message catalogs as
  embedded data files.
* Domlette node classes can now be subclassed
* Add support for EntityResolver interface to Saxlette (primarily to satisfy
  Python's XMLFilterBase for SAX filters)
* Add Ft.Xml.CreateInputSource convenience function
* Add Ft.Xml.Sax.SaxPrinter, a SAX handler that uses a 4Suite printer
  to re-serialize from SAX events
* Add C14N and exclusive C14N writers
* Added support for xml:id
* Add native getElementById() method to Domlette documents to allow
  for XPath's id() and XPointer's short-hand expressions.
* Bug fixes, improvements to performance and documentation

4Suite XML is a comprehensive library for XML processing.  It is
implemented in Python and C and supports XML (SAX-like and DOM-like),
XPath, XSLT, RELAX NG, XUpdate, XInclude, XPointer, and more.

The manual is included in the documents, or you can browse it online at

http://uche.ogbuji.net/tech/4suite/etc/CoreManual.html

For general information, see:

http://4suite.org
http://uche.ogbuji.net/tech/4Suite/

For the files, see:

ftp://ftp.4suite.org/pub/4Suite/

Mirrored on www.fourthought.com:

http://www.fourthought.com/files/4Suite/4Suite_XML-1.0rc3-py2.3-win32.egg
http://www.fourthought.com/files/4Suite/4Suite_XML-1.0rc3-py2.4-win32.egg
http://www.fourthought.com/files/4Suite/4Suite_XML-1.0rc3-py2.5-win32.egg
http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.tar.bz2
http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.tar.gz
http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.zip
http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.win32-py2.2.exe
http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.win32-py2.3.exe
http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.win32-py2.4.exe
http://www.fourthought.com/files/4Suite/4Suite-XML-1.0rc3.win32-py2.5.exe
http://www.fourthought.com/files/4Suite/4Suite-XML-docs-1.0rc3.tar.bz2
http://www.fourthought.com/files/4Suite/4Suite-XML-docs-1.0rc3.tar.gz
http://www.fourthought.com/files/4Suite/4Suite-XML-docs-1.0rc3.zip

You can also get the files on Python CheeseShop:

http://cheeseshop.python.org/pypi/4Suite-XML/

We usually upload to Sourceforge, and will do so as soon as we can, but
server errors have made it impossible for us to complete the file release
so far.

https://sourceforge.net/projects/foursuite/
https://sourceforge.net/project/showfiles.php?group_id=39954

Documentation:

In the locations specified above, with filenames of the form

4Suite-XML-docs-1.0rc3.*

Release notes
--
The current installation directory layout document tells where package files
are installed:

http://4suite.org/docs/installation-locations.xhtml


From charlie.groves at gmail.com  Wed Aug 30 04:12:57 2006
From: charlie.groves at gmail.com (Charlie Groves)
Date: Tue, 29 Aug 2006 22:12:57 -0400
Subject: [XML-SIG] Updates to PyXML for Jython 2.2
Message-ID: <96c4692d0608291912g10e96132sb097a2239e50e6f5@mail.gmail.com>

Hi,

I'm working on getting the xml module back into a working state in
Jython for the 2.2 release.  I've gotten to the point where all of the
pieces that are distributed with CPython are working, so I'm starting
to think about how to actually put the xml module into Jython's
release system.  How does CPython pull stuff from pyxml?  Do they
maintain a separate version or do they pull over the current version
of pyxml when they're preparing for a release?

I could just submit a patch to get my changes into pyxml proper and
pull the version for Jython from cvs.  Does that seem reasonable?

Thanks,
Charlie

From jeremy.kloth at 4suite.org  Wed Aug 30 04:32:57 2006
From: jeremy.kloth at 4suite.org (Jeremy Kloth)
Date: Tue, 29 Aug 2006 20:32:57 -0600
Subject: [XML-SIG] Updates to PyXML for Jython 2.2
In-Reply-To: <96c4692d0608291912g10e96132sb097a2239e50e6f5@mail.gmail.com>
References: <96c4692d0608291912g10e96132sb097a2239e50e6f5@mail.gmail.com>
Message-ID: <200608292032.57478.jeremy.kloth@4suite.org>

On Tuesday 29 August 2006 8:12 pm, Charlie Groves wrote:
> Hi,
>
> I'm working on getting the xml module back into a working state in
> Jython for the 2.2 release.  I've gotten to the point where all of the
> pieces that are distributed with CPython are working, so I'm starting
> to think about how to actually put the xml module into Jython's
> release system.  How does CPython pull stuff from pyxml?  Do they
> maintain a separate version or do they pull over the current version
> of pyxml when they're preparing for a release?

The stuff in CPython's xml package is a select set of components from the 
PyXML whole.  The PyXML maintainer would manually sync up the components for 
each CPython release.  Note, PyXML is currently in need of a maintainer, 
Martin v. Loewis  has stepped down from that role.

> I could just submit a patch to get my changes into pyxml proper and
> pull the version for Jython from cvs.  Does that seem reasonable?

That seems fine, but you may want to consider only including the set of 
components included in CPython to be in Jython (for consistency).  Although I 
am by no means versed in Jython packaging. Feel free to include whatever 
parts you desire.

-- 
Jeremy Kloth
http://4suite.org/

From tom.dalton at EDS.COM  Thu Aug 31 11:38:17 2006
From: tom.dalton at EDS.COM (Dalton, Tom)
Date: Thu, 31 Aug 2006 10:38:17 +0100
Subject: [XML-SIG] XML-HOWTO
Message-ID: <D7457E86391D8742990A4C7537FF147B01835539@ukhkm201.emea.corp.eds.com>

Hi,

I would find it very useful if you could make some/all of the
documentation given at http://pyxml.sourceforge.net/topics/docs.html
available in downloadable format (I appreciate some of it already is,
i.e. the PDF files). In particular, the XML HOWTO tutorial style
introduction would be very useful to me, but there's no easy way of
taking a copy of it since it's all separate html files.

The main python documentation is available this way, e.g.
http://docs.python.org/ (top link) and I've found that very useful.

I hope my comments are useful to you :-)

Regards,

Tom Dalton
EDS Ltd
DIS
1-3 Bartley Wood Business Park
Bartley Way
Hook
Hampshire
RG27 9XA
(Tel: 01256 742481)

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.python.org/pipermail/xml-sig/attachments/20060831/6dc8d72a/attachment.html 

From fredrik at pythonware.com  Thu Aug 31 12:09:41 2006
From: fredrik at pythonware.com (Fredrik Lundh)
Date: Thu, 31 Aug 2006 12:09:41 +0200
Subject: [XML-SIG] XML-HOWTO
In-Reply-To: <D7457E86391D8742990A4C7537FF147B01835539@ukhkm201.emea.corp.eds.com>
References: <D7457E86391D8742990A4C7537FF147B01835539@ukhkm201.emea.corp.eds.com>
Message-ID: <ed6ch6$d1a$1@sea.gmane.org>

Dalton, Tom wrote:

> I would find it very useful if you could make some/all of the 
> documentation given at _http://pyxml.sourceforge.net/topics/docs.html_ 
> available in downloadable format (I appreciate some of it already is, 
> i.e. the PDF files). In particular, the XML HOWTO tutorial style 
> introduction would be very useful to me, but there's no easy way of 
> taking a copy of it since it's all separate html files.

well, I guess you could always use the "websucker" tool to get your own 
local copy:

http://effbot.org/zone/websucker.htm

</F>


From dkgunter at lbl.gov  Thu Aug 31 22:27:00 2006
From: dkgunter at lbl.gov (Dan Gunter)
Date: Thu, 31 Aug 2006 13:27:00 -0700
Subject: [XML-SIG] XML-HOWTO
In-Reply-To: <D7457E86391D8742990A4C7537FF147B01835539@ukhkm201.emea.corp.eds.com>
References: <D7457E86391D8742990A4C7537FF147B01835539@ukhkm201.emea.corp.eds.com>
Message-ID: <44F74614.1090800@lbl.gov>

wget -r http://pyxml.sourceforge.net/topics/howto/

-Dan

Dalton, Tom wrote:
>
> Hi,
>
> I would find it very useful if you could make some/all of the 
> documentation given at _http://pyxml.sourceforge.net/topics/docs.html_ 
> available in downloadable format (I appreciate some of it already is, 
> i.e. the PDF files). In particular, the XML HOWTO tutorial style 
> introduction would be very useful to me, but there's no easy way of 
> taking a copy of it since it's all separate html files.
>
> The main python documentation is available this way, e.g. 
> _http://docs.python.org/_ (top link) and I've found that very useful.
>
> I hope my comments are useful to you :-)
>
> Regards,
>
> Tom Dalton
> EDS Ltd
> DIS
> 1-3 Bartley Wood Business Park
> Bartley Way
> Hook
> Hampshire
> RG27 9XA
> (Tel: 01256 742481)
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> XML-SIG maillist  -  XML-SIG at python.org
> http://mail.python.org/mailman/listinfo/xml-sig
>   


-- 
Dan Gunter. voice:510-495-2504 fax:510-486-6363 dsd.lbl.gov/~dang