From martin@loewis.home.cs.tu-berlin.de  Sun Jul  1 19:20:59 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Sun, 1 Jul 2001 20:20:59 +0200
Subject: [XML-SIG] Re: Narval 1.0 and Python 2.1
In-Reply-To: <3B3E05A7.CA87E510@zolera.com> (message from Rich Salz on Sat, 30
 Jun 2001 13:00:23 -0400)
References: <Pine.LNX.4.21.0106210908020.4140-100000@leo.logilab.fr> <200106210745.f5L7jrm01579@mira.informatik.hu-berlin.de> <3B33669D.9EB0C620@FourThought.com> <200106301634.f5UGYDQ08994@mira.informatik.hu-berlin.de> <3B3E05A7.CA87E510@zolera.com>
Message-ID: <200107011820.f61IKxA01012@mira.informatik.hu-berlin.de>

> > I wanted to integrate 4XSLT into PyXML, in a way that does not
> > require 4Suite.
> 
> That means 4XPATH also, right?

Right.

Martin


From tpassin@home.com  Mon Jul  2 03:41:58 2001
From: tpassin@home.com (Thomas B. Passin)
Date: Sun, 1 Jul 2001 22:41:58 -0400
Subject: [XML-SIG] 4xslt bug involving key()
References: <006601c0f077$eb31fc70$f803a8c0@zeus> <m3g0daey55.fsf@lambda.garshol.priv.no> <004101c0f21c$fa2e0560$7cac1218@reston1.va.home.com> <3B3DE70A.653002A6@FourThought.com>
Message-ID: <000f01c102a0$911cef20$7cac1218@reston1.va.home.com>

[Mike Olson]

Thanks, Mike, I'll see if I can retrieve it and get it to work.  Much
appreciated.

Cheers,

Tom P

> "Thomas B. Passin" wrote:
>
>
> Thomas,
>
>   I forget if someone replied to you, but this appears to be fixed in
> CVS.
>
> Mike
> >
> > I've just found that a stylesheet construction that I need to use
doesn't
> > work right with 4xslt (the python 1.5.2 version I got from the
4suite.org
> > site several weeks ago).
> >
> > The stylesheet takes a number of elements that have duplicated content
and
> > produce a list without duplicates.  It's a simplified Muenchian method,
> > using <xsl:key/> and key().  It works right with msxml3, saxon, and
xalan,
> > but not 4xslt.  I need to use this in a project I'm in the middle of at
> > work, so I request the 4thought people (Mike, would that be?) to take a
look
> > at it.
> >
> > If 4xslt doesn't implement keys (I thought it did), then at least it
should
> > throw an error.
> >
XML strategy, XML tools (http://4Suite.org), knowledge management


From uche.ogbuji@fourthought.com  Mon Jul  2 06:02:32 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Sun, 01 Jul 2001 23:02:32 -0600
Subject: [XML-SIG] Re: [4suite] 4xslt: bug and patch: variable import order
References: <15096.25410.753829.204197@lindm.dm>
Message-ID: <3B400068.BF01CAC8@fourthought.com>

Dieter Maurer wrote:
> 
> The XSLT spec specifies that definitions and template rules
> in an importing stylesheet take precedence over those from
> an imported stylesheet. This is essential for easy customization
> of imported stylesheets.
> 
> "4xslt" implements this feature only partially:
> 
>    Top level variables in an importing stylesheet do not
>    take precedence over imported ones.
> 
> The attached patch hopefully fixes the problem.
> It ensures that variables in importing style sheets
> take precedence over those defined in imported style sheets
> and that all style sheets use the same top level variables.

Note: my fix was quite different.  I hadn't applied this patch because I
knew the problem was more fundamental.

Thanks, though.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Ste. C, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From uogbuji@fourthought.com  Mon Jul  2 06:56:47 2001
From: uogbuji@fourthought.com (Uche Ogbuji)
Date: Sun, 01 Jul 2001 23:56:47 -0600
Subject: [XML-SIG] Reader() newbie question
Message-ID: <200107020556.f625ulp02108@localhost.local>

> > p = make_parser("xml.sax.drivers.drv_xmlproc")
> > reader = Sax.Reader(parser=p)
> 
> Sorry it took so long to get back to you.  This works fine for me, Thanks.
> In my case since I want to use a validating parser I use:
> 
> p = make_parser("xml.sax.drivers.drv_xmlproc_val")
> reader = Sax.Reader(parser=p)

Oops.  Right.

> Since I can also write:
> 
> p = make_parser("xml.sax.drivers.drv.pyexpat")
> reader = Sax.Reader(parser=p)
> 
> what is considered the "correct" method if I want to use expat?  The above
> line or what I have seen more often:
> 
> reader = PyExpat.Reader()
> 
> I mean, sure the second method is one line shorter, but the first one is
> consistent across all the parsers on my machine under 'xml.sax.drivers'
> Does the second method do important init stuff (or whatever) that I am
> missing?

Their both right, and equivalent, since PyXML sets up
"xml.sax.drivers.drv.pyexpat" as the default non-validating SAX driver.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Ste. C, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From uche.ogbuji@fourthought.com  Mon Jul  2 19:29:33 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Mon, 02 Jul 2001 12:29:33 -0600
Subject: [XML-SIG] Reader() newbie question
In-Reply-To: Message from Uche Ogbuji <uogbuji@fourthought.com>
 of "Sun, 01 Jul 2001 23:56:47 MDT." <200107020556.f625ulp02108@localhost.local>
Message-ID: <200107021829.f62ITXN04420@localhost.local>

Me:

> Their both right, and equivalent, since PyXML sets up...
  ^^^^^

Ouch.  I'm more sleep-deprived than I thought.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Ste. C, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From tpassin@home.com  Tue Jul  3 01:20:21 2001
From: tpassin@home.com (Thomas B. Passin)
Date: Mon, 2 Jul 2001 20:20:21 -0400
Subject: [XML-SIG] 4xslt bug involving key()
References: <006601c0f077$eb31fc70$f803a8c0@zeus> <m3g0daey55.fsf@lambda.garshol.priv.no> <004101c0f21c$fa2e0560$7cac1218@reston1.va.home.com> <3B3DE70A.653002A6@FourThought.com>
Message-ID: <006b01c10355$f2eff480$7cac1218@reston1.va.home.com>

4xslt from CVS doesn't run. Here's what I did.  I have pyxml 0.65/python
1.5.2 on Windows.  I copied the three directories xslt, xpath, util from the
CVS on SourceForge, renamed the corresponding 0.65 directories to save them,
and copied the three CVS directories in their place.  Then I ran my command
line wrapper for 4xslt.

I get an error, of which this is the salient part:

 File "D:\PROGRA~2\PYTHON\xml\xpath\Conversions.py", line 23, in ?
    from xml.utils import boolean
ImportError: cannot import name boolean

There is no file called "boolean" in the CVS, nor does xml\util\__init__.py
define boolean.  What do I need to make this work?

Cheers,

Tom P


[Mike Olson]
> "Thomas B. Passin" wrote:
>
>
> Thomas,
>
>   I forget if someone replied to you, but this appears to be fixed in
> CVS.
>


From larsga@garshol.priv.no  Tue Jul  3 07:44:21 2001
From: larsga@garshol.priv.no (Lars Marius Garshol)
Date: 03 Jul 2001 08:44:21 +0200
Subject: [XML-SIG] SAX event: internalEntityDecl
In-Reply-To: <3B3DD62E.22498.3C2354@localhost>
References: <Pine.LNX.4.21.0106281018211.11853-100000@leo.logilab.fr> <3B3DD62E.22498.3C2354@localhost>
Message-ID: <m3d77ifr5m.fsf@lambda.garshol.priv.no>

* Arne Krug
|
| is there a way to distinguish between SAX entity-events:
| one Entity is declared in an external dtd and
| the other one is directly in the xml-file 

It is possible to do this, though the information is not directly
present in any specific event. Using other events it is possible to
figure out where you are at any given time.

  startDTD(...)
  # events here come from the internal subset
  startEntity("[dtd"]
  # events here from the external subset
  endEntity("[dtd]")
  endDTD(...)
 
I hope this helps. 

--Lars M.


From jeremy.kloth@fourthought.com  Tue Jul  3 15:44:31 2001
From: jeremy.kloth@fourthought.com (Jeremy Kloth)
Date: Tue, 3 Jul 2001 08:44:31 -0600
Subject: [XML-SIG] 4xslt bug involving key()
References: <006601c0f077$eb31fc70$f803a8c0@zeus> <m3g0daey55.fsf@lambda.garshol.priv.no> <004101c0f21c$fa2e0560$7cac1218@reston1.va.home.com> <3B3DE70A.653002A6@FourThought.com> <006b01c10355$f2eff480$7cac1218@reston1.va.home.com>
Message-ID: <002901c103ce$acf26e80$703d64c0@den.xcare.net>

From: "Thomas B. Passin" <tpassin@home.com>
> 4xslt from CVS doesn't run. Here's what I did.  I have pyxml 0.65/python
> 1.5.2 on Windows.  I copied the three directories xslt, xpath, util from
the
> CVS on SourceForge, renamed the corresponding 0.65 directories to save
them,
> and copied the three CVS directories in their place.  Then I ran my
command
> line wrapper for 4xslt.
>
> I get an error, of which this is the salient part:
>
>  File "D:\PROGRA~2\PYTHON\xml\xpath\Conversions.py", line 23, in ?
>     from xml.utils import boolean
> ImportError: cannot import name boolean
>
> There is no file called "boolean" in the CVS, nor does
xml\util\__init__.py
> define boolean.  What do I need to make this work?
>

The boolean module is an extension module that should have been made if
'setup.py install' was run.  The source for that extension lives in (from
CVS root) xml/extensions.

--
Jeremy Kloth                              Consultant
jeremy.kloth@fourthought.com              +1 303 583 9900 x 105
Fourthought, Inc.                         http://fourthought.com
4735 East Walnut St, Suite C, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4suite.org), knowledge management


From uche.ogbuji@fourthought.com  Tue Jul  3 16:38:04 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Tue, 03 Jul 2001 09:38:04 -0600
Subject: [XML-SIG] 4xslt bug involving key()
In-Reply-To: Message from "Thomas B. Passin" <tpassin@home.com>
 of "Mon, 02 Jul 2001 20:20:21 EDT." <006b01c10355$f2eff480$7cac1218@reston1.va.home.com>
Message-ID: <200107031538.f63Fc4t09145@localhost.local>

> 4xslt from CVS doesn't run. Here's what I did.  I have pyxml 0.65/python
> 1.5.2 on Windows.  I copied the three directories xslt, xpath, util from the
> CVS on SourceForge, renamed the corresponding 0.65 directories to save them,
> and copied the three CVS directories in their place.  Then I ran my command
> line wrapper for 4xslt.
> 
> I get an error, of which this is the salient part:
> 
>  File "D:\PROGRA~2\PYTHON\xml\xpath\Conversions.py", line 23, in ?
>     from xml.utils import boolean
> ImportError: cannot import name boolean
> 
> There is no file called "boolean" in the CVS, nor does xml\util\__init__.py
> define boolean.  What do I need to make this work?

Weird.  None of this should have changed since the beta.

xml.utils.boolean.so (or .pyd) should have ben built with your PyXML build.  
For instance, on my machine:

/usr/local/lib/python2.1/site-packages/_xmlplus/utils/boolean.so

How did you build/install PyXML?

BTW, you'll want the most recent CVS 4Suite (from a few hours ago): important 
fixes.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Ste. C, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From law@otelnet.com  Tue Jul  3 16:34:41 2001
From: law@otelnet.com (Katherina Law)
Date: Tue, 3 Jul 2001 08:34:41 -0700
Subject: [XML-SIG] build question
Message-ID: <65E7CA3B34A0D211B65300A0C9E1CF4F065E9D50@bluewhale.otelnet.com>

We have Python 2 running on Sun OS 2.6, when I tried to complile, I'm
getting the following error, do I need to have libcurses.so.1?  Where can I
find it?

>python setup.py build
ld.so.1: python: fatal: libcurses.so.1: open failed: No such file or
directory
Killed

Many thanks,
Katherina


From Alexandre.Fayolle@logilab.fr  Tue Jul  3 18:38:14 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Tue, 3 Jul 2001 19:38:14 +0200 (CEST)
Subject: [XML-SIG] The new version of XPath
Message-ID: <Pine.LNX.4.21.0107031936530.21428-100000@leo.logilab.fr>

Hello,

Just a quick question. The new version of XPath is 8bit character
friendly, and possibly unicode friendly, which is great news as far as I'm
concerned. Is it thread safe ?

Alexandre Fayolle
-- 
http://www.logilab.com 
Narval is the first software agent available as free software (GPL).
LOGILAB, Paris (France).


From uche.ogbuji@fourthought.com  Tue Jul  3 18:46:01 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Tue, 03 Jul 2001 11:46:01 -0600
Subject: [XML-SIG] The new version of XPath
In-Reply-To: Message from Alexandre Fayolle <Alexandre.Fayolle@logilab.fr>
 of "Tue, 03 Jul 2001 19:38:14 +0200." <Pine.LNX.4.21.0107031936530.21428-100000@leo.logilab.fr>
Message-ID: <200107031746.f63Hk1j09490@localhost.local>

> Just a quick question. The new version of XPath is 8bit character
> friendly, and possibly unicode friendly, which is great news as far as I'm
> concerned.

It's unicode friendly using UTF-8.

> Is it thread safe ?

It should be.  If you find any problems with threading, do let us know.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Ste. C, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From jeremy.kloth@fourthought.com  Tue Jul  3 19:46:20 2001
From: jeremy.kloth@fourthought.com (Jeremy Kloth)
Date: Tue, 3 Jul 2001 12:46:20 -0600
Subject: [XML-SIG] The new version of XPath
References: <Pine.LNX.4.21.0107031936530.21428-100000@leo.logilab.fr>
Message-ID: <00da01c103f0$738c6660$703d64c0@den.xcare.net>

From: "Alexandre Fayolle" <Alexandre.Fayolle@logilab.fr>
> Hello,
>
> Just a quick question. The new version of XPath is 8bit character
> friendly, and possibly unicode friendly, which is great news as far as I'm
> concerned. Is it thread safe ?
>

Both the C and pure Python parsers are completely stateless.  The
concurrency issues from before went away when we removed Flex (Bison was
already stateless).

--
Jeremy Kloth                              Consultant
jeremy.kloth@fourthought.com              +1 303 583 9900 x 105
Fourthought, Inc.                         http://fourthought.com
4735 East Walnut St, Suite C, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4suite.org), knowledge management


From jeremy.kloth@fourthought.com  Tue Jul  3 19:46:25 2001
From: jeremy.kloth@fourthought.com (Jeremy Kloth)
Date: Tue, 3 Jul 2001 12:46:25 -0600
Subject: [XML-SIG] The new version of XPath
References: <Pine.LNX.4.21.0107031936530.21428-100000@leo.logilab.fr>
Message-ID: <00db01c103f0$768ef800$703d64c0@den.xcare.net>

From: "Alexandre Fayolle" <Alexandre.Fayolle@logilab.fr>
> Hello,
>
> Just a quick question. The new version of XPath is 8bit character
> friendly, and possibly unicode friendly, which is great news as far as I'm
> concerned. Is it thread safe ?
>

Both the C and pure Python parsers are completely stateless.  The
concurrency issues from before went away when we removed Flex (Bison was
already stateless).

--
Jeremy Kloth                              Consultant
jeremy.kloth@fourthought.com              +1 303 583 9900 x 105
Fourthought, Inc.                         http://fourthought.com
4735 East Walnut St, Suite C, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4suite.org), knowledge management


From tpassin@home.com  Wed Jul  4 00:20:14 2001
From: tpassin@home.com (Thomas B. Passin)
Date: Tue, 3 Jul 2001 19:20:14 -0400
Subject: [XML-SIG] 4xslt bug involving key()
References: <200107031538.f63Fc4t09145@localhost.local>
Message-ID: <004801c10416$b6d09880$7cac1218@reston1.va.home.com>

[Uche Ogbuji]

> > There is no file called "boolean" in the CVS, nor does
xml\util\__init__.py
> > define boolean.  What do I need to make this work?
>
> Weird.  None of this should have changed since the beta.
>
> xml.utils.boolean.so (or .pyd) should have ben built with your PyXML
build.
> For instance, on my machine:
>
> /usr/local/lib/python2.1/site-packages/_xmlplus/utils/boolean.so
>
> How did you build/install PyXML?
>
I installed the Pyxml 0.65 binary for Windows Python 1.5.2.  I did not
install a complete new installation from the CVS.  I also don't own any
Microsoft C compilers and I'm not about to shell out to get one, so any
"setyp.py install" that wants to compile something is out of luck.   But the
required file must be pretty simple, right?  Do I have to get the whole CVS
compiled/installed to get the latest version of 4xslt working, or what?

Cheers,

Tom P


From noreply@sourceforge.net  Wed Jul  4 01:12:02 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Tue, 03 Jul 2001 17:12:02 -0700
Subject: [XML-SIG] [ pyxml-Bugs-438397 ] truncated content passed to characters()
Message-ID: <E15HaGw-00072Q-00@usw-sf-web1.sourceforge.net>

Bugs item #438397, was opened at 2001-07-03 17:12
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=438397&group_id=6473

Category: SAX
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Mr. Codepage (codepage)
Assigned to: Nobody/Anonymous (nobody)
Summary: truncated content passed to characters()

Initial Comment:
Parsing a pretty simple 500k xml file.

The bad output lines in question look like

c <--- truncated, should be com.xxxxxx.ejb.domain.intfc
com/xxxxxx/ejb/domain/intfc/AdverseReactionType.java
com.xxxxxx.e <--- truncated
com/xxxxxx/ejb/service/hsif/msgHandler/intfc/HLSevenHan
dler.java

This is an xml file that describes the source pool at 
a certain release point in time.

I rewrote the small script in java with Xereces and it 
is fine.

The XML file does NOT contain truncated data. If I 
extract the portions of the datafile above that are 
having problems and put it in its own xml file, it 
works fine (with the code below). It is only this 
configuration of the datafile that is truncating the 
value of <b>content</b> passed to characters(). The 
XML file is well formed.

class packageScan(saxutils.DefaultHandler):
	def __init__(self):
		self.showText = 0
		self.grabPath = 0
		self.Path = ""
	def startElement(self, name, attrs):
		if name == "package":
			self.showText = 1
		elif name == "path":
			self.grabPath = 1
	def characters(self, content):
		if self.showText == 1:
			if len(content) < 13:
				print content
				print self.Path
			self.showText = 0
		if self.grabPath == 1:
			self.Path = content
			self.grabPath = 0

python 2.1
pyxml 0.6.5

I would be happy to test any workarounds, patches, etc.


----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=438397&group_id=6473


From tpassin@home.com  Wed Jul  4 04:43:42 2001
From: tpassin@home.com (Thomas B. Passin)
Date: Tue, 3 Jul 2001 23:43:42 -0400
Subject: [XML-SIG] 4xslt bug involving key()
References: <200107031538.f63Fc4t09145@localhost.local>
Message-ID: <000e01c1043b$85d51380$7cac1218@reston1.va.home.com>

[Uche Ogbuji]

> >
> >  File "D:\PROGRA~2\PYTHON\xml\xpath\Conversions.py", line 23, in ?
> >     from xml.utils import boolean
> > ImportError: cannot import name boolean
> >
> > There is no file called "boolean" in the CVS, nor does
xml\util\__init__.py
> > define boolean.  What do I need to make this work?
>
> Weird.  None of this should have changed since the beta.
>
> xml.utils.boolean.so (or .pyd) should have ben built with your PyXML
build.
> For instance, on my machine:
>
> /usr/local/lib/python2.1/site-packages/_xmlplus/utils/boolean.so
>
> How did you build/install PyXML?
>
OK, I'm making progress.  My installation on Windows has a boolean.pyd in
both the ft/Lib and ft/extensions directories, both the same file.  This is
the 0.11 version.  I copied that file to  the xml/utils directory so the
xpath script could find it.  Apparently this file is now supposed to be in
xml/utils, not extensions.

Now there is a different failure:

  File "D:\PROGRA~2\PYTHON\xml\xpath\CoreFunctions.py", line 21, in ?
    from xml.xpath import Util, Conversions
  File "D:\PROGRA~2\PYTHON\xml\xpath\Conversions.py", line 179, in ?
    _strConversions = {
AttributeError: BooleanType

I looked at the boolean.h and boolean.c files in the cvs, and they contain
PyBoolean_Type, not PyBooleanType.  There is no string BooleanType.  Also I
looked at my boolean.pyd with a hex editor, and it doesn't contain
BooleanType or Boolean_Type at all.

RIght now, it looks like several things are happening:

1) boolean.pyd (or .so, I guess) is expected by xpath.Conversions to be in
xml\utils, but it's in extensions\ in the cvs.

2) It seems that xpath.Conversions now expects objects of type BooleanType,
but boolean.pyd/.so thinks it should be called Boolean_Type.

3) It looks like Boolean_Type and BooleanType were not used in the 0.11
version of 4suite.

Perhaps these are all incorrect deductions, someone please enlighten me.

Anyway, I can't use the new versions in cvs until someone makes a 1.52
binary version for Windows.  Would someone be willing to do that?

Cheers,

Tom P


From sales@now.net.cn  Wed Jul  4 05:58:25 2001
From: sales@now.net.cn (����ʱ��)
Date: Wed, 4 Jul 2001 12:58:25 +0800
Subject: [XML-SIG] һ��������6����վ(����ע����Ż�!)
Message-ID: <200107040458.f644wP612788@localhost.localdomain>

һ��������6����վ(����ע����Ż�!)

�𾴵Ŀͻ�������! 

����6��23�ŵ�7��23��֮��������ʱ����ɹ��ڡ����ʡ�������ע�ᣬ�Ϳ���һ����������ͬʱ����������վ,Ҳ����˵�μ���������ԭ����3����Ϊ6��!�ټ���VDNS���е�����ָ���ܣ�Ҫ�����Լ���δ�����׺��Ѻ�����Ҫ�����ڿ�ʼ�� 

��������ÿ�����־ͻ���ʧһ��������Ͷ���Լ���������Դ�������ջ񽫴�󳬳��������� 

Today��s Network(http://www.now.net.cn)���ȿ�����VDNS��������������ʵ�֣գң�ת������������¼���ͣ��ʼ���¼���ɣ�ָ����ƵȲ���,���������������������Լ��Ĵμ������� ���������������վ�����������ָ���κοռ䣬Ҳ��������һ�οռ�ͽ�������վ,Ҳ���Բ�����ռ����ԭ���Ŀռ�,��������ѿռ䣬 ���������Ż�ʹ�ÿռ���Դ�� 

������ͬ�ƶ���������չ���������Ƴ���ע���������ڴ��Żݻ���� 

����֮�⣬���ǻ�Ϊ�������������е���վ��������WEB-ADMIN�����Ǽ��ϴ�,���� ��ҳ�༭ �ļ��ƶ� ɾ�� �����ȹ�����һ����վ�������ߣ��봫ͳFTP����,����Ч,������ع���������վ�� 

���λ��ֹ��7��23�ţ���ץס���Ļ������������������̻����Ͽ�ע�������������������http://www.now.net.cn/register/ 

����һֱ��רҵ�����ʡ�����Ϊ��ּ���ȳ�Ϊ������ 

��ӭ���� Today's Network support@now.net.cn 
��ӭ����� ���ǵ���վ http://www.now.net.cn 


From b.fathi@gmx.net  Wed Jul  4 10:39:29 2001
From: b.fathi@gmx.net (Bijan Fathi)
Date: Wed, 4 Jul 2001 11:39:29 +0200 (MEST)
Subject: [XML-SIG] illigal character encoding bug in minidom + patch
Message-ID: <8659.994239569@www33.gmx.net>

This is a MIME encapsulated multipart message -
please use a MIME-compliant e-mail program to open it.

Dies ist eine mehrteilige Nachricht im MIME-Format -
bitte verwenden Sie zum Lesen ein MIME-konformes Mailprogramm.

--========GMXBoundary8659994239569
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit

Category: DOM/Minidom
Group: None
Status: Solved/Patch supplied
Resolution: None
Priority: 4
Submitted By: Bijan Fathi (b.fathi@gmx.net)
#Assigned to: Nobody/Anonymous (nobody)
Summary: illigal characters have not been escaped

Initial Comment:

characters with the code above 127 have been written to the xml file by
Text,
but minidom couldn't open the xml file because it contained illigal
characters.
(it was not represented as well formed in ms ie as well)

the supplied patch escapes all characters above 127 (including unicode) in
ordinary 
hex character reference notation (&#xnnnn;)

of course this only the representaion in the xml file, after loading the
file the data is
well represented as unicode or 8-bit character

python 2.0
pyxml 0.6.5

I would be thankful if you would supply this patch to minidom.py

-- 

Bijan Fathi  <b.fathi@gmx.net>

GMX - Die Kommunikationsplattform im Internet.
http://www.gmx.net

GMX Tipp:

Machen Sie Ihr Hobby zu Geld bei unserem Partner 1&1!
http://profiseller.de/info/index.php3?ac=OM.PS.PS003K00596T0409a
--========GMXBoundary8659994239569
Content-Type: text/plain; name="charenc-bug.patch"
Content-Transfer-Encoding: base64
Content-Disposition: attachment; filename="charenc-bug.patch"

MjY4YTI2OSwyNzMKPiAgICAgZGF0YXRtcCA9IGRhdGEKPiAgICAgZGF0YSA9ICIiCj4gICAgIGZv
ciBpIGluIGRhdGF0bXA6Cj4gICAgICAgICBpZiBvcmQoaSkgPiAxMjIgOiAgZGF0YSA9IGRhdGEg
KyAiJiN4JTA0eDsiICUgb3JkKGkpCQo+ICAgICAgICAgZWxzZSA6IAkJCSAgIGRhdGEgPSBkYXRh
ICsgaQo=
--========GMXBoundary8659994239569--


From noreply@sourceforge.net  Wed Jul  4 13:25:29 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Wed, 04 Jul 2001 05:25:29 -0700
Subject: [XML-SIG] [ pyxml-Bugs-438514 ] syntax error on xml.dom.ext.__init__
Message-ID: <E15Hlij-0000py-00@usw-sf-web1.sourceforge.net>

Bugs item #438514, was opened at 2001-07-04 05:25
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=438514&group_id=6473

Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Alexandre Fayolle (afayolle)
Assigned to: Nobody/Anonymous (nobody)
Summary: syntax error on xml.dom.ext.__init__

Initial Comment:
Using the latest version of PyXML from CVS (just did an
update), I got a SyntaxError on xml.dom.ext

  File "/home/alf/Narval/narval/lib.py", line 32, in ?
    from xml.dom.ext import Print, PrettyPrint,
StripXml
  File
"/home/alf/lib/python/_xmlplus/dom/ext/__init__.py",
line 285
    elif attr.namespaceURI:


Here's a patch which fixes this indentation problem.

--- __init__.py~        Sat Jun 23 19:11:08 2001
+++ __init__.py Wed Jul  4 14:25:00 2001
@@ -282,7 +282,7 @@
                         nss[''] = attr.value
                     else:
                         nss[attr.localName] =
attr.value
-            elif attr.namespaceURI:
-                nss[attr.prefix] = attr.namespaceURI
+                elif attr.namespaceURI:
+                    nss[attr.prefix] =
attr.namespaceURI
             SeekNss(child, nss)
     return nss


Cheers 

Alexandre Fayolle
Logilab

----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=438514&group_id=6473


From noreply@sourceforge.net  Fri Jul  6 05:17:45 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Thu, 05 Jul 2001 21:17:45 -0700
Subject: [XML-SIG] [ pyxml-Bugs-438967 ] indentation error in current cvs
Message-ID: <E15IN3p-0001JW-00@usw-sf-web3.sourceforge.net>

Bugs item #438967, was opened at 2001-07-05 21:17
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=438967&group_id=6473

Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Gregory P. Smith (greg)
Assigned to: Nobody/Anonymous (nobody)
Summary: indentation error in current cvs

Initial Comment:
after python setup.py install i get a "bad syntax" on
line 285 of xml/dom/ext/__init__.py in SeekNss(). 
Looks like an indentation error.  elif matches up with
the for when it should be over four spaces to match up
with the if.


----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=438967&group_id=6473


From noreply@sourceforge.net  Fri Jul  6 13:37:49 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Fri, 06 Jul 2001 05:37:49 -0700
Subject: [XML-SIG] [ pyxml-Bugs-439031 ] startEntity/endEntity event
Message-ID: <E15IUrl-00037s-00@usw-sf-web1.sourceforge.net>

Bugs item #439031, was opened at 2001-07-06 05:37
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=439031&group_id=6473

Category: xmlproc
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Nobody/Anonymous (nobody)
Assigned to: Nobody/Anonymous (nobody)
Summary: startEntity/endEntity event

Initial Comment:
in the following example no startEntity/endEntity 
event (LexicalHandler) ocurred:

mail.xml:
<?xml version="1.0"?>
<!DOCTYPE mail SYSTEM "k:\testfiles\mail.dtd" [
<!ENTITY henning "hb@ix.heise.de">  
]>
<mail>
        <Recipient>  &henning;                        
</Recipient>
        <Sender>     &ingo;                           
</Sender>
        <Date>       Mon, 21 Apr 1997 09:27:55 +0200 
</Date>
        <Subject>    XML literature                  
</Subject>

</mail>

mail.dtd:
<!ELEMENT mail  (Recipient, Sender,
                         Date, Subject)      >
<!ELEMENT Sender        (#PCDATA)       >
<!ELEMENT Recipient     (#PCDATA)       >
<!ELEMENT Date          (#PCDATA)       >
<!ELEMENT Subject       (#PCDATA)       >
<!ENTITY ingo    "Ingo.Macherius@tu-clausthal.de" >

----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=439031&group_id=6473


From geert@boskant.nl  Sun Jul  8 16:05:50 2001
From: geert@boskant.nl (Geert Jansen)
Date: Sun, 8 Jul 2001 17:05:50 +0200
Subject: [XML-SIG] DocumentFragment bug in minidom
Message-ID: <POEGJAKPNOCGNJMONMFKIEPPCBAA.geert@boskant.nl>

Hi!

While playing around with minidom and DocumentFragments, I ran across a
small bug in the handling of DocumentFragments. (I'm using vanilla Python
2.1)

When you're adding a DocumentFragment to a node with Node.appendNode(), this
is supposed to add all children of the DocumentFragment to the Node. I
noticed however that when the DocumentFragment has more than one node, its
_last_ node is skipped.

Looking through the sources, the problem seems to be caused in minidom.py,
lines 140-141:

   def appendChild(self, node):
        if node.nodeType == self.DOCUMENT_FRAGMENT_NODE:
            for c in node.childNodes:
                self.appendChild(c)
            ### The DOM does not clearly specify what to return in this case
            return node

The call "self.appendChild(c)" changes the list node.childNodes under our
feet, because it tries to remove the child from its parent. This apparently
works out in such a way that the iteration of node.childNodes skips the last
element.

With the patch below, appendChild() does work as expected with
DocumentFragment's.

--- minidom.py.old      Sat Jul  7 15:42:51 2001
+++ minidom.py  Sat Jul  7 15:48:59 2001
@@ -137,7 +137,9 @@

     def appendChild(self, node):
         if node.nodeType == self.DOCUMENT_FRAGMENT_NODE:
-            for c in node.childNodes:
+            # Make a copy of childNodes as appendChild() will change it.
+            children = [ c for c in node.childNodes ]
+            for c in children:
                 self.appendChild(c)
             ### The DOM does not clearly specify what to return in this
case
             return node


Can this patch be applied? Please CC me in replies, as I'm not subscribed to
the list.

Greetings,
Geert Jansen


From rsalz@zolera.com  Sat Jul  7 19:18:32 2001
From: rsalz@zolera.com (Rich Salz)
Date: Sat, 07 Jul 2001 14:18:32 -0400
Subject: [XML-SIG] DocumentFragment bug in minidom
References: <POEGJAKPNOCGNJMONMFKIEPPCBAA.geert@boskant.nl>
Message-ID: <3B475278.DC93921B@zolera.com>

> +            # Make a copy of childNodes as appendChild() will change it.
> +            children = [ c for c in node.childNodes ]
> +            for c in children:

Probably better to write it this way -- more clear, works in 1.5:
		for c in node.childNodes[:]:

	/r$
-- 
Zolera Systems, Securing web services (XML, SOAP, Signatures,
Encryption)
http://www.zolera.com


From brian@sweetapp.com  Sat Jul  7 20:27:13 2001
From: brian@sweetapp.com (Brian Quinlan)
Date: Sat, 7 Jul 2001 12:27:13 -0700
Subject: [XML-SIG] Pyana (a Python interface to the Xalan XSLT engine) 0.1.0 released
Message-ID: <000a01c1071a$d37922c0$445d4540@D1XYVL01>

Windows binaries (you can get the source from CVS) for Pyana 0.1.0 have
been released and are available at:

http://sourceforge.net/project/showfiles.php?group_id=28142

It fixes a log of bugs and introduces the experimental ability to extend
the Xalan XPath engine with Python functions.

Here is a simple example:

def sum(*args):
    """Compute the sum of all arguments"""
    s = 0
    for i in args:
        s += i
    return s

Pyana.install( 'exampleNS', sum, 'sum' )

inputExampleXSL = r'''
<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
xmlns:py="exampleNS" version="1.0">
    <xsl:output method="text"/>
    <xsl:template match="message"><xsl:value-of
select="py:sum(1,2,3,4,5)"/></xsl:template>
</xsl:stylesheet>
'''

inputExampleXML = r'''
<message>ignored</message>
'''

print Pyana.transform(inputExampleXML, inputExampleXSL) # => '15'


From elwinsoftware@hypermart.net  Sat Jul  7 21:01:20 2001
From: elwinsoftware@hypermart.net (Elwin Software)
Date: Sat, 07 Jul 2001 21:01:20 +0100
Subject: [XML-SIG] Software Developer
Message-ID: <3B476A90.AA114174@hypermart.net>

<!doctype html public "-//w3c//dtd html 4.0 transitional//en">
<html>
<font face="Tahoma">{You will only receive this message today.}</font>
<br>&nbsp;
<p><font face="Tahoma">I visited your site and saw that you to are also
a developer of software.</font>
<p><font face="Tahoma">I simply want to let you know about a software registration
service that</font>
<br><font face="Tahoma">has been around since 1994 - called <b><font color="#009900">The
Ordering Network</font></b>.</font>
<p><font face="Tahoma">And I will just point out a few benefits as i know
them.</font>
<br><font face="Tahoma">Yes if you sign up i will get credit as a referral.</font>
<br>&nbsp;
<p><font face="Tahoma">&nbsp;&nbsp; <font color="#FF0000">#They have very
lows fees - the percentage is as low as 8.5%</font></font>
<br><font face="Tahoma"><font color="#FF0000">&nbsp;&nbsp; #They process
the registration in seconds.</font></font>
<br><font face="Tahoma"><font color="#FF0000">&nbsp;&nbsp; #They can generate
your key in seconds - no extra cost</font></font>
<p><font face="Tahoma">I can keep going - <u><font color="#CC33CC">but
its really worth a look</font></u></font>
<p><font face="Tahoma">Please follow this link so i get credit.&nbsp; Or
copy and paste into the address line.</font>
<br><font face="Tahoma">&nbsp;<a href="http://www.evergreennetworks.com/register2/devSignup.asp?refID=W1172">http://www.evergreennetworks.com/register2/devSignup.asp?refID=W1172</a></font>
<br>&nbsp;
<p><font face="Tahoma">If you have any questions please let me know.</font>
<br><font face="Tahoma"><a href="http://elwinsoftware.hypermart.net/">http://elwinsoftware.hypermart.net/</a></font>
<p><font face="Tahoma">**&nbsp; If you received this in error - God Bless
your understanding and compassion.</font>
<br><font face="Tahoma">You are not on a list.&nbsp; Im just sending you
this mail today.</font></html>


From Alexandre.Fayolle@logilab.fr  Tue Jul 10 08:15:04 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Tue, 10 Jul 2001 09:15:04 +0200 (CEST)
Subject: [XML-SIG] Semantext
Message-ID: <Pine.LNX.4.21.0107100912001.12514-100000@leo.logilab.fr>

This has just arrived from comp.lang.python.announce:

---------------------------8<---------------------------
The 0.72 release of SemanText has just been posted at
http://www.semantext.com/

Among the new features are:

* Context-based harvesting - This allows topics and associations to be
automatically constructed from XML documents by identifying specific
information to be harvested.

* Full topic map maintenance capability - Topics, associations,
occurrences, and facets can be added, modified and deleted via the
SemanText interface.

* Choice of look-and-feel - A classic web browser style of interface or a
push-button style of interface.

* Choice of view - Users can select whether to look at the information
from a topic map point of view (only the information contained in the
topic map) or a knowledge base point of view (information based on
interpretting the topic map or generated by the inference rules).</li>

* XTM export - Topic maps can be exported in accordance with the new XTM
specification.

About SemanText

SemanText is a prototype application developed, using Python, to
demonstrate how the topic map standard (ISO/IEC 13250:2000) and XML Topic
Maps (XTM) can be used to represent the knowledge contained within
documents by building semantic networks. Semantic networks are a building
block for artificial intelligence applications such as inference engines
and expert systems.

<!-- ****************************************************************
Eric Freese                                    Email: eric@isogen.com
Senior Consultant                              Voice:    651 636 9180
ISOGEN International/DataChannel               Fax:      651 636 9191
1611 West County Road B - Suite 204            WWW:    www.isogen.com
St. Paul, MN 55113                                www.datachannel.com
***************************************************************** -->

----------------------------8<----------------------------------------------

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From faassen@vet.uu.nl  Tue Jul 10 14:05:36 2001
From: faassen@vet.uu.nl (Martijn Faassen)
Date: Tue, 10 Jul 2001 15:05:36 +0200
Subject: [XML-SIG] XPath and Zope's ParsedXML DOM
Message-ID: <20010710150536.A26286@vet.uu.nl>

Hi there,

I've been trying to make XPath work with Zope's DOM implementation,
ParsedXML. In the process I've discovered some incompatibilities in
XPath that I had to hack around.

The problem is as follows. ParsedXML uses DOM nodes that are 
descendants of ExtensionClass. This means that type(node) != types.InstanceType.

Conversions.py depends on this in several places, however. After hacking
around them XPath works better for me.

Here are the two places where I had to hack:

The function CoreStringValue has this line:

result = _strConversions.get(type(object), _strUnknown)(object)

but, since instances now don't trigger the InstanceType key in 
_strConversions, this fails and returns None for valid instances. I've hacked 
around this by doing the following:

if hasattr(object, 'ownerDocument'):
    result = _strInstance(object)
else:
    result = _strConversions.get(type(object), _strUnknown)(object)

I'm not sure if this succeeds in all cases and it's a hack. I'll study
ExtensionClasses to see if there may be a better way.

The other hack is similar and involves the types.ListType entry in the
_strConversions dictionary. Again the lookup that takes place in the
value lambda fails due to ExtensionClass.

Regards,

Martijn


From jeremy.kloth@fourthought.com  Tue Jul 10 20:07:41 2001
From: jeremy.kloth@fourthought.com (Jeremy Kloth)
Date: Tue, 10 Jul 2001 13:07:41 -0600
Subject: [XML-SIG] XPath and Zope's ParsedXML DOM
References: <20010710150536.A26286@vet.uu.nl>
Message-ID: <005501c10973$981fe280$703d64c0@den.xcare.net>

From: "Martijn Faassen" <faassen@vet.uu.nl>
> Hi there,
>
> I've been trying to make XPath work with Zope's DOM implementation,
> ParsedXML. In the process I've discovered some incompatibilities in
> XPath that I had to hack around.
>
> The problem is as follows. ParsedXML uses DOM nodes that are
> descendants of ExtensionClass. This means that type(node) !=
types.InstanceType.
>

Instead of doing the check every time, I implemented a more lazy approach to
it.  Additionally, the performance hit happens only the first time through.

def _strUnknown(object):
    # Allow for non-instance DOM node objects
    if hasattr(object, 'nodeType'):
        # Add this type to the mapping for next time through
        _strConversions[type(object)] = _strInstance
        return _strInstance(object)
    return

and change type types.ListType entry in _strConversions to:

    types.ListType : lambda x: x and _strConversions.get(type(x[0]),
_strUnknown)(x[0]) or '',

--
Jeremy Kloth                              Consultant
jeremy.kloth@fourthought.com              +1 303 583 9900 x 105
Fourthought, Inc.                         http://fourthought.com
4735 East Walnut St, Boulder, CO  80301, USA
XML strategy, XML tools (http://4suite.org), knowledge management


From faassen@vet.uu.nl  Tue Jul 10 22:59:31 2001
From: faassen@vet.uu.nl (Martijn Faassen)
Date: Tue, 10 Jul 2001 23:59:31 +0200
Subject: [XML-SIG] XPath and Zope's ParsedXML DOM
In-Reply-To: <005501c10973$981fe280$703d64c0@den.xcare.net>
References: <20010710150536.A26286@vet.uu.nl> <005501c10973$981fe280$703d64c0@den.xcare.net>
Message-ID: <20010710235931.A28790@vet.uu.nl>

Jeremy Kloth wrote:
> Instead of doing the check every time, I implemented a more lazy approach to
> it.  Additionally, the performance hit happens only the first time through.

[snip source]

Sweet! I'll be playing some more with XPath and ParsedXML next week, when I'm
back from a (Zope) conference.

Thanks,

Martijn


From uche.ogbuji@fourthought.com  Tue Jul 10 23:05:08 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Tue, 10 Jul 2001 16:05:08 -0600
Subject: [XML-SIG] XPath and Zope's ParsedXML DOM
In-Reply-To: Message from Martijn Faassen <faassen@vet.uu.nl>
 of "Tue, 10 Jul 2001 15:05:36 +0200." <20010710150536.A26286@vet.uu.nl>
Message-ID: <200107102205.f6AM58g04686@localhost.local>

> Hi there,
> 
> I've been trying to make XPath work with Zope's DOM implementation,
> ParsedXML. In the process I've discovered some incompatibilities in
> XPath that I had to hack around.
> 
> The problem is as follows. ParsedXML uses DOM nodes that are 
> descendants of ExtensionClass. This means that type(node) != types.InstanceType.
> 
> Conversions.py depends on this in several places, however. After hacking
> around them XPath works better for me.
> 
> Here are the two places where I had to hack:
> 
> The function CoreStringValue has this line:
> 
> result = _strConversions.get(type(object), _strUnknown)(object)
> 
> but, since instances now don't trigger the InstanceType key in 
> _strConversions, this fails and returns None for valid instances. I've hacked 
> around this by doing the following:
> 
> if hasattr(object, 'ownerDocument'):
>     result = _strInstance(object)
> else:
>     result = _strConversions.get(type(object), _strUnknown)(object)
> 
> I'm not sure if this succeeds in all cases and it's a hack. I'll study
> ExtensionClasses to see if there may be a better way.
> 
> The other hack is similar and involves the types.ListType entry in the
> _strConversions dictionary. Again the lookup that takes place in the
> value lambda fails due to ExtensionClass.

Thanks.  Karl Anderson has pointed out all these issues, and they are on the 
docket to fix, but we haven't had a chance yet.

Thanks for the fixes you offer, but you are right that they are problematic in 
the general case.  If you do find features of ExtensionClass that make for a 
better fix, please let us know and we'll give them a try.

Thanks.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From uche.ogbuji@fourthought.com  Tue Jul 10 23:18:26 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Tue, 10 Jul 2001 16:18:26 -0600
Subject: [XML-SIG] XPath and Zope's ParsedXML DOM
In-Reply-To: Message from Uche Ogbuji <uche.ogbuji@fourthought.com>
 of "Tue, 10 Jul 2001 16:05:08 MDT." <200107102205.f6AM58g04686@localhost.local>
Message-ID: <200107102218.f6AMIQJ04717@localhost.local>

> Thanks.  Karl Anderson has pointed out all these issues, and they are on the 
> docket to fix, but we haven't had a chance yet.

Never mind.  Looks as if Jeremy has it sorted out.

--Uche


From noreply@sourceforge.net  Wed Jul 11 14:08:04 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Wed, 11 Jul 2001 06:08:04 -0700
Subject: [XML-SIG] [ pyxml-Bugs-440396 ] 4Suite and PyXML DOMs differ.
Message-ID: <E15KJim-0005sp-00@usw-sf-web2.sourceforge.net>

Bugs item #440396, was opened at 2001-07-11 06:08
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=440396&group_id=6473

Category: 4Suite
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Romain Slootmaekers (evilsloot)
Assigned to: Nobody/Anonymous (nobody)
Summary: 4Suite and PyXML DOMs differ.

Initial Comment:
XML Document and Domlette objects are not 
interchangeble for the xml.xslt.Processor api.
(versions: 4Suite-0.11.1b2, PyXML-0.6.5 and Python 
2.1)
I included a small example program (30 or so lines) 
that fully demonstrates the problem.

have fun,
Sloot.


----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=440396&group_id=6473


From noreply@sourceforge.net  Thu Jul 12 09:31:59 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Thu, 12 Jul 2001 01:31:59 -0700
Subject: [XML-SIG] [ pyxml-Patches-440604 ] ns_parse.py and bookmark.py patch
Message-ID: <E15Kbt9-0001mJ-00@usw-sf-web1.sourceforge.net>

Patches item #440604, was opened at 2001-07-12 01:31
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=440604&group_id=6473

Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Nobody/Anonymous (nobody)
Assigned to: Nobody/Anonymous (nobody)
Summary: ns_parse.py and bookmark.py patch

Initial Comment:
This tiny patch fixes a lot of problems (missing
descriptions, separators, ...) I had when I tried to
generate an XBEL file from my netscape bookmarks. It
now includes all information available in the netcrap
bookmark file in the result. I'M NOT A PYTHON HACKER,
so please excuse the bad quality.

The patch is against PyXML-0.6.5.

----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=440604&group_id=6473


From nicoml@webmails.com  Thu Jul 12 11:03:46 2001
From: nicoml@webmails.com (Nicolas Villetard)
Date: Thu, 12 Jul 2001 11:3:46 +0100
Subject: [XML-SIG] Xml query language for Python
Message-ID: <20010712090346.29784.qmail@webmails.com>

I have to deal with queries on a quite big XML Database (up to 5 Mo)
for an application written in Python 2.1.

I need also a quite performant query language (I'd like it to do more
than pattern matching).

Does anybody know which of these XML query languages are supported ?
(XML-QL, YATL, Lorel, XQL, XML-RPC, ...)
In which libraries ?

You can also send me your suggestions about this topic.

Thanks

____________________________________________________________________
- http://www.WebMailSPro.com - >> 
VOTRE service d'email sans pub avec VOTRE nom de domaine


From wwwjessie@21cn.com  Thu Jul 12 11:01:51 2001
From: wwwjessie@21cn.com (wwwjessie@21cn.com)
Date: Thu, 12 Jul 2001 18:01:51 +0800
Subject: [XML-SIG] =?gb2312?B?xvPStcnPzfijrNK7sr21vc67KFlvdXIgb25saW5lIGNvbXBhbnkp?=
Message-ID: <34f3401c10ab9$ac12df30$9300a8c0@ifood1gongxing>

This is a multi-part message in MIME format.

------=_NextPart_000_34F35_01C10AFC.BA361F30
Content-Type: text/plain;
	charset="gb2312"
Content-Transfer-Encoding: base64

1/C+tLXEu+HUsaOsxPq6w6Oh0rzKs8a31tC5+s34t/7O8dDFz6K5qcT6ss6/vKO6ICANCg0K07XT
0NfUvLq1xM34yc+5q8u+o6zVucq+uavLvrL6xre6zbf+zvGjrMzhuN/G89K1vrrV+cGmLMT609DB
vdbW0aHU8aO6DQoNCjEvIM341b62qNbGIDxodHRwOi8vd3d3Lmlmb29kMS5jb20vYWJvdXR1cy9v
dXJzZXJ2aWNlcy93ZWIuYXNwPiAgOg0K19S8us6su6S4/NDCo6y53MDtx7DMqLrzzKijrLj5vt3G
89K10OjSqqOsvajBotfUvLq1xM34yc+5q8u+o6zK/b7dv+LEo7/pyM7E+tGh1PGjusnMx+nQxc+i
t6KyvCzN+MnPsvrGt9W5yr6jrL/Nu6e3/s7x1tDQxCzN+MnPubrO78+1zbMsv827p7nYDQrPtbnc
wO0szfjJz8LbzLMszfjJz7vh0unW0NDELM34yc/V0Ma4LM22xrHPtc2zLNfKwc/PwtTY1tDQxCzO
yr7ttfey6Swg1dCx6rLJubrPtc2zLLfDzsrV382zvMa31s72LCDBxMzsytIovbvB96GizLjF0Cmh
raGtDQoNCs/rwcu94sr9vt2/4sSjv+nR3cq+1tDQxKO/x+vBqs+1o7ogc2FsZXNAaWZvb2QxLmNv
bSA8bWFpbHRvOnNhbGVzQGlmb29kMS5jb20+DQqhobXnu7CjujA3NTUtMzc4NjMwOaGhz/rK27K/
yfLQob3jDQoNCjIvINK8zfjNqCA8aHR0cDovL29uZXQuaWZvb2QxLmNvbS8+DQot19TW+sq9vajN
+KOsstnX97zytaWjrLy0vai8tNPDo7q/ydW5yr4zMNXFu/K4/Lbg1dXGrKOs19TW+sq9zqy7pKOs
v8nL5sqxuPzQws28xqy6zc7E19bE2sjdo6zU2s/ft6KyvLL6xrfQxc+ioaK5q8u+tq/MrLXIo6zU
+cvNtv68trn6vMrT8sP7KA0KyOdodHRwOi8veW91cm5hbWUuaWZvb2QxLmNvbSmjrNPr0rzKs8a3
1tC5+s34KNKzw+bkr8DAwb/UwtPiMjAwzfK0zim99MPcway906OszOG438LyvNK6zbnLv823w87K
wb+jrLaoxtrK1bW90rzKsw0KxrfW0Ln6zfjM4bmptcS/zbun0OjH87rNssm5utDFz6Khow0KDQoN
Cg0KN9TCMzDI1cewyerH67KiuLa/7sq508PSvM34zaijrMzYsfDTxbvdvNszODAw1KovxOqjrNT5
y83M9cLrueO45rKiw+K30dTayrPGt9eo0rXU09a+v6+1x7mpo6zH86OstPrA7aOsus/X99DFz6IN
Cs/rwcu94rj8tuA/IKGhx+vBqs+1o7ogc2FsZXNAaWZvb2QxLmNvbSA8bWFpbHRvOnNhbGVzQGlm
b29kMS5jb20+DQqhobXnu7CjujA3NTUtMzc4NjMwOaGhoaHP+srbsr/J8tChveMNCrvyILfDzsrO
0sPHtcTN+NKzIDxodHRwOi8vd3d3Lmlmb29kMS5jb20vYWJvdXR1cy9vdXJzZXJ2aWNlcy9jcHNl
cnZpY2UuYXNwPg0KOnd3dy5pZm9vZDEuY29tDQoNCrvY1rSjqMfrtKvV5qO6MDc1NS0zMjM5MDQ3
u/K3orXn19PTyrz+o7ogc2FsZXNAaWZvb2QxLmNvbSA8bWFpbHRvOnNhbGVzQGlmb29kMS5jb20+
IKOpDQoNCqH1ILG+uavLvrbUzfjVvrao1sa40NDLyKShoaGhICAgICAgICAgICAgICAgICAgICAg
ofUgsb65q8u+ttTSvM34zai3/s7xuNDQy8ikDQoNCrmry77D+7PGo7pfX19fX19fX19fX19fX19f
X19fX19fX19fX19fX19fX19fX19fX19fX19fX1/Bqs+1yMujul9fX19fX19fX19fX19fX19fXw0K
X19fX18gDQoNCrXnu7Cjul9fX19fX19fX19fX19fX19fX19fX7Sr1eajul9fX19fX19fX19fX19f
X19fX19fX19FLW1haWyjul9fX19fX19fX19fX19fX18NCl9fX19fXyANCg0K

------=_NextPart_000_34F35_01C10AFC.BA361F30
Content-Type: text/html;
	charset="gb2312"
Content-Transfer-Encoding: base64

PEhUTUw+DQo8SEVBRD4NCjxUSVRMRT5VbnRpdGxlZCBEb2N1bWVudDwvVElUTEU+IDxNRVRBIEhU
VFAtRVFVSVY9IkNvbnRlbnQtVHlwZSIgQ09OVEVOVD0idGV4dC9odG1sOyBjaGFyc2V0PWdiMjMx
MiI+IA0KPC9IRUFEPg0KDQo8Qk9EWSBCR0NPTE9SPSIjRkZGRkZGIiBURVhUPSIjMDAwMDAwIj4N
CjxUQUJMRSBXSURUSD0iOTglIiBCT1JERVI9IjAiIENFTExTUEFDSU5HPSIwIiBDRUxMUEFERElO
Rz0iMCI+PFRSPjxURD48UCBDTEFTUz1Nc29Ob3JtYWwgU1RZTEU9J21hcmdpbi1yaWdodDotMTcu
ODVwdDtsaW5lLWhlaWdodDoxNTAlJz48Rk9OVCBTSVpFPSIyIj7X8L60tcS74dSxo6zE+rrDo6HS
vMqzxrfW0Ln6zfi3/s7x0MXPormpxPqyzr+8o7ombmJzcDs8L0ZPTlQ+IA0KPC9QPjxQIENMQVNT
PU1zb05vcm1hbCBTVFlMRT0nbWFyZ2luLXJpZ2h0Oi0xNy44NXB0O2xpbmUtaGVpZ2h0OjE1MCUn
PjxGT05UIFNJWkU9IjIiPtO109DX1Ly6tcTN+MnPuavLvqOs1bnKvrmry76y+sa3us23/s7xo6zM
4bjfxvPStb661fnBpizE+tPQwb3W1tGh1PGjujxCUj48QlI+MS8gDQo8QQ0KSFJFRj0iaHR0cDov
L3d3dy5pZm9vZDEuY29tL2Fib3V0dXMvb3Vyc2VydmljZXMvd2ViLmFzcCI+zfjVvrao1sY8L0E+
IDog19S8us6su6S4/NDCo6y53MDtx7DMqLrzzKijrLj5vt3G89K10OjSqqOsvajBotfUvLq1xM34
yc+5q8u+o6zK/b7dv+LEo7/pyM7E+tGh1PGjusnMx+nQxc+it6KyvCzN+MnPsvrGt9W5yr6jrL/N
u6e3/s7x1tDQxCzN+MnPubrO78+1zbMsv827p7nYz7W53MDtLM34yc/C28yzLM34yc+74dLp1tDQ
xCzN+MnP1dDGuCzNtsaxz7XNsyzXysHPz8LU2NbQ0MQszsq+7bX3suksIA0K1dCx6rLJubrPtc2z
LLfDzsrV382zvMa31s72LCDBxMzsytIovbvB96GizLjF0CmhraGtPC9GT05UPjwvUD48UCBDTEFT
Uz1Nc29Ob3JtYWwgU1RZTEU9J2xpbmUtaGVpZ2h0OjIwLjBwdCc+PEI+PEZPTlQgQ09MT1I9IiNG
RjAwMDAiPs/rwcu94sr9vt2/4sSjv+nR3cq+1tDQxKO/PC9GT05UPjwvQj48Rk9OVCBTSVpFPSIy
Ij7H68Gqz7WjujxBIEhSRUY9Im1haWx0bzpzYWxlc0BpZm9vZDEuY29tIj5zYWxlc0BpZm9vZDEu
Y29tPC9BPiANCqGhtee7sKO6MDc1NS0zNzg2MzA5oaHP+srbsr/J8tChveM8L0ZPTlQ+PC9QPjxQ
IENMQVNTPU1zb05vcm1hbCBTVFlMRT0nbGluZS1oZWlnaHQ6MjAuMHB0Jz48L1A+PFAgQ0xBU1M9
TXNvTm9ybWFsIFNUWUxFPSdsaW5lLWhlaWdodDoyMC4wcHQnPjxGT05UIFNJWkU9IjIiPjIvIA0K
PEEgSFJFRj0iaHR0cDovL29uZXQuaWZvb2QxLmNvbS8iPtK8zfjNqDwvQT4t19TW+sq9vajN+KOs
stnX97zytaWjrLy0vai8tNPDo7q/ydW5yr4zMNXFu/K4/Lbg1dXGrKOs19TW+sq9zqy7pKOsv8nL
5sqxuPzQws28xqy6zc7E19bE2sjdo6zU2s/ft6KyvLL6xrfQxc+ioaK5q8u+tq/MrLXIo6zU+cvN
tv68trn6vMrT8sP7KMjnaHR0cDovL3lvdXJuYW1lLmlmb29kMS5jb20po6zT69K8yrPGt9bQufrN
+CjSs8Pm5K/AwMG/1MLT4jIwMM3ytM4pvfTD3MGsvdOjrMzhuN/C8rzSus25y7/Nt8POysG/o6y2
qMbaytW1vdK8yrPGt9bQufrN+Mzhuam1xL/Nu6fQ6Mfzus2yybm60MXPoqGjPEJSPjwvRk9OVD48
L1A+PFAgQ0xBU1M9TXNvTm9ybWFsIFNUWUxFPSdtYXJnaW4tcmlnaHQ6LTE3Ljg1cHQ7bGluZS1o
ZWlnaHQ6MTUwJSc+PEZPTlQgU0laRT0iMiI+PEJSPjwvRk9OVD4gDQo8Qj48Rk9OVCBDT0xPUj0i
I0ZGMDAwMCI+NzwvRk9OVD48L0I+PEZPTlQgQ09MT1I9IiNGRjAwMDAiPjxCPtTCMzDI1cewyerH
67KiuLa/7sq508PSvM34zaijrMzYsfDTxbvdvNszODAw1KovxOqjrNT5y83M9cLrueO45rKiw+K3
0dTayrPGt9eo0rXU09a+v6+1x7mpo6zH86OstPrA7aOsus/X99DFz6I8L0I+PEJSPjwvRk9OVD4g
DQo8Rk9OVCBTSVpFPSIyIj7P68HLveK4/LbgPyChocfrwarPtaO6PEEgSFJFRj0ibWFpbHRvOnNh
bGVzQGlmb29kMS5jb20iPnNhbGVzQGlmb29kMS5jb208L0E+IA0KoaG157uwo7owNzU1LTM3ODYz
MDmhoaGhz/rK27K/yfLQob3jPEJSPjwvRk9OVD48Rk9OVCBTSVpFPSIyIj678jxBDQpIUkVGPSJo
dHRwOi8vd3d3Lmlmb29kMS5jb20vYWJvdXR1cy9vdXJzZXJ2aWNlcy9jcHNlcnZpY2UuYXNwIj63
w87KztLDx7XEzfjSszwvQT46d3d3Lmlmb29kMS5jb208L0ZPTlQ+PC9QPjxQIENMQVNTPU1zb05v
cm1hbCBTVFlMRT0nbGluZS1oZWlnaHQ6MjAuMHB0JyBBTElHTj0iTEVGVCI+PC9QPjxQIENMQVNT
PU1zb05vcm1hbCBBTElHTj1MRUZUIFNUWUxFPSdsaW5lLWhlaWdodDoyMC4wcHQnPjxGT05UIFNJ
WkU9IjIiPjxCPrvY1rSjqMfrtKvV5qO6MDc1NS0zMjM5MDQ3u/K3orXn19PTyrz+o7o8L0I+PEEN
CkhSRUY9Im1haWx0bzpzYWxlc0BpZm9vZDEuY29tIj5zYWxlc0BpZm9vZDEuY29tIDwvQT48Qj6j
qTwvQj48L0ZPTlQ+PC9QPjxQPjxGT05UIFNJWkU9IjIiPqH1IA0Ksb65q8u+ttTN+NW+tqjWxrjQ
0MvIpKGhoaEmbmJzcDsmbmJzcDsgJm5ic3A7Jm5ic3A7Jm5ic3A7Jm5ic3A7Jm5ic3A7Jm5ic3A7
Jm5ic3A7Jm5ic3A7Jm5ic3A7Jm5ic3A7IA0KJm5ic3A7Jm5ic3A7Jm5ic3A7Jm5ic3A7Jm5ic3A7
Jm5ic3A7IKH1ILG+uavLvrbU0rzN+M2ot/7O8bjQ0MvIpDwvRk9OVD48L1A+PFAgQ0xBU1M9TXNv
Tm9ybWFsIFNUWUxFPSdsaW5lLWhlaWdodDoyMC4wcHQnPjxGT05UIFNJWkU9IjIiPrmry77D+7PG
o7pfX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX1/Bqs+1yMujul9f
X19fX19fX19fX19fX19fX19fX19fIA0KPEJSPiA8QlI+ILXnu7Cjul9fX19fX19fX19fX19fX19f
X19fX7Sr1eajul9fX19fX19fX19fX19fX19fX19fX19FLW1haWyjul9fX19fX19fX19fX19fX19f
X19fX18gDQo8L0ZPTlQ+PC9QPjxQIENMQVNTPU1zb05vcm1hbCBTVFlMRT0nbGluZS1oZWlnaHQ6
MjAuMHB0Jz48L1A+PC9URD48L1RSPjwvVEFCTEU+IA0KPC9CT0RZPg0KPC9IVE1MPg0K

------=_NextPart_000_34F35_01C10AFC.BA361F30--


From hungjunglu@yahoo.com  Fri Jul 13 00:23:03 2001
From: hungjunglu@yahoo.com (Hung Jung Lu)
Date: Thu, 12 Jul 2001 16:23:03 -0700 (PDT)
Subject: [XML-SIG] SAX  with DTD
Message-ID: <20010712232303.61912.qmail@web12607.mail.yahoo.com>

Hi,

I am new to XML in Python. I have a few questions.

(1) I have read that Expat is non-validating. Does it
mean that it ignores DTD completely?

(2) I have a DTD that specifies default attributes
(via #FIXED) of an XML document. Is there some parser
(DOM preferred, SAX ok) in Python that can take into
account the attributes specified in DTD? I tried
xml.dom.minidom and as one would guess, it does not do
anything with DTD. What's a good XML parser in Python
that builds DOM with DTD information? 

(3) If the above is not available in Python (DOM with
DTD), is there any simple downloadable example out
there of some SAX parser that uses both XML and DTD?
I've read a bit about xmlproc, xmlval, but is there
any simple example code?

thanks,

Hung Jung


__________________________________________________
Do You Yahoo!?
Get personalized email addresses from Yahoo! Mail
http://personal.mail.yahoo.com/


From fdrake@acm.org  Fri Jul 13 00:29:02 2001
From: fdrake@acm.org (Fred L. Drake, Jr.)
Date: Thu, 12 Jul 2001 19:29:02 -0400 (EDT)
Subject: [XML-SIG] SAX  with DTD
In-Reply-To: <20010712232303.61912.qmail@web12607.mail.yahoo.com>
References: <20010712232303.61912.qmail@web12607.mail.yahoo.com>
Message-ID: <15182.12990.22102.223577@cj42289-a.reston1.va.home.com>

Hung Jung Lu writes:
 > (1) I have read that Expat is non-validating. Does it
 > mean that it ignores DTD completely?

  Yes, pretty much.  If you use Expat 1.95+ (see
expat.sourceforge.net), then you can coerce Expat into reading the
DTD and report what's in the DTD, but it won't perform validation.
You certainly could use that to pick up the default values of
attributes, however.

 > (2) I have a DTD that specifies default attributes
 > (via #FIXED) of an XML document. Is there some parser
 > (DOM preferred, SAX ok) in Python that can take into
 > account the attributes specified in DTD? I tried

  I suspect xmlproc can be used to build a DOM like this.  I've used
the latest versions of Expat to do this as well, but that DOM requires
the acquisition machinery in Zope to work.  It shouldn't be too hard
to adapt that code to build a minidom DOM, but I've not had time to do
so.  You are free to work on that if you like.


  -Fred

-- 
Fred L. Drake, Jr.  <fdrake at acm.org>
PythonLabs at Digital Creations


From Juergen Hermann" <jhe@webde-ag.de  Fri Jul 13 10:10:06 2001
From: Juergen Hermann" <jhe@webde-ag.de (Juergen Hermann)
Date: Fri, 13 Jul 2001 11:10:06 +0200
Subject: [XML-SIG] SAX  with DTD
In-Reply-To: <20010712232303.61912.qmail@web12607.mail.yahoo.com>
Message-ID: <m15Kyxa-007oGhC@smtp.web.de>

On Thu, 12 Jul 2001 16:23:03 -0700 (PDT), Hung Jung Lu wrote:

>(2) I have a DTD that specifies default attributes
>(via #FIXED) of an XML document. Is there some parser
>(DOM preferred, SAX ok) in Python that can take into
>account the attributes specified in DTD?

This is a full SAX example:

import os 
 
import xml.sax 
import xml.sax.saxutils 
import xml.sax.handler 
import xml.sax.sax2exts 
 
class CopsConfigHandler(xml.sax.saxutils.DefaultHandler): 
 
    xmlns_copscfg = u'http://www.cinetic.de/2000/COPS/Config' 
    _debug = 0 
 
    def __init__(self, configfile): 
        self.configfile = 'file://' + os.path.abspath(configfile) 
        self.params = {} 
        self.in_parameters = 0 
 
        # create parser 
        parser = xml.sax.sax2exts.XMLValParserFactory.make_parser() 
        ##print '+++ parser is', parser 
        parser.setFeature(xml.sax.handler.feature_namespaces, 1) 
        parser.setFeature(xml.sax.handler.feature_validation, 0) 
        parser.setFeature(xml.sax.handler.feature_external_ges, 1) 
        parser.setFeature(xml.sax.handler.feature_external_pes, 1) 
 
        # set handlers 
        parser.setContentHandler(self) 
        parser.setDTDHandler(self) 
        if not self._debug: 
            # no tracebacks, print error msg only! 
            parser.setErrorHandler(self) 
        parser.setEntityResolver(self) 
 
        # parse the XML into events 
        parser.parse(self.configfile) 
 
    ### error handler events 
    def error(self,exception): 
        raise exception
 
    def fatalError(self,exception): 
        raise exception

    def warning(self,exception): 
        sys.stderr.write("*** warning %s\n" % (str(exception),)) 
 
    ### document handler events 
    def startElementNS(self, name, qname, attrs): 
        if name[0] == self.xmlns_copscfg: 
            ##print name, qname, attrs.items() 
            if name[1] == "parameters": 
                self.in_parameters = 1 
            elif self.in_parameters and name[1] == "param": 
                ##print '+++ attrs', attrs._attrs 
                ##print '+++ qnames', attrs._qnames 
                name = attrs.getValueByQName('name') 
                value = attrs.getValueByQName('value') 
                self.params[name] = value 
 
    def endElementNS(self, name, qname): 
        if name[0] == self.xmlns_copscfg: 
            if name[1] == "parameters": 
                self.in_parameters = 0 
 
if __name__ == "__main__": 
    copsconfig = CopsConfigHandler(os.path.join('conf', 'cops-config.xml')) 
    keys = copsconfig.params.keys() 
    keys.sort() 
    klen = reduce(max, map(len, keys), 0) 
    for key in keys: 
        print key.ljust(klen), repr(copsconfig.params[key]) 


From larsga@garshol.priv.no  Fri Jul 13 10:30:42 2001
From: larsga@garshol.priv.no (Lars Marius Garshol)
Date: 13 Jul 2001 11:30:42 +0200
Subject: [XML-SIG] SAX  with DTD
In-Reply-To: <20010712232303.61912.qmail@web12607.mail.yahoo.com>
References: <20010712232303.61912.qmail@web12607.mail.yahoo.com>
Message-ID: <m3y9ptdvlp.fsf@lambda.garshol.priv.no>

* Hung Jung Lu
| 
| (2) I have a DTD that specifies default attributes (via #FIXED) of
| an XML document. Is there some parser (DOM preferred, SAX ok) in
| Python that can take into account the attributes specified in DTD?

xmlproc does this, and there is a SAX driver for it, so that you can
access it as a SAX parser. The DOM implementations use SAX to build
their DOM trees, so you can use xmlproc to build your DOMs.

[larsga@pc36 project]$ python2.1
Python 2.1 (#1, May  5 2001, 06:49:59) 
[GCC 2.95.1 19990816/Linux (release)] on linux2
Type "copyright", "credits" or "license" for more information.
>>> from xml.dom.ext.reader.Sax2 import Reader
>>> r = Reader(1)
>>> doc = r.fromStream(open("engine-plan.xml"))
>>> doc
<XML Document at 836d55c>

The 1 argument to Reader tells it to use a validating parser, so it
will do much the same as J�rgen's example, except with the DOM rather
than SAX. It uses xmlproc at the moment, because that's the only
validating parser we have.

--Lars M.


From hungjunglu@yahoo.com  Fri Jul 13 15:12:37 2001
From: hungjunglu@yahoo.com (Hung Jung Lu)
Date: Fri, 13 Jul 2001 07:12:37 -0700 (PDT)
Subject: [XML-SIG] SAX  with DTD
In-Reply-To: <m3y9ptdvlp.fsf@lambda.garshol.priv.no>
Message-ID: <20010713141237.50998.qmail@web12605.mail.yahoo.com>

Cool. It works! Yours is probably the shortest way of
attaching attributes from DTD to XML. Result can be
seen by

from xml.dom.ext import PrettyPrint
PrettyPrint(doc)

I did try xmlproc directly, too. More coding for the
handlers, but I guess it's good if one wants to
convert XML directly into Python objects instead of
going through DOM.

Thanks everyone!

Hung Jung

--- Lars Marius Garshol <larsga@garshol.priv.no>
wrote:
> 
> * Hung Jung Lu
> | 
> | (2) I have a DTD that specifies default attributes
> (via #FIXED) of
> | an XML document. Is there some parser (DOM
> preferred, SAX ok) in
> | Python that can take into account the attributes
> specified in DTD?
> 
> xmlproc does this, and there is a SAX driver for it,
> so that you can
> access it as a SAX parser. The DOM implementations
> use SAX to build
> their DOM trees, so you can use xmlproc to build
> your DOMs.
> 
> [larsga@pc36 project]$ python2.1
> Python 2.1 (#1, May  5 2001, 06:49:59) 
> [GCC 2.95.1 19990816/Linux (release)] on linux2
> Type "copyright", "credits" or "license" for more
> information.
> >>> from xml.dom.ext.reader.Sax2 import Reader
> >>> r = Reader(1)
> >>> doc = r.fromStream(open("engine-plan.xml"))
> >>> doc
> <XML Document at 836d55c>
> 
> The 1 argument to Reader tells it to use a
> validating parser, so it
> will do much the same as J�rgen's example, except
> with the DOM rather
> than SAX. It uses xmlproc at the moment, because
> that's the only
> validating parser we have.
> 
> --Lars M.
> 
> 
> _______________________________________________
> XML-SIG maillist  -  XML-SIG@python.org
> http://mail.python.org/mailman/listinfo/xml-sig


__________________________________________________
Do You Yahoo!?
Get personalized email addresses from Yahoo! Mail
http://personal.mail.yahoo.com/


From dkuhlman@cutter.rexx.com  Fri Jul 13 21:29:15 2001
From: dkuhlman@cutter.rexx.com (Dave Kuhlman)
Date: Fri, 13 Jul 2001 13:29:15 -0700
Subject: [XML-SIG] Python wrappers for libxml and libxslt
Message-ID: <20010713132914.A20340@cutter.rexx.com>

I've implemented wrappers for the parser in libxml2 and simple
wrappers for the top level functionality in libxslt.  You can learn
more about libxml and libxslt at:

    http://xmlsoft.org

And you can find my Python wrappers at:

   *** Caution -- This is alpha-ware.  Use at your own risk. ***

    SAX interface:
        http://www.rexx.com/~dkuhlman/libxml_saxlib.html
        http://www.rexx.com/~dkuhlman/libxml_saxlib-1.0a.tar.gz

    DOM interface:
        http://www.rexx.com/~dkuhlman/libxml_domlib.html
        http://www.rexx.com/~dkuhlman/libxml_domlib-1.0a.tar.gz

    XSL-T:
        http://www.rexx.com/~dkuhlman/libxsltmod.html
        http://www.rexx.com/~dkuhlman/libxsltmod-1.0a.tar.gz

Thanks so much to all those who work made this possible.  Thanks to
the people who did libxml and libxslt (these modules are 99.9%
their work and 0.1% mine.) Thanks for Distutils, which made it so
easy to package these modules.  And, thanks to the core Python team
for a great and extensible language.

My wrappers are at a pretty low level (i.e. close to the libxml C
code).  That made it a bit easier for me.  But it might also help
with speed and memory use considerations for some uses.

But, it also turns out to be very easy for a Python user of the
wrappers.  With libxml_saxlib, just create an instance a class that
has methods like startDocument, endDocument, startElement,
endElement, characters, etc, then call parse_file(instance,
fileName) or parse_string(instance, string).  With libxml_domlib,
call parse_file or parse_string to parse the document, then call
getRootElement, getFirstChild, getNextSibling, etc to walk the
tree.  With libxslt, just call a function or two.

An additional educational part of this work -- In providing access
to the DOM tree, I needed to implement several Python extension
datatypes (as part of the Python extension module libxml_domlib). 
I had never done that before, believing that the Python C
structures involved were too difficult for me to deal with.  With
some help, it turned out to be not as difficult as I thought.  Here
are two suggestions if you need to implement a Python extension
type yourself:

  - Start by copying Objects/xxobject.c in the Python source code
    distribution.  The structure and organization in this file will
    put you far ahead of where you would be if you start from
    scratch and it will save many errors, too.

  - Or, use the my extension datatype generator.  You can find it at:

        http://www.rexx.com/~dkuhlman/dtGenerator.py

    For restricted purposes, this will save a lot of copy, paste,
    and rename work.

You may be asking, Why did you implement XML capabilities for
Python, when we already have PyXML/4Suite?  PyXML is super.  And
there is no way that these wrappers for libxml/libxslt can be
considered anywhere near as good as PyXML.  (It's presumptuous for
me to suggest that they are comparable.) However, let me give a
couple of reasons for doing and offering this:

  - Because it's there.  libxml2 and libxslt are available. 

  - Because implementing the Python extension modules and extension
    datatypes was good training for me.

  - Because I believe that having a bit more breadth of coverage of
    something as important as XML is good for Python, even if it is
    not used very much.

  - Because it's easy to use.  Using libxsltmod from Python is
    (almost) as easy as one function call.  It won't give enough
    control for some situations.  But where that control is not
    needed, calling from Python is very easy.

  - Because there may be special situations where this
    implementation is useful. For example, installing it on a new
    machine, may be as easy as copying a few shared libraries.  For
    some purposes, that may be a benefit.

  - Because I'm grateful for all that the Python community has
    given me and I'd like to try to give a little back.

If you have suggestions or find problems please let me know.

  - Dave

-- 
Dave Kuhlman
dkuhlman@rexx.com


From Mike.Olson@fourthought.com  Mon Jul 16 00:57:44 2001
From: Mike.Olson@fourthought.com (Mike Olson)
Date: Sun, 15 Jul 2001 17:57:44 -0600
Subject: [XML-SIG] [ANN] 4Suite and 4SuiteServer 0.11.1 beta 3 release
Message-ID: <3B522DF8.E083F256@fourthought.com>


All,

   This should be out last beta release for the 0.11.1 final release. 
Thanks to every one for the help in finding all of our bugs.  I think we
have fixed most of them, the rest we will be fixing this week and hope
to have the final release out at the end of the week.

  As always, any and all who can down load the beta and give it a try it
would be greately appreciated.  You can get the beta releases from the
ftp site at

ftp://ftp.fourthought.com/pub/4Suite

or from the web site at:

http://4suite.org/download.html

Thanks
Mike

-- 
Mike Olson                                Principal Consultant
mike.olson@fourthought.com                +1 303 583 9900 x 102
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St,                      http://4Suite.org
Boulder, CO 80301-2537, USA
XML strategy, XML tools, knowledge management


From Alexandre.Fayolle@logilab.fr  Mon Jul 16 08:19:30 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Mon, 16 Jul 2001 09:19:30 +0200 (CEST)
Subject: [XML-SIG] 4Suite 0.11.1 and PyXML 0.6.5
In-Reply-To: <3B522DF8.E083F256@fourthought.com>
Message-ID: <Pine.LNX.4.21.0107160914360.23663-100000@leo.logilab.fr>

On Sun, 15 Jul 2001, Mike Olson wrote:

>    This should be out last beta release for the 0.11.1 final release. 
> Thanks to every one for the help in finding all of our bugs.  I think we
> have fixed most of them, the rest we will be fixing this week and hope
> to have the final release out at the end of the week.

Great news. 

I've been quite busy these last weeks, and have not managed to follow the
various mailing lists as closely as I would have wanted. Is there a
release of PyXML 0.6.6 planned that would mainly feature the changes in
xml.dom.ext that make PyXML compatible with 4Suite-0.11.1's pDomlette ? 

Yet another (dumb) question : is the new version of XPath (thread and
unicode friendly) part of 4Suite 0.11.1 or PyXML 0.7 ?

Thanks

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From wwwjessie@21cn.com  Mon Jul 16 10:47:34 2001
From: wwwjessie@21cn.com (wwwjessie@21cn.com)
Date: Mon, 16 Jul 2001 17:47:34 +0800
Subject: [XML-SIG] =?gb2312?B?tPPBrC0yMDAxxOq5+rzKwszJq8qzxrfT68jLwOC9ob+1sqnAwLvhKA==?=	=?gb2312?B?QWdybyBBbm51YWwgTWVldGluZyBDaGluYSAyMDAxKQ0=?=
Message-ID: <2d9a001c10ddc$5766b6c0$9300a8c0@ifood1gongxing>

This is a multi-part message in MIME format.

------=_NextPart_000_2D9A1_01C10E1F.6589F6C0
Content-Type: text/plain;
	charset="gb2312"
Content-Transfer-Encoding: base64

MjAwMcTq1tC5+rn6vMrFqdK1v8a8vMTqu+ENCrn6vMrCzMmryrPGt9PryMvA4L2hv7WyqcDAu+G8
sNGnyvXM1sLbu+ENCg0KCQ0K1bnG2qO6IAmhoTIwMDHE6jnUwjTI1S03yNUJDQq12LXjo7ogCaGh
tPPBrNDHuqO74dW51tDQxAkNCtb3sOyjuiAJoaHW0LuqyMvD8bmyus25+sWp0rWyvw0KoaHW0Ln6
v8bRp7y8yvXQrbvhDQqhobTzwazK0MjLw/HV/riuDQoJDQqz0LDso7ogCaGh1tC5+sLMyavKs8a3
t6LVudbQ0MQNCqGh1tC5+sWp0ae74Q0KoaHW0Ln6wszJq8qzxrfQrbvhDQqhobTzwazK0MWp0rW+
1g0KoaG088Gs0Me6o7vh1bnW0NDEDQoJDQrN+MLnt/7O8czhuanJzKO60rzKs8a31tC5+s34IGh0
dHA6Ly93d3cuaWZvb2QxLmNvbQ0KPGh0dHA6Ly93d3cuaWZvb2QxLmNvbS9pbmRleC5hc3A/ZnI9
eG1sLXNpZ0BweXRob24ub3JnPiAJDQogCQ0Kofogzai5/dK8yrPGt9bQufrN+LGow/uyztW5o7q+
xdXb08W73SixyMjnz9bT0MO/uPYgM00gWCAzTSC1xLHq17zVuc671K2821JNQjQ1MDCjrM2ouf3O
0sPH1rvQ6Li2Uk1CNDA1MCmjrA0KsajD+73Y1rnI1cbaMjAwMcTqN9TCMjDI1SA8aHR0cDovL2dy
ZWVuMjAwMS5pZm9vZDEuY29tL2Zyb20xLmFzcD4gDQqh+iC7ttOtIMPit9HXorLhIDxodHRwOi8v
d3d3Lmlmb29kMS5jb20vc2lnbnVwL3NldmFncmVlbS5hc3A+ILPJzqq5q8u+u+HUsaGjDQo31MIy
MMjVx7DXorLho6zE+r2r1No31MIyNcjVx7DNqLn9tefX09PKvP63vcq9w+K30bvxtcMzMMz1ssm5
utDFz6Khow0KyOe5+8T6srvP68rVtb3O0sPHtcTTyrz+o6zH6yDBqs+1ztLDxyA8bWFpbHRvOnVu
c3Vic2NyaWJlQGlmb29kMS5jb20+IKOsztLDx9LUuvO9q7K71Nm3otPKvP64+MT6oaMNCrLp0a+j
uiBzYWxlc0BpZm9vZDEuY29tIDxtYWlsdG86c2FsZXNAaWZvb2QxLmNvbT4gIKGhoaG157uwo7ow
NzU1LTM3ODYzMDmhoc/6ytuyvw0KyfLQob3jILbFz8jJ+g0KDQoNCiANCg0Ku9gg1rQgo6jH67Sr
1eajujA3NTUtMzIzOTA0N7vyILeitefX09PKvP6juiBzYWxlc0BpZm9vZDEuY29tIDxtYWlsdG86
c2FsZXNAaWZvb2QxLmNvbT4NCqOpCQ0KofUgsb65q8u+09DS4s2ouf3SvMqzxrfW0Ln6zfiyztW5
IKGhoaEgofUgsb65q8u+xOK9+NK7sr3By73iuMOyqcDAu+GjrMfr0+vO0sPHwarPtQ0KDQq5q8u+
w/uzxqO6X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18NCsGqz7XIy6O6X19f
X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fXw0Ktee7sKO6X19fX19fX19fX19fX19f
X19fX19fX19fX19fX19fX19fX19fXw0KtKvV5qO6X19fX19fX19fX19fX19fX19fX19fX19fX19f
X19fX19fX19fXw0KRS1tYWlso7pfX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f
DQoJDQogCQ0KIAkNCiAJDQogCQ0KIAkNCg==

------=_NextPart_000_2D9A1_01C10E1F.6589F6C0
Content-Type: text/html;
	charset="gb2312"
Content-Transfer-Encoding: base64

PGh0bWw+DQo8aGVhZD4NCjx0aXRsZT5VbnRpdGxlZCBEb2N1bWVudDwvdGl0bGU+IDxtZXRhIGh0
dHAtZXF1aXY9IkNvbnRlbnQtVHlwZSIgY29udGVudD0idGV4dC9odG1sOyBjaGFyc2V0PWdiMjMx
MiI+IA0KPHN0eWxlIHR5cGU9InRleHQvY3NzIj4NCjwhLS0NCnRkIHsgIGxpbmUtaGVpZ2h0OiAy
NHB4fQ0KLS0+DQo8L3N0eWxlPiANCjwvaGVhZD4NCg0KPGJvZHkgYmdjb2xvcj0iI0ZGRkZGRiIg
dGV4dD0iIzAwMDAwMCI+DQo8ZGl2IGFsaWduPSJDRU5URVIiPjx0YWJsZSB3aWR0aD0iNzUlIiBi
b3JkZXI9IjAiIGNlbGxzcGFjaW5nPSIwIiBjZWxscGFkZGluZz0iMCI+PHRyPjx0ZCBhbGlnbj0i
Q0VOVEVSIj48YSBocmVmPSJodHRwOy8vZ3JlZW4yMDAxLmlmb29kMS5jb20iPjxiPjIwMDHE6tbQ
ufq5+rzKxanStb/GvLzE6rvhPGJyPrn6vMrCzMmryrPGt9PryMvA4L2hv7WyqcDAu+G8sNGnyvXM
1sLbu+E8L2I+PC9hPjxicj48YnI+PC90ZD48L3RyPjx0cj48dGQgYWxpZ249IkNFTlRFUiI+PHRh
YmxlIHdpZHRoPSI3NSUiIGJvcmRlcj0iMCIgY2VsbHNwYWNpbmc9IjAiIGNlbGxwYWRkaW5nPSIw
Ij48dHI+PHRkIGhlaWdodD0iMTIiIHdpZHRoPSIzOSUiIGFsaWduPSJSSUdIVCI+PGI+PGZvbnQg
c2l6ZT0iMiI+1bnG2qO6IA0KPC9mb250PjwvYj48L3RkPjx0ZCBoZWlnaHQ9IjEyIiB3aWR0aD0i
NjElIj48Zm9udCBzaXplPSIyIj6hoTIwMDHE6jnUwjTI1S03yNU8L2ZvbnQ+PC90ZD48L3RyPjx0
cj48dGQgaGVpZ2h0PSIxMiIgd2lkdGg9IjM5JSIgYWxpZ249IlJJR0hUIj48Yj48Zm9udCBzaXpl
PSIyIj612LXjo7ogDQo8L2ZvbnQ+PC9iPjwvdGQ+PHRkIGhlaWdodD0iMTIiIHdpZHRoPSI2MSUi
Pjxmb250IHNpemU9IjIiPqGhtPPBrNDHuqO74dW51tDQxDwvZm9udD48L3RkPjwvdHI+PHRyPjx0
ZCBoZWlnaHQ9IjEyIiB3aWR0aD0iMzklIiBhbGlnbj0iUklHSFQiIHZhbGlnbj0iVE9QIj48Yj48
Zm9udCBzaXplPSIyIj7W97Dso7ogDQo8L2ZvbnQ+PC9iPjwvdGQ+PHRkIGhlaWdodD0iMTIiIHdp
ZHRoPSI2MSUiPjxmb250IHNpemU9IjIiPqGhPC9mb250Pjxmb250IHNpemU9IjIiPtbQu6rIy8Px
ubK6zbn6xanStbK/PGJyPqGh1tC5+r/G0ae8vMr10K274Txicj6hobTzwazK0MjLw/HV/riuPGJy
PjwvZm9udD48L3RkPjwvdHI+PHRyPjx0ZCBoZWlnaHQ9IjEyIiB3aWR0aD0iMzklIiBhbGlnbj0i
UklHSFQiIHZhbGlnbj0iVE9QIj48Yj48Zm9udCBzaXplPSIyIj6z0LDso7ogDQo8L2ZvbnQ+PC9i
PjwvdGQ+PHRkIGhlaWdodD0iMTIiIHdpZHRoPSI2MSUiPjxmb250IHNpemU9IjIiPqGhPC9mb250
Pjxmb250IHNpemU9IjIiPtbQufrCzMmryrPGt7ei1bnW0NDEPGJyPqGh1tC5+sWp0ae74Txicj6h
odbQufrCzMmryrPGt9Ctu+E8YnI+oaG088GsytDFqdK1vtY8YnI+oaG088Gs0Me6o7vh1bnW0NDE
PGJyPjwvZm9udD48L3RkPjwvdHI+PHRyPjx0ZCBjb2xzcGFuPSIyIiBhbGlnbj0iQ0VOVEVSIj48
Zm9udCBzaXplPSIyIj7N+MLnt/7O8czhuanJzKO60rzKs8a31tC5+s34IA0KPGEgaHJlZj0iaHR0
cDovL3d3dy5pZm9vZDEuY29tL2luZGV4LmFzcD9mcj14bWwtc2lnQHB5dGhvbi5vcmciPmh0dHA6
Ly93d3cuaWZvb2QxLmNvbTwvYT48L2ZvbnQ+PC90ZD48L3RyPjx0cj48dGQgY29sc3Bhbj0iMiIg
YWxpZ249IkNFTlRFUiI+Jm5ic3A7PC90ZD48L3RyPjx0cj48dGQgY29sc3Bhbj0iMiIgYWxpZ249
IkxFRlQiPjxwPjxmb250IHNpemU9IjIiPqH6IA0Kzai5/dK8yrPGt9bQufrN+LGow/uyztW5o7o8
Yj48Zm9udCBzaXplPSIzIiBjb2xvcj0iI0ZGMDAwMCI+vsXV29PFu908L2ZvbnQ+PC9iPiixyMjn
z9bT0MO/uPYgM00gWCAzTSANCrXEserXvNW5zrvUrbzbUk1CNDUwMKOszai5/c7Sw8fWu9DouLZS
TUI0MDUwKaOsIDxhIGhyZWY9Imh0dHA6Ly9ncmVlbjIwMDEuaWZvb2QxLmNvbS9mcm9tMS5hc3Ai
PjxiPjxmb250IHNpemU9IjMiIGNvbG9yPSIjRkYwMDAwIj6xqMP7vdjWucjVxtoyMDAxxOo31MIy
MMjVPC9mb250PjwvYj48L2E+PGJyPqH6IA0Ku7bTrTxhIGhyZWY9Imh0dHA6Ly93d3cuaWZvb2Qx
LmNvbS9zaWdudXAvc2V2YWdyZWVtLmFzcCI+w+K30deisuE8L2E+s8nOqrmry7674dSxoaMgPGZv
bnQgY29sb3I9IiNGRjAwMDAiPjxiPjxmb250IHNpemU9IjMiPjfUwjIwyNXHsNeisuGjrMT6vavU
2jfUwjI1yNXHsM2ouf2159fT08q8/re9yr3D4rfRu/G1wzMwzPWyybm60MXPoqGjPC9mb250Pjwv
Yj48L2ZvbnQ+PGJyPsjnufvE+rK7z+vK1bW9ztLDx7XE08q8/qOsx+s8YSBocmVmPSJtYWlsdG86
dW5zdWJzY3JpYmVAaWZvb2QxLmNvbSI+warPtc7Sw8c8L2E+o6zO0sPH0tS6872rsrvU2bei08q8
/rj4xPqhozxicj6y6dGvo7o8YSBocmVmPSJtYWlsdG86c2FsZXNAaWZvb2QxLmNvbSI+c2FsZXNA
aWZvb2QxLmNvbTwvYT4gDQqhoaGhtee7sKO6MDc1NS0zNzg2MzA5oaHP+srbsr8gyfLQob3jILbF
z8jJ+jxicj48L2ZvbnQ+PC9wPjxwPiZuYnNwOzwvcD48L3RkPjwvdHI+PHRyPjx0ZCBoZWlnaHQ9
IjMwIiBjb2xzcGFuPSIyIiBhbGlnbj0iQ0VOVEVSIj48Zm9udCBzaXplPSIyIj48Yj672CANCta0
IKOox+u0q9Xmo7owNzU1LTMyMzkwNDe78iC3orXn19PTyrz+o7ogPGEgaHJlZj0ibWFpbHRvOnNh
bGVzQGlmb29kMS5jb20iPnNhbGVzQGlmb29kMS5jb208L2E+IA0Ko6k8L2I+PC9mb250PjwvdGQ+
PC90cj48dHI+PHRkIGhlaWdodD0iMTIiIGNvbHNwYW49IjIiPjxmb250IHNpemU9IjIiPqH1ILG+
uavLvtPQ0uLNqLn90rzKs8a31tC5+s34ss7VuSANCqGhoaEgofUgsb65q8u+xOK9+NK7sr3By73i
uMOyqcDAu+GjrMfr0+vO0sPHwarPtTxicj48YnI+uavLvsP7s8ajul9fX19fX19fX19fX19fX19f
X19fX19fX19fX19fX19fX19fX19fPGJyPsGqz7XIy6O6X19fX19fX19fX19fX19fX19fX19fX19f
X19fX19fX19fX19fXzxicj48L2ZvbnQ+PGZvbnQgc2l6ZT0iMiI+tee7sKO6X19fX19fX19fX19f
X19fX19fX19fX19fX19fX19fX19fX19fXzxicj60q9Xmo7pfX19fX19fX19fX19fX19fX19fX19f
X19fX19fX19fX19fX19fPGJyPkUtbWFpbKO6X19fX19fX19fX19fX19fX19fX19fX19fX19fX19f
X19fX19fXzxicj48L2ZvbnQ+PC90ZD48L3RyPjx0cj48dGQgaGVpZ2h0PSIxMiIgY29sc3Bhbj0i
MiIgYWxpZ249IkxFRlQiPiZuYnNwOzwvdGQ+PC90cj48dHI+PHRkIGhlaWdodD0iMTIiIGNvbHNw
YW49IjIiIGFsaWduPSJMRUZUIj4mbmJzcDs8L3RkPjwvdHI+PHRyPjx0ZCBoZWlnaHQ9IjEyIiBj
b2xzcGFuPSIyIiBhbGlnbj0iTEVGVCI+Jm5ic3A7PC90ZD48L3RyPjwvdGFibGU+PC90ZD48L3Ry
Pjx0cj48dGQ+Jm5ic3A7PC90ZD48L3RyPjx0cj48dGQ+Jm5ic3A7PC90ZD48L3RyPjwvdGFibGU+
PC9kaXY+DQo8L2JvZHk+DQo8L2h0bWw+DQo=

------=_NextPart_000_2D9A1_01C10E1F.6589F6C0--


From martin@loewis.home.cs.tu-berlin.de  Mon Jul 16 13:57:16 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Mon, 16 Jul 2001 14:57:16 +0200
Subject: [XML-SIG] SAX  with DTD
In-Reply-To: <20010712232303.61912.qmail@web12607.mail.yahoo.com> (message
 from Hung Jung Lu on Thu, 12 Jul 2001 16:23:03 -0700 (PDT))
References: <20010712232303.61912.qmail@web12607.mail.yahoo.com>
Message-ID: <200107161257.f6GCvGv02814@mira.informatik.hu-berlin.de>

> (2) I have a DTD that specifies default attributes
> (via #FIXED) of an XML document. Is there some parser
> (DOM preferred, SAX ok) in Python that can take into
> account the attributes specified in DTD?

I recommend to use the functions and classes in
xml.dom.ext.reader.Sax2, and turn validation on.

Regards,
Martin


From martin@loewis.home.cs.tu-berlin.de  Mon Jul 16 14:14:45 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Mon, 16 Jul 2001 15:14:45 +0200
Subject: [XML-SIG] 4Suite 0.11.1 and PyXML 0.6.5
In-Reply-To: <Pine.LNX.4.21.0107160914360.23663-100000@leo.logilab.fr>
 (message from Alexandre Fayolle on Mon, 16 Jul 2001 09:19:30 +0200
 (CEST))
References: <Pine.LNX.4.21.0107160914360.23663-100000@leo.logilab.fr>
Message-ID: <200107161314.f6GDEj902846@mira.informatik.hu-berlin.de>

> I've been quite busy these last weeks, and have not managed to follow the
> various mailing lists as closely as I would have wanted. Is there a
> release of PyXML 0.6.6 planned that would mainly feature the changes in
> xml.dom.ext that make PyXML compatible with 4Suite-0.11.1's pDomlette ? 

The 0.6.6 branch is open for people to commit into it; I trust that
anybody committing changes will follow a "bug fixes only" strategy
there.

Once there is actually stuff to release, I'd happily produce a release.

> Yet another (dumb) question : is the new version of XPath (thread
> and unicode friendly) part of 4Suite 0.11.1 or PyXML 0.7 ?

The PyXPath in the current PyXML CVS is already thread any
unicode-aware; it is not based on the recent "unicode friendly" code
from 4Suite (which I understand uses UTF-8 strings).

I have currently no plans to integrate the 4Suite XPath parsers into
PyXML, mainly because of the build complexity. Of course, if anybody
would take the challenge and put the extension modules into
extensions/, that would be fine as well.

Regards,
Martin


From emdpek@chron.com  Mon Jul 16 23:27:22 2001
From: emdpek@chron.com (Philip King)
Date: Mon, 16 Jul 2001 17:27:22 -0500
Subject: [XML-SIG] Parsing DTD
Message-ID: <3B536A4A.9DE3859F@chron.com>

I am looking for (or desiring to build) a DTD Browser tool.  I am
imagining a simple window, initially showing the "root" entity.  Each
entity can be optionally (clickably) expanded, which would reveal its
children entity nodes.  In a separate window, perhaps a listing of a
nodes attributes.

(For users of IE Explorer, a similar utility can be found in the NITF
DTD docs:
http://www.nitf.org/nitf-documentation/nitf.html)

Here is my dilemna: I cannot figure the hoops one must just through in
order to have a parser (either Expat, xmlproc, xmllib, etc...) to read
and parse a DTD.  They all seem to choke with a syntax error...

Any ideas?


Philip


From uogbuji@fourthought.com  Tue Jul 17 05:36:41 2001
From: uogbuji@fourthought.com (Uche Ogbuji)
Date: Mon, 16 Jul 2001 22:36:41 -0600 (MDT)
Subject: [4suite] Re: [XML-SIG] 4Suite 0.11.1 and PyXML 0.6.5
In-Reply-To: <200107161314.f6GDEj902846@mira.informatik.hu-berlin.de>
Message-ID: <Pine.LNX.4.33.0107162233340.22242-100000@yen.fourthought.com>

On Mon, 16 Jul 2001, Martin v. Loewis wrote:

> > I've been quite busy these last weeks, and have not managed to follow the
> > various mailing lists as closely as I would have wanted. Is there a
> > release of PyXML 0.6.6 planned that would mainly feature the changes in
> > xml.dom.ext that make PyXML compatible with 4Suite-0.11.1's pDomlette ?
>
> The 0.6.6 branch is open for people to commit into it; I trust that
> anybody committing changes will follow a "bug fixes only" strategy
> there.

I missed this.  I'll be sure to sync all my changes from the tip to this
branch.  I would indeed like to see a PyXML 0.6.6 bug-fix release to go
with the 4Suite 0.11.1 release.

> Once there is actually stuff to release, I'd happily produce a release.
>
> > Yet another (dumb) question : is the new version of XPath (thread
> > and unicode friendly) part of 4Suite 0.11.1 or PyXML 0.7 ?
>
> The PyXPath in the current PyXML CVS is already thread any
> unicode-aware; it is not based on the recent "unicode friendly" code
> from 4Suite (which I understand uses UTF-8 strings).

I had been hoping Jeremy would open up a discussion about the differences
between your approach and the one he chose.

I think this is an important discussion to have, since you've both done
valuable work on the issue and people will be wanting to know what XPath
implementation to use, and why.

I confess that I don't really know the answers to this myself.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com
4735 East Walnut St, Ste. C, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From Alexandre.Fayolle@logilab.fr  Tue Jul 17 07:40:53 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Tue, 17 Jul 2001 08:40:53 +0200 (CEST)
Subject: [XML-SIG] Parsing DTD
In-Reply-To: <3B536A4A.9DE3859F@chron.com>
Message-ID: <Pine.LNX.4.21.0107170834400.25186-100000@leo.logilab.fr>

On Mon, 16 Jul 2001, Philip King wrote:

> Here is my dilemna: I cannot figure the hoops one must just through in
> order to have a parser (either Expat, xmlproc, xmllib, etc...) to read
> and parse a DTD.  They all seem to choke with a syntax error...

You want to use xmlproc's DTD parser. For an example, you may want to
check xmltools (http://www.logilab.org/xmltools/), since xmleditor uses
the DTD parser to get the valid elements. And of course, you should give a
look at the full blown documentation which is available on Lars Marius
Garshol's xmlproc page
(http://www.garshol.priv.no/download/software/xmlproc/)

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From dirksen_lau@yahoo.com  Tue Jul 17 08:16:04 2001
From: dirksen_lau@yahoo.com (Dirksen)
Date: Tue, 17 Jul 2001 00:16:04 -0700 (PDT)
Subject: [XML-SIG] How to get SAX to parse not well formed HTML doc?
Message-ID: <20010717071604.11011.qmail@web5105.mail.yahoo.com>

 
Hi,

I need to parse a bunch of HTML documents, yet the parser is too 
strict for this task. It stops at places where considered correct by 
HTML rules, like unquoted attributes. Can I make the parser more 
relaxed toward HTML documents?

Cheers
Dirksen


__________________________________________________
Do You Yahoo!?
Get personalized email addresses from Yahoo! Mail
http://personal.mail.yahoo.com/


From larsga@garshol.priv.no  Tue Jul 17 08:45:23 2001
From: larsga@garshol.priv.no (Lars Marius Garshol)
Date: 17 Jul 2001 09:45:23 +0200
Subject: [XML-SIG] Parsing DTD
In-Reply-To: <3B536A4A.9DE3859F@chron.com>
References: <3B536A4A.9DE3859F@chron.com>
Message-ID: <m3elrguhgs.fsf@lambda.garshol.priv.no>

* Philip King
| 
| Here is my dilemna: I cannot figure the hoops one must just through in
| order to have a parser (either Expat, xmlproc, xmllib, etc...) to read
| and parse a DTD.  They all seem to choke with a syntax error...

As Alexandre says you need to use a special DTD parser. The XML
parsers will assume they are given an XML document and freak when they
find that they are chewing something completely different. 

The DTD parser of xmlproc is the only way I know of doing this in
Python. It should work just fine, though.

--Lars M.


From python-te@mcwords.com  Tue Jul 17 08:54:42 2001
From: python-te@mcwords.com (Martin C Brown)
Date: Tue, 17 Jul 2001 08:54:42 +0100
Subject: [XML-SIG] How to get SAX to parse not well formed HTML doc?
In-Reply-To: <20010717071604.11011.qmail@web5105.mail.yahoo.com>
Message-ID: <B779ADD2.2BEC5%python-te@mcwords.com>

> I need to parse a bunch of HTML documents, yet the parser is too
> strict for this task. It stops at places where considered correct by
> HTML rules, like unquoted attributes. Can I make the parser more
> relaxed toward HTML documents?

You might have more luck using the HTML parser, rather than SAX, which is
deigned for parsing XML.

The HTML parser is in htmllib and works in much the same way, and it handles
unquoted attributes without any problems.

MC

-- 
Martin 'MC' Brown, mc@mcwords.com        http://www.mcwords.com
Writer, Author, Consultant
'Life is pain, anyone who says differently is selling something'


From Alexandre.Fayolle@logilab.fr  Tue Jul 17 10:02:03 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Tue, 17 Jul 2001 11:02:03 +0200 (CEST)
Subject: [XML-SIG] xml.dom.ext.reader.HtmlLib
Message-ID: <Pine.LNX.4.21.0107171026550.25417-100000@leo.logilab.fr>

Hello,

I was hunting for a bug in Narval, and ended up in
xml.dom.ext.reader.HtmlLib. I would like some feedback on this to know
is this is indeed a bug, a documentation issue, or just me daydreaming
that all APIs should do what I'd like them to, instead of what the coder
meant.

When I use xml.dom.ext.reader.Sax2, if I pass an ownerDocument to the
reader when reading the data, I'll get back a DocumentFragment, belonging
to the same document. 

With HtmlLib's reader, this is not the case : the owner document I'm
passing is getting emptied. Cf. line 42-46:
        if doc:
            while doc.firstChild:
                # Empty out the document
                node = doc.removeChild(doc.firstChild)
                ReleaseNode(node)

First (minor) thing is, this supposes I'm using a 4DOM document, since it
uses ReleaseNode, second (important) thing is, I'm much annoyed that the
document should be emptied, since in the case at hand, it already had some
contents, and I was merely passing it in order to be sure that the right
DOM implementation would be used, and to avoid an expensive call to
importNode.

As a side note, Sgmlop.HtmlParser uses non NS methods to build it's
DOM. Is this what is intended ?

I'll be glad to work on some patches, hopefully in time for PyXML 0.6.6,
once the correct behaviour has been agreed on.

Cheers,

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From larsga@garshol.priv.no  Tue Jul 17 11:22:02 2001
From: larsga@garshol.priv.no (Lars Marius Garshol)
Date: 17 Jul 2001 12:22:02 +0200
Subject: [XML-SIG] xml.dom.ext.reader.HtmlLib
In-Reply-To: <Pine.LNX.4.21.0107171026550.25417-100000@leo.logilab.fr>
References: <Pine.LNX.4.21.0107171026550.25417-100000@leo.logilab.fr>
Message-ID: <m3k817etyt.fsf@lambda.garshol.priv.no>

* Alexandre Fayolle
| 
| With HtmlLib's reader, this is not the case : the owner document I'm
| passing is getting emptied. Cf. line 42-46:
|         if doc:
|             while doc.firstChild:
|                 # Empty out the document
|                 node = doc.removeChild(doc.firstChild)
|                 ReleaseNode(node)
| 
| First (minor) thing is, this supposes I'm using a 4DOM document, since it
| uses ReleaseNode, second (important) thing is, I'm much annoyed that the
| document should be emptied, since in the case at hand, it already had some
| contents, and I was merely passing it in order to be sure that the right
| DOM implementation would be used, and to avoid an expensive call to
| importNode.

Part of the problem here is that we have a separate Reader for HTML
documents. IMHO it would be much preferrable to have a SAX driver for
the HTML parser instead. That could then use the SAX Reader, and
behaviour would be consistent. 

In addition, we would get increased flexibility by having a SAX driver
for this parser.
 
| As a side note, Sgmlop.HtmlParser uses non NS methods to build it's
| DOM. Is this what is intended ?

Should be, shouldn't it? HTML doesn't have namespaces, only XHTML does.
 
--Lars M.


From Alexandre.Fayolle@logilab.fr  Tue Jul 17 11:53:41 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Tue, 17 Jul 2001 12:53:41 +0200 (CEST)
Subject: [4suite] Re: [XML-SIG] xml.dom.ext.reader.HtmlLib
In-Reply-To: <m3k817etyt.fsf@lambda.garshol.priv.no>
Message-ID: <Pine.LNX.4.21.0107171237210.25689-100000@leo.logilab.fr>

On 17 Jul 2001, Lars Marius Garshol wrote:

> In addition, we would get increased flexibility by having a SAX driver
> for this parser.

agreed.

>  
> | As a side note, Sgmlop.HtmlParser uses non NS methods to build it's
> | DOM. Is this what is intended ?
> 
> Should be, shouldn't it? HTML doesn't have namespaces, only XHTML does.

Well... yes, and no. This is the old setAttributeNS(EMPTY_NS, name,value)
vs setAttribute(name,value) question. The problem happens when you try to
get the value back and you don't know what API was used to set
it. However, using a Sax driver for this parser should help, since then
the DOM builder would be able to call whatever method is deemed necessary.

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From fdrake@acm.org  Tue Jul 17 12:50:16 2001
From: fdrake@acm.org (Fred L. Drake, Jr.)
Date: Tue, 17 Jul 2001 07:50:16 -0400 (EDT)
Subject: [XML-SIG] How to get SAX to parse not well formed HTML doc?
In-Reply-To: <20010717071604.11011.qmail@web5105.mail.yahoo.com>
References: <20010717071604.11011.qmail@web5105.mail.yahoo.com>
 <B779ADD2.2BEC5%python-te@mcwords.com>
Message-ID: <15188.9848.137663.499928@cj42289-a.reston1.va.home.com>

Dirksen writes:
 > I need to parse a bunch of HTML documents, yet the parser is too 
 > strict for this task. It stops at places where considered correct by 
 > HTML rules, like unquoted attributes. Can I make the parser more 
 > relaxed toward HTML documents?

Martin C Brown writes:
 > The HTML parser is in htmllib and works in much the same way, and it handles
 > unquoted attributes without any problems.

  Another possibility would be to use the HTMLParser module, which is
new in Python 2.2.  It was originally developed for another project
and is stable and well-tested.  Feel free to extract the module from
the Python CVS repository.


  -Fred

-- 
Fred L. Drake, Jr.  <fdrake at acm.org>
PythonLabs at Digital Creations


From noreply@sourceforge.net  Tue Jul 17 14:44:44 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Tue, 17 Jul 2001 06:44:44 -0700
Subject: [XML-SIG] [ pyxml-Patches-442005 ] pDomletteReader.SaxReader patch
Message-ID: <E15MV9Y-0005h6-00@usw-sf-web2.sourceforge.net>

Patches item #442005, was opened at 2001-07-17 06:44
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=442005&group_id=6473

Category: 4Suite
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Alexandre Fayolle (afayolle)
Assigned to: Nobody/Anonymous (nobody)
Summary: pDomletteReader.SaxReader patch

Initial Comment:
The attached patch fixes several bugs when using
pDomletteReader.SaxReader. It was generated against
4Suite-0.11.1b3.

Cheers

Alexandre Fayolle

----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=442005&group_id=6473


From Alexandre.Fayolle@logilab.fr  Tue Jul 17 16:40:30 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Tue, 17 Jul 2001 17:40:30 +0200 (CEST)
Subject: [XML-SIG] ANN : xmltools 1.3
Message-ID: <Pine.LNX.4.21.0107171738360.26242-100000@leo.logilab.fr>

I've just made python xmltools 1.3 available from
http://www.logilab.org/xmltools/ 

Python XmlTools is a set of high level tools to help using XML in Python.
It features two pyGTK widgets, XmlTree and XmlEditor, which can
respectively display and edit an XML document in a graphical fashion.

This release should fix some compatibility problems with python 2.x that
were observed in xmltools-1.2.

Cheers.

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From noreply@sourceforge.net  Tue Jul 17 18:24:19 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Tue, 17 Jul 2001 10:24:19 -0700
Subject: [XML-SIG] [ pyxml-Bugs-442087 ] parsing an XML string
Message-ID: <E15MYa3-0003sZ-00@usw-sf-web2.sourceforge.net>

Bugs item #442087, was opened at 2001-07-17 10:24
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=442087&group_id=6473

Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Nobody/Anonymous (nobody)
Assigned to: Nobody/Anonymous (nobody)
Summary: parsing an XML string

Initial Comment:
I'm using PyXML 0.6.5 with Python 2.0; the
following code:
    
--- code ---
from xml.dom.ext.reader import Sax2
    
parser = Sax2.Reader(validate=1)
xml_dom_object = parser.fromString(VALID_XML_STRING)
--- code ---
  
returns:
    
--- traceback ---
Traceback (most recent call last):
  File "<stdin>", line 1, in ?
  File
"/usr/lib/python2.0/site-packages/_xmlplus/dom/ext/reader/__init__.py",
l
ine 63, in fromString
    return self.fromStream(stream, ownerDoc)
  File
"/usr/lib/python2.0/site-packages/_xmlplus/dom/ext/reader/Sax2.py",
line
309, in fromStream
    self.parser.parse(s)
  File
"/usr/lib/python2.0/site-packages/_xmlplus/sax/drivers2/drv_xmlproc.py",
line 90, in parse 
    parser.read_from(source.getByteStream(), bufsize)
  File
"/usr/lib/python2.0/site-packages/_xmlplus/parsers/xmlproc/xmlval.py",
li
ne 105, in read_from
    self.parser.read_from(file,bufsize)
  File
"/usr/lib/python2.0/site-packages/_xmlplus/parsers/xmlproc/xmlutils.py",
line 137, in read_from
    self.feed(buf)
  File
"/usr/lib/python2.0/site-packages/_xmlplus/parsers/xmlproc/xmlutils.py",
line 185, in feed
    self.do_parse()
  File
"/usr/lib/python2.0/site-packages/_xmlplus/parsers/xmlproc/xmlproc.py",
l
ine 104, in do_parse
    self.parse_doctype()
  File
"/usr/lib/python2.0/site-packages/_xmlplus/parsers/xmlproc/xmlproc.py",
l
ine 494, in parse_doctype
    sys_id))
  File
"/usr/lib/python2.0/site-packages/_xmlplus/parsers/xmlproc/xmlutils.py",
line 667, in join_sysids_general
    if urlparse.urlparse(base)[0]=="":
  File "/usr/lib/python2.0/urlparse.py", line 59, in
urlparse
    i = find(url, ':')
  File "/usr/lib/python2.0/string.py", line 172, in
find
    return s.find(*args)
AttributeError: 'None' object has no attribute 'find'
--- traceback ---


Using a non validating parser (validate=0) the code
works;
it also works using the fromUri() method of the parser
object.

Obviously the VALID_XML_STRING is a valid XML string.

Thank you.


----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=442087&group_id=6473


From dieter@handshake.de  Tue Jul 17 22:34:19 2001
From: dieter@handshake.de (Dieter Maurer)
Date: Tue, 17 Jul 2001 23:34:19 +0200 (CEST)
Subject: [XML-SIG] How to get SAX to parse not well formed HTML doc?
In-Reply-To: <20010717071604.11011.qmail@web5105.mail.yahoo.com>
References: <20010717071604.11011.qmail@web5105.mail.yahoo.com>
Message-ID: <15188.44891.27142.683220@lindm.dm>

Dirksen writes:
 > I need to parse a bunch of HTML documents, yet the parser is too 
 > strict for this task. It stops at places where considered correct by 
 > HTML rules, like unquoted attributes. Can I make the parser more 
 > relaxed toward HTML documents?
Maybe, you can use "tidy" (--> www.w3.org) beforehand to clean
up your HTML.


Dieter


From martin@loewis.home.cs.tu-berlin.de  Wed Jul 18 00:02:49 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Wed, 18 Jul 2001 01:02:49 +0200
Subject: [XML-SIG] How to get SAX to parse not well formed HTML doc?
In-Reply-To: <B779ADD2.2BEC5%python-te@mcwords.com> (message from Martin C
 Brown on Tue, 17 Jul 2001 08:54:42 +0100)
References: <B779ADD2.2BEC5%python-te@mcwords.com>
Message-ID: <200107172302.f6HN2nG01729@mira.informatik.hu-berlin.de>

> > I need to parse a bunch of HTML documents, yet the parser is too
> > strict for this task. It stops at places where considered correct by
> > HTML rules, like unquoted attributes. Can I make the parser more
> > relaxed toward HTML documents?
> 
> You might have more luck using the HTML parser, rather than SAX, which is
> deigned for parsing XML.
> 
> The HTML parser is in htmllib and works in much the same way, and it handles
> unquoted attributes without any problems.

Alternatively, you can use xml.parsers.sgmlop in the SGML mode.

Regards,
Martin


From martin@loewis.home.cs.tu-berlin.de  Wed Jul 18 00:13:45 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Wed, 18 Jul 2001 01:13:45 +0200
Subject: [XML-SIG] xml.dom.ext.reader.HtmlLib
In-Reply-To: <m3k817etyt.fsf@lambda.garshol.priv.no> (message from Lars Marius
 Garshol on 17 Jul 2001 12:22:02 +0200)
References: <Pine.LNX.4.21.0107171026550.25417-100000@leo.logilab.fr> <m3k817etyt.fsf@lambda.garshol.priv.no>
Message-ID: <200107172313.f6HNDj701738@mira.informatik.hu-berlin.de>

> Part of the problem here is that we have a separate Reader for HTML
> documents. IMHO it would be much preferrable to have a SAX driver for
> the HTML parser instead. That could then use the SAX Reader, and
> behaviour would be consistent. 
> 
> In addition, we would get increased flexibility by having a SAX driver
> for this parser.

Sounds like an interesting project for a volunteer. I'd personally
recommend to build this SAX driver on top of sgmlop; the true
challenge is to get the events right that only result from the SGML
DTD for HTML (e.g. missing closing tags, etc).

Regards,
Martin


From martin@loewis.home.cs.tu-berlin.de  Wed Jul 18 00:17:14 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Wed, 18 Jul 2001 01:17:14 +0200
Subject: [XML-SIG] How to get SAX to parse not well formed HTML doc?
In-Reply-To: <15188.9848.137663.499928@cj42289-a.reston1.va.home.com>
 (fdrake@acm.org)
References: <20010717071604.11011.qmail@web5105.mail.yahoo.com>
 <B779ADD2.2BEC5%python-te@mcwords.com> <15188.9848.137663.499928@cj42289-a.reston1.va.home.com>
Message-ID: <200107172317.f6HNHEp01770@mira.informatik.hu-berlin.de>

>   Another possibility would be to use the HTMLParser module, which is
> new in Python 2.2.  It was originally developed for another project
> and is stable and well-tested.  Feel free to extract the module from
> the Python CVS repository.

Of course, a "true" HTML parser should get the DTD right,
i.e. generate closing elements where they are missing, expand entities
(to unicode strings), etc.

Regards,
Martin


From martin@loewis.home.cs.tu-berlin.de  Wed Jul 18 00:00:11 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Wed, 18 Jul 2001 01:00:11 +0200
Subject: [4suite] Re: [XML-SIG] 4Suite 0.11.1 and PyXML 0.6.5
In-Reply-To: <Pine.LNX.4.33.0107162233340.22242-100000@yen.fourthought.com>
 (message from Uche Ogbuji on Mon, 16 Jul 2001 22:36:41 -0600 (MDT))
References: <Pine.LNX.4.33.0107162233340.22242-100000@yen.fourthought.com>
Message-ID: <200107172300.f6HN0BV01728@mira.informatik.hu-berlin.de>

> I missed this.  I'll be sure to sync all my changes from the tip to this
> branch.  I would indeed like to see a PyXML 0.6.6 bug-fix release to go
> with the 4Suite 0.11.1 release.

Ok, then I propose the following procedure:

- Copy everything you want to see in 0.6.6 in the branch (it is the
  "o6maint" branch)
- Once you are done, I'll investigate the remaining changes as to whether
  they contain missing pieces; I'll then try to contact the authors of
  these changes to see whether they should be merged (sometimes it may
  be clear from the check-in messages).
- I'll then give advance warning of a couple of days that 0.6.6 is upcoming.

Regards,
Martin


From douglas@paradise.net.nz  Wed Jul 18 05:52:59 2001
From: douglas@paradise.net.nz (Douglas Bagnall)
Date: Wed, 18 Jul 2001 16:52:59 +1200
Subject: [XML-SIG] How to get SAX to parse not well formed HTML doc?
In-Reply-To: <200107172317.f6HNHEp01770@mira.informatik.hu-berlin.de>
References: <15188.9848.137663.499928@cj42289-a.reston1.va.home.com>	(fdrake@acm.org)
Message-ID: <3B55BEEB.4349.1E410E2@localhost>

--Message-Boundary-5927
Content-type: text/plain; charset=US-ASCII
Content-transfer-encoding: 7BIT
Content-description: Mail message body


Hi there,

I've used the attached script to turn html into xml for minidom, and it 
seems to work fairly well so long as the html doesn't contain text cut 
and pasted from Microsoft Word. 

fix(<filename>) prints out xmlish version of the file.
fixstring(<string>) does the same to a string.
obviously, you'd change this somewhere around line 110.
The output is tested against minidom, so if you get no traceback, it 
will be xml safe. Which is not to say it'll look good.


Another thing I've done is put tohtml() and writehtml() methods in my 
version of minidom. They're the same as toxml & writexml, except they 
test empty elements against a tuple: br, img, link and so forth are 
rendered <br /> (note the space) while other empty tags are written the 
long way - <td></td>, <p></p> etc. It's really simple. Would this be of 
any use to anyone else, or would it be just clutter up minidom.py?


Douglas


--Message-Boundary-5927
Content-type: text/plain; charset=US-ASCII
Content-transfer-encoding: 7BIT
Content-description: Text from file 'rehtml.py'

#!/usr/bin/python
"""
Excerpt from experimental WCN auto page generating version of Kea editor.

copyright katipo communications ltd  2001
by douglas bagnall <douglas@katipo.co.nz>


fix(<filename>) prints out xmlish version of the file.
fixstring(<string>) does the same to a string.
obviously, you'd change this somewhere around line 110.

The output is tested against minidom, so if you get no traceback,
it will be xml safe. Which is not to say it'll look good.

Html entities are not handled, nor are valueless attributes, like
selected in option (xhtml 1.0 asks for selected="selected").
Misunderstood attributes are omitted without notice.

"""

from xml.dom.minidom import parseString
import sys,re,string,os

singlelist=('img','br','link','hr','input','area',"meta")
wf=re.compile(r'''\w+=('|")[^'"]+\1''')

def attrify(tag):
    attrs=tag.split()
    fattrs=[re.sub("[^\w-]","x",attrs.pop(0).lower())] #deals rudely with non-alphanumeric tags
    while attrs:
        trying=attrs.pop(0)
        if wf.match(trying):
            fattrs.append(trying)
        else:
            trying=re.sub(r'[\'"]',"",trying)        # clear quotes
            trying=trying.replace('=','="',1)+'"'    # and requote (won't get valueless html attributes eg <option selected>)
            if wf.match(trying):
                fattrs.append(trying)
                #tried hard enough, so now forget it, return fixed attrs only
    return " ".join(fattrs)

def ent(s):
    s=s.replace("&","&amp;")
    s=s.replace(">","&gt;")
    return s

def splitter(y):
    z=y.split('>',1)
    tag=z[0]
    if len(z)==1 or not tag or not re.match(r"^[A-Za-z_/!]",tag[0]):
        return ["","","&lt;%s"%y.replace('>','&gt;')]  #so loose >s get entitied and empty tag is returned
    if tag[0]=="/":  #ends
        return ["e", re.sub(r"\s+"," ",tag[1:]), ent(z[1])]
    elif tag[-1]=="/" or tag.split()[0].lower() in singlelist:        #tags without closers
        return ["m", attrify(re.sub('/$','',tag)), ent(z[1])]     #normalise to no " />"
    elif tag[:3]=="!--":
        return ["c",tag,z[1]]
    else: #start
        return ["s", attrify(tag), ent(z[1])]

#joiner *almost* reverses splitter, but whitespace in tags remains reduced, and leading <s included
joinerdict={"":"%s%s", "s":"<%s>%s", "m":"<%s />%s", "e": "</%s>%s", "c": "<%s>%s"}
def joiner(z):
    return joinerdict[z[0]]%(z[1],z[2])

def fixstring(z):   #
    zbits=(z).split('<')    #list of string bits, without '<' eg '<p>foo<br>bar' becomes ['','p>foo','br>bar']
    zlist=map(splitter,zbits)
    zbits=[]
    zstack=[]
    for x in zlist:
        if x[0]=="s":
            stag=x[1].split()[0]
            if stag in ("p","a","form","option","select","td","li") and stag==zstack[-1]: #whatever else too
                zstack.pop()
                zbits.append(["e",stag,""])
            zstack.append(stag)
            zbits.append(x)
        elif x[0]=="e":
            etag=x[1].split()[0]
            tstack=[]
            if zstack:
                lasttag=zstack.pop()
                while zstack and etag != lasttag:
                    tstack.append(lasttag)
                    lasttag=zstack.pop()
                if etag == lasttag:
                    for t in tstack:
                        zbits.append(["e",t,""])  #ie </t>
                    zbits.append(x)
                else:    #couldn't find in zstack, probably closed in previous tstack manoeuvre?
                    zbits.append(["","",x[2]])
                    tstack.reverse()
                    zstack.append(lasttag)
                    zstack+=tstack #pile them back on (starting with latest lasttag, which is unprocessed, otherwise lost)
            else:
                zbits.append(["","",x[2]]) #oh well.
        else:    #single or empty tags or comments
            zbits.append(x)    #carry on with no stacking

    zstack.reverse()
    for a in zstack:  #clear any unclosed tags!
        zbits.append(["e",a,""])

    z=''.join(map(joiner,zbits))[4:]   #first 4 are a &lt;

    #  now test it
    try:
        zdom=parseString("<span>%s</span>"%z)  #for test if valid tags
    except:
        print """<!-- Couldn't Parse ! -->\n%s""" % z
        raise
    #so, it worked!
    print """<!-- Yay!, successfully parsed -->\n%s""" % z


def fix(z):
    try:
        f=open(z,"r")
        fixstring(f.read())
    except:
        print "%s is probably not a file" %z
        raise

--Message-Boundary-5927--


From martin@loewis.home.cs.tu-berlin.de  Wed Jul 18 10:02:33 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Wed, 18 Jul 2001 11:02:33 +0200
Subject: [XML-SIG] How to get SAX to parse not well formed HTML doc?
In-Reply-To: <3B55BEEB.4349.1E410E2@localhost> (douglas@paradise.net.nz)
References: <15188.9848.137663.499928@cj42289-a.reston1.va.home.com>	(fdrake@acm.org) <3B55BEEB.4349.1E410E2@localhost>
Message-ID: <200107180902.f6I92XK01169@mira.informatik.hu-berlin.de>

> I've used the attached script to turn html into xml for minidom, and it 
> seems to work fairly well so long as the html doesn't contain text cut 
> and pasted from Microsoft Word. 

Hi Douglas,

Please note that your approach has many problems. In particular, the
converter does not consider the HTML DTD. E.g. converting

<html>
<head>
<title>Hallo
<body>
</html>

will give you

<!-- Yay!, successfully parsed -->
<html>
<head>
<title>Hallo
<body>
</body></title></head></html>

While this is well-formed XML, it is not well-formed XHTML; it should read

<html>
<head>
<title>Hallo</title>
</head>
<body>
</body></html>

instead (i.e. title and head must close before body opens). Another
thing I noticed is that it messes up external entities, e.g.

<html>
<head>
<title>Hall&ouml;chen</title>
</head>
<body>
</body></html>

is converted to

<html>
<head>
<title>Hall&amp;ouml;chen</title>
</head>
<body>
</body></html>


> Another thing I've done is put tohtml() and writehtml() methods in my 
> version of minidom. They're the same as toxml & writexml, except they 
> test empty elements against a tuple: br, img, link and so forth are 
> rendered <br /> (note the space) while other empty tags are written the 
> long way - <td></td>, <p></p> etc. It's really simple. Would this be of 
> any use to anyone else, or would it be just clutter up minidom.py?

I don't think it should go into minidom. Instead, it might be useful
to have such a function as a stand-alone library, which prints
arbitrary XHTML DOM trees. In fact, the best thing may be to extend
the XHTML pretty printer with such a feature.

Regards,
Martin


From larsga@garshol.priv.no  Wed Jul 18 10:12:01 2001
From: larsga@garshol.priv.no (Lars Marius Garshol)
Date: 18 Jul 2001 11:12:01 +0200
Subject: [XML-SIG] xml.dom.ext.reader.HtmlLib
In-Reply-To: <200107172313.f6HNDj701738@mira.informatik.hu-berlin.de>
References: <Pine.LNX.4.21.0107171026550.25417-100000@leo.logilab.fr> <m3k817etyt.fsf@lambda.garshol.priv.no> <200107172313.f6HNDj701738@mira.informatik.hu-berlin.de>
Message-ID: <m3bsmieh3y.fsf@lambda.garshol.priv.no>

* Lars Marius Garshol
|
| Part of the problem here is that we have a separate Reader for HTML
| documents. IMHO it would be much preferrable to have a SAX driver for
| the HTML parser instead. That could then use the SAX Reader, and
| behaviour would be consistent. 
| 
| In addition, we would get increased flexibility by having a SAX driver
| for this parser.

* Martin v. Loewis
| 
| Sounds like an interesting project for a volunteer. 

I guess it would be. It's a very small task, really, but good for
learning. I would do it, but I haven't got the time.

| I'd personally recommend to build this SAX driver on top of sgmlop;
| the true challenge is to get the events right that only result from
| the SGML DTD for HTML (e.g. missing closing tags, etc).

So perhaps it would be better to integrate Tidy as a Python module?
It's a lot more work, but it would also be a lot more useful. If that
were done I think the module should have SAX as its interface. 

I think using the native expat interface was a mistake that has caused
us all kinds of problems. Instead of having just one interface for
parsers we ended up with several, because many people didn't want to
take the (slight) performance hit of using SAX.

So a SAX driver for expat written in C would be another good thing.

--Lars M.


From Alexandre.Fayolle@logilab.fr  Wed Jul 18 10:24:32 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Wed, 18 Jul 2001 11:24:32 +0200 (CEST)
Subject: [4suite] Re: [XML-SIG] xml.dom.ext.reader.HtmlLib
In-Reply-To: <200107172313.f6HNDj701738@mira.informatik.hu-berlin.de>
Message-ID: <Pine.LNX.4.21.0107181123300.1292-100000@leo.logilab.fr>

On Wed, 18 Jul 2001, Martin v. Loewis wrote:

> > Part of the problem here is that we have a separate Reader for HTML
> > documents. IMHO it would be much preferrable to have a SAX driver for
> > the HTML parser instead. That could then use the SAX Reader, and
> > behaviour would be consistent. 
> > 
> > In addition, we would get increased flexibility by having a SAX driver
> > for this parser.
> 
> Sounds like an interesting project for a volunteer. 

I'll give it a go. 

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From shivaji_br@yahoo.com  Wed Jul 18 10:30:01 2001
From: shivaji_br@yahoo.com (shivaji raju)
Date: Wed, 18 Jul 2001 02:30:01 -0700 (PDT)
Subject: [4suite] Re: [XML-SIG] xml.dom.ext.reader.HtmlLib
In-Reply-To: <Pine.LNX.4.21.0107181123300.1292-100000@leo.logilab.fr>
Message-ID: <20010718093001.38243.qmail@web9204.mail.yahoo.com>

Hello sir,

I have got a DOM tree  Generated from a xml file.....
I wanted to walk through the tree and use the 
data from that tree.I wanted to  know how to 
do this....

with regards
shivaji


--- Alexandre Fayolle <Alexandre.Fayolle@logilab.fr>
wrote:
> On Wed, 18 Jul 2001, Martin v. Loewis wrote:
> 
> > > Part of the problem here is that we have a
> separate Reader for HTML
> > > documents. IMHO it would be much preferrable to
> have a SAX driver for
> > > the HTML parser instead. That could then use the
> SAX Reader, and
> > > behaviour would be consistent. 
> > > 
> > > In addition, we would get increased flexibility
> by having a SAX driver
> > > for this parser.
> > 
> > Sounds like an interesting project for a
> volunteer. 
> 
> I'll give it a go. 
> 
> Alexandre Fayolle
> -- 
> LOGILAB, Paris (France).
> http://www.logilab.com   http://www.logilab.fr 
> http://www.logilab.org
> Narval, the first software agent available as free
> software (GPL).
> 
> 
> _______________________________________________
> XML-SIG maillist  -  XML-SIG@python.org
> http://mail.python.org/mailman/listinfo/xml-sig


__________________________________________________
Do You Yahoo!?
Get personalized email addresses from Yahoo! Mail
http://personal.mail.yahoo.com/


From Alexandre.Fayolle@logilab.fr  Wed Jul 18 10:58:46 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Wed, 18 Jul 2001 11:58:46 +0200 (CEST)
Subject: [4suite] Re: [XML-SIG] xml.dom.ext.reader.HtmlLib
In-Reply-To: <20010718093001.38243.qmail@web9204.mail.yahoo.com>
Message-ID: <Pine.LNX.4.21.0107181156250.3420-100000@leo.logilab.fr>

On Wed, 18 Jul 2001, shivaji raju wrote:

> Hello sir,
> 
> I have got a DOM tree  Generated from a xml file.....
> I wanted to walk through the tree and use the 
> data from that tree.I wanted to  know how to 
> do this....

<I bounced your question on the mailing list>

It depends on what you are wanting to do with it. 4DOM provides the DOM
Traversal API, which could be what you're needing. You can find
information on this API on
http://www.w3.org/TR/2000/REC-DOM-Level-2-Traversal-Range-20001113/

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From shivaji_br@yahoo.com  Wed Jul 18 11:04:25 2001
From: shivaji_br@yahoo.com (shivaji raju)
Date: Wed, 18 Jul 2001 03:04:25 -0700 (PDT)
Subject: [4suite] Re: [XML-SIG] xml.dom.ext.reader.HtmlLib
In-Reply-To: <Pine.LNX.4.21.0107181156250.3420-100000@leo.logilab.fr>
Message-ID: <20010718100425.38884.qmail@web9203.mail.yahoo.com>

hello sir

i forgot to mention that i am using python's
 Pyxml package.....so plz give me a option
within this package....


My previous  message was....

 Hello sir,
> > 
> > I have got a DOM tree  Generated from a xml
> file.....
> > I wanted to walk through the tree and use the 
> > data from that tree.I wanted to  know how to 
> > do this....


__________________________________________________
Do You Yahoo!?
Get personalized email addresses from Yahoo! Mail
http://personal.mail.yahoo.com/


From Alexandre.Fayolle@logilab.fr  Wed Jul 18 11:11:12 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Wed, 18 Jul 2001 12:11:12 +0200 (CEST)
Subject: [4suite] Re: [XML-SIG] xml.dom.ext.reader.HtmlLib
In-Reply-To: <20010718100425.38884.qmail@web9203.mail.yahoo.com>
Message-ID: <Pine.LNX.4.21.0107181210230.3420-100000@leo.logilab.fr>

On Wed, 18 Jul 2001, shivaji raju wrote:

> hello sir
> 
> i forgot to mention that i am using python's
>  Pyxml package.....so plz give me a option
> within this package....

Please send your questions to xml-sig@python.org.

The solution I mention using 4DOM's Traversal API is part of pyxml.


Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From Nicolas.Chauvat@logilab.fr  Wed Jul 18 12:00:24 2001
From: Nicolas.Chauvat@logilab.fr (Nicolas Chauvat)
Date: Wed, 18 Jul 2001 13:00:24 +0200 (CEST)
Subject: [XML-SIG] xsl:nclude-and-transform
Message-ID: <Pine.LNX.4.21.0107181238470.31099-100000@aries.logilab.fr>

Hi Lists,

Here is what I'd like to do :

company.xml (my own DTD)

=09<company>
           <name>Logilab</name>
           <address>10 rue Louis Vicat</address>
           <web>http://www.logilab.com/</web>
        </company>

document.xml (DocBook DTD)

=09<article>
          <author>
            <surname>Chauvat</surname>
            <affiliation>
              <!-- insert here the result of company.xml
                   transformed with company2docbook.xsl
                   should be
                 =20
                   <orgname>Logilab</orgname>
                   <address>
                     <street>10 rue Louis Vicat</street>
                   </address>
              -->
            </affiliation>
          </author>

         <sect>
           <!-- the rest of my document -->
         </sect>
       </article>
=20
Here are the ideas I had so far :

     * forget about my own DTD and write company.xml using docbook, then
       include an entity. It already works like this, but I need to
       make it more flexible...

     * generate the document in two steps : replace the above comment with
       a specific <include file=3D"company.xml/> tag that gets processed by
       a first XSLT and replaced with the proper docbook elements, then
       feed the result to the docbook generator. The problem is that when
       you generate a document, you have to know which stylesheet to apply
       for the first processing step.

     * generate the document in a single step : modify the above tag to
       specify the stylesheet :=20
         <include file=3D"company.xml" transform=3D"company2docbook.xsl"/>
       and use something like an XSL extension function to replace that
       include tag with the corresponding elements. That would ask for
       docbook XSL customization, but we already do that.

     * use processing instructions : I don't know much about PI. Could
       I use a PI to do the two-step processing described above by
       including a <?PI use-stylesheet=3D'first-step.xsl'?> in the document
       that I feed to the docbook processing step ?

     * ...

As you understood, I'm missing a <xsl:include-after-transform/> tag...

I look forward to your ideas and comments.

--=20
Nicolas Chauvat

http://www.logilab.com - "Mais o=F9 est donc Ornicar ?" - LOGILAB, Paris (F=
rance)


From fdrake@acm.org  Wed Jul 18 15:28:42 2001
From: fdrake@acm.org (Fred L. Drake, Jr.)
Date: Wed, 18 Jul 2001 10:28:42 -0400 (EDT)
Subject: [XML-SIG] How to get SAX to parse not well formed HTML doc?
In-Reply-To: <200107172317.f6HNHEp01770@mira.informatik.hu-berlin.de>
References: <20010717071604.11011.qmail@web5105.mail.yahoo.com>
 <B779ADD2.2BEC5%python-te@mcwords.com>
 <15188.9848.137663.499928@cj42289-a.reston1.va.home.com>
 <200107172317.f6HNHEp01770@mira.informatik.hu-berlin.de>
Message-ID: <15189.40218.983911.540691@cj42289-a.reston1.va.home.com>

Martin v. Loewis writes:
 > Of course, a "true" HTML parser should get the DTD right,
 > i.e. generate closing elements where they are missing, expand entities
 > (to unicode strings), etc.

  A "true" HTML parser would do a lot better than the
HTMLParser.HTMLParser class; it exhibits the expectation of the
project that it was created for -- to allow editing the file without
adding new lexical tokens in the output as a side effect of the
parse.  There are certainly other ways to achieve that goal, but this
made the most sense for the original application.
  It should be fairly easy to add a smarter parser as a subclass; this
should arguably be added to the current module.


  -Fred

-- 
Fred L. Drake, Jr.  <fdrake at acm.org>
PythonLabs at Digital Creations


From Mike.Olson@fourthought.com  Wed Jul 18 15:48:00 2001
From: Mike.Olson@fourthought.com (Mike Olson)
Date: Wed, 18 Jul 2001 08:48:00 -0600
Subject: [XML-SIG] xsl:nclude-and-transform
References: <Pine.LNX.4.21.0107181238470.31099-100000@aries.logilab.fr>
Message-ID: <3B55A1A0.F28EC3BA@fourthought.com>

Nicolas Chauvat wrote:
>=20
> Hi Lists,
>=20
> Here is what I'd like to do :
>=20
> company.xml (my own DTD)
>=20
>         <company>
>            <name>Logilab</name>
>            <address>10 rue Louis Vicat</address>
>            <web>http://www.logilab.com/</web>
>         </company>
>=20
> document.xml (DocBook DTD)
>=20
>         <article>
>           <author>
>             <surname>Chauvat</surname>
>             <affiliation>
>               <!-- insert here the result of company.xml
>                    transformed with company2docbook.xsl
>                    should be
>=20
>                    <orgname>Logilab</orgname>
>                    <address>
>                      <street>10 rue Louis Vicat</street>
>                    </address>
>               -->
>             </affiliation>
>           </author>
>=20
>          <sect>
>            <!-- the rest of my document -->
>          </sect>
>        </article>


Why not just use xinclude?

 <affiliation?
  <xi:include href=3D'company.xml'/>
 </affiliation>


Then extend the stylesheet that you would use to process the standard
docbook with templates like:
<xsl:include href=3D'docbook.xsl'/>
<xsl:template match =3D'doc:affiliation/logi:company'>
  ...
</>

Mike

>=20

> Here are the ideas I had so far :
>=20
>      * forget about my own DTD and write company.xml using docbook, the=
n
>        include an entity. It already works like this, but I need to
>        make it more flexible...
>=20
>      * generate the document in two steps : replace the above comment w=
ith
>        a specific <include file=3D"company.xml/> tag that gets processe=
d by
>        a first XSLT and replaced with the proper docbook elements, then
>        feed the result to the docbook generator. The problem is that wh=
en
>        you generate a document, you have to know which stylesheet to ap=
ply
>        for the first processing step.
>=20
>      * generate the document in a single step : modify the above tag to
>        specify the stylesheet :
>          <include file=3D"company.xml" transform=3D"company2docbook.xsl=
"/>
>        and use something like an XSL extension function to replace that
>        include tag with the corresponding elements. That would ask for
>        docbook XSL customization, but we already do that.
>=20
>      * use processing instructions : I don't know much about PI. Could
>        I use a PI to do the two-step processing described above by
>        including a <?PI use-stylesheet=3D'first-step.xsl'?> in the docu=
ment
>        that I feed to the docbook processing step ?
>=20
>      * ...
>=20
> As you understood, I'm missing a <xsl:include-after-transform/> tag...
>=20
> I look forward to your ideas and comments.
>=20
> --
> Nicolas Chauvat
>=20
> http://www.logilab.com - "Mais o=F9 est donc Ornicar ?" - LOGILAB, Pari=
s (France)
>=20
> _______________________________________________
> XML-SIG maillist  -  XML-SIG@python.org
> http://mail.python.org/mailman/listinfo/xml-sig

--=20
Mike Olson                                Principal Consultant
mike.olson@fourthought.com                +1 303 583 9900 x 102
Fourthought, Inc.                         http://Fourthought.com=20
4735 East Walnut St,                      http://4Suite.org
Boulder, CO 80301-2537, USA
XML strategy, XML tools, knowledge management


From uche.ogbuji@fourthought.com  Wed Jul 18 16:25:12 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Wed, 18 Jul 2001 09:25:12 -0600
Subject: [4suite] Re: [XML-SIG] 4Suite 0.11.1 and PyXML 0.6.5
In-Reply-To: Message from Uche Ogbuji <uogbuji@fourthought.com>
 of "Mon, 16 Jul 2001 22:36:41 MDT." <Pine.LNX.4.33.0107162233340.22242-100000@yen.fourthought.com>
Message-ID: <200107181525.f6IFPCt01813@localhost.local>

> On Mon, 16 Jul 2001, Martin v. Loewis wrote:
> 
> > > I've been quite busy these last weeks, and have not managed to follow the
> > > various mailing lists as closely as I would have wanted. Is there a
> > > release of PyXML 0.6.6 planned that would mainly feature the changes in
> > > xml.dom.ext that make PyXML compatible with 4Suite-0.11.1's pDomlette ?
> >
> > The 0.6.6 branch is open for people to commit into it; I trust that
> > anybody committing changes will follow a "bug fixes only" strategy
> > there.
> 
> I missed this.  I'll be sure to sync all my changes from the tip to this
> branch.  I would indeed like to see a PyXML 0.6.6 bug-fix release to go
> with the 4Suite 0.11.1 release.

This is done and checked in.  I have some more fixes to make on this branch, 
and then I'd like to enter a testing and release prep phase late this 
week/early next.  Is this feasible for others who will ned to help?  Martin?

Thanks, all.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From uche.ogbuji@fourthought.com  Wed Jul 18 16:29:55 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Wed, 18 Jul 2001 09:29:55 -0600
Subject: [XML-SIG] xml.dom.ext.reader.HtmlLib
In-Reply-To: Message from Alexandre Fayolle <Alexandre.Fayolle@logilab.fr>
 of "Tue, 17 Jul 2001 11:02:03 +0200." <Pine.LNX.4.21.0107171026550.25417-100000@leo.logilab.fr>
Message-ID: <200107181529.f6IFTtS01825@localhost.local>

> Hello,
> 
> I was hunting for a bug in Narval, and ended up in
> xml.dom.ext.reader.HtmlLib. I would like some feedback on this to know
> is this is indeed a bug, a documentation issue, or just me daydreaming
> that all APIs should do what I'd like them to, instead of what the coder
> meant.
> 
> When I use xml.dom.ext.reader.Sax2, if I pass an ownerDocument to the
> reader when reading the data, I'll get back a DocumentFragment, belonging
> to the same document. 
> 
> With HtmlLib's reader, this is not the case : the owner document I'm
> passing is getting emptied. Cf. line 42-46:
>         if doc:
>             while doc.firstChild:
>                 # Empty out the document
>                 node = doc.removeChild(doc.firstChild)
>                 ReleaseNode(node)
> 
> First (minor) thing is, this supposes I'm using a 4DOM document, since it
> uses ReleaseNode, second (important) thing is, I'm much annoyed that the
> document should be emptied, since in the case at hand, it already had some
> contents, and I was merely passing it in order to be sure that the right
> DOM implementation would be used, and to avoid an expensive call to
> importNode.

I wasn't aware of this, and I agree it's a nasty bug.  Please do prep a patch 
if you can.  Just be sure to check it in to the o6maint branch, or put it on 
SF for me to do so (yes, we do intend to work down the SF docket before final 
release).

> As a side note, Sgmlop.HtmlParser uses non NS methods to build it's
> DOM. Is this what is intended ?

I think so.  The main danger is using import to merhe HTML and XML+NS DOMs, 
but I think this is a pretty sticky case anyway.  Since namespaces don't even 
really have a meaning in HTML, I think the current approach is the right one.

> I'll be glad to work on some patches, hopefully in time for PyXML 0.6.6,
> once the correct behaviour has been agreed on.

I'd love to see a patch on the ownerDoc misbehavior.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From uche.ogbuji@fourthought.com  Wed Jul 18 16:32:20 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Wed, 18 Jul 2001 09:32:20 -0600
Subject: [4suite] Re: [XML-SIG] xml.dom.ext.reader.HtmlLib
In-Reply-To: Message from Alexandre Fayolle <Alexandre.Fayolle@logilab.fr>
 of "Tue, 17 Jul 2001 12:53:41 +0200." <Pine.LNX.4.21.0107171237210.25689-100000@leo.logilab.fr>
Message-ID: <200107181532.f6IFWKr01839@localhost.local>

> On 17 Jul 2001, Lars Marius Garshol wrote:

> > | As a side note, Sgmlop.HtmlParser uses non NS methods to build it's
> > | DOM. Is this what is intended ?
> > 
> > Should be, shouldn't it? HTML doesn't have namespaces, only XHTML does.
> 
> Well... yes, and no. This is the old setAttributeNS(EMPTY_NS, name,value)
> vs setAttribute(name,value) question. The problem happens when you try to
> get the value back and you don't know what API was used to set
> it. However, using a Sax driver for this parser should help, since then
> the DOM builder would be able to call whatever method is deemed necessary.

Sigh.  I've resisted doing this (the DOM spec itself gives us excuse not to 
bothr), but perhaps it's time to bite the performance bullet and make the NS 
and non-NS APIs smarter about each other.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From uche.ogbuji@fourthought.com  Wed Jul 18 16:35:23 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Wed, 18 Jul 2001 09:35:23 -0600
Subject: [4suite] Re: [XML-SIG] 4Suite 0.11.1 and PyXML 0.6.5
In-Reply-To: Message from "Martin v. Loewis" <martin@loewis.home.cs.tu-berlin.de>
 of "Wed, 18 Jul 2001 01:00:11 +0200." <200107172300.f6HN0BV01728@mira.informatik.hu-berlin.de>
Message-ID: <200107181535.f6IFZOO01861@localhost.local>

> > I missed this.  I'll be sure to sync all my changes from the tip to this
> > branch.  I would indeed like to see a PyXML 0.6.6 bug-fix release to go
> > with the 4Suite 0.11.1 release.
> 
> Ok, then I propose the following procedure:
> 
> - Copy everything you want to see in 0.6.6 in the branch (it is the
>   "o6maint" branch)

Done.

> - Once you are done, I'll investigate the remaining changes as to whether
>   they contain missing pieces; I'll then try to contact the authors of
>   these changes to see whether they should be merged (sometimes it may
>   be clear from the check-in messages).

Please hold off on this, since I have some new changes to make as well.  
Probably by Friday I'll be done.

> - I'll then give advance warning of a couple of days that 0.6.6 is upcoming.

Sounds good.  Thanks.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From uche.ogbuji@fourthought.com  Wed Jul 18 16:40:59 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Wed, 18 Jul 2001 09:40:59 -0600
Subject: [XML-SIG] xsl:nclude-and-transform
In-Reply-To: Message from Mike Olson <Mike.Olson@fourthought.com>
 of "Wed, 18 Jul 2001 08:48:00 MDT." <3B55A1A0.F28EC3BA@fourthought.com>
Message-ID: <200107181541.f6IFf0q01878@localhost.local>

> Nicolas Chauvat wrote:
> > =

> > Hi Lists,
> > =

> > Here is what I'd like to do :
> > =

> > company.xml (my own DTD)
> > =

> >         <company>
> >            <name>Logilab</name>
> >            <address>10 rue Louis Vicat</address>
> >            <web>http://www.logilab.com/</web>
> >         </company>
> > =

> > document.xml (DocBook DTD)
> > =

> >         <article>
> >           <author>
> >             <surname>Chauvat</surname>
> >             <affiliation>
> >               <!-- insert here the result of company.xml
> >                    transformed with company2docbook.xsl
> >                    should be
> > =

> >                    <orgname>Logilab</orgname>
> >                    <address>
> >                      <street>10 rue Louis Vicat</street>
> >                    </address>
> >               -->
> >             </affiliation>
> >           </author>
> > =

> >          <sect>
> >            <!-- the rest of my document -->
> >          </sect>
> >        </article>
> =

> =

> Why not just use xinclude?
> =

>  <affiliation?
>   <xi:include href=3D'company.xml'/>
>  </affiliation>
> =

> =

> Then extend the stylesheet that you would use to process the standard
> docbook with templates like:
> <xsl:include href=3D'docbook.xsl'/>
> <xsl:template match =3D'doc:affiliation/logi:company'>
>   ...
> </>

Unfortunately, this kills most of the flexibility Nico wants.  For one th=
ing, =

I think the intention is to have the inclusion transparent to the stylesh=
eet.

I have several solutions, all in a separate message.


-- =

Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com =

4735 East Walnut St, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From uche.ogbuji@fourthought.com  Wed Jul 18 17:25:40 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Wed, 18 Jul 2001 10:25:40 -0600
Subject: [XML-SIG] xsl:nclude-and-transform
In-Reply-To: Message from Nicolas Chauvat <Nicolas.Chauvat@logilab.fr>
 of "Wed, 18 Jul 2001 13:00:24 +0200." <Pine.LNX.4.21.0107181238470.31099-100000@aries.logilab.fr>
Message-ID: <200107181625.f6IGPeV02008@localhost.local>

> Hi Lists,
> =

> Here is what I'd like to do :
> =

> company.xml (my own DTD)
> =

> 	<company>
>            <name>Logilab</name>
>            <address>10 rue Louis Vicat</address>
>            <web>http://www.logilab.com/</web>
>         </company>
> =

> document.xml (DocBook DTD)
> =

> 	<article>
>           <author>
>             <surname>Chauvat</surname>
>             <affiliation>
>               <!-- insert here the result of company.xml
>                    transformed with company2docbook.xsl
>                    should be
>                   =

>                    <orgname>Logilab</orgname>
>                    <address>
>                      <street>10 rue Louis Vicat</street>
>                    </address>
>               -->
>             </affiliation>
>           </author>
> =

>          <sect>
>            <!-- the rest of my document -->
>          </sect>
>        </article>
>  =

> Here are the ideas I had so far :
> =

>      * forget about my own DTD and write company.xml using docbook, the=
n
>        include an entity. It already works like this, but I need to
>        make it more flexible...
> =

>      * generate the document in two steps : replace the above comment w=
ith
>        a specific <include file=3D"company.xml/> tag that gets processe=
d by
>        a first XSLT and replaced with the proper docbook elements, then=

>        feed the result to the docbook generator. The problem is that wh=
en
>        you generate a document, you have to know which stylesheet to ap=
ply
>        for the first processing step.
> =

>      * generate the document in a single step : modify the above tag to=

>        specify the stylesheet : =

>          <include file=3D"company.xml" transform=3D"company2docbook.xsl=
"/>
>        and use something like an XSL extension function to replace that=

>        include tag with the corresponding elements. That would ask for
>        docbook XSL customization, but we already do that.
> =

>      * use processing instructions : I don't know much about PI. Could
>        I use a PI to do the two-step processing described above by
>        including a <?PI use-stylesheet=3D'first-step.xsl'?> in the docu=
ment
>        that I feed to the docbook processing step ?
> =

>      * ...
> =

> As you understood, I'm missing a <xsl:include-after-transform/> tag...
> =

> I look forward to your ideas and comments.

For max flexibility I'm guessing you wan thtis to be transparent to the =

stylesheet.  The best way I could think of makinng this happen would be t=
o add =

an option to the 4XSLT processor for translating xinclude instructions fr=
om =

result elements.

This would take the form:

* Update the XmlWriter and RtfWriter to look out for output requests that=
 meet =

XInclude spec and automatically render the resulting text or nodes into t=
he =

output.  Probably by parsing to cDomlette in both cases, and calling =

xml.dom.ext.Print on the resulting document element in the case of XMLWri=
ter =

(watch out for cdata-section-element specs and CDATASections in the XIncl=
uded =

file).  In the case of RTFWriter, one could either translate the domlette=
 =

nodes to RTFNodes or just write a quick'n dirty writer that generates RTF=
Nodes =

directly from text.

* Add a parameter to the processor (off by default) which enables this =

automatic translation

This actually would not be that difficult to implement, and if you're itc=
hing =

badly enough, you might want to work out a patch (on the R0-11-1-prerelea=
se =

branch of CVS 4Suite) that implements it.  It's useful enough (and an ine=
rt =

enough feature) that if all goes well, it could be included in 0.11.1 fin=
al, =

even though bugfixes are the priority at this point.

Failing that, and if you don't mind minimal XSLT additions, the best solu=
tion =

is probably

<xsl:template match=3D"affiliation">
  <xsl:variable name=3D"included">
    <xsl:value-of select=3D"document('company.xml')"/>
  </xsl:variable>
  <xsl:copy>
    <xsl:copy-of select=3D"$included"/>
  </xsl:copy>
</xsl:template>

Does all this help?


-- =

Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com =

4735 East Walnut St, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From Mike.Olson@fourthought.com  Wed Jul 18 21:09:14 2001
From: Mike.Olson@fourthought.com (Mike Olson)
Date: Wed, 18 Jul 2001 14:09:14 -0600
Subject: [4suite] Re: [XML-SIG] xsl:nclude-and-transform
References: <200107181625.f6IGPeV02008@localhost.local>
Message-ID: <3B55ECEA.6E5C4E5D@fourthought.com>

> >
> > Here are the ideas I had so far :
> >
> >      * forget about my own DTD and write company.xml using docbook, then
> >        include an entity. It already works like this, but I need to
> >        make it more flexible...
> >
> >      * generate the document in two steps : replace the above comment with
> >        a specific <include file="company.xml/> tag that gets processed by
> >        a first XSLT and replaced with the proper docbook elements, then
> >        feed the result to the docbook generator. The problem is that when
> >        you generate a document, you have to know which stylesheet to apply
> >        for the first processing step.
> >
> >      * generate the document in a single step : modify the above tag to
> >        specify the stylesheet :
> >          <include file="company.xml" transform="company2docbook.xsl"/>
> >        and use something like an XSL extension function to replace that
> >        include tag with the corresponding elements. That would ask for
> >        docbook XSL customization, but we already do that.
> >
> >      * use processing instructions : I don't know much about PI. Could
> >        I use a PI to do the two-step processing described above by
> >        including a <?PI use-stylesheet='first-step.xsl'?> in the document
> >        that I feed to the docbook processing step ?
> >
> >      * ...

I must be missing the problem statement.  From what I gathered, you want
to insert company.xml into your docbook src, but translate xompany.xml
into docbook before doing so.  Is that correct?

My first inclenation would be to use what you call step 2.  However, I
think you can do this all in XSLT with something like:


<xsl:include href='docbook.xsl'/>
<xsl:include href='company.xsl'/>

<xsl:template match='affiliation'>
  <xsl:variable name='translated'>
    <xsl:apply-templates select='document("company.xsl")'
mode='company-to-docbook'/>
  </xsl:variable>
  <xsl:apply-templates select='ftext:node-set($translated)'/>
</xsl:template>


> >
> > As you understood, I'm missing a <xsl:include-after-transform/> tag...
> >
> > I look forward to your ideas and comments.
> 
> For max flexibility I'm guessing you wan thtis to be transparent to the
> stylesheet.  The best way I could think of makinng this happen would be to add
> an option to the 4XSLT processor for translating xinclude instructions from
> result elements.

I don't follow.  Why would you want to output xinclude, when in a
stylesheet you can just use copy-of and the document function.

> 
> Failing that, and if you don't mind minimal XSLT additions, the best solution
> is probably
> 
> <xsl:template match="affiliation">
>   <xsl:variable name="included">
>     <xsl:value-of select="document('company.xml')"/>
>   </xsl:variable>
>   <xsl:copy>
>     <xsl:copy-of select="$included"/>
>   </xsl:copy>
> </xsl:template>
> 
> Does all this help?

I guess I also don't see how this is less flexible then using XInclude
in the source document.  The only way this is a bit more flexible is
that you could parameterize "company.xml".

Mike

> 
> --
> Uche Ogbuji                               Principal Consultant
> uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
> Fourthought, Inc.                         http://Fourthought.com
> 4735 East Walnut St, Boulder, CO 80301-2537, USA
> XML strategy, XML tools (http://4Suite.org), knowledge management
> 
> _______________________________________________
> 4suite mailing list
> 4suite@lists.fourthought.com
> http://lists.fourthought.com/mailman/listinfo/4suite

-- 
Mike Olson                                Principal Consultant
mike.olson@fourthought.com                +1 303 583 9900 x 102
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St,                      http://4Suite.org
Boulder, CO 80301-2537, USA
XML strategy, XML tools, knowledge management


From uche.ogbuji@fourthought.com  Wed Jul 18 21:27:12 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Wed, 18 Jul 2001 14:27:12 -0600
Subject: [4suite] Re: [XML-SIG] xsl:nclude-and-transform
References: <200107181625.f6IGPeV02008@localhost.local> <3B55ECEA.6E5C4E5D@fourthought.com>
Message-ID: <3B55F120.ACBDB43E@fourthought.com>

Mike Olson wrote:

> I must be missing the problem statement.  From what I gathered, you want
> to insert company.xml into your docbook src, but translate xompany.xml
> into docbook before doing so.  Is that correct?

That's not at all what I read, but I'll let Nico speak for himself.


> > > As you understood, I'm missing a <xsl:include-after-transform/> tag...

This is mostly the question I was answering.  Note *after*, not
*before*.


> > Failing that, and if you don't mind minimal XSLT additions, the best solution
> > is probably
> >
> > <xsl:template match="affiliation">
> >   <xsl:variable name="included">
> >     <xsl:value-of select="document('company.xml')"/>
> >   </xsl:variable>
> >   <xsl:copy>
> >     <xsl:copy-of select="$included"/>
> >   </xsl:copy>
> > </xsl:template>
> >
> > Does all this help?
> 
> I guess I also don't see how this is less flexible then using XInclude
> in the source document.  The only way this is a bit more flexible is
> that you could parameterize "company.xml".

No.  You can also parameterize when and where you do the inclusions, as
well as their semantics.

Most importantly, your Docbook stylesheet needn't know anything about
company.xml's structure.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From Mike.Olson@fourthought.com  Wed Jul 18 21:29:51 2001
From: Mike.Olson@fourthought.com (Mike Olson)
Date: Wed, 18 Jul 2001 14:29:51 -0600
Subject: [4suite] Re: [XML-SIG] xsl:nclude-and-transform
References: <200107181625.f6IGPeV02008@localhost.local> <3B55ECEA.6E5C4E5D@fourthought.com> <3B55F120.ACBDB43E@fourthought.com>
Message-ID: <3B55F1BF.C864F378@fourthought.com>

Uche Ogbuji wrote:
> 
> Mike Olson wrote:
> 
> > I must be missing the problem statement.  From what I gathered, you want
> > to insert company.xml into your docbook src, but translate xompany.xml
> > into docbook before doing so.  Is that correct?
> 
> That's not at all what I read, but I'll let Nico speak for himself.
> 
> > > > As you understood, I'm missing a <xsl:include-after-transform/> tag...
> 
> This is mostly the question I was answering.  Note *after*, not
> *before*.
> 
> > > Failing that, and if you don't mind minimal XSLT additions, the best solution
> > > is probably
> > >
> > > <xsl:template match="affiliation">
> > >   <xsl:variable name="included">
> > >     <xsl:value-of select="document('company.xml')"/>
> > >   </xsl:variable>
> > >   <xsl:copy>
> > >     <xsl:copy-of select="$included"/>
> > >   </xsl:copy>
> > > </xsl:template>
> > >
> > > Does all this help?
> >
> > I guess I also don't see how this is less flexible then using XInclude
> > in the source document.  The only way this is a bit more flexible is
> > that you could parameterize "company.xml".
> 
> No.  You can also parameterize when and where you do the inclusions, as
> well as their semantics.
> 
> Most importantly, your Docbook stylesheet needn't know anything about
> company.xml's structure.

Something has to convert the structure of company.xml into either
docbook, or the expected output.

Mike


> 
> --
> Uche Ogbuji                               Principal Consultant
> uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
> Fourthought, Inc.                         http://Fourthought.com
> 4735 East Walnut St, Boulder, CO 80301-2537, USA
> XML strategy, XML tools (http://4Suite.org), knowledge management

-- 
Mike Olson                                Principal Consultant
mike.olson@fourthought.com                +1 303 583 9900 x 102
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St,                      http://4Suite.org
Boulder, CO 80301-2537, USA
XML strategy, XML tools, knowledge management


From uche.ogbuji@fourthought.com  Wed Jul 18 21:44:11 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Wed, 18 Jul 2001 14:44:11 -0600
Subject: [4suite] Re: [XML-SIG] xsl:nclude-and-transform
References: <200107181625.f6IGPeV02008@localhost.local> <3B55ECEA.6E5C4E5D@fourthought.com> <3B55F120.ACBDB43E@fourthought.com> <3B55F1BF.C864F378@fourthought.com>
Message-ID: <3B55F51B.242E5F15@fourthought.com>

Mike Olson wrote:

> > Most importantly, your Docbook stylesheet needn't know anything about
> > company.xml's structure.
> 
> Something has to convert the structure of company.xml into either
> docbook, or the expected output.

I didn't understand that as part of the requirement.  I understood that
company.xml was a black box through the processing.  I guess here's
where we need Nico's help again in explaining exactly what he wants.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From noreply@sourceforge.net  Wed Jul 18 21:50:58 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Wed, 18 Jul 2001 13:50:58 -0700
Subject: [XML-SIG] [ pyxml-Patches-442574 ] Kick-off for the 0.6.6 release
Message-ID: <E15MyHa-0000FW-00@usw-sf-web3.sourceforge.net>

Patches item #442574, was opened at 2001-07-18 13:50
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=442574&group_id=6473

Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Uche Ogbuji (uche)
Assigned to: Nobody/Anonymous (nobody)
Summary: Kick-off for the 0.6.6 release

Initial Comment:
For reference, here is the patch that I've already
committed to the o6maint branch.  It merges in fixes
from the tip (HEAD) that should be included in the
0.6.6. release.

----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=442574&group_id=6473


From Mike.Olson@fourthought.com  Wed Jul 18 21:44:34 2001
From: Mike.Olson@fourthought.com (Mike Olson)
Date: Wed, 18 Jul 2001 14:44:34 -0600
Subject: [4suite] Re: [XML-SIG] xsl:nclude-and-transform
References: <200107181625.f6IGPeV02008@localhost.local> <3B55ECEA.6E5C4E5D@fourthought.com> <3B55F120.ACBDB43E@fourthought.com> <3B55F1BF.C864F378@fourthought.com> <3B55F51B.242E5F15@fourthought.com>
Message-ID: <3B55F532.9EBDF6A2@fourthought.com>

Uche Ogbuji wrote:
> 
> Mike Olson wrote:
> 
> > > Most importantly, your Docbook stylesheet needn't know anything about
> > > company.xml's structure.
> >
> > Something has to convert the structure of company.xml into either
> > docbook, or the expected output.
> 
> I didn't understand that as part of the requirement.  I understood that
> company.xml was a black box through the processing.  I guess here's
> where we need Nico's help again in explaining exactly what he wants.

Agreed.  Nico wake up

Mike

> 
> --
> Uche Ogbuji                               Principal Consultant
> uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
> Fourthought, Inc.                         http://Fourthought.com
> 4735 East Walnut St, Boulder, CO 80301-2537, USA
> XML strategy, XML tools (http://4Suite.org), knowledge management

-- 
Mike Olson                                Principal Consultant
mike.olson@fourthought.com                +1 303 583 9900 x 102
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St,                      http://4Suite.org
Boulder, CO 80301-2537, USA
XML strategy, XML tools, knowledge management


From uche.ogbuji@fourthought.com  Wed Jul 18 21:51:41 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Wed, 18 Jul 2001 14:51:41 -0600
Subject: [XML-SIG] SF category for DOM?
Message-ID: <200107182051.f6IKpgN02616@localhost.local>

I just noticed that there is no category for DOM in bugs or patches for 4Suite 
on SF.  I don't have admin privs or I would have added this.


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From fdrake@acm.org  Wed Jul 18 21:57:56 2001
From: fdrake@acm.org (Fred L. Drake, Jr.)
Date: Wed, 18 Jul 2001 16:57:56 -0400 (EDT)
Subject: [XML-SIG] SF category for DOM?
In-Reply-To: <200107182051.f6IKpgN02616@localhost.local>
References: <200107182051.f6IKpgN02616@localhost.local>
Message-ID: <15189.63572.717790.187096@cj42289-a.reston1.va.home.com>

Uche Ogbuji writes:
 > I just noticed that there is no category for DOM in bugs or patches
 > for 4Suite on SF.  I don't have admin privs or I would have added
 > this.

  I've added "DOM" as a category for both bugs and patches, and
renamed the "sax" category for patches to "SAX".


  -Fred

-- 
Fred L. Drake, Jr.  <fdrake at acm.org>
PythonLabs at Digital Creations


From noreply@sourceforge.net  Thu Jul 19 08:26:12 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Thu, 19 Jul 2001 00:26:12 -0700
Subject: [XML-SIG] [ pyxml-Patches-442672 ] xml.dom.ext.reader.HtmlLib patch
Message-ID: <E15N8CK-0001V0-00@usw-sf-web3.sourceforge.net>

Patches item #442672, was opened at 2001-07-19 00:26
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=442672&group_id=6473

Category: DOM
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Alexandre Fayolle (afayolle)
Assigned to: Nobody/Anonymous (nobody)
Summary: xml.dom.ext.reader.HtmlLib patch

Initial Comment:
When passed an ownerDocument, the HtmlParser would
empty it first. This patch fixes this.

Cheers 

Alexandre

----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=442672&group_id=6473


From Alexandre.Fayolle@logilab.fr  Thu Jul 19 08:34:04 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Thu, 19 Jul 2001 09:34:04 +0200 (CEST)
Subject: [XML-SIG] xml.dom.ext.reader.HtmlLib
In-Reply-To: <200107181529.f6IFTtS01825@localhost.local>
Message-ID: <Pine.LNX.4.21.0107190930570.1054-100000@pisces.logilab.fr>

On Wed, 18 Jul 2001, Uche Ogbuji wrote:

> I wasn't aware of this, and I agree it's a nasty bug.  Please do prep a patch 
> if you can.  Just be sure to check it in to the o6maint branch, or put it on 
> SF for me to do so (yes, we do intend to work down the SF docket before final 
> release).

It's on SF (patch #442672). 

What had caused the code to be written like it was in the first place, is
that implementation.createHTMLDocument will not create an empty document,
but rather an HTML skeleton (with a HEAD/TITLE and a BODY). 

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From Alexandre.Fayolle@logilab.fr  Thu Jul 19 08:46:16 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Thu, 19 Jul 2001 09:46:16 +0200 (CEST)
Subject: [XML-SIG] NS/nonNS apis
In-Reply-To: <200107181532.f6IFWKr01839@localhost.local>
Message-ID: <Pine.LNX.4.21.0107190934340.1054-100000@pisces.logilab.fr>

On Wed, 18 Jul 2001, Uche Ogbuji wrote:

> Sigh.  I've resisted doing this (the DOM spec itself gives us excuse not to 
> bothr), but perhaps it's time to bite the performance bullet and make the NS 
> and non-NS APIs smarter about each other.

I know how it feels. 

Would there be a huge penalty in having setAttribute(name,value) use
('',name) as the key for the attribute in the dictionnary ?

The DOM API is already inconsistent in this regard anyway, with the
removeAttributeNodeNS being non existent. 

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From Alexandre.Fayolle@logilab.fr  Thu Jul 19 10:48:19 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Thu, 19 Jul 2001 11:48:19 +0200 (CEST)
Subject: [XML-SIG] Request For Clarification on packages
Message-ID: <Pine.LNX.4.21.0107191110470.1054-100000@pisces.logilab.fr>

Hi everyone,

I so far have been mainly using the DOM part of PyXML, and since I
volunteered to write a SAX wrapper around sgmlop, I'm getting to know some
other parts of the library. However, everything is not as cristal clear as
I'd like, so I thought maybe I should ask here for a few
clarifications. What I'll do is make a number of statements, which I
believe are true. If some of them are either plain wrong or inaccurate,
I'd appreciate if someone knowledgeable could correct me. 

There we go:

 * the xml.parsers module contains 4 parsers (pyexpat, sgmllib, sgmlop,
xmlproc), each of which has its own API

 * xml.sax.drivers contains SAX wrappers for the 4 parsers above, plus
wrappers for some other parsers

 * xml.sax.drivers2 contains SAX 2.0 wrappers for pyxexpat and xmlproc

 * xml.sax.expatreader really belongs to xml.sax.drivers2 but is there for
backwards compatibility. One should preferably use
xml.sax.drivers2.drv_pyexpat

 * a SAX 2.0 parser should implement the interface defined in
xml.sax.xmlreader.XmlReader

 * xml.sax.handler defines the interface that should be implemented by
someone willing to use a SAX2.0 parser. A parser making callback to
methods not defined in there is not SAX 2.0 compatible.

 * xml.sax.saxlib defines the interfaces of objects manipulated by a SAX
2.0 parser.

 * xml.sax.saxutils provides basic implementations of some of the
interfaces defined in handler and saxlib

 * xml.dom.ext.reader.Sax contains a DOM generator that uses a SAX parser,
with the Reader interface

 * xml.dom.ext.reader.Sax2 contains a DOM generator that uses a SAX 2.0
parser with the Reader interface

 * xml.dom.ext.reader.Sgmlop contains a DOM generator that uses the raw
Sgmlop reader to generate either a DOM or an HTML DOM.

 * xml.dom.ext.reader.HtmlLib is a wrapper around
xml.dom.ext.reader.Sgmlop wich provides the Reader interface

 * xml.dom.ext.reader.HtmlSax is an HTML DOM generator which uses a SAX
parser as the input (is this SAX 1 or SAX 2?)

 * xml.dom.ext.reader.PyExpat is a DOM generator that uses the raw Expat
reader together with the Reader interface

 * xml.dom.ext.reader.Sax2Lib is redundant with xml.sax.handler


Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From noreply@sourceforge.net  Thu Jul 19 11:07:05 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Thu, 19 Jul 2001 03:07:05 -0700
Subject: [XML-SIG] [ pyxml-Bugs-442700 ] localized messages and unicode bug
Message-ID: <E15NAi1-000072-00@usw-sf-web1.sourceforge.net>

Bugs item #442700, was opened at 2001-07-19 03:07
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=442700&group_id=6473

Category: 4Suite
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Alexandre Fayolle (afayolle)
Assigned to: Nobody/Anonymous (nobody)
Summary: localized messages and unicode bug

Initial Comment:
There are some problems with localized messages and
unicode handling, I believe.

Let's consider the following XSLT:

<?xml version="1.0" encoding='iso-8859-1'?>
<xsl:stylesheet
xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
version="1.0">
<xsl:varialbe name='foo'>bar</xsl:varialbe>
</xsl:stylesheet>

It's obviously wrong. Processing it with LC_ALL set to
C will give the following error:
[alf@pisces alf]$ python2 bin/4xslt TODO_HORN bug.xslt
Illegal Element "varialbe" in XSLT Namespace (see XSLT
Spec: 2.1).


However, if LC_ALL is set to fr_FR, for instance, I get
a Unicode Error:

Traceback (most recent call last):
  File "bin/4xslt", line 3, in ?
    _4xslt.XsltCommandLineApp().run()
  File
"/usr/lib/python2.1/site-packages/Ft/Lib/CommandLine/CommandLineApp.py", 
line 87, in run
    cmd.run_command(self.authenticationFunction)
  File
"/usr/lib/python2.1/site-packages/Ft/Lib/CommandLine/Command.py",
line 83
, in run_command
    self.function(self.clOptions, self.clArguments)
  File
"/usr/lib/python2.1/site-packages/_xmlplus/xslt/_4xslt.py",
line 106, in 
Run
    processor.appendStylesheetUri(sty)
  File
"/usr/lib/python2.1/site-packages/_xmlplus/xslt/Processor.py",
line 101, 
in appendStylesheetUri
    sty = self._styReader.fromUri(styleSheetUri,
baseUri)
  File
"/usr/lib/python2.1/site-packages/_xmlplus/xslt/StylesheetReader.py",
lin
e 560, in fromUri
    ownerDoc, stripElements)
  File
"/usr/lib/python2.1/site-packages/Ft/Lib/ReaderBase.py",
line 76, in from
Uri
    rt = self.fromStream(stream, newBaseUri, ownerDoc,
stripElements) 
  File
"/usr/lib/python2.1/site-packages/_xmlplus/xslt/StylesheetReader.py",
lin
e 578, in fromStream
    success = self.parser.ParseFile(stream)
  File
"/usr/lib/python2.1/site-packages/_xmlplus/xslt/StylesheetReader.py",
lin
e 344, in startElement
    raise XsltException(Error.XSLT_ILLEGAL_ELEMENT,
local)
  File
"/usr/lib/python2.1/site-packages/_xmlplus/xslt/__init__.py",
line 72, in
 __init__
    msg = MessageSource.g_errorMessages[errorCode] %
args
UnicodeError: ASCII decoding error: ordinal not in
range(128)


Tested with Python 2.1.1c1 and 4Suite-0.11.1b3

Cheers

Alexandre

----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=442700&group_id=6473


From larsga@garshol.priv.no  Thu Jul 19 11:11:15 2001
From: larsga@garshol.priv.no (Lars Marius Garshol)
Date: 19 Jul 2001 12:11:15 +0200
Subject: [XML-SIG] Request For Clarification on packages
In-Reply-To: <Pine.LNX.4.21.0107191110470.1054-100000@pisces.logilab.fr>
References: <Pine.LNX.4.21.0107191110470.1054-100000@pisces.logilab.fr>
Message-ID: <m3itgp1b5o.fsf@lambda.garshol.priv.no>

* Alexandre Fayolle
| 
|  * xml.sax.drivers contains SAX wrappers for the 4 parsers above, plus
| wrappers for some other parsers

Yep. These are SAX 1.0 and obsolete.
 
|  * xml.sax.drivers2 contains SAX 2.0 wrappers for pyxexpat and xmlproc

Yep.
 
|  * xml.sax.expatreader really belongs to xml.sax.drivers2 but is there for
| backwards compatibility. One should preferably use
| xml.sax.drivers2.drv_pyexpat

This file is empty and uses xml.sax.expatreader. I think this was done
to make interoperation with the main Python distro easier. drv_pyexpat
could really go away.
 
|  * a SAX 2.0 parser should implement the interface defined in
| xml.sax.xmlreader.XmlReader

Yes. If possible, it should also implement IncrementalParser. That is,
if the underlying parser provides the necessary functionality.
 
|  * xml.sax.handler defines the interface that should be implemented by
| someone willing to use a SAX2.0 parser. A parser making callback to
| methods not defined in there is not SAX 2.0 compatible.

True. Parsers can have properties that are callback handlers, however,
and this is the right way to sneak in new callbacks.
 
|  * xml.sax.saxlib defines the interfaces of objects manipulated by a SAX
| 2.0 parser.

Yes. 
 
|  * xml.sax.saxutils provides basic implementations of some of the
| interfaces defined in handler and saxlib

Yes.

|  * xml.dom.ext.reader.Sax contains a DOM generator that uses a SAX parser,
| with the Reader interface

SAX 1.0, yes, and therefore obsolete.
 
|  * xml.dom.ext.reader.Sax2 contains a DOM generator that uses a SAX 2.0
| parser with the Reader interface

Yes.
 
|  * xml.dom.ext.reader.HtmlSax is an HTML DOM generator which uses a SAX
| parser as the input (is this SAX 1 or SAX 2?)

This is SAX 1.0, but turning it into SAX 2.0 should be trivial. Only
ignorableWhitespace() and characters() need to be changed.
 
|  * xml.dom.ext.reader.PyExpat is a DOM generator that uses the raw
| Expat reader together with the Reader interface

Yes.
 
|  * xml.dom.ext.reader.Sax2Lib is redundant with xml.sax.handler

Yes. Some of it is also incompatible, it seems. This should be
deleted, IMHO.
 
--Lars M.


From Alexandre.Fayolle@logilab.fr  Thu Jul 19 13:11:01 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Thu, 19 Jul 2001 14:11:01 +0200 (CEST)
Subject: [XML-SIG] Request For Clarification on packages
In-Reply-To: <m3itgp1b5o.fsf@lambda.garshol.priv.no>
Message-ID: <Pine.LNX.4.21.0107191228040.1054-100000@pisces.logilab.fr>

Thanks for the quick answer.

On 19 Jul 2001, Lars Marius Garshol wrote:

<snip>
> Yep. These are SAX 1.0 and obsolete.
<snip>
> SAX 1.0, yes, and therefore obsolete.
<snip again>

What would you think if using the new warning feature of python2.1 to flag
these modules. Something along the lines of:

try:
    message = 'The Sax 1.0 API is obsolete. Please consider using Sax 2.0
instead'
    import warnings
    warnings.warn(message,
                  DeprecationWarning)

    # Ignore further deprecation warnings about this module
    warnings.filterwarnings("ignore", "", DeprecationWarning, __name__)
except :
    print message


Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From larsga@garshol.priv.no  Thu Jul 19 13:37:53 2001
From: larsga@garshol.priv.no (Lars Marius Garshol)
Date: 19 Jul 2001 14:37:53 +0200
Subject: [XML-SIG] Request For Clarification on packages
In-Reply-To: <Pine.LNX.4.21.0107191228040.1054-100000@pisces.logilab.fr>
References: <Pine.LNX.4.21.0107191228040.1054-100000@pisces.logilab.fr>
Message-ID: <m366cp3xi6.fsf@lambda.garshol.priv.no>

* Alexandre Fayolle
| 
| What would you think if using the new warning feature of python2.1
| to flag these modules.

I think we should do it. It may probably require some rearrangement
(because SAX 2.0 and 1.0 are currently mashed together), but I think
it's worth doing.

I'll do it as and when I can. If anyone gets there before me I don't
mind... 

--Lars M.


From Alexandre.Fayolle@logilab.fr  Thu Jul 19 14:33:30 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Thu, 19 Jul 2001 15:33:30 +0200 (CEST)
Subject: [XML-SIG] Sgmlop and IncrementalParser
In-Reply-To: <m3itgp1b5o.fsf@lambda.garshol.priv.no>
Message-ID: <Pine.LNX.4.21.0107191529070.1054-100000@pisces.logilab.fr>

On 19 Jul 2001, Lars Marius Garshol wrote:

> |  * a SAX 2.0 parser should implement the interface defined in
> | xml.sax.xmlreader.XmlReader
> 
> Yes. If possible, it should also implement IncrementalParser. That is,
> if the underlying parser provides the necessary functionality.

Do you happen to know if Sgmlop would support being wrapped into an
IncrementalParser, or am I better off using a plain XmlReader ?

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From larsga@garshol.priv.no  Thu Jul 19 14:38:00 2001
From: larsga@garshol.priv.no (Lars Marius Garshol)
Date: 19 Jul 2001 15:38:00 +0200
Subject: [XML-SIG] Re: Sgmlop and IncrementalParser
In-Reply-To: <Pine.LNX.4.21.0107191529070.1054-100000@pisces.logilab.fr>
References: <Pine.LNX.4.21.0107191529070.1054-100000@pisces.logilab.fr>
Message-ID: <m3wv552g5j.fsf@lambda.garshol.priv.no>

* Alexandre Fayolle
| 
| Do you happen to know if Sgmlop would support being wrapped into an
| IncrementalParser, or am I better off using a plain XmlReader ?

sgmlop supports the IncrementalParser methods (feed, close, and reset).
So XMLReader.parse should be implemented using reset, feed and close.

--Lars M.


From Nicolas.Chauvat@logilab.fr  Thu Jul 19 14:46:31 2001
From: Nicolas.Chauvat@logilab.fr (Nicolas Chauvat)
Date: Thu, 19 Jul 2001 15:46:31 +0200 (CEST)
Subject: [4suite] Re: [XML-SIG] xsl:nclude-and-transform
In-Reply-To: <3B55F532.9EBDF6A2@fourthought.com>
Message-ID: <Pine.LNX.4.21.0107191519190.31099-100000@aries.logilab.fr>

> > company.xml was a black box through the processing.  I guess here's
> > where we need Nico's help again in explaining exactly what he wants.
>=20
> Agreed.  Nico wake up

Ouch, sorry guys, I can't keep up with Colorado time. But it seems like I
started a nice thread :-) From what I read, Uche got the right
interpretation of my question.

I was asking for a mean to fetch part of a document, transform it using a
parametrizable stylesheet, then include the result in the parsed document.

Compared to <xi:include href=3D'company.xml'/> that would copy the content
of company.xml, I was asking for a

   <include-after-transform
       src=3D'company.xml'
       transform=3D'company2dcbk.xslt'/>

[or even better ...
       src=3D'companies.xml#xpointer(company/name/text()=3D"Logilab")'
       ...
]

that would get replaced with=20

=09<orgname>...</orgname>
        <address>...</address>

*before* the document is processed by the docbook XSL stylesheet.

Of I could customize my docbook XSL stylesheet, but then I'd have to
include every single transform that converts data from one format to a
docbook representation.

I think that one way of doing it would be to enhance 4xslt as Uche
suggested. Another way to do it might be to forget about the "xsl
processor does all in one step" idea and use a script that first replaces
the include-and-transform tags (with an xsl transform ?), then feeds
the result to the XSL processor that applies the docook stylesheet.

I'll look into it some more and maybe ask for more details about
adding that feature to 4xslt. Let me know if you've got a better idea and
thank you very much for your support :-)

--=20
Nicolas Chauvat

http://www.logilab.com - "Mais o=F9 est donc Ornicar ?" - LOGILAB, Paris (F=
rance)


From lpc@racemi.com  Thu Jul 19 16:57:40 2001
From: lpc@racemi.com (Luis P Caamano)
Date: Thu, 19 Jul 2001 11:57:40 -0400
Subject: [XML-SIG] Installing to split prefix/exec-prefix
In-Reply-To: <E15Mtl8-0003dn-00@mail.python.org>
Message-ID: <AHEBKIMJLGAIBJCOANKPAEAICBAA.lpc@racemi.com>

I have a python installation on an NFS server that
supports both Linux and BSD systems.  When configuring
python, I set prefix to /python/python_common and
exec-prefix to /python/python_<OS> where <OS> is
either Linux or FreeBSD depending on where I run
configure, make, and make install.

It works great!

Next was installing PyXml.  After running the
appropriate python setup.py [build/install], I noticed
that PyXML stuff always go to the exec_prefix
directory, including .py files.  I expected those
.py files to go to python_common and not the
OS specific directory.

In other words, I expected .py files to 

python_common/lib/python2.1/site-packages/_xmlplus

and exec_prefix related stuff to

python_<OS>/lib/python2.1/site-packages/_xmlplus

or better, to

python_<OS>/lib/python2.1/lib-dynload

Is this the way it's supposed to be?
Is this a distutils or PyXML problem?
Should I do things differently?

Thanks in advance for your reply.

----------------------------------
Luis P. Caamano 
lcaamano@mindspring.com
Atlanta, GA, USA
----------------------------------


From uche.ogbuji@fourthought.com  Thu Jul 19 17:28:02 2001
From: uche.ogbuji@fourthought.com (Uche Ogbuji)
Date: Thu, 19 Jul 2001 10:28:02 -0600
Subject: [XML-SIG] Request For Clarification on packages
In-Reply-To: Message from Alexandre Fayolle <Alexandre.Fayolle@logilab.fr>
 of "Thu, 19 Jul 2001 11:48:19 +0200." <Pine.LNX.4.21.0107191110470.1054-100000@pisces.logilab.fr>
Message-ID: <200107191628.f6JGS2D06405@localhost.local>

> Hi everyone,
> 
> I so far have been mainly using the DOM part of PyXML, and since I
> volunteered to write a SAX wrapper around sgmlop, I'm getting to know some
> other parts of the library. However, everything is not as cristal clear as
> I'd like, so I thought maybe I should ask here for a few
> clarifications. What I'll do is make a number of statements, which I
> believe are true. If some of them are either plain wrong or inaccurate,
> I'd appreciate if someone knowledgeable could correct me. 

[snip]

Thanks so much for doing this.  I'd like to check it in as 

doc/package-summary.txt

Any objections?


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Boulder, CO 80301-2537, USA
XML strategy, XML tools (http://4Suite.org), knowledge management


From Mike.Olson@fourthought.com  Thu Jul 19 18:41:06 2001
From: Mike.Olson@fourthought.com (Mike Olson)
Date: Thu, 19 Jul 2001 11:41:06 -0600
Subject: [4suite] Re: [XML-SIG] xsl:nclude-and-transform
References: <Pine.LNX.4.21.0107191519190.31099-100000@aries.logilab.fr>
Message-ID: <3B571BB2.816E109E@fourthought.com>

Nicolas Chauvat wrote:
>=20
> that would get replaced with
>=20
>         <orgname>...</orgname>
>         <address>...</address>
>=20
> *before* the document is processed by the docbook XSL stylesheet.
>=20
> Of I could customize my docbook XSL stylesheet, but then I'd have to
> include every single transform that converts data from one format to a
> docbook representation.
>=20
> I think that one way of doing it would be to enhance 4xslt as Uche
> suggested. Another way to do it might be to forget about the "xsl
> processor does all in one step" idea and use a script that first replac=
es
> the include-and-transform tags (with an xsl transform ?), then feeds
> the result to the XSL processor that applies the docook stylesheet.
>=20
> I'll look into it some more and maybe ask for more details about
> adding that feature to 4xslt. Let me know if you've got a better idea a=
nd
> thank you very much for your support :-)

Two step processesing will give you the most flexibility.

I guess I don't see how extending 4xslt will help.  I see your solution
then "include and transform" but what uche said looks more like
"transform and include" which I don't see as helping.

To do it all in one XSLT script, you would need to do something similar
to my second approach of

1.  Read "company.xml" into a RTF
2.  Select that RTF into another to "render it to docbook"
3.  Select the resulting RTF against the standard docbook stylesheets.

The down side is that you need all possible stylesheets included in the
main stylesheet, but you can pick and choose how to render it as you
please.

Mike


>=20
> --
> Nicolas Chauvat
>=20
> http://www.logilab.com - "Mais o=F9 est donc Ornicar ?" - LOGILAB, Pari=
s (France)

--=20
Mike Olson                                Principal Consultant
mike.olson@fourthought.com                +1 303 583 9900 x 102
Fourthought, Inc.                         http://Fourthought.com=20
4735 East Walnut St,                      http://4Suite.org
Boulder, CO 80301-2537, USA
XML strategy, XML tools, knowledge management


From larsga@garshol.priv.no  Thu Jul 19 19:00:21 2001
From: larsga@garshol.priv.no (Lars Marius Garshol)
Date: 19 Jul 2001 20:00:21 +0200
Subject: [XML-SIG] Request For Clarification on packages
In-Reply-To: <200107191628.f6JGS2D06405@localhost.local>
References: <200107191628.f6JGS2D06405@localhost.local>
Message-ID: <m3u208n6iy.fsf@lambda.garshol.priv.no>

* Uche Ogbuji
| 
| I'd like to check it in as 
| 
| doc/package-summary.txt
| 
| Any objections?

Good idea. I'm all for it.

--Lars m.


From martin@loewis.home.cs.tu-berlin.de  Thu Jul 19 20:08:48 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Thu, 19 Jul 2001 21:08:48 +0200
Subject: [XML-SIG] Request For Clarification on packages
In-Reply-To: <m3itgp1b5o.fsf@lambda.garshol.priv.no> (message from Lars Marius
 Garshol on 19 Jul 2001 12:11:15 +0200)
References: <Pine.LNX.4.21.0107191110470.1054-100000@pisces.logilab.fr> <m3itgp1b5o.fsf@lambda.garshol.priv.no>
Message-ID: <200107191908.f6JJ8m301386@mira.informatik.hu-berlin.de>

> This file is empty and uses xml.sax.expatreader. I think this was done
> to make interoperation with the main Python distro easier. drv_pyexpat
> could really go away.

To remove it, you'd also need to update sax2exts, to refer to the new
module. I also think it is easier for users if they know they can find
all drivers in drivers2.

Regards,
Martin


From martin@loewis.home.cs.tu-berlin.de  Thu Jul 19 20:13:34 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Thu, 19 Jul 2001 21:13:34 +0200
Subject: [XML-SIG] Request For Clarification on packages
In-Reply-To: <Pine.LNX.4.21.0107191110470.1054-100000@pisces.logilab.fr>
 (message from Alexandre Fayolle on Thu, 19 Jul 2001 11:48:19 +0200
 (CEST))
References: <Pine.LNX.4.21.0107191110470.1054-100000@pisces.logilab.fr>
Message-ID: <200107191913.f6JJDYu01394@mira.informatik.hu-berlin.de>

>  * xml.sax.expatreader really belongs to xml.sax.drivers2 but is there for
> backwards compatibility. One should preferably use
> xml.sax.drivers2.drv_pyexpat

No. xml.sax.expatreader is official Python API, since it is part of
the standard Python library. One should not need to care about the
module names most of the time, since looking up drivers should be done
through make_parser.

It might be desirable to select a parser not by its name, but by its
feature; such API is currently not available for SAX (it is for DOM).

Regards,
Martin


From ptak@xassist.pha.jhu.edu  Fri Jul 20 05:54:46 2001
From: ptak@xassist.pha.jhu.edu (Andrew Ptak)
Date: Fri, 20 Jul 2001 00:54:46 -0400 (EDT)
Subject: [XML-SIG] xml for software configuration/parameters
Message-ID: <Pine.LNX.4.33.0107200046150.21651-100000@xassist.pha.jhu.edu>

Hello,
I am fairly new to xml but have been using Python extensively.  I am
working on a large python project and would like to use xml for storing
parameters, like user preferences.  I've looked around for a library to do
this but haven't found any yet.  What I'd like to see is a class that has
fields such as data value, type (for now, "string", "number", "boolean"
and "choice" for menus would work), and allowed range (or values for menu
choices) so that user input can be validated automatically.  Later
advanced features like dependencies (i.e., parameter X can take on certain
values if parameter Y is set to true, etc.) and some notion of grouping so
that a gui can be created automatically would be nice.  Does anybody know
of code out there that would work?

Thanks,
Andy Ptak


From Alexandre.Fayolle@logilab.fr  Fri Jul 20 08:40:39 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Fri, 20 Jul 2001 09:40:39 +0200 (CEST)
Subject: [XML-SIG] Request For Clarification on packages
In-Reply-To: <200107191628.f6JGS2D06405@localhost.local>
Message-ID: <Pine.LNX.4.21.0107200935360.1054-100000@pisces.logilab.fr>

On Thu, 19 Jul 2001, Uche Ogbuji wrote:

> Thanks so much for doing this.  I'd like to check it in as 
> 
> doc/package-summary.txt
> 
> Any objections?

Nope, except that some packages are not mentionned : 

 * xml.dom : 4DOM implementation
 * xml.dom.ext : extentions to DOM (Print/PrettyPrint, ReleaseNode,
StripXml/StripHtml)
 * xml.dom.html : 4DOM HTML DOM implementation
 * xml.marshal : marshallers for xmlrpc and wddl
 * xml.ns : values of various namespaces
 * xml.schema : trex implementation
 * xml.unicode : utilities for python 1.5.2 backward compatibility
 * xml.utils : date parser + ???
 * xml.xpath : xpath implementation (not released yet) 
 * xml.xslt : xslt implementation (not released yet)
 

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From noreply@sourceforge.net  Fri Jul 20 15:45:55 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Fri, 20 Jul 2001 07:45:55 -0700
Subject: [XML-SIG] [ pyxml-Bugs-443099 ] Duplication of attributes
Message-ID: <E15NbXP-0001Cu-00@usw-sf-web3.sourceforge.net>

Bugs item #443099, was opened at 2001-07-20 07:45
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=443099&group_id=6473

Category: DOM
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Davide Alberani (alberanid)
Assigned to: Nobody/Anonymous (nobody)
Summary: Duplication of attributes

Initial Comment:
Honestly I don't know if this is a bug; maybe it's a
normal
side effect using iterators...

The PyXML version I'm using is 0.6.5 in Python 2.0.

The attached code produces an XML with a duplicated
value
for an attribute (look at 'attB').


----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=443099&group_id=6473


From lpc@racemi.com  Fri Jul 20 15:54:00 2001
From: lpc@racemi.com (Luis P Caamano)
Date: Fri, 20 Jul 2001 10:54:00 -0400
Subject: [XML-SIG] Installing to split prefix/exec-prefix
In-Reply-To: <E15NGEb-0006td-00@mail.python.org>
Message-ID: <AHEBKIMJLGAIBJCOANKPCEALCBAA.lpc@racemi.com>

** This is a resend.  It seems the first message didn't
make it to the list.  I apologize if it's a duplicate **

I have a python installation on an NFS server that
supports both Linux and BSD systems.  When configuring
python, I set prefix to /python/python_common and
exec-prefix to /python/python_<OS> where <OS> is
either Linux or FreeBSD depending on where I run
configure, make, and make install.

It works great!

Next was installing PyXml.  After running the
appropriate python setup.py [build/install], I noticed
that PyXML stuff always go to the exec_prefix
directory, including .py files.  I expected those
.py files to go to python_common and not the
OS specific directory.

In other words, I expected .py files to go to

python_common/lib/python2.1/site-packages/_xmlplus

and exec_prefix related stuff to

python_<OS>/lib/python2.1/site-packages/_xmlplus

or better, to

python_<OS>/lib/python2.1/lib-dynload

Is this the way it's supposed to be?
Is this a distutils or PyXML problem?
Should I do things differently?

Thanks in advance for your reply.

----------------------------------
Luis P. Caamano | Racemi, Inc.
lpc@racemi.com  | Atlanta, GA, USA
----------------------------------


From lannert@python.net  Fri Jul 20 16:03:03 2001
From: lannert@python.net (Detlef Lannert)
Date: Fri, 20 Jul 2001 17:03:03 +0200
Subject: [XML-SIG] A "tolerant" parser for structure-challenged HTML files
Message-ID: <20010720170303.B22663@det.rz.uni-duesseldorf.de>

A couple of weeks ago I was faced with the problem of processing a few
web pages which were generated by Microsoft Word (and post-processed
by some other structure-pessimizing program).  Among the various Python
*ML parsers I didn't find any that could retrieve the "intended document
structure" (like most browsers can) and that didn't choke on the input.

Therefore I wrote a "TolerantParser" class, based on sgmllib's parser,
which tries to understand input like, for example,

    <B><FONT SIZE=2><P>- 34/2001 -</B></FONT>

assuming here that </B> implies </P></FONT> and ignoring the following
</FONT>.  Although I didn't deal with every imaginable nonsense, this
worked for me; in a derived class I generate a minidom Document and
add the "accepted" HTML nodes to it.  Then I can use dom methods to
extract the data that I actually need.

Since I haven't seen anyone else doing this so far, I'd like to make
these classes publicly available
(<http://starship.python.net/~lannert/tweak_html.py>) and to solicit
your comments.  If anything like (or, probably, better than) this
exists somewhere, please let me know; I'd also love to hear any
criticism or suggestions.

  Detlef


From rsalz@zolera.com  Fri Jul 20 16:22:50 2001
From: rsalz@zolera.com (Rich Salz)
Date: Fri, 20 Jul 2001 11:22:50 -0400
Subject: [XML-SIG] A "tolerant" parser for structure-challenged HTML files
References: <20010720170303.B22663@det.rz.uni-duesseldorf.de>
Message-ID: <3B584CCA.310714D8@zolera.com>

Detlef Lannert wrote:
> 
> A couple of weeks ago I was faced with the problem of processing a few
> web pages which were generated by Microsoft Word (and post-processed

You might want to look at the "microsoft demoroniser" :)
	http://www.fourmilab.ch/webtools/demoroniser/

-- 
Zolera Systems, Your Key to Online Integrity
Securing Web services: XML, SOAP, Signatures, Encryption
http://www.zolera.com


From Alexandre.Fayolle@logilab.fr  Fri Jul 20 17:09:13 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Fri, 20 Jul 2001 18:09:13 +0200 (CEST)
Subject: [XML-SIG] A "tolerant" parser for structure-challenged HTML
 files
In-Reply-To: <3B584CCA.310714D8@zolera.com>
Message-ID: <Pine.LNX.4.21.0107201806440.3451-100000@pisces.logilab.fr>

On Fri, 20 Jul 2001, Rich Salz wrote:

> Detlef Lannert wrote:
> > 
> > A couple of weeks ago I was faced with the problem of processing a few
> > web pages which were generated by Microsoft Word (and post-processed
> 
> You might want to look at the "microsoft demoroniser" :)
> 	http://www.fourmilab.ch/webtools/demoroniser/

You can also use Tidy which has a special mode for MS Word files. 
http://www.w3.org/People/Raggett/tidy/

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).


From Alexandre.Fayolle@logilab.fr  Fri Jul 20 19:04:06 2001
From: Alexandre.Fayolle@logilab.fr (Alexandre Fayolle)
Date: Fri, 20 Jul 2001 20:04:06 +0200 (CEST)
Subject: [XML-SIG] Sgmlop SAX 2 parser
Message-ID: <Pine.LNX.4.21.0107201959180.3530-200000@pisces.logilab.fr>

  This message is in MIME format.  The first part should be readable text,
  while the remaining parts are likely unreadable without MIME-aware tools.
  Send mail to mime@docserver.cac.washington.edu for more info.

---1463793919-1301454201-995652246=:3530
Content-Type: TEXT/PLAIN; charset=US-ASCII

Hello,

Here's a first version of my attempt at providing a SAX2 parser for
Sgmlop. It still features some debugging prints. I'd be very grateful if
you could scrutinize it hard and tell me what you think of it. I'll be
back online on Monday.

from xml.sax.drivers2 import drv_sgmlop
from xml.dom.ext.reader.Sax2 import Reader
p = drv_sgmlop.SaxHtmlParser()
r = Reader(parser=p)
d = r.fromUri('http://www.slashdot.org/')
#debugging output skipped...
from xml.dom.ext import PrettyPrint
PrettyPrint(d)

Cheers,

Alexandre Fayolle
-- 
LOGILAB, Paris (France).
http://www.logilab.com   http://www.logilab.fr  http://www.logilab.org
Narval, the first software agent available as free software (GPL).

---1463793919-1301454201-995652246=:3530
Content-Type: TEXT/PLAIN; charset=US-ASCII; name="drv_sgmlop.py"
Content-Transfer-Encoding: BASE64
Content-ID: <Pine.LNX.4.21.0107202004060.3530@pisces.logilab.fr>
Content-Description: 
Content-Disposition: attachment; filename="drv_sgmlop.py"

IiIiDQpTQVgyIGRyaXZlciBmb3IgdGhlIHNnbWxvcCBwYXJzZXIuDQoNCiRJ
ZCQNCiIiIg0KDQp2ZXJzaW9uPSIwLjEiDQoNCmZyb20geG1sLnBhcnNlcnMu
c2dtbGxpYiBpbXBvcnQgU0dNTFBhcnNlcg0KZnJvbSB4bWwuc2F4IGltcG9y
dCBzYXhsaWINCmZyb20geG1sLnNheC54bWxyZWFkZXIgaW1wb3J0IEF0dHJp
YnV0ZXNJbXBsLFhNTFJlYWRlcg0KZnJvbSB4bWwuc2F4LnNheHV0aWxzIGlt
cG9ydCBDb250ZW50R2VuZXJhdG9yLCBwcmVwYXJlX2lucHV0X3NvdXJjZQ0K
DQoNCnRyeToNCiAgICBpbXBvcnQgY29kZWNzDQogICAgZGVmIHRvX3htbF9z
dHJpbmcoc3RyLGVuY29kaW5nKToNCiAgICAgICAgdHJ5Og0KICAgICAgICAg
ICAgZGVjb2RlciA9IGNvZGVjcy5sb29rdXAoZW5jb2RpbmcpWzFdDQogICAg
ICAgICAgICBlbmNvZGVyID0gY29kZWNzLmxvb2t1cCgndXRmLTgnKVswXQ0K
ICAgICAgICAgICAgcmV0dXJuIGVuY29kZXIoZGVjb2RlcihzdHIpWzBdKVsw
XQ0KICAgICAgICBleGNlcHQgTG9va3VwRXJyb3I6DQogICAgICAgICAgICBy
ZXR1cm4gc3RyDQpleGNlcHQgSW1wb3J0RXJyb3I6DQogICAgZnJvbSB4bWwu
dW5pY29kZS5pc284ODU5IGltcG9ydCB3c3RyaW5nDQogICAgZGVmIHRvX3ht
bF9zdHJpbmcoc3RyLGVuY29kaW5nKToNCiAgICAgICAgaWYgdXBwZXIoc2Vs
Zi5fZW5jb2RpbmcpID09ICdVVEYtOCc6DQogICAgICAgICAgICByZXR1cm4g
c3RyDQogICAgICAgIGVsc2U6DQogICAgICAgICAgICByZXR1cm4gd3N0cmlu
Zy5kZWNvZGUoZW5jb2Rpbmcsc3RyKS51dGY4KCkNCiAgICAgICAgDQoNCg0K
Y2xhc3MgU2F4UGFyc2VyKFNHTUxQYXJzZXIsWE1MUmVhZGVyKToNCiAgICAi
IiIgSW1wbGVtZW50cyBJbmNyZW1lbnRhbFJlYWRlciAiIiINCg0KICAgIGRl
ZiBfX2luaXRfXyhzZWxmLGJ1ZnNpemUgPSA2NTUzNixlbmNvZGluZz0nVVRG
LTgnKToNCiAgICAgICAgWE1MUmVhZGVyLl9faW5pdF9fKHNlbGYpDQogICAg
ICAgIFNHTUxQYXJzZXIuX19pbml0X18oc2VsZikNCiAgICAgICAgc2VsZi5w
YXJzZXIgPSBTR01MUGFyc2VyKCkNCiAgICAgICAgc2VsZi5fYnVmc2l6ZSA9
IGJ1ZnNpemUNCiAgICAgICAgc2VsZi5fbGV4aWNhbF9oYW5kbGVyID0gTm9u
ZQ0KICAgICAgICBzZWxmLl9lbmNvZGluZyA9IGVuY29kaW5nDQogICAgICAg
IA0KICAgIGRlZiBwYXJzZShzZWxmLCBzb3VyY2UpOg0KICAgICAgICBzb3Vy
Y2UgPSBwcmVwYXJlX2lucHV0X3NvdXJjZShzb3VyY2UpDQoNCiAgICAgICAg
c2VsZi5wcmVwYXJlUGFyc2VyKHNvdXJjZSkNCiAgICAgICAgZmlsZSA9IHNv
dXJjZS5nZXRCeXRlU3RyZWFtKCkNCiAgICAgICAgYnVmZmVyID0gZmlsZS5y
ZWFkKHNlbGYuX2J1ZnNpemUpDQogICAgICAgIHdoaWxlIGJ1ZmZlciAhPSAi
IjoNCiAgICAgICAgICAgIHNlbGYuZmVlZChidWZmZXIpDQogICAgICAgICAg
ICBidWZmZXIgPSBmaWxlLnJlYWQoc2VsZi5fYnVmc2l6ZSkNCiAgICAgICAg
c2VsZi5jbG9zZSgpDQoNCiAgICBkZWYgcHJlcGFyZVBhcnNlcihzZWxmLCBz
b3VyY2UpOg0KICAgICAgICAiIiJUaGlzIG1ldGhvZCBpcyBjYWxsZWQgYnkg
dGhlIHBhcnNlIGltcGxlbWVudGF0aW9uIHRvIGFsbG93DQogICAgICAgIHRo
ZSBTQVggMi4wIGRyaXZlciB0byBwcmVwYXJlIGl0c2VsZiBmb3IgcGFyc2lu
Zy4iIiINCiAgICAgICAgc2VsZi5fY29udF9oYW5kbGVyLnN0YXJ0RG9jdW1l
bnQoKQ0KICAgICAgICANCiAgICBkZWYgY2xvc2Uoc2VsZik6DQogICAgICAg
ICIiIlRoaXMgbWV0aG9kIGlzIGNhbGxlZCB3aGVuIHRoZSBlbnRpcmUgWE1M
IGRvY3VtZW50IGhhcyBiZWVuDQogICAgICAgIHBhc3NlZCB0byB0aGUgcGFy
c2VyIHRocm91Z2ggdGhlIGZlZWQgbWV0aG9kLCB0byBub3RpZnkgdGhlDQog
ICAgICAgIHBhcnNlciB0aGF0IHRoZXJlIGFyZSBubyBtb3JlIGRhdGEuIFRo
aXMgYWxsb3dzIHRoZSBwYXJzZXIgdG8NCiAgICAgICAgZG8gdGhlIGZpbmFs
IGNoZWNrcyBvbiB0aGUgZG9jdW1lbnQgYW5kIGVtcHR5IHRoZSBpbnRlcm5h
bA0KICAgICAgICBkYXRhIGJ1ZmZlci4NCg0KICAgICAgICBUaGUgcGFyc2Vy
IHdpbGwgbm90IGJlIHJlYWR5IHRvIHBhcnNlIGFub3RoZXIgZG9jdW1lbnQg
dW50aWwNCiAgICAgICAgdGhlIHJlc2V0IG1ldGhvZCBoYXMgYmVlbiBjYWxs
ZWQuDQoNCiAgICAgICAgY2xvc2UgbWF5IHJhaXNlIFNBWEV4Y2VwdGlvbi4i
IiINCiAgICAgICAgU0dNTFBhcnNlci5jbG9zZShzZWxmKQ0KICAgICAgICBz
ZWxmLl9jb250X2hhbmRsZXIuZW5kRG9jdW1lbnQoKSAgICAgICAgDQoNCiAg
ICBkZWYgX21ha2VfYXR0cl9kaWN0KHNlbGYsYXR0cl9saXN0KToNCiAgICAg
ICAgZCA9IHt9DQogICAgICAgIGN2cnQgPSBsYW1iZGEgc3RyLGU9c2VsZi5f
ZW5jb2Rpbmc6dG9feG1sX3N0cmluZyhzdHIsZSkNCiAgICAgICAgZm9yIChh
LGIpIGluIGF0dHJfbGlzdDoNCiAgICAgICAgICAgIGRbY3ZydChhKV09Y3Zy
dChiKQ0KICAgICAgICByZXR1cm4gZA0KICAgIA0KICAgIGRlZiB1bmtub3du
X3N0YXJ0dGFnKHNlbGYsdGFnLGF0dHJzKToNCiAgICAgICAgc2VsZi5fY29u
dF9oYW5kbGVyLnN0YXJ0RWxlbWVudCh0b194bWxfc3RyaW5nKHRhZyxzZWxm
Ll9lbmNvZGluZyksDQogICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg
ICAgICAgICAgQXR0cmlidXRlc0ltcGwoc2VsZi5fbWFrZV9hdHRyX2RpY3Qo
YXR0cnMpKSkNCg0KICAgIGRlZiB1bmtub3duX2VuZHRhZyhzZWxmLHRhZyk6
DQogICAgICAgIHNlbGYuX2NvbnRfaGFuZGxlci5lbmRFbGVtZW50KHRvX3ht
bF9zdHJpbmcodGFnLHNlbGYuX2VuY29kaW5nKSkNCg0KICAgIGRlZiBoYW5k
bGVfZGF0YShzZWxmLGRhdGEpOg0KICAgICAgICBzZWxmLl9jb250X2hhbmRs
ZXIuY2hhcmFjdGVycyh0b194bWxfc3RyaW5nKGRhdGEsc2VsZi5fZW5jb2Rp
bmcpKQ0KDQogICAgZGVmIGhhbmRsZV9jb21tZW50KHNlbGYsZGF0YSk6DQog
ICAgICAgIGlmIHNlbGYuX2xleGljYWxfaGFuZGxlciBpcyBub3QgTm9uZToN
CiAgICAgICAgICAgIHNlbGYuX2xleGljYWxfaGFuZGxlci5jb21tZW50KHRv
X3htbF9zdHJpbmcoZGF0YSxzZWxmLl9lbmNvZGluZykpDQoNCiAgICBkZWYg
c2V0X3Byb3BlcnR5KHNlbGYsbmFtZSx2YWx1ZSk6DQogICAgICAgIGlmIG5h
bWUgPT0gaGFuZGxlci5wcm9wZXJ0eV9sZXhpY2FsX2hhbmRsZXI6DQogICAg
ICAgICAgICBzZWxmLl9sZXhpY2FsX2hhbmRsZXIgPSB2YWx1ZQ0KICAgICAg
ICBlbHNlOg0KICAgICAgICAgICAgcmFpc2UgU0FYTm90UmVjb2duaXplZEV4
Y2VwdGlvbigiUHJvcGVydHkgJyVzJyBub3QgcmVjb2duaXplZCIgJSBuYW1l
KQ0KICAgIGRlZiBnZXRQcm9wZXJ0eShzZWxmLCBuYW1lKToNCiAgICAgICAg
aWYgbmFtZSA9PSBoYW5kbGVyLnByb3BlcnR5X2xleGljYWxfaGFuZGxlcjoN
CiAgICAgICAgICAgIHJldHVybiBzZWxmLl9sZXhpY2FsX2hhbmRsZXINCiAg
ICAgICAgcmFpc2UgU0FYTm90UmVjb2duaXplZEV4Y2VwdGlvbigiUHJvcGVy
dHkgJyVzJyBub3QgcmVjb2duaXplZCIgJSBuYW1lKQ0KDQojIyAgICBkZWYg
Z2V0RmVhdHVyZShzZWxmLCBuYW1lKToNCiMjICAgICAgICBpZiBuYW1lID09
IGhhbmRsZXIuZmVhdHVyZV9uYW1lc3BhY2VzOg0KIyMgICAgICAgICAgICBy
ZXR1cm4gc2VsZi5fbmFtZXNwYWNlcw0KIyMgICAgICAgIHJhaXNlIFNBWE5v
dFJlY29nbml6ZWRFeGNlcHRpb24oIkZlYXR1cmUgJyVzJyBub3QgcmVjb2du
aXplZCIgJSBuYW1lKQ0KDQojIyAgICBkZWYgc2V0RmVhdHVyZShzZWxmLCBu
YW1lLCBzdGF0ZSk6DQojIyAgICAgICAgaWYgc2VsZi5fcGFyc2luZzoNCiMj
ICAgICAgICAgICAgcmFpc2UgU0FYTm90U3VwcG9ydGVkRXhjZXB0aW9uKCJD
YW5ub3Qgc2V0IGZlYXR1cmVzIHdoaWxlIHBhcnNpbmciKQ0KIyMgICAgICAg
IGlmIG5hbWUgPT0gaGFuZGxlci5mZWF0dXJlX25hbWVzcGFjZXM6DQojIyAg
ICAgICAgICAgIHNlbGYuX25hbWVzcGFjZXMgPSBzdGF0ZQ0KIyMgICAgICAg
IGVsc2U6DQojIyAgICAgICAgICAgIHJhaXNlIFNBWE5vdFJlY29nbml6ZWRF
eGNlcHRpb24oIkZlYXR1cmUgJyVzJyBub3QgcmVjb2duaXplZCIgJQ0KIyMg
ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIG5h
bWUpDQoNCiAgICAjIC0tLSBFWFBFUklNRU5UQUwgUFlUSE9OIFNBWCBFWFRF
TlNJT05TDQoNCiAgICBkZWYgZ2V0X3BhcnNlcl9uYW1lKHNlbGYpOg0KICAg
ICAgICByZXR1cm4gInNnbWxvcCINCg0KICAgIGRlZiBnZXRfcGFyc2VyX3Zl
cnNpb24oc2VsZik6DQogICAgICAgIHJldHVybiAiVW5rbm93biINCg0KICAg
IGRlZiBnZXRfZHJpdmVyX3ZlcnNpb24oc2VsZik6DQogICAgICAgIHJldHVy
biB2ZXJzaW9uDQogICAgDQogICAgZGVmIGlzX3ZhbGlkYXRpbmcoc2VsZik6
DQogICAgICAgIHJldHVybiAwDQoNCiAgICBkZWYgaXNfZHRkX3JlYWRpbmco
c2VsZik6DQogICAgICAgIHJldHVybiAwDQoNCg0KZnJvbSB4bWwuZG9tLmh0
bWwgaW1wb3J0IEhUTUxfQ0hBUkFDVEVSX0VOVElUSUVTLCBIVE1MX0ZPUkJJ
RERFTl9FTkQsSFRNTF9PUFRfRU5ELEhUTUxfRFREDQpmcm9tIHN0cmluZyBp
bXBvcnQgc3RyaXAsdXBwZXINCg0KY2xhc3MgU2F4SHRtbFBhcnNlcihTYXhQ
YXJzZXIpOg0KDQogICAgZGVmIF9faW5pdF9fKHNlbGYsYnVmc2l6ZSA9IDY1
NTM2LGVuY29kaW5nPSdpc28tODg1OS0xJyk6DQogICAgICAgIFNheFBhcnNl
ci5fX2luaXRfXyhzZWxmLGJ1ZnNpemUsZW5jb2RpbmcpDQogICAgICAgIA0K
ICAgIGRlZiBmaW5pc2hfc3RhcnR0YWcoc2VsZiwgdGFnLCBhdHRycyk6DQog
ICAgICAgICIiInVzZXMgdGhlIEhUTUwgRFREIHRvIGF1dG9tYXRpY2FsbHkg
Z2VuZXJhdGUgZXZlbnRzDQogICAgICAgIGZvciBtaXNzaW5nIHRhZ3MiIiIN
Cg0KICAgICAgICBwcmludCAnc3RhcnQnLHRhZyxzZWxmLnN0YWNrDQogICAg
ICAgICMgZ3Vlc3Mgb21pdHRlZCBjbG9zZSB0YWdzDQogICAgICAgIHdoaWxl
IHNlbGYuc3RhY2sgYW5kIFwNCiAgICAgICAgICAgICAgdXBwZXIoc2VsZi5z
dGFja1stMV0pIGluIEhUTUxfT1BUX0VORCBhbmQgXA0KICAgICAgICAgICAg
ICB0YWcgbm90IGluIEhUTUxfRFRELmdldChzZWxmLnN0YWNrWy0xXSxbXSk6
DQogICAgICAgICAgICBwcmludCAnIyBmb3JjZSBlbmQnLHNlbGYuc3RhY2tb
LTFdDQogICAgICAgICAgICBzZWxmLnVua25vd25fZW5kdGFnKHNlbGYuc3Rh
Y2tbLTFdKQ0KICAgICAgICAgICAgZGVsIHNlbGYuc3RhY2tbLTFdDQoNCiAg
ICAgICAgaWYgc2VsZi5zdGFjayBhbmQgdGFnIG5vdCBpbiBIVE1MX0RURC5n
ZXQoc2VsZi5zdGFja1stMV0sW10pOg0KICAgICAgICAgICAgcHJpbnQgJyon
KjMwDQogICAgICAgICAgICBwcmludCAnV2FybmluZyA6IHRyeWluZyB0byBh
ZGQgJXMgYXMgYSBjaGlsZCBvZiAlcyclXA0KICAgICAgICAgICAgICAgICAg
KHRhZyxzZWxmLnN0YWNrWy0xXSkNCiAgICAgICAgDQogICAgICAgIHNlbGYu
dW5rbm93bl9zdGFydHRhZyh0YWcsYXR0cnMpDQogICAgICAgIGlmIHVwcGVy
KHRhZykgaW4gSFRNTF9GT1JCSURERU5fRU5EOg0KICAgICAgICAgICAgIyBj
bG9zZSBpbW1lZGlhdGx5IHRhZ3MgZm9yIHdoaWNoIHdlIHdvbid0IGdldCBh
bmQgZW5kDQogICAgICAgICAgICBwcmludCAnZW5kJyx0YWcNCiAgICAgICAg
ICAgIHNlbGYudW5rbm93bl9lbmR0YWcodGFnKQ0KICAgICAgICAgICAgcmV0
dXJuIDANCiAgICAgICAgZWxzZToNCiAgICAgICAgICAgIHNlbGYuc3RhY2su
YXBwZW5kKHRhZykNCiAgICAgICAgcmV0dXJuIDENCg0KICAgIGRlZiBmaW5p
c2hfZW5kdGFnKHNlbGYsIHRhZyk6DQogICAgICAgIHByaW50ICdlbmQnLHRh
ZyxzZWxmLnN0YWNrDQogICAgICAgIGlmIHRhZyBpbiBIVE1MX0ZPUkJJRERF
Tl9FTkQgOg0KICAgICAgICAgICAgIyBkbyBub3RoaW5nOiB3ZSd2ZSBhbHJl
YWR5IGNsb3NlZCBpdA0KICAgICAgICAgICAgcmV0dXJuDQogICAgICAgIGlm
IHRhZyBpbiBzZWxmLnN0YWNrOg0KICAgICAgICAgICAgd2hpbGUgc2VsZi5z
dGFjayBhbmQgc2VsZi5zdGFja1stMV0gIT0gdGFnOg0KICAgICAgICAgICAg
ICAgIHByaW50ICcjIGZvcmNlIGVuZCcsc2VsZi5zdGFja1stMV0NCiAgICAg
ICAgICAgICAgICBzZWxmLnVua25vd25fZW5kdGFnKHNlbGYuc3RhY2tbLTFd
KQ0KICAgICAgICAgICAgICAgIGRlbCBzZWxmLnN0YWNrWy0xXQ0KICAgICAg
ICAgICAgc2VsZi51bmtub3duX2VuZHRhZyh0YWcpDQogICAgICAgICAgICBk
ZWwgc2VsZi5zdGFja1stMV0NCiAgICAgICAgZWxzZToNCiAgICAgICAgICAg
IHByaW50ICcqJyozMA0KICAgICAgICAgICAgcHJpbnQgIldhcm5pbmc6IEkg
ZG9uJ3Qgc2VlIHdoZXJlIHRhZyAlcyB3YXMgb3BlbmVkIiV0YWcNCg0KDQog
ICAgZGVmIGhhbmRsZV9kYXRhKHNlbGYsZGF0YSk6DQogICAgICAgIGlmIHNl
bGYuc3RhY2s6DQogICAgICAgICAgICBpZiAnI1BDREFUQScgbm90IGluIEhU
TUxfRFRELmdldChzZWxmLnN0YWNrWy0xXSxbXSkgYW5kIG5vdCBzdHJpcChk
YXRhKToNCiAgICAgICAgICAgICAgICAjIHRoaXMgaXMgcHJvYmFibHkgaWdu
b3JhYmxlIHdoaXRlc3BhY2UNCiAgICAgICAgICAgICAgICBwcmludCAnd2hp
dGVTcGFjZScNCiAgICAgICAgICAgICAgICBzZWxmLl9jb250X2hhbmRsZXIu
aWdub3JhYmxlV2hpdGVzcGFjZShkYXRhKQ0KICAgICAgICAgICAgZWxzZToN
CiAgICAgICAgICAgICAgICBwcmludCAnZGF0YScsc3RyaXAoZGF0YSkNCiAg
ICAgICAgICAgICAgICBzZWxmLl9jb250X2hhbmRsZXIuY2hhcmFjdGVycyh0
b194bWxfc3RyaW5nKGRhdGEsc2VsZi5fZW5jb2RpbmcpKQ0KDQogICAgZGVm
IGNsb3NlKHNlbGYpOg0KICAgICAgICBwcmludCAnZW5kIGRvY3VtZW50Jyxz
ZWxmLnN0YWNrDQogICAgICAgIFNHTUxQYXJzZXIuY2xvc2Uoc2VsZikNCiAg
ICAgICAgc2VsZi5zdGFjay5yZXZlcnNlKCkNCiAgICAgICAgZm9yIHRhZyBp
biBzZWxmLnN0YWNrOg0KICAgICAgICAgICAgcHJpbnQgJyMgZm9yY2UgZW5k
Jyx0YWcNCiAgICAgICAgICAgIHNlbGYudW5rbm93bl9lbmR0YWcodGFnKQ0K
ICAgICAgICBzZWxmLnN0YWNrID0gW10NCiAgICAgICAgc2VsZi5fY29udF9o
YW5kbGVyLmVuZERvY3VtZW50KCkgICAgICAgIA0KDQoNCg0KDQojIC0tLS0N
Cg0KZGVmIGNyZWF0ZV9wYXJzZXIoKToNCiAgICByZXR1cm4gU2F4UGFyc2Vy
KCkNCg0KZGVmIGNyZWF0ZV9odG1sX3BhcnNlcigpOg0KICAgIHJldHVybiBT
YXhIdG1sUGFyc2VyKCkNCg==
---1463793919-1301454201-995652246=:3530--


From MFioritto@Tribune.com  Tue Jul 24 15:39:00 2001
From: MFioritto@Tribune.com (Fioritto, Mike)
Date: Tue, 24 Jul 2001 09:39:00 -0500
Subject: [XML-SIG] difficulty installing Windows version
Message-ID: <8F2B91951D04D411A5E700508B6D2FF70185FED9@tms-chi-exmb01.tms.trb>

I am having difficulties installing the Windows version of PyXML 0.6.5. When
I launch the .exe file it takes me to the licensing screen and when I hit
next it asks for the Python dir. I am not able to select the dir or enter a
pathname in order to go to the next step.
Any help would be appreciated.
Thanks,
Mike

Michael Fioritto
Executive Producer
Tribune Media Services - Multimedia Products & Services 
435 N Michigan Ave, Chicago, IL  60611
312-222-3032
Fax: 561-673-7711 


From rob@jam.rr.com  Tue Jul 24 15:52:54 2001
From: rob@jam.rr.com (Rob Andrews)
Date: Tue, 24 Jul 2001 09:52:54 -0500
Subject: [XML-SIG] difficulty installing Windows version
In-Reply-To: <8F2B91951D04D411A5E700508B6D2FF70185FED9@tms-chi-exmb01.tms.trb>
Message-ID: <NFBBKIELCLIEEMGGIGKDMEECCBAA.rob@jam.rr.com>

Does it allow you any other options at this point, such as *Next*? And does
it have a default install path that you wish to change, or just none at all.
If you can provide the URL to download PyXML, I'll try to reproduce your
problem.

Oh, and which Windows are you using?

Rob

Your one-stop shop for newbie source code!
Useless Python: http://www.lowerstandard.com/python/

# -----Original Message-----
# From: xml-sig-admin@python.org [mailto:xml-sig-admin@python.org]On
# Behalf Of Fioritto, Mike
# Sent: Tuesday, July 24, 2001 9:39 AM
# To: 'xml-sig@python.org'
# Subject: [XML-SIG] difficulty installing Windows version
#
#
# I am having difficulties installing the Windows version of PyXML
# 0.6.5. When
# I launch the .exe file it takes me to the licensing screen and when I hit
# next it asks for the Python dir. I am not able to select the dir
# or enter a
# pathname in order to go to the next step.
# Any help would be appreciated.
# Thanks,
# Mike
#
# Michael Fioritto
# Executive Producer
# Tribune Media Services - Multimedia Products & Services
# 435 N Michigan Ave, Chicago, IL  60611
# 312-222-3032
# Fax: 561-673-7711
#
# _______________________________________________
# XML-SIG maillist  -  XML-SIG@python.org
# http://mail.python.org/mailman/listinfo/xml-sig


From rob@jam.rr.com  Tue Jul 24 16:03:03 2001
From: rob@jam.rr.com (Rob Andrews)
Date: Tue, 24 Jul 2001 10:03:03 -0500
Subject: [XML-SIG] difficulty installing Windows version
In-Reply-To: <NFBBKIELCLIEEMGGIGKDMEECCBAA.rob@jam.rr.com>
Message-ID: <NFBBKIELCLIEEMGGIGKDGEEDCBAA.rob@jam.rr.com>

I was able to install without incident. PyXML's installer saw in the
Registry where I had Python installed. Do you have the right PyXML file for
the version of Python you have installed?

Rob

Your one-stop shop for newbie source code!
Useless Python: http://www.lowerstandard.com/python/

# -----Original Message-----
# From: xml-sig-admin@python.org [mailto:xml-sig-admin@python.org]On
# Behalf Of Rob Andrews
# Sent: Tuesday, July 24, 2001 9:53 AM
# To: Fioritto, Mike; xml-sig@python.org
# Subject: RE: [XML-SIG] difficulty installing Windows version
#
#
# Does it allow you any other options at this point, such as
# *Next*? And does
# it have a default install path that you wish to change, or just
# none at all.
# If you can provide the URL to download PyXML, I'll try to reproduce your
# problem.
#
# Oh, and which Windows are you using?
#
# Rob
#
# Your one-stop shop for newbie source code!
# Useless Python: http://www.lowerstandard.com/python/
#
# # -----Original Message-----
# # From: xml-sig-admin@python.org [mailto:xml-sig-admin@python.org]On
# # Behalf Of Fioritto, Mike
# # Sent: Tuesday, July 24, 2001 9:39 AM
# # To: 'xml-sig@python.org'
# # Subject: [XML-SIG] difficulty installing Windows version
# #
# #
# # I am having difficulties installing the Windows version of PyXML
# # 0.6.5. When
# # I launch the .exe file it takes me to the licensing screen and
# when I hit
# # next it asks for the Python dir. I am not able to select the dir
# # or enter a
# # pathname in order to go to the next step.
# # Any help would be appreciated.
# # Thanks,
# # Mike
# #
# # Michael Fioritto
# # Executive Producer
# # Tribune Media Services - Multimedia Products & Services
# # 435 N Michigan Ave, Chicago, IL  60611
# # 312-222-3032
# # Fax: 561-673-7711
# #
# # _______________________________________________
# # XML-SIG maillist  -  XML-SIG@python.org
# # http://mail.python.org/mailman/listinfo/xml-sig
#
#
# _______________________________________________
# XML-SIG maillist  -  XML-SIG@python.org
# http://mail.python.org/mailman/listinfo/xml-sig


From noreply@sourceforge.net  Tue Jul 24 23:43:48 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Tue, 24 Jul 2001 15:43:48 -0700
Subject: [XML-SIG] [ pyxml-Bugs-444289 ] Cygwin build fails.
Message-ID: <E15PAu4-00034e-00@usw-sf-web2.sourceforge.net>

Bugs item #444289, was opened at 2001-07-24 15:43
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=444289&group_id=6473

Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: George K. Thiruvathukal (gkt)
Assigned to: Nobody/Anonymous (nobody)
Summary: Cygwin build fails.

Initial Comment:
See subject.


----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=444289&group_id=6473


From content-management@high-tech-communcations.com  Wed Jul 25 10:29:18 2001
From: content-management@high-tech-communcations.com (Victor Black)
Date: Wed, 25 Jul 2001 02:29:18 -0700
Subject: [XML-SIG] New web utility
Message-ID: <200107250929.f6P9TIn07337@mail.high-tech-communications.com>

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD><TITLE></TITLE>
<META http-equiv=Content-Type content="text/html; charset=iso-8859-1">
<META content="MSHTML 5.50.4616.200" name=GENERATOR></HEAD>
<BODY>
<P><FONT size=2><FONT face=Arial>I noticed your email address on a list serve 
related to technology and web development.&nbsp; With your permission, 
we<BR>would like to send you information regarding new web tools and utilities 
based on your interests.&nbsp; Please click the<BR>following link and opt-in to 
our product updates and e-newsletter, click here: </FONT><A target=_blank 
href="http://216.133.228.90/"><FONT 
face=Arial>http://216.133.228.90/</FONT></A><BR><BR><FONT 
face=Arial>Cordially,<BR><BR>Victor 
Black<BR>High-Tech-Communications.com</FONT></FONT><FONT face=Arial> </FONT></P>
<P><FONT size=2><FONT face=Arial>If you would like to be removed from our 
database, please click here: </FONT><A 
href="http://216.133.228.90/remove.cgi"><FONT 
face=Arial>http://216.133.228.90/remove.cgi</FONT></A></FONT></P>
<P><FONT face=Arial size=2></FONT>&nbsp;</P></BODY></HTML>


From info@psrorders.com  Wed Jul 25 19:31:58 2001
From: info@psrorders.com (info@psrorders.com)
Date: Wed, 25 Jul 2001 11:31:58 PDT
Subject: [XML-SIG] Thank you for your past support
Message-ID: <E15PSeA-0005Kb-00@mail.python.org>

<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns="http://www.w3.org/TR/REC-html40">

<head>
<meta name="GENERATOR" content="Microsoft FrontPage 5.0">
<meta name="ProgId" content="FrontPage.Editor.Document">
<meta http-equiv="Content-Type" content="text/html; charset=windows-1252">
<title>We have just redesigned our site www</title>
<style>
<!--
p
	{margin-right:0in;
	margin-left:0in;
	font-size:12.0pt;
	font-family:"Times New Roman";
	}
 p.MsoNormal
	{mso-style-parent:"";
	margin-bottom:.0001pt;
	font-size:12.0pt;
	font-family:"Times New Roman";
	margin-left:0in; margin-right:0in; margin-top:0in}
 table.MsoNormalTable
	{mso-style-parent:"";
	font-size:10.0pt;
	font-family:"Times New Roman"}
 div.MsoNormal
	{mso-style-parent:"";
	margin-bottom:.0001pt;
	font-size:12.0pt;
	font-family:"Times New Roman";
	margin-left:0in; margin-right:0in; margin-top:0in}
-->
</style>
</head>

<body>

<h1 align="center"><i><font face="Arial" size="4">Save up to 60%</font></i></h1>

<p class="MsoNormal"><span style="font-family:Arial">We have just redesigned our 
site www.e-janco.com in conjunction with
www.ejobdescription.com.&nbsp; As we move to our new all electronic format&nbsp;
<a name="OLE_LINK1">we are now offering all of our subscribers and past 
customers a chance
to purchase paper copies of our HandiGuides at savings up to 60% off.&nbsp;&nbsp; </a>
Just go to the <b>
<a href="http://www.ejobdescription.com/search_result_hand_detail.asp?CATALOGID=27">Mid Summer 
Sale link</a></b> and you can participate in this
ONE time offer for any or all of these great HandiGuides.&nbsp;
</span></p>

<p class="MsoNormal">&nbsp;</p>
<table class="MsoNormalTable" border="0" cellpadding="0" style="width: 506">
  <tr style="height: 52.65pt">
    <td style="width:138;padding:.75pt;
  height:52.65pt">
    <p class="MsoNormal" align="center" style="text-align:center">
    <a href="http://www.e-janco.com">
    <img border="0" src="http://www.e-janco.com/images/janco.gif" width="136" height="60" alt="Janco Associates - www.janco.com"></a><v:imagedata src="http://www.e-janco.com/images/janco.gif" o:href="http://www.e-janco.com/images/janco.gif"></td>
    <td colspan="2" style="width:234;padding:.75pt;
  height:52.65pt">
    <p class="MsoNormal" align="center" style="text-align:center"><b>
    <span style="font-family:Arial">
    <a href="http://www.ejobdescription.com/search_result_hand_detail.asp?CATALOGID=27">Special MID-<br>
    Summer Sale<br>
    up to 60% off</a></span></b></td>
    <td style="padding:.75pt .75pt .75pt .75pt;height:52.65pt" width="122">
    <p class="MsoNormal" align="center" style="text-align:center"><v:imagedata src="http://www.ejobdescription.com/images/eJobDescriptionLogo.GIF" o:href="http://www.ejobdescription.com/images/eJobDescriptionLogo.GIF">
    <a href="http://www.ejobdescription.com/">
    <img border="0" src="http://www.ejobdescription.com/images/ejobdescriptionlogo.gif" width="122" height="96" alt="Link to ejobdescription.com"></a></td>
  </tr>
  <tr>
    <td colspan="4" style="padding:.75pt .75pt .75pt .75pt" width="500">
    <div class="MsoNormal" align="center" style="text-align:center">
      <hr size="2" width="500" style="width:375.0pt" noshade color="blue" align="center">
    </div>
    <p class="MsoNormal">&nbsp;</td>
  </tr>
  <tr>
    <td valign="top" style="width:138;padding:.75pt; ">
    <p align="center"><a href="http://www.e-janco.com/DRP.htm">
    <img border="0" src="http://www.e-janco.com/images/fire.gif" width="45" height="78" alt="link to DRP www.e-janco.com/drp.htm"></a></td>
    <td valign="top" style="width:115;padding:.75pt; ">
    <p class="MsoNormal" align="center" style="text-align:center"><b>
    <span style="font-family:Arial"><a href="http://www.e-janco.com/DRP.htm">Disaster <br>
    Recovery <br>
    Plan <br>
    Template</a></span></b></td>
    <td colspan="2" style="width:241;padding:.75pt; ">
    <p class="MsoNormal" align="center" style="margin-bottom:12.0pt;text-align:center">
    <b><span style="font-family:Arial">
    <a href="http://www.e-janco.com/Security.htm">Security <br>
    Guidelines <br>
    Template</a></span></b></td>
  </tr>
  <tr>
    <td style="width:140;padding:0in; ">
    <p class="MsoNormal"><span style="font-size:1.0pt">&nbsp;</span></td>
    <td style="width:117;padding:0in; ">
    <p class="MsoNormal"><span style="font-size:1.0pt">&nbsp;</span></td>
    <td style="width:117;padding:0in; ">
    <p class="MsoNormal"><span style="font-size:1.0pt">&nbsp;</span></td>
    <td style="width:124;padding:0in; ">
    <p class="MsoNormal"><span style="font-size:1.0pt">&nbsp;</span></td>
  </tr>
</table>
<p class="MsoNormal"><span style="display: none">&nbsp;</span></p>
<table class="MsoNormalTable" border="0" cellpadding="0" width="506" style="width: 379.5pt">
  <tr>
    <td colspan="2" style="padding:.75pt .75pt .75pt .75pt">
    <p class="MsoNormal"><i><span style="font-size:14.0pt;font-family:Arial">&nbsp;</span></i></p>
      <hr size="2" width="500" style="width:375.0pt" noshade color="blue" align="center">
    <p class="MsoNormal" align="center" style="text-align:center"><i>
    <span style="font-size:14.0pt;font-family:Arial">
    <a href="http://www.e-janco.com/Salary.htm">IT Salary Survey Featured 
    on
    <img border="0" src="http://www.e-janco.com/images/cnn_fn.gif" width="56" height="18" alt="CNNfn"></a></span></i></td>
  </tr>
  <tr>
    <td colspan="2" style="padding:.75pt .75pt .75pt .75pt">
    <div class="MsoNormal" align="center" style="text-align:center">
      <hr size="2" width="500" style="width:375.0pt" noshade color="blue" align="center">
    </div>
    <p class="MsoNormal">&nbsp;</td>
  </tr>
  <tr>
    <td width="252" style="width:188.9pt;padding:.75pt .75pt .75pt .75pt">
    <p align="center" style="text-align:center">
    <span style="font-size:10.0pt;font-family:Arial;color:blue;text-transform:uppercase"><u>
    <a href="http://www.e-janco.com/PC_book.htm">
    <img border="0" src="http://www.e-janco.com/images/pcbook.gif" width="72" height="90" alt="Link to PC_book.htm"></a></u></span><b><span style="font-size:10.0pt;font-family:Arial"><a href="http://www.e-janco.com/PC_book.htm"><br>
    PC Policies &amp; 
    Procedures</a></span></b></td>
    <td width="248" style="width:186.1pt;padding:.75pt .75pt .75pt .75pt">
    <p class="MsoNormal" align="center" style="text-align:center">
    <span style="font-size:10.0pt;font-family:Arial;color:blue;text-transform:uppercase"><u>
    <a href="http://www.e-janco.com/Job_Book.htm">
    <img border="0" src="http://www.e-janco.com/images/jobbook.gif" width="72" height="90" alt="Link to Job_Book.htm"><br>
    </a>
    </u></span><b><span style="font-size:10.0pt;font-family:Arial">
    <a href="http://www.e-janco.com/Job_Book.htm">IT Job 
    Descriptions</a></span></b></td>
  </tr>
  <tr>
    <td width="252" style="width:188.9pt;padding:.75pt .75pt .75pt .75pt">
    <p align="center" style="text-align:center"><b>
    <span style="font-size:10.0pt;
  font-family:Arial;color:blue;text-transform:uppercase">
    <a href="http://www.e-janco.com/client_server_book.htm">
    <img border="0" src="http://www.e-janco.com/images/csbook.gif" width="72" height="90" alt="Link to client_server_book.htm"><u><br>
    </u></a></span><span style="font-size:10.0pt;font-family:Arial">
    <a href="http://www.e-janco.com/client_server_book.htm">Client Server 
    Management</a></span></b></td>
    <td width="248" style="width:186.1pt;padding:.75pt .75pt .75pt .75pt">
    <p class="MsoNormal" align="center" style="text-align:center"><b>
    <span style="font-size:10.0pt;font-family:Arial">
    <a href="metric_book.htm">
    <img border="0" src="http://www.e-janco.com/images/metricbook.gif" width="72" height="90" alt="Link to metric_book.htm"></a><a href="http://www.e-janco.com/metric_book.htm"><br>
    Metrics for IT 
    and the Internet</a></span></b></td>
  </tr>
  <tr>
    <td colspan="2" style="padding:.75pt .75pt .75pt .75pt">
    <div class="MsoNormal" align="center" style="text-align:center">
      <hr size="2" width="500" style="width:375.0pt" noshade color="blue" align="center">
    </div>
    <p class="MsoNormal">&nbsp;</td>
  </tr>
</table>
<p class="MsoNormal"><span style="font-family:Arial">Have a great day<br>
&nbsp;</span></p>
<p class="MsoNormal"><span style="font-family: Arial">
<a href="mailto:info@psrorders.com?subject=July Mail Question">Customer Service</a></span><span style="font-family:Arial"><br>
<br>
You have opted in for mail on our products.&nbsp; If you wish to be removed from 
our list just select the link that follows or reply to this message with the word
<b><a href="mailto:info@psrorders.com?subject=Remove">REMOVE</a></b> in the subject</span><span style="font-size:10.0pt;font-family:Arial">.</span></p>

</body>

</html>


From martin@loewis.home.cs.tu-berlin.de  Thu Jul 26 07:45:02 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Thu, 26 Jul 2001 08:45:02 +0200
Subject: [XML-SIG] Installing to split prefix/exec-prefix
In-Reply-To: <AHEBKIMJLGAIBJCOANKPCEALCBAA.lpc@racemi.com>
References: <AHEBKIMJLGAIBJCOANKPCEALCBAA.lpc@racemi.com>
Message-ID: <200107260645.f6Q6j2S01528@mira.informatik.hu-berlin.de>

> In other words, I expected .py files to go to
> 
> python_common/lib/python2.1/site-packages/_xmlplus
> 
> and exec_prefix related stuff to
> 
> python_<OS>/lib/python2.1/site-packages/_xmlplus
> 
> or better, to
> 
> python_<OS>/lib/python2.1/lib-dynload
> 
> Is this the way it's supposed to be?
> Is this a distutils or PyXML problem?
> Should I do things differently?

Yes, you should do things differently. First, distutils does not
support installing packages into two location (exec_prefix/lib and
prefix/lib); it will always put them into install_lib (which will be
either platlib or purelib, depending on whether the package has
extension modules - which PyXML always has).

Even if you'd manage to install PyXML into two locations, it would not
work. This is a Python problem: If a package directory is found in
multiple location, only the first location is used. Otherwise, which
__init__ would you execute? In turn, you would not find
xml.parsers.sgmlop, since that would live in
exec_prefix/site-packages/..., which would not be part of the package.

It is possible to work around these limitations; if they are important
to you, you may consider providing a patch.

Regards,
Martin


From larsga@garshol.priv.no  Thu Jul 26 09:18:24 2001
From: larsga@garshol.priv.no (Lars Marius Garshol)
Date: 26 Jul 2001 10:18:24 +0200
Subject: [XML-SIG] Sgmlop SAX 2 parser
In-Reply-To: <Pine.LNX.4.21.0107201959180.3530-200000@pisces.logilab.fr>
References: <Pine.LNX.4.21.0107201959180.3530-200000@pisces.logilab.fr>
Message-ID: <m3ofq8umr3.fsf@lambda.garshol.priv.no>

Hi Alexandre,

* Alexandre Fayolle
| 
| Here's a first version of my attempt at providing a SAX2 parser for
| Sgmlop. It still features some debugging prints. I'd be very grateful if
| you could scrutinize it hard and tell me what you think of it. I'll be
| back online on Monday.

This looks reasonable to me. I haven't tested it, or looked at what it
does with encodings, but the general approach seems like it will work
just fine.

Some minor nits:

 - set_property should be setProperty

 - you don't need prepareParser, it's just there to make subclassing
   IncrementalParser easier, but you don't do that

 - I think you omit the startDocument() element if someone only uses
   the feed, close, and reset methods, without going via parse

 - the experimental Python extensions you can just remove, that is
   stuff from SAX 1.0
 
 - the SaxHtmlParser looks good, but it should get its own module so
   that it is accessible via xml.sax.make_parser

BTW: I needed HTML parsing yesterday, and, forgetting Alexandre's
     contribution, I added drivers for sgmllib and htmllib to
     xml.sax.drivers2. So we should be well covered in terms of SGML
     and HTML parsing now.

--Lars M.


From Nicolas.Chauvat@logilab.fr  Thu Jul 26 10:32:24 2001
From: Nicolas.Chauvat@logilab.fr (Nicolas Chauvat)
Date: Thu, 26 Jul 2001 11:32:24 +0200 (CEST)
Subject: [XML-SIG] XML DTD for RPM spec and PO files
Message-ID: <Pine.LNX.4.21.0107261130370.14769-100000@aries.logilab.fr>

Hi List,

Would any of you know if there is an existing DTD for RPM .spec files and
for .po files ?

Would any of you care to share his experience with man page generation
using a DocBook source ?

Thanks in advance,

--=20
Nicolas Chauvat

http://www.logilab.com - "Mais o=F9 est donc Ornicar ?" - LOGILAB, Paris (F=
rance)


From fdrake@acm.org  Thu Jul 26 13:49:25 2001
From: fdrake@acm.org (Fred L. Drake, Jr.)
Date: Thu, 26 Jul 2001 08:49:25 -0400 (EDT)
Subject: [XML-SIG] XML DTD for RPM spec and PO files
In-Reply-To: <Pine.LNX.4.21.0107261130370.14769-100000@aries.logilab.fr>
References: <Pine.LNX.4.21.0107261130370.14769-100000@aries.logilab.fr>
Message-ID: <15200.4565.457829.950729@cj42289-a.reston1.va.home.com>

Nicolas Chauvat writes:
 > Would any of you know if there is an existing DTD for RPM .spec files and
 > for .po files ?

  Neither of these are XML-based languages, so I presume you're asking
for XML-based equivalents?  There might be something for .po files --
the Translation Memory eXchange specification is available at:

	http://www.lisa.org/tmx/tmx.htm

I'm not particularly knowledgable about it; it may not be a good
match.
  I don't think I've heard anything about a .spec equivalent.


  -Fred

-- 
Fred L. Drake, Jr.  <fdrake at acm.org>
PythonLabs at Digital Creations


From Nicolas.Chauvat@logilab.fr  Thu Jul 26 14:44:54 2001
From: Nicolas.Chauvat@logilab.fr (Nicolas Chauvat)
Date: Thu, 26 Jul 2001 15:44:54 +0200 (CEST)
Subject: [XML-SIG] XML DTD for RPM spec and PO files
In-Reply-To: <15200.4565.457829.950729@cj42289-a.reston1.va.home.com>
Message-ID: <Pine.LNX.4.21.0107261534020.14769-100000@aries.logilab.fr>

>  > Would any of you know if there is an existing DTD for RPM .spec files =
and
>  > for .po files ?
>=20
>   Neither of these are XML-based languages, so I presume you're asking
> for XML-based equivalents?  There might be something for .po files --
> the Translation Memory eXchange specification is available at:

Actually I'm coordinating the french translation of the LDP and I'm
looking for existing XML DTDs that would map the structure of .po and
=2Espec files in order to generalize XML as an exchange, archive and
manipulation format for translations.

For a .po file, the idea would be to map

"Project-Id-Version: PyXML 0.6.5"
"PO-Revision-Date: 2001-07-21"

msgid "error"
msgstr "erreur"

msgid "OK"
msgstr "OK"

to something along the lines of

<po id=3D'PyXML' version=3D'0.6.5'>
  <msg>
   <id>error</id>
   <str>erreur</str>
  </msg>
  <msg>
   <id>OK</id>
   <str>OK</str>
  </msg>
</po>

And the same thing for .spec files.

Then I can store all these files and there translation and use the same
tools (xmldiff and XSL transforms) to deal with new versions, detect parts
that changed, produce reports and even generate the original format (XML
-> po or XML -> spec).

--=20
Nicolas Chauvat

http://www.logilab.com - "Mais o=F9 est donc Ornicar ?" - LOGILAB, Paris (F=
rance)


From martin@loewis.home.cs.tu-berlin.de  Thu Jul 26 17:46:13 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Thu, 26 Jul 2001 18:46:13 +0200
Subject: [XML-SIG] XML DTD for RPM spec and PO files
In-Reply-To: <Pine.LNX.4.21.0107261534020.14769-100000@aries.logilab.fr>
 (message from Nicolas Chauvat on Thu, 26 Jul 2001 15:44:54 +0200
 (CEST))
References: <Pine.LNX.4.21.0107261534020.14769-100000@aries.logilab.fr>
Message-ID: <200107261646.f6QGkDR00923@mira.informatik.hu-berlin.de>

> Actually I'm coordinating the french translation of the LDP and I'm
> looking for existing XML DTDs that would map the structure of .po and
> .spec files in order to generalize XML as an exchange, archive and
> manipulation format for translations.

What do you gain by doing that? For the existing formats, all kinds of
tools exist. For the XML equivalents, nothing exists.

> Then I can store all these files and there translation and use the same
> tools (xmldiff and XSL transforms) to deal with new versions, detect parts
> that changed, produce reports and even generate the original format (XML
> -> po or XML -> spec).

Why do you want to use, say, XSL, on a po file? The typical output
processing of such file is into a binary .mo file, which is structured
for efficient access at run time. I very much doubt you can do msgfmt
in XSLT.  Likewise, to combine revisions of catalogs, I doubt any tool
would be as good as msgmerge, with support for fuzzy messages and all
that.

Regards,
Martin


From Nicolas.Chauvat@logilab.fr  Fri Jul 27 08:02:27 2001
From: Nicolas.Chauvat@logilab.fr (Nicolas Chauvat)
Date: Fri, 27 Jul 2001 09:02:27 +0200 (CEST)
Subject: [XML-SIG] XML DTD for RPM spec and PO files
In-Reply-To: <200107261646.f6QGkDR00923@mira.informatik.hu-berlin.de>
Message-ID: <Pine.LNX.4.21.0107270850580.14769-100000@aries.logilab.fr>

> What do you gain by doing that? For the existing formats, all kinds of
> tools exist. For the XML equivalents, nothing exists.

My idea is to use XML as a common format and reuse the xml tools to do the
same processing for different document that I received in different source
format.

In french I'd call it a "format pivot". I suppose I could translate it to
"hub format".

I know that tools exist for other formats. But I'd like not to have to
install, use and know of as many tools as (format, processing) couples I
have to deal with.

> > Then I can store all these files and there translation and use the same
> > tools (xmldiff and XSL transforms) to deal with new versions, detect pa=
rts
> > that changed, produce reports and even generate the original format (XM=
L
> > -> po or XML -> spec).
>=20
> Why do you want to use, say, XSL, on a po file? The typical output
> processing of such file is into a binary .mo file, which is structured
> for efficient access at run time. I very much doubt you can do msgfmt
> in XSLT.

That is not what I meant. My idea is to archive source documents in XML
format, do version diffing and checking with XML, maybe write the source
directly to XML, then turn these documents to the proper format when
needed.

For .po files, it would be something like :

XML-formatted .po -- XSL --> .po -- usual po generation tools --> whatever

For .spec files, it would be something like :

XML-formatted .spec -- XSL --> .spec -- RPM --> package

For man pages, it would be :

Docbook -- XSL --> man=20

or Docbook -- XSL --> man source -- groff (?) --> man

Disclaimer: I don't know much about source po and man usage and generation
I probably used the wrong tool names above.

> Likewise, to combine revisions of catalogs, I doubt any tool
> would be as good as msgmerge, with support for fuzzy messages and all
> that.

Agreed. That's definitely an issue with .po files. I suppose one could
write an XSL extension that does the same, but is it worth it ?

--=20
Nicolas Chauvat

http://www.logilab.com - "Mais o=F9 est donc Ornicar ?" - LOGILAB, Paris (F=
rance)


From lpc@racemi.com  Fri Jul 27 13:57:20 2001
From: lpc@racemi.com (Luis P Caamano)
Date: Fri, 27 Jul 2001 08:57:20 -0400
Subject: [XML-SIG] Installing to split prefix/exec-prefix
In-Reply-To: <200107260645.f6Q6j2S01528@mira.informatik.hu-berlin.de>
Message-ID: <AHEBKIMJLGAIBJCOANKPEEBLCBAA.lpc@racemi.com>

> -----Original Message-----
> From: Martin v. Loewis [mailto:martin@loewis.home.cs.tu-berlin.de]
> Sent: Thursday, July 26, 2001 2:45 AM
> To: lpc@racemi.com
> Cc: xml-sig@python.org
> Subject: Re: [XML-SIG] Installing to split prefix/exec-prefix
> 
> 
> > In other words, I expected .py files to go to
> > 
> > python_common/lib/python2.1/site-packages/_xmlplus
> > 
> > and exec_prefix related stuff to
> > 
> > python_<OS>/lib/python2.1/site-packages/_xmlplus
> > 
> > or better, to
> > 
> > python_<OS>/lib/python2.1/lib-dynload
> > 
> > Is this the way it's supposed to be?
> > Is this a distutils or PyXML problem?
> > Should I do things differently?
> 
> Yes, you should do things differently. First, distutils does not
> support installing packages into two location (exec_prefix/lib and
> prefix/lib); it will always put them into install_lib (which will be
> either platlib or purelib, depending on whether the package has
> extension modules - which PyXML always has).

Guess it's time to send a disutils bug report then.

> 
> Even if you'd manage to install PyXML into two locations, it would not
> work. This is a Python problem: If a package directory is found in
> multiple location, only the first location is used. Otherwise, which
> __init__ would you execute? 

Understood, a python problem.

Pardon my ignorance, but, would it be true to say that
.py[co], files should NEVER exist in exec_prefix?  If so,
then __init__.py would always be in prefix.  Also,
'unpure' packages would always be split between prefix
and exec_prefix, and thus, should be tested that way.

python bug?

> In turn, you would not find
> xml.parsers.sgmlop, since that would live in
> exec_prefix/site-packages/..., which would not be part of the package.

Understood.

> 
> It is possible to work around these limitations; if they are important
> to you, you may consider providing a patch.

Well, currently the work around is to have packages like PyXML
(almost all the packages I've used) reside completely in exec_prefix.

After playing with split prefix/exec_prefix I've found that it
has not been tested much outside of python itself, which works
fine with split dirs.  My opinion is that split dirs just
doesn't work as advertised in practice.  Too bad.

> 
> Regards,
> Martin
> 

Thanks for your reply Martin.


----------------------------------
Luis P. Caamano 
lcaamano@mindspring.com
Atlanta, GA, USA
----------------------------------


From lannert@uni-duesseldorf.de  Fri Jul 27 14:15:22 2001
From: lannert@uni-duesseldorf.de (Detlef Lannert)
Date: Fri, 27 Jul 2001 15:15:22 +0200
Subject: [XML-SIG] Re: A "tolerant" parser for structure-challenged HTML files
In-Reply-To: <Pine.LNX.4.21.0107201806440.3451-100000@pisces.logilab.fr>; from Alexandre.Fayolle@logilab.fr on Fri, Jul 20, 2001 at 06:09:13PM +0200
References: <3B584CCA.310714D8@zolera.com> <Pine.LNX.4.21.0107201806440.3451-100000@pisces.logilab.fr>
Message-ID: <20010727151522.A10829@det.rz.uni-duesseldorf.de>

On Fri, Jul 20, 2001 at 06:09:13PM +0200, Alexandre Fayolle wrote:
> On Fri, 20 Jul 2001, Rich Salz wrote:
> > 
> > You might want to look at the "microsoft demoroniser" :)
> > 	http://www.fourmilab.ch/webtools/demoroniser/
> 
> You can also use Tidy which has a special mode for MS Word files. 
> http://www.w3.org/People/Raggett/tidy/

Many (albeit belated) thanks to Rich and Alexandre for these pointers.
I'll have a closer look into both of these tools (lying at the beach
during the next three weeks might get me in the mood for reading P**l
and C code ... ;).

  Detlef


From sorifu_info@ec-shock.com  Fri Jul 27 15:57:39 2001
From: sorifu_info@ec-shock.com (=?ISO-2022-JP?B?GyRCJWEhPCVrJSIlcyUxITwlSDt2TDM2SRsoQg==?=)
Date: Fri, 27 Jul 2001 23:57:39 +0900
Subject: [XML-SIG] =?ISO-2022-JP?B?GyRCPi5AdEZiM1UbKEIgGyRCO1k7fUlUO1k7fTZbNV4bKEI=?=
 =?ISO-2022-JP?B?GyRCJSIlcyUxITwlSBsoQg==?=
Message-ID: <20010727.2357380298.babaq@sorifu_info-ec-shock.com>

$B>.@tFb3U!!;Y;}!&IT;Y;}!!6[5^%"%s%1!<%H(B

$B$*K;$7$$$H$3$m!"$4LBOG$r$*$+$1$7$^$9$,!"(B 
$B2<$N(BURL$B$r%/%j%C%/$7$F!"%"%s%1!<%H$K$46(NO$*4j$$$$$?$7$^$9!#(B 

http://211.9.37.210/koizumi/koizumi_an.asp?id=168905


From Nicolas.Chauvat@logilab.fr  Fri Jul 27 18:26:00 2001
From: Nicolas.Chauvat@logilab.fr (Nicolas Chauvat)
Date: Fri, 27 Jul 2001 19:26:00 +0200 (CEST)
Subject: [XML-SIG] ANN: xmldiff 0.1.1
Message-ID: <Pine.LNX.4.21.0107271921470.26845-100000@aries.logilab.fr>

Hi Folks,

Logilab released today our first beta of xmldiff. It's version 0.1.1 but
is fully functionnal. As you already guessed, xmldiff figures out the
differences between two XML trees in the same way that diff does it for
text files.

Homepage:=09http://www.logilab.org/xmldiff/

More Info:=09http://www.logilab.org/xmldiff/HELP.txt

Download:=09ftp://ftp.logilab.org/pub/xmldiff/

It's still a bit slow, but it works and gives correct results. As usual
all comments and ideas are welcome.

Happy XML-diff'ing !

--=20
Nicolas Chauvat

http://www.logilab.com - "Mais o=F9 est donc Ornicar ?" - LOGILAB, Paris (F=
rance)


From martin@loewis.home.cs.tu-berlin.de  Fri Jul 27 23:45:28 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Sat, 28 Jul 2001 00:45:28 +0200
Subject: [XML-SIG] Installing to split prefix/exec-prefix
In-Reply-To: <AHEBKIMJLGAIBJCOANKPEEBLCBAA.lpc@racemi.com>
References: <AHEBKIMJLGAIBJCOANKPEEBLCBAA.lpc@racemi.com>
Message-ID: <200107272245.f6RMjSI02115@mira.informatik.hu-berlin.de>

> Pardon my ignorance, but, would it be true to say that
> .py[co], files should NEVER exist in exec_prefix?  

I can't see why they should not exist in exec_prefix. It is true to
say that they never need to exist there.

> If so, then __init__.py would always be in prefix.  Also, 'unpure'
> packages would always be split between prefix and exec_prefix, and
> thus, should be tested that way.
>
> python bug?

No, it is documented behaviour. You can add additional directories to
a package by setting some package attribute (__path__?), but by
default, a package only lives in a single directory.

> After playing with split prefix/exec_prefix I've found that it
> has not been tested much outside of python itself, which works
> fine with split dirs.  My opinion is that split dirs just
> doesn't work as advertised in practice.

Where did you see any advertisement for exec_prefix that made you
think PyXML should not install into it? Exact quote please, if
possible.

Regards,
Martin


From noreply@sourceforge.net  Sat Jul 28 04:44:35 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Fri, 27 Jul 2001 20:44:35 -0700
Subject: [XML-SIG] [ pyxml-Patches-445405 ] Cygwin Build Attempt
Message-ID: <E15QL1n-0001Or-00@usw-sf-web2.sourceforge.net>

Patches item #445405, was opened at 2001-07-27 20:44
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=445405&group_id=6473

Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Garth T Kidd (gtk)
Assigned to: Nobody/Anonymous (nobody)
Summary: Cygwin Build Attempt

Initial Comment:
Trying to build PyXML from source under Cygwin and 
install it with the Cygwin build of Python is pretty 
frustrating. Following some clues I found via Google, 
it looks like use of the DL_EXPORT macro is pretty 
handy. The next problem was the linker complaining 
that it couldn't find the (static) PyBoolean_Type to 
export it, so I removed static. 

Now it builds. Whee! 

Lots of tests fail, but that also happens with the 
PyXML 0.6.5 Windows distribution kit, so I figure you 
know about some of them already.

Failed test results after applying the patch to either 
PyXML 0.6.5 or the current CVS contents: 

test test_howto crashed -- exceptions.TypeError : 
__init__() takes at least 2 arguments (1 given)
test test_marshal crashed -- exceptions.TypeError : 
__init__() takes at least 2 arguments (1 given)
test test_minidom failed -- Writing: 'Test Failed: ', 
expected: 'Passed testAA'
test test_sax crashed -- exceptions.TypeError : 
__init__() takes at least 2 arguments (1 given)
test test_saxdrivers crashed -- exceptions.TypeError : 
__init__() takes at least 2 arguments (1 given)


----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=445405&group_id=6473


From fdrake@acm.org  Sat Jul 28 04:44:31 2001
From: fdrake@acm.org (Fred L. Drake, Jr.)
Date: Fri, 27 Jul 2001 23:44:31 -0400 (EDT)
Subject: [XML-SIG] Expat 1.95.2 released
Message-ID: <15202.13599.964445.917473@cj42289-a.reston1.va.home.com>

  In case anyone is interested, Expat 1.95.2 has been released, with
both a source archive for Unix users and a handy installer for Windows
victims (thanks to Tim Peters for getting me started!).  This release
fixes some small bugs and improves the portability of the build
process (and there is one for Windows this time).
  You can pick up the 1.95.2 release at:

	http://sourceforge.net/projects/expat/


  -Fred

-- 
Fred L. Drake, Jr.  <fdrake at acm.org>
PythonLabs at Zope Corporation


From noreply@sourceforge.net  Sat Jul 28 09:07:42 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Sat, 28 Jul 2001 01:07:42 -0700
Subject: [XML-SIG] [ pyxml-Patches-445441 ] Cygwin Fixes
Message-ID: <E15QP8Q-0000m1-00@usw-sf-web2.sourceforge.net>

Patches item #445441, was opened at 2001-07-28 01:07
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=445441&group_id=6473

Category: 4Suite
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Garth T Kidd (gtk)
Assigned to: Nobody/Anonymous (nobody)
Summary: Cygwin Fixes

Initial Comment:
To be able to install 4Suite (and PyXML, and many 
other Python modules) under the Cygwin environment, 
DL_EXPORT must be rigorously applied to any non-static 
identifiers. 

This patch makes appropriate DL_EXPORT changes and 
removes 'static' from a definition of PyBoolean_Type 
so that 4Suite builds and installs under Cygwin. 

Apply the patch with -p1. 


----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=445441&group_id=6473


From just@letterror.com  Sun Jul 29 10:48:08 2001
From: just@letterror.com (Just van Rossum)
Date: Sun, 29 Jul 2001 11:48:08 +0200
Subject: [XML-SIG] 4suite build troubles
Message-ID: <20010729114812-r01010700-9730901c-0910-010c@10.0.0.151>

I have some assorted troubles building 4suite for MacOS. First thing I noticed
that several C extensions use strdup(). Unfortunately the Metrowerks compiler
doesn't have strdup(). If I understand it correctly, strdup() is *not* ANSI C,
so for portability, using it is an error. I've worked around the problem for
now.

Then, when I build XPathc, I get the following link errors:

Link Error   : multiply-defined 'g_errorTraceback' (data)
Defined in XPath_wrap.c
Defined in XPathSwig.c

Link Error   : multiply-defined 'g_errorValue' (data)
Defined in XPath_wrap.c
Defined in XPathSwig.c

Link Error   : multiply-defined 'g_errorType' (data)
Defined in XPath_wrap.c
Defined in XPathSwig.c

Link Error   : multiply-defined 'g_prodNum' (data)
Defined in XPath_wrap.c
Defined in XPathSwig.c

Link Error   : multiply-defined 'g_errorOccured' (data)
Defined in XPath_wrap.c
Defined in XPathSwig.c

Link Error   : multiply-defined 'g_errorLocation' (data)
Defined in XPath_wrap.c
Defined in XPathSwig.c

Am I doing something wrong here?

On another note, even PyXML doesn't build out of the box with distutils on the
Mac. Seems to have to do with path names that should get converted to Mac paths.
Whether this is a bug in the Mac backend of distutils or not I don't know yet --
I'll look into it further.

Just


From dgoodger@bigfoot.com  Sun Jul 29 17:59:16 2001
From: dgoodger@bigfoot.com (David Goodger)
Date: Sun, 29 Jul 2001 12:59:16 -0400
Subject: [XML-SIG] Validation from Python code?
Message-ID: <B789B922.15315%dgoodger@bigfoot.com>

In the Docstring Processing System, there is a class library module [1]_
with one class for each element in the DTDs [2]_. The nodes (text & element
objects) have an asdom() method, which constructs an xml.dom.minidom tree. I
would like to implement a validate() method, which would verify that the
content of each element node adheres to the content models from the DTDs. I
could manually code each validate() method, but thought that there must be
some way to encode the content models in the classes such that they can be
automatically validated (without running an external parser). Basically I'm
looking for the XML/DOM equivalent of regular expression matching, something
that can answer the question: "given a pattern P, is DOM tree D valid?"

Does anybody know of ways to accomplish this validation? Pointers to
existing standards and/or implementations would be much appreciated. Any
Python implementations out there?

Thanks in advance.

David Goodger

.. _[1] http://docstring.sourceforge.net/dps/nodes.py

.. _[2] http://docstring.sourceforge.net/spec/gpdi.dtd,
   http://docstring.sourceforge.net/spec/ppdi.dtd,
   http://docstring.sourceforge.net/spec/soextblx.dtd

-- 
David Goodger    dgoodger@bigfoot.com    Open-source projects:
 - Python Docstring Processing System: http://docstring.sourceforge.net
 - reStructuredText: http://structuredtext.sourceforge.net
 - The Go Tools Project: http://gotools.sourceforge.net


From rsalz@zolera.com  Sun Jul 29 20:11:42 2001
From: rsalz@zolera.com (Rich Salz)
Date: Sun, 29 Jul 2001 15:11:42 -0400
Subject: [XML-SIG] Validation from Python code?
References: <B789B922.15315%dgoodger@bigfoot.com>
Message-ID: <3B645FEE.409F2CA0@zolera.com>

> Basically I'm
> looking for the XML/DOM equivalent of regular expression matching, something
> that can answer the question: "given a pattern P, is DOM tree D valid?"

Sounds like you want pytrex.
	/r$


From garth@deadlybloodyserious.com  Mon Jul 30 00:02:37 2001
From: garth@deadlybloodyserious.com (Garth T Kidd)
Date: Mon, 30 Jul 2001 09:02:37 +1000
Subject: [Docstring-develop] Re: [XML-SIG] Validation from Python code?
In-Reply-To: <3B645FEE.409F2CA0@zolera.com>
Message-ID: <NBBBIJGOIKKLHHFHILDNKEONJLAA.garth@deadlybloodyserious.com>

> > Basically I'm looking for the XML/DOM equivalent of regular
> > expression matching, something that can answer the question:
> > "given a pattern P, is DOM tree D valid?"
>
> Sounds like you want pytrex.

PyTREX looks like it can nicely validate the parser output (via
``.asdom().toxml()``), but doesn't seem to provide tools for
automatically generating the patterns from the DTD.

Can anyone suggest a way of filling the gap so that we don't have to
hand-maintain the tests in sync with the DTD?

Suspicions: if we switch to using an XML schema, perhaps we can pull
PyTREX patterns out of the schema using XSLT. Perhaps we can build the
schema from the PyTREX patterns. Perhaps we can establish some
middle-format from which we can derive both schema and patterns --
`permittedContents` attributes in ``nodes.py``, or something. It's a fun
problem to mull over, that's for sure.

I can understand David's reluctance to use an external parser if he can
possibly avoid it; it just took me Much Work to get PyXML and 4Suite to
build and install under Cygwin.

David, are you intending your tests to be run only during testing, or as
an optional "paranoid mode" for dps and restructuredtext where people
can have all of their output checked for validity versus the DTD? Both?

Regards,
Garth.


From lcaamano@mindspring.com  Mon Jul 30 00:20:03 2001
From: lcaamano@mindspring.com (Luis P Caamano)
Date: Sun, 29 Jul 2001 19:20:03 -0400
Subject: [XML-SIG] Installing to split prefix/exec-prefix
In-Reply-To: <200107272245.f6RMjSI02115@mira.informatik.hu-berlin.de>
Message-ID: <AHEBKIMJLGAIBJCOANKPAEBOCBAA.lcaamano@mindspring.com>

> -----Original Message-----
> From: Martin v. Loewis [mailto:martin@loewis.home.cs.tu-berlin.de]
> Sent: Friday, July 27, 2001 6:45 PM
> To: lpc@racemi.com
> Cc: xml-sig@python.org
> Subject: Re: [XML-SIG] Installing to split prefix/exec-prefix
> 
> 
> > After playing with split prefix/exec_prefix I've found that it
> > has not been tested much outside of python itself, which works
> > fine with split dirs.  My opinion is that split dirs just
> > doesn't work as advertised in practice.
> 
> Where did you see any advertisement for exec_prefix that made you
> think PyXML should not install into it? Exact quote please, if
> possible.

Not PyXML specifically.  What made think that was this quote from
the Python distribution README file:

***
This will install all platform-independent files in subdirectories of
the directory given with the --prefix option to configure or to the
`prefix' Make variable (default /usr/local).  All binary and other
platform-specific files will be installed in subdirectories if the
directory given by --exec-prefix or the `exec_prefix' Make variable
(defaults to the --prefix directory) is given.
***

So, that works great with Python, but then when you install other
extensions/packages, things are not so neat ... for example ...
PyXML.  I've learned now that this is not a PyXML specific issue,
but regardless, "things don't work for everything as the quote
above led me to believe."  Granted, it doesn't say anything about
PyXML or any other extension/package but still, I just thought
that exec_prefix was for "binaries and platform specific files."

Regards,
Luis

----------------------------------
Luis P. Caamano 
lcaamano@mindspring.com
Atlanta, GA, USA
----------------------------------


From lcaamano@mindspring.com  Mon Jul 30 00:36:36 2001
From: lcaamano@mindspring.com (Luis P Caamano)
Date: Sun, 29 Jul 2001 19:36:36 -0400
Subject: [XML-SIG] Installing to split prefix/exec-prefix
In-Reply-To: <200107272245.f6RMjSI02115@mira.informatik.hu-berlin.de>
Message-ID: <AHEBKIMJLGAIBJCOANKPEEBOCBAA.lcaamano@mindspring.com>

> -----Original Message-----
> From: Martin v. Loewis [mailto:martin@loewis.home.cs.tu-berlin.de]
> Sent: Friday, July 27, 2001 6:45 PM
> To: lpc@racemi.com
> Cc: xml-sig@python.org
> Subject: Re: [XML-SIG] Installing to split prefix/exec-prefix
> 
> 
> > Pardon my ignorance, but, would it be true to say that
> > .py[co], files should NEVER exist in exec_prefix?  
> 
> I can't see why they should not exist in exec_prefix.

Because you end up with multiple copies of the files on
every platform-specific directory, just like I have them
now.  :-(

----------------------------------
Luis P. Caamano
lcaamano@mindspring.com
Atlanta, GA, USA
----------------------------------


From fdrake@acm.org  Mon Jul 30 06:38:49 2001
From: fdrake@acm.org (Fred L. Drake, Jr.)
Date: Mon, 30 Jul 2001 01:38:49 -0400 (EDT)
Subject: [XML-SIG] Installing to split prefix/exec-prefix
In-Reply-To: <AHEBKIMJLGAIBJCOANKPEEBOCBAA.lcaamano@mindspring.com>
References: <200107272245.f6RMjSI02115@mira.informatik.hu-berlin.de>
 <AHEBKIMJLGAIBJCOANKPEEBOCBAA.lcaamano@mindspring.com>
Message-ID: <15204.62185.296425.784320@cj42289-a.reston1.va.home.com>

Luis P Caamano writes:
 > Because you end up with multiple copies of the files on
 > every platform-specific directory, just like I have them

  While this is a bit of a nuissance, it should not be significant.
The extra copies should not include configuration files; those don't
belong in the source area.  (They should go in /usr/share/lib/ or some
place like that.)
  Using exec_prefix for both allows different versions of a package to
be installed for different platforms.  This can reasonably be expected
to be necessary in some of the platforms become unsupported by newer
versions of the package, but users need to use the newer versions
where possible.


  -Fred

-- 
Fred L. Drake, Jr.  <fdrake at acm.org>
PythonLabs at Zope Corporation


From martin@loewis.home.cs.tu-berlin.de  Mon Jul 30 05:42:48 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Mon, 30 Jul 2001 06:42:48 +0200
Subject: [XML-SIG] Installing to split prefix/exec-prefix
In-Reply-To: <AHEBKIMJLGAIBJCOANKPEEBOCBAA.lcaamano@mindspring.com>
References: <AHEBKIMJLGAIBJCOANKPEEBOCBAA.lcaamano@mindspring.com>
Message-ID: <200107300442.f6U4gmR00839@mira.informatik.hu-berlin.de>

> Because you end up with multiple copies of the files on
> every platform-specific directory, just like I have them
> now.  :-(

Out of curiosity: How many different platforms do you use?

Regards,
Martin


From rsalz@zolera.com  Mon Jul 30 17:11:00 2001
From: rsalz@zolera.com (Rich Salz)
Date: Mon, 30 Jul 2001 12:11:00 -0400
Subject: [XML-SIG] Bug in dom/ext/reader/PyExpat.py
Message-ID: <3B658714.2A3455EA@zolera.com>

In startElement, around line 130:
                if (prefix or value):
                    self._namespaces[prefix] = attribs[curr_attrib_key]
                else:
                    del self._namespaces[prefix]

If nested nodes change the default namespace, this can raise an
exception.  I'm not sure which fix is better:
	if (prefix or prefix == '' or value):
or
	if (prefix != None or value):

comments?
-- 
Zolera Systems, Your Key to Online Integrity
Securing Web services: XML, SOAP, Dig-sig, Encryption
http://www.zolera.com


From rsalz@zolera.com  Mon Jul 30 17:57:28 2001
From: rsalz@zolera.com (Rich Salz)
Date: Mon, 30 Jul 2001 12:57:28 -0400
Subject: [XML-SIG] How to use PyExpat ExternalParsedEntityDeclHandler?
Message-ID: <3B6591F8.C0DE2F77@zolera.com>

I have an input document
	<!DOCTYPE doc [ 
		<!ENTITY ent1 "Hello">
		<!ENTITY ent2 SYSTEM "world.txt">
	]>
	<doc>&ent1;, &ent2;!</doc>

I'm subclassing PyExpat.Reader:

PYE = PyExpat.Reader
class ReaderforC14NExamples(PYE):
    def initParser(self):
        PYE.initParser(self)
        self.parser.UnparsedEntityDeclHandler = self.unparsedEntityDecl
        self.parser.NotationDeclHandler = self.notationDecl
        self.parser.ExternalParsedEntityDeclHandler = self.entityDecl
 
    def entityDecl(self, *args):
        if args != ('ent2', None, 'world.txt', None): return
        print 'match'
>>>     self.parser.CharacterDataHandler('world')
        return 0

Doesn't do what I thought it would.

help?  tnx.
	/r$


From noreply@sourceforge.net  Mon Jul 30 18:37:48 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Mon, 30 Jul 2001 10:37:48 -0700
Subject: [XML-SIG] [ pyxml-Patches-446023 ] Add-on to Cygwin 4Suite patch 445441
Message-ID: <E15RGzE-0003tO-00@usw-sf-web2.sourceforge.net>

Patches item #446023, was opened at 2001-07-30 10:37
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=446023&group_id=6473

Category: 4Suite
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Bill Eldridge (dcbill)
Assigned to: Nobody/Anonymous (nobody)
Summary: Add-on to Cygwin 4Suite patch 445441

Initial Comment:
Seems like the 4Suite Cygwin patch for boolean.c
needed some additional lines that were added in
the PyXML patch for boolean.c

Applies to 4Suite-0.11.1b3, don't know about later.


----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=306473&aid=446023&group_id=6473


From yuba@cyberback.com  Mon Jul 30 23:24:11 2001
From: yuba@cyberback.com (Greg & Janet LINDSTROM)
Date: Mon, 30 Jul 2001 17:24:11 -0500
Subject: [XML-SIG] Where to start?
Message-ID: <000801c11946$5d49b040$42de3dd8@glindshome>

This is a multi-part message in MIME format.

------=_NextPart_000_0005_01C1191C.72D00A60
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

Greetings-
I am ready to leave my secure world of fixed length records and enter =
the brave new world of XML.  I have looked through the website, =
including this SIG; I have purchased the XML Processing with Python book =
and worked through the examples (though they do not seem to work with =
Python 2.1.1.  There appear to be many choices for XML processing =
software (or perhaps I'm reading it wrong)...what do you suggest I use =
to create and process XML for transmission via sockets?  Are there any =
good tutorials? The How-to was hard to follow (IMHO, I may be a bit =
slower than the average bear)?

Any help you can supply would be appreciated.

Greg Lindstrom
Vilonia, AR

------=_NextPart_000_0005_01C1191C.72D00A60
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content=3D"text/html; charset=3Diso-8859-1" =
http-equiv=3DContent-Type>
<META content=3D"MSHTML 5.00.2919.6307" name=3DGENERATOR>
<STYLE></STYLE>
</HEAD>
<BODY bgColor=3D#ffffff>
<DIV><FONT face=3DArial size=3D2>Greetings-</FONT></DIV>
<DIV><FONT face=3DArial size=3D2>I am ready to leave my secure world of =
fixed length=20
records and enter the brave new world of XML.&nbsp; I have looked =
through the=20
website, including this SIG; I have purchased the XML Processing with =
Python=20
book and worked through the examples (though they do not seem to work =
with=20
Python 2.1.1.&nbsp; There appear to be many choices for XML processing =
software=20
(or perhaps I'm reading it wrong)...what do you suggest I use to create =
and=20
process XML for transmission via sockets?&nbsp; Are there any good =
tutorials?=20
The How-to was hard to follow (IMHO, I may be a bit slower than the =
average=20
bear)?</FONT></DIV>
<DIV>&nbsp;</DIV>
<DIV><FONT face=3DArial size=3D2>Any help you can supply would be=20
appreciated.</FONT></DIV>
<DIV>&nbsp;</DIV>
<DIV><FONT face=3DArial size=3D2>Greg Lindstrom</FONT></DIV>
<DIV><FONT face=3DArial size=3D2>Vilonia, AR</FONT></DIV></BODY></HTML>

------=_NextPart_000_0005_01C1191C.72D00A60--


From martin@loewis.home.cs.tu-berlin.de  Tue Jul 31 07:10:19 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Tue, 31 Jul 2001 08:10:19 +0200
Subject: [XML-SIG] Bug in dom/ext/reader/PyExpat.py
In-Reply-To: <3B658714.2A3455EA@zolera.com> (message from Rich Salz on Mon, 30
 Jul 2001 12:11:00 -0400)
References: <3B658714.2A3455EA@zolera.com>
Message-ID: <200107310610.f6V6AJn01058@mira.informatik.hu-berlin.de>

> comments?

If there are none at the moment, can you please file a bug report at
SF with a test case?

Thanks,
Martin


From martin@loewis.home.cs.tu-berlin.de  Tue Jul 31 07:09:02 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Tue, 31 Jul 2001 08:09:02 +0200
Subject: [XML-SIG] How to use PyExpat ExternalParsedEntityDeclHandler?
In-Reply-To: <3B6591F8.C0DE2F77@zolera.com> (message from Rich Salz on Mon, 30
 Jul 2001 12:57:28 -0400)
References: <3B6591F8.C0DE2F77@zolera.com>
Message-ID: <200107310609.f6V692F01057@mira.informatik.hu-berlin.de>

> PYE = PyExpat.Reader
> class ReaderforC14NExamples(PYE):
>     def initParser(self):
>         PYE.initParser(self)
>         self.parser.UnparsedEntityDeclHandler = self.unparsedEntityDecl
>         self.parser.NotationDeclHandler = self.notationDecl
>         self.parser.ExternalParsedEntityDeclHandler = self.entityDecl
>  
>     def entityDecl(self, *args):
>         if args != ('ent2', None, 'world.txt', None): return
>         print 'match'
> >>>     self.parser.CharacterDataHandler('world')
>         return 0
> 
> Doesn't do what I thought it would.

Well, what did you think it would do? Replace every occurrence of the
external entity with "world"? Then you should handle the references to
the external entities, not the declaration. I.e. you should set the
ExternalEntityRefHandler. See expatreader for an example of how to do
this.

Regards,
Martin


From martin@loewis.home.cs.tu-berlin.de  Tue Jul 31 07:03:13 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Tue, 31 Jul 2001 08:03:13 +0200
Subject: [XML-SIG] Where to start?
In-Reply-To: <000801c11946$5d49b040$42de3dd8@glindshome> (yuba@cyberback.com)
References: <000801c11946$5d49b040$42de3dd8@glindshome>
Message-ID: <200107310603.f6V63Dg01054@mira.informatik.hu-berlin.de>

> I am ready to leave my secure world of fixed length records and
> enter the brave new world of XML.  I have looked through the
> website, including this SIG; I have purchased the XML Processing
> with Python book and worked through the examples (though they do not
> seem to work with Python 2.1.1.  There appear to be many choices for
> XML processing software (or perhaps I'm reading it wrong)...what do
> you suggest I use to create and process XML for transmission via
> sockets?  

For creating XML, plain print/write statements are usually sufficient,
unless you already have some XML which you use to create other XML.

For processing, there are many options, and knowing that the output
will get over a socket doesn't help much in deciding which one is
best. With no further information, I'd recommend to use the DOM API.

> Are there any good tutorials? The How-to was hard to
> follow (IMHO, I may be a bit slower than the average bear)?

Don't be tricked into assuming XML was easy to do, even if people tell
you it is. It takes quite some learning effort, as there are many
issues involved. I would recommend that you attempt to solve the
problem at hand; that will already provide you with an understanding
of the application domain. If you then find specific questions of the
type "how do I", don't hesitate to ask them here.

Regards,
Martin


From noreply@sourceforge.net  Tue Jul 31 10:44:01 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Tue, 31 Jul 2001 02:44:01 -0700
Subject: [XML-SIG] [ pyxml-Bugs-446326 ] extensions dir doesn't build?
Message-ID: <E15RW4H-0003bZ-00@usw-sf-web2.sourceforge.net>

Bugs item #446326, was opened at 2001-07-31 02:44
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=446326&group_id=6473

Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Bill Eldridge (dcbill)
Assigned to: Nobody/Anonymous (nobody)
Summary: extensions dir doesn't build?

Initial Comment:
Seems that now to get xml/extensions/boolean.c et al.
compiled, I have to add:

        clobber: clean all
and change 
        LIB=xmlparse/libexpat.a 
to
        LIB=libexpat.a

in xml/extensions/expat/Makefile

and then go to xml/extensions and

        make -f Makefile.pre.in boot
        make
        cp pyexpat.so /usr/lib/python2.1/site-examples
        cp sgmlop.so /usr/lib/python2.1/site-examples

Did something strange happen to the PyXML tree?

(I had to do this both with RH 7.0 & Cygwin)

----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=446326&group_id=6473


From noreply@sourceforge.net  Tue Jul 31 16:23:14 2001
From: noreply@sourceforge.net (noreply@sourceforge.net)
Date: Tue, 31 Jul 2001 08:23:14 -0700
Subject: [XML-SIG] [ pyxml-Bugs-446436 ] Nested xmlns='' declarations break
Message-ID: <E15RbMY-0005M3-00@usw-sf-web1.sourceforge.net>

Bugs item #446436, was opened at 2001-07-31 08:23
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=446436&group_id=6473

Category: expat
Group: None
Status: Open
Resolution: None
Priority: 7
Submitted By: Rich Salz (rsalz)
Assigned to: Nobody/Anonymous (nobody)
Summary: Nested xmlns='' declarations break

Initial Comment:
Nested declarations of the default namespace raise an
exception:
eg3 = """<outer xmlns="">
   <inner xmlns="http://www.ietf.org"/>
</outer>"""

from xml.dom.ext.reader import PyExpat
dom = PyExpat.Reader().fromString(eg3)

Traceback (most recent call last):
  File "x.py", line 7, in ?
    dom = PyExpat.Reader().fromString(eg3)
  File
"/usr/local/lib/python2.0/site-packages/_xmlplus/dom/ext/reader/__init__.py",
line 63, in fromString
    return self.fromStream(stream, ownerDoc)
  File
"/usr/local/lib/python2.0/site-packages/_xmlplus/dom/ext/reader/PyExpat.py",
line 65, in fromStream
    success = self.parser.ParseFile(stream)
  File
"/usr/local/lib/python2.0/site-packages/_xmlplus/dom/ext/reader/PyExpat.py",
line 134, in startElement
    del self._namespaces[prefix]
KeyError

This seems like a good fix.  Line 130 of PyExpat, in
startElement, add "or prefix == ''"
   if (prefix or prefix == '' or value):


----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=106473&aid=446436&group_id=6473


From timbl@w3.org  Tue Jul 31 18:09:13 2001
From: timbl@w3.org (Tim Berners-Lee)
Date: Tue, 31 Jul 2001 13:09:13 -0400
Subject: [XML-SIG] problems building pyXML for cygwin
Message-ID: <003e01c119e3$85d69df0$e0061812@CREST>

I tried installing pyXML under cygwin.  Any ideas why it didn't? I haven't
looked into it in any detail.  Python is:
Python 2.1 (#1, Apr 17 2001, 09:45:01)   [GCC 2.95.3-2 (cygwin special)] on
cygwin_nt-4.01

It would of course be great to have pyXML in the cygwin distribution, as
python is.

Tim BL

$ python setup.py build
running build
running build_py
creating build
creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1
creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus
copying xml/__init__.py -> build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus
copying xml/_checkversion.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus
creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/Attr.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/CDATASection.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/CharacterData.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/Comment.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/DOMImplementation.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/Document.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/DocumentFragment.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/DocumentType.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/Element.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/Entity.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/EntityReference.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/Event.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/FtNode.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/MessageSource.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/NamedNodeMap.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/NodeFilter.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/NodeIterator.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/NodeList.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/Notation.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/ProcessingInstruction.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/Range.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/Text.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/TreeWalker.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/__init__.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/domreg.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/javadom.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/minidom.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/minitraversal.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
copying xml/dom/pulldom.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/GenerateHtml.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLAnchorElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/htm
l
copying xml/dom/html/HTMLAppletElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/htm
l
copying xml/dom/html/HTMLAreaElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLBRElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLBaseElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLBaseFontElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/h
tml
copying xml/dom/html/HTMLBodyElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLButtonElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/htm
l
copying xml/dom/html/HTMLCollection.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLDListElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html

copying xml/dom/html/HTMLDOMImplementation.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom
/html
copying xml/dom/html/HTMLDirectoryElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/
html
copying xml/dom/html/HTMLDivElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLDocument.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/__init__.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLFieldSetElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/h
tml
copying xml/dom/html/HTMLFontElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLFormElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLFrameElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html

copying xml/dom/html/HTMLFrameSetElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/h
tml
copying xml/dom/html/HTMLHRElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLHeadElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLHeadingElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ht
ml
copying xml/dom/html/HTMLHtmlElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLIFrameElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/htm
l
copying xml/dom/html/HTMLImageElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html

copying xml/dom/html/HTMLInputElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html

copying xml/dom/html/HTMLIsIndexElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ht
ml
copying xml/dom/html/HTMLLIElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLLabelElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html

copying xml/dom/html/HTMLLegendElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/htm
l
copying xml/dom/html/HTMLLinkElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLMapElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLMenuElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLMetaElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLModElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLOListElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html

copying xml/dom/html/HTMLObjectElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/htm
l
copying xml/dom/html/HTMLOptGroupElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/h
tml
copying xml/dom/html/HTMLOptionElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/htm
l
copying xml/dom/html/HTMLParagraphElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/
html
copying xml/dom/html/HTMLParamElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html

copying xml/dom/html/HTMLPreElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html
copying xml/dom/html/HTMLQuoteElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html

copying xml/dom/html/HTMLScriptElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/htm
l
copying xml/dom/html/HTMLSelectElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/htm
l
copying xml/dom/html/HTMLStyleElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html

copying xml/dom/html/HTMLTableCaptionElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/d
om/html
copying xml/dom/html/HTMLTableCellElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/
html
copying xml/dom/html/HTMLTableColElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/h
tml
copying xml/dom/html/HTMLTableElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html

copying xml/dom/html/HTMLTableRowElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/h
tml
copying xml/dom/html/HTMLTableSectionElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/d
om/html
copying xml/dom/html/HTMLTextAreaElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/h
tml
copying xml/dom/html/HTMLTitleElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html

copying xml/dom/html/HTMLUListElement.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/html

creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext
copying xml/dom/ext/Printer.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext
copying xml/dom/ext/Visitor.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext
copying xml/dom/ext/XHtml2HtmlPrinter.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext
copying xml/dom/ext/XHtmlPrinter.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext
copying xml/dom/ext/__init__.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext
creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext/reader
copying xml/dom/ext/reader/HtmlLib.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext/rea
der
copying xml/dom/ext/reader/HtmlSax.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext/rea
der
copying xml/dom/ext/reader/PyExpat.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext/rea
der
copying xml/dom/ext/reader/Sax.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext/reader
copying xml/dom/ext/reader/Sax2.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext/reader

copying xml/dom/ext/reader/Sax2Lib.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext/rea
der
copying xml/dom/ext/reader/Sgmlop.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext/read
er
copying xml/dom/ext/reader/__init__.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/dom/ext/re
ader
creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/marshal
copying xml/marshal/__init__.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/marshal
copying xml/marshal/generic.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/marshal
copying xml/marshal/wddx.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/marshal
copying xml/marshal/xmlrpc.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/marshal
creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/unicode
copying xml/unicode/__init__.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/unicode
copying xml/unicode/iso8859.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/unicode
copying xml/unicode/utf8_iso.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/unicode
creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers
copying xml/parsers/__init__.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers
copying xml/parsers/expat.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers
copying xml/parsers/sgmllib.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers
creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/xmlproc
copying xml/parsers/xmlproc/__init__.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/x
mlproc
copying xml/parsers/xmlproc/_outputters.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parser
s/xmlproc
copying xml/parsers/xmlproc/catalog.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/xm
lproc
copying xml/parsers/xmlproc/charconv.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/x
mlproc
copying xml/parsers/xmlproc/dtdparser.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/
xmlproc
copying xml/parsers/xmlproc/errors.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/xml
proc
copying xml/parsers/xmlproc/namespace.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/
xmlproc
copying xml/parsers/xmlproc/utils.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/xmlp
roc
copying xml/parsers/xmlproc/xcatalog.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/x
mlproc
copying xml/parsers/xmlproc/xmlapp.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/xml
proc
copying xml/parsers/xmlproc/xmldtd.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/xml
proc
copying xml/parsers/xmlproc/xmlproc.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/xm
lproc
copying xml/parsers/xmlproc/xmlutils.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/x
mlproc
copying xml/parsers/xmlproc/xmlval.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/xml
proc
creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax
copying xml/sax/__init__.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax
copying xml/sax/_exceptions.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax
copying xml/sax/expatreader.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax
copying xml/sax/handler.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax
copying xml/sax/sax2exts.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax
copying xml/sax/saxexts.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax
copying xml/sax/saxlib.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax
copying xml/sax/saxutils.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax
copying xml/sax/writer.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax
copying xml/sax/xmlreader.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax
creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/drivers
copying xml/sax/drivers/__init__.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/drivers
copying xml/sax/drivers/drv_htmllib.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/driver
s
copying xml/sax/drivers/drv_ltdriver.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/drive
rs
copying xml/sax/drivers/drv_ltdriver_val.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/d
rivers
copying xml/sax/drivers/drv_pyexpat.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/driver
s
copying xml/sax/drivers/drv_sgmllib.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/driver
s
copying xml/sax/drivers/drv_sgmlop.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/drivers

copying xml/sax/drivers/drv_xmldc.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/drivers
copying xml/sax/drivers/drv_xmllib.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/drivers

copying xml/sax/drivers/drv_xmlproc.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/driver
s
copying xml/sax/drivers/drv_xmlproc_val.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/dr
ivers
copying xml/sax/drivers/drv_xmltoolkit.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/dri
vers
copying xml/sax/drivers/pylibs.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/drivers
creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/drivers2
copying xml/sax/drivers2/__init__.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/drivers2

copying xml/sax/drivers2/drv_pyexpat.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/drive
rs2
copying xml/sax/drivers2/drv_xmlproc.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/sax/drive
rs2
creating build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/utils
copying xml/utils/__init__.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/utils
copying xml/utils/iso8601.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/utils
copying xml/utils/qp_xml.py ->
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/utils
running build_ext
building '_xmlplus.parsers.pyexpat' extension
creating build/temp.cygwin_nt-5.0-1.3.2-i686-2.1
gcc -g -O2 -Wall -Wstrict-prototypes -DUSE_DL_IMPORT -DXML_NS -DXML_DTD -DEX
PAT_VERSION=0x010200 -Ie
xtensions/expat/xmltok -Iextensions/expat/xmlparse -I/usr/include/python2.1 
-c extensions/pyexpat.c
-o build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/pyexpat.o
extensions/pyexpat.c:143: warning: `conv_atts_using_string' defined but not
used
extensions/pyexpat.c:181: warning: `conv_atts_using_unicode' defined but not
used
gcc -g -O2 -Wall -Wstrict-prototypes -DUSE_DL_IMPORT -DXML_NS -DXML_DTD -DEX
PAT_VERSION=0x010200 -Ie
xtensions/expat/xmltok -Iextensions/expat/xmlparse -I/usr/include/python2.1 
-c extensions/expat/xmlt
ok/xmltok.c -o build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/xmltok.o
gcc -g -O2 -Wall -Wstrict-prototypes -DUSE_DL_IMPORT -DXML_NS -DXML_DTD -DEX
PAT_VERSION=0x010200 -Ie
xtensions/expat/xmltok -Iextensions/expat/xmlparse -I/usr/include/python2.1 
-c extensions/expat/xmlt
ok/xmlrole.c -o build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/xmlrole.o
gcc -g -O2 -Wall -Wstrict-prototypes -DUSE_DL_IMPORT -DXML_NS -DXML_DTD -DEX
PAT_VERSION=0x010200 -Ie
xtensions/expat/xmltok -Iextensions/expat/xmlparse -I/usr/include/python2.1 
-c extensions/expat/xmlw
f/xmlfile.c -o build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/xmlfile.o
extensions/expat/xmlwf/xmlfile.c: In function `processStream':
extensions/expat/xmlwf/xmlfile.c:149: warning: implicit declaration of
function `close'
extensions/expat/xmlwf/xmlfile.c:153: warning: implicit declaration of
function `read'
gcc -g -O2 -Wall -Wstrict-prototypes -DUSE_DL_IMPORT -DXML_NS -DXML_DTD -DEX
PAT_VERSION=0x010200 -Ie
xtensions/expat/xmltok -Iextensions/expat/xmlparse -I/usr/include/python2.1 
-c extensions/expat/xmlw
f/xmlwf.c -o build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/xmlwf.o
gcc -g -O2 -Wall -Wstrict-prototypes -DUSE_DL_IMPORT -DXML_NS -DXML_DTD -DEX
PAT_VERSION=0x010200 -Ie
xtensions/expat/xmltok -Iextensions/expat/xmlparse -I/usr/include/python2.1 
-c extensions/expat/xmlw
f/codepage.c -o build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/codepage.o
gcc -g -O2 -Wall -Wstrict-prototypes -DUSE_DL_IMPORT -DXML_NS -DXML_DTD -DEX
PAT_VERSION=0x010200 -Ie
xtensions/expat/xmltok -Iextensions/expat/xmlparse -I/usr/include/python2.1 
-c extensions/expat/xmlp
arse/xmlparse.c -o build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/xmlparse.o
gcc -g -O2 -Wall -Wstrict-prototypes -DUSE_DL_IMPORT -DXML_NS -DXML_DTD -DEX
PAT_VERSION=0x010200 -Ie
xtensions/expat/xmltok -Iextensions/expat/xmlparse -I/usr/include/python2.1 
-c extensions/expat/xmlw
f/unixfilemap.c -o build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/unixfilemap.o
extensions/expat/xmlwf/unixfilemap.c: In function `filemap':
extensions/expat/xmlwf/unixfilemap.c:36: warning: implicit declaration of
function `close'
gcc -shared -Wl,--enable-auto-image-base
build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/pyexpat.o build/tem
p.cygwin_nt-5.0-1.3.2-i686-2.1/xmltok.o
build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/xmlrole.o build/temp
.cygwin_nt-5.0-1.3.2-i686-2.1/xmlfile.o
build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/xmlwf.o build/temp.c
ygwin_nt-5.0-1.3.2-i686-2.1/codepage.o
build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/xmlparse.o build/temp
.cygwin_nt-5.0-1.3.2-i686-2.1/unixfilemap.o -L/usr/lib/python2.1/config -lpy
thon2.1 -o build/lib.cyg
win_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/pyexpat.dll
building '_xmlplus.parsers.sgmlop' extension
gcc -g -O2 -Wall -Wstrict-prototypes -DUSE_DL_IMPORT -I/usr/include/python2.
1 -c extensions/sgmlop.c
 -o build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/sgmlop.o
gcc -shared -Wl,--enable-auto-image-base
build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/sgmlop.o -L/usr/lib
/python2.1/config -lpython2.1 -o
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/sgmlop.dll
Cannot export _bss_end__: symbol not defined
Cannot export _bss_start__: symbol not defined
Cannot export _data_end__: symbol not defined
Cannot export _data_start__: symbol not defined
collect2: ld returned 1 exit status
error: command 'gcc' failed with exit status 1


timbl@CREST /cygdrive/c/Download/pyXML/PyXML-0.6.5
$ python setup.py install
running install
running build
running build_py
not copying xml/__init__.py (output up-to-date)
not copying xml/_checkversion.py (output up-to-date)
not copying xml/dom/Attr.py (output up-to-date)
not copying xml/dom/CDATASection.py (output up-to-date)
not copying xml/dom/CharacterData.py (output up-to-date)
not copying xml/dom/Comment.py (output up-to-date)
not copying xml/dom/DOMImplementation.py (output up-to-date)
not copying xml/dom/Document.py (output up-to-date)
not copying xml/dom/DocumentFragment.py (output up-to-date)
not copying xml/dom/DocumentType.py (output up-to-date)
not copying xml/dom/Element.py (output up-to-date)
not copying xml/dom/Entity.py (output up-to-date)
not copying xml/dom/EntityReference.py (output up-to-date)
not copying xml/dom/Event.py (output up-to-date)
not copying xml/dom/FtNode.py (output up-to-date)
not copying xml/dom/MessageSource.py (output up-to-date)
not copying xml/dom/NamedNodeMap.py (output up-to-date)
not copying xml/dom/NodeFilter.py (output up-to-date)
not copying xml/dom/NodeIterator.py (output up-to-date)
not copying xml/dom/NodeList.py (output up-to-date)
not copying xml/dom/Notation.py (output up-to-date)
not copying xml/dom/ProcessingInstruction.py (output up-to-date)
not copying xml/dom/Range.py (output up-to-date)
not copying xml/dom/Text.py (output up-to-date)
not copying xml/dom/TreeWalker.py (output up-to-date)
not copying xml/dom/__init__.py (output up-to-date)
not copying xml/dom/domreg.py (output up-to-date)
not copying xml/dom/javadom.py (output up-to-date)
not copying xml/dom/minidom.py (output up-to-date)
not copying xml/dom/minitraversal.py (output up-to-date)
not copying xml/dom/pulldom.py (output up-to-date)
not copying xml/dom/html/GenerateHtml.py (output up-to-date)
not copying xml/dom/html/HTMLAnchorElement.py (output up-to-date)
not copying xml/dom/html/HTMLAppletElement.py (output up-to-date)
not copying xml/dom/html/HTMLAreaElement.py (output up-to-date)
not copying xml/dom/html/HTMLBRElement.py (output up-to-date)
not copying xml/dom/html/HTMLBaseElement.py (output up-to-date)
not copying xml/dom/html/HTMLBaseFontElement.py (output up-to-date)
not copying xml/dom/html/HTMLBodyElement.py (output up-to-date)
not copying xml/dom/html/HTMLButtonElement.py (output up-to-date)
not copying xml/dom/html/HTMLCollection.py (output up-to-date)
not copying xml/dom/html/HTMLDListElement.py (output up-to-date)
not copying xml/dom/html/HTMLDOMImplementation.py (output up-to-date)
not copying xml/dom/html/HTMLDirectoryElement.py (output up-to-date)
not copying xml/dom/html/HTMLDivElement.py (output up-to-date)
not copying xml/dom/html/HTMLDocument.py (output up-to-date)
not copying xml/dom/html/HTMLElement.py (output up-to-date)
not copying xml/dom/html/__init__.py (output up-to-date)
not copying xml/dom/html/HTMLFieldSetElement.py (output up-to-date)
not copying xml/dom/html/HTMLFontElement.py (output up-to-date)
not copying xml/dom/html/HTMLFormElement.py (output up-to-date)
not copying xml/dom/html/HTMLFrameElement.py (output up-to-date)
not copying xml/dom/html/HTMLFrameSetElement.py (output up-to-date)
not copying xml/dom/html/HTMLHRElement.py (output up-to-date)
not copying xml/dom/html/HTMLHeadElement.py (output up-to-date)
not copying xml/dom/html/HTMLHeadingElement.py (output up-to-date)
not copying xml/dom/html/HTMLHtmlElement.py (output up-to-date)
not copying xml/dom/html/HTMLIFrameElement.py (output up-to-date)
not copying xml/dom/html/HTMLImageElement.py (output up-to-date)
not copying xml/dom/html/HTMLInputElement.py (output up-to-date)
not copying xml/dom/html/HTMLIsIndexElement.py (output up-to-date)
not copying xml/dom/html/HTMLLIElement.py (output up-to-date)
not copying xml/dom/html/HTMLLabelElement.py (output up-to-date)
not copying xml/dom/html/HTMLLegendElement.py (output up-to-date)
not copying xml/dom/html/HTMLLinkElement.py (output up-to-date)
not copying xml/dom/html/HTMLMapElement.py (output up-to-date)
not copying xml/dom/html/HTMLMenuElement.py (output up-to-date)
not copying xml/dom/html/HTMLMetaElement.py (output up-to-date)
not copying xml/dom/html/HTMLModElement.py (output up-to-date)
not copying xml/dom/html/HTMLOListElement.py (output up-to-date)
not copying xml/dom/html/HTMLObjectElement.py (output up-to-date)
not copying xml/dom/html/HTMLOptGroupElement.py (output up-to-date)
not copying xml/dom/html/HTMLOptionElement.py (output up-to-date)
not copying xml/dom/html/HTMLParagraphElement.py (output up-to-date)
not copying xml/dom/html/HTMLParamElement.py (output up-to-date)
not copying xml/dom/html/HTMLPreElement.py (output up-to-date)
not copying xml/dom/html/HTMLQuoteElement.py (output up-to-date)
not copying xml/dom/html/HTMLScriptElement.py (output up-to-date)
not copying xml/dom/html/HTMLSelectElement.py (output up-to-date)
not copying xml/dom/html/HTMLStyleElement.py (output up-to-date)
not copying xml/dom/html/HTMLTableCaptionElement.py (output up-to-date)
not copying xml/dom/html/HTMLTableCellElement.py (output up-to-date)
not copying xml/dom/html/HTMLTableColElement.py (output up-to-date)
not copying xml/dom/html/HTMLTableElement.py (output up-to-date)
not copying xml/dom/html/HTMLTableRowElement.py (output up-to-date)
not copying xml/dom/html/HTMLTableSectionElement.py (output up-to-date)
not copying xml/dom/html/HTMLTextAreaElement.py (output up-to-date)
not copying xml/dom/html/HTMLTitleElement.py (output up-to-date)
not copying xml/dom/html/HTMLUListElement.py (output up-to-date)
not copying xml/dom/ext/Printer.py (output up-to-date)
not copying xml/dom/ext/Visitor.py (output up-to-date)
not copying xml/dom/ext/XHtml2HtmlPrinter.py (output up-to-date)
not copying xml/dom/ext/XHtmlPrinter.py (output up-to-date)
not copying xml/dom/ext/__init__.py (output up-to-date)
not copying xml/dom/ext/reader/HtmlLib.py (output up-to-date)
not copying xml/dom/ext/reader/HtmlSax.py (output up-to-date)
not copying xml/dom/ext/reader/PyExpat.py (output up-to-date)
not copying xml/dom/ext/reader/Sax.py (output up-to-date)
not copying xml/dom/ext/reader/Sax2.py (output up-to-date)
not copying xml/dom/ext/reader/Sax2Lib.py (output up-to-date)
not copying xml/dom/ext/reader/Sgmlop.py (output up-to-date)
not copying xml/dom/ext/reader/__init__.py (output up-to-date)
not copying xml/marshal/__init__.py (output up-to-date)
not copying xml/marshal/generic.py (output up-to-date)
not copying xml/marshal/wddx.py (output up-to-date)
not copying xml/marshal/xmlrpc.py (output up-to-date)
not copying xml/unicode/__init__.py (output up-to-date)
not copying xml/unicode/iso8859.py (output up-to-date)
not copying xml/unicode/utf8_iso.py (output up-to-date)
not copying xml/parsers/__init__.py (output up-to-date)
not copying xml/parsers/expat.py (output up-to-date)
not copying xml/parsers/sgmllib.py (output up-to-date)
not copying xml/parsers/xmlproc/__init__.py (output up-to-date)
not copying xml/parsers/xmlproc/_outputters.py (output up-to-date)
not copying xml/parsers/xmlproc/catalog.py (output up-to-date)
not copying xml/parsers/xmlproc/charconv.py (output up-to-date)
not copying xml/parsers/xmlproc/dtdparser.py (output up-to-date)
not copying xml/parsers/xmlproc/errors.py (output up-to-date)
not copying xml/parsers/xmlproc/namespace.py (output up-to-date)
not copying xml/parsers/xmlproc/utils.py (output up-to-date)
not copying xml/parsers/xmlproc/xcatalog.py (output up-to-date)
not copying xml/parsers/xmlproc/xmlapp.py (output up-to-date)
not copying xml/parsers/xmlproc/xmldtd.py (output up-to-date)
not copying xml/parsers/xmlproc/xmlproc.py (output up-to-date)
not copying xml/parsers/xmlproc/xmlutils.py (output up-to-date)
not copying xml/parsers/xmlproc/xmlval.py (output up-to-date)
not copying xml/sax/__init__.py (output up-to-date)
not copying xml/sax/_exceptions.py (output up-to-date)
not copying xml/sax/expatreader.py (output up-to-date)
not copying xml/sax/handler.py (output up-to-date)
not copying xml/sax/sax2exts.py (output up-to-date)
not copying xml/sax/saxexts.py (output up-to-date)
not copying xml/sax/saxlib.py (output up-to-date)
not copying xml/sax/saxutils.py (output up-to-date)
not copying xml/sax/writer.py (output up-to-date)
not copying xml/sax/xmlreader.py (output up-to-date)
not copying xml/sax/drivers/__init__.py (output up-to-date)
not copying xml/sax/drivers/drv_htmllib.py (output up-to-date)
not copying xml/sax/drivers/drv_ltdriver.py (output up-to-date)
not copying xml/sax/drivers/drv_ltdriver_val.py (output up-to-date)
not copying xml/sax/drivers/drv_pyexpat.py (output up-to-date)
not copying xml/sax/drivers/drv_sgmllib.py (output up-to-date)
not copying xml/sax/drivers/drv_sgmlop.py (output up-to-date)
not copying xml/sax/drivers/drv_xmldc.py (output up-to-date)
not copying xml/sax/drivers/drv_xmllib.py (output up-to-date)
not copying xml/sax/drivers/drv_xmlproc.py (output up-to-date)
not copying xml/sax/drivers/drv_xmlproc_val.py (output up-to-date)
not copying xml/sax/drivers/drv_xmltoolkit.py (output up-to-date)
not copying xml/sax/drivers/pylibs.py (output up-to-date)
not copying xml/sax/drivers2/__init__.py (output up-to-date)
not copying xml/sax/drivers2/drv_pyexpat.py (output up-to-date)
not copying xml/sax/drivers2/drv_xmlproc.py (output up-to-date)
not copying xml/utils/__init__.py (output up-to-date)
not copying xml/utils/iso8601.py (output up-to-date)
not copying xml/utils/qp_xml.py (output up-to-date)
running build_ext
skipping '_xmlplus.parsers.pyexpat' extension (up-to-date)
building '_xmlplus.parsers.sgmlop' extension
skipping extensions/sgmlop.c
(build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/sgmlop.o up-to-date)
gcc -shared -Wl,--enable-auto-image-base
build/temp.cygwin_nt-5.0-1.3.2-i686-2.1/sgmlop.o -L/usr/lib
/python2.1/config -lpython2.1 -o
build/lib.cygwin_nt-5.0-1.3.2-i686-2.1/_xmlplus/parsers/sgmlop.dll
Cannot export _bss_end__: symbol not defined
Cannot export _bss_start__: symbol not defined
Cannot export _data_end__: symbol not defined
Cannot export _data_start__: symbol not defined
collect2: ld returned 1 exit status
error: command 'gcc' failed with exit status 1

timbl@CREST /cygdrive/c/Download/pyXML/PyXML-0.6.5
$ uname -a
CYGWIN_NT-5.0 CREST 1.3.2(0.39/3/2) 2001-05-20 23:28 i686 unknown


From moscowworkshops@email.com  Tue Jul 31 19:13:17 2001
From: moscowworkshops@email.com (MOSCOW WORKSHOPS)
Date: Tue, 31 Jul 2001 22:13:17 +0400
Subject: [XML-SIG] Increase business with wealthy Russian clients
Message-ID: <200107311843.f6VIhRJ43854@addr21.addr.com>

This is a multi-part message in MIME format.

------=_NextPart_000_0181_01C11A0E.069F6BE0
Content-Type: text/plain;
        charset="koi8-r"
Content-Transfer-Encoding: quoted-printable

MOSCOW INTERNATIONAL WINTER WORKSHOP
06 September 2001


The Russian travel market continues to expand with many destinations =
posting increases of more than 15% this summer.
=20
The Russian Travel market is a lucrative market and the Travel Companies =
are  looking for new offers for their clients. Doing business with =
Russia is often perceived of being difficult and complicated but this is =
not true. The Russian Travel companies are better organised, payments =
are always made on time and the clients enjoy spending and are generous =
to staff.
=20
The Moscow International Winter Workshop will take place in Moscow on =
September 6th and is the best opportunity to meet with the Russian =
Travel Trade as they prepare their winter programs. Strategically =
planned 6 weeks prior to the winter exhibition participation at the =
Workshop will ensure that your offers are included in the Russian travel =
companies winter programs.
=20
Winter Sun. 40% of all Vacations booked are during the winter season. =
Wealthy Russian clients are looking for exotic destinations, warmth, =
attractions and activities.
=20
Business Travel. Russian business people are prolific travellers. 64% of =
the Russian companies travel on business up to 10 times a year and 29% =
take between 10 and 50 trips a year!
=20
Meetings & Incentives. 28% of Russian companies organised Incentive =
Tours and more than a quarter organise International Meetings & =
Incentives. More than half book these travel arrangements through Travel =
Agents.
=20
Skiing. Russians are keen skiers and are always looking for special =
accommodation and new ski destinations.
=20
Summer 2002. Many Russian companies are starting to plan their next =
summer programs much earlier than in the past. Contact with Travel =
Companies in September will ensure your position for next summer.
=20
The Moscow international Workshop on September 6th, is THE event to make =
contact with the Russian Travel Trade. Please contact us  for more =
details.
=20
 MOSCOW INTERNATIONAL WINTER WORKSHOP
www.MoscowWorkshop.com

 =20
We do apologise if we have contacted you in error. Please use the delete =
link if you wish to be removed from this list. Please delete.

------=_NextPart_000_0181_01C11A0E.069F6BE0
Content-Type: text/html;
        charset="koi8-r"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content=3D"text/html; charset=3Dkoi8-r" http-equiv=3DContent-Type>
<META content=3D"MSHTML 5.00.3103.1000" name=3DGENERATOR>
<STYLE></STYLE>
</HEAD>
<BODY bgColor=3D#ffffff>
<DIV><FONT face=3D"Arial Cyr" size=3D2>
<DIV><FONT face=3D"MS Shell Dlg">
<DIV align=3Dcenter><FONT size=3D5><FONT face=3D"Arial Cyr"><FONT=20
color=3D#3366ff><STRONG>MOSCOW INTERNATIONAL WINTER WORKSHOP<BR>06 =
September=20
2001</STRONG></FONT><BR></FONT></FONT></DIV>
<DIV><FONT face=3D"Arial Cyr"><BR>The Russian travel market continues to =
expand=20
with many destinations posting increases of more than 15% this=20
summer.<BR>&nbsp;<BR>The Russian Travel market is a lucrative market and =
the=20
Travel Companies are&nbsp; looking for new offers for their clients. =
Doing=20
business with Russia is often perceived of being difficult and =
complicated but=20
this is not true. The Russian Travel companies are better organised, =
payments=20
are always made on time and the clients enjoy spending and are generous =
to=20
staff.<BR>&nbsp;<BR><FONT color=3D#ff0000><STRONG>The Moscow =
International Winter=20
Workshop will take place in Moscow on September 6<SUP>th</SUP>=20
</STRONG></FONT>and is the best opportunity to meet with the Russian =
Travel=20
Trade as they prepare their winter programs. Strategically planned 6 =
weeks prior=20
to the winter exhibition participation at the Workshop will ensure that =
your=20
offers are included in the Russian travel companies winter=20
programs.<BR>&nbsp;<BR><STRONG><FONT color=3D#ff0000>Winter Sun.=20
</FONT></STRONG>40% of all Vacations booked are during the winter =
season.=20
Wealthy Russian clients are looking for exotic destinations, warmth, =
attractions=20
and activities.<BR>&nbsp;<BR><STRONG><FONT color=3D#ff0000>Business =
Travel.=20
</FONT></STRONG>Russian business people are prolific travellers. 64% of =
the=20
Russian companies travel on business up to 10 times a year and 29% take =
between=20
10 and 50 trips a year!<BR>&nbsp;<BR><STRONG><FONT =
color=3D#ff0000>Meetings &amp;=20
Incentives.</FONT></STRONG> 28% of Russian companies organised Incentive =
Tours=20
and more than a quarter organise International Meetings &amp; =
Incentives. More=20
than half book these travel arrangements through Travel=20
Agents.<BR>&nbsp;<BR><FONT color=3D#ff0000><STRONG>Skiing.=20
</STRONG></FONT>Russians are keen skiers and are always looking for =
special=20
accommodation and new ski destinations.<BR>&nbsp;<BR><FONT=20
color=3D#ff0000><STRONG>Summer 2002. </STRONG></FONT>Many Russian =
companies are=20
starting to plan their next summer programs much earlier than in the =
past.=20
Contact with Travel Companies in September will ensure your position for =
next=20
summer.<BR>&nbsp;<BR>The Moscow international Workshop on September=20
6<SUP>th</SUP>, is THE event to make contact with the Russian Travel =
Trade.=20
Please <U><A href=3D"mailto:moscowworkshop@email.com">contact =
us</A></U>&nbsp; for=20
more details.</FONT></DIV>
<DIV><FONT face=3D"Arial Cyr"></FONT>&nbsp;</DIV>
<DIV align=3Dcenter><FONT face=3D"Arial Cyr">&nbsp;<STRONG><FONT =
color=3D#3366ff=20
size=3D4>MOSCOW INTERNATIONAL WINTER WORKSHOP<BR></FONT><A=20
href=3D"http://www.MoscowWorkshop.com">www.MoscowWorkshop.com</A></STRONG=
></FONT></DIV>
<DIV align=3Dcenter><FONT face=3D"Arial Cyr"><BR>&nbsp;</FONT><FONT=20
face=3D"Arial Cyr">&nbsp;<BR><FONT size=3D2>We do apologise if we have =
contacted you=20
in error. Please use the delete link if you wish to be removed from this =
list.=20
<A href=3D"mailto:to_unsubscribe@email.com?Subj=3Ddelete">Please=20
delete</A>.</FONT></FONT></DIV></FONT></DIV>
</FONT></DIV></BODY></HTML>

------=_NextPart_000_0181_01C11A0E.069F6BE0--


From martin@loewis.home.cs.tu-berlin.de  Tue Jul 31 19:43:50 2001
From: martin@loewis.home.cs.tu-berlin.de (Martin v. Loewis)
Date: Tue, 31 Jul 2001 20:43:50 +0200
Subject: [XML-SIG] problems building pyXML for cygwin
In-Reply-To: <003e01c119e3$85d69df0$e0061812@CREST> (timbl@w3.org)
References: <003e01c119e3$85d69df0$e0061812@CREST>
Message-ID: <200107311843.f6VIhoD01239@mira.informatik.hu-berlin.de>

> I tried installing pyXML under cygwin.  Any ideas why it didn't?

Just a few days ago, Garth Kidd contributed a few patches to compile
PyXML using cygwin; they are in

http://sourceforge.net/tracker/index.php?func=detail&aid=445405&group_id=6473&atid=306473

He also made a binary release on

http://www.deadlybloodyserious.com/python/bdist/cygwin/

Hope this helps; I'll try to provide binary releases for use with
Cygwin starting from 0.7 (which is still way ahead).

Regards,
Martin