From Anthony Baxter <arb@connect.com.au>  Fri Oct  4 01:53:28 1996
From: Anthony Baxter <arb@connect.com.au> (Anthony Baxter)
Date: Fri, 04 Oct 1996 10:53:28 +1000
Subject: [PYTHON DOC-SIG] Re: Dynamic Loading of modules on Solaris
In-Reply-To: Your message of "03 Oct 1996 10:37:04 -0400."
 <ydak9t83yi7.fsf@fargo.i-have-a-misconfigured-system-so-shoot-me>
References: <ydak9t83yi7.fsf@fargo.i-have-a-misconfigured-system-so-shoot-me>
Message-ID: <199610040053.KAA00457@jambu.off.connect.com.au>


>>> In message <ydak9t83yi7.fsf@fargo.i-have-a-misconfigured-system-so-shoot-me> Ben Hurwitz wrote

(gotta love that message-id :)

> Does dynamic loading of modules work on Solaris for Python-1.4beta3?
> I'm trying it unsuccessfully on Solaris 2.5.  At first I thought it
> was just the module that I wrote, but when I build the python-included
> modules as shared libraries, it doesn't work either.

Yes. Make sure

you're not using GNU ld (GNU ld under Solaris is broken - remove it
entirely.)
you link the .so with 'ld -G foomodule.so foomodule.o otherbit.o -lneededlib'
(substitute in real filenames and library names, of course)
Note that you don't need to link with -lPython, -LParser, or the like.


Hm, maybe DOC-SIG could come up with a page of 'how to dynamically link
a module on O/S Foo-ix' instructions...

=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From friedric@phoenix.net  Sat Oct 19 17:27:57 1996
From: friedric@phoenix.net (Robin Friedrich)
Date: Sat, 19 Oct 1996 11:27:57 -0500
Subject: [PYTHON DOC-SIG] New regex module (long)
References: <32684984.6152@phoenix.net> <3268BED1.15FB@aw.sgi.com>
Message-ID: <32690188.45CE@phoenix.net>

Ka-Ping Yee wrote in reply to email from me:
> 
> Hi, Robin.  Look out -- this is going to be long.
> 
> > I believe, as do others, that the Python regex module
> > needs a serious upgrade.
> 
> Yup.  I agree.
> 
> I mentioned to Guido a little while ago that a "text processing SIG"
> would be a good idea, especially as the DOC-SIG starts to use and need
> text-processing features more extensively.
> 
> > Ping's patches have helped me on occasion but I
> > can't code with them in mind if I want my stuff to be used by others. So
> > I will propose a development SIG at the workshop to come up with a plug
> > replacement for regexmodule.c such that Guido will be tempted to drop it
> > in by 1.5 (2.0 failing that).
> 
> Actually, i've already been working on exactly such a module for a
> little
> while now (ever since making those patches) -- i'm trying to do a
> complete
> rewrite of a regular expression module.  I haven't told anybody about it
> yet, because i wanted to get something working and to actually run some
> tests and speed comparisons to see if it's any good.  It isn't complete,
> and i'm still learning as i go (this is the first time i've tried to
> code
> regex functionality), but it is coming along reasonably well and i have
> lots of ideas that i am trying to incorporate.
> 
> But, since we want the same thing in the end, i might as well just stop
> keeping secrets and spill all the beans to you in the hope that i can
> at least make a useful contribution of concepts.  This letter will try
> to be a summary of all the ideas i've come up with so far.
> 
> My main goals (which seem to fit quite nicely with yours) are:
> 
>   (a) no global state (complete thread safety)
>   (b) almost-perfect compatibility with the existing regex module
>   (c) speed
>   (d) extra syntax features
> 
> (The ordering does not intend to express priority.)
> 
> I would like this module to supplant both regexmodule.c and pregex
> to a large degree, although it cannot entirely replace pregex for
> the reasons explained in (b).
> 
> (a) no global state
> 
> There is an apparent contradiction between (a) and the ability to choose
> variant syntaxes.  To deal with this, i've come up with a scheme where
> the most commonly-used syntax tables (currently perl, emacs, egrep,
> grep,
> awk, posix) are held static and referenced, but if the user requests to
> change individual flags, a new syntax table is created and kept within
> the compiled regex structure.
> 
> (My term for "compiled regex" is "pat", for "pattern".  I now use "prog"
> to mean just the program-bytecode part of the "pat" structure; a "pat"
> will contain other things like the syntax table and optimization data
> like a fastmap, etc.)
> 
> (b) almost-perfect compatibility with the existing regex module
> 
> "Almost-perfect" here means as close as possible, but i intend to fix
> obvious bugs in the implementation.  For example, as i pointed out on
> the Python list a little while ago, backreference matching does not
> backtrack correctly.  I think this should be fixed.  However, i feel
> that the new module should be made as compatible as possible in other
> respects, for if it isn't a "drop-in" replacement, it probably won't
> make it into Python version N.
> 
> > 1) Should this be based on or incorporate the pregexmodule that's out
> > there? or start from scratch with a non-POSIX approach?
> 
> For me, this is a compatibility issue.  As Jeffrey Friedl explained on
> the Python list, there are two quite different ways to select which
> match to return.  There is the POSIX way -- the longest overall match --
> and there is the way that the current regex module (along with Perl,
> Tcl,
> and others) do it, which is summarized in the Tcl manual by:
> 
>     1.  If a regular expression could match two different parts of an
>         input string, then it will match the one that begins earliest.
> 
>     2.  If a regular expression contains | operators then the leftmost
>         matching subexpression is chosen.
> 
>     3.  In *, +, and ? constructs, longer matches are chosen in
>         preference to shorter ones.
> 
>     4.  In sequences of expression components, the components are
>         considered from left to right.
> 
> in order of precedence.  I feel that this latter is the way to go.
> Although it is not POSIX compliant, it is the way programmers are
> used to dealing with regular expressions in most scripting languages
> (including current Python) and is somewhat more predictable than the
> POSIX leftmost-longest rule.  Because the two schemes can yield such
> different results, i think it's necessary to do things the latter way
> to maintain compatibility with the existing regex module.
> 
> (c) speed!
> 
> > 2) I feel performance vs perl is an issue. How far should we strive for
> > in speed?
> 
> I agree that performance versus Perl is an important issue.  Although
> we are not at war, it is true that Python and Perl can be considered
> "competing" languages, and a big way to find new Python users is to
> show Perl users what they can gain without having to admit a great
> deal that they will lose.  I don't like to be ashamed of Python.  So,
> how far should we strive?
> 
> Pretty far, i think.  My current timings show that the Python 1.4b3
> regex module handles "ordinary" (i.e. non-pathological; what i consider
> to be commonly-used kinds) regular expressions approximately 6 times
> slower than Perl does.  This is a serious hit for programmers (CGI among
> others), and so serious that i am almost afraid to tell anyone for fear
> it will make them quit Python.  Worse, i found that pregex is 2.5 times
> slower than *that*, which amounts to a some 15 times slower than Perl.
> I don't intend to offend the implementors of these modules, but i do
> feel that a speed improvement is needed.  And because of the room for
> improvement, people will get more excited if we say regexes are e.g.
> "3 times faster now" rather than "only half as fast as Perl's". :)
> 
> How can this be accomplished?  I haven't looked much at the Perl source.
> But here are some of my own thoughts on speed/efficiency issues:
> 
>     1.  A survey of a few scripts led me to discover that by far the
>         majority of regular expressions actually represent *very*
>         *simple* things and often don't require the full power of a
>         regular expression (well, according to my survey, anyway).
> 
>         This means that there are lots of opportunities for optimization
>         everywhere.  I think it makes sense to target the most common
>         idioms in regular expressions and make them as fast as possible
>         (even at the expense of expressions that fall outside these
>         special cases).
> 
>         My way of doing this is by introducing a whole host of extra
>         opcodes for very specific purposes, to reduce cycles through
>         the bytecode interpreter and give us more chances to run loops
>         in C.  For example, possibly the single most common idiom in
>         Perl regular expressions is "\s*" (slurp whitespace).  There
>         is almost never any need for backtracking because the next
>         thing in the regex is almost always non-whitespace.  So this
>         can be replaced with a single opcode which makes a C routine
>         skip all available whitespace, fast.
> 
>         This particular optimization method can be generalized: any
>         character class followed by a maximal-matching operator can
>         be converted into a single loop if the set of possible matches
>         following it does not intersect that character class.
> 
>     2.  With the similar goal of keeping execution in C as much as
>         possible, we can also incorporate extra regex features.
>         This will be dealt with below, but i wanted to point out that
>         it is also an important speed issue.  A single new feature,
>         which activates a compact and optimized C routine, can be
>         tremendously faster than the equivalent, complicated regular
>         expression as it would have had to have been expressed
>         without using the feature.
> 
>     3.  One of the biggest things that bothers me to no end about
>         Python is the immutability of strings.  It cripples you.
>         You can't pass around large files.  You can't cut up and
>         append long strings.  Every time you try, you have to wait
>         for the entire string to get duplicated.  This is awful,
>         and it means quadratic times for linear algorithms.
> 
>         So, one of my ideas has been to build regex functionality
>         into a new "mutable string" class which is capable of faster
>         appends and slices.  For programs which chew through files
>         and want to assemble the results piece by piece, i think this
>         could make a huge difference.

I think a new mutable string type could be very powerful as well.
Maybe exploring synergy with the new array module could bare fruit?
> 
>     4.  The regsub module does exactly that: chews through a string
>         and assembles the results piece by piece.  Because of the
>         immutability of strings, it's quadratic (arrghh!) which makes
>         it extremely slow for large strings.  (And global substitutions
>         on *big* strings is one of the abilities in which Perl excels.)
> 
>         Using mutable strings in regsub would speed it up a lot, but i
>         think the way to go is to do regsub in C once and for all.
> 
> (d) i.  extra regex syntax features
> 
> I have a few ideas on this front, too.  One of the most common
> comments i hear with regard to regular expressions goes something
> like, "yeah, regular expressions are pretty powerful, but they
> can't do everything.  For example, you can't do balanced matching
> in a regular expression."
> 
> Well, why not?  Even something as simple as a new metacharacter:
> 
>     \=xy    immediately match an "x" and skip until the balancing "y"
> 
> commonly used, perhaps, as \=() or \=[] or \={}, would be useful.
> A single use of this in a regular expression (implemented in a quick
> C loop) would easily beat any kind of scan/count/slice loop in Python
> -- or even Perl -- hands down.
> 
Yes, another good idea, but a lot of thought will be needed on this one.

> (As a side note, i now realize that '=' isn't a very good choice since
> it is a non-word character and it is nice to rely on all non-word
> characters being literal when they are backslashes.  So we should pick
> an appropriate letter of the alphabet, say -- \m() for "matching"?)
> 
> I added \l (letter) and \h (hex digit) to my regex patch because i
> find them commonly used and useful, too.  They may be worth considering.
> Also, the interpretation of class metacharacters like \w, \s, etc.
> within [bracketed] classes is also a great timesaver and should be
> on the list.
> 
> > 3) What are the perl5 features which might come in handy?
> 
> Minimal matching with ??, +?, and *? was the most-immediately-useful
> and easiest-to-implement feature, so i did that in my regex patch.
> This is a prime candidate.  Also very good, as Tom Christiansen
> recently pointed out, are the negative look-ahead assertions (?!...).
> 
> Perl purists may complain that we are copying or stealing their syntax,
> but i think there is no reason why we should confuse people who already
> know a perfectly good way of doing these things by making up our own.

Tom also mentioned that Larry closely examined Guido's object system
when implementing the perl5 object support so I don't feel guilty of
anything at all if we borrow as much good regex stuff as possible from
perl5. We will ignore any spurious perluddite comments.
(just tell them immitation is the sincerest form of flattery and leave
it at that;-)

> 
> (d) ii. extra syntax features on strings in general
> 
> Despite possible cries from Python users about it "looking like Perl"
> or "looking like Tcl", i think Python needs string interpolation.
> Although "foo %(spam)s bar" % vars() may be adequate, it's just too
> clumsy for people who are doing text processing.  It may increase
> the amount of typing you have to do by 30-50%, and also increase the
> opportunities for mistakes.
> 
> I'm currently thinking of a string interpolation scheme which is
> slightly more powerful than Perl's.  Python's consistency lets me
> describe it in just two rules.  For conciseness, i'll describe
> the syntax using regular expressions with the above extensions, and
> include rxb-like expressions for readability:
> 
>     1.  $\w+(\m[]|\m()|\.\w+|)*
> 
>         '$' + some(wordchars) + some(
>             either(balanced('[]'), balanced('()'), '.' +
> some(wordchars)))
> 
>     2.  $\m{}
> 
>         '$' + balanced('{}')
> 
Dollar signs sends shivers down my spine, but carry on.

> Either of the above things, when found, is replaced with its
> value when evaluated as a Python expression and then converted
> to a string with 'str' (not 'repr').  In all cases when you
> want to use a literal "[", "(", or "." immediately following
> an interpolation, you can just protect the interpolation with {}
> and nothing changes because it's already an expression.  To get
> a '$' you use '$$'.  These rules make it possible to write:
> 
>     a1. "Here is a $string."
> 
>     a2. "Here is a $module.member."
> 
>     a3. "Here is an $object.member."
> 
>     a4. "Here is a $functioncall(with, arguments)."
> 
>     a5. "Here is an ${arbitrary + expression}."
> 
>     a6. "Here is an $array[3] member."
> 
>     a7. "Here is a $dictionary['foo'] member."
> 
> This is, i think, about as compact as you can get and in my opinion
> very lucid.  The following are some comparisons, with typing length
> differences in parentheses (always relative to a1-a7 above).  The
> Perl equivalents are:
> 
>     b1. "Here is a $string."  (+0)
> 
>     b2. "Here is a $module::member."  (+1)
> 
>     b3. "Here is an $object->{member}."  (+3)
> 
>     b4. "Here is a " . &functioncall($with, $arguments) . "."  (+10)
> 
>     b5. "Here is an " . ($arbitrary + $expression) . "."  (+9)
> 
>     b6. "Here is an $array[3] member."  (+0)
> 
>     b7. "Here is a $dictionary{foo} member."  (-2)
> 
> In Perl 5 you can also do
> 
>     c4. "Here is an @{[&functioncall($with, $arguments)]}."  (+8)
> 
>     c5. "Here is an ${[$arbitrary + $expression]}."  (+4)
> 
> Finally, the current-Python equivalents, for comparison, are:
> 
>     d1. "Here is a " + string + "."  (+7)
>     e1. "Here is a %s." % string  (+4)
>     f1. "Here is a %(string)s." % vars()  (+12)
> 
>     d2. "Here is a " + str(module.member) + "." (+12)
>     e2. "Here is a %s." % module.member  (+4)
> 
>     e3. "Here is an %s." % object.member  (+4)
> 
>     e4. "Here is a %s." % functioncall(with, arguments)  (+4)
> 
>     e5. "Here is an %s." % (arbitrary + expression)  (+4)
> 
>     e6. "Here is an %s member." % array[3]  (+4)
> 
>     e7. "Here is a %s member." % dictionary['foo']  (+4)
> 
> Clearly, typing is not the be-all and end-all of a programming
> language!  But since the above data were easy to collect, i
> thought i'd present them.  You can see that my proposed syntax
> for interpolation wins almost every time (and the design of
> Python should take more credit for that than i do).  Moreover,
> i feel that it is a lot cleaner and more readable than either
> of the current alternatives in Python or Perl.  (I am ignoring
> Tcl on purpose, as it is even far clumsier at this.)

I think this will be the most controversial of your proposals
but well worth the debate. I'll include it as a definate topic
at the workshop.
> 
> So, given the syntax, what is the implementation?
> 
> Trying to force this into anything internal would just not
> work -- at least not initially.  I don't intend to change
> any Python syntax; i was thinking of putting this functionality
> into an "interpolation" module (just in Python to begin with,
> but later in a compiled C module).
> 
> What i had in mind was a class "Itpl" which, much like the
> rxb.Pattern class, holds only a string, but gives it special
> meaning just by virtue of its type.  Itpl.__repr__ would
> actually perform the interpolation.  So a casual user could:
> 
>     print Itpl("Here is a $string.")
> 
> Or the Itpl module could export a "printpl" function as shorthand:
> 
>     printpl("Here is a $string.")
> 
> Why encapsulate this behaviour in a class?  It means we can
> delay the interpolation from happening until the __repr__
> is actually requested.  When __repr__ is called, it should
> evaluate the expressions in its caller's local and global
> namespaces.
> 
> This is relevant to our current discussion because it means
> an Itpl instance could be passed as the replacement argument
> to a regsub, which makes it possible to evaluate real
> expressions for each match to yield the substitution string.
> (The regsub function would then simply define names in the
> Itpl's global name space to represent matching subgroups.)
> 
> This gives us Perl's /e option for evaluating expressions
> during regex substitutions, which is very powerful functionality
> that definitely should be available in Python.  Additionally,
> regsub should probably also accept a Python function object as
> the replacement argument, which it calls on each match.

I agree, (although in the function object case performance will
suffer and shouldn't be relied on too heavily).
> 
> -- Anyway, so there are all of my ideas i've been collecting
> so far on improving text processing in Python.  Combining
> everything in d(i) and d(ii) would give Python a great deal of
> text-processing power, and just might even place it in a
> position to seriously compete with Perl for a larger part of
> this application area.
> 
> > 4) Who would be the best people to work on this?
> > 5) How much work to you see for such an effort?
> 
> A lot of work, probably?  I don't know exactly.  These questions
> are probably the hardest to answer.
> 
> I wanted to keep working on this without committing to it
> in public because i'm not very sure about how much time i'll
> be able to devote to it.  I think this project would have a
> great deal of potential, though, and i'm very interested.
> Actually, i already feel better about it now that i've let
> go with all these ideas and contributed something.
> 
> Okay, well there it is.  I've been at this letter a *lot* longer
> than i'd planned, so i'd better go eat something.  Let me know
> what you think of these ideas, and also if you have issues in
> mind that i've missed.  (Even if you don't have time to respond to
> everything, i'd rather hear your thoughts in pieces.)
> 
> Thanks!
> 
> Ping
>                                  Developer, Alias|Wavefront Inc. (Tokyo)
> http://www.lfw.org/math/ brings math to the Web as easy as <se>?pi?</se>

Ok guys what do you think? I'll be recruiting for contributions of
ideas and coding on the newsgroup as well as at the workshop. I forsee
a major thread in the c.l.p and then we'll probably move it to a TEXT-SIG.
I know I'm tired of having to answer people about why we should use Python
when Perl is so much better at text processing. I'd love to remove that 
thorn from Python's side!  Python has too much going for it to be dismissed
with one easy criticism.

-Robin Friedrich

=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From kpyee@aw.sgi.com  Sun Oct 20 15:27:03 1996
From: kpyee@aw.sgi.com (Ka-Ping Yee)
Date: Sun, 20 Oct 1996 23:27:03 +0900
Subject: [PYTHON DOC-SIG] String interpolation module
References: <32684984.6152@phoenix.net> <3268BED1.15FB@aw.sgi.com> <32690188.45CE@phoenix.net>
Message-ID: <326A36B7.31DF@aw.sgi.com>

This is a multi-part message in MIME format.

--------------237C2F1C7DE1
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit

I wrote:
> I'm currently thinking of a string interpolation scheme
[...]

Robin Friedrich wrote:
> 
> Dollar signs sends shivers down my spine, but carry on.

Perhaps, but i really don't see much other alternative.
Can you think of something better?

> I think this will be the most controversial of your proposals
> but well worth the debate. I'll include it as a definate topic
> at the workshop.

Thanks.  Yes, i imagine it is controversial because it makes
Python look a little like other languages.  But i don't think
that this reaction should overshadow its utility.  Comparing

    "Here is an @{[&functioncall($with, $arguments)]}." (Perl 5)

to

    "Here is a $functioncall(with, arguments)."  (Python + Itpl)

should help to show people that interpolation does NOT necessarily
mean riddling the entire language with punctuation.  We're talking
one and only one new special character here.

To back up my proposal and give everyone (including myself!) a
chance to actually try this out and see how it feels, i've coded
up the Itpl module as i described.  Because it is so short
(90 lines) i've decided to attach the module to this letter.
I hope that subscribers to this list don't mind.

The module contains the "Itpl" class and three functions:

    printpl(str): interpolate and print the string 'str'

    filter(file): wrap 'file' in an interpolating filter for
        the 'write' method and return a new file-like object

    unfilter(itplfile): unwrap the filtered 'itplfile' and
        return the original file object

By default, filter and unfilter operate on "sys.stdout".

This allows you to do:

    Python 1.4b3 (Oct  3 1996) [C]
    Copyright 1991-1996 Stichting Mathematisch Centrum, Amsterdam
    >>> from Itpl import printpl
    >>> x = 3
    >>> y = ['spam', (4, 6), 'eggs']
    >>>
    >>> printpl("The number of the count shall be $x.")
    The number of the count shall be 3.
    >>> printpl("This list contains $y[0], $y[1], and $y[2].")
    This list contains spam, (4, 6), and eggs.

Or, to make 'print' temporarily assume this functionality:

    Python 1.4b3 (Oct  3 1996) [C]
    Copyright 1991-1996 Stichting Mathematisch Centrum, Amsterdam
    >>> import Itpl, sys
    >>> sys.stdout = Itpl.filter()
    >>> z = 'python'
    >>>
    >>> print "$sys has been imported."
    <module 'sys'> has been imported.
    >>> print "$sys.stdout"
    <interpolated <open file '<stdout>', mode 'w' at 10051700>>
    >>> print "$z[1:]$z[0]ay"
    ythonpay
    >>> sys.stdout = Itpl.unfilter()
    >>> print "$phew, $back $to $normal."
    $phew, $back $to $normal.

Notice how Python's string slicing gives us a bit more convenience
than Perl in generating Pig Latin. :)  Anyway, i'd be very grateful
if you could try it out and send me your comments.


Ping
                                 Developer, Alias|Wavefront Inc. (Tokyo)
http://www.lfw.org/math/ brings math to the Web as easy as <se>?pi?</se>

--------------237C2F1C7DE1
Content-Type: text/plain; charset=us-ascii; name="Itpl.py"
Content-Transfer-Encoding: 7bit
Content-Disposition: inline; filename="Itpl.py"

import sys, string
from types import StringType
from tokenize import tokenprog

ItplError = "ItplError"
class Itpl:
    def __init__(self, fmt):
        if type(fmt) != StringType:
            raise TypeError, "needs string initializer"

        namechars = 'abcdefghijklmnopqrstuvwxyz' \
            'ABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789_';
        chunks = []
        pos = 0

        try:
            while 1:
                dollar = string.index(fmt, '$', pos)
                nextchar = fmt[dollar+1]

                if nextchar == '{':
                    chunks.append((0, fmt[pos:dollar]))
                    pos, level = dollar+2, 1
                    while level:
                        pos = pos + tokenprog.match(fmt, pos)
                        tstart, tend = tokenprog.regs[3]
                        token = fmt[tstart:tend]
                        if token == '{': level = level+1
                        elif token == '}': level = level-1
                    chunks.append((1, fmt[dollar+2:pos-1]))

                elif nextchar in namechars:
                    chunks.append((0, fmt[pos:dollar]))
                    pos = dollar + 1
                    pos = pos + tokenprog.match(fmt, pos)
                    while pos < len(fmt):
                        if fmt[pos] == '.' and \
                            pos+1 < len(fmt) and fmt[pos+1] in namechars:
                            pos = pos+2
                            pos = pos + tokenprog.match(fmt, pos)
                        elif fmt[pos] in '([':
                            pos, level = pos+1, 1
                            while level:
                                pos = pos + tokenprog.match(fmt, pos)
                                tstart, tend = tokenprog.regs[3]
                                token = fmt[tstart:tend]
                                if token[0] in '([': level = level+1
                                elif token[0] in ')]': level = level-1
                        else: break
                    chunks.append((1, fmt[dollar+1:pos]))

                else:
                    chunks.append((0, fmt[pos:dollar+1]))
                    pos = dollar + 1 + (nextchar == '$')

        except TypeError:       # token regex did not match, regs[] failed
            import traceback
            traceback.print_exc()
            raise ItplError, "unfinished expression"
        except ValueError:      # index did not find a dollar sign
            pass

        if pos < len(fmt): chunks.append((0, fmt[pos:]))
        self.chunks = chunks

    def __repr__(self, prog=None):
        try: 1/0
        except: frame = sys.exc_traceback.tb_frame

        while frame.f_globals['__name__'] == name: frame = frame.f_back
        loc, glob = frame.f_locals, frame.f_globals

        result = []
        for live, chunk in self.chunks:
            if live: result.append(str(eval(chunk, loc, glob)))
            else: result.append(chunk)

        return string.join(result, '')

def printpl(str): print Itpl(str)
def itpl(str): return repr(Itpl(str))

class Itplfile:
    def __init__(self, file): self.file = file
    def __repr__(self): return '<interpolated ' + repr(self.file) + '>'
    def __getattr__(self, attr): return getattr(self.file, attr)
    def write(self, str): self.file.write(repr(Itpl(str)))

def filter(file=sys.stdout): return Itplfile(file)
def unfilter(ifile=None): return ifile and ifile.file or sys.stdout.file

--------------237C2F1C7DE1--


=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From kpyee@aw.sgi.com  Sun Oct 20 15:55:10 1996
From: kpyee@aw.sgi.com (Ka-Ping Yee)
Date: Sun, 20 Oct 1996 23:55:10 +0900
Subject: [PYTHON DOC-SIG] New regex module (long)
References: <32684984.6152@phoenix.net> <3268BED1.15FB@aw.sgi.com> <32690188.45CE@phoenix.net>
Message-ID: <326A3D4E.4487@aw.sgi.com>

Robin Friedrich wrote:
> I forsee
> a major thread in the c.l.p and then we'll probably move it to a TEXT-SIG.
> I know I'm tired of having to answer people about why we should use Python
> when Perl is so much better at text processing. I'd love to remove that
> thorn from Python's side!  Python has too much going for it to be dismissed
> with one easy criticism.

Agreed.  I'm glad this group will be forming.  It's only too bad i can't
meet you at the conference.


Ping
                                 Developer, Alias|Wavefront Inc. (Tokyo)
http://www.lfw.org/math/ brings math to the Web as easy as <se>?pi?</se>


=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From friedric@rose.rsoc.rockwell.com  Sun Oct 20 18:43:35 1996
From: friedric@rose.rsoc.rockwell.com (Robin Friedrich)
Date: Sun, 20 Oct 1996 12:43:35 -0500
Subject: [PYTHON DOC-SIG] String interpolation module
Message-ID: <199610201743.MAA13773@darwin.rsoc.rockwell.com>

|> From: Ka-Ping Yee <kpyee@aw.sgi.com>
|> Organization: Alias|Wavefront Inc.
|> To: friedric@phoenix.net
|> Cc: doc-sig@python.org
|> Subject: [PYTHON DOC-SIG] String interpolation module
|> 
|> I wrote:
|> > I'm currently thinking of a string interpolation scheme
|> [...]
|> 
|> Robin Friedrich wrote:
|> > 
|> > Dollar signs sends shivers down my spine, but carry on.
|> 
|> Perhaps, but i really don't see much other alternative.
|> Can you think of something better?

No, that's why I said carry on:-)  I'll get over it.

|> 
|> > I think this will be the most controversial of your proposals
|> > but well worth the debate. I'll include it as a definate topic
|> > at the workshop.
|> 
|> Thanks.  Yes, i imagine it is controversial because it makes
|> Python look a little like other languages.  But i don't think
|> that this reaction should overshadow its utility.  Comparing
|> 
|>     "Here is an @{[&functioncall($with, $arguments)]}." (Perl 5)
|> 
|> to
|> 
|>     "Here is a $functioncall(with, arguments)."  (Python + Itpl)
|> 
|> should help to show people that interpolation does NOT necessarily
|> mean riddling the entire language with punctuation.  We're talking
|> one and only one new special character here.

Good point. 
|> 
|> To back up my proposal and give everyone (including myself!) a
|> chance to actually try this out and see how it feels, i've coded
|> up the Itpl module as i described.  Because it is so short
|> (90 lines) i've decided to attach the module to this letter.
|> I hope that subscribers to this list don't mind.

This is one of the many unsung beauties of Python. We can so easily
explore new ideas for improving the language. First we just whip up
something in a python implementation and hash it through until the
design is right and then go ahead and recast it in C if needed for
speed!

I played a little with the Itpl and needed to make the following
change... line 70
while frame.f_globals['__name__'] == name: frame = frame.f_back
to
while frame.f_globals['__name__'] == __name__: frame = frame.f_back

I'm not sure how it ever worked for you. 

Once that's done it worked fine.  I like it alot!

Anyone else have an opinion?    (well I guess this IS the weekend...)

-Robin

=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From kpyee@aw.sgi.com  Mon Oct 21 03:16:16 1996
From: kpyee@aw.sgi.com (Ka-Ping Yee)
Date: Mon, 21 Oct 1996 11:16:16 +0900
Subject: [PYTHON DOC-SIG] String interpolation module
References: <199610201743.MAA13773@darwin.rsoc.rockwell.com>
Message-ID: <326ADCF0.2781@aw.sgi.com>

Robin Friedrich wrote:
> 
> This is one of the many unsung beauties of Python. We can so easily
> explore new ideas for improving the language.

Yes, absolutely.  I love being able to try things out this fast.

> I played a little with the Itpl and needed to make the following
> change... line 70
> while frame.f_globals['__name__'] == name: frame = frame.f_back
> to
> while frame.f_globals['__name__'] == __name__: frame = frame.f_back
> 
> I'm not sure how it ever worked for you.

Apologies to everyone!  I attached the wrong copy.  The script
(with this correction) is available now on my web site of collected
Python tidbits at http://www.lfw.org/python/.
 
> Once that's done it worked fine.  I like it alot!

Thanks.
 
> Anyone else have an opinion?    (well I guess this IS the weekend...)

I'll be watching... :)


Ping
                                 Developer, Alias|Wavefront Inc. (Tokyo)
http://www.lfw.org/math/ brings math to the Web as easy as <se>?pi?</se>


=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From ikds@fashion-forum.de  Wed Oct 23 13:18:32 1996
From: ikds@fashion-forum.de (Marc-Andre Lemburg)
Date: Wed, 23 Oct 1996 14:18:32 +0200
Subject: [PYTHON DOC-SIG] String interpolation module
References: <32684984.6152@phoenix.net> <3268BED1.15FB@aw.sgi.com> <32690188.45CE@phoenix.net> <326A36B7.31DF@aw.sgi.com>
Message-ID: <326E0D18.7B11@fashion-forum.de>

Hi !

I just tried your Itpl class. Looks like a nice approach in general,
although the implementation does have it's quirks (jumping through
frames and stuff like that).

Here two suggestions I'd like to make:

1. Why not use a syntax that would also include on-the-fly
   formatting (e.g. something resembling '%2.2f'%y) ? I would
   suggest using:

   "global optimum found at x = %2.2f{calc_opt(params)}"

   which could be done in your syntax with:

   "global optimum found at x = ${'%2.2f'%calc_opt(params)}"

    That way we wouldn't leave the tracks of using C style
    %2i too much, while still having all the goodies of doing
    inline evaluation.

    The syntax would then be:
       
      %{some + expression}  ... having no formatting string

    and

      %2.2f{some + calculations} ... using %2.2f as formatting string

    (i.e. you can look for %{...} while parsing the string)

2. Wouldn't it be easier to just convert the above strings to
   normal Python syntax and then letting Python evaluate the whole
   string%tupel, rather than doing the whole parsing by yourself ?

cheers, Marc
-----------------------------------------------------------------------
lemburg@informatik.uni-koeln.de                      Marc-Andre Lemburg

=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From jim.fulton@digicool.com  Wed Oct 23 14:19:00 1996
From: jim.fulton@digicool.com (Jim Fulton)
Date: Wed, 23 Oct 1996 09:19:00 -0400
Subject: [PYTHON DOC-SIG] Document Templates
Message-ID: <326E1B44.13BB@digicool.com>

We have developed a module named DocumentTemplate that we will
be releasing at the Python conference in a couple of weeks:

Many Web based applications must provide the capability for customers
to be able to edit the look and feel of the application's Web pages.
Futhermore, our experience has been that mixing Python and Hypertext
Markup Language (HTML) together makes both less readable. We have
developed a Python module named DocumentTemplate that allows documents
to be created with embedded markup that causes data from Python objects
to be inserted into documents when they are output from an application.
The data insertion goes beyond simple insertion of data with C-language
formats to include:

 - Insertion of data with application-defined formats, such as 
   date-time formats,

 - Automatic evaluation of python functions and methods,

 - Nested documented templates, 

 - Conditional insertion,

 - Iterative insertion, by iterating over python sequences, including
   support for browse by batch, and

 - Automatic computation of summary statistics when iterating over   
sequences.

See http://www.digicool.com/papers/DocumentTemplate.html
for more details.

Jim

-- 
Jim Fulton         Digital Creations
jim@digicool.com   540.371.6909
## Python is my favorite language ##
##     http://www.python.org/     ##

=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From paul@digicool.com  Wed Oct 23 15:38:47 1996
From: paul@digicool.com (Paul Everitt)
Date: Wed, 23 Oct 1996 09:38:47 -0500
Subject: [PYTHON DOC-SIG] Document Templates
References: <326E1B44.13BB@digicool.com>
Message-ID: <326E2DF7.314@digicool.com>

I'd like to mention that this is certainly used beyond HTML.  For
instance, most of our apps send e-mail in the classic "Feedback Form"
genre.

The form that is actually sent is a DocumentTemplate that looks like an
SMTP-formatted mail message.  It is editable through-the-web, obeys
access control restrictions, and supports "acquisition" of container
attributes.  All in the same way described for HTML.

Ostensibly e-mail is not the only use beyond HTML for
DocumentTemplates.  We've also used them for Web-editing of config
files.

Some on this SIG might not have heard the background behind all this. 
We have been working on, and are releasing at the workshop, an object
publishing environment called Bobo:

   http://www.digicool.com/papers/PythonObjectPublisher.html

Benefits:

  o Applications do not have to include code for interfacing with the
web server. 
  o Applications can be moved from one publishing mechanism, such as
CGI, to another mechanism, such as Fast CGI or ILU Requestor, with no
change. 
  o Python objects are published as Python objects. The web server
"calls" the objects in much the same way that other
  o Python objects would. 
  o Automatic conversion of URL to object/subobject traversal. 
  o Automatic marshaling of form data, cookie data, and request
meta-data to Python function arguments. 
  o Automated exception handling. 
  o Automatic generation of CGI headers. 
  o Automated authentication and authorization using a rich mechanism. 
  o With a name like Bobo, it's gotta be good.

It is tightly integrated with DocumentTemplate and our nested lazy
persistence, as well as other infrastructure we're building.

--Paul

-- 
Paul Everitt       Digital Creations
paul@digicool.com  540.371.6909
## Python is my favorite language ##
##     http://www.python.org/     ##

=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From jim.fulton@digicool.com  Wed Oct 23 14:41:42 1996
From: jim.fulton@digicool.com (Jim Fulton)
Date: Wed, 23 Oct 1996 09:41:42 -0400
Subject: [PYTHON DOC-SIG] Document Templates
References: <326E1B44.13BB@digicool.com> <326E2DF7.314@digicool.com>
Message-ID: <326E2096.3AEF@digicool.com>

Paul Everitt wrote:
> 
> I'd like to mention that this is certainly used beyond HTML.  For
> instance, most of our apps send e-mail in the classic "Feedback Form"
> genre.

We also use DocumentTemplates to generate SQL in a database access
application.

Jim

-- 
Jim Fulton         Digital Creations
jim@digicool.com   540.371.6909
## Python is my favorite language ##
##     http://www.python.org/     ##

=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From jim.fulton@digicool.com  Wed Oct 23 15:03:53 1996
From: jim.fulton@digicool.com (Jim Fulton)
Date: Wed, 23 Oct 1996 10:03:53 -0400
Subject: [PYTHON DOC-SIG] New StructuredText module
Message-ID: <326E25C9.74EE@digicool.com>

I've updated the StructuredText module at:

  http://www.digicool.com/jim/python/StructuredText.html

to fix a couple of bugs:

  - Titles in descriptive lists did not get character tags inserted.

  - Example text did get character tags inserted, and shouldn't have.

I also added a function to insert interbal links for [references] and
to turn absolute URLs that are preceeded by white-space into anchors.

Jim

-- 
Jim Fulton         Digital Creations
jim@digicool.com   540.371.6909
## Python is my favorite language ##
##     http://www.python.org/     ##

=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From da@maigret.cog.brown.edu  Wed Oct 23 18:21:12 1996
From: da@maigret.cog.brown.edu (David Ascher)
Date: Wed, 23 Oct 1996 13:21:12 -0400 (EDT)
Subject: [PYTHON DOC-SIG] Re: [PYTHON MATRIX-SIG] Alpha release of new tutorial for 1.0a5
In-Reply-To: <199610231704.NAA22149@maigret> from "Geoffrey Furnish" at Oct 23, 96 10:04:22 am
Message-ID: <199610231721.NAA22272@maigret>

In my announcement of the alpha tutorial for the numeric extension, I
wrote:

>  > And there are HTML, DVI, TeX, Info, Text and PS versions (TIM is
>  > pretty cool really!).

Which prompted Geoffrey Furnish to ask in email:
 
> BTW, what is TIM?  Can you give me a brief low-down on it?  Can it do
> math like tex/latex?

TIM is a tool developed by Bill Janssen as part of the ILU project at
Xerox PARC, but independent of ILU.  Some documentation can be found at:

ftp://ftp.parc.xerox.com/pub/ilu/2.0/20a8-manual-html/manual_17.html#SEC275

The key sentence is:

  TIM is essentially a superset of the GNU texinfo language, version
  2.  It adds several features to allow more precise discrimination of
  semantics when documenting software systems. You should be familiar
  with the basic texinfo system first.

I am still a novice at it, so I'm not sure about it possibilities.  I
suspect that one can insert TeX commands in it, and they'll come out
fine in the TeX mode.  I suspect also that they won't come out at all in
HTML mode.  

I chose it because it seemed reasonable.  I'm probably using very few
features of TIM which aren't part of Texinfo.  I also thought that
since we're looking for a common documentation format, this could be
somewhat of a test for TIM.

For most of the python modules, I think TIM would be good enough.
For the scientific extensions which are bound to arrive in droves =),
it'd be nice to have one which allowed math like LaTeX, but also
allowed it to be displayed in HTML form.  This makes me think that an
interface between a markup language like TIM's and a system like
Ka-Ping Yee's system would be nice.  It's a big project though, and one
I'm not volunteering for.

Thoughts anyone?

--david

[I'm sending this to matrix-sig, doc-sig and Bill Janssen and Ping
since they all seem relevant.  Maybe followups should just go to
doc-sig?].

PS: One of the reasons I like TIM is that it seems it shouldn't be too
    hard to do a reasonable TIM->SGML conversion someday, when the
    tools become available.

=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From fdrake@CNRI.Reston.Va.US  Wed Oct 23 18:31:49 1996
From: fdrake@CNRI.Reston.Va.US (Fred L. Drake)
Date: Wed, 23 Oct 1996 13:31:49 -0400 (EDT)
Subject: [PYTHON DOC-SIG] Re: [PYTHON MATRIX-SIG] Alpha release of new tutorial for 1.0a5
In-Reply-To: <199610231721.NAA22272@maigret> from "David Ascher" at Oct 23, 96 01:21:12 pm
Message-ID: <199610231731.NAA26615@weyr.CNRI.Reston.Va.US>

> PS: One of the reasons I like TIM is that it seems it shouldn't be too
>     hard to do a reasonable TIM->SGML conversion someday, when the
>     tools become available.

  So I'm not the only SGML enthusiast here.  (Hooray!)

  Are there any reasons not to switch directly to DocBook?  There
would definately be a need for a Python Documentation Author's Guide,
but that's ok.


  -Fred

--
Fred L. Drake, Jr.
fdrake@cnri.reston.va.us
Corporation for National Research Initiatives
1895 Preston White Drive
Reston, VA    20191-5434

=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From jim.fulton@digicool.com  Mon Oct 28 16:24:16 1996
From: jim.fulton@digicool.com (Jim Fulton)
Date: Mon, 28 Oct 1996 11:24:16 -0500
Subject: [PYTHON DOC-SIG] Re: Q: Structured Text markup doesn't span lines
References: <327256BE.2158@digicool.com>
Message-ID: <3274DE30.5E94@digicool.com>

Paul Everitt wrote:
> 
> If I start emphasis with a * and end it several lines later, it doesn't
> get translated.  Even worse:
> 
>    Example Text -- Say this is some glossary-type list that is
>                    fairly heavily indented.  This can cause *strange
>                    things* to happen when you M-q to re-fill.
> 
> In the above, strange things is not in italics after StructuredText is
> run.  It is hard to control the break if you have more than one word in
> italics, as M-q may put a break in for you.

Hm.  This limitation was intentional.  At the time I implemented this, I
didn't trust the rule for recognizing character tabs to span multiple
lines.  It would probably be safe to extend the search to the entire
paragraph, however.


Jim

-- 
Jim Fulton         Digital Creations
jim@digicool.com   540.371.6909
## Python is my favorite language ##
##     http://www.python.org/     ##

=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From friedric@rose.rsoc.rockwell.com  Mon Oct 21 15:41:20 1996
From: friedric@rose.rsoc.rockwell.com (Robin Friedrich)
Date: Mon, 21 Oct 1996 09:41:20 -0500
Subject: [PYTHON DOC-SIG] Re: [PYTHON MATRIX-SIG] Where to go (was Numerical Recipes)
Message-ID: <199610211441.JAA00648@darwin.rsoc.rockwell.com>

|> From: da@maigret.cog.brown.edu (David Ascher)
|> Subject: Re: [PYTHON MATRIX-SIG] Where to go (was Numerical Recipes)
|> To: jhauser@ifm.uni-kiel.de (janko hauser)
|> Date: Sun, 20 Oct 1996 15:30:32 -0400 (EDT)
|>  
|> Clearly this would make most sense if all Python docs were available in
|> a TIM-like format.  Maybe it's time to ask the DOC-SIG what's up.  The
|> Reference manual has been converted to FrameMaker, which might actually
|> allow conversion to SGML and from there TIM?  The other manuals were
|> last I checked available in info format (in fact it's with the library
|> reference in mind that I wrote the `ihelp' module, when some users were
|> complaining that Python had lousy help facilities compared to
|> matlab...).
|> 
|> --david
|> 

It should be made clear that the Language Reference and Tutorial
are the only docs converted to FrameMaker. The Lib Ref is still
up in the air as to format. The last view I heard was that it should
be written in a very easy system that's text-based and assembled
into it's entirety with some higher level tools. TIM looks to be the
odds on favorite here but there needs to be a whole lot more 
concurrence among module writers. The structured text standard
for doc strings needs to be considered and incorporated as well.
A TIM formatter for gendoc is an obvious approach.

=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================

From da@maigret.cog.brown.edu  Wed Oct 23 18:21:12 1996
From: da@maigret.cog.brown.edu (David Ascher)
Date: Wed, 23 Oct 1996 13:21:12 -0400 (EDT)
Subject: [PYTHON DOC-SIG] Re: [PYTHON MATRIX-SIG] Alpha release of new tutorial for 1.0a5
In-Reply-To: <199610231704.NAA22149@maigret> from "Geoffrey Furnish" at Oct 23, 96 10:04:22 am
Message-ID: <199610231721.NAA22272@maigret>

In my announcement of the alpha tutorial for the numeric extension, I
wrote:

>  > And there are HTML, DVI, TeX, Info, Text and PS versions (TIM is
>  > pretty cool really!).

Which prompted Geoffrey Furnish to ask in email:
 
> BTW, what is TIM?  Can you give me a brief low-down on it?  Can it do
> math like tex/latex?

TIM is a tool developed by Bill Janssen as part of the ILU project at
Xerox PARC, but independent of ILU.  Some documentation can be found at:

ftp://ftp.parc.xerox.com/pub/ilu/2.0/20a8-manual-html/manual_17.html#SEC275

The key sentence is:

  TIM is essentially a superset of the GNU texinfo language, version
  2.  It adds several features to allow more precise discrimination of
  semantics when documenting software systems. You should be familiar
  with the basic texinfo system first.

I am still a novice at it, so I'm not sure about it possibilities.  I
suspect that one can insert TeX commands in it, and they'll come out
fine in the TeX mode.  I suspect also that they won't come out at all in
HTML mode.  

I chose it because it seemed reasonable.  I'm probably using very few
features of TIM which aren't part of Texinfo.  I also thought that
since we're looking for a common documentation format, this could be
somewhat of a test for TIM.

For most of the python modules, I think TIM would be good enough.
For the scientific extensions which are bound to arrive in droves =),
it'd be nice to have one which allowed math like LaTeX, but also
allowed it to be displayed in HTML form.  This makes me think that an
interface between a markup language like TIM's and a system like
Ka-Ping Yee's system would be nice.  It's a big project though, and one
I'm not volunteering for.

Thoughts anyone?

--david

[I'm sending this to matrix-sig, doc-sig and Bill Janssen and Ping
since they all seem relevant.  Maybe followups should just go to
doc-sig?].

PS: One of the reasons I like TIM is that it seems it shouldn't be too
    hard to do a reasonable TIM->SGML conversion someday, when the
    tools become available.

=================
DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org
=================