Who's minister of propaganda this week?

Sun Mar 18 12:49:20 EST 2001

First of all, I want to thank Alex for engaging me in this discussion...
if nothing else, I'm sure to learn something before we're through.

Alex Martelli wrote:

> "Michael Chermside" <mcherm_python at yahoo.com> 
> <mailto:mcherm_python at yahoo.com> wrote in message
> news:mailman.984619039.7856.python-list at python.org...
> 
>> Alex Martelli wrote:
>>          ... [snip]...
>>   > So, all the compile-time checking is buying is catching (a small
>>   > subset of) the errors that would be caught in testing anyway, a
>>   > little bit earlier (thus, a little bit cheaper) -- it's never the
>>   > case that one has to write a test which would not be needed at
>>   > all if type-checking was static, since "the object has the right
>>   > type" is a small subcase of "the object _behaves_ per the specs".
>>   >
>> I'm really not sure I see it this way. If the method  foo(x) is known to
>> take a FancyDateObject
> 
> 
> 
> Assume that is an abstract interface (no gain in terms of
> functionality assurance if it's concrete) and (without loss
> of generality) that it has two methods First and Second that
> foo uses (it may have others that foo is ignoring, but they
> don't affect the following argument).  OK so far?  Good.

Yep... ok so far

> 
>> there are three kinds of errors we could make. One is that foo() is
>> written badly so it
>> doesn't do what it's supposed to. The unit tests of foo() need to guard
>> against this.
> 
> 
> 
> Right.  Specifically, they'll test the combinations of calls
> to First and Second methods of the x argument that foo needs
> to actually perform, say in certain cases First only, in
> others Second only, in others yet, First then Second (if foo
> never needs to call Second before, and First after, on its
> argument, then its unit-tests will not exercise that path).
> 
>> Another possible error is that FancyDateObject isn't written properly.
> 
> 
> 
> As I assumed this is an interface, let's say you're talking
> about some specific implementation thereof -- FDO_impl1, say.
> 
>> The unit tests
>> of FancyDateObject need to guard against this.
> 
> 
> 
> Right again -- specifically, they'll test that FDO_impl1's
> implementations of First and Second support the call patterns
> that the specification demands.  For example, if that is what
> the specs say, the implementation will work fine if only First
> is called, or if only Second is called, or if Second is called
> before and _then_ First is called -- 'First before, Second after'
> may not be in the specs and thus doesn't get exercized by unit
> tests.

Okay... we're still together here.

> 
>> And the third type of
>> error is that
>> that somewhere where we CALL foo(), we might pass it a DateObject
>> instead... or
>> even a String... which will cause it to perform wrong. To guard against
> 
> 
> 
> More generally: we might erroneously pass to foo some object that
> does not even _implement_ First and Second methods with acceptable
> signatures, or some that _does_ implement the methods with signatures
> that appear good BUT not with the semantics that foo needs -- e.g.,
> it does not let First be called earlier and Second later (because
> there is a mismatch in semantics specs between the specs that foo
> requires and the specs that x actually ensures -- just as in the
> other cases, regarding existence or signature of the methods).
> 
> No compiler wards you at compile-time against all, or even _most_,
> errors of this very common kind.  (The existence of more than one
> method is not needed to have this behavior -- just a single method
> suffices to exhibit this error, e.g. it could take an argument i
> with a prereq of i>23 and be called with i==23 by foo; or foo might
> call it 7 times when the semantics specify it must be called no
> more than 6 times; etc, etc).
> 
> Another way to express it: an interface is not just, not even
> _mostly_, about existence and signature of methods -- it's mostly
> about prereq's, post-conditions, and invariants; and nobody can
> check those at compile-time in enough cases to make a difference
> to your software's reliability.

Okay, this is an interesting point.

First of all, I completely agree that telling whether an object has
the right method names and signatures (an "interface" in the java
sense) is only a very limited subset of defining the signatures,
pre-requisites, post-conditions, and invariants. Eiffel allows
something like this... let's call the idea a "contract". I
think it would GREAT if we could specify our strict typing in terms
of "contracts" instead of the more limited "interfaces".
Compilers could NOT be expected to verify at compiletime that the
pre-requisites were being satisfied, but they could at LEAST
verify that the object being used claimed to implement that
contract.

> So, statically-typed languages make a LOT of the not-very-important
> issues of method-existence and signature -- because those issues
> are THE ones they can check statically, not because their importance
> (wrt the importance of real semantics issues, the programming-by-
> contract parts of the interface) warrants special attention.  It's
> like the drunkard who was looking for his housekeys at one end of
> the street, opposite to the end where he had dropped them, because
> the end he was searching at was the one that a streetlamp lit...
> 
> 
>> this in a
>> dynamically typed language, we have to write unit tests for *every
>> single place*
>> that we call foo(). Of course, we'd be writing unit tests for those
>> functions anyway,
>> but we won't be able to assume that foo() works properly, and will need
>> extra tests to ensure this.
> 
> 
> 
> You can never 'assume that foo(x) works properly' for a given x
> in an untested case -- and syntactic-level compatibility between
> the methods x offers (& their signatures), and the ones foo
> requires of its arguments, doesn't buy you much, since the likely
> and troublesome errors are with contract-expectations mismatches.
> 
> If the implementation of foo and its call are inside the same
> component, then unit-tests should exercize the relevant paths
> (or else the component is being released in a state of incomplete
> testing -- thus, dubious reliability, whether with or without
> static typechecks).
> 
> If the implementation of foo and its call are in different
> components, then you have a system-integration problem (again,
> one that remains independently of static type-checking) and
> thus more 'strategic' kinds of troubles.  You **STILL** need
> 'extra' tests to ensure the actually-implemented semantics
> of a given actual argument and the ones foo requires of its
> formal argument match -- for all distinct cases that occur
> in the (system-level) acceptance criteria tests.  If then, in
> later system operation, you meet a failing case that was not
> tested (makes no difference whether the mismatch is in
> method existence, signature, or semantics), then your acceptance
> tests were insufficient -- and static checking would not have
> made them sufficient.
> 
> 
>> In a staticly typed language, we still need the unit tests for cases 1
>> and 3, but the
>> third type of error is caught by the compiler.
> 
> 
> 
> Not in most cases of interest, no.
> 
>> And in my mind, ANY TIME that
>> I can have a machine do my work for me it's better... I can be lazy, and
>> the machine
>> never gets tired after a long day and forgets to test sometimes. Of
> 
> 
> 
> Extra checks (even if theoretically redundant) are good insurance EXCEPT
> where they breed exactly this kind of 'complacency'.  Release procedures
> MUST 'never .. forget to test' either -- and it's not that hard a problem
> to setup your build/release environment so that a machine ensures this.

You misunderstand. I'm not saying that type checking allows me to
be lazy... it certainly doesn't. I'm just claiming that it's better
to have a computer doing the check instead of a human whenever possible.
I believe that statement is quite general... no matter what the test,
it's better to have a computer do it than to count on the programmer.
It catches mistakes sooner and more reliably than humans and (for that
matter) expects less in the way of salary and benefits! As far as I
can tell, you agree with me on this.

>> course, I can
>> put an assert at the top of foo() which asserts that x is of type
>> FancyDateObject,
>> but if I always assert the types of my arguments then I'm basically
>> using a staticly typed language.
> 
> 
> 
> If your 'assert ... is of type' actually runs a sensible albeit small
> 'type'-testing procedure, which exercizes the whole contract that the
> interface implements, then you have gone statically typed languages
> one better -- unfortunately, this is most often impractical (as such
> infinitely-repeated tests are far too slow, can't be made non-invasive,
> etc, etc).  And it doesn't buy you all that much either (a bit more
> than just statically checking types, but not all that much more) --
> you still need to have **exercised** the call-cases of interest.
> 
> Asserting a single, specific, invariable concrete type would be on
> a different plane -- perhaps feasible (at non-inconsiderable cost)
> for some within-the-component cases (where you can commit to never
> needing ANY polymorphism EVER), definitely unfeasible across any
> component-boundary (cfr. Lakos' "Large Scale" book, again -- it's
> still the best treatment I know of dependency management in large
> scale software development, and, in particular, of the inevitability
> of purely abstract interface-classes across component boundaries
> to manage those dependencies; Martin's articles, which can be found
> on his objectmentor site, are more readable, although not quite as
> deep and extensive as Lakos' big book of course).
> 
> 
>> There are times when a dynamically typed language is more flexible, and
>> it's certainly
>> nice not to have to declare everything just to specify its type, but
>> there ARE
>> advantages to static typing, and this, I believe, is the biggest one.
> 
> 
> 
> We agree that the (redundant, but earlier) checks performed by the
> statically-checking compiler are the 'biggest' (least small?-)
> advantage of said compiler (well, apart from speed issues, which
> may at times be paramount for certain well-identified components).
> 
> We disagree on how big that 'biggest' is.  I estimate that (speed
> apart) this specific advantage may buy me about a 5% productivity
> improvement -- so I agree it's an advantage, and I agree it's
> larger than any other advantages of static checks (none of which,
> it seems to me, may make even a 1% further improvement even when
> all taken together -- again, speed of resulting code apart), but
> I don't think it's worth anywhere near the _bother_ (productivity
> impact, _negative_ improvement) of the contortions I have to
> perform to satisfy the checks (which cost me _at least_ 10% of
> my sw-lifetime-coding-productivity, even in the cases where I'm
> least interested in specifically taking advantage of dynamism).

You know.. I understood and agreed with every single point you
made up until this one. I'm agree that static typechecking probably
only buys 5% productivity (maybe less) because the kind of mistake
it catches is the kind which are easy to diagnose and correct
anyhow.

But I'm quite surprised that you think you pay a 10% penalty
for going through contortions to adhere to static typechecking.
You see... in my own experience, I hardly ever want to violate
it. I generally design type hierarchys and use objects in places
where I expect an object of that type or of a direct ancestor.
On the occasions when I want to write something which works on
all objects of a certain behavior, I define an interface for
the behavior (even in languages which don't have direct support
for "interface") and specify that all appropriate objects
implement the interface. In fact, I often write multi-paragraph
comments on how the interface should behave, so that it's really
more of a "contract" than an interface. When I'm being good,
(and I'm good much more often lately) I even write unit tests to
enforce it.

It is actually VERY rare that I do something like (and here I
dig up my most-likely-to-happen example and phrase in terms
of python) write a function expecting a "file-like-object" --
which, in Python, is reasonably-well-understood but not
formally documented behavior (well, set of behaviors). But as
you point out...

> I come to these (tentative) conclusions after a lifetime spent
> working _mostly_ in statically-typed languages

... I'm just learning this stuff. In fact, I have been working
with dynamically typed languages only for a very short time. So
my own take-away is that I should really try to exercise the
ability to make more extravagant use of polymorphism when working
in dynamically typed languages, because I may discover (if my
experience is similar to yours) that I'm saving 10% of my time,
by avoiding awkward contortions that I don't even realize I'm
doing now.

>                                                 -- because the
> performance characteristics of the machines I was targeting just
> didn't afford me the luxury of doing otherwise (when they DID,
> I repeatedly tried out dynamically typed languages, and, over
> and over again, I was tickled pink at how well they worked --
> most people are surprised when they learned how much of the
> 'background processing' of the programs I was doing in the
> mid-80's [on IBM mainframes] was done in Rexx, but then, they
> are equally surprised at how fast and reliably I delivered:-).
> 
> Today, I'm still relying mostly on C++ to earn my daily bread
> (3-D modeling &c being pretty compute-intensive even by today's
> standards -- and 3D for mechanical engineering is that what
> we're mostly doing here at my current employer), but more and
> more dynamically typed code 'sneaks in' (thanks be!-)...
> 
> 
> Alex
> 
Thanks for the detailed input... I still feel like static
types are a good thing (5% gain perhaps) but if there's
something out there that dynamic typing can give me which
is even BETTER (10%?), then I'd REALLY like to find it!

-- Michael Chermside

_________________________________________________________
Do You Yahoo!?
Get your free @yahoo.com address at http://mail.yahoo.com