Processing output of TM program

MK mark_removethis_ at _removethis_btweng.krakow.pl
Sat May 15 09:47:57 EDT 1999


Hi,

I am Python newbie, and I would like to consult more experienced
programmers how to solve following problem. In company we use
output stats of Translation Memory program (ugh) of following form:

---------

E:\COMPAQ\MARKED~1\R-APPB.RTF

 Match Types  Segments        Words Percent Placeables
 Repetitions        27           64       2          1
 100%               68          801      25          0
 95% - 99%           5           83       2          0
 85% - 94%           2           18       0          0
 75% - 84%           2           40       1          0
 50% - 74%           4           10       0          0
 No Match          127        2,122      70          1
 Total             235        3,138     100          2

 Chars/Wo         5.27


E:\COMPAQ\MARKED~1\S-APPC.RTF

 Match Types  Segments        Words Percent Placeables
 Repetitions        21           46      15          1
 100%               10           96      33          0
 95% - 99%           3           67      23          0
 85% - 94%           1           25       8          0
 75% - 84%           1            3       1          0
 50% - 74%           1            3       1          0
 No Match            7           49      19          0
 Total              44          289     100          1

 Chars/Wo         5.82



Analyze Total

 Match Types  Segments        Words Percent Placeables
 Repetitions       899        3,330      11         16
 100%              371        1,826       6         15
 95% - 99%          48          415       1          1
 85% - 94%          47          403       1          1
 75% - 84%          86          638       2          4
 50% - 74%          98          553       1         18
 No Match        2,154       21,389      78        548
 Total           3,703       28,554     100        603

 Chars/Wo         5.33


Analyze finished successfully! Wed Apr 14 18:25:12 1999

--------


Only 'Match types' and 'Words' columns are relevant.

Various people work on particular chapters. Depending on 
how well appropriate sentences matched sentences 
from translation memory (which obviously reflects how
much work it has to be done on it), a fraction 
of pay-per-word is paid. E.g. "No match" is paid 100%, 
"50%-74" is say 80%, "75-99%" is say 50%, and so on.

Calculating what particular person is liable for is very time
consuming. Attempts to use Excel did not help much. 
I would like to write a program in Python that would calculate
wages of translators. The simplest approach is to slurp
the file into table using readlines(), and then use string.split
on rows of table. But maybe there is some cleverer approach?
I mean, this is more general problem -- perl has this 'form' facility
(I did not use it really, just browsed but it looked interesting),
maybe there is some module that makes such things 
a breeze? Unfortunately I have nopt been able to find
anything like that on the net.
TIA





--------------------------------------------------
Reality is something that does not disappear after
you cease believing in it - VALIS, Philip K. Dick
--------------------------------------------------

Delete _removethis_ from address to email me




More information about the Python-list mailing list