Most Effective Way to Build Up a Histogram of Words?

Oleg Broytmann phd at phd.russ.ru
Thu Oct 12 12:19:42 EDT 2000


On Fri, 13 Oct 2000, June Kim wrote:
> Thank you for your clear and clean code.
> The problem, however, is that I might run through several of a few MB files,
> summing up to tens of mega bytes when added into one file .
> Therefore, to do the sorting all at once might sound somewhat unfeasible
> or ineffecient. Am I trying to make Python a panacea here? ( I know it has
> no snake oil though)

   Today I ran similar program on 42 Mb file. It worked for about 50
minutes (there are regular expressions to extract words). As I run the
program once per month I am pretty satisfied with the speed :)

Oleg.            (All opinions are mine and not of my employer)
---- 
    Oleg Broytmann      Foundation for Effective Policies      phd at phd.russ.ru
           Programmers don't die, they just GOSUB without RETURN.





More information about the Python-list mailing list