new user needs help!

Steve Holden steve at holdenweb.com
Tue Apr 8 20:57:47 EDT 2008


drjekil77 at gmail.com wrote:
> thanks!

Please keep all replies on the list: somebody else may also wish to help 
(and they will also pick up mistakes I make ;-)

> I am working with a text filelooks like this:
> #NAME AA TOPO ACCESS DSSP STRIDE Z-COORD
> 1lghB A i 79.8 H H -24.58
> 1lghB V i 79.6 H H -22.06
> 1lghB H i 71.9 H H -19.94
> i need to compare those lines which has a value between 10 to 22 and presents in the following way
> True/false A C D E F G H I K L M N P Q R S T V X Y W(here alfabets represents amino acids)
> 1 1:1 2:0 3:0 and so on for rest of the amino acids.
> 
> 
> I have given 2 examples below to make it clear a bit:
> 
> ex.1:
> #NAME AA TOPO ACCESS DSSP STRIDE Z-COORD
> 1lghB A i 79.8 H H                              -24.58                               #for that line amino acid is A and z-COORED value is more than 22,so output should be look like
> 
> True/false A C D E F G H I K L M N P Q R S T V X Y W(here alfabets represents amino acids)
> -1 1:1 2:0 3:o 4:0 so on upto 20 bcz there r 20 amino acids.and A presents in the 1st position.every line represents one amino acid with value,so output will show in which position is it(here A is in 1st position thats why its value 1:1 and all the other position o like 2:0,3:0,4:0 and if its Z-COORED value between 10-22 then true false value 1,otherwise -1.
> 
> another ex:
> 
> 1lghB H i 71.9 H H -19.94            # for that line amino acid is H and it has value between 10-22.so output should looks like:
> 
> True/false A C D E F G H I K L M N P Q R S T V X Y W(here alfabets represents 20 amino acids)
> 
> 1 1:0 2:0 3:0 4:0 5:0 6:0 7:1 8:0,every position is zero up to 20 bcz H presents in the 7th position.so  it will be 1.
> 
> so for every line, output will in 21 coulum,1st coulum for true false value,others r for 20 amino acids.true false value will be either 1 or -1,and  within  other 20 coulum one value will be n:1,others will be n:0.n=0,1,2,3..20.
> its a bit tricky.
> thank u so much for ur help
> waiting for ur opinion.
>  
So, you are using -1 for true and 1 for false? Can there be multiple 
amino acids on each line?

Would this be a possible input:

1lghB AHG i 87.3 Q Q   -23.45

If so, would this be the required output?

-1 1:1 2:0 3:0 4:0 5:0 6:0 7:1 8:1 9:0 10:0 ... 19:0 20:0

There is no point trying to write more code until we understand the 
requirements properly.

regards
  Steve
-- 
Steve Holden        +1 571 484 6266   +1 800 494 3119
Holden Web LLC              http://www.holdenweb.com/



More information about the Python-list mailing list