[Baypiggies] upper case, lower case, and....?

Vikram K kpguy1975 at gmail.com
Fri Apr 1 15:01:22 CEST 2011


I have this protein sequence (each character represents an amino acid):

MAKFNTGSNPTEEAATSSRPFKVAGQSSPSGIQSRKNLFDNQGNASPPAGPSSMPKFGTTKPPLAAKPTYEEKPEKEP
KPPFLKPTGGSPRFGTQPNSVSRDPEVKVGFLKPVSPKPTSLTKEDSKPVVLRPPGNKLHNLNQESDLKTPGPKPGPAP
PVPENELKPGFSKVAGAKSKFMPAAQDTDSKPRFPRHTFGQKPSLSTEDSQEENTSKNVPVQKGSPVQLGAKSKGAPF
KPPKEDPEDKDHGAPSSPFPGVVLKPAASRGSPGLSKNFEEKKEDRKTDLAKNIFLNKLNQEEPARFPKAPSKLTAGTPW
GQSQEKEGDKNSATPKQKALPPLSVLGPPPPKPNRPPNVDLTRFRKADSANSATKSQTPYSTTSLPPPPPTHPASQPPLPASHP
AHPPVPSLPPRNIKPPLDLKHPINDENQDGVMHSDGTGNLEEEQESEGETYEDIDSSKERDKKREKEEKKRLELERKEQKEREKK
EQELKKKFKLTGPIQVIHHAKACCDVKGGKNELSFKQGEDIEIIRITDNPEGKWLGRTARGSYGYIKTTAVEIDYDSLKRKKNSLNAVP
PRLVEDDQDVYDDVAEQDAPNSHGQSGSGGMFPPPPTDDEIYDGIEEEDDDDGSVPQVDEKTNAWSWGILKMLKGKDDRKKSIRE
KPKVSESDNNEGSSLPSQHKQLDVGEEVYDDVDASDFPPPPAEMSQGMSVGRAKTEEKDPKKLKKQEKEEKDLRKKFKYDGEIRVL
YSTKVASSLTSKKWGARDLQIKPGESLEVIQSTDDTKVLCRNEEGKYGYVLRSYLVDNDGEIYDDIADGCIYDND

I identified the presence of the following peptide sequence in this protein:
TTAVEIDYDsLKR

The small s ('s') in the peptide sequence represents a phosphoyrlated serine
(phospho serine) which is different from a normal serine which is
represented by
the big s ('S').

Now i replace the occurence of the peptide in the protein with small case to
identify where the peptide is occuring in the protein:

MAKFNTGSNPTEEAATSSRPFKVAGQSSPSGIQSRKNLFDNQGNASPPAGPSSMPKFGTTKPPLAAKPTYEEKPEKEPKPPFLKPT
GGSPRFGTQPNSVSRDPEVKVGFLKPVSPKPTSLTKEDSKPVVLRPPGNKLHNLNQESDLKTPGPKPGPAPPVPENELKPGFSKVA
GAKSKFMPAAQDTDSKPRFPRHTFGQKPSLSTEDSQEENTSKNVPVQKGSPVQLGAKSKGAPFKPPKEDPEDKDHGAPSSPFPGVVL
KPAASRGSPGLSKNFEEKKEDRKTDLAKNIFLNKLNQEEPARFPKAPSKLTAGTPWGQSQEKEGDKNSATPKQKALPPLSVLGPPPPKP
NRPPNVDLTRFRKADSANSATKSQTPYSTTSLPPPPPTHPASQPPLPASHPAHPPVPSLPPRNIKPPLDLKHPINDENQDGVMHSDGTGNL
EEEQESEGETYEDIDSSKERDKKREKEEKKRLELERKEQKEREKKEQELKKKFKLTGPIQVIHHAKACCDVKGGKNELSFKQGEDIEIIRITDN
PEGKWLGRTARGSYGYIKttaveidydslkrKKNSLNAVPPRLVEDDQDVYDDVAEQDAPNSHGQSGSGGMFPPPPTDDEIYDGIEEEDDDDGSV
PQVDEKTNAWSWGILKMLKGKDDRKKSIREKPKVSESDNNEGSSLPSQHKQLDVGEEVYDDVDASDFPPPPAEMSQGMSVGRAKTEEKDP
KKLKKQEKEEKDLRKKFKYDGEIRVLYSTKVASSLTSKKWGARDLQIKPGESLEVIQSTDDTKVLCRNEEGKYGYVLRSYLVDNDGEIYDDIADGCIYDND


My problem is that i wish to distinguish the phospho serine character from
the rest of the small case letters in the modified protein sequence shown
above.
Any suggestions?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/baypiggies/attachments/20110401/1577a31e/attachment.html>


More information about the Baypiggies mailing list