Sequence
analysis of MLH1 protein.
1. Search human MutL protein homolog 1, Mlh1, in SWISS-PROT database. Give its (a) accession number, (b) entry name and (b) release date of last modification.
Ans: (a) primary accession number: P40692
(b) entry name: MLH1_HUMAN
(c) Annotations were last modified in: Release 40, October 2001
http://www.expasy.ch/cgi-bin/niceprot.pl?P40692
2. Give its number of amino acids, molecular weight and theoretical pI.
Ans: Molecular weight: 84600.98
Theoretical pI: 5.51
http://tw.expasy.org/cgi-bin/pi_tool1?P40692@noft@
3. Calculate the total number of negatively charged residues and positively
charged residues.
Ans: Total number of
negatively charged residues (Asp + Glu): 104
Total number of positively charged residues (Arg + Lys): 83
http://tw.expasy.org/cgi-bin/protparam1?P40692@noft@
4. Calculate its hydrophobicity (Kyte & Doolittle
scale, window size 11, Relative weight 60%).
Ans:p
http://tw.expasy.org/cgi-bin/protscale.pl?P40692@noft@Hphob.--/--Kyte--&--Doolittle_11_60_linear_no
5. Performe the trypsin (higher specificity) cleavage
of the protein. (a) How many peptides will you get after cleavage? (b) Give the
list of peptides with a mass bigger than 1000 dalton.
Ans: (a)81
(b)
mass | position | #MC | peptide sequence |
4615.429 | AEMLADYFSLEIDEEGNLIG LPLLIDNYVPPLEGLPIFIL R | ||
3115.482 | LSEPAPLFDLAMLALDSPES GWTEEDGPK | ||
2862.478 | NTHPFLYLSLEISPQNVDVN VHPTK | ||
2804.404 | APPKPCAGNQGTQITVEDLF YNIATR | ||
2775.324 | EMLHNHSFVGCVNPQWALAQ HQTK | ||
2751.311 | QYISEESTLSGQQSEVPGSI PNSWK | ||
2634.410 | IINLTSVLSLQEEINEQGHE VLR | ||
2625.413 | LDAFLQPLSKPLSSQPQAIV TEDK | ||
2550.328 | LSEELFYQILIYDFANFGVL R | ||
2201.124 | HFTEDGNILQLANLPDLYK | ||
2140.082 | MYFTQTLLPGLAGPSGEMVK | ||
1968.958 | QQDEEMLELPAPAEVAAK | ||
1835.002 | GEALASISHVAHVTITTK | ||
1833.902 | LQSFEDLASISTYGFR | ||
1774.887 | HEVHFLHEESILER | ||
1550.901 | IAAGEVIQRPANAIK | ||
1532.692 | STTSLTSSSTSGSSDK | ||
1525.596 | EDSDVEMVEDDSR | ||
1460.684 | MNGYISNANYSVK | ||
1410.751 | EGLAEYIVEFLK | ||
1408.722 | YSVHNAGISFSVK | ||
1338.730 | AIETVYAAYLPK | ||
1333.627 | LATEVNWDEEK | ||
1327.733 | LIQIQDNGTGIR | ||
1300.686 | TLPNASTVDNIR | ||
1275.703 | CIFLLFINHR | ||
1174.625 | WTVEHIVYK | ||
1165.523 | EMIENCLDAK | ||
1148.550 | ELIEIGCEDK | ||
1119.496 | ECAMFYSIR | ||
1092.517 | NQSLEGDTTK | ||
1091.504 | EDLDIVCER | ||
1003.514 | VYAHQMVR |