Sequence
analysis of MLH1 protein.
1. Search human MutL protein homolog 1, Mlh1, in SWISS-PROT database. Give its (a) accession number, (b) entry name and (b) release date of last modification.
Ans: (a) primary accession number: P40692
(b) entry name: MLH1_HUMAN
(c) Annotations were last modified in: Release 40, October 2001
http://www.expasy.ch/cgi-bin/niceprot.pl?P40692
2. Give its number of amino acids, molecular weight and theoretical pI.
Ans: Molecular weight: 84600.98
Theoretical pI: 5.51
http://tw.expasy.org/cgi-bin/pi_tool1?P40692@noft@
3. Calculate the total number of negatively charged residues and positively
charged residues.
Ans: Total number of
negatively charged residues (Asp + Glu): 104
Total number of positively charged residues (Arg + Lys): 83
http://tw.expasy.org/cgi-bin/protparam1?P40692@noft@
4. Calculate its hydrophobicity (Kyte & Doolittle
scale, window size 11, Relative weight 60%).
Ans:p

http://tw.expasy.org/cgi-bin/protscale.pl?P40692@noft@Hphob.--/--Kyte--&--Doolittle_11_60_linear_no
5. Performe the trypsin (higher specificity) cleavage
of the protein. (a) How many peptides will you get after cleavage? (b) Give the
list of peptides with a mass bigger than 1000 dalton.
Ans: (a)81
(b)
| mass | position | #MC | peptide sequence |
| 4615.429 | AEMLADYFSLEIDEEGNLIG LPLLIDNYVPPLEGLPIFIL R | ||
| 3115.482 | LSEPAPLFDLAMLALDSPES GWTEEDGPK | ||
| 2862.478 | NTHPFLYLSLEISPQNVDVN VHPTK | ||
| 2804.404 | APPKPCAGNQGTQITVEDLF YNIATR | ||
| 2775.324 | EMLHNHSFVGCVNPQWALAQ HQTK | ||
| 2751.311 | QYISEESTLSGQQSEVPGSI PNSWK | ||
| 2634.410 | IINLTSVLSLQEEINEQGHE VLR | ||
| 2625.413 | LDAFLQPLSKPLSSQPQAIV TEDK | ||
| 2550.328 | LSEELFYQILIYDFANFGVL R | ||
| 2201.124 | HFTEDGNILQLANLPDLYK | ||
| 2140.082 | MYFTQTLLPGLAGPSGEMVK | ||
| 1968.958 | QQDEEMLELPAPAEVAAK | ||
| 1835.002 | GEALASISHVAHVTITTK | ||
| 1833.902 | LQSFEDLASISTYGFR | ||
| 1774.887 | HEVHFLHEESILER | ||
| 1550.901 | IAAGEVIQRPANAIK | ||
| 1532.692 | STTSLTSSSTSGSSDK | ||
| 1525.596 | EDSDVEMVEDDSR | ||
| 1460.684 | MNGYISNANYSVK | ||
| 1410.751 | EGLAEYIVEFLK | ||
| 1408.722 | YSVHNAGISFSVK | ||
| 1338.730 | AIETVYAAYLPK | ||
| 1333.627 | LATEVNWDEEK | ||
| 1327.733 | LIQIQDNGTGIR | ||
| 1300.686 | TLPNASTVDNIR | ||
| 1275.703 | CIFLLFINHR | ||
| 1174.625 | WTVEHIVYK | ||
| 1165.523 | EMIENCLDAK | ||
| 1148.550 | ELIEIGCEDK | ||
| 1119.496 | ECAMFYSIR | ||
| 1092.517 | NQSLEGDTTK | ||
| 1091.504 | EDLDIVCER | ||
| 1003.514 | VYAHQMVR |