Sequence analysis of MLH1 protein.

1. Search human MutL protein homolog 1, Mlh1, in SWISS-PROT database. Give its (a) accession number, (b) entry name and (b) release date of last modification.

Ans: (a) primary accession number: P40692

        (b) entry name: MLH1_HUMAN

       (c) Annotations were last modified in: Release 40, October 2001

        http://www.expasy.ch/cgi-bin/niceprot.pl?P40692
2. Give its number of amino acids, molecular weight and theoretical pI.

Ans: Molecular weight: 84600.98

        Theoretical pI: 5.51

        http://tw.expasy.org/cgi-bin/pi_tool1?P40692@noft@
3. Calculate the total number of negatively charged residues and positively charged residues.
Ans: Total number of negatively charged residues (Asp + Glu): 104 

        Total number of positively charged residues (Arg + Lys): 83

        http://tw.expasy.org/cgi-bin/protparam1?P40692@noft@

4. Calculate its hydrophobicity (Kyte & Doolittle scale, window size 11, Relative weight 60%).
Ans:p
ProtScale graph

      http://tw.expasy.org/cgi-bin/protscale.pl?P40692@noft@Hphob.--/--Kyte--&--Doolittle_11_60_linear_no

5. Performe the trypsin (higher specificity) cleavage of the protein. (a) How many peptides will you get after cleavage? (b) Give the list of peptides with a mass bigger than 1000 dalton.
Ans: (a)81

        (b)

mass position #MC peptide sequence
4615.429
619-659
0
AEMLADYFSLEIDEEGNLIG LPLLIDNYVPPLEGLPIFIL R
3115.482
576-604
0
LSEPAPLFDLAMLALDSPES GWTEEDGPK
2862.478
287-311
0
NTHPFLYLSLEISPQNVDVN VHPTK
2804.404
137-162
0
APPKPCAGNQGTQITVEDLF YNIATR
2775.324
523-546
0
EMLHNHSFVGCVNPQWALAQ HQTK
2751.311
689-713
0
QYISEESTLSGQQSEVPGSI PNSWK
2634.410
500-522
0
IINLTSVLSLQEEINEQGHE VLR
2625.413
393-416
0
LDAFLQPLSKPLSSQPQAIV TEDK
2550.328
555-575
0
LSEELFYQILIYDFANFGVL R
2201.124
733-751
0
HFTEDGNILQLANLPDLYK
2140.082
342-361
0
MYFTQTLLPGLAGPSGEMVK
1968.958
426-443
0
QQDEEMLELPAPAEVAAK
1835.002
101-118
0
GEALASISHVAHVTITTK
1833.902
85-100
0
LQSFEDLASISTYGFR
1774.887
312-325
0
HEVHFLHEESILER
1550.901
19-33
0
IAAGEVIQRPANAIK
1532.692
362-377
0
STTSLTSSSTSGSSDK
1525.596
475-487
0
EDSDVEMVEDDSR
1460.684
242-254
0
MNGYISNANYSVK
1410.751
605-616
0
EGLAEYIVEFLK
1408.722
183-195
0
YSVHNAGISFSVK
1338.730
275-286
0
AIETVYAAYLPK
1333.627
660-670
0
LATEVNWDEEK
1327.733
58-69
0
LIQIQDNGTGIR
1300.686
206-217
0
TLPNASTVDNIR
1275.703
256-265
0
CIFLLFINHR
1174.625
714-722
0
WTVEHIVYK
1165.523
34-43
0
EMIENCLDAK
1148.550
227-236
0
ELIEIGCEDK
1119.496
679-687
0
ECAMFYSIR
1092.517
444-453
0
NQSLEGDTTK
1091.504
71-79
0
EDLDIVCER
1003.514
378-385
0
VYAHQMVR

http://tw.expasy.org/cgi-bin/peptide-mass.pl