Sequence analysis of MLH1 protein. - To search protein file in SWISS-PROT Database - To use analysis tools in ExPASy - Deadline - 11/29/2001 1. Search human MutL protein homolog 1, Mlh1, in SWISS-PROT database. Give its (a) accession number, (b) entry name and (c) release date of last modification. ans:(a)P40692(b)MLH1_HUMAN(c)Release 40, October 2001 2. Give its number of amino acids, molecular weight and theoretical pI. ans:(a)Number of amino acids:756(b)Molecularweight:84600.98 (c)Theoretical pI: 5.51 3. Calculate the total number of negatively charged residues and positively charged residues. ans:Total number of negatively charged residues (Asp + Glu): 104 Total number of positively charged residues (Arg + Lys): 83 4. Calculate its hydrophobicity (Kyte & Doolittle scale, window size 11, Relative weight 60%). ans:Ala: 1.800 Arg: -4.500 Asn: -3.500 Asp: -3.500 Cys: 2.500 Gln: -3.500 Glu: -3.500 Gly: -0.400 His: -3.200 Ile: 4.500 Leu: 3.800 Lys: -3.900 Met: 1.900 Phe: 2.800 Pro: -1.600 Ser: -0.800 Thr: -0.700 Trp: -0.900 Tyr: -1.300 Val: 4.200 Asx: -3.500 Glx: -3.500 Xaa: -0.490 5. Performe the trypsin (higher specificity) cleavage of the protein. (a) How many peptides will you get after cleavage? (b) Give the list of peptides with a mass bigger than 1000 dalton. ans:(a)33 (b)mass position #MC peptide sequence 4615.429 619-659 0 AEMLADYFSLEIDEEGNLIG LPLLIDNYVPPLEGLPIFIL R 3115.482 576-604 0 LSEPAPLFDLAMLALDSPES GWTEEDGPK 2862.478 287-311 0 NTHPFLYLSLEISPQNVDVN VHPTK 2804.404 137-162 0 APPKPCAGNQGTQITVEDLF YNIATR 2775.324 523-546 0 EMLHNHSFVGCVNPQWALAQ HQTK 2751.311 689-713 0 QYISEESTLSGQQSEVPGSI PNSWK 2634.410 500-522 0 IINLTSVLSLQEEINEQGHE VLR 2625.413 393-416 0 LDAFLQPLSKPLSSQPQAIV TEDK 2550.328 555-575 0 LSEELFYQILIYDFANFGVL R 2201.124 733-751 0 HFTEDGNILQLANLPDLYK 2140.082 342-361 0 MYFTQTLLPGLAGPSGEMVK 1968.958 426-443 0 QQDEEMLELPAPAEVAAK 1835.002 101-118 0 GEALASISHVAHVTITTK 1833.902 85-100 0 LQSFEDLASISTYGFR 1774.887 312-325 0 HEVHFLHEESILER 1550.901 19-33 0 IAAGEVIQRPANAIK 1532.692 362-377 0 STTSLTSSSTSGSSDK 1525.596 475-487 0 EDSDVEMVEDDSR 1460.684 242-254 0 MNGYISNANYSVK 1410.751 605-616 0 EGLAEYIVEFLK 1408.722 183-195 0 YSVHNAGISFSVK 1338.730 275-286 0 AIETVYAAYLPK 1333.627 660-670 0 LATEVNWDEEK 1327.733 58-69 0 LIQIQDNGTGIR 1300.686 206-217 0 TLPNASTVDNIR 1275.703 256-265 0 CIFLLFINHR 1174.625 714-722 0 WTVEHIVYK 1165.523 34-43 0 EMIENCLDAK 1148.550 227-236 0 ELIEIGCEDK 1119.496 679-687 0 ECAMFYSIR 1092.517 444-453 0 NQSLEGDTTK 1091.504 71-79 0 EDLDIVCER 1003.514 378-385 0 VYAHQMVR