assignment 6


1.
DNA MISMATCH REPAIR PROTEIN MLH1 (MUTL PROTEIN HOMOLOG 1) [Homo sapiens (Human)]

SequenceMSFVAGVIRR LDETVVNRIA AGEVIQRPAN AIKEMIENCL DAKSTSIQVIStructureCCCCEEEEEE CCHHHHHHHH HCHHHHHHHH HHHHHHHHHC CCCCCCHHHHSequenceVKEGGLKLIQ IQDNGTGIRK EDLDIVCERF TTSKLQSFED LASISTYGFRStructureHHHCCCEEEE ECCCCCCCHH HHHHHHHCCC CCCCCCCCHH HHHHCCCCCCSequenceGEALASISHV AHVTITTKTA DGKCAYRASY SDGKLKAPPK PCAGNQGTQIStructureCCHHHHHHHE EEEEEEECCC CCCCEEEECC CCCCCCCCCC CCCCCCCCEESequenceTVEDLFYNIA TRRKALKNPS EEYGKILEVV GRYSVHNAGI SFSVKKQGETStructureEEHHHHHHHH HHHHHHCCCC HHHHHEEEEE ECCCCCCCCE EEEECCCCCESequenceVADVRTLPNA STVDNIRSIF GNAVSRELIE IGCEDKTLAF KMNGYISNANStructureEEEEEECCCC CCCCCEEEEC CCCCCHHHHH HCCCHHHHHH CCCCCEECCCSequenceYSVKKCIFLL FINHRLVEST SLRKAIETVY AAYLPKNTHP FLYLSLEISPStructureCCCCCEEEEE ECCCCHHHHH HHHHHHHHHH HHCCCCCCCC EEEECCCCCCSequenceQNVDVNVHPT KHEVHFLHEE SILERVQQHI ESKLLGSNSS RMYFTQTLLPStructureCCCCEEECCC CCHHHHHHHH HHHHHHHHHH HHHHHCCCCC CEEEEEEECCSequenceGLAGPSGEMV KSTTSLTSSS TSGSSDKVYA HQMVRTDSRE QKLDAFLQPLStructureCCCCCCCCEE EEEEEEEEEC CCCCCCHHHH HHHHHHHHHH HHHHHHHCCCSequenceSKPLSSQPQA IVTEDKTDIS SGRARQQDEE MLELPAPAEV AAKNQSLEGDStructureCCCCCCCCCE EECCCCCCHH HHHHHHHHHH HHHCCCHHHH HHHHHCCCCCSequenceTTKGTSEMSE KRGPTSSNPR KRHREDSDVE MVEDDSRKEM TAACTPRRRIStructureCCCCCCHHHC CCCCCCCCCC CCCCCCCCHH HHHHHHHHHH HHHCCCCCEESequenceINLTSVLSLQ EEINEQGHEV LREMLHNHSF VGCVNPQWAL AQHQTKLYLLStructureECCCCHHHHH HHHHHHHHHH HHHHHCCCCE EEEECCCCHH HHHHHHHHHHSequenceNTTKLSEELF YQILIYDFAN FGVLRLSEPA PLFDLAMLAL DSPESGWTEEStructureHCCCCHHHHH HHHHHHCCCC CCEECCCCCC CHHHHHHHHC CCCCCCCCCCSequenceDGPKEGLAEY IVEFLKKKAE MLADYFSLEI DEEGNLIGLP LLIDNYVPPLStructureCCCCCCHHHH HHHHHHHHHH HHHHHHHHHH HHCCCCCCCC EEECCCCCCCSequenceEGLPIFILRL ATEVNWDEEK ECFESLSKEC AMFYSIRKQY ISEESTLSGQStructureCCCCHHHHHH HHHHCHHHHH HCCCCCCCCC HHHHHCCCCC CCHHHHCCCCSequenceQSEVPGSIPN SWKWTVEHIV YKALRSHILP PKHFTEDGNI LQLANLPDLYStructureCCCCCCCCCC CCCEEEECHH HHHHHCCCCC CCCCCCCCHH HHHHCCCCCESequenceKVFERCStructureEEEEEC
LEGEND: Alpha Helix = H Beta Sheet = E Random Coil = C

2.
GENPEPT_7595954 -----------------MAFVAGVIRRLDETVVNRIAAGEVIQRPANAIK GENPEPT_1724118 -----------------MSFVAGVIRRLDETVVNRIAAGEVIQRPANAIK MLH1_HUMAN -----------------MSFVAGVIRRLDETVVNRIAAGEVIQRPANAIK GENPEPT_3192877 ---------------MAEYLQPGVIRKLDEVVVNRIAAGEIIQRPANALK GENPEPT_460627 --------------------MSLRIKALDASVVNKIAAGEIIISPVNALK hypothetical_protein_T28A8.7 MWHCGYRTRNCDEFSKIEFSLMGLIQRLPQDVVNRMAAGEVLARPCNAIK*:****::****::***:* GENPEPT_7595954 EMIENCLDAKSTNIQVVVKEGGLKLIQIQDNGTGIRKEDLDIVCERFTTS GENPEPT_1724118 EMTENCLDAKSTNIQVIVREGGLKLIQIQDNGTGIRKEDLDIVCERFTTS MLH1_HUMAN EMIENCLDAKSTSIQVIVKEGGLKLIQIQDNGTGIRKEDLDIVCERFTTS GENPEPT_3192877 ELLENSLDAQSTHIQVQVKAGGLKLLQIQDNGTGIRREDLAIVCERFTTS GENPEPT_460627 EMMENSIDANATMIDILVKEGGIKVLQITDNGSGINKADLPILCERFTTS hypothetical_protein_T28A8.7 ELVENSLDAGATEIMVNMQNGGLKLLQVSDNGKGIEREDFALVCERFATS*:**.:**:**:::**:*::*:***.**.:*:::****:** GENPEPT_7595954 KLQTFEDLASISTYGFRGEALASISHVAHVTITTKTADGKCAYRASYSDG GENPEPT_1724118 KLQTFEDLAMISTYGFRGEALASISHVAHVTITTKTADGKCAYRASYSDG MLH1_HUMAN KLQSFEDLASISTYGFRGEALASISHVAHVTITTKTADGKCAYRASYSDG GENPEPT_3192877 KLTRFEDLSQIATFGFRGEALASISHVAHLSIQTKTAKEKCGYKATYADG GENPEPT_460627 KLQKFEDLSQIQTYGFRGEALASISHVARVTVTTKVKEDRCAWRVSYAEG hypothetical_protein_T28A8.7 KLQKFEDLMHMKTYGFRGEALASLSHVAKVNIVSKRADAKCAYQANFLDG******:*:*********:****::.::*.:*.::..::* GENPEPT_7595954 KLQAPPKPCAGNQGTLITVEDLFYNIITRRKALKNPSEEYGKILEVVGRY GENPEPT_1724118 KLQAPPKPCAGNQGTLITVEDLFYNIITRKKALKNPSEEYGKILEVVGRY MLH1_HUMAN KLKAPPKPCAGNQGTQITVEDLFYNIATRRKALKNPSEEYGKILEVVGRY GENPEPT_3192877 KLQGQPKPCAGNQGTIICIEDLFYNMPQRRQALRSPAEEFQRLSEVLARY GENPEPT_460627 KMLESPKPVAGKDGTTILVEDLFFNIPSRLRALRSHNDEYSKILDVVGRY hypothetical_protein_T28A8.7 KMTADTKPAAGKNGTCITATDLFYNLPTRRNKMTTHGEEAKMVNDTLLRF*:.****::******:*:*.:.:*::.:*: