·  Fasta label (*)

Workbench label

MLH1_HUMAN

DNA MISMATCH REPAIR PROTEIN MLH1 (MUTL PROTEIN HOMOLOG 1) [Homo sapiens (Human)]

MLH1_YEAST

MUTL PROTEIN HOMOLOG 1 (DNA MISMATCH REPAIR PROTEIN MLH1) [Saccharomyces cerevisiae (Baker's yeast)]

MLH1_RAT

DNA MISMATCH REPAIR PROTEIN MLH1 (MUTL PROTEIN HOMOLOG 1) [Rattus norvegicus (Rat)]

MLH1_MOUSE

DNA MISMATCH REPAIR PROTEIN MLH1 (MUTL PROTEIN HOMOLOG 1) [Mus musculus (Mouse)]

GENPEPT:3880333

Caenorhabditis elegans cosmid T28A8, complete sequence_

GENPEPT:3192877

Drosophila melanogaster mutL homolog (Mlh1) gene, complete cds_

GENPEPT:7304079

Drosophila melanogaster genomic scaffold 142000013386047 section 5

(*) Clustalw cuts off Fasta labels after the first space (e.g. ">abc def" becomes ">abc").


Sequence alignment

Consensus key (see documentation for details)
* - single, fully conserved residue
: - conservation of strong groups
. - conservation of weak groups
  - no consensus
 
 
CLUSTAL W (1.81) multiple sequence alignment
 
 
GENPEPT_7304079      ---------------MAEYLQPGVIRKLDEVVVNRIAAGEIIQRPANALKELLENSLDAQ
GENPEPT_3192877      ---------------MAEYLQPGVIRKLDEVVVNRIAAGEIIQRPANALKELLENSLDAQ
MLH1_MOUSE           -----------------MAFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMIENCLDAK
MLH1_RAT             -----------------MSFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMTENCLDAK
MLH1_HUMAN           -----------------MSFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMIENCLDAK
MLH1_YEAST           --------------------MSLRIKALDASVVNKIAAGEIIISPVNALKEMMENSIDAN
GENPEPT_3880333      MWHCGYRTRNCDEFSKIEFSLMGLIQRLPQDVVNRMAAGEVLARPCNAIKELVENSLDAG
                                             *: *   ***::****::  * **:**: **.:** 
 
GENPEPT_7304079      STHIQVQVKAGGLKLLQIQDNGTGIRREDLAIVCERFTTSKLTRFEDLSQIATFGFRGEA
GENPEPT_3192877      STHIQVQVKAGGLKLLQIQDNGTGIRREDLAIVCERFTTSKLTRFEDLSQIATFGFRGEA
MLH1_MOUSE           STNIQVVVKEGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQTFEDLASISTYGFRGEA
MLH1_RAT             STNIQVIVREGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQTFEDLAMISTYGFRGEA
MLH1_HUMAN           STSIQVIVKEGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQSFEDLASISTYGFRGEA
MLH1_YEAST           ATMIDILVKEGGIKVLQITDNGSGINKADLPILCERFTTSKLQKFEDLSQIQTYGFRGEA
GENPEPT_3880333      ATEIMVNMQNGGLKLLQVSDNGKGIEREDFALVCERFATSKLQKFEDLMHMKTYGFRGEA
                     :* * : :: **:*::*: ***.**.: *: ::****:****  ****  : *:******
 
GENPEPT_7304079      LASISHVAHLSIQTKTAKEKCGYKATYADGKLQGQPKPCAGNQGTIICIEDLFYNMPQRR
GENPEPT_3192877      LASISHVAHLSIQTKTAKEKCGYKATYADGKLQGQPKPCAGNQGTIICIEDLFYNMPQRR
MLH1_MOUSE           LASISHVAHVTITTKTADGKCAYRASYSDGKLQAPPKPCAGNQGTLITVEDLFYNIITRR
MLH1_RAT             LASISHVAHVTITTKTADGKCAYRASYSDGKLQAPPKPCAGNQGTLITVEDLFYNIITRK
MLH1_HUMAN           LASISHVAHVTITTKTADGKCAYRASYSDGKLKAPPKPCAGNQGTQITVEDLFYNIATRR
MLH1_YEAST           LASISHVARVTVTTKVKEDRCAWRVSYAEGKMLESPKPVAGKDGTTILVEDLFFNIPSRL
GENPEPT_3880333      LASLSHVAKVNIVSKRADAKCAYQANFLDGKMTADTKPAAGKNGTCITATDLFYNLPTRR
                     ***:****::.: :*  . :*.::..: :**:   .** **::** *   ***:*:  * 
 
GENPEPT_7304079      QALRSPAEEFQRLSEVLARYAVHNPRVGFTLRKQGDAQPALRTPVASSRSENIRIIYGAA
GENPEPT_3192877      QALRSPAEEFQRLSEVLARYAVHNPRVGFTLRKQGDAQPALRTPVASSRSENIRIIYGAA
MLH1_MOUSE           KALKNPSEEYGKILEVVGRYSIHNSGISFSVKKQGETVSDVRTLPNATTVDNIRSIFGNA
MLH1_RAT             KALKNPSEEYGKILEVVGRYSIHNSGISFSVKKQGETVSDVRTLPNATTVDNIRSIFGNA
MLH1_HUMAN           KALKNPSEEYGKILEVVGRYSVHNAGISFSVKKQGETVADVRTLPNASTVDNIRSIFGNA
MLH1_YEAST           RALRSHNDEYSKILDVVGRYAIHSKDIGFSCKKFGDSNYSLSVKPSYTVQDRIRTVFNKS
GENPEPT_3880333      NKMTTHGEEAKMVNDTLLRFAIHRPDVSFALRQ--NQAGDFRTKGDGNFRDVVCNLLGRD
                     . : .  :*   : :.: *:::*   :.*: ::  :    . .    .  : :  : .  
 
GENPEPT_7304079      ISKELL--EFSHRDEVYKFEAECLITQVNYSAKKCQM----------LLFINQRLVESTA
GENPEPT_3192877      ISKELL--EFSHRDEVYKFEAECLITQVNYSAKKCQM----------LLFINQRLVESTA
MLH1_MOUSE           VSRELI--EVGCEDKTLAFKMNGYISNANYSVKKCIF----------LLFINHRLVESAA
MLH1_RAT             VSRELI--EVGCEDKTLAFKMNGYISNANYSVKKCIF----------LLFINHRLVESAA
MLH1_HUMAN           VSRELI--EIGCEDKTLAFKMNGYISNANYSVKKCIF----------LLFINHRLVESTS
MLH1_YEAST           VASNLITFHISKVEDLNLESVDGKVCNLNFISKKSISP---------IFFINNRLVTCDL
GENPEPT_3880333      VADTILP--LSLNSTRLKFTFTGHISKPIASATAAIAQNRKTSRSFFSVFINGRSVRCDI
                     ::  ::   ..  .          : :     . .             .*** * * .  
 
GENPEPT_7304079      LRTSVDSIYATYLPRGHHPFVYMSLTLPPQNLDVNVHPTKHEVHFLYQEEIVDSIKQQVE
GENPEPT_3192877      LRTSVDSIYATYLPRGHHPFVYMSLTLPPQNLDVNVHPTKHEVHFLYQEEIVDSIKQQVE
MLH1_MOUSE           LRKAIETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFLHEESILQRVQQHIE
MLH1_RAT             LKKAIEAVYAAYLPKNTHPFLYLILEISPQNVDVNVHPTKHEVHFLHEESILERVQQHIE
MLH1_HUMAN           LRKAIETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFLHEESILERVQQHIE
MLH1_YEAST           LRRALNSVYSNYLPKGNRPFIYLGIVIDPAAVDVNVHPTKREVRFLSQDEIIEKIANQLH
GENPEPT_3880333      LKHPIDEVLG--ARQLHAQFCALHLQIDETRIDVNVHPTKNSVIFLEKEEIIEEIRAYFE
                     *: .:: : .    :    *  : : :    :********..* ** ::.*:: :   ..
 
GENPEPT_7304079      ARLLGSNATRTFYKQLRLPGAP-----------------DLDETQLADKTQRIYPKEMVR
GENPEPT_3192877      ARLLGSNATRTFYKQLRLPGAP-----------------DLDETQLADKTQRIYPKEMVR
MLH1_MOUSE           SKLLGSNSSRMYFTQTLLPGLAG------PSGEAARPTTGVASSSTSGSGDKVYAYQMVR
MLH1_RAT             SKLLGSNSSRMYFTQTLLPGLAG------PSGEAVKSTTGIASSSTSGSGDKVHAYQMVR
MLH1_HUMAN           SKLLGSNSSRMYFTQTLLPGLAG------PSGEMVKSTTSLTSSSTSGSSDKVYAHQMVR
MLH1_YEAST           AELSAIDTSRTFKASSISTNKPESLIPFNDTIESDRNRKSLRQAQVVENSYTTANSQLRK
GENPEPT_3880333      KVIGEIFGFEALDVEKPEEEQPD--------IENLVMIPMSQSLKSIEAIRKPDTKPEFK
                       :      .    .      .                    . .              :
 
GENPEPT_7304079      TDSTEQKLDKFLAPLVK-------------------------------------------
GENPEPT_3192877      TDSTEQKLDKFLAPLVK-------------------------------------------
MLH1_MOUSE           TDSRDQKLDAFLQPVSSLVPSQPQDPAPVRGARTEGSPERATREDEEMLALPAPAEAAAE
MLH1_RAT             TDSRDQKLDAFMQPVSRRLPSQPQD--PVPGNRTEGSPEKAMQKDQEISELPAPMEAAAD
MLH1_HUMAN           TDSREQKLDAFLQPLSKPLSSQPQ--AIVTEDKTDISSGRARQQDEEMLELPAPAEVAAK
MLH1_YEAST           AKRQENKLVRIDASQAKITSFLSSS--QQFNFEGSSTKRQLSEPKVTNVSHSQEAEKLTL
GENPEPT_3880333      SSPSAWKSDKKRVDYMEVRTDAKERKIDEFVTRGGAVGPTTSNDDIFGGSGILKRARTED
                     :.    *                                                     
 
GENPEPT_7304079      ----------------SDSGVSSSSSQEASRLPEES------------FRVTAAKKSREV
GENPEPT_3192877      ----------------SDSGVSSSSSQEASRLPEES------------FRVTAAKKSREV
MLH1_MOUSE           SENLERESLMETSDAAQKAAPTSSPGSSRKRHREDSDVEMVENASGKEMTAACYPRRRII
MLH1_RAT             SASLERESVIGASEVVAPQRHPSSPGSSRKRHPEDSDVEMMENDSRKEMTAACYPRRRII
MLH1_HUMAN           NQSLEGDTTKGTSEMSEKRGPTSS--NPRKRHREDSDVEMVEDDSRKEMTAACTPRRRII
MLH1_YEAST           NESEQPRDANTINDNDLKDQPKKKQKLGDYKVPSIADDEKNALPISKDGYIRVPKERVNV
GENPEPT_3880333      STGGEKEPEDLNTDFDDVSMVSLVSTADGRRLNESQD-----LGEDDDVDFEYGKTHREF
                                                   :  .                         .
 
GENPEPT_7304079      RLSSVLDMRKRVERQCSVQLRSTLKNLVYVGCVDERR--ALFQHETRLYMCNTRSFSEEL
GENPEPT_3192877      RLSSVLDMRKRVERQCSVQLRSTLKNLVYVGCVDERR--ALFQHETRLYMCNTRSFSEEL
MLH1_MOUSE           NLTSVLSLQEEISERCHETLREILRNHSFVGCVNPQW--ALAQHQTKLYLLNTTKLSEEL
MLH1_RAT             NLTSVLSLQEEINDRGHETLREMLRNHTFVGCVNPQW--ALAQHQTKLYLLNTTKLSEEL
MLH1_HUMAN           NLTSVLSLQEEINEQGHEVLREMLHNHSFVGCVNPQW--ALAQHQTKLYLLNTTKLSEEL
MLH1_YEAST           NLTSIKKLREKVDDSIHRELTDIFANLNYVGVVDEERRLAAIQHDLKLFLIDYGSVCYEL
GENPEPT_3880333      HFESIEVLRKEIIANSSQSLREMFKTSTFVGSINVKQ--VLIQFGTSLYHLDFSTVLREF
                     .: *:  :::.:       * . : .  :** :: .   .  *.   *:  :  ..  *:
 
GENPEPT_7304079      FYQRMIYEFQNCSEITISPPLPLKELLILSLESEAAGWTPEDGDKAELA-----DGAADI
GENPEPT_3192877      FYQRMIYEFQNCSEITICPPLPLKELLILSLESRAAGWTPEDEDKAELA-----DGAADI
MLH1_MOUSE           FYQILIYDFANFGVLRLSEPAPLFDLAMLALDSPESGWTEDDGPKEGLA-----EYIVEF
MLH1_RAT             FYQILIYDFANFGVLRLPEPAPLFDFAMLALDSPESGWTEEDGPKEGLA-----EYIVEF
MLH1_HUMAN           FYQILIYDFANFGVLRLSEPAPLFDLAMLALDSPESGWTEEDGPKEGLA-----EYIVEF
MLH1_YEAST           FYQIGLTDFANFGKINLQSTNVSDDIVLYNLLSEFDELN-DDASK---------EKIISK
GENPEPT_3880333      FYQISVFSFGNYGSYRLDE-EPPAIIEILELLGELSTREPNYAAFEVFANVENRFAAEKL
                     ***  : .* * .   :        : :  * .       :                 . 
 
GENPEPT_7304079      LLKKAPIMREYFGLRISEDGM--------LESLPSLLHQHRPCVAHLPVYLLRLATEVDW
GENPEPT_3192877      LLKKAPIMREYFGLRISEDGM--------LESLPSLLHQHRPCVAHLPVYLLRLATEVDW
MLH1_MOUSE           LKKKAEMLADYFSVEIDEEGN--------LIGLPLLIDSYVPPLEGLPIFILRLATEVNW
MLH1_RAT             LKKKAKMLADYFSVEIDEEGN--------LIGLPLLIDSYVPPLEGLPIFILRLATEVNW
MLH1_HUMAN           LKKKAEMLADYFSLEIDEEGN--------LIGLPLLIDNYVPPLEGLPIFILRLATEVNW
MLH1_YEAST           IWDMSSMLNEYYSIELVNDGLDNDLKSVKLKSLPLLLKGYIPSLVKLPFFIYRLGKEVDW
GENPEPT_3880333      LAEHADLLHDYFAIKLDQLENGR----LHITEIPSLVHYFVPQLEKLPFLIATLVLNVDY
                     : . : :: :*:.:.: :           :  :* *:. . * :  **. :  *  :*::
 
GENPEPT_7304079      EQETRCFETFCRETARFY--------------AQLDWREGATAGFSRWT--MEHVLFPAF
GENPEPT_3192877      EQETRCFETFCRETARFY--------------AQLDWREGATAVFSRWT--MEHVLFPAF
MLH1_MOUSE           DEEKECFESLSKECAMFYSIRKQYILEESTLSGQQSDMPGSTSKPWKWT--VEHIIYKAF
MLH1_RAT             DEE-ECFESLSKECAVFYSIRKQYILEESALSGQQSDMPGSPSKPWKWT--VEHIIYKAF
MLH1_HUMAN           DEEKECFESLSKECAMFYSIRKQYISEESTLSGQQSEVPGSIPNSWKWT--VEHIVYKAL
MLH1_YEAST           EDEQECLDGILREIALLYIPDMVPKVDTSDASLSEDEKAQFINRKEHISSLLEHVLFPCI
GENPEPT_3880333      DDEQNTFRTICRAIGDLFTLDTN---------FITLDKKISAFSATPWKTLIKEVLMPLV
                     ::* . :  : :  . ::                              .  ::.::   .
 
GENPEPT_7304079      KKYLLPPPRIKD--QIYELTNLPTLYKVFERC--
GENPEPT_3192877      KKYLLPP-RIKD--QIYELTNLPTLYKVFERC--
MLH1_MOUSE           RSHLLPPKHFTEDGNVLQLANLPDLYKVFERC--
MLH1_RAT             RSHLLPPKHFTEDGNVLQLANLPDLCKVFERC--
MLH1_HUMAN           RSHILPPKHFTEDGNILQLANLPDLYKVFERC--
MLH1_YEAST           KRRFLAPRHILK--DVVEIANLPDLYKVFERC--
GENPEPT_3880333      KRKFIPPEHFKQAGVIRQLADSHDLYKVFERCGT
                     :  ::.* :: .   : ::::   * ******