Sequence alignment

GENPEPT:1724118 Rattus norvegicus mismatch repair protein (MLH1) mRNA, complete
GENPEPT:3192877 Drosophila melanogaster mutL homolog (Mlh1) gene, complete cds_
GENPEPT:7595954 Mus musculus MutL homolog 1 protein (MLH1) mRNA, complete cds.
GENPEPT:460627 Saccharomyces cerevisiae DNA mismatch repair (MLH1) gene, complete
MLH1_HUMAN DNA MISMATCH REPAIR PROTEIN MLH1 (MUTL PROTEIN HOMOLOG 1) [Homo sapiens (Human)]
C_elegans_gene:3876842_emb_CAA98478.1_ 3876842 similar to DNA mismatch repair protein_ cDNA EST EMBL:D37616 comes from this gene_ cDNA EST EMBL:D68353 comes from this gene_ cDNA EST EMBL:D68679 comes from this gene_ cDNA EST EMBL:C12838 comes from this gene_ cDNA EST EMBL:D67162 comes from this gene
Consensus key (see documentation for details)
* - single, fully conserved residue
: - conservation of strong groups
. - conservation of weak groups
  - no consensus


CLUSTAL W (1.81) multiple sequence alignment


GENPEPT_7595954                   --MAFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMIENCLDAKSTNIQ
GENPEPT_1724118                   --MSFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMTENCLDAKSTNIQ
MLH1_HUMAN                        --MSFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMIENCLDAKSTSIQ
GENPEPT_3192877                   MAEYLQPGVIRKLDEVVVNRIAAGEIIQRPANALKELLENSLDAQSTHIQ
GENPEPT_460627                    -----MSLRIKALDASVVNKIAAGEIIISPVNALKEMMENSIDANATMID
C_elegans_gene_3876842_emb_C      ----MSQNKIERISKEVAERLTTAQVVVSLSSAIRQLIDNSIDAGSTIID
                                           *. :.  *.:::::.:::    .*:::: :*.:** :* *:

GENPEPT_7595954                   VVVKEGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQTFEDLASISTYG
GENPEPT_1724118                   VIVREGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQTFEDLAMISTYG
MLH1_HUMAN                        VIVKEGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQSFEDLASISTYG
GENPEPT_3192877                   VQVKAGGLKLLQIQDNGTGIRREDLAIVCERFTTSKLTRFEDLSQIATFG
GENPEPT_460627                    ILVKEGGIKVLQITDNGSGINKADLPILCERFTTSKLQKFEDLSQIQTYG
C_elegans_gene_3876842_emb_C      IRVKNNGFESIEVQDNGSGIEARNFDALCKPHSTSKLTQFSDFDKLATLG
                                  : *: .*:: ::: ***:**.  ::  :*: .:****  *.*:  : * *

GENPEPT_7595954                   FRGEALASISHVAHVTITTKTADGKCAYRASYSDGKLQAPPKPCAGNQGT
GENPEPT_1724118                   FRGEALASISHVAHVTITTKTADGKCAYRASYSDGKLQAPPKPCAGNQGT
MLH1_HUMAN                        FRGEALASISHVAHVTITTKTADGKCAYRASYSDGKLKAPPKPCAGNQGT
GENPEPT_3192877                   FRGEALASISHVAHLSIQTKTAKEKCGYKATYADGKLQGQPKPCAGNQGT
GENPEPT_460627                    FRGEALASISHVARVTVTTKVKEDRCAWRVSYAEGKMLESPKPVAGKDGT
C_elegans_gene_3876842_emb_C      FRGEALNALCTVSSVSIFTRASDTEIGTRLTYDHSGNIICRQSAARELGT
                                  ****** ::. *: ::: *:. . . . : :* ..      :. * : **

GENPEPT_7595954                   LITVEDLFYNIITRRKALK-NPSEEYGKILEVVGRYSIHNSGISFSVKK-
GENPEPT_1724118                   LITVEDLFYNIITRKKALK-NPSEEYGKILEVVGRYSIHNSGISFSVKK-
MLH1_HUMAN                        QITVEDLFYNIATRRKALK-NPSEEYGKILEVVGRYSVHNAGISFSVKK-
GENPEPT_3192877                   IICIEDLFYNMPQRRQALR-SPAEEFQRLSEVLARYAVHNPRVGFTLRK-
GENPEPT_460627                    TILVEDLFFNIPSRLRALR-SHNDEYSKILDVVGRYAIHSKDIGFSCKK-
C_elegans_gene_3876842_emb_C      TIIVNKLFETLPVRRKELERSQKREFVKLLSTVQSFALLCPHIKILCTNN
                                   * ::.** .:  * : *. .   *: :: ..:  :::    : :   : 

GENPEPT_7595954                   -QGETVSDVRTLPNATTVDNIRS--------IFGNAVSRELIEVG-----
GENPEPT_1724118                   -QGETVSDVRTLPNATTVDNIRS--------IFGNAVSRELIEVG-----
MLH1_HUMAN                        -QGETVADVRTLPNASTVDNIRS--------IFGNAVSRELIEIG-----
GENPEPT_3192877                   -QGDAQPALRTPVASSRSENIRI--------IYGAAISKELLEFS-----
GENPEPT_460627                    -FGDSNYSLSVKPSYTVQDRIRT--------VFNKSVASNLITFHI----
C_elegans_gene_3876842_emb_C      INGKKTNLICTPGGTTSIQDVVANLFGIARKIENSKIGSGLIPIQQNQPD
                                    *.    : .    :  : :          : .  :.  *: .      

GENPEPT_7595954                   -----------CEDKTLAFK-MNGYISNANYSVKKCI---FLLFINHRLV
GENPEPT_1724118                   -----------CEDKTLAFK-MNGYISNANYSVKKCI---FLLFINHRLV
MLH1_HUMAN                        -----------CEDKTLAFK-MNGYISNANYSVKKCI---FLLFINHRLV
GENPEPT_3192877                   -----------HRDEVYKFE-AECLITQVNYSAKKCQ---MLLFINQRLV
GENPEPT_460627                    -----------SKVEDLNLESVDGKVCNLNFISKKSIS--LIFFINNRLV
C_elegans_gene_3876842_emb_C      VEIMTIHSVPMEEMHFFDLFKIRGFVSSCEHGCGRGTSDRQFVYINNRPV
                                              . .   :      : . :.   :      :.:**:* *

GENPEPT_7595954                   ESAALRKAIETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFL
GENPEPT_1724118                   ESAALKKAIEAVYAAYLPKNTHPFLYLILEISPQNVDVNVHPTKHEVHFL
MLH1_HUMAN                        ESTSLRKAIETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFL
GENPEPT_3192877                   ESTALRTSVDSIYATYLPRGHHPFVYMSLTLPPQNLDVNVHPTKHEVHFL
GENPEPT_460627                    TCDLLRRALNSVYSNYLPKGFRPFIYLGIVIDPAAVDVNVHPTKREVRFL
C_elegans_gene_3876842_emb_C      EYSRVCSVINDVYKQFNKK-QYPIIVLFIDVPPEKIDVNVTPDKKTVMLE
                                      :   :: :*  :  :   *:: : : : *  :**** * *: * : 

GENPEPT_7595954                   HEESILQRVQQHIESKLLGSNSSRMYFTQTLLPGLAG---------PSGE
GENPEPT_1724118                   HEESILERVQQHIESKLLGSNSSRMYFTQTLLPGLAG---------PSGE
MLH1_HUMAN                        HEESILERVQQHIESKLLGSNSSRMYFTQTLLPGLAG---------PSGE
GENPEPT_3192877                   YQEEIVDSIKQQVEARLLGSNATRTFYKQLRLPGAP--------------
GENPEPT_460627                    SQDEIIEKIANQLHAELSAIDTSRTFKASSISTNKPESLIPF---NDTIE
C_elegans_gene_3876842_emb_C      KERHLLAVVRASMMKTYLKIVGSHSTVRSSVEDRRIMNLSQQSFSNASFM
                                   :  ::  :   :         ::    .                     

GENPEPT_7595954                   AARPTTGVASSSTSGSGDKVYAYQMVRTDSRDQKLDAFLQPVSSLVPSQP
GENPEPT_1724118                   AVKSTTGIASSSTSGSGDKVHAYQMVRTDSRDQKLDAFMQPVSRRLPSQP
MLH1_HUMAN                        MVKSTTSLTSSSTSGSSDKVYAHQMVRTDSREQKLDAFLQPLSKPLSSQP
GENPEPT_3192877                   ------DLDETQLADKTQRIYPKEMVRTDSTEQKLDKFLAPLVK------
GENPEPT_460627                    SDRNRKSLRQAQVVENSYTTANSQLRKAKRQENKLVRIDASQAKITSFLS
C_elegans_gene_3876842_emb_C      SSKSSTPDDFNNTTLNSTYPEDSLLNTSDLLKQRKENRSPPAKKSCPMIR
                                             .   .        :  :.  .::      .         

GENPEPT_7595954                   QDPAPVRGARTEGSPERATREDEEMLALPAPAEAAAESENLERESLMETS
GENPEPT_1724118                   QD--PVPGNRTEGSPEKAMQKDQEISELPAPMEAAADSASLERESVIGAS
MLH1_HUMAN                        Q--AIVTEDKTDISSGRARQQDEEMLELPAPAEVAAKNQSLEGDTTKGTS
GENPEPT_3192877                   --------------------------------------------------
GENPEPT_460627                    SS--QQFNFEGSSTKRQLSEPKVTNVSHSQEAEKLTLNESEQPRDANTIN
C_elegans_gene_3876842_emb_C      RT----EPFHSVPSTSNSRTQRLENFSFTMEPKRVEVSKKIPSKSDKKLT


GENPEPT_7595954                   DAAQKAAPTSSPGSSRKRHREDSDVEMVENASGKEMTAACYPRRRIINLT
GENPEPT_1724118                   EVVAPQRHPSSPGSSRKRHPEDSDVEMMENDSRKEMTAACYPRRRIINLT
MLH1_HUMAN                        EMSEKRGPTSS--NPRKRHREDSDVEMVEDDSRKEMTAACTPRRRIINLT
GENPEPT_3192877                   ---SDSGVSSSSSQEASRLPEES------------FRVTAAKKSREVRLS
GENPEPT_460627                    DNDLKDQPKKKQKLGDYKVPSIADDEKNALPISKDGYIRVPKERVNVNLT
C_elegans_gene_3876842_emb_C      DEELRSAVIEENPLKKAGEIDDIEILEQSQESQDVNESQCSQDSQTSQNS
                                           ..         .                          . :

GENPEPT_7595954                   SVLSLQEEISERCHETLREILRNHS-FVGCVNPQW--ALAQHQTKLYLLN
GENPEPT_1724118                   SVLSLQEEINDRGHETLREMLRNHT-FVGCVNPQW--ALAQHQTKLYLLN
MLH1_HUMAN                        SVLSLQEEINEQGHEVLREMLHNHS-FVGCVNPQW--ALAQHQTKLYLLN
GENPEPT_3192877                   SVLDMRKRVERQCSVQLRSTLKNLV-YVGCVDERR--ALFQHETRLYMCN
GENPEPT_460627                    SIKKLREKVDDSIHRELTDIFANLN-YVGVVDEERRLAAIQHDLKLFLID
C_elegans_gene_3876842_emb_C      RVSYFTLRPQQKIKFSMKLLREAYSPKTDETDDNTEEAEVSAEKDVLNEI
                                   :  :  . .      :          .. .: .   *  . :  :    

GENPEPT_7595954                   TTKLSEELFYQILIYDFANFGVLRLSEPAPLFDLAMLALDSPESGWTEDD
GENPEPT_1724118                   TTKLSEELFYQILIYDFANFGVLRLPEPAPLFDFAMLALDSPESGWTEED
MLH1_HUMAN                        TTKLSEELFYQILIYDFANFGVLRLSEPAPLFDLAMLALDSPESGWTEED
GENPEPT_3192877                   TRSFSEELFYQRMIYEFQNCSEITICPPLPLKELLILSLESRAAGWTPED
GENPEPT_460627                    YGSVCYELFYQIGLTDFANFGKINLQSTNVSDDIVLYNLLSEFDELN-DD
C_elegans_gene_3876842_emb_C      TTKINKEENDDAERQLSRSLTKDDFSKMKIIGQFNHGFIICRLRGHLFIV
                                    ..  *   :       .     :       ::    : .         

GENPEPT_7595954                   GPKEGLAEYIVEFLKKKAEMLADYFSVEIDEEGN--------LIGLPLLI
GENPEPT_1724118                   GPKEGLAEYIVEFLKKKAKMLADYFSVEIDEEGN--------LIGLPLLI
MLH1_HUMAN                        GPKEGLAEYIVEFLKKKAEMLADYFSLEIDEEGN--------LIGLPLLI
GENPEPT_3192877                   EDKAELADGAADILLKKAPIMREYFGLRISEDGM--------LESLPSLL
GENPEPT_460627                    ASK----EKIISKIWDMSSMLNEYYSIELVNDGLDNDLKSVKLKSLPLLL
C_elegans_gene_3876842_emb_C      DQHASDEKYNFERLQSSAKLTKQPLFMPT-ALGFGAVQELIIRENLPIFH
                                    :    .   . : . : :  :   :     *           .** : 

GENPEPT_7595954                   DS-----YVPPLEGLPIFILRLATEVNWDEEKECFESLSKECAMFYSIRK
GENPEPT_1724118                   DS-----YVPPLEGLPIFILRLATEVNWDEE-ECFESLSKECAVFYSIRK
MLH1_HUMAN                        DN-----YVPPLEGLPIFILRLATEVNWDEEKECFESLSKECAMFYSIRK
GENPEPT_3192877                   HQ-----HRPCVAHLPVYLLRLATEVDWEQETRCFETFCRETARFY----
GENPEPT_460627                    KG-----YIPSLVKLPFFIYRLGKEVDWEDEQECLDGILREIALLYIPDM
C_elegans_gene_3876842_emb_C      ANGFDFEFSENDGCIKTFLTARPELLNQQLTNSDLEEILAVVSQYPNQMY
                                         .      :  ::      :: :     :: :    :       

GENPEPT_7595954                   QYILEESTLSGQQSDMPGSTSKPWKWT--VEHIIYKAFRSHLLPPKHFTE
GENPEPT_1724118                   QYILEESALSGQQSDMPGSPSKPWKWT--VEHIIYKAFRSHLLPPKHFTE
MLH1_HUMAN                        QYISEESTLSGQQSEVPGSIPNSWKWT--VEHIVYKALRSHILPPKHFTE
GENPEPT_3192877                   ----------AQLDWREGATAVFSRWT--MEHVLFPAFKKYLLPPR---I
GENPEPT_460627                    VPKVDTLDASLSEDEKAQFINRKEHISSLLEHVLFPCIKRRFLAPRHILK
C_elegans_gene_3876842_emb_C      RPVRIRKIFASKACRKSVMIGKPLNQR--EMTQIIRHLAKLDQPWNCPHG
                                             .            .        :   :     . .    

GENPEPT_7595954                   DGNVLQLANLPDLYKVFERC
GENPEPT_1724118                   DGNVLQLANLPDLCKVFERC
MLH1_HUMAN                        DGNILQLANLPDLYKVFERC
GENPEPT_3192877                   KDQIYELTNLPTLYKVFERC
GENPEPT_460627                    D--VVEIANLPDLYKVFERC
C_elegans_gene_3876842_emb_C      RPTIRHLASLPDRAEFE---
                                     : .::.**   :.