Compare human colon cancer gene MLH1 with other genes


- To use ORF finder to translate DNA sequence to protein sequence in all reading frames. -
- To use blastn, blastp, CD search and blast 2 sequence programs for searching and comparison. -

1. Compare MLH1 (answer of assignment 2.6) and mutS (answer of 2.7) sequence.

answer: No significant similarity was found


2. Translate the above two gene sequences to protein sequences.

answer:

>gi|463989|gb|AAC50285.1| hMLH1
MSFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMIENCLDAKSTSIQVIVKEGGLKLIQIQDNGTGIRKEDLDIVCERF
TTSKLQSFEDLASISTYGFRGEALASISHVAHVTITTKTADGKCAYRASYSDGKLKAPPKPCAGNQGTQITVEDLFYNIA
TRRKALKNPSEEYGKILEVVGRYSVHNAGISFSVKKQGETVADVRTLPNASTVDNIRSIFGNAVSRELIEIGCEDKTLAF
KMNGYISNANYSVKKCIFLLFINHRLVESTSLRKAIETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFLHEE
SILERVQQHIESKLLGSNSSRMYFTQTLLPGLAGPSGEMVKSTTSLTSSSTSGSSDKVYAHQMVRTDSREQKLDAFLQPL
SKPLSSQPQAIVTEDKTDISSGRARQQDEEMLELPAPAEVAAKNQSLEGDTTKGTSEMSEKRGPTSSNPRKRHREDSDVE
MVEDDSRKEMTAACTPRRRIINLTSVLSLQEEINEQGHEVLREMLHNHSFVGCVNPQWALAQHQTKLYLLNTTKLSEELF
YQILIYDFANFGVLRLSEPAPLFDLAMLALDSPESGWTEEDGPKEGLAEYIVEFLKKKAEMLADYFSLEIDEEGNLIGLP
LLIDNYVPPLEGLPIFILRLATEVNWDEEKECFESLSKECAMFYSIRKQYISEESTLSGQQSEVPGSIPNSWKWTVEHIV
YKALRSHILPPKHFTEDGNILQLANLPDLYKVFERC

>gi|1592569|gb|AAB97931.1| DNA mismatch repair protein [Escherichia coli]
MSAIENFDAHTPMMQQYLKLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRSASAGEPIPMAGIPYHAVENYL
AKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFAYATLDISSGRFRLSEP
ADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCL
LQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRHTRV
LLERQQTIGALQDFTAELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEF
AELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRG
QSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAE
RAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYV
PAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIK
ALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESIS
PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV



3.Perform protein sequence homology searching for MLH1 in GenBank. Give the 10 highest hits.

answer:

GI:13878583, MLH1_Mouse, DNA MISMATCH REPAIR PROTEIN MLH1 (MUTL PROTEIN HOMOLOG 1).
GI:13591989, NP_112315, mismatch repair protein [Rattus norvegicus].
GI:4557757, NP_000240, mutL homolog 1; mutL (E. coli) homolog 1 [Homo sapiens].
GI:466462, AAA17374, (U07418) human homolog of E. coli mutL gene product, Swiss-Prot Accession Number P23367 [Homo sapiens]
GI:604369, AAA85687.1, hMLH1 gene product.[Homo sapiens]
GI:12835158, BAB23172.1, putative [Mus musculus].
GI:13543339, AAH05833.1, Similar to mutL (E. coli) homolog 1 (colon cancer, nonpolyposis type 2) [Homo sapiens].
GI:7304079, AAF59117.1, Mlh1 gene product [Drosophila melanogaster].
GI:3192877, AAC19117.1, mutL homolog [Drosophila melanogaster].
GI:460627, AAA16835.1, Mlh1p.[Saccharomyces cerevisiae].



4. Compare human MLH1 protein with MLH1 in M. musculus, R. norvegicus and D. melanogaster. Give the pairwise alignment and % of sequence smility.

answer:(link)

Mus musculus Identities = 651/760 (85%), Positives = 693/760 (90%), Gaps = 4/760 (0%)
Rattus norvegicus Identities = 639/758 (84%), Positives = 684/758 (89%), Gaps = 3/758 (0%)
Drosophila melanogaster Identities = 335/751 (44%), Positives = 453/751 (59%), Gaps = 94/751 (12%)

5. Search the conserve domain (CD) for MLH1. Give the position of the CD, name of CD and Pfam ID number.

position: No.147-327 amino acid
name: DNA_mis_repair, DNA mismatch repair protein. Also known as the mutL/hexB/PMS1 pfam family.
ID: pfam01119 (link)


6. Show multiple alignment of MLH1 conserve domain with 5 sequences from the top of the CD alignment.

answer: (link)