Search genes associated with colon cancer in human genome


MLH1 was identified as a locus frequently mutated in hereditary nonpolyposis colon cancer (HNPCC). When cloned, it was discovered to be a human homolog of the E. coli mismatch repair gene mutS.




1. How many hits will you get if you search genes associated with colon cancer in human genome?

Ans: 17hits

2. How many loci will you find if you search locus link for human in Genebank?

Ans: 21

3. Give the locus ID and position of MLH1.

Ans: ID:4292 position:3p21.3


4. Find the %ID of nucleotide sequence for its possible orthologs in mouse.

Ans: 89.3%ID with LocusLink 17350 MLH1

5. Find the total number of mutations of MLH1 reported in human gene mutation database.

Ans: 147

6. Give the DNA sequence of MLH1.

Ans:

1 ggctcttctg gcgccaaaat gtcgttcgtg gcaggggtta ttcggcggct ggacgagaca
61 gtggtgaacc gcatcgcggc gggggaagtt atccagcggc cagctaatgc tatcaaagag
121 atgattgaga actgtttaga tgcaaaatcc acaagtattc aagtgattgt taaagaggga
181 ggcctgaagt tgattcagat ccaagacaat ggcaccggga tcaggaaaga agatctggat
241 attgtatgtg aaaggttcac tactagtaaa ctgcagtcct ttgaggattt agccagtatt
301 tctacctatg gctttcgagg tgaggctttg gccagcataa gccatgtggc tcatgttact
361 attacaacga aaacagctga tggaaagtgt gcatacagag caagttactc agatggaaaa
421 ctgaaagccc ctcctaaacc atgtgctggc aatcaaggga cccagatcac ggtggaggac
481 cttttttaca acatagccac gaggagaaaa gctttaaaaa atccaagtga agaatatggg
541 aaaattttgg aagttgttgg caggtattca gtacacaatg caggcattag tttctcagtt
601 aaaaaacaag gagagacagt agctgatgtt aggacactac ccaatgcctc aaccgtggac
661 aatattcgct ccatctttgg aaatgctgtt agtcgagaac tgatagaaat tggatgtgag
721 gataaaaccc tagccttcaa aatgaatggt tacatatcca atgcaaacta ctcagtgaag
781 aagtgcatct tcttactctt catcaaccat cgtctggtag aatcaacttc cttgagaaaa
841 gccatagaaa cagtgtatgc agcctatttg cccaaaaaca cacacccatt cctgtacctc
901 agtttagaaa tcagtcccca gaatgtggat gttaatgtgc accccacaaa gcatgaagtt
961 cacttcctgc acgaggagag catcctggag cgggtgcagc agcacatcga gagcaagctc
1021 ctgggctcca attcctccag gatgtacttc acccagactt tgctaccagg acttgctggc
1081 ccctctgggg agatggttaa atccacaaca agtctgacct cgtcttctac ttctggaagt
1141 agtgataagg tctatgccca ccagatggtt cgtacagatt cccgggaaca gaagcttgat
1201 gcatttctgc agcctctgag caaacccctg tccagtcagc cccaggccat tgtcacagag
1261 gataagacag atatttctag tggcagggct aggcagcaag atgaggagat gcttgaactc
1321 ccagcccctg ctgaagtggc tgccaaaaat cagagcttgg agggggatac aacaaagggg
1381 acttcagaaa tgtcagagaa gagaggacct acttccagca accccagaaa gagacatcgg
1441 gaagattctg atgtggaaat ggtggaagat gattcccgaa aggaaatgac tgcagcttgt
1501 accccccgga gaaggatcat taacctcact agtgttttga gtctccagga agaaattaat
1561 gagcagggac atgaggttct ccgggagatg ttgcataacc actccttcgt gggctgtgtg
1621 aatcctcagt gggccttggc acagcatcaa accaagttat accttctcaa caccaccaag
1681 cttagtgaag aactgttcta ccagatactc atttatgatt ttgccaattt tggtgttctc
1741 aggttatcgg agccagcacc gctctttgac cttgccatgc ttgccttaga tagtccagag
1801 agtggctgga cagaggaaga tggtcccaaa gaaggacttg ctgaatacat tgttgagttt
1861 ctgaagaaga aggctgagat gcttgcagac tatttctctt tggaaattga tgaggaaggg
1921 aacctgattg gattacccct tctgattgac aactatgtgc cccctttgga gggactgcct
1981 atcttcattc ttcgactagc cactgaggtg aattgggacg aagaaaagga atgttttgaa
2041 agcctcagta aagaatgcgc tatgttctat tccatccgga agcagtacat atctgaggag
2101 tcgaccctct caggccagca gagtgaagtg cctggctcct ggaagtggac tgtggaacac
2161 attgtctata aagccttgcg ctcacacatt ctgcctccta aacatttcac agaagatgga
2221 aatatcctgc agcttgctaa cctgcctgat ctatacaaag tctttgagag gtgttaaata
2281 tggttattta tgcactgtgg gatgtgttct tctttctctg tattc
(only exons)

7. Give the DNA sequence of E. coli mismatch repair gene mutS

Ans:

1 tcgcgcattt tcttcaacca ggaggtgagg aggtttcgac atggcggtgc agccgaagga
61 gacgctgcag ttggagagcg cggccgaggt cggcttcgtg cgcttctttc agggcatgcc
121 ggagaagccg accaccacag tgcgcctttt cgaccggggc gacttctata cggcgcacgg
181 cgaggacgcg ctgctggccg cccgggaggt gttcaagacc cagggggtga tcaagtacat
241 ggggccggca ggagcaaaga atctgcagag tgttgtgctt agtaaaatga attttgaatc
301 ttttgtaaaa gatcttcttc tggttcgtca gtatagagtt gaagtttata agaatagagc
361 tggaaataag gcatccaagg agaatgattg gtatttggca tataaggctt ctcctggcaa
421 tctctctcag tttgaagaca ttctctttgg taacaatgat atgtcagctt ccattggtgt
481 tgtgggtgtt aaaatgtccg cagttgatgg ccagagacag gttggagttg ggtatgtgga
541 ttccatacag aggaaactag gactgtgtga attccctgat aatgatcagt tctccaatct
601 tgaggctctc ctcatccaga ttggaccaaa ggaatgtgtt ttacccggag gagagactgc
661 tggagacatg gggaaactga gacagataat tcaaagagga ggaattctga tcacagaaag
721 aaaaaaagct gacttttcca caaaagacat ttatcaggac ctcaaccggt tgttgaaagg
781 caaaaaggga gagcagatga atagtgctgt attgccagaa atggagaatc aggttgcagt
841 ttcatcactg tctgcggtaa tcaagttttt agaactctta tcagatgatt ccaactttgg
901 atagtttgaa ctgactactt ttgacttcag ccagtatatg aaattggata ttgcagcagt
961 cagagccctt aacctttttc agggttctgt tgaagatacc actggctctc agtctctggc
1021 tgccttgctg aataagtgta aaacccctca aggacaaaga cttgttaacc agtggattaa
1081 gcagcctctc atggataaga acagaataga ggagagattg aatttagtgg aagcttttgt
1141 agaagatgca gaattgaggc agactttaca agaagattta cttcgtcgat tcccagatct
1201 taaccgactt gccaagaagt ttcaaagaca agcagcaaac ttacaagatt gttaccgact
1261 ctatcagggt ataaatcaac tacctaatgt tatacaggct ctggaaaaac atgaaggaaa
1321 acaccagaaa ttattgttgg cagtttttgt gactcctctt actgatcttc gttctgactt
1381 ctccaagttt caggaaatga tagaaacaac tttagatatg gatcaggtgg aaaaccatga
1441 attccttgta aaaccttcat ttgatcctaa tctcagtgaa ttaagagaaa taatgaatga
1501 cttggaaaag aagatgcagt caacattaat aagtgcagcc agagatcttg gcttggaccc
1561 tggcaaacag attaaactgg attccagtgc acagtttgga tattactttc gtgtaacctg
1621 taaggaagaa aaagtccttc gtaacaataa aaactttagt actgtagata tccagaagaa
1681 tggtgttaaa tttaccaaca gcaaattgac ttctttaaat gaagagtata ccaaaaataa
1741 aacagaatat gaagaagccc aggatgccat tgttaaagaa attgtcaata tttcttcagg
1801 ctatgtagaa ccaatgcaga cactcaatga tgtgttagct cagctagatg ctgttgtcag
1861 ctttgctcac gtgtcaaatg gagcacctgt tccatatgta cgaccagcca ttttggagaa
1921 aggacaagga agaattatat taaaagcatc caggcatgct tgtgttgaag ttcaagatga
1981 aattgcattt attcctaatg acgtatactt tgaaaaagat aaacagatgt tccacatcat
2041 tactggcccc aatatgggag gtaaatcaac atatattcga caaactgggg tgatagtact
2101 catggcccaa attgggtgtt ttgtgccatg tgagtcagca gaagtgtcca ttgtggactg
2161 catcttagcc cgagtagggg ctggtgacag tcaattgaaa ggagtctcca cgttcatggc
2221 tgaaatgttg gaaactgctt ctatcctcag gtctgcaacc aaagattcat taataatcat
2281 agatgaattg ggaagaggaa cttctaccta cgatggattt gggttagcat gggctatatc
2341 agaatacatt gcaacaaaga ttggtgcttt ttgcatgttt gcaacccatt ttcatgaact
2401 tactgccttg gccaatcaga taccaactgt taataatcta catgtcacag cactcaccac
2461 tgaagagacc ttaactatgc tttatcaggt gaagaaaggt gtctgtgatc aaagttttgg
2521 gattcatgtt gcagagcttg ctaatttccc taagcatgta atagagtgtg ctaaacagaa
2581 agccctggaa cttgaggagt ttcagtatat tggagaatcg caaggatatg atatcatgga
2641 accagcagca aagaagtgct atctggaaag agagcaaggt gaaaaaatta ttcaggagtt
2701 cctgtccaag gtgaaacaaa tgccctttac tgaaatgtca gaagaaaaca tcacaataaa
2761 gttaaaacag ctaaaagctg aagtaatagc aaagaataat agctttgtaa atgaaatcat
2821 ttcacgaata aaagttacta cgtgaaaaat cccagtaatg gaatgaaggt aatattgata
2881 agctattgtc tgtaatagtt ttatattgtt ttatattaa
(only exons)