Score E
Sequences producing significant alignments: (bits) Value
ref|NP_002909.1| regulatory factor X, 1 (influences HLA cla... 1555 0.0
ref|NP_033081.1| regulatory factor (trans-acting) 1 [Mus mu... 1437 0.0
ref|NP_002910.1| regulatory factor X, 3 (influences HLA cla... 730 0.0
ref|NP_033082.1| regulatory factor (trans-acting) 2 [Mus mu... 699 0.0
ref|NP_000626.1| regulatory factor X, 2 (influences HLA cla... 687 0.0
gb|AAF54526.2| (AE003686) Rfx gene product [Drosophila mela... 479 e-134
emb|CAB63452.1| (AJ133103) RFX transcription factor [Drosop... 479 e-134
sp|Q09555|YQW1_CAEEL HYPOTHETICAL 88.3 KD PROTEIN F33H1.1 F... 217 5e-55
gb|AAF61564.1|AF226156_1 (AF226156) RFX-like transcription ... 217 5e-55
sp|P48381|RFX3_MOUSE DNA BINDING PROTEIN RFX3 >gi|1083294|p... 192 2e-47
pdb|1DP7|P Chain P, Cocrystal Structure Of Rfx-Dbd In Compl... 128 3e-28
gb|AAC62839.1| (AC005784) RFX2_HUMAN, [AA 620-723] [Homo sa... 108 2e-22
emb|CAB85619.1| (AJ243296) putative RFX transcription facto... 80 1e-13
ref|NP_002911.1| regulatory factor X, 4 (influences HLA cla... 79 3e-13
sp|P48383|SAK1_SCHPO SAK1 PROTEIN >gi|7493354|pir||T11650 s... 71 4e-11
gb|AAA67937.1| (U19978) RFX family DNA-binding protein [Sch... 68 4e-10
emb|CAB85587.1| (AJ132014) cephalosporin C regulator 1 [Acr... 59 2e-07
pir||S51421 hypothetical protein YLR176c - yeast (Saccharom... 57 9e-07
ref|NP_013277.1| DNA binding protein, homologous to mammali... 57 9e-07
gb|AAF54121.2| (AE003675) CG9727 gene product [Drosophila m... 53 1e-05
pir||S55315 mucin (clone PGM-2A) - pig >gi|2136504|pir||I47... 49 2e-04
pir||T34433 hypothetical protein K06A9.1a - Caenorhabditis ... 46 0.002
pir||T34434 hypothetical protein K06A9.1a - Caenorhabditis ... 46 0.002
ref|NP_012284.1| cell surface flocculin with structure simi... 45 0.005
ref|NP_000440.1| regulatory factor X, 5 [Homo sapiens] >gi|... 44 0.009
ref|NP_059091.1| regulatory factor (trans-acting) 5 [Mus mu... 43 0.012
pir||F75518 hypothetical protein - Deinococcus radiodurans ... 43 0.012
gb|AAB71465.1| (AC000098) EST gb|ATTS1136 comes from this g... 43 0.012
gb|AAB91441.1| (U80743) CAGH32 [Homo sapiens] 43 0.016
gb|AAF48345.1| (AE003495) CG11584 gene product [Drosophila ... 43 0.016
ref|NP_011528.1| putative integral membrane protein; Msb2p ... 43 0.021
dbj|BAB13413.1| (AB046807) KIAA1587 protein [Homo sapiens] 43 0.021
pir||T33369 hypothetical protein H02F09.3 - Caenorhabditis ... 43 0.021
pir||T45462 membrane glycoprotein [imported] - equine herpe... 41 0.046
pir||T33247 hypothetical protein H05O09.1 - Caenorhabditis ... 41 0.061
ref|XP_000707.1| hypothetical protein XP_000707 [Homo sapiens] 41 0.061
ref|NP_034869.1| lymphocyte antigen 64 [Mus musculus] >gi|1... 41 0.061
pir||T16509 hypothetical protein F59A6.3 - Caenorhabditis e... 41 0.079
gb|AAB03569.2| (U35622) EWS protein/E1A enhancer binding pr... 40 0.10
emb|CAA19845.2| (AL031028) /prediction=(method:""genscan"",... 40 0.14
pir||S55316 mucin (clone PGM-2B) - pig >gi|915207|gb|AAC485... 40 0.14
ref|NP_041080.1| membrane glycoprotein [Equine herpesvirus ... 39 0.18
emb|CAB37867.1| (AJ133273) atrophin-1 [Hylobates lar] 39 0.23
gb|AAC51331.2| (U85962) CREB-binding protein [Homo sapiens] 39 0.23
gb|AAF70456.1|AF221952_1 (AF221952) mu-protocadherin [Rattu... 39 0.23
ref|NP_004371.1| CREB binding protein (Rubinstein-Taybi syn... 39 0.23
pir||S39162 transcription coactivator CREB-binding protein ... 39 0.23
gb|AAB01610.1| (L36831) transcription regulator [Mus musculus] 39 0.31
ref|NP_053733.1| Ewing sarcoma breakpoint region 1, isoform... 39 0.31
ref|NP_035508.1| transcriptional regulator, SIN3 yeast homo... 39 0.31
pir||A56068 co-repressor protein - mouse >gi|642617|gb|AAA6... 39 0.31
gb|AAC60129.1| (U43200) antifreeze glycopeptide AFGP polypr... 39 0.31
pir||I61713 co-repressor protein - mouse >gi|642619|gb|AAA6... 39 0.31
dbj|BAA36223.1| (D87895) chitinase [Aspergillus nidulans] 39 0.31
pir||JW0067 chitinase (EC 3.2.1.14) A - Emericella nidulans 38 0.40
sp|P45481|CBP_MOUSE CREB-BINDING PROTEIN >gi|481698|pir||S3... 38 0.40
prf||1923401A protein CBP [Mus musculus] 38 0.40
ref|NP_013101.1| Ylr001cp [Saccharomyces cerevisiae] >gi|21... 38 0.53
gb|AAC41377.1| (AF036382) MLL [Takifugu rubripes] 38 0.53
gb|AAG16733.1|AF258676_1 (AF258676) MUCDHL-FL [Homo sapiens] 38 0.53
gb|AAD43152.1|AC007504_7 (AC007504) Hypothetical Protein [A... 38 0.69
gb|AAG33495.1| (AF301909) mu-protocadherin [Homo sapiens] 38 0.69
ref|NP_060187.1| hypothetical protein FLJ20219 [Homo sapien... 38 0.69
emb|CAB37921.1| (AJ133274) atrophin-1 [Macaca fascicularis] 37 0.90
pir||A53577 ascites sialoglycoprotein 1 - rat (fragments) 37 0.90
pir||T02057 fructose-bisphosphate aldolase (EC 4.1.2.13) - ... 37 0.90
gb|AAB61358.1| (U62397) rhamnogalacturonan hydrolase [Botry... 37 0.90
gb|AAC36003.1| (AC002397) DRPLA [Mus musculus] 37 1.2
pir||S27920 nuclear antigen EBNA-3A - human herpesvirus 4 >... 37 1.2
dbj|BAB09176.1| (AB018113) gene_id:MFC19.15~pir||T04430~sim... 37 1.2
gb|AAF56853.1| (AE003768) CG11873 gene product [Drosophila ... 37 1.2
pir||PC4397 mucin 3 T10 - human (fragment) >gi|2454619|gb|A... 37 1.2
gb|AAG09037.1|AF254088_1 (AF254088) EWS/ZSG fusion protein ... 36 1.5
emb|CAB88653.1| (AL353822) hypothetical protein [Neurospora... 36 1.5
ref|NP_012143.1| (putative) invovled in control of DNA repl... 36 1.5
sp|Q06154|PM17_BOVIN MELANOCYTE PROTEIN PMEL 17 (RETINAL PI... 36 1.5
sp|P45384|IGA2_HAEIN IMMUNOGLOBULIN A1 PROTEASE PRECURSOR (... 36 1.5
pir||T39903 serine-rich protein - fission yeast (Schizosacc... 36 1.5
ref|NP_005234.1| Ewing sarcoma breakpoint region 1, isoform... 36 1.5
gb|AAG09036.1|AF254087_1 (AF254087) EWS/ZSG fusion protein ... 36 1.5
gb|AAF60629.1| (AC024790) Hypothetical protein Y47D7A.a [Ca... 36 1.5
gb|AAG16731.1| (AF258674) MUCDHL-FL [Homo sapiens] 36 1.5
gb|AAF60743.1| (AC006797) contains similarity to Pfam famil... 36 1.5
gb|AAG09035.1|AF254086_1 (AF254086) EWS/ZSG fusion protein ... 36 1.5
gb|AAA29652.1| (M34047) major merozoite surface antigen [Pl... 36 2.0
gb|AAC47557.1| (AF000606) insect intestinal mucin IIM22 [Tr... 36 2.0
dbj|BAA92544.1| (AB037727) KIAA1306 protein [Homo sapiens] 36 2.0
gb|AAF51744.1| (AE003594) CG7177 gene product [Drosophila m... 36 2.0
gb|AAF82185.1| (AF076776) helicase DOMINO A [Drosophila mel... 36 2.7
gb|AAF62173.1|AF247450_1 (AF247450) hyperpolarization-activ... 36 2.7
gb|AAG22189.1| (AE003453) CG9696 gene product [Drosophila m... 36 2.7
pir||T45463 membrane glycoprotein [imported] - equine herpe... 36 2.7
sp|Q90718|SRF_CHICK SERUM RESPONSE FACTOR (SRF) >gi|1245462... 36 2.7
gb|AAF13032.1|AF113616_1 (AF113616) intestinal mucin 3 [Hom... 36 2.7
sp|Q92777|SYN2_HUMAN SYNAPSIN II >gi|3386486|gb|AAC28368.1|... 35 3.5
pir||T18535 high molecular mass nuclear antigen - chicken (... 35 3.5
emb|CAB46680.1| (AJ243460) proteophosphoglycan [Leishmania ... 35 3.5
emb|CAB46679.1| (AJ243459) proteophosphoglycan [Leishmania ... 35 3.5
ref|XP_000928.1| similar to Ewing sarcoma breakpoint region... 35 3.5
gb|AAF56831.1| (AE003767) CG14522 gene product [Drosophila ... 35 4.6
Alignments
>ref|NP_002909.1| regulatory factor X, 1 (influences HLA class II expression);
Regulatory factor (trans-acting) 1 (influences HLA class
II [Homo sapiens]
sp|P22670|RFX1_HUMAN MHC CLASS II REGULATORY FACTOR RFX1 (RFX) (ENHANCER FACTOR C)
(EF-C)
pir||A35913 regulatory factor X - human
emb|CAA41730.1| (X58964) MHC class II regulatory factor RFX [Homo sapiens]
Length = 979
Score = 1555 bits (3983), Expect = 0.0
Identities = 800/926 (86%), Positives = 800/926 (86%)
Query: 54 YVTELXXXXXXXXXXXXXXXYVTELPAVPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRAS 113
YVTEL YVTELPAVPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRAS
Sbjct: 54 YVTELQSPQPQAQPPGGQKQYVTELPAVPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRAS 113
Query: 114 ETVSEASPGSTASXXXXXXXXXXXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQAL 173
ETVSEASPGSTAS SVQAKPGHVSPLQLTNIQVPQQAL
Sbjct: 114 ETVSEASPGSTASQTGVPTQVVQQVQGTQQRLLVQTSVQAKPGHVSPLQLTNIQVPQQAL 173
Query: 174 PTQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQ 233
PTQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQ
Sbjct: 174 PTQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQ 233
Query: 234 VHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSS 293
VHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQP YSS
Sbjct: 234 VHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPVHVAQEVQQLQQVPVPHVYSS 293
Query: 294 QVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASS 353
QVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASS
Sbjct: 294 QVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASS 353
Query: 354 GSMPMYVSGSQVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 413
GSMPMYVSGSQVV Y
Sbjct: 354 GSMPMYVSGSQVVASSASTGAGASNSSGGGGSGGGGGGGGGGGGGGSGSTGGGGSGAGTY 413
Query: 414 VIQGGYMLGSASQSYSHTTRASPATVQWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLE 473
VIQGGYMLGSASQSYSHTTRASPATVQWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLE
Sbjct: 414 VIQGGYMLGSASQSYSHTTRASPATVQWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLE 473
Query: 474 PVNAASFGKLIRSVFMXXXXXXXXXXXNSKYHYYGLRIKASSPLLRLMEDQQHMAMRGQP 533
PVNAASFGKLIRSVFM NSKYHYYGLRIKASSPLLRLMEDQQHMAMRGQP
Sbjct: 474 PVNAASFGKLIRSVFMGLRTRRLGTRGNSKYHYYGLRIKASSPLLRLMEDQQHMAMRGQP 533
Query: 534 FSQKQRLKPIQKMEGMTNGVAVGQQPSTGLSDISAQVQQYQQFLDASRSLPDFTELDLQG 593
FSQKQRLKPIQKMEGMTNGVAVGQQPSTGLSDISAQVQQYQQFLDASRSLPDFTELDLQG
Sbjct: 534 FSQKQRLKPIQKMEGMTNGVAVGQQPSTGLSDISAQVQQYQQFLDASRSLPDFTELDLQG 593
Query: 594 KVLPEGVGPGDIKAFQVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQPSEAPPLA 653
KVLPEGVGPGDIKAFQVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQPSEAPPLA
Sbjct: 594 KVLPEGVGPGDIKAFQVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQPSEAPPLA 653
Query: 654 VHDEAEKRLPKAILVLLSKFEPVLQWTKHCDNVLYQGLVEILIPDVLRPIPSALTQAIRN 713
VHDEAEKRLPKAILVLLSKFEPVLQWTKHCDNVLYQGLVEILIPDVLRPIPSALTQAIRN
Sbjct: 654 VHDEAEKRLPKAILVLLSKFEPVLQWTKHCDNVLYQGLVEILIPDVLRPIPSALTQAIRN 713
Query: 714 FAKSLESWLTHAMVNIPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQ 773
FAKSLESWLTHAMVNIPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQ
Sbjct: 714 FAKSLESWLTHAMVNIPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQ 773
Query: 774 MLSDLNRVDFANVQEQASWVCRCEDRVVQRLEQDFKVTLQQQNSLEQWAAWLDGVVSQVL 833
MLSDLNRVDFANVQEQASWVCRCEDRVVQRLEQDFKVTLQQQNSLEQWAAWLDGVVSQVL
Sbjct: 774 MLSDLNRVDFANVQEQASWVCRCEDRVVQRLEQDFKVTLQQQNSLEQWAAWLDGVVSQVL 833
Query: 834 KPYQGSAGFPKAAKLFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIEHR 893
KPYQGSAGFPKAAKLFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIEHR
Sbjct: 834 KPYQGSAGFPKAAKLFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIEHR 893
Query: 894 VAQAKGETPIAVMGEFANLATSLNPLDPDKXXXXXXXXXXXXXLPQDISLAAGGESPALG 953
VAQAKGETPIAVMGEFANLATSLNPLDPDK LPQDISLAAGGESPALG
Sbjct: 894 VAQAKGETPIAVMGEFANLATSLNPLDPDKDEEEEEEEESEDELPQDISLAAGGESPALG 953
Query: 954 PETLEPPAKLARTDARGLFVQALPSS 979
PETLEPPAKLARTDARGLFVQALPSS
Sbjct: 954 PETLEPPAKLARTDARGLFVQALPSS 979
>ref|NP_033081.1| regulatory factor (trans-acting) 1 [Mus musculus]
sp|P48377|RFX1_MOUSE DNA BINDING PROTEIN RFX1
pir||A55926 DNA binding protein RFX1 - mouse
emb|CAA53702.1| (X76088) DNA binding protein RFX1 [Mus musculus]
Length = 963
Score = 1437 bits (3680), Expect = 0.0
Identities = 749/926 (80%), Positives = 763/926 (81%), Gaps = 10/926 (1%)
Query: 54 YVTELXXXXXXXXXXXXXXXYVTELPAVPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRAS 113
YVTEL YV ELPA PAPSQP P PSP QQYIVVTVSEGAMRAS
Sbjct: 48 YVTELQSPQPQTQPPGSQKQYVAELPAAPAPSQPA-TPAPSPVAQQYIVVTVSEGAMRAS 106
Query: 114 ETVSEASPGSTASXXXXXXXXXXXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQAL 173
ETVSEASP STAS SVQAKPGHVSPLQLTNIQVPQQA+
Sbjct: 107 ETVSEASPSSTASQTGVPTQVVQQVQGTQQRLLVQASVQAKPGHVSPLQLTNIQVPQQAI 166
Query: 174 PTQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQ 233
PT LVVQS APG+K GQVSLTVH QQVHS PE+SPVQAN+S+SKTAG P TV QQLQ
Sbjct: 167 PTHDLVVQSPAPGTKSGQVSLTVHSAQQVHSAPERSPVQANNSTSKTAGTPAATV-QQLQ 225
Query: 234 VHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSS 293
VH VQQSVPVTQERSVVQATPQ K GPVQ LTVQGLQP YSS
Sbjct: 226 VHSVQQSVPVTQERSVVQATPQT-KAGPVQQLTVQGLQPVHVAQEVQQLPQVPVPHVYSS 284
Query: 294 QVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASS 353
QVQYVEGGDASYTASAIRSSTY YPETP+YTQTA TSYYEA+GTA QVSTPATSQ VASS
Sbjct: 285 QVQYVEGGDASYTASAIRSSTYQYPETPIYTQTAGTSYYEASGTAAQVSTPATSQTVASS 344
Query: 354 GSMPMYVSGSQVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 413
GS+PMYVSGS +V Y
Sbjct: 345 GSVPMYVSGSPIVASSSSSEAGASNSSVGAGGNGGGGSSGGGSGGSSGSGAGT------Y 398
Query: 414 VIQGGYMLGSASQSYSHTTRASPATVQWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLE 473
VIQGGYMLG+ASQSYSHTTRASPATVQWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLE
Sbjct: 399 VIQGGYMLGNASQSYSHTTRASPATVQWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLE 458
Query: 474 PVNAASFGKLIRSVFMXXXXXXXXXXXNSKYHYYGLRIKASSPLLRLMEDQQHMAMRGQP 533
PVNAASFGKLIRSVFM NSKYHYYGLRIKASSPLLRLMEDQQHMAMRGQP
Sbjct: 459 PVNAASFGKLIRSVFMGLRTRRLGTRGNSKYHYYGLRIKASSPLLRLMEDQQHMAMRGQP 518
Query: 534 FSQKQRLKPIQKMEGMTNGVAVGQQPSTGLSDISAQVQQYQQFLDASRSLPDFTELDLQG 593
FSQKQRLKPIQKMEG+ NGVAVGQQ STGLSDISAQVQQYQQFLDASRSLPDF ELDLQG
Sbjct: 519 FSQKQRLKPIQKMEGVANGVAVGQQ-STGLSDISAQVQQYQQFLDASRSLPDFAELDLQG 577
Query: 594 KVLPEGVGPGDIKAFQVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQPSEAPPLA 653
KVLPEGVGPGDIKAFQVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQPSEAPPLA
Sbjct: 578 KVLPEGVGPGDIKAFQVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQPSEAPPLA 637
Query: 654 VHDEAEKRLPKAILVLLSKFEPVLQWTKHCDNVLYQGLVEILIPDVLRPIPSALTQAIRN 713
VHDEAEKRLP+A LVLLSKF+PVLQWTKHCDNVLYQGLVEILIPDVLRPIPSALTQAIRN
Sbjct: 638 VHDEAEKRLPRASLVLLSKFQPVLQWTKHCDNVLYQGLVEILIPDVLRPIPSALTQAIRN 697
Query: 714 FAKSLESWLTHAMVNIPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQ 773
FAKSLESWLTHAMVNIPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQ
Sbjct: 698 FAKSLESWLTHAMVNIPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQ 757
Query: 774 MLSDLNRVDFANVQEQASWVCRCEDRVVQRLEQDFKVTLQQQNSLEQWAAWLDGVVSQVL 833
MLSDLNRVDFANVQEQASWVCRCEDRVVQRLEQDFKVTLQQQNSLEQWAAWLDGVVSQVL
Sbjct: 758 MLSDLNRVDFANVQEQASWVCRCEDRVVQRLEQDFKVTLQQQNSLEQWAAWLDGVVSQVL 817
Query: 834 KPYQGSAGFPKAAKLFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIEHR 893
KPYQGS+GFPKAAKLFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIEHR
Sbjct: 818 KPYQGSSGFPKAAKLFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIEHR 877
Query: 894 VAQAKGETPIAVMGEFANLATSLNPLDPDKXXXXXXXXXXXXXLPQDISLAAGGESPALG 953
VAQAKGETPIAVMGEFANLATSLNPLDPDK LPQDISLAAG ESPALG
Sbjct: 878 VAQAKGETPIAVMGEFANLATSLNPLDPDKDEEEEEEEESEDELPQDISLAAGSESPALG 937
Query: 954 PETLEPPAKLARTDARGLFVQALPSS 979
PE LEPPAKLARTD RGLFVQALPSS
Sbjct: 938 PEALEPPAKLARTDTRGLFVQALPSS 963
>ref|NP_002910.1| regulatory factor X, 3 (influences HLA class II expression) [Homo
sapiens]
sp|P48380|RFX3_HUMAN DNA BINDING PROTEIN RFX3
pir||D55926 DNA binding protein RFX3 - human
emb|CAA53706.1| (X76092) DNA binding protein RFX3 [Homo sapiens]
Length = 707
Score = 730 bits (1865), Expect = 0.0
Identities = 398/722 (55%), Positives = 480/722 (66%), Gaps = 98/722 (13%)
Query: 217 SSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXX 276
+S+T TV Q V Q +VP VVQ P + VQ TVQ +Q
Sbjct: 3 TSETGSDTGSTVTLQTSVAS-QAAVPT----QVVQQVPVQQQVQQVQ--TVQQVQ----- 50
Query: 277 XXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAG 336
Y +QVQYVEG D YT AIR++TY Y ET +Y+Q +Y++ G
Sbjct: 51 ------------HVYPAQVQYVEGSDTVYTNGAIRTTTYPYTETQMYSQNTGGNYFDTQG 98
Query: 337 TATQVSTPATSQAVASSGSMPMYVSGSQVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 396
++ QV+T +S ++ +G + M V+G Q++
Sbjct: 99 SSAQVTTVVSSHSMVGTGGIQMGVTGGQLISSSGGT------------------------ 134
Query: 397 XXXXXXXXXXXXXXXXYVIQGGYMLGSASQSYSHTTRASPATV----------------- 439
Y+I G + ++ S +HTTRASPAT+
Sbjct: 135 ----------------YLI--GNSMENSGHSVTHTTRASPATIEMAIETLQKSDGLSTHR 176
Query: 440 --------QWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXX 491
QWLLDNYETAEGVSLPRSTLY HYL HCQE KL+PVNAASFGKLIRS+FM
Sbjct: 177 SSLLNSHLQWLLDNYETAEGVSLPRSTLYNHYLRHCQEHKLDPVNAASFGKLIRSIFMGL 236
Query: 492 XXXXXXXXXNSKYHYYGLRIKASSPLLRLMEDQQHMAMRGQPFSQKQRLKPIQKMEGMTN 551
NSKYHYYG+R+K SPL RL ED Q+MAMR QP QKQR KP+QK++G+ +
Sbjct: 237 RTRRLGTRGNSKYHYYGIRVKPDSPLNRLQEDMQYMAMRQQPMQQKQRYKPMQKVDGVAD 296
Query: 552 G-VAVGQQPSTGLSD-ISAQVQQYQQFLDASRSLPDFTELDLQGKVLPEGVGPGDIKAFQ 609
G GQQ T + + AQ Q +QQFLDASR+LP+F E+++ LP+G DIK+ Q
Sbjct: 297 GFTGSGQQTGTSVEQTVIAQSQHHQQFLDASRALPEFGEVEISS--LPDGTTFEDIKSLQ 354
Query: 610 VLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQPSEAPPLAVHD---EAEKRLPKAI 666
LYREHCEAI+DV+VNLQF+L+E LW+TFWRY+ S P++ + E E RLPKA
Sbjct: 355 SLYREHCEAILDVVVNLQFSLIEKLWQTFWRYSPSTPTDGTTITESSNLSEIESRLPKAK 414
Query: 667 LVLLSKFEPVLQWTKHCDNVLYQGLVEILIPDVLRPIPSALTQAIRNFAKSLESWLTHAM 726
L+ L K E +L+W +CD+ +YQ LVEILIPDVLRPIPSALTQAIRNFAKSLE WL++AM
Sbjct: 415 LITLCKHESILKWMCNCDHGMYQALVEILIPDVLRPIPSALTQAIRNFAKSLEGWLSNAM 474
Query: 727 VNIPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQMLSDLNRVDFANV 786
NIP+ M++ KVAA AFAQTLRRYTSLNHLAQAARAVLQNT+QINQMLSDLNRVDFANV
Sbjct: 475 NNIPQRMIQTKVAAVSAFAQTLRRYTSLNHLAQAARAVLQNTSQINQMLSDLNRVDFANV 534
Query: 787 QEQASWVCRCEDRVVQRLEQDFKVTLQQQNSLEQWAAWLDGVVSQVLKPYQGSAGFPKAA 846
QEQASWVC+C+D +VQRLE DFK+TLQQQ++LEQWAAWLD V+ Q LKPY+G FPKAA
Sbjct: 535 QEQASWVCQCDDNMVQRLETDFKMTLQQQSTLEQWAAWLDNVMMQALKPYEGRPSFPKAA 594
Query: 847 KLFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIEHRVAQAKGETPIAVM 906
+ FLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYM+YL+EHRVAQA GETPIAVM
Sbjct: 595 RQFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYMFYLVEHRVAQATGETPIAVM 654
Query: 907 GE 908
GE
Sbjct: 655 GE 656
>ref|NP_033082.1| regulatory factor (trans-acting) 2 [Mus musculus]
sp|P48379|RFX2_MOUSE DNA BINDING PROTEIN RFX2
pir||C55926 DNA binding protein RFX2 - mouse
emb|CAA53703.1| (X76089) DNA binding protein RFX2 [Mus musculus]
Length = 692
Score = 699 bits (1785), Expect = 0.0
Identities = 379/700 (54%), Positives = 459/700 (65%), Gaps = 60/700 (8%)
Query: 229 PQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXX 288
P + + Q +P + +R +VQA PK P+Q LT+ +QP
Sbjct: 11 PASVALRPAAQPMPASPQRVLVQAAGSTPKGTPMQTLTLPRVQPVPPQVQHV-------- 62
Query: 289 XXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQ 348
Y +QVQYVEGGDA Y AIR++ P+ LY +++ SY+E G TQV+ A+S
Sbjct: 63 --YPAQVQYVEGGDAVYANGAIRAAYAYNPDPQLYAPSSAASYFETPG-GTQVTVAASSP 119
Query: 349 -AVASSG--SMPMYVSGSQVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 405
AV S G + M VSG+ +V
Sbjct: 120 PAVPSHGMVGITMDVSGTPIVSGAGA---------------------------------- 145
Query: 406 XXXXXXXYVIQGGYMLGSASQSYSHTTRASPATVQWLLDNYETAEGVSLPRSTLYCHYLL 465
Y+I GG + S +HT R+SPAT+QWLLDNYETAEGVSLPRS+LY HYL
Sbjct: 146 -------YLIHGG--MDGTRHSLAHTARSSPATLQWLLDNYETAEGVSLPRSSLYNHYLR 196
Query: 466 HCQEQKLEPVNAASFGKLIRSVFMXXXXXXXXXXXNSKYHYYGLRIKASSPLLRLMEDQQ 525
HCQE KLEPVNAASFGKLIRSVFM NSKYHYYG+R+K SPL RL ED Q
Sbjct: 197 HCQEHKLEPVNAASFGKLIRSVFMGLRTRRLGTRGNSKYHYYGIRLKPDSPLNRLQEDTQ 256
Query: 526 HMAMRGQPFSQKQRLKPIQKMEGMTNGVAVGQQPSTGLSDISAQVQQYQQFLDASRSLPD 585
+MAMR QP QK R +P QK + + +G A ++ Q Q +QQ++D S P+
Sbjct: 257 YMAMRQQPTHQKPRYRPAQKSDSLGDGSAHSNMHGMPDQAMATQGQHHQQYIDVSHVFPE 316
Query: 586 FTELDLQGKVLPEGVGPGDIKAFQVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQ 645
F DL +L E V D+KA Q++YR HCEA +DV++NLQF +E LW +FW +
Sbjct: 317 FPAPDLGSTLLQESVTLHDVKALQLVYRRHCEATLDVVMNLQFQYIEKLWLSFWNCKATS 376
Query: 646 PSEAPPLAVHDEAEK--RLPKAILVLLSKFEPVLQWTKHCDNVLYQGLVEILIPDVLRPI 703
L DE + LPK L+ L K EP+LQW + CD++LYQ LVE LIPDVLRP+
Sbjct: 377 SDSCASLPASDEDPEVTLLPKEKLISLCKCEPILQWMRSCDHILYQTLVETLIPDVLRPV 436
Query: 704 PSALTQAIRNFAKSLESWLTHAMVNIPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARA 763
PS+LTQAIRNFAKSLE WL +AM P+++++ KV AFAQTLRRYTSLNHLAQAARA
Sbjct: 437 PSSLTQAIRNFAKSLEGWLINAMSGFPQQVIQTKVGVVSAFAQTLRRYTSLNHLAQAARA 496
Query: 764 VLQNTAQINQMLSDLNRVDFANVQEQASWVCRCEDRVVQRLEQDFKVTLQQQNSLEQWAA 823
VLQNT+QINQMLSDLNRVDFANVQEQASWVC+CE+ +VQRLE DFKVTLQQQ+SL+QWA+
Sbjct: 497 VLQNTSQINQMLSDLNRVDFANVQEQASWVCQCEESLVQRLEHDFKVTLQQQSSLDQWAS 556
Query: 824 WLDGVVSQVLKPYQGSAGFPKAAKLFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYD 883
WLD VV+QVLK + GS FPKAA+ FLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYD
Sbjct: 557 WLDNVVTQVLKQHSGSPSFPKAARQFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYD 616
Query: 884 EYMYYLIEHRVAQAKGETPIAVMGEFANLAT-SLNPLDPD 922
EYM+YL+EHRVAQA GETPIAVMGEF +LA+ SL LD +
Sbjct: 617 EYMFYLVEHRVAQATGETPIAVMGEFNDLASLSLTLLDKE 656
>ref|NP_000626.1| regulatory factor X, 2 (influences HLA class II expression);
Regulatory factor (trans-acting) 2 (influences HLA class
II [Homo sapiens]
sp|P48378|RFX2_HUMAN DNA BINDING PROTEIN RFX2
pir||B55926 DNA binding protein RFX2 - human
emb|CAA53705.1| (X76091) DNA binding protein RFX2 [Homo sapiens]
Length = 723
Score = 687 bits (1753), Expect = 0.0
Identities = 378/713 (53%), Positives = 460/713 (64%), Gaps = 82/713 (11%)
Query: 241 VPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEG 300
VP + +R +VQA PK +QP+++ +Q Y +QVQYVEG
Sbjct: 25 VPASPQRVLVQAASSNPKGSQMQPISLPRVQQVPQQVQPVQHV-------YPAQVQYVEG 77
Query: 301 GDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGSM---P 357
GDA YT AIR++ PE +Y +++ SY+EA G A QV+ A+S S SM
Sbjct: 78 GDAVYTNGAIRTAYTYNPEPQMYAPSSTASYFEAPGGA-QVTVAASSPPAVPSHSMVGIT 136
Query: 358 MYVSGSQVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYVIQG 417
M V GS +V Y+I G
Sbjct: 137 MDVGGSPIVSSAGA-----------------------------------------YLIHG 155
Query: 418 GYMLGSASQSYSHTTRASPATV-------------------------QWLLDNYETAEGV 452
G + S S +HT+R+SPAT+ QWLLDNYETAEGV
Sbjct: 156 G--MDSTRHSLAHTSRSSPATLEMAIENLQKSEGITSHKSGLLNSHLQWLLDNYETAEGV 213
Query: 453 SLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXXXXXXXXXXNSKYHYYGLRIK 512
SLPRS+LY HYL HCQE KL+PVNAASFGKLIRSVFM NSKYHYYG+R+K
Sbjct: 214 SLPRSSLYNHYLRHCQEHKLDPVNAASFGKLIRSVFMGLRTRRLGTRGNSKYHYYGIRLK 273
Query: 513 ASSPLLRLMEDQQHMAMRGQPFSQKQRLKPIQKMEGMTNGVAVGQQPSTGLSDISAQVQQ 572
SPL RL ED Q+MAMR QP QK R +P QK + + + + ST ++ Q Q
Sbjct: 274 PDSPLNRLQEDTQYMAMRQQPMHQKPRYRPAQKTDSLGDSGSHSGLHSTPEQTMAVQSQH 333
Query: 573 YQQFLDASRSLPDFTELDLQGKVLPEGVGPGDIKAFQVLYREHCEAIVDVMVNLQFTLVE 632
+QQ++D S P+F DL +L +GV D+KA Q++YR HCEA VDV++NLQF +E
Sbjct: 334 HQQYIDVSHVFPEFPAPDLGSFLLQDGVTLHDVKALQLVYRRHCEATVDVVMNLQFHYIE 393
Query: 633 TLWKTFWRYNLSQPSEAPPLAVHDEAEKR--LPKAILVLLSKFEPVLQWTKHCDNVLYQG 690
LW +FW S L DE + LPK L+ L + +P+L+W + CD++LYQ
Sbjct: 394 KLWLSFWNSKASSSDGPTSLPASDEDPEGAVLPKDKLISLCQCDPILRWMRSCDHILYQA 453
Query: 691 LVEILIPDVLRPIPSALTQAIRNFAKSLESWLTHAMVNIPEEMLRVKVAAAGAFAQTLRR 750
LVEILIPDVLRP+PS LTQAIRNFAKSLE WLT+AM + P+++++ KV AFAQTLRR
Sbjct: 454 LVEILIPDVLRPVPSTLTQAIRNFAKSLEGWLTNAMSDFPQQVIQTKVGVVSAFAQTLRR 513
Query: 751 YTSLNHLAQAARAVLQNTAQINQMLSDLNRVDFANVQEQASWVCRCEDRVVQRLEQDFKV 810
YTSLNHLAQAARAVLQNT+QINQMLSDLNRVDFANVQEQASWVC+CE+ VVQRLEQDFK+
Sbjct: 514 YTSLNHLAQAARAVLQNTSQINQMLSDLNRVDFANVQEQASWVCQCEESVVQRLEQDFKL 573
Query: 811 TLQQQNSLEQWAAWLDGVVSQVLKPYQGSAGFPKAAKLFLLKWSFYSSMVIRDLTLRSAA 870
TLQQQ+SL+QWA+WLD VV+QVLK + GS FPKAA+ FLLKWSFYSSMVIRDLTLRSAA
Sbjct: 574 TLQQQSSLDQWASWLDSVVTQVLKQHAGSPSFPKAARQFLLKWSFYSSMVIRDLTLRSAA 633
Query: 871 SFGSFHLIRLLYDEYMYYLIEHRVAQAKGETPIAVMGEFANLAT-SLNPLDPD 922
SFGSFHLIRLLYDEYM+YL+EHRVA+A GETPIAVMGEF +LA+ SL LD D
Sbjct: 634 SFGSFHLIRLLYDEYMFYLVEHRVAEATGETPIAVMGEFNDLASLSLTLLDKD 686
>gb|AAF54526.2| (AE003686) Rfx gene product [Drosophila melanogaster]
Length = 897
Score = 479 bits (1219), Expect = e-134
Identities = 292/720 (40%), Positives = 384/720 (52%), Gaps = 37/720 (5%)
Query: 215 SSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXX 274
+SSS T G T T P V S+P+ + V V +Q
Sbjct: 143 NSSSDTMGTITTTEPNGTTV---THSIPIHSMADLAAIKDGVDLAQQVANGQVTVVQTTE 199
Query: 275 XXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEA 334
+ QVQYV+ + ++ S+ TY + Y T+YY
Sbjct: 200 DDDGTPFITVTVSGQEQNYQVQYVDS-ELYHSNSSQTQMTYPFCPVGDYQGNGQTAYYST 258
Query: 335 AGTATQVSTPATSQAVASSGSMPMYVSGSQVVXXXXXXXXXXXXXXXXXXXXXXXXXXXX 394
G S+ S A S ++P V + +
Sbjct: 259 TGQYGTTSSAGGSSNGAHSTTLPYLVPVEEGILLNGSAHSLSQSQSQSHGRDSPHSLTEV 318
Query: 395 XXXXXXXXXXXXXXXXXXYVIQGGYMLG---------SASQSYSHTTRASPATVQWLLDN 445
G LG S + + + + AT++WL N
Sbjct: 319 AYIQEAQSTPQTPTSTTTTHSASGGSLGTGGGGASPDSDQSALGSSNKIASATIKWLSRN 378
Query: 446 YETAEGVSLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXXXXXXXXXXNSKYH 505
YETA+GVSLPRSTLY HY+ HC E KLEPVNAASFGKLIRSVF NSKYH
Sbjct: 379 YETADGVSLPRSTLYNHYMQHCSEHKLEPVNAASFGKLIRSVFSGLRTRRLGTRGNSKYH 438
Query: 506 YYGLRIKASSPL-LRLMEDQQHMAMRGQPFSQKQRLKPIQKMEGMTNGVAV--------- 555
YYG+RIK S L + M+D+Q +A P S M +T+ A
Sbjct: 439 YYGIRIKPGSLLNSQAMDDKQMLAAGYGPSSDGTGGPGSGPMVSVTSSTAGQLTGSNGLG 498
Query: 556 ---GQQPSTGLSDISAQVQQYQQFL----DASRSLPDFTELDLQGKVLPEGVGPGDIKAF 608
GQ+ S G + + + Y+ + D + +LP F ++L E + D+ F
Sbjct: 499 GGHGQRHSNGTKKHTFKPETYEACIQYIGDGTSALPSFPPIELNHSFNSE-LTLEDVDTF 557
Query: 609 QVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQPSEAPPLAVHDEAEKRLPKAILV 668
+ LYREHCE+ +D ++NL+F VE L + FWR + + + E EK L K L
Sbjct: 558 RGLYREHCESFLDAVLNLEFNTVEFLLRDFWRASDNNNLD------ECEEEKYLSKTKLY 611
Query: 669 LLSKFEPVLQWTKHCDNVLYQGLVEILIPDVLRPIPSALTQAIRNFAKSLESWLTHAMVN 728
LL V ++ + D YQ V+++IPDVLR IP+ALTQAIRNFAK+LE WL +M+
Sbjct: 612 LLCHCAEVQKFVREVDYQFYQNTVDVIIPDVLRSIPNALTQAIRNFAKNLEIWLCESMLG 671
Query: 729 IPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQMLSDLNRVDFANVQE 788
+PE++ ++K +A AF QTLRRYTSLNHLAQAARAVLQN AQI+QMLSDLNRVDF NVQE
Sbjct: 672 VPEQLAQIKTSAVSAFCQTLRRYTSLNHLAQAARAVLQNGAQISQMLSDLNRVDFHNVQE 731
Query: 789 QASWVCRCEDRVVQRLEQDFKVTLQQQNSLEQWAAWLDGVVSQVLKPYQGSAGFPKAAKL 848
QA+WV +C VVQRLE DFK LQQQ+SLEQWA+WL VV ++ Y G + +AA+
Sbjct: 732 QAAWVSQCAPAVVQRLESDFKAALQQQSSLEQWASWLQLVVESAMEEYNGKPTYARAARQ 791
Query: 849 FLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIEHRVAQAKGETPIAVMGE 908
FLLKWSFYSSM+IRDLTLRSA+SFGSFHLIRLL+DEYM+YL+EH++A+A+ +T IAV+ E
Sbjct: 792 FLLKWSFYSSMIIRDLTLRSASSFGSFHLIRLLFDEYMFYLVEHKIAEAQDKTAIAVICE 851
>emb|CAB63452.1| (AJ133103) RFX transcription factor [Drosophila melanogaster]
Length = 897
Score = 479 bits (1219), Expect = e-134
Identities = 293/720 (40%), Positives = 385/720 (52%), Gaps = 37/720 (5%)
Query: 215 SSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXX 274
+SSS T G T T P V S+P+ + V V +Q
Sbjct: 143 NSSSDTMGTITTTEPNGTTV---THSIPIHSMADLAAIKDGVDLAQQVANGQVTVVQTTE 199
Query: 275 XXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEA 334
+ QVQYV+ + ++ S+ TY + Y T+YY
Sbjct: 200 DDDGTPFITVTVSGQEQNYQVQYVDS-ELYHSNSSQTQMTYPFCPVGDYQGNGQTAYYST 258
Query: 335 AGTATQVSTPATSQAVASSGSMPMYVSGSQVVXXXXXXXXXXXXXXXXXXXXXXXXXXXX 394
G S+ S A S ++P V + +
Sbjct: 259 TGQYGTTSSAGGSSNGAHSTTLPYLVPVEEGILLNGSAHSLSQSQSQSHGRDSPHSLTEV 318
Query: 395 XXXXXXXXXXXXXXXXXXYVIQGGYMLG---------SASQSYSHTTRASPATVQWLLDN 445
G LG S + S + + + AT++WL N
Sbjct: 319 AYIQEAQSTPQTPTSTTTTHSASGGSLGTGGGGASPDSDQSALSSSNKIASATIKWLSRN 378
Query: 446 YETAEGVSLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXXXXXXXXXXNSKYH 505
YETA+GVSLPRSTLY HY+ HC E KLEPVNAASFGKLIRSVF NSKYH
Sbjct: 379 YETADGVSLPRSTLYNHYMHHCSEHKLEPVNAASFGKLIRSVFSGLRTRRLGTRGNSKYH 438
Query: 506 YYGLRIKASSPL-LRLMEDQQHMAMRGQPFSQKQRLKPIQKMEGMTNGVAV--------- 555
YYG+RIK S L + M+D+Q +A P S M +T+ A
Sbjct: 439 YYGIRIKPGSLLNSQAMDDKQMLAAGYGPSSDGTGGPGSGPMVSVTSSTAGQLTGSNGLG 498
Query: 556 ---GQQPSTGLSDISAQVQQYQQFL----DASRSLPDFTELDLQGKVLPEGVGPGDIKAF 608
GQ+ S G + + + Y+ + D + +LP F ++L E + D+ F
Sbjct: 499 GGHGQRHSNGTKKHTFKPETYEACIQYIGDGTSALPSFPPIELNHSFNSE-LTLEDVDTF 557
Query: 609 QVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQPSEAPPLAVHDEAEKRLPKAILV 668
+ LYREHCE+ +D ++NL+F VE L + FWR + + + E EK L K L
Sbjct: 558 RGLYREHCESFLDAVLNLEFNTVEFLLRDFWRASDNNNLD------ECEEEKYLSKTKLY 611
Query: 669 LLSKFEPVLQWTKHCDNVLYQGLVEILIPDVLRPIPSALTQAIRNFAKSLESWLTHAMVN 728
LL V ++ + D YQ V+++IPDVLR IP+ALTQAIRNFAK+LE WL +M+
Sbjct: 612 LLCHCAEVQKFVREVDYQFYQNTVDVIIPDVLRSIPNALTQAIRNFAKNLEIWLCESMLG 671
Query: 729 IPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQMLSDLNRVDFANVQE 788
+PE++ ++K +A AF QTLRRYTSLNHLAQAARAVLQN AQI+QMLSDLNRVDF NVQE
Sbjct: 672 VPEQLAQIKNSAVSAFCQTLRRYTSLNHLAQAARAVLQNGAQISQMLSDLNRVDFHNVQE 731
Query: 789 QASWVCRCEDRVVQRLEQDFKVTLQQQNSLEQWAAWLDGVVSQVLKPYQGSAGFPKAAKL 848
QA+WV +C VVQRLE DFK LQQQ+SLEQWA+WL VV ++ Y G + +AA+
Sbjct: 732 QAAWVSQCAPAVVQRLESDFKAALQQQSSLEQWASWLQLVVESAMEEYNGKPTYARAARQ 791
Query: 849 FLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIEHRVAQAKGETPIAVMGE 908
FLLKWSFYSSM+IRDLTLRSA+SFGSFHLIRLL+DEYM+YL+EH++A+A+ +T IAV+ E
Sbjct: 792 FLLKWSFYSSMIIRDLTLRSASSFGSFHLIRLLFDEYMFYLVEHKIAEAQDKTAIAVICE 851
>sp|Q09555|YQW1_CAEEL HYPOTHETICAL 88.3 KD PROTEIN F33H1.1 FROM CHROMOSOME II
pir||T21708 hypothetical protein F33H1.1 - Caenorhabditis elegans
emb|CAA88701.1| (Z48783) similarity with the transcripton factor involved in the
expression of the human MHC class II gene (Swiss Prot
accession number P22670)~cDNA EST EMBL:T01603 comes from
this gene~cDNA EST yk116b6.5 comes from this gene~cDNA
EST yk151d8.5 comes f>
gb|AAF63475.1|AF233652_1 (AF233652) RFX-type transcription factor DAF-19 short variant
[Caenorhabditis elegans]
Length = 780
Score = 217 bits (547), Expect = 5e-55
Identities = 125/325 (38%), Positives = 177/325 (54%), Gaps = 22/325 (6%)
Query: 599 GVGPGDIKAFQVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQPSEAPPLAVHDEA 658
GVG ++ + +Y C I+ ++ N+ F VE W FW N + H A
Sbjct: 451 GVGEEELNSLIDIYEILCREILALIKNIDFASVEDTWSKFWSGNFGVDRD------HISA 504
Query: 659 EKRLPKAILVLLSKFEPVLQWTKHCDNVLYQGLVEILIPDVL-RPIPSALTQAIRNFAKS 717
L + V + D LYQ +V+ LIP+VL + + +TQ R FAK+
Sbjct: 505 -----------LCTLDQVQDYIIEVDLALYQTIVDTLIPNVLLSELSTGMTQTCRTFAKN 553
Query: 718 LESWLTHAMV--NIPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQML 775
++ +L +++ N+ E ++ K+ A Q L+RYTSLNHLA A+R VL Q+ QM
Sbjct: 554 IDVYLRKSLMLANLGEFFVKKKIQAIKYLQQGLKRYTSLNHLAHASRGVLMKPEQVQQMY 613
Query: 776 SDLNRVDFANVQEQASWVCRCEDRVVQRLEQDFKVTLQQQNSLEQWAAWLDGVVSQVLKP 835
D RVD V +QA W+C C+ +V + FK LQ+ +++E WA WL+ +V QVL
Sbjct: 614 QDYIRVDINTVHQQAGWICGCDSVMVHHVNNAFKHNLQRMSAMEVWAEWLESIVDQVLAK 673
Query: 836 YQGSAG--FPKAAKLFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIEHR 893
Y K FLL WSFY+SM+IRDLTLRSA SFGSF LIRLL D+YMYYLIE +
Sbjct: 674 YHDKPANVIANVGKQFLLNWSFYTSMIIRDLTLRSAMSFGSFTLIRLLADDYMYYLIESK 733
Query: 894 VAQAKGETPIAVMGEFANLATSLNP 918
+A+A + I V+ + + NP
Sbjct: 734 IAKAGKQQLITVIRADKDWPLTTNP 758
Score = 108 bits (267), Expect = 3e-22
Identities = 54/92 (58%), Positives = 61/92 (65%), Gaps = 1/92 (1%)
Query: 433 RASPATVQWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXX 492
RASPATV WL +NYE EG SLPR LY HY HC E +++PVNAASFGKLIRSVF
Sbjct: 230 RASPATVNWLFENYEIGEG-SLPRCELYDHYKKHCAEHRMDPVNAASFGKLIRSVFHNLK 288
Query: 493 XXXXXXXXNSKYHYYGLRIKASSPLLRLMEDQ 524
NSKYHYYG+R+K SS L + Q
Sbjct: 289 TRRLGTRGNSKYHYYGIRLKDSSTLHSMQHPQ 320
>gb|AAF61564.1|AF226156_1 (AF226156) RFX-like transcription factor DAF-19 [Caenorhabditis
elegans]
Length = 805
Score = 217 bits (547), Expect = 5e-55
Identities = 125/325 (38%), Positives = 177/325 (54%), Gaps = 22/325 (6%)
Query: 599 GVGPGDIKAFQVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQPSEAPPLAVHDEA 658
GVG ++ + +Y C I+ ++ N+ F VE W FW N + H A
Sbjct: 476 GVGEEELNSLIDIYEILCREILALIKNIDFASVEDTWSKFWSGNFGVDRD------HISA 529
Query: 659 EKRLPKAILVLLSKFEPVLQWTKHCDNVLYQGLVEILIPDVL-RPIPSALTQAIRNFAKS 717
L + V + D LYQ +V+ LIP+VL + + +TQ R FAK+
Sbjct: 530 -----------LCTLDQVQDYIIEVDLALYQTIVDTLIPNVLLSELSTGMTQTCRTFAKN 578
Query: 718 LESWLTHAMV--NIPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQML 775
++ +L +++ N+ E ++ K+ A Q L+RYTSLNHLA A+R VL Q+ QM
Sbjct: 579 IDVYLRKSLMLANLGEFFVKKKIQAIKYLQQGLKRYTSLNHLAHASRGVLMKPEQVQQMY 638
Query: 776 SDLNRVDFANVQEQASWVCRCEDRVVQRLEQDFKVTLQQQNSLEQWAAWLDGVVSQVLKP 835
D RVD V +QA W+C C+ +V + FK LQ+ +++E WA WL+ +V QVL
Sbjct: 639 QDYIRVDINTVHQQAGWICGCDSVMVHHVNNAFKHNLQRMSAMEVWAEWLESIVDQVLAK 698
Query: 836 YQGSAG--FPKAAKLFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIEHR 893
Y K FLL WSFY+SM+IRDLTLRSA SFGSF LIRLL D+YMYYLIE +
Sbjct: 699 YHDKPANVIANVGKQFLLNWSFYTSMIIRDLTLRSAMSFGSFTLIRLLADDYMYYLIESK 758
Query: 894 VAQAKGETPIAVMGEFANLATSLNP 918
+A+A + I V+ + + NP
Sbjct: 759 IAKAGKQQLITVIRADKDWPLTTNP 783
Score = 108 bits (267), Expect = 3e-22
Identities = 54/92 (58%), Positives = 61/92 (65%), Gaps = 1/92 (1%)
Query: 433 RASPATVQWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXX 492
RASPATV WL +NYE EG SLPR LY HY HC E +++PVNAASFGKLIRSVF
Sbjct: 255 RASPATVNWLFENYEIGEG-SLPRCELYDHYKKHCAEHRMDPVNAASFGKLIRSVFHNLK 313
Query: 493 XXXXXXXXNSKYHYYGLRIKASSPLLRLMEDQ 524
NSKYHYYG+R+K SS L + Q
Sbjct: 314 TRRLGTRGNSKYHYYGIRLKDSSTLHSMQHPQ 345
>sp|P48381|RFX3_MOUSE DNA BINDING PROTEIN RFX3
pir||E55926 DNA binding protein RFX3 - mouse (fragment)
emb|CAA53704.1| (X76090) DNA binding protein RFX3 [Mus musculus]
Length = 189
Score = 192 bits (482), Expect = 2e-47
Identities = 102/188 (54%), Positives = 120/188 (63%), Gaps = 27/188 (14%)
Query: 418 GYMLGSASQSYSHTTRASPATV-------------------------QWLLDNYETAEGV 452
G + ++ S +HTTRASPAT+ QWLLDNYETAEGV
Sbjct: 2 GNSMENSGHSVTHTTRASPATIEMAIETLQKSDGLSTHRSSLLNSHLQWLLDNYETAEGV 61
Query: 453 SLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXXXXXXXXXXNSKYHYYGLRIK 512
SLPRSTLY HYL HCQE KL+PVNAASFGKLIRS+FM NSKYHYYG+R+K
Sbjct: 62 SLPRSTLYNHYLRHCQEHKLDPVNAASFGKLIRSIFMGLRTRRLGTRGNSKYHYYGIRVK 121
Query: 513 ASSPLLRLMEDQQHMAMRGQPFSQKQRLKPIQKMEGMTNG-VAVGQQPSTGLSD-ISAQV 570
SPL RL ED Q+MAMR QP QKQR KP+QK++G+ +G GQQ T + + AQ
Sbjct: 122 PDSPLNRLQEDMQYMAMRQQPMQQKQRYKPMQKVDGVADGFTGSGQQTGTSVEQTVIAQS 181
Query: 571 QQYQQFLD 578
Q +QQFLD
Sbjct: 182 QHHQQFLD 189
>pdb|1DP7|P Chain P, Cocrystal Structure Of Rfx-Dbd In Complex With Its Cognate
X-Box Binding Site
Length = 76
Score = 128 bits (318), Expect = 3e-28
Identities = 63/76 (82%), Positives = 63/76 (82%)
Query: 438 TVQWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXXXXXXX 497
TVQWLLDNYETAEGVSLPRSTLY HYLLH QEQKLEPVNAASFGKLIRSVFM
Sbjct: 1 TVQWLLDNYETAEGVSLPRSTLYNHYLLHSQEQKLEPVNAASFGKLIRSVFMGLRTRRLG 60
Query: 498 XXXNSKYHYYGLRIKA 513
NSKYHYYGLRIKA
Sbjct: 61 TRGNSKYHYYGLRIKA 76
>gb|AAC62839.1| (AC005784) RFX2_HUMAN, [AA 620-723] [Homo sapiens]
Length = 104
Score = 108 bits (268), Expect = 2e-22
Identities = 56/67 (83%), Positives = 61/67 (90%), Gaps = 1/67 (1%)
Query: 857 SSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIEHRVAQAKGETPIAVMGEFANLAT-S 915
SSMVIRDLTLRSAASFGSFHLIRLLYDEYM+YL+EHRVA+A GETPIAVMGEF +LA+ S
Sbjct: 1 SSMVIRDLTLRSAASFGSFHLIRLLYDEYMFYLVEHRVAEATGETPIAVMGEFNDLASLS 60
Query: 916 LNPLDPD 922
L LD D
Sbjct: 61 LTLLDKD 67
>emb|CAB85619.1| (AJ243296) putative RFX transcription factor [Penicillium
chrysogenum]
Length = 855
Score = 80.0 bits (194), Expect = 1e-13
Identities = 110/516 (21%), Positives = 185/516 (35%), Gaps = 88/516 (17%)
Query: 441 WLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXXXXXXXXXX 500
WL +N + G S+ R +YC Y +C +++ +N ASFGKL+R +F
Sbjct: 235 WLRENCRKSSG-SVRRDRVYCCYAENCGTERVSVLNPASFGKLVRIIFPNVQTRRLGVRG 293
Query: 501 NSKYHYYGLRI--------------KASS---PLLRLMEDQQHMAMRGQP------FSQK 537
SKYHY L + SS P +E Q A++ QP F
Sbjct: 294 ESKYHYVDLSVIEEKQQKLAPLQPTNCSSFNWPCFNALEGCQCDALKKQPTADTAVFPSP 353
Query: 538 QRLKPIQKMEGMT-------NGVAVGQQPSTGLSDISAQ----VQQYQQFLDASRSLPDF 586
P + + + G + + +++ Q + Q QF L D
Sbjct: 354 TTSFPPRFPNNASPADCNCQSHTPSGPEATITRENVAQQAGKMIHQMLQFPTDENPLVDN 413
Query: 587 TELDLQG--KVLPEGVGPGDIKAFQVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLS 644
L L LP A LYR HC +++D + ++ L K F ++ +
Sbjct: 414 DTLQLPDIRAYLPSNTDLKVAAALAALYRSHCISVID---SFRYCKERNLMKYFSAFHGT 470
Query: 645 QPSEAPPLAVHDEAEKRLPKAILVLLSKFEPVLQWTKHCDNVLYQGLVEILIPDVLRPIP 704
L H + W K CD ++YQ ++ + P + +P
Sbjct: 471 LTVPVQKLLTHPN------------------LAPWIKECDWMMYQKMIAFVAPLTTQVVP 512
Query: 705 SALTQAIRNFAKSLESWLTHAMVNIPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAV 764
+ A + ++ L + P + ++ A F L+ +N A AA A
Sbjct: 513 KPVLDAFNSISQRLCGHIAETFKTQPTHVSIARLIPAHIFCNLLKHMLDVNQAANAAAAW 572
Query: 765 LQNTAQINQMLSDL-NRVDFANVQEQASWVCRCEDRVVQRLEQDFKVTLQQQNSLEQWAA 823
L + NQM +D V+ ++ +A+ E Q L+ D + L + + A+
Sbjct: 573 LCHPDNRNQMWTDFKTMVNPRDMMTKANIPTCAELATEQILKHDIRALLTPLSDADPSAS 632
Query: 824 WL--------DGVVSQVLKPYQGSAG----FPKAAKLFLL----------------KWSF 855
L D V + P + + G FP F+L K
Sbjct: 633 LLFFTQPDTPDSVEAHKF-PVESAPGDEYNFPDKWVQFILNIPAAFANHRTQCVIEKVDA 691
Query: 856 YSSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIE 891
V+ LTL A SF ++ + ++ + E M + E
Sbjct: 692 LWDSVLHRLTLAGAPSFSAWWMTKVFFHEMMVWQAE 727
>ref|NP_002911.1| regulatory factor X, 4 (influences HLA class II expression) [Homo
sapiens]
gb|AAA58461.1| (M69296) estrogen receptor-related protein [Homo sapiens]
Length = 336
Score = 78.8 bits (191), Expect = 3e-13
Identities = 37/73 (50%), Positives = 50/73 (67%), Gaps = 2/73 (2%)
Query: 418 GYMLGSASQSYSHTTR--ASPATVQWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLEPV 475
G M G + + ++ ++PAT+QWL +NYE AEGV +PRS LY HYL C++ +PV
Sbjct: 249 GMMKGDEEKENNRASKPHSTPATLQWLEENYEIAEGVCIPRSALYMHYLDFCEKNDTQPV 308
Query: 476 NAASFGKLIRSVF 488
NAASFGK+IR F
Sbjct: 309 NAASFGKIIRQQF 321
>sp|P48383|SAK1_SCHPO SAK1 PROTEIN
pir||T11650 sak1 protein - fission yeast (Schizosaccharomyces pombe)
emb|CAA15923.1| (AL021046) sak1 protein. [Schizosaccharomyces pombe]
Length = 766
Score = 71.4 bits (172), Expect = 4e-11
Identities = 70/306 (22%), Positives = 130/306 (41%), Gaps = 44/306 (14%)
Query: 607 AFQVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQPSEAPPLAVHDEAEKRLPKAI 666
A +Y HC TL+E++ + LS+ S P L ++
Sbjct: 392 ALMNIYSSHC-----------ITLIESVRYMHLKQFLSEISNFP---------NSLSPSL 431
Query: 667 LVLLSKFEPVLQWTKHCDNVLYQGLVEILIPDVLRPIPSALTQAIRNFAKSLESWLTHAM 726
L LLS +W + D V+Y+ ++++L P L+ +P + +R+ A++L + ++
Sbjct: 432 LALLSS-PYFTKWIERSDTVMYREILKLLFPMTLQVVPPPVLVLLRHLAENLVNHISSIY 490
Query: 727 VNIPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQMLSDLNRV----- 781
+ +L+VK A F+ L R +N A AA L N A + + +D R
Sbjct: 491 ASHSSCLLQVKSETAAIFSNLLSRLLRVNDTAHAAARFLANPADRHLICNDWERFVSTRF 550
Query: 782 ---------DFANVQEQASW-----VCRCEDRVVQRLEQDFKVTLQQQNSLEQWAAWLDG 827
D V W C ++ L+ + + N +E +DG
Sbjct: 551 IVHRELMCNDKEAVAALDEWYSILSTCSNPSELLDPLKDKHEASDTSMNRVE--LRQIDG 608
Query: 828 VVSQVLKPY-QGSAGFPKAA-KLFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEY 885
V+ ++ + + + FP + ++FLL + V+R++T+ +FG+ +IR DEY
Sbjct: 609 VLDRMADFFLELPSRFPSCSPRMFLLCLGALQTSVLREITVSGGEAFGALWVIRCWVDEY 668
Query: 886 MYYLIE 891
M ++ E
Sbjct: 669 MTWVAE 674
Score = 59.3 bits (141), Expect = 2e-07
Identities = 26/80 (32%), Positives = 43/80 (53%)
Query: 441 WLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXXXXXXXXXX 500
WL E + ++ R+ +Y HY+ C ++P+N+ASFGKL+R +F
Sbjct: 104 WLKRACEEQQDAAVQRNQIYAHYVEICNSLHIKPLNSASFGKLVRLLFPSIKTRRLGMRG 163
Query: 501 NSKYHYYGLRIKASSPLLRL 520
+SKYHY G++++ RL
Sbjct: 164 HSKYHYCGIKLRGQDSFRRL 183
>gb|AAA67937.1| (U19978) RFX family DNA-binding protein [Schizosaccharomyces pombe]
Length = 734
Score = 68.3 bits (164), Expect = 4e-10
Identities = 71/310 (22%), Positives = 133/310 (42%), Gaps = 52/310 (16%)
Query: 607 AFQVLYREHCEAIVDVMVNLQFTLVETLWKTFWRYNLSQPSEAPPLAVHDEAEKRLPKAI 666
A +Y HC TL+E++ + LS+ S P L ++
Sbjct: 392 ALMNIYSSHC-----------ITLIESVRYMHLKQFLSEISNFP---------NSLSPSL 431
Query: 667 LVLLSKFEPVLQWTKHCDNVLYQGLVEILIPDVLRPIPSALTQAIRNFAKSLESWLTHAM 726
L LLS +W + D V+Y+ + ++L P L+ +P + +R+ A++L + ++
Sbjct: 432 LALLSS-PYFTKWIERSDTVMYREIRKLLFPMTLQVVPPPVLVLLRHLAENLVNHISSIY 490
Query: 727 VNIPEEMLRVKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQMLSDLNRVDFANV 786
+ +L+VK A F+ L R +N A AA L N A + + +D R F +
Sbjct: 491 ASHSSCLLQVKSETAAIFSNLLSRLLRVNDTAHAAARFLANPADRHLICNDWER--FVST 548
Query: 787 QEQASWVCRCEDR-VVQRLEQDFKV----------------------TLQQQNSLEQWAA 823
+ C D+ V L++ + + T ++ L Q
Sbjct: 549 RFIVHRELMCNDKEAVAALDEWYSILSTCSNPSDLLDPLKDKHEASDTSMKRVELRQ--- 605
Query: 824 WLDGVVSQVLKPY-QGSAGFPKAA-KLFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLL 881
+DGV+ ++ + + + FP + ++FLL + V+R++T+ +FG+ +IR
Sbjct: 606 -IDGVLDRMADFFLELPSRFPSCSPRMFLLCLGALQTSVLREITVSGGEAFGALWVIRCW 664
Query: 882 YDEYMYYLIE 891
DEYM ++ E
Sbjct: 665 VDEYMTWVAE 674
Score = 59.3 bits (141), Expect = 2e-07
Identities = 26/80 (32%), Positives = 43/80 (53%)
Query: 441 WLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXXXXXXXXXX 500
WL E + ++ R+ +Y HY+ C ++P+N+ASFGKL+R +F
Sbjct: 104 WLKRACEEQQDAAVQRNQIYAHYVEICNSLHIKPLNSASFGKLVRLLFPSIKTRRLGMRG 163
Query: 501 NSKYHYYGLRIKASSPLLRL 520
+SKYHY G++++ RL
Sbjct: 164 HSKYHYCGIKLRGQDSFRRL 183
>emb|CAB85587.1| (AJ132014) cephalosporin C regulator 1 [Acremonium chrysogenum]
Length = 830
Score = 58.9 bits (140), Expect = 2e-07
Identities = 57/235 (24%), Positives = 97/235 (41%), Gaps = 29/235 (12%)
Query: 676 VLQWTKHCDNVLYQGLVEILIPDVLRPIPSALTQAIRNFAKSLESWLTHAMVNIPEEMLR 735
+ W + CD VLYQ L + L +P+ +R+ A L + PE +++
Sbjct: 480 IAPWIEECDLVLYQRLTKFAFTMSLVVVPNVYLNRMRSIADRLVPHIVDVFGGHPEHVVQ 539
Query: 736 VKVAAAGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQMLSDLNRVDFANVQEQASWV-C 794
K A F L R T +N A AA + A +QM +D ++ N ++ A V
Sbjct: 540 AKAGPAAIFVGILERMTRVNKTAHAAARPVALDANRDQMYADW--LELVNARKIAECVPT 597
Query: 795 RCEDRVVQRLEQDFKVTLQQQNSLEQWAA----------------WLDGVVSQVLKPYQG 838
R D V + L ++ + + +N + W D ++S K
Sbjct: 598 RGMDDVAELLVREMRYLVDPKNYIPDDETSNADAGGARSPFAVNRWRDFLMSLPGK---- 653
Query: 839 SAGFPKAAKLFLLKWSF--YSSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIE 891
FP A+ ++ W + ++R+LTL+ SF ++ I+ DE + YL E
Sbjct: 654 ---FPYASHEDIV-WCVERVGTAIVRELTLQGGTSFTTWWSIKTFLDEEIMYLAE 704
Score = 46.9 bits (109), Expect = 0.001
Identities = 22/64 (34%), Positives = 32/64 (49%)
Query: 453 SLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXXXXXXXXXXNSKYHYYGLRIK 512
S+PR +Y Y C + ++ +N ASFGKL+R VF SKYHY +++
Sbjct: 238 SIPRGRVYHTYASKCTDDRVVVLNPASFGKLVRVVFPSIKTRRLGVRGESKYHYCNFQLR 297
Query: 513 ASSP 516
P
Sbjct: 298 DPPP 301
>pir||S51421 hypothetical protein YLR176c - yeast (Saccharomyces cerevisiae)
Length = 771
Score = 57.0 bits (135), Expect = 9e-07
Identities = 25/71 (35%), Positives = 38/71 (53%)
Query: 441 WLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXXXXXXXXXX 500
WL+ N ++ +PR ++ Y C + L+P++ AS GKLIR+VF
Sbjct: 248 WLMKNCKSQHDSYVPRGKIFAQYASSCSQNNLKPLSQASLGKLIRTVFPDLTTRRLGMRG 307
Query: 501 NSKYHYYGLRI 511
SKYHY GL++
Sbjct: 308 QSKYHYCGLKL 318
Score = 43.8 bits (101), Expect = 0.009
Identities = 35/178 (19%), Positives = 74/178 (40%), Gaps = 27/178 (15%)
Query: 584 PDFTELDLQGKVLPEGVGPGD-----IKAFQVLYREHCEAIVDVMVNLQFTLVETLWKTF 638
P T L +P GV P D I + + LY HC ++ + ++F + +
Sbjct: 457 PLLTSYKLDFPKIPAGVLPTDTDSDVISSLESLYHIHCNSVYEC---IKFLKSDNISNAL 513
Query: 639 WRYNLSQPSEAPPLAVHDEAEKRLPKAILVLLSKFEPVLQWTKHCDNVLYQGLVEILIPD 698
+ N + S P + +S EP++ W CD + Y GL++
Sbjct: 514 FFSNSNSIS---------------PTMFNLFIS--EPLIDWVTKCDLITYTGLIKFFSQF 556
Query: 699 VL--RPIPSALTQAIRNFAKSLESWLTHAMVNIPEEMLRVKVAAAGAFAQTLRRYTSL 754
++ I ++ Q + + K L + A++ +P+ +++ K++ F + +++ L
Sbjct: 557 IIHSNEISDSIIQKLESMIKLLPEQINKAVLELPKALVQRKLSIINNFTKLVKKLIKL 614
>ref|NP_013277.1| DNA binding protein, homologous to mammalian RFX1-4 proteins; Rfx1p
[Saccharomyces cerevisiae]
sp|P48743|RFXL_YEAST HYPOTHETICAL 90.6 KD PROTEIN IN CBF5-DKA1 INTERGENIC REGION
gb|AAB67470.1| (U17246) Ylr176cp [Saccharomyces cerevisiae]
Length = 811
Score = 57.0 bits (135), Expect = 9e-07
Identities = 25/71 (35%), Positives = 38/71 (53%)
Query: 441 WLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXXXXXXXXXX 500
WL+ N ++ +PR ++ Y C + L+P++ AS GKLIR+VF
Sbjct: 288 WLMKNCKSQHDSYVPRGKIFAQYASSCSQNNLKPLSQASLGKLIRTVFPDLTTRRLGMRG 347
Query: 501 NSKYHYYGLRI 511
SKYHY GL++
Sbjct: 348 QSKYHYCGLKL 358
Score = 43.8 bits (101), Expect = 0.009
Identities = 35/178 (19%), Positives = 74/178 (40%), Gaps = 27/178 (15%)
Query: 584 PDFTELDLQGKVLPEGVGPGD-----IKAFQVLYREHCEAIVDVMVNLQFTLVETLWKTF 638
P T L +P GV P D I + + LY HC ++ + ++F + +
Sbjct: 497 PLLTSYKLDFPKIPAGVLPTDTDSDVISSLESLYHIHCNSVYEC---IKFLKSDNISNAL 553
Query: 639 WRYNLSQPSEAPPLAVHDEAEKRLPKAILVLLSKFEPVLQWTKHCDNVLYQGLVEILIPD 698
+ N + S P + +S EP++ W CD + Y GL++
Sbjct: 554 FFSNSNSIS---------------PTMFNLFIS--EPLIDWVTKCDLITYTGLIKFFSQF 596
Query: 699 VL--RPIPSALTQAIRNFAKSLESWLTHAMVNIPEEMLRVKVAAAGAFAQTLRRYTSL 754
++ I ++ Q + + K L + A++ +P+ +++ K++ F + +++ L
Sbjct: 597 IIHSNEISDSIIQKLESMIKLLPEQINKAVLELPKALVQRKLSIINNFTKLVKKLIKL 654
>gb|AAF54121.2| (AE003675) CG9727 gene product [Drosophila melanogaster]
Length = 1280
Score = 53.5 bits (126), Expect = 1e-05
Identities = 36/128 (28%), Positives = 63/128 (49%), Gaps = 14/128 (10%)
Query: 438 TVQWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMXXXXXXXX 497
T+ W+ + E VS+P+ +Y Y+ +C+ ++P++ A FGK+++ VF
Sbjct: 310 TINWVRSHLEHDAQVSIPKQDVYNDYIAYCERLSIKPLSTADFGKVMKQVFPGVRPRRLG 369
Query: 498 XXXNSKYHYYGLR--IKASSPLL-RLMEDQQHMAMRGQPFSQKQRLKPIQKMEGMTNGVA 554
NS+Y Y +R K + P L +L + +Q +A G S +L T+ +
Sbjct: 370 TRGNSRYCYAAMRKTTKLTPPQLPQLCKTEQIVA--GDSNSDPSQL---------TSASS 418
Query: 555 VGQQPSTG 562
+G PSTG
Sbjct: 419 LGGLPSTG 426
>pir||S55315 mucin (clone PGM-2A) - pig
pir||I47141 gastric mucin (clone PGM-2A) - pig (fragment)
gb|AAC48526.1| (U10281) gastric mucin [Sus scrofa]
Length = 528
Score = 49.2 bits (115), Expect = 2e-04
Identities = 62/284 (21%), Positives = 100/284 (34%), Gaps = 13/284 (4%)
Query: 80 AVPAPSQPTGAPTPS---PAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXX 136
+VP PS + P+ S P V T S + S T+S S++S
Sbjct: 27 SVPIPSTTSVQPSSSGSAPTTSATSVQTSSSSSPPISSTIS-VQTSSSSSVPTTSTTSVQ 85
Query: 137 XXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQA--LPTQRLVVQSAAPGSKGGQVSL 194
SVQ+ +P+ T P + +PT ++ S S
Sbjct: 86 PSSSSSAPTTRATSVQSSSSSSAPISSTTSVQPSSSGSVPTTSATSVQSSSSSSAPTTSA 145
Query: 195 TVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATP 254
T SPP S V SSS +A + T Q S V+ + S + P
Sbjct: 146 TSVQPSSSSSPPISSTVSVQPSSSSSAPTTSATSVQPSSSSSPPISSTVSVQTSSSSSVP 205
Query: 255 QAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSST 314
VQP + + P ++ VQ +S +++ S+T
Sbjct: 206 TTSTTS-VQPSSSSSV-PTTSATSVRSSSSSSTPIPSTTSVQ-----PSSSSSAPTTSAT 258
Query: 315 YSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGSMPM 358
P + T ST+ + + +++ +T ATS +SS S P+
Sbjct: 259 SVQPSSSSSTPIPSTTSVQPSSSSSAPTTSATSVQPSSSSSPPI 302
Score = 44.9 bits (104), Expect = 0.004
Identities = 65/281 (23%), Positives = 89/281 (31%), Gaps = 39/281 (13%)
Query: 80 AVPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSEAS--PGSTASXXXXXXXXXXX 137
+ P S + P+ S +P ++V + +S T S S P S+ S
Sbjct: 283 SAPTTSATSVQPSSSSSPPISSTISVQPSSSSSSPTTSTTSVQPSSSGSAPTTSATSVQP 342
Query: 138 XXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKGGQVSLTVH 197
SVQ SP T P + S P S
Sbjct: 343 SSSSSPPISSTISVQPSSSSSSPTTSTTSVQPSSSGSAPTTSATSVQPSSSS-------- 394
Query: 198 GTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAP 257
S P S SSSS + PT T Q SVP T S VQ + +
Sbjct: 395 ------SVPTTSATSVRSSSSSSTPIPTTTSVQP----SSSSSVPTTSATS-VQTSSSSS 443
Query: 258 KPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSY 317
P P + +QP SS S T S SS+ S
Sbjct: 444 TPIP----STTSVQPSSSSSAPTTSATSVQPSSSSSP-------PISSTISVQPSSSSSS 492
Query: 318 PETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGSMPM 358
P T ST+ + + + + +T ATS +SS S P+
Sbjct: 493 P-------TTSTTSVQPSSSGSAPTTSATSVQPSSSSSPPI 526
Score = 41.8 bits (96), Expect = 0.035
Identities = 60/285 (21%), Positives = 94/285 (32%), Gaps = 16/285 (5%)
Query: 78 LPAVPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXX 137
+P A S + + + +P V S + S TVS P S++S
Sbjct: 124 VPTTSATSVQSSSSSSAPTTSATSVQPSSSSSPPISSTVS-VQPSSSSSAPTTSATSVQP 182
Query: 138 XXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQA--LPTQRLVVQSAAPGSKGGQVSLT 195
SVQ P T P + +PT ++ S S T
Sbjct: 183 SSSSSPPISSTVSVQTSSSSSVPTTSTTSVQPSSSSSVPTTSATSVRSSSSSSTPIPSTT 242
Query: 196 VHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQ 255
S P S SSS + P+ T Q S P T SV ++
Sbjct: 243 SVQPSSSSSAPTTSATSVQPSSSSSTPIPSTTSVQPSS----SSSAPTTSATSVQPSSSS 298
Query: 256 APKPGP---VQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRS 312
+P VQP + P ++ VQ +S ++ I S
Sbjct: 299 SPPISSTISVQPSSSSS-SPTTSTTSVQPSSSGSAPTTSATSVQ-----PSSSSSPPISS 352
Query: 313 STYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGSMP 357
+ P + + T ST+ + + + + +T ATS +SS S+P
Sbjct: 353 TISVQPSSSSSSPTTSTTSVQPSSSGSAPTTSATSVQPSSSSSVP 397
>pir||T34433 hypothetical protein K06A9.1a - Caenorhabditis elegans
gb|AAC70889.1| (U80846) K06A9.1a gene product [Caenorhabditis elegans]
Length = 1032
Score = 45.7 bits (106), Expect = 0.002
Identities = 68/316 (21%), Positives = 101/316 (31%), Gaps = 40/316 (12%)
Query: 75 VTELPAVPAPSQPTGAPTPSPAPQQY-IVVTVSEGAMRASETVSEASPGSTASXXXXXXX 133
VT +P P+ +P PS +P +T+S + TVS GST S
Sbjct: 519 VTVVPGSSTSPAPSSSPNPSSSPASTGSTITISGSSSIIVSTVS----GSTVSGSTGTSQ 574
Query: 134 XXXXXXXXXXXXXXX--XSVQAKPGHVSPLQLTNIQVPQQAL---PTQRLVVQSAAP--- 185
S +P SP T P Q P+ + S+ P
Sbjct: 575 STLASSTATPGSSSTVPSSSSPQPSSQSPAPNTGSTTPSQTSSQSPSPSMNPSSSTPTGS 634
Query: 186 ----------------GSKGGQVSLTVHGTQQVHSPPEQS--PVQANSSSSKTAGAPTGT 227
GS G S+ T Q P S NSS S ++ +P+ +
Sbjct: 635 SQSTITPEGSTASSPTGSTGSTFSVATEVTSQSTVPSGSSLGTQSTNSSPSPSSLSPSTS 694
Query: 228 VPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXX 287
L + P + + S Q+T P P P Q +
Sbjct: 695 GMSTL----TSEPSPSSTQSSGAQSTLTTPSPNPSQSTSSLESSTSGATTSSGSAGTTMT 750
Query: 288 XXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATS 347
SS V G + + S S+T + TQT +S +A T ++
Sbjct: 751 SPSQSSSV-----GSSQGSTSPAASTTSGEMTSQGSTQTPGSSVSTSAAILTSTQQSVST 805
Query: 348 QAVASSGSMPMYVSGS 363
+ S+ + P VSGS
Sbjct: 806 NSPGSTVTRPSTVSGS 821
Score = 34.8 bits (78), Expect = 4.6
Identities = 60/292 (20%), Positives = 92/292 (30%), Gaps = 25/292 (8%)
Query: 82 PAPSQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXXXXXX 141
P S G+ TPS + ++ + G+ ++ TV+ S + S
Sbjct: 389 PGSSSTYGSSTPSASSSSSGTMSTNSGSTGSTVTVAPVSSSTFGSSTPIASSSSSGSTVT 448
Query: 142 XXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLV----------VQSAAPGSKGGQ 191
+ P S T + T +V QSA+P S G
Sbjct: 449 VVSGSSSTYGSSTPSASSSSAGTASTISGSTGSTATIVPGSSSSVGSSTQSASPSSPGTM 508
Query: 192 VSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVV- 250
+++ V P S A SSS + +P T + + G + T S V
Sbjct: 509 STVSGPTGSTVTVVPGSSTSPAPSSSPNPSSSPAST-GSTITISGSSSIIVSTVSGSTVS 567
Query: 251 ------QATPQAPKPGPVQPLTV-QGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDA 303
Q+T + P TV P SSQ +
Sbjct: 568 GSTGTSQSTLASSTATPGSSSTVPSSSSPQPSSQSPAPNTGSTTPSQTSSQSPSPSMNPS 627
Query: 304 SYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGS 355
S T + ST + + + T ST G+ V+T TSQ+ SGS
Sbjct: 628 SSTPTGSSQSTITPEGSTASSPTGST------GSTFSVATEVTSQSTVPSGS 673
>pir||T34434 hypothetical protein K06A9.1a - Caenorhabditis elegans
gb|AAC70890.1| (U80846) K06A9.1a gene product [Caenorhabditis elegans]
Length = 2232
Score = 45.7 bits (106), Expect = 0.002
Identities = 68/316 (21%), Positives = 101/316 (31%), Gaps = 40/316 (12%)
Query: 75 VTELPAVPAPSQPTGAPTPSPAPQQY-IVVTVSEGAMRASETVSEASPGSTASXXXXXXX 133
VT +P P+ +P PS +P +T+S + TVS GST S
Sbjct: 519 VTVVPGSSTSPAPSSSPNPSSSPASTGSTITISGSSSIIVSTVS----GSTVSGSTGTSQ 574
Query: 134 XXXXXXXXXXXXXXX--XSVQAKPGHVSPLQLTNIQVPQQAL---PTQRLVVQSAAP--- 185
S +P SP T P Q P+ + S+ P
Sbjct: 575 STLASSTATPGSSSTVPSSSSPQPSSQSPAPNTGSTTPSQTSSQSPSPSMNPSSSTPTGS 634
Query: 186 ----------------GSKGGQVSLTVHGTQQVHSPPEQS--PVQANSSSSKTAGAPTGT 227
GS G S+ T Q P S NSS S ++ +P+ +
Sbjct: 635 SQSTITPEGSTASSPTGSTGSTFSVATEVTSQSTVPSGSSLGTQSTNSSPSPSSLSPSTS 694
Query: 228 VPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXX 287
L + P + + S Q+T P P P Q +
Sbjct: 695 GMSTL----TSEPSPSSTQSSGAQSTLTTPSPNPSQSTSSLESSTSGATTSSGSAGTTMT 750
Query: 288 XXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATS 347
SS V G + + S S+T + TQT +S +A T ++
Sbjct: 751 SPSQSSSV-----GSSQGSTSPAASTTSGEMTSQGSTQTPGSSVSTSAAILTSTQQSVST 805
Query: 348 QAVASSGSMPMYVSGS 363
+ S+ + P VSGS
Sbjct: 806 NSPGSTVTRPSTVSGS 821
Score = 42.2 bits (97), Expect = 0.027
Identities = 64/310 (20%), Positives = 95/310 (30%), Gaps = 29/310 (9%)
Query: 82 PAPSQPTGAPT-PSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXXXXX 140
P PSQ + PT P+ + + +++ + T+S+AS GST+
Sbjct: 1413 PVPSQTSSTPTNPTGSTESSTLLSSTISGSTQHTTMSKASSGSTSPSTNSQTGSTVTMGS 1472
Query: 141 XXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKG-------GQVS 193
S + +S Q ++ A T S AP S G G V
Sbjct: 1473 SSTSGVSTSSASSTQPQMSTSQGSSAG-STVASSTASPAASSTAPSSTGTMSSTSSGTVG 1531
Query: 194 LTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTG----TVPQQLQVHGVQQSVPVTQERSV 249
T+ + S Q+ SS T+G T T PQ G V +
Sbjct: 1532 STISESSTTASASSQTGSTVTMGSSSTSGVSTSSASSTQPQMSTSQGSSAGSTVASSTAG 1591
Query: 250 VQATPQAPKP----GPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASY 305
+ +T P G TV SS V AS
Sbjct: 1592 LVSTSTVPSSTGTMGSTSSGTVGSTISESSTTASASSQTGSTVTMGSSSTSGVSTSSASS 1651
Query: 306 T------------ASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASS 353
T S + SST T + T ++GT + +++ A ASS
Sbjct: 1652 TQPQMSTSQGSSAGSTVASSTTGLVSTSTVPSSTGTMGSTSSGTVGSTISESSTAASASS 1711
Query: 354 GSMPMYVSGS 363
+ GS
Sbjct: 1712 QTGSTVTMGS 1721
Score = 36.0 bits (81), Expect = 2.0
Identities = 66/277 (23%), Positives = 97/277 (34%), Gaps = 47/277 (16%)
Query: 82 PAPSQPTGAP--TPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXXXX 139
P PS +G+ T SP P Q S + +S T S SPG+T +
Sbjct: 855 PNPSTSSGSSMITQSPYPSQ------STSPVESSTTPSPGSPGTTLTSTSPSPSQSTTIG 908
Query: 140 XXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQ-SAAPGSKGGQVSLTVHG 198
S ++ ++T+ Q T V Q S S ++TV
Sbjct: 909 STQGSTSPGISTTSE-------EMTSQGSTQTPGSTGSTVTQPSTVSDSTSSGSTVTVGS 961
Query: 199 TQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAPK 258
T+ SP + N S+S + T T PQ Q + PV E S AT +
Sbjct: 962 TEGSSSPIPSTSQNTNPSTSSGSSMSTQT-PQSSQ-----STSPV--ESSTSGATSSSGS 1013
Query: 259 PGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSYP 318
PG T+ + P SS + +G + ++ + ST
Sbjct: 1014 PGT----TLTSISPSPSP---------------SSTIGSSQGSTSPVVSTISQGST---- 1050
Query: 319 ETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGS 355
ETP T + T +G+A+ ST ASS S
Sbjct: 1051 ETPGSTGSTVTKPSTVSGSASSGSTATMGSTEASSTS 1087
Score = 36.0 bits (81), Expect = 2.0
Identities = 62/304 (20%), Positives = 101/304 (32%), Gaps = 35/304 (11%)
Query: 82 PAPSQPTGAPTPSPAPQQYI--VVTVSEGAMRA-SETVSEASPGST----ASXXXXXXXX 134
P PS +G+ TP+P P Q VV+ + G M + T + ++ GST ++
Sbjct: 1265 PNPST-SGSSTPTPNPSQSTSPVVSTTTGEMTSHGSTQTPSTIGSTVTQPSTVSGSNSSG 1323
Query: 135 XXXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQ---------------QALPTQRLV 179
S + P +SP+ T+ +P ++ T L
Sbjct: 1324 STVTIGSSEASTSGSSFKTSPSSISPVP-TSSPIPSTTFASSTSGSTISDVSSVSTTSLA 1382
Query: 180 VQSAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQ 239
S++ S + + T + S SPV + +SS+ T PTG+ +
Sbjct: 1383 PLSSSLPSTVPSSTQSFSSTSEGSSKASSSPVPSQTSSTPT--NPTGSTESSTLLSS--- 1437
Query: 240 SVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVE 299
T S T G P T Q SS +
Sbjct: 1438 ----TISGSTQHTTMSKASSGSTSPST--NSQTGSTVTMGSSSTSGVSTSSASSTQPQMS 1491
Query: 300 GGDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGSMPMY 359
S S + SST S + + T ++GT + +++ A ASS +
Sbjct: 1492 TSQGSSAGSTVASSTASPAASSTAPSSTGTMSSTSSGTVGSTISESSTTASASSQTGSTV 1551
Query: 360 VSGS 363
GS
Sbjct: 1552 TMGS 1555
Score = 35.2 bits (79), Expect = 3.5
Identities = 54/266 (20%), Positives = 80/266 (29%), Gaps = 21/266 (7%)
Query: 115 TVSEASPGSTASXXXXXXXXXXXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALP 174
T+SE+S ++AS S Q + +S Q ++ A
Sbjct: 1616 TISESSTTASASSQTGSTVTMGSSSTSGVSTSSASSTQPQ---MSTSQGSSAG-STVASS 1671
Query: 175 TQRLVVQSAAPGSKG-------GQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGT 227
T LV S P S G G V T+ + S Q+ SS T+G T +
Sbjct: 1672 TTGLVSTSTVPSSTGTMGSTSSGTVGSTISESSTAASASSQTGSTVTMGSSSTSGVSTSS 1731
Query: 228 V----PQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGL------QPXXXXX 277
PQ G V + A+ AP T G Q
Sbjct: 1732 ASSGQPQMSTSQGSSAGSTVVSSTASPAASSTAPSSTGTMSSTSSGTVGSTMSQSSTAAS 1791
Query: 278 XXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGT 337
S+ + S S + SST T + T ++GT
Sbjct: 1792 TTSHTGSTVTLGSSSTSSNQMSTSQGSSVGSTVASSTAGLVSTSTVPSSTGTMGSTSSGT 1851
Query: 338 ATQVSTPATSQAVASSGSMPMYVSGS 363
+ +++ A ASS + GS
Sbjct: 1852 VGSTISESSTTASASSQTGSTVTMGS 1877
Score = 34.8 bits (78), Expect = 4.6
Identities = 60/292 (20%), Positives = 92/292 (30%), Gaps = 25/292 (8%)
Query: 82 PAPSQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXXXXXX 141
P S G+ TPS + ++ + G+ ++ TV+ S + S
Sbjct: 389 PGSSSTYGSSTPSASSSSSGTMSTNSGSTGSTVTVAPVSSSTFGSSTPIASSSSSGSTVT 448
Query: 142 XXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLV----------VQSAAPGSKGGQ 191
+ P S T + T +V QSA+P S G
Sbjct: 449 VVSGSSSTYGSSTPSASSSSAGTASTISGSTGSTATIVPGSSSSVGSSTQSASPSSPGTM 508
Query: 192 VSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVV- 250
+++ V P S A SSS + +P T + + G + T S V
Sbjct: 509 STVSGPTGSTVTVVPGSSTSPAPSSSPNPSSSPAST-GSTITISGSSSIIVSTVSGSTVS 567
Query: 251 ------QATPQAPKPGPVQPLTV-QGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDA 303
Q+T + P TV P SSQ +
Sbjct: 568 GSTGTSQSTLASSTATPGSSSTVPSSSSPQPSSQSPAPNTGSTTPSQTSSQSPSPSMNPS 627
Query: 304 SYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGS 355
S T + ST + + + T ST G+ V+T TSQ+ SGS
Sbjct: 628 SSTPTGSSQSTITPEGSTASSPTGST------GSTFSVATEVTSQSTVPSGS 673
>ref|NP_012284.1| cell surface flocculin with structure similar to
serine/threonine-rich GPI-anchored cell wall proteins;
Muc1p [Saccharomyces cerevisiae]
sp|P08640|AMYH_YEAST GLUCOAMYLASE S1/S2 PRECURSOR (GLUCAN 1,4-ALPHA-GLUCOSIDASE)
(1,4-ALPHA-D-GLUCAN GLUCOHYDROLASE)
pir||S48478 glucan 1,4-alpha-glucosidase (EC 3.2.1.3) - yeast (Saccharomyces
cerevisiae)
emb|CAA86176.1| (Z38061) mal5, sta1, len: 1367, CAI: 0.3, AMYH_YEAST P08640
GLUCOAMYLASE S1 (EC 3.2.1.3) [Saccharomyces cerevisiae]
gb|AAC49609.1| (U30626) glucoamylase [Saccharomyces cerevisiae var. diastaticus]
Length = 1367
Score = 44.5 bits (103), Expect = 0.005
Identities = 61/291 (20%), Positives = 94/291 (31%), Gaps = 36/291 (12%)
Query: 81 VPAPSQPTGAPTPSPAPQQYIVVTVSEGA-MRASETVSEASPGSTASXXXXXXXXXXXXX 139
VP PS T + +PAP T S A + +S T S ++P T S
Sbjct: 512 VPTPSSSTTESSSAPAPTPSSSTTESSSAPVTSSTTESSSAPVPTPSSSTTESSSTPVTS 571
Query: 140 XXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKGGQVSLTVHGT 199
+ S + ++ VP P+ S+AP
Sbjct: 572 STTESSSAPVPTPSS----STTESSSAPVPT---PSSSTTESSSAPAPTPSS-------- 616
Query: 200 QQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKP 259
S E S SS+++++ AP VP S PV S + AP P
Sbjct: 617 ----STTESSSAPVTSSTTESSSAP---VPTPSSSTTESSSAPVPTPSSSTTESSSAPVP 669
Query: 260 GPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAI-----RSST 314
P T P SS ++ +S+ SST
Sbjct: 670 TPSSSTTESSSAPVTSSTTESSSAPVTSSTTESSSAPVPTPSSSTTESSSAPVPTPSSST 729
Query: 315 YSYPETPLYTQTASTSYYEAA--------GTATQVSTPATSQAVASSGSMP 357
P+ T ++ST+ +A ++ V TP++S +SS +P
Sbjct: 730 TESSSAPVPTPSSSTTESSSAPVTSSTTESSSAPVPTPSSSTTESSSAPVP 780
Score = 43.8 bits (101), Expect = 0.009
Identities = 64/294 (21%), Positives = 91/294 (30%), Gaps = 27/294 (9%)
Query: 81 VPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSEAS---PGSTASXXXXXXXXXXX 137
VP PS T + +P P T S A S T +S P ++S
Sbjct: 314 VPTPSSSTTESSSAPVPTPSSSTTESSSAPVTSSTTESSSAPVPTPSSSTTESSSAPVTS 373
Query: 138 XXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKGGQVSLTVH 197
S P ++ A T S+AP + S +
Sbjct: 374 STTESSSAPVTSSTTESSSAPVPTPSSSTTESSSAPVTSSTTESSSAPVTSSTTESSSAP 433
Query: 198 GTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAP 257
T S E S SS+++++ AP VP S PVT S + AP
Sbjct: 434 VTS---STTESSSAPVTSSTTESSSAP---VPTPSSSTTESSSAPVT---SSTTESSSAP 484
Query: 258 KPGPVQPLTVQGLQPXXXXXX----------XXXXXXXXXXXXYSSQVQYVEGGDASYTA 307
P P T P + E A T+
Sbjct: 485 VPTPSSSTTESSSAPVTSSTTESSSAPVPTPSSSTTESSSAPAPTPSSSTTESSSAPVTS 544
Query: 308 SAIRSSTYSYPETPLYTQTASTSYYEAAGT----ATQVSTPATSQAVASSGSMP 357
S SS+ P TP + T S+S + T + V TP++S +SS +P
Sbjct: 545 STTESSSAPVP-TPSSSTTESSSTPVTSSTTESSSAPVPTPSSSTTESSSAPVP 597
Score = 40.6 bits (93), Expect = 0.079
Identities = 60/302 (19%), Positives = 95/302 (30%), Gaps = 33/302 (10%)
Query: 81 VPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXXXXX 140
VP PS T + +P P T S A S T +S T+S
Sbjct: 653 VPTPSSSTTESSSAPVPTPSSSTTESSSAPVTSSTTESSSAPVTSS---TTESSSAPVPT 709
Query: 141 XXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPT---QRLVVQSAAPGSKGGQVSLTVH 197
S S + ++ VP + T V S+ S V
Sbjct: 710 PSSSTTESSSAPVPTPSSSTTESSSAPVPTPSSSTTESSSAPVTSSTTESSSAPVPTPSS 769
Query: 198 GTQQVHSPPEQSPVQANSSSSKTAGAPTGT------------VPQQLQVHGVQQSVPVTQ 245
T + S P +P +SS+++++ AP T VP + S P +
Sbjct: 770 STTESSSAPVPTP---SSSTTESSSAPVPTPSSSTTESSVAPVPTPSSSSNITSSAPSST 826
Query: 246 ERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXX-------XXXXXXXXXYSSQVQYV 298
S + P P P T P S + +
Sbjct: 827 PFSSSTESSSVPVPTPSSSTTESSSAPVSSSTTESSVAPVPTPSSSSNITSSAPSSIPFS 886
Query: 299 EGGDASYTASAIRSSTYSYPETPLYTQTAST--SYYEAAGTATQVSTPAT---SQAVASS 353
++ T + + S+ YP + T +ST + T T V+TP+T + V S+
Sbjct: 887 STTESFSTGTTVTPSSSKYPGSQTETSVSSTTETTIVPTKTTTSVTTPSTTTITTTVCST 946
Query: 354 GS 355
G+
Sbjct: 947 GT 948
Score = 39.9 bits (91), Expect = 0.14
Identities = 60/285 (21%), Positives = 88/285 (30%), Gaps = 23/285 (8%)
Query: 76 TELPAVPAPSQPTGA-----PTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXX 130
TE + P S T + PTPS + + S + +S T S ++P T S
Sbjct: 439 TESSSAPVTSSTTESSSAPVPTPSSSTTE-----SSSAPVTSSTTESSSAPVPTPSSSTT 493
Query: 131 XXXXXXXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKGG 190
+ P T P+ S+AP +
Sbjct: 494 ESSSAPVTSSTTES-------SSAPVPTPSSSTTESSSAPAPTPSSSTTESSSAPVTSST 546
Query: 191 QVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVV 250
S + S E S SS+++++ AP VP S PV S
Sbjct: 547 TESSSAPVPTPSSSTTESSSTPVTSSTTESSSAP---VPTPSSSTTESSSAPVPTPSSST 603
Query: 251 QATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAI 310
+ AP P P T P SS + + ++S
Sbjct: 604 TESSSAPAPTPSSSTTESSSAPVTSSTTESSSAPVPTPS--SSTTESSSAPVPTPSSSTT 661
Query: 311 RSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGS 355
SS+ P TP + T S+S + T S P TS SS +
Sbjct: 662 ESSSAPVP-TPSSSTTESSSAPVTSSTTESSSAPVTSSTTESSSA 705
>ref|NP_000440.1| regulatory factor X, 5 [Homo sapiens]
sp|P48382|RFX5_HUMAN DNA BINDING PROTEIN RFX5
pir||I38155 DNA-binding regulatory factor X5 - human
emb|CAA59771.1| (X85786) binding regulatory factor [Homo sapiens]
Length = 616
Score = 43.8 bits (101), Expect = 0.009
Identities = 22/74 (29%), Positives = 37/74 (49%), Gaps = 1/74 (1%)
Query: 440 QWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKL-EPVNAASFGKLIRSVFMXXXXXXXXX 498
+W+ ++ E LP+ ++Y Y +C+ P++ A+FGK+IR +F
Sbjct: 94 RWIRNHLEEHTDTCLPKQSVYDAYRKYCESLACCRPLSTANFGKIIREIFPDIKARRLGG 153
Query: 499 XXNSKYHYYGLRIK 512
SKY Y G+R K
Sbjct: 154 RGQSKYCYSGIRRK 167
>ref|NP_059091.1| regulatory factor (trans-acting) 5 [Mus musculus]
gb|AAF68260.1| (AF209854) regulator factor X 5 [Mus musculus]
Length = 658
Score = 43.4 bits (100), Expect = 0.012
Identities = 22/74 (29%), Positives = 37/74 (49%), Gaps = 1/74 (1%)
Query: 440 QWLLDNYETAEGVSLPRSTLYCHYLLHCQEQKL-EPVNAASFGKLIRSVFMXXXXXXXXX 498
+W+ ++ E LP+ ++Y Y +C+ P++ A+FGK+IR +F
Sbjct: 93 RWIRNHLEEHMDTCLPKQSVYDAYRKYCESLACCRPLSTANFGKIIREIFPDIKARRLGG 152
Query: 499 XXNSKYHYYGLRIK 512
SKY Y G+R K
Sbjct: 153 RGQSKYCYSGIRRK 166
>pir||F75518 hypothetical protein - Deinococcus radiodurans (strain R1)
gb|AAF10038.1|AE001904_14 (AE001904) hypothetical protein [Deinococcus radiodurans]
Length = 839
Score = 43.4 bits (100), Expect = 0.012
Identities = 69/325 (21%), Positives = 107/325 (32%), Gaps = 46/325 (14%)
Query: 76 TELPAVPAPSQPT---GAPTPSPAPQQYIV--------VTVSEGAMRASETV-------- 116
T+ PA PAP+ GAP+P+PAP Q TV E + A+ +
Sbjct: 250 TQTPATPAPAAQRPAGGAPSPAPAPAQANAPAGSVVPEATVPESSTPAAPSAQTPPTPTR 309
Query: 117 ----SEASPGSTASXXXXXXXXXXXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQA 172
+EASP + S A P V+P T
Sbjct: 310 ETAQTEASPAAPNSSAAAPNEPASEPVAGRPGTAASSPESASPVTVTPRGETPDTAASAG 369
Query: 173 LPTQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQL 232
P+ V + AP + G + G P +P+ A + + +++G GT +
Sbjct: 370 TPSAGRVTPAPAPSASEGASAARTPGAGSQTPPIPATPIPA-TPAGRSSGESAGTAAARP 428
Query: 233 QVHGVQQSVPVTQERSVVQATPQ---AP---KPGPVQPLTVQGLQPXXXXXXXXXXXXXX 286
PV+++RS V P+ AP P P P
Sbjct: 429 NA----APAPVSEDRSDVSGLPRREDAPAESSPVAASPARGASSAPSSAPAAAVPSRAPV 484
Query: 287 XXXXYSS-----QVQYVEGGDASYTASAIR----SSTYSYPETPLYTQTASTSYYEAAGT 337
S+ E G+ + SA +S+ + P P + S + A G
Sbjct: 485 SGGSVSAPRTAPTAPVAEQGEVPVSPSAAAPRGGASSAAAPSAPAAARGGSGA---AGGA 541
Query: 338 ATQVSTPATSQAVASSGSMPMYVSG 362
A S PA ++ + G+ SG
Sbjct: 542 AGGASAPAAARPAQTPGASAGGASG 566
Score = 39.5 bits (90), Expect = 0.18
Identities = 55/261 (21%), Positives = 81/261 (30%), Gaps = 30/261 (11%)
Query: 79 PAVPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXXX 138
P VPAP+ T P PAP + T A R + +P
Sbjct: 193 PPVPAPTSQTPTPPVQPAPTR----TPPPQAARPTPNAPAQTPAPATQAPAAQTPTAQAP 248
Query: 139 XXXXXXXXXXXSVQAKPGHVSPLQL-TNIQVPQQALPTQRLVVQSAAPGSKGGQVSLTVH 197
+ + G SP P ++ + V +S+ P + Q
Sbjct: 249 ATQTPATPAPAAQRPAGGAPSPAPAPAQANAPAGSVVPEATVPESSTPAAPSAQ------ 302
Query: 198 GTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAP 257
+PP + A + +S A + P + S PV ++P++
Sbjct: 303 ------TPPTPTRETAQTEASPAAPNSSAAAPNE------PASEPVAGRPGTAASSPESA 350
Query: 258 KPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSY 317
P V P +G P S EG A+ T A S T
Sbjct: 351 SPVTVTP---RGETPDTAASAGTPSAGRVTPAPAPSA---SEGASAARTPGA-GSQTPPI 403
Query: 318 PETPLYTQTASTSYYEAAGTA 338
P TP+ A S E+AGTA
Sbjct: 404 PATPIPATPAGRSSGESAGTA 424
>gb|AAB71465.1| (AC000098) EST gb|ATTS1136 comes from this gene. [Arabidopsis
thaliana]
Length = 402
Score = 43.4 bits (100), Expect = 0.012
Identities = 60/305 (19%), Positives = 95/305 (30%), Gaps = 39/305 (12%)
Query: 79 PAVPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTA------------- 125
P + S P + P+ V S + ++ T +E+S G+TA
Sbjct: 23 PNTYSNSNPNADASSMPSTNTTTVPQTSSSSTSSTTTATESSSGTTAESSSSTKSATMSG 82
Query: 126 SXXXXXXXXXXXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQSAA- 184
S S + + + I A PT +++
Sbjct: 83 STTHTTSSATASSTASTSTSSYSTSYSTSSTKTTTMTGSTISTTASAAPTSTASTSTSSY 142
Query: 185 ------PGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQ 238
+K V+ + GT +P S ANSS+S T +G+ P +
Sbjct: 143 STSYSTSSTKTTTVTGSTIGTTASAAPTSTSTSTANSSASSTTNPSSGSKPTAM------ 196
Query: 239 QSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYV 298
S ++P T G +P S
Sbjct: 197 TGTTANTSPSAPTSSPSTTNSSSTAAYTSSGSKPTTVTRTTANTSSSASTSSASP----- 251
Query: 299 EGGDASYTASAIRSSTYSYPE--TPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGSM 356
S T++ SS S P T T T+ST+ +A T S+ AT+ +SSGS
Sbjct: 252 ---TNSSTSTPTNSSAGSKPTTMTGTTTNTSSTTTTSSASTTKSSSSSATN---SSSGSK 305
Query: 357 PMYVS 361
P +S
Sbjct: 306 PSTLS 310
>gb|AAB91441.1| (U80743) CAGH32 [Homo sapiens]
Length = 556
Score = 43.0 bits (99), Expect = 0.016
Identities = 33/108 (30%), Positives = 48/108 (43%), Gaps = 10/108 (9%)
Query: 155 PGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQAN 214
P H+ +Q +Q+P Q P Q +AP QV + Q P +QSP
Sbjct: 353 PEHLIKMQKQKLQMPPQPPPPQA----QSAPPQPAAQVQV-----QTSQPPQQQSPQLTT 403
Query: 215 SSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPV 262
++ + TGT LQV + + VP +Q +S Q QAP+P V
Sbjct: 404 VTAPRPGALLTGTTVANLQVARLTR-VPTSQLQSQGQMQTQAPQPAQV 450
>gb|AAF48345.1| (AE003495) CG11584 gene product [Drosophila melanogaster]
Length = 1015
Score = 43.0 bits (99), Expect = 0.016
Identities = 65/275 (23%), Positives = 88/275 (31%), Gaps = 51/275 (18%)
Query: 87 PTGAPTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXXXXXXXXXXX 146
P AP P+PAP V V E +T S +P
Sbjct: 203 PAAAPVPAPAPIAIPVPAVQEQHQVIQQTYSVPAPAPAVQQSYSAPAPAPVVQ------- 255
Query: 147 XXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPP 206
+ P V Q P QA+ Q+L + AP ++ +Q +S P
Sbjct: 256 ---QTYSAPAPVVQEQTYTAAAPVQAV--QQLTYSAPAPVTQ-----------EQYYSAP 299
Query: 207 EQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLT 266
S VQ SS+ A AP +QQ+ VVQ T AP P P Q +
Sbjct: 300 A-SVVQQTSSAPAPAPAPVQEQFYSAPAPAIQQTYSAPAPAPVVQQTYSAPAPAPQQTYS 358
Query: 267 VQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASA---IRSSTYSYPETPLY 323
Q Q V+ SY+A A + TYSYP +
Sbjct: 359 AP-------------------APAVQEQTQVVQ----SYSAPAPAPVAQQTYSYPAPVVQ 395
Query: 324 TQTASTSYYEAAGTATQ-VSTPATSQAVASSGSMP 357
+ + A Q S PA + V + S P
Sbjct: 396 QAPVVQAVAQQAPVVQQSYSAPAPAPVVQQTYSAP 430
Score = 34.8 bits (78), Expect = 4.6
Identities = 49/205 (23%), Positives = 73/205 (34%), Gaps = 25/205 (12%)
Query: 82 PAPSQPTGAPTPSPAPQQYIVVTVSEGAMR--ASETVSEASPGSTASXXXXXXXXXXXXX 139
PAP Q AP P+ Q +V + S A A +T S +P +
Sbjct: 351 PAPQQTYSAPAPAVQEQTQVVQSYSAPAPAPVAQQTYSYPAPVVQQAPVVQAVAQQAPVV 410
Query: 140 XXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKGGQVSLTVHGT 199
V + + +P + + Q + Q VVQ + V
Sbjct: 411 QQSYSAPAPAPV-VQQTYSAPAPVVQETIQQAPVIQQAPVVQQSYSAPAPAPV------V 463
Query: 200 QQVHSPP--------EQSPV-QANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQER--- 247
QQ +S P +Q+PV Q ++ AP V + +Q V Q PV Q+
Sbjct: 464 QQSYSAPAPVVQETIQQAPVIQQAPVVQQSYSAPAPVVQETIQQAPVIQQAPVVQQSYSA 523
Query: 248 ----SVVQATPQAPKPGPVQPLTVQ 268
VVQ + AP P PV ++Q
Sbjct: 524 PAPAPVVQQSYSAPAPAPVVQESIQ 548
>ref|NP_011528.1| putative integral membrane protein; Msb2p [Saccharomyces
cerevisiae]
sp|P32334|MSB2_YEAST MSB2 PROTEIN (MULTICOPY SUPPRESSION OF A BUDDING DEFECT 2)
pir||S25370 MSB2 protein - yeast (Saccharomyces cerevisiae)
gb|AAA34798.1| (M77354) multicopy suppressor of a budding defect [Saccharomyces
cerevisiae]
emb|CAA96997.1| (Z72799) ORF YGR014w [Saccharomyces cerevisiae]
Length = 1306
Score = 42.6 bits (98), Expect = 0.021
Identities = 62/300 (20%), Positives = 98/300 (32%), Gaps = 25/300 (8%)
Query: 88 TGAPTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXXXXXXXXXXXX 147
T AP+ + Y + +M + + ST S
Sbjct: 503 TSAPSVVSSSFSYTSLQAGGSSMTNPSSSTIVYSSSTGSSEESAASTASATLSGSSSTYM 562
Query: 148 XXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKGGQ--------VSLTVHGT 199
++Q++P S L L+ Q + V + +P + G +S T T
Sbjct: 563 AGNLQSQPPSTSSL-LSESQATSTSAVLASSSVSTTSPYTTAGGASTEASSLISSTSAET 621
Query: 200 QQVHSPPEQSPVQANS--SSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAP 257
QV + +Q +S SSS T G+ T + VQ ++ E S Q T Q
Sbjct: 622 SQVSYSQSTTALQTSSFASSSTTEGSETSSQGFSTSSVLVQMPSSISSEFSPSQTTTQMN 681
Query: 258 KPGPVQPLTVQGL------------QPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASY 305
T+ SS V V SY
Sbjct: 682 SASSSSQYTISSTGILSQVSDTSVSYTTSSSSVSQVSDTPVSYTTSSSSVSQVSDTPVSY 741
Query: 306 TASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGSMPMYVSGSQV 365
T S+ SS +TP+ T+S+S + + T +T ++S + S S+P S S V
Sbjct: 742 TTSS--SSVSQVSDTPVSYTTSSSSVSQVSDTPVSYTTSSSSVSQVSDTSVPSTSSRSSV 799
>dbj|BAB13413.1| (AB046807) KIAA1587 protein [Homo sapiens]
Length = 991
Score = 42.6 bits (98), Expect = 0.021
Identities = 57/274 (20%), Positives = 87/274 (30%), Gaps = 16/274 (5%)
Query: 91 PTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXXXXXXXXXXXXXXS 150
PT + P ++ T+SE + + + PG++ +
Sbjct: 104 PTSAEGPSTFVPPTISEASSASGQPTISEGPGTSVLPTPSEGLSTSGPPTISKGLCTSVT 163
Query: 151 VQAKPGHVSPLQLTNIQVPQQAL-PTQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQS 209
+ A G + T+ + P ++ PT V ++ P + G S +V T + P S
Sbjct: 164 LAASEGRNTSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPT--AYEGPSTS 221
Query: 210 PVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSV---VQATPQAPKPGPVQPLT 266
V T+ PT G SVP+ + VQATP V P
Sbjct: 222 VVPTPDEGPSTSVLPT-------PGEGPGTSVPLAATEGLSTSVQATPDEGPSTSVPPTA 274
Query: 267 VQGLQ---PXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLY 323
+GL P S+ V S + R S P
Sbjct: 275 TEGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPAASDGQSISLVPTRGKGSSTSVPPTA 334
Query: 324 TQTASTSYYEAAGTATQVSTPATSQAVASSGSMP 357
T+ STS AG + S P T S+ P
Sbjct: 335 TEGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPP 368
Score = 41.0 bits (94), Expect = 0.061
Identities = 47/203 (23%), Positives = 67/203 (32%), Gaps = 8/203 (3%)
Query: 76 TELPAVPAPSQPTGAP-TPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXX 134
T LP P T P T P +V T EG + PG++
Sbjct: 196 TSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPLAATEGLS 255
Query: 135 XXXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQ--RLVVQSAAPGSKGGQ- 191
A G +P+ T + P ++P S P + GQ
Sbjct: 256 TSVQATPDEGPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPAASDGQS 315
Query: 192 VSLT-VHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVV 250
+SL G S P + ++S TAG + T G+ SVP T +
Sbjct: 316 ISLVPTRGKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELS 375
Query: 251 QATPQAPKPGP---VQPLTVQGL 270
+ P P GP V P+ +GL
Sbjct: 376 TSVPPTPGEGPSTSVLPIPGEGL 398
>pir||T33369 hypothetical protein H02F09.3 - Caenorhabditis elegans
gb|AAC64622.1| (AF077538) unknown [Caenorhabditis elegans]
Length = 1275
Score = 42.6 bits (98), Expect = 0.021
Identities = 70/316 (22%), Positives = 109/316 (34%), Gaps = 41/316 (12%)
Query: 80 AVPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXXXX 139
AV PS AP+ VVTV + TV +SP +
Sbjct: 328 AVTKPSTVVTAPST--------VVTVPSTVVTKPNTVVTSSPTVATTPTTVVTTPSTVVT 379
Query: 140 XXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQS-----AAPGSKGGQVSL 194
+V P V T + VP + ++ V+ + ++P + G ++
Sbjct: 380 VPSTVVTVPTTVVTNPSTVVTAPSTVVTVPTTVMTSRSTVITTPTTGGSSPSTAGTSLAS 439
Query: 195 TVHGTQ-QVHSPPEQSPVQANS----------SSSKTAGAPTGTVPQQLQ--VHGVQQSV 241
T T+ + S P Q+ S SS TAGA + Q + + S
Sbjct: 440 TAVTTETSIGSSSTPLPSQSTSLSMSSLSTYTPSSSTAGATSPATQQSTKPTIGTSMSSG 499
Query: 242 PVT------QERSVVQA-TPQAPK---PGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXY 291
P T E +V+Q+ TP P T G P
Sbjct: 500 PTTVAPGASTESTVLQSSTPSGTTVTLPSGSSTATA-GTSPQASTVTTVTDISTVSGSTV 558
Query: 292 SSQVQYVEGGDASYTASAIRSSTYS----YPETPLYTQTASTSYYEAAGTATQVSTPATS 347
+SQ S T++ ST S P T + +AS+ Y +G+ ++P T+
Sbjct: 559 TSQTAESSLSTESPTSAGSSISTVSTVSSQPSTYIPVSSASSIYSTLSGSTGSTASPGTT 618
Query: 348 QAVASSGSMPMYVSGS 363
++ SS S P +SGS
Sbjct: 619 ESSGSSTSGPSTISGS 634
>pir||T45462 membrane glycoprotein [imported] - equine herpesvirus 1
dbj|BAA20037.1| (D88733) membrane glycoprotein [Equine herpesvirus 1]
Length = 866
Score = 41.4 bits (95), Expect = 0.046
Identities = 38/175 (21%), Positives = 60/175 (33%), Gaps = 12/175 (6%)
Query: 181 QSAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQS 240
QS + G+ S T SPP + + SS+S + + T S
Sbjct: 40 QSTSSGTTNSSSSPTTSPPTTSSSPPTSTHTSSPSSTSTQSSSTAATSSSAPSTASSTTS 99
Query: 241 VPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEG 300
+P + TP A P T P S++
Sbjct: 100 IPTSTSTETTTTTPTASTTTP----TTTTAAPTTAATTTAVTTAASTAASTSAET----- 150
Query: 301 GDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGS 355
+ TA+A + T + P T T TA+T+ A T T +T AT+ A ++ +
Sbjct: 151 --TTATATATSTPTTTTP-TSTTTTTATTTVPTTASTTTDTTTAATTTAATTTAA 202
Score = 34.8 bits (78), Expect = 4.6
Identities = 44/256 (17%), Positives = 70/256 (27%), Gaps = 13/256 (5%)
Query: 104 TVSEGAMRASETVSEASPGSTASXXXXXXXXXXXXXXXXXXXXXXXSVQAKPGHVSPLQL 163
T + A+ T + + +T S + +
Sbjct: 223 TTTAATTTAATTTAATTTAATTSSATTAATTTAATTTAATTTAATTTAATTTAATTTAAT 282
Query: 164 TNIQVPQQALPTQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGA 223
T A T + + + T T + + S S+ T GA
Sbjct: 283 TTAATTTAATTTAATTTAATTTAATTTAATTTAATTTAATTTAATTTGSPTSGSTSTTGA 342
Query: 224 PTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQ-PXXXXXXXXXX 282
T T S T + AT P P + P
Sbjct: 343 STSTPSASTAT-----SATPTSTSTSAAATTSTPTPTSAATSAESTTEAPTSTPTTDTTT 397
Query: 283 XXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTAT--- 339
S + V S T +A + +++ P++ T STS E + T T
Sbjct: 398 PSEATTATTSPESTTVSASTTSATTTAFTTESHTSPDS----STGSTSTAEPSSTFTLTP 453
Query: 340 QVSTPATSQAVASSGS 355
+TP+T Q SS S
Sbjct: 454 STATPSTDQFTGSSAS 469
>pir||T33247 hypothetical protein H05O09.1 - Caenorhabditis elegans (fragment)
gb|AAC19224.1| (AF067951) contains similarity to the immunoglobin superfamily (Pfam:
ig.hmm, score: 16.18, 22.17, 15.93) [Caenorhabditis
elegans]
Length = 2109
Score = 41.0 bits (94), Expect = 0.061
Identities = 44/195 (22%), Positives = 67/195 (33%), Gaps = 36/195 (18%)
Query: 77 ELPAVPAPSQPTGAPTPSPAP-------QQYIVVTVSEGAMRASETVSEASPGSTASXXX 129
E+P V PS+PT A P A QQ + + ++ EA+P +
Sbjct: 1565 EVPKVAEPSEPTQADVPKIAAPLEQSQIQQEVPTVAAPSEPTQADVPKEAAPSEPSQADV 1624
Query: 130 XXXXXXXXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKG 189
Q P +PL+ T VP+ A P ++ +Q P
Sbjct: 1625 PKVAAPLEQTQIQ---------QEVPMVAAPLEPTQADVPKVAAPLEQSQIQQEVP---- 1671
Query: 190 GQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVT----- 244
T S P Q+ V ++ S+ + A V L+ +QQ VP+
Sbjct: 1672 ---------TVAAPSEPTQADVPKEAAPSEPSQADVPKVAAPLEQTQIQQEVPMVAAPLE 1722
Query: 245 --QERSVVQATPQAP 257
QE +A P P
Sbjct: 1723 PIQEEVPKEAAPSEP 1737
>ref|XP_000707.1| hypothetical protein XP_000707 [Homo sapiens]
Length = 1039
Score = 41.0 bits (94), Expect = 0.061
Identities = 64/293 (21%), Positives = 93/293 (30%), Gaps = 19/293 (6%)
Query: 84 PSQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXXXXXXXX 143
P++P P+ S P S+ A +AS+ A S S
Sbjct: 527 PAKPAKPPSQSRQPATQPARQASQ-ASQASQATHPAGQPSRPSQPTSQVSQPAKPSSQQS 585
Query: 144 XXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVV--QSAAPGSKGGQVSLTV----- 196
S ++P S T Q Q A P + QS+ P S+ Q
Sbjct: 586 QPSHPASQASQPAKPSQPSQTTSQASQPAKPAKPASQPSQSSQPSSQASQARKATQPAKP 645
Query: 197 --HGTQQVHSPPEQ--SPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQA 252
H Q PP Q P Q +S+ + A P + H Q S P +QE S
Sbjct: 646 SSHPASQASQPPSQPSQPNQPGQPASQVSQASEPPKPAKPASHLSQPSQP-SQEASQATQ 704
Query: 253 TPQAPKPG-PVQPLTV-----QGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYT 306
Q KP P +P + Q QP ++ + +
Sbjct: 705 VRQTAKPAKPAKPASQASQASQASQPAKTTRLASHPSQPSHQASQPAKQASQPSQPSQPS 764
Query: 307 ASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGSMPMY 359
+A ++S S P P T + A ATQ++ A A S P +
Sbjct: 765 QAASQASQASQPAKPASQPTQPSQPASQASQATQLAKTAKRAKPACQPSQPSH 817
>ref|NP_034869.1| lymphocyte antigen 64 [Mus musculus]
sp|P19467|C114_MOUSE CELL SURFACE ANTIGEN 114/A10 PRECURSOR
pir||A33533 cell surface glycoprotein precursor - mouse
gb|AAA37239.1| (J04634) cell surface antigen 114/A10 precursor [Mus musculus]
Length = 573
Score = 41.0 bits (94), Expect = 0.061
Identities = 53/205 (25%), Positives = 80/205 (38%), Gaps = 21/205 (10%)
Query: 155 PGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPP----EQSP 210
PG S T + PT VQS +PGS Q S T + SPP QSP
Sbjct: 43 PGSSSQASTTTSSSGGASPPT---TVQSQSPGSSS-QASTTTSSSGGA-SPPTTVQSQSP 97
Query: 211 VQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGL 270
++ +S+ T+ + + P +Q QS + + S ++ P P TVQ
Sbjct: 98 GSSSQASTTTSSSGGASPPTTVQ----SQSPGSSSQASTTTSSSGGASP----PTTVQSQ 149
Query: 271 QPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTS 330
P ++ VQ G +S ++ SS + P T + +Q+ +S
Sbjct: 150 SPGSSSQASTTTSSSGGASPPTT-VQSQSPGSSSQVSTTTSSSGGASPPTTVQSQSPGSS 208
Query: 331 YYEAAGTATQVSTPATSQAVASSGS 355
+ TQ S A+S V S GS
Sbjct: 209 ---SQPGPTQPSGGASSSTVPSGGS 230
Score = 38.7 bits (88), Expect = 0.31
Identities = 45/177 (25%), Positives = 71/177 (39%), Gaps = 23/177 (12%)
Query: 180 VQSAAPGSKGGQVSLTVHGTQQVHSPP----EQSPVQANSSSSKTAGAPTGTVPQQLQVH 235
VQS +PGS Q S T + SPP QSP ++ +S+ T+ + + P +Q
Sbjct: 38 VQSQSPGSSS-QASTTTSSSGGA-SPPTTVQSQSPGSSSQASTTTSSSGGASPPTTVQ-- 93
Query: 236 GVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQV 295
QS + + S ++ P P TVQ P + V
Sbjct: 94 --SQSPGSSSQASTTTSSSGGASP----PTTVQSQSPGSSSQASTTTSSSGGASP-PTTV 146
Query: 296 QYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVAS 352
Q G +S ++ SS + P T + +Q + G+++QVST +S AS
Sbjct: 147 QSQSPGSSSQASTTTSSSGGASPPTTVQSQ--------SPGSSSQVSTTTSSSGGAS 195
Score = 38.3 bits (87), Expect = 0.40
Identities = 47/157 (29%), Positives = 57/157 (35%), Gaps = 23/157 (14%)
Query: 85 SQPTGAPTPSP-APQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXXXXXXXX 143
S PT + SP + Q T S G TV SPGS++
Sbjct: 60 SPPTTVQSQSPGSSSQASTTTSSSGGASPPTTVQSQSPGSSSQASTTTSSSGGASPPT-- 117
Query: 144 XXXXXXSVQAK-PGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKGGQVSLTVHG---- 198
+VQ++ PG S T + PT VQS +PGS Q S T
Sbjct: 118 ------TVQSQSPGSSSQASTTTSSSGGASPPT---TVQSQSPGSSS-QASTTTSSSGGA 167
Query: 199 ----TQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQ 231
T Q SP S V +SSS A PT TV Q
Sbjct: 168 SPPTTVQSQSPGSSSQVSTTTSSSGGASPPT-TVQSQ 203
Score = 36.0 bits (81), Expect = 2.0
Identities = 46/186 (24%), Positives = 67/186 (35%), Gaps = 21/186 (11%)
Query: 182 SAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSV 241
S+ S GG T T Q SP S +SSS A PT VQ
Sbjct: 23 SSTTSSSGGTSPPT---TVQSQSPGSSSQASTTTSSSGGASPPTT----------VQSQS 69
Query: 242 PVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGG 301
P + ++ +T + G P TVQ P + VQ G
Sbjct: 70 PGSSSQA---STTTSSSGGASPPTTVQSQSPGSSSQASTTTSSSGGASP-PTTVQSQSPG 125
Query: 302 DASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQ--VSTPAT--SQAVASSGSMP 357
+S ++ SS + P T + +Q+ +S + T++ S P T SQ+ SS +
Sbjct: 126 SSSQASTTTSSSGGASPPTTVQSQSPGSSSQASTTTSSSGGASPPTTVQSQSPGSSSQVS 185
Query: 358 MYVSGS 363
S S
Sbjct: 186 TTTSSS 191
>pir||T16509 hypothetical protein F59A6.3 - Caenorhabditis elegans
gb|AAA83456.1| (U41994) similar to glycoproteins [Caenorhabditis elegans]
Length = 786
Score = 40.6 bits (93), Expect = 0.079
Identities = 57/267 (21%), Positives = 83/267 (30%), Gaps = 30/267 (11%)
Query: 109 AMRASETVSEASPGSTASXXXXXXXXXXXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQV 168
A AS+ + P +T+ SV PG P ++ I
Sbjct: 353 ASSASDDPTTTGPSTTSGSTASTTSGSLFSTSLGSSQSPGSSVSTTPG---PSTISGISQ 409
Query: 169 PQQALPTQR---------LVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQSP---VQANSS 216
+ PT V ++ P + G S T+ TQ S P +P + SS
Sbjct: 410 STTSGPTTTSEPSTTSGSTVSDTSGPSTTSGP-STTLGTTQSTTSGPSTTPGSTISTTSS 468
Query: 217 SSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXX 276
+S T+G T + +V T +S T ++ GP +
Sbjct: 469 ASTTSGPSTSS----------GSTVSTTSGQSTSSGTTKSTTSGPTTSSGPSTVSERTLS 518
Query: 277 XXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAG 336
S V G T S ST S P T TAS S
Sbjct: 519 TTSGPSTTSGPSTTSGSTVSTTPGAS---TTSGSTQSTTSGPSTSSGPSTASRSTVSTTS 575
Query: 337 TATQVSTPATSQAVA-SSGSMPMYVSG 362
+ S P+T+ + +SGS SG
Sbjct: 576 GPSTTSGPSTTSGPSTTSGSTKSTTSG 602
>gb|AAB03569.2| (U35622) EWS protein/E1A enhancer binding protein chimera [Homo
sapiens]
Length = 478
Score = 40.2 bits (92), Expect = 0.10
Identities = 55/271 (20%), Positives = 85/271 (31%), Gaps = 10/271 (3%)
Query: 96 APQQYIVVTV--SEGAMRASETVSEASPGSTASXXXXXXXXXXXXXXXXXXXXXXXSVQA 153
A Q Y T ++G + ++ + S G+ Q
Sbjct: 14 AQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQP 73
Query: 154 KPGHVSPLQLTNIQVPQQALPTQRLVVQSAA-PGSKGGQVSLTVHGTQQVHSPPEQSPVQ 212
G+ +P P Q T +A ++ + + +GTQ + Q P
Sbjct: 74 PTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAA 133
Query: 213 ANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQP 272
+ + PT T Q G Q P + PQ P P+QP+T P
Sbjct: 134 TAPTRPQDGNKPTETSQPQSSTGGYNQ--PSLGYGQSNYSYPQVPGSYPMQPVTAPPSYP 191
Query: 273 XXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTAST-SY 331
YS Q Y G +SY + SY + P + T SY
Sbjct: 192 --PTSYSSTQPTSYDQSSYSQQNTY--GQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSY 247
Query: 332 YEAAGTATQVSTPATSQAVASSGSMPMYVSG 362
+A +Q S+ Q V SM ++ G
Sbjct: 248 SQAPSQYSQQSSSYGQQNVTGCASMYLHTEG 278
>emb|CAA19845.2| (AL031028) /prediction=(method:""genscan"", version:""1.0"",
score:""477.26"")~/prediction=(method:""genefinder"",
version:""084"") [Drosophila melanogaster]
gb|AAF45644.1| (AE003421) EG:56G7.1 gene product [Drosophila melanogaster]
Length = 1795
Score = 39.9 bits (91), Expect = 0.14
Identities = 51/291 (17%), Positives = 90/291 (30%), Gaps = 29/291 (9%)
Query: 82 PAPSQPTGAPTPSPAPQQYIV-----------VTVSEGAMRASETVSEASPGSTASXXXX 130
P P+ TG PT +P P +T + ++ET S P +T
Sbjct: 698 PKPTSSTGKPTTTPKPSTRTTPTTTKVTTTTQITTTTPLRSSTETTSTQPPTTTTPQPTT 757
Query: 131 XXXXXXXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKGG 190
+ + P + Q T P T ++ + + +
Sbjct: 758 TTTLTVTPKTSTTTTTTEKPITSSPKPTTTTQKTTSTAPN----TTKVAITTQKETTPTQ 813
Query: 191 QVSLTVHGTQQVHSPPE----QSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQE 246
S T+ + + PE + P+ + + T T TV + + P T E
Sbjct: 814 STSTTIFTRKTTTNNPEPTSTEKPITSTTPKPSTTTPKTSTVASSTEKTTISSPKPTT-E 872
Query: 247 RSVVQATPQAPKPGPV----QPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGD 302
+S T + K + Q T +P S+Q
Sbjct: 873 KSTENPTTNSVKTSALTSSTQRATSTTSEPTKTTQNITTTTPKPTTLKTSTQEATTSTQK 932
Query: 303 ASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASS 353
S + +T S P T L T+ +T+ + +TP T+ A++
Sbjct: 933 VSTVTITTKKATESSPLTTLSTEEPNTT-----PKPLRTTTPTTTSVTATT 978
>pir||S55316 mucin (clone PGM-2B) - pig
gb|AAC48525.1| (U10281) gastric mucin [Sus scrofa]
Length = 317
Score = 39.9 bits (91), Expect = 0.14
Identities = 70/301 (23%), Positives = 101/301 (33%), Gaps = 36/301 (11%)
Query: 79 PAVPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSEAS-------------PGSTA 125
P P P PT + T S GA ++ +V +S P S+
Sbjct: 29 PKKDCPVSPITLPTTTSVRVTSPPETSSHGATSSTTSVQPSSSSSAPTTSATSVQPSSSG 88
Query: 126 SXXXXXXXXXXXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQ---ALPTQRLVVQS 182
S SVQ P+ T P A T VQS
Sbjct: 89 SAPTTSATSVQSSSSGSAPTTSATSVQPSSSSSPPISSTISVQPSSSSSAPTTSATSVQS 148
Query: 183 AAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVP 242
++ GS + +V + SPP S + SSS + APT T +Q S P
Sbjct: 149 SSSGSAPTTSATSVQPSSS-SSPPISSTISVQPSSS--SSAPT-TSATSVQ-SSSSSSAP 203
Query: 243 VTQERSVVQATPQAPKPGPVQPLT-VQGLQ----PXXXXXXXXXXXXXXXXXXYSSQVQY 297
T SV P + P T VQ P ++ VQ
Sbjct: 204 TTSATSV---QPSSSGSAPTTSATSVQSSSSSSPPISSTISVQTSSSSSSPTTSTTSVQP 260
Query: 298 VEGGDASYT-ASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGSM 356
G A T A++++ S+ S P +ST + + +++ +T ATS +SS S
Sbjct: 261 SSSGSAPTTSATSVQPSSSSSP------PISSTISVQPSSSSSAPTTSATSVQSSSSSSA 314
Query: 357 P 357
P
Sbjct: 315 P 315
>ref|NP_041080.1| membrane glycoprotein [Equine herpesvirus 1]
sp|P28968|VGLX_HSVEB GLYCOPROTEIN X PRECURSOR
pir||VGBEX1 glycoprotein X precursor - equine herpesvirus 1 (strain Ab4p)
gb|AAB02506.1| (M86664) membrane glycoprotein [Equine herpesvirus 1]
Length = 797
Score = 39.5 bits (90), Expect = 0.18
Identities = 44/258 (17%), Positives = 70/258 (27%), Gaps = 20/258 (7%)
Query: 181 QSAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQS 240
QS + G+ S T SPP + + SS+S + + T S
Sbjct: 40 QSTSSGTTNSSSSPTTSPPTTSSSPPTSTHTSSPSSTSTQSSSTAATSSSAPSTASSTTS 99
Query: 241 VPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEG 300
+P + TP A P T P ++
Sbjct: 100 IPTSTSTETTTTTPTASTTTP----TTTTAAPTTAATTTAVTTAASTSAETTTA------ 149
Query: 301 GDASYTASAIRSSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGSMPMYV 360
TA+A + T + P T T TA+T+ A T T +T AT+ A ++ +
Sbjct: 150 -----TATATSTPTTTTP-TSTTTTTATTTVPTTASTTTDTTTAATTTAATTTAA----T 199
Query: 361 SGSQVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYVIQGGYM 420
+ +
Sbjct: 200 TTAATTTAATTTAATTTAATTTAATTSSATTAATTTAATTTAATTTAATTTAATTTAATT 259
Query: 421 LGSASQSYSHTTRASPAT 438
GS + + TT AS +T
Sbjct: 260 TGSPTSGSTSTTGASTST 277
Score = 37.5 bits (85), Expect = 0.69
Identities = 52/283 (18%), Positives = 82/283 (28%), Gaps = 41/283 (14%)
Query: 76 TELPAVPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXX 135
T P + T A T P T + A+ T + + +T +
Sbjct: 156 TPTTTTPTSTTTTTATTTVPTTASTTTDTTTAATTTAATTTAATTTAATTTAATTTAATT 215
Query: 136 XXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKGGQVSLT 195
+ + T A T S GS S T
Sbjct: 216 TAATTTAATTSSATTAATTTAATTTAATTTAATTTAATTTAATTTGSPTSGS----TSTT 271
Query: 196 VHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQ 255
T +P + A +S+ T+ A T + P T + ++T +
Sbjct: 272 GASTS---TPSASTATSATPTSTSTSAAATTSTPTP------------TSAATSAESTTE 316
Query: 256 APKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTY 315
AP P T S + V S T +A + ++
Sbjct: 317 APTSTPTTDTTTP---------------SEATTATTSPESTTVSASTTSATTTAFTTESH 361
Query: 316 SYPETPLYTQTASTSYYEAAGTAT---QVSTPATSQAVASSGS 355
+ P+ + T STS E + T T +TP+T Q SS S
Sbjct: 362 TSPD----SSTGSTSTAEPSSTFTLTPSTATPSTDQFTGSSAS 400
>emb|CAB37867.1| (AJ133273) atrophin-1 [Hylobates lar]
Length = 301
Score = 39.1 bits (89), Expect = 0.23
Identities = 59/286 (20%), Positives = 95/286 (32%), Gaps = 19/286 (6%)
Query: 82 PAP-SQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSEASPGSTASXXXXXXXXXXXXXX 140
P+P S P + + +PAP + S + A+ S +S S++S
Sbjct: 15 PSPHSLPPASSSSAPAPPMRFPYSSSSSSSAAA---SSSSSSSSSSASPYPASQALPSYP 71
Query: 141 XXXXXXXXXSVQAKPGHVS----PLQLTNIQVPQQALPTQRLVVQSAA-----PGSKGGQ 191
SV +P + P Q Q P P RL+ S A P S G Q
Sbjct: 72 HSFPPPTSLSVSNQPPKYTQPSLPSQAVWSQGPPPPPPYGRLLANSNAHPGPFPPSTGAQ 131
Query: 192 VSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQ 251
+ + H +Q Q + P G P L+ + P S+
Sbjct: 132 STAHAPASTHHHHHQQQQQQQQQQHHGSSGPPPPGAFPHPLEGGSSHHAHPYAMSPSLGS 191
Query: 252 ATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIR 311
P P P + P Q + +SQ Y + + ++
Sbjct: 192 LRPYPPGPAHLPPPHSQ-VSYSQAGPNGPPVSSSSNSSSSTSQGSYPCSHPS--PSQGLQ 248
Query: 312 SSTYSYPETPLYTQTASTSYYEAAGTATQVSTPATSQAVASSGSMP 357
+ Y +P P T +++T + AT VS+PA + + G P
Sbjct: 249 GAPYPFPPVPTVTTSSATL---STVIATVVSSPAGYKTASPPGPPP 291
>gb|AAC51331.2| (U85962) CREB-binding protein [Homo sapiens]
Length = 2442
Score = 39.1 bits (89), Expect = 0.23
Identities = 35/115 (30%), Positives = 49/115 (42%), Gaps = 12/115 (10%)
Query: 159 SPLQLTNIQVPQQALP--TQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQS-PVQANS 215
+PL + Q Q P TQ + + P S + H T +PP+ + P Q ++
Sbjct: 831 NPLNMLGPQASQLPCPPVTQSPLHPTPPPASTAAGMPSLQHTTPPGMTPPQPAAPTQPST 890
Query: 216 SSSKTAGAPT---GTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTV 267
S + PT G+VP Q QS P Q + Q TPQ PVQP +V
Sbjct: 891 PVSSSGQTPTPTPGSVPSATQT----QSTPTVQAAAQAQVTPQPQT--PVQPPSV 939
>gb|AAF70456.1|AF221952_1 (AF221952) mu-protocadherin [Rattus norvegicus]
Length = 862
Score = 39.1 bits (89), Expect = 0.23
Identities = 56/217 (25%), Positives = 79/217 (35%), Gaps = 45/217 (20%)
Query: 163 LTNIQVPQQALPTQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAG 222
+ IQV ++ P+ +S P GG + + T + S S A +SS +AG
Sbjct: 449 IVEIQVSEREPPS----TESPTPPEAGGTTGPSSNTTLETPSTSGTSQGPATTSSGGSAG 504
Query: 223 A--PTGTVPQQLQ----VHGVQQSVPVTQERSVVQATP-----QAPKPGPVQPLTVQGLQ 271
P GT L V G ++ ++ S ATP Q PKPG QP+
Sbjct: 505 PFPPAGTTLSPLTSAPTVPGGSPTLGIST--SPQTATPGGDATQTPKPGTSQPMVP---- 558
Query: 272 PXXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSY 331
S Q + G + +ST S P TP
Sbjct: 559 ------TPGASTSSQPATPSGSSTQTPKPGTSQPMVPTPGASTSSQPATP---------- 602
Query: 332 YEAAGTATQVSTPATSQAV-----ASSGSMPMYVSGS 363
+G++TQ P TSQ + AS+ S P SGS
Sbjct: 603 ---SGSSTQTPRPGTSQPMVPTPGASTSSQPATPSGS 636
Score = 34.0 bits (76), Expect = 7.8
Identities = 44/198 (22%), Positives = 58/198 (29%), Gaps = 14/198 (7%)
Query: 77 ELPAVPAPSQPTGAPTPSPAPQQYIVVTVSEGAMRASETVSE---ASPGSTASXXXXXXX 133
E P+ +P+ P T P+ + + G + T S A P A
Sbjct: 458 EPPSTESPTPPEAGGTTGPSSNTTLETPSTSGTSQGPATTSSGGSAGPFPPAGTTLSPLT 517
Query: 134 XXXXXXXXXXXXXXXXSVQAKPGHVSPLQLTNIQVPQQALPTQRLVVQSAAPGSKGGQVS 193
S Q Q Q +PT S G
Sbjct: 518 SAPTVPGGSPTLGISTSPQTATPGGDATQTPKPGTSQPMVPTPGASTSSQPATPSGSSTQ 577
Query: 194 LTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQ-A 252
GT Q P +S+S P+G+ Q + Q VP + Q A
Sbjct: 578 TPKPGTSQPMVPTP------GASTSSQPATPSGSSTQTPRPGTSQPMVPTPGASTSSQPA 631
Query: 253 TP----QAPKPGPVQPLT 266
TP Q PKPG QP T
Sbjct: 632 TPSGSTQTPKPGTSQPTT 649
Score = 34.0 bits (76), Expect = 7.8
Identities = 51/203 (25%), Positives = 71/203 (34%), Gaps = 27/203 (13%)
Query: 168 VPQQALPTQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTGT 227
VP + T R+ V++ +K ++ + P +SP + T G + T
Sbjct: 423 VPMETERTIRIEVEANNTVTKDIATTIVEIQVSEREPPSTESPTPPEAGG--TTGPSSNT 480
Query: 228 VPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPXXXXXXXXXXXXXXX 287
+ G Q T A P P + PLT P
Sbjct: 481 TLETPSTSGTSQGPATTSSGG--SAGPFPPAGTTLSPLTSAPTVPGGSPTLGIS------ 532
Query: 288 XXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEA--AGTATQVSTPA 345
+S GGDA+ T S P P T ASTS A +G++TQ P
Sbjct: 533 ----TSPQTATPGGDATQTPKPGTSQ----PMVP--TPGASTSSQPATPSGSSTQTPKPG 582
Query: 346 TSQAV-----ASSGSMPMYVSGS 363
TSQ + AS+ S P SGS
Sbjct: 583 TSQPMVPTPGASTSSQPATPSGS 605
>ref|NP_004371.1| CREB binding protein (Rubinstein-Taybi syndrome); CREB binding
protein [Homo sapiens]
sp|Q92793|CBP_HUMAN CREB-BINDING PROTEIN
gb|AAC51770.1| (U47741) CREB-binding protein [Homo sapiens]
Length = 2442
Score = 39.1 bits (89), Expect = 0.23
Identities = 35/115 (30%), Positives = 49/115 (42%), Gaps = 12/115 (10%)
Query: 159 SPLQLTNIQVPQQALP--TQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQS-PVQANS 215
+PL + Q Q P TQ + + P S + H T +PP+ + P Q ++
Sbjct: 831 NPLNMLGPQASQLPCPPVTQSPLHPTPPPASTAAGMPSLQHTTPPGMTPPQPAAPTQPST 890
Query: 216 SSSKTAGAPT---GTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTV 267
S + PT G+VP Q QS P Q + Q TPQ PVQP +V
Sbjct: 891 PVSSSGQTPTPTPGSVPSATQT----QSTPTVQAAAQAQVTPQPQT--PVQPPSV 939
>pir||S39162 transcription coactivator CREB-binding protein - human
Length = 2440
Score = 39.1 bits (89), Expect = 0.23
Identities = 35/115 (30%), Positives = 49/115 (42%), Gaps = 12/115 (10%)
Query: 159 SPLQLTNIQVPQQALP--TQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQS-PVQANS 215
+PL + Q Q P TQ + + P S + H T +PP+ + P Q ++
Sbjct: 831 NPLNMLGPQASQLPCPPVTQSPLHPTPPPASTAAGMPSLQHTTPPGMTPPQPAAPTQPST 890
Query: 216 SSSKTAGAPT---GTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTV 267
S + PT G+VP Q QS P Q + Q TPQ PVQP +V
Sbjct: 891 PVSSSGQTPTPTPGSVPSATQT----QSTPTVQAAAQAQVTPQPQT--PVQPPSV 939
>gb|AAB01610.1| (L36831) transcription regulator [Mus musculus]
Length = 1282
Score = 38.7 bits (88), Expect = 0.31
Identities = 29/88 (32%), Positives = 40/88 (44%), Gaps = 15/88 (17%)
Query: 190 GQV-SLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTG-------TVPQQLQVH--GVQQ 239
GQV + HG Q PP Q P Q +S S+ T P + P QLQ H QQ
Sbjct: 206 GQVHQIPTHGIQPQPQPPPQHPSQPSSQSAPTPAQPAPQPTAAKVSKPSQLQAHTPASQQ 265
Query: 240 SVPVTQERSVVQATPQAPKPGPVQPLTV 267
+ P+ A+P++P P P+T+
Sbjct: 266 TPPLPP-----YASPRSPPVQPHTPVTI 288
>ref|NP_053733.1| Ewing sarcoma breakpoint region 1, isoform EWS [Homo sapiens]
ref|XP_000929.1| similar to Ewing sarcoma breakpoint region 1 (H. sapiens) [Homo
sapiens]
Length = 583
Score = 38.7 bits (88), Expect = 0.31
Identities = 53/266 (19%), Positives = 83/266 (30%), Gaps = 15/266 (5%)
Query: 96 APQQYIVVTV--SEGAMRASETVSEASPGSTASXXXXXXXXXXXXXXXXXXXXXXXSVQA 153
A Q Y T ++G + ++ + S G+ Q
Sbjct: 14 AQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQP 73
Query: 154 KPGHVSPLQLTNIQVPQQALPTQRLVVQSAA-PGSKGGQVSLTVHGTQQVHSPPEQSPVQ 212
G+ +P P Q T +A ++ + + +GTQ + Q P
Sbjct: 74 PTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAA 133
Query: 213 ANSSSSKTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQP 272
+ + PT T Q G Q P + PQ P P+QP+T P
Sbjct: 134 TAPTRPQDGNKPTETSQPQSSTGGYNQ--PSLGYGQSNYSYPQVPGSYPMQPVTAPPSYP 191
Query: 273 XXXXXXXXXXXXXXXXXXYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYY 332
YS Q Y G +SY + SY + Y Q TSY
Sbjct: 192 PTSYSSTQPTSYDQSS--YSQQNTY--GQPSSYGQQS------SYGQQSSYGQQPPTSYP 241
Query: 333 EAAGTATQVSTPATSQAVASSGSMPM 358
G+ +Q + + Q+ + PM
Sbjct: 242 PQTGSYSQAPSQYSQQSSSYGQQRPM 267
>ref|NP_035508.1| transcriptional regulator, SIN3 yeast homolog A [Mus musculus]
gb|AAA89119.1| (U22394) mSin3A [Mus musculus]
Length = 1274
Score = 38.7 bits (88), Expect = 0.31
Identities = 29/88 (32%), Positives = 40/88 (44%), Gaps = 15/88 (17%)
Query: 190 GQV-SLTVHGTQQVHSPPEQSPVQANSSSSKTAGAPTG-------TVPQQLQVH--GVQQ 239
GQV + HG Q PP Q P Q +S S+ T P + P QLQ H QQ
Sbjct: 206 GQVHQIPTHGIQPQPQPPPQHPSQPSSQSAPTPAQPAPQPTAAKVSKPSQLQAHTPASQQ 265
Query: 240 SVPVTQERSVVQATPQAPKPGPVQPLTV 267
+ P+ A+P++P P P+T+
Sbjct: 266 TPPLPP-----YASPRSPPVQPHTPVTI 288
Database: nr
Posted date: Nov 26, 2000 9:38 PM
Number of letters in database: 184,285,689
Number of sequences in database: 585,701
Lambda K H
0.314 0.129 0.368
Gapped
Lambda K H
0.270 0.0470 0.230
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 283896506
Number of Sequences: 585701
Number of extensions: 10712628
Number of successful extensions: 49962
Number of sequences better than 10.0: 137
Number of HSP's better than 10.0 without gapping: 25
Number of HSP's successfully gapped in prelim test: 118
Number of HSP's that attempted gapping in prelim test: 49205
Number of HSP's gapped (non-prelim): 653
length of query: 979
length of database: 184,285,689
effective HSP length: 62
effective length of query: 917
effective length of database: 147,972,227
effective search space: 135690532159
effective search space used: 135690532159
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 41 (21.6 bits)
S2: 76 (34.0 bits)