BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3769 Length=90 Score E Sequences producing significant alignments: (Bits) Value gi|15610905|ref|NP_218286.1| hypothetical protein Rv3769 [Mycoba... 175 2e-42 gi|306778661|ref|ZP_07416998.1| hypothetical protein TMBG_02307 ... 173 8e-42 gi|289441205|ref|ZP_06430949.1| LOW QUALITY PROTEIN: conserved h... 94.4 5e-18 gi|289759938|ref|ZP_06519316.1| predicted protein [Mycobacterium... 82.8 1e-14 gi|145224255|ref|YP_001134933.1| hypothetical protein Mflv_3671 ... 67.0 8e-10 gi|240170024|ref|ZP_04748683.1| hypothetical protein MkanA1_1197... 64.7 5e-09 gi|289572016|ref|ZP_06452243.1| hypothetical protein TBJG_02904 ... 59.3 2e-07 gi|183984927|ref|YP_001853218.1| hypothetical protein MMAR_4959 ... 50.1 1e-04 gi|321444297|gb|EFX60363.1| hypothetical protein DAPPUDRAFT_7199... 36.2 1.7 gi|321465406|gb|EFX76407.1| hypothetical protein DAPPUDRAFT_1885... 35.8 2.1 gi|260814598|ref|XP_002602001.1| hypothetical protein BRAFLDRAFT... 35.8 2.3 gi|49119498|gb|AAH73619.1| Eps15R protein [Xenopus laevis] 35.8 2.3 gi|148231027|ref|NP_001084490.1| epidermal growth factor recepto... 35.0 3.4 gi|260787745|ref|XP_002588912.1| hypothetical protein BRAFLDRAFT... 34.3 5.9 gi|45185571|ref|NP_983287.1| ACL117Wp [Ashbya gossypii ATCC 1089... 34.3 6.2 gi|189230039|ref|NP_001121513.1| epidermal growth factor recepto... 33.9 8.1 >gi|15610905|ref|NP_218286.1| hypothetical protein Rv3769 [Mycobacterium tuberculosis H37Rv] gi|31794939|ref|NP_857432.1| hypothetical protein Mb3795 [Mycobacterium bovis AF2122/97] gi|121639683|ref|YP_979907.1| hypothetical protein BCG_3828 [Mycobacterium bovis BCG str. Pasteur 1173P2] 53 more sequence titlesLength=90 Score = 175 bits (444), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 89/90 (99%), Positives = 90/90 (100%), Gaps = 0/90 (0%) Query 1 VTTLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTGRLDRVTTKVGQLA 60 +TTLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTGRLDRVTTKVGQLA Sbjct 1 MTTLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTGRLDRVTTKVGQLA 60 Query 61 AKSDDTNARVRSLEEGQAEIKDLLLRALDK 90 AKSDDTNARVRSLEEGQAEIKDLLLRALDK Sbjct 61 AKSDDTNARVRSLEEGQAEIKDLLLRALDK 90 >gi|306778661|ref|ZP_07416998.1| hypothetical protein TMBG_02307 [Mycobacterium tuberculosis SUMu002] gi|306791050|ref|ZP_07429372.1| hypothetical protein TMDG_01505 [Mycobacterium tuberculosis SUMu004] gi|306791369|ref|ZP_07429671.1| hypothetical protein TMEG_00264 [Mycobacterium tuberculosis SUMu005] gi|308328384|gb|EFP17235.1| hypothetical protein TMBG_02307 [Mycobacterium tuberculosis SUMu002] gi|308332631|gb|EFP21482.1| hypothetical protein TMDG_01505 [Mycobacterium tuberculosis SUMu004] gi|308340122|gb|EFP28973.1| hypothetical protein TMEG_00264 [Mycobacterium tuberculosis SUMu005] Length=90 Score = 173 bits (438), Expect = 8e-42, Method: Compositional matrix adjust. Identities = 88/90 (98%), Positives = 90/90 (100%), Gaps = 0/90 (0%) Query 1 VTTLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTGRLDRVTTKVGQLA 60 +TTLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVRE+TGRLDRVTTKVGQLA Sbjct 1 MTTLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREYTGRLDRVTTKVGQLA 60 Query 61 AKSDDTNARVRSLEEGQAEIKDLLLRALDK 90 AKSDDTNARVRSLEEGQAEIKDLLLRALDK Sbjct 61 AKSDDTNARVRSLEEGQAEIKDLLLRALDK 90 >gi|289441205|ref|ZP_06430949.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium tuberculosis T46] gi|289414124|gb|EFD11364.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium tuberculosis T46] Length=91 Score = 94.4 bits (233), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 46/47 (98%), Positives = 47/47 (100%), Gaps = 0/47 (0%) Query 1 VTTLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTG 47 +TTLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTG Sbjct 1 MTTLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTG 47 >gi|289759938|ref|ZP_06519316.1| predicted protein [Mycobacterium tuberculosis T85] gi|289715502|gb|EFD79514.1| predicted protein [Mycobacterium tuberculosis T85] Length=109 Score = 82.8 bits (203), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 41/46 (90%), Positives = 43/46 (94%), Gaps = 0/46 (0%) Query 38 IATTVREHTGRLDRVTTKVGQLAAKSDDTNARVRSLEEGQAEIKDL 83 IATTVR+ +GRLDRVTTKVGQL AKSDDTNARVRSLEEGQ EIKDL Sbjct 41 IATTVRKQSGRLDRVTTKVGQLVAKSDDTNARVRSLEEGQDEIKDL 86 >gi|145224255|ref|YP_001134933.1| hypothetical protein Mflv_3671 [Mycobacterium gilvum PYR-GCK] gi|315444590|ref|YP_004077469.1| hypothetical protein Mspyr1_30170 [Mycobacterium sp. Spyr1] gi|145216741|gb|ABP46145.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK] gi|315262893|gb|ADT99634.1| hypothetical protein Mspyr1_30170 [Mycobacterium sp. Spyr1] Length=93 Score = 67.0 bits (162), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 40/87 (46%), Positives = 58/87 (67%), Gaps = 4/87 (4%) Query 4 LKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTGRL----DRVTTKVGQL 59 L + AR++ALEA+ ADYRAVLAA+N GAN+R+ T + RL +R+ + +L Sbjct 2 LDDHEARISALEASHADYRAVLAAINALGANERDHVTRLTSVDNRLIAVDNRLISVETEL 61 Query 60 AAKSDDTNARVRSLEEGQAEIKDLLLR 86 A +T AR+RS++E AEIKDL++R Sbjct 62 ADFRQETRARLRSVDEHLAEIKDLIIR 88 >gi|240170024|ref|ZP_04748683.1| hypothetical protein MkanA1_11976 [Mycobacterium kansasii ATCC 12478] Length=87 Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 37/87 (43%), Positives = 55/87 (64%), Gaps = 10/87 (11%) Query 4 LKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTGRLDRVTTKVGQLAAKS 63 L L ARVAA+E +QADYR+++ A+ G Q+ +A +R + G++ A + Sbjct 11 LANLKARVAAVERSQADYRSMVEAIKAFGETQQLLADVLRGY----------AGEMRATA 60 Query 64 DDTNARVRSLEEGQAEIKDLLLRALDK 90 DD+N R+RSLE AEIK+LL RAL++ Sbjct 61 DDSNQRIRSLETSTAEIKNLLTRALER 87 >gi|289572016|ref|ZP_06452243.1| hypothetical protein TBJG_02904 [Mycobacterium tuberculosis T17] gi|289545770|gb|EFD49418.1| hypothetical protein TBJG_02904 [Mycobacterium tuberculosis T17] Length=31 Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 29/31 (94%), Positives = 30/31 (97%), Gaps = 0/31 (0%) Query 1 VTTLKELGARVAALEANQADYRAVLAAVNPP 31 +TTLKELGARVAALEAN ADYRAVLAAVNPP Sbjct 1 MTTLKELGARVAALEANPADYRAVLAAVNPP 31 >gi|183984927|ref|YP_001853218.1| hypothetical protein MMAR_4959 [Mycobacterium marinum M] gi|183178253|gb|ACC43363.1| conserved hypothetical protein [Mycobacterium marinum M] Length=87 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 28/83 (34%), Positives = 47/83 (57%), Gaps = 10/83 (12%) Query 2 TTLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTGRLDRVTTKVGQLAA 61 +TL++L AR+AALEA++ Y ++ A+ G Q+ +A +R + G + Sbjct 12 STLRDLKARIAALEASRTSYEEIVDAIKAFGQTQQMLADVLRAYG----------GDMRG 61 Query 62 KSDDTNARVRSLEEGQAEIKDLL 84 ++D+N R+R LE AEIK +L Sbjct 62 TAEDSNERIRKLEASVAEIKKML 84 >gi|321444297|gb|EFX60363.1| hypothetical protein DAPPUDRAFT_71996 [Daphnia pulex] Length=156 Score = 36.2 bits (82), Expect = 1.7, Method: Compositional matrix adjust. Identities = 16/33 (49%), Positives = 23/33 (70%), Gaps = 0/33 (0%) Query 43 REHTGRLDRVTTKVGQLAAKSDDTNARVRSLEE 75 REH R++ + T V QL AK++DT R+R LE+ Sbjct 65 REHHTRVNELETHVHQLRAKNEDTTKRIRQLEQ 97 >gi|321465406|gb|EFX76407.1| hypothetical protein DAPPUDRAFT_188557 [Daphnia pulex] Length=307 Score = 35.8 bits (81), Expect = 2.1, Method: Compositional matrix adjust. Identities = 16/33 (49%), Positives = 23/33 (70%), Gaps = 0/33 (0%) Query 43 REHTGRLDRVTTKVGQLAAKSDDTNARVRSLEE 75 REH R++ + T V QL AK++DT R+R LE+ Sbjct 88 REHHTRVNELETHVHQLRAKNEDTTKRIRQLEQ 120 >gi|260814598|ref|XP_002602001.1| hypothetical protein BRAFLDRAFT_82587 [Branchiostoma floridae] gi|229287306|gb|EEN58013.1| hypothetical protein BRAFLDRAFT_82587 [Branchiostoma floridae] Length=1201 Score = 35.8 bits (81), Expect = 2.3, Method: Compositional matrix adjust. Identities = 27/80 (34%), Positives = 41/80 (52%), Gaps = 0/80 (0%) Query 4 LKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTGRLDRVTTKVGQLAAKS 63 L ++ A V AL+ Q D R + A V+ +Q ++ TTV D V+ V LA Sbjct 367 LDDMSATVNALKRGQDDIRRLSATVDALKRDQDKMYTTVGTLKRDQDDVSATVDALAGDL 426 Query 64 DDTNARVRSLEEGQAEIKDL 83 DD + V +L+ GQ +I+ L Sbjct 427 DDMSTTVNALKRGQDDIRRL 446 >gi|49119498|gb|AAH73619.1| Eps15R protein [Xenopus laevis] Length=850 Score = 35.8 bits (81), Expect = 2.3, Method: Composition-based stats. Identities = 25/88 (29%), Positives = 45/88 (52%), Gaps = 11/88 (12%) Query 1 VTTLKELGARVAALE----ANQADYRAVLAAVNPPGANQREIATTVREHTGRLDRVTTKV 56 V L E+ +A L+ A + D R A+ R+ +T V+E LDR T+ + Sbjct 382 VKELDEISQEIAQLQREKYALEQDIREKEEAI-------RQKSTEVQELQNDLDRETSTL 434 Query 57 GQLAAKSDDTNARVRSLEEGQAEIKDLL 84 +L A+ D R+ +++ +A++KD+L Sbjct 435 QELEAQKQDAQDRLDEMDQQKAKLKDML 462 >gi|148231027|ref|NP_001084490.1| epidermal growth factor receptor pathway substrate 15-like 1 [Xenopus laevis] gi|32364687|gb|AAP80383.1| EH domain protein [Xenopus laevis] Length=897 Score = 35.0 bits (79), Expect = 3.4, Method: Composition-based stats. Identities = 25/88 (29%), Positives = 45/88 (52%), Gaps = 11/88 (12%) Query 1 VTTLKELGARVAALE----ANQADYRAVLAAVNPPGANQREIATTVREHTGRLDRVTTKV 56 V L E+ +A L+ A + D R A+ R+ +T V+E LDR T+ + Sbjct 382 VKELDEISQEIAQLQREKYALEQDIREKEEAI-------RQKSTEVQELQNDLDRETSTL 434 Query 57 GQLAAKSDDTNARVRSLEEGQAEIKDLL 84 +L A+ D R+ +++ +A++KD+L Sbjct 435 QELEAQKQDAQDRLDEMDQQKAKLKDML 462 >gi|260787745|ref|XP_002588912.1| hypothetical protein BRAFLDRAFT_89100 [Branchiostoma floridae] gi|229274084|gb|EEN44923.1| hypothetical protein BRAFLDRAFT_89100 [Branchiostoma floridae] Length=833 Score = 34.3 bits (77), Expect = 5.9, Method: Compositional matrix adjust. Identities = 27/89 (31%), Positives = 45/89 (51%), Gaps = 3/89 (3%) Query 4 LKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTGRLDRVTTKVGQLAAKS 63 +++L V AL+ +Q D R + A V+ +Q +++TTV D +T V L Sbjct 303 MRQLSTTVDALKRDQDDMRHLSATVDALKRDQDDMSTTVDALKRDQDDTSTTVDALKRDQ 362 Query 64 DDTNARVRSLEEGQAEIK---DLLLRALD 89 DD + V +L+ Q ++ D+L R LD Sbjct 363 DDMSTTVDALKRDQDDMSTTVDVLKRDLD 391 >gi|45185571|ref|NP_983287.1| ACL117Wp [Ashbya gossypii ATCC 10895] gi|44981289|gb|AAS51111.1| ACL117Wp [Ashbya gossypii ATCC 10895] Length=581 Score = 34.3 bits (77), Expect = 6.2, Method: Composition-based stats. Identities = 29/79 (37%), Positives = 37/79 (47%), Gaps = 6/79 (7%) Query 3 TLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTGRLDRVTTKVGQL--- 59 TLKE A+ A L Q R V +NP G Q I G + ++T K G Sbjct 368 TLKERAAKTAPLPEGQDIIRPVSQPLNPRGHLQ--ILYGSLAPGGAVGKITGKEGTFFQG 425 Query 60 -AAKSDDTNARVRSLEEGQ 77 A D+ NA +R+LEEGQ Sbjct 426 RARVFDEENAFIRALEEGQ 444 >gi|189230039|ref|NP_001121513.1| epidermal growth factor receptor pathway substrate 15-like 1 [Xenopus (Silurana) tropicalis] gi|183985776|gb|AAI66356.1| LOC100158630 protein [Xenopus (Silurana) tropicalis] Length=898 Score = 33.9 bits (76), Expect = 8.1, Method: Composition-based stats. Identities = 24/88 (28%), Positives = 45/88 (52%), Gaps = 11/88 (12%) Query 1 VTTLKELGARVAALE----ANQADYRAVLAAVNPPGANQREIATTVREHTGRLDRVTTKV 56 V L E+ +A L+ A + D R A+ R+ +T V++ LDR T+ + Sbjct 382 VKELDEISQEIAQLQREKYALEQDIREKEEAI-------RQKSTEVQDLQNDLDRETSTL 434 Query 57 GQLAAKSDDTNARVRSLEEGQAEIKDLL 84 +L A+ D R+ +++ +A++KD+L Sbjct 435 QELEAQKQDAQDRLDEMDQQKAKLKDML 462 Lambda K H 0.312 0.127 0.329 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129182109240 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40