BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3898c
Length=110
Score E
Sequences producing significant alignments: (Bits) Value
gi|326905731|gb|EGE52664.1| hypothetical protein TBPG_03696 [Myc... 214 4e-54
gi|289748435|ref|ZP_06507813.1| conserved hypothetical protein [... 213 6e-54
gi|289441339|ref|ZP_06431083.1| conserved hypothetical protein [... 213 6e-54
gi|289747742|ref|ZP_06507120.1| conserved hypothetical protein [... 213 6e-54
gi|340628869|ref|YP_004747321.1| hypothetical protein MCAN_39191... 213 7e-54
gi|298527370|ref|ZP_07014779.1| conserved hypothetical protein [... 213 7e-54
gi|31795071|ref|NP_857564.1| hypothetical protein Mb3927c [Mycob... 213 7e-54
gi|289572154|ref|ZP_06452381.1| conserved hypothetical protein [... 213 8e-54
gi|253800948|ref|YP_003033950.1| hypothetical protein TBMG_03946... 211 3e-53
gi|15611034|ref|NP_218415.1| hypothetical protein Rv3898c [Mycob... 211 3e-53
gi|339633888|ref|YP_004725530.1| hypothetical protein MAF_39130 ... 211 3e-53
gi|289760070|ref|ZP_06519448.1| conserved hypothetical protein [... 166 1e-39
gi|289756032|ref|ZP_06515410.1| conserved hypothetical protein [... 166 1e-39
gi|240168378|ref|ZP_04747037.1| hypothetical protein MkanA1_0364... 105 2e-21
gi|333988703|ref|YP_004521317.1| hypothetical protein JDM601_006... 39.7 0.16
gi|183983066|ref|YP_001851357.1| hypothetical protein MMAR_3067 ... 37.0 0.90
gi|118617824|ref|YP_906156.1| hypothetical protein MUL_2312 [Myc... 37.0 1.0
gi|41410423|ref|NP_963259.1| hypothetical protein MAP4325c [Myco... 36.2 1.5
gi|195382089|ref|XP_002049764.1| GJ20572 [Drosophila virilis] >g... 35.4 2.6
gi|195382091|ref|XP_002049765.1| GJ20571 [Drosophila virilis] >g... 35.4 2.6
gi|31793266|ref|NP_855759.1| hypothetical protein Mb2109 [Mycoba... 35.0 4.1
gi|289447704|ref|ZP_06437448.1| conserved hypothetical protein [... 35.0 4.1
gi|323719381|gb|EGB28520.1| hypothetical protein TMMG_01360 [Myc... 34.7 4.9
gi|308376907|ref|ZP_07668376.1| hypothetical protein TMHG_01272 ... 34.7 4.9
gi|254232248|ref|ZP_04925575.1| conserved hypothetical protein [... 34.3 5.6
gi|15609220|ref|NP_216599.1| hypothetical protein Rv2083 [Mycoba... 34.3 5.8
gi|298525583|ref|ZP_07012992.1| conserved hypothetical protein [... 34.3 6.0
gi|15841576|ref|NP_336613.1| hypothetical protein MT2145 [Mycoba... 34.3 6.0
>gi|326905731|gb|EGE52664.1| hypothetical protein TBPG_03696 [Mycobacterium tuberculosis W-148]
Length=350
Score = 214 bits (544), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 110/110 (100%), Positives = 110/110 (100%), Gaps = 0/110 (0%)
Query 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA
Sbjct 22 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 81
Query 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA
Sbjct 82 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 131
>gi|289748435|ref|ZP_06507813.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289689022|gb|EFD56451.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=176
Score = 213 bits (543), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 110/110 (100%), Positives = 110/110 (100%), Gaps = 0/110 (0%)
Query 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA
Sbjct 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
Query 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA
Sbjct 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
>gi|289441339|ref|ZP_06431083.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289414258|gb|EFD11498.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
Length=329
Score = 213 bits (543), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 110/110 (100%), Positives = 110/110 (100%), Gaps = 0/110 (0%)
Query 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA
Sbjct 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
Query 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA
Sbjct 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
>gi|289747742|ref|ZP_06507120.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289688270|gb|EFD55758.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|339296704|gb|AEJ48815.1| hypothetical protein CCDC5079_3626 [Mycobacterium tuberculosis
CCDC5079]
gi|339300296|gb|AEJ52406.1| hypothetical protein CCDC5180_3569 [Mycobacterium tuberculosis
CCDC5180]
Length=331
Score = 213 bits (543), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 110/110 (100%), Positives = 110/110 (100%), Gaps = 0/110 (0%)
Query 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA
Sbjct 3 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 62
Query 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA
Sbjct 63 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 112
>gi|340628869|ref|YP_004747321.1| hypothetical protein MCAN_39191 [Mycobacterium canettii CIPT
140010059]
gi|340007059|emb|CCC46250.1| putative uncharacterized protein bcg_3954c [Mycobacterium canettii
CIPT 140010059]
Length=329
Score = 213 bits (542), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 110/110 (100%), Positives = 110/110 (100%), Gaps = 0/110 (0%)
Query 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA
Sbjct 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
Query 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA
Sbjct 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
>gi|298527370|ref|ZP_07014779.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298497164|gb|EFI32458.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=280
Score = 213 bits (542), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 110/110 (100%), Positives = 110/110 (100%), Gaps = 0/110 (0%)
Query 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA
Sbjct 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
Query 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA
Sbjct 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
>gi|31795071|ref|NP_857564.1| hypothetical protein Mb3927c [Mycobacterium bovis AF2122/97]
gi|121639809|ref|YP_980033.1| hypothetical protein BCG_3954c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224992304|ref|YP_002646994.1| hypothetical protein JTY_3956 [Mycobacterium bovis BCG str. Tokyo
172]
8 more sequence titles
Length=329
Score = 213 bits (542), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 110/110 (100%), Positives = 110/110 (100%), Gaps = 0/110 (0%)
Query 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA
Sbjct 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
Query 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA
Sbjct 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
>gi|289572154|ref|ZP_06452381.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289545909|gb|EFD49556.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=329
Score = 213 bits (542), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 110/110 (100%), Positives = 110/110 (100%), Gaps = 0/110 (0%)
Query 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA
Sbjct 3 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 62
Query 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA
Sbjct 63 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 112
>gi|253800948|ref|YP_003033950.1| hypothetical protein TBMG_03946 [Mycobacterium tuberculosis KZN
1435]
gi|254233384|ref|ZP_04926710.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|254548902|ref|ZP_05139349.1| hypothetical protein Mtube_00295 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
7 more sequence titles
Length=112
Score = 211 bits (537), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 110/110 (100%), Positives = 110/110 (100%), Gaps = 0/110 (0%)
Query 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA
Sbjct 3 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 62
Query 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA
Sbjct 63 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 112
>gi|15611034|ref|NP_218415.1| hypothetical protein Rv3898c [Mycobacterium tuberculosis H37Rv]
gi|15843529|ref|NP_338566.1| hypothetical protein MT4014 [Mycobacterium tuberculosis CDC1551]
gi|148663765|ref|YP_001285288.1| hypothetical protein MRA_3937 [Mycobacterium tuberculosis H37Ra]
24 more sequence titles
Length=110
Score = 211 bits (537), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 110/110 (100%), Positives = 110/110 (100%), Gaps = 0/110 (0%)
Query 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA
Sbjct 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
Query 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA
Sbjct 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
>gi|339633888|ref|YP_004725530.1| hypothetical protein MAF_39130 [Mycobacterium africanum GM041182]
gi|339333244|emb|CCC28981.1| conserved hypothetical protein [Mycobacterium africanum GM041182]
Length=111
Score = 211 bits (536), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 110/110 (100%), Positives = 110/110 (100%), Gaps = 0/110 (0%)
Query 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA
Sbjct 1 MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAA 60
Query 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA
Sbjct 61 GNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
>gi|289760070|ref|ZP_06519448.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|289715634|gb|EFD79646.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
Length=305
Score = 166 bits (420), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 85/86 (99%), Positives = 86/86 (100%), Gaps = 0/86 (0%)
Query 25 LQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAAGNSGVPGDVESSGLEDLDRRAHAA 84
+QVLTTPPASGPAPFPAVPVDLPAPADIANGALFAAGNSGVPGDVESSGLEDLDRRAHAA
Sbjct 1 MQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAAGNSGVPGDVESSGLEDLDRRAHAA 60
Query 85 DAVQKFSANEADAAQQFQGVGAQAEA 110
DAVQKFSANEADAAQQFQGVGAQAEA
Sbjct 61 DAVQKFSANEADAAQQFQGVGAQAEA 86
>gi|289756032|ref|ZP_06515410.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289696619|gb|EFD64048.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
Length=305
Score = 166 bits (419), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 85/86 (99%), Positives = 86/86 (100%), Gaps = 0/86 (0%)
Query 25 LQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAAGNSGVPGDVESSGLEDLDRRAHAA 84
+QVLTTPPASGPAPFPAVPVDLPAPADIANGALFAAGNSGVPGDVESSGLEDLDRRAHAA
Sbjct 1 MQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAAGNSGVPGDVESSGLEDLDRRAHAA 60
Query 85 DAVQKFSANEADAAQQFQGVGAQAEA 110
DAVQKFSANEADAAQQFQGVGAQAEA
Sbjct 61 DAVQKFSANEADAAQQFQGVGAQAEA 86
>gi|240168378|ref|ZP_04747037.1| hypothetical protein MkanA1_03647 [Mycobacterium kansasii ATCC
12478]
Length=343
Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 73/123 (60%), Positives = 78/123 (64%), Gaps = 15/123 (12%)
Query 1 MTGDQNPA-------------PGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLP 47
MTGDQNP P P+ GVPIKVTPEIL QVL P S P VP P
Sbjct 1 MTGDQNPFPFPFPGPGVPAIPPSPS-GVPIKVTPEILQQVLYGP-LSPPVEPNIVPPPGP 58
Query 48 APADIANGALFAAGNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQGVGAQ 107
APAD AL AAGN GV GD SS +DLDRRAHAADA +KF NE+DAAQQFQGVG+Q
Sbjct 59 APADQGQSALAAAGNLGVLGDAASSAQDDLDRRAHAADAGRKFQTNESDAAQQFQGVGSQ 118
Query 108 AEA 110
A A
Sbjct 119 AGA 121
>gi|333988703|ref|YP_004521317.1| hypothetical protein JDM601_0063 [Mycobacterium sp. JDM601]
gi|333484671|gb|AEF34063.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=281
Score = 39.7 bits (91), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 16/36 (45%), Positives = 26/36 (73%), Gaps = 0/36 (0%)
Query 75 EDLDRRAHAADAVQKFSANEADAAQQFQGVGAQAEA 110
+ +R+A A+DA +KF+ NEADA +F+GV + +A
Sbjct 24 DTTERKAGASDAARKFATNEADAVSKFEGVAGEGDA 59
>gi|183983066|ref|YP_001851357.1| hypothetical protein MMAR_3067 [Mycobacterium marinum M]
gi|183176392|gb|ACC41502.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=322
Score = 37.0 bits (84), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 18/22 (82%), Positives = 18/22 (82%), Gaps = 0/22 (0%)
Query 89 KFSANEADAAQQFQGVGAQAEA 110
KF ANEADAAQQFQGVGA A
Sbjct 74 KFPANEADAAQQFQGVGADGMA 95
>gi|118617824|ref|YP_906156.1| hypothetical protein MUL_2312 [Mycobacterium ulcerans Agy99]
gi|118569934|gb|ABL04685.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=322
Score = 37.0 bits (84), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 18/22 (82%), Positives = 18/22 (82%), Gaps = 0/22 (0%)
Query 89 KFSANEADAAQQFQGVGAQAEA 110
KF ANEADAAQQFQGVGA A
Sbjct 74 KFPANEADAAQQFQGVGADGMA 95
>gi|41410423|ref|NP_963259.1| hypothetical protein MAP4325c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|118465693|ref|YP_884398.1| hypothetical protein MAV_5287 [Mycobacterium avium 104]
gi|41399257|gb|AAS06875.1| hypothetical protein MAP_4325c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|118166980|gb|ABK67877.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=306
Score = 36.2 bits (82), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 35/76 (47%), Positives = 45/76 (60%), Gaps = 1/76 (1%)
Query 27 VLTTPPASGPAPFPAVPVDLPAPADIANGALFAAGNSGVPGDVESSGLEDLDRRAHAADA 86
++ PPAS P PFP P +PAD G + AGN+ VP D + D DR+A AADA
Sbjct 18 IVGGPPASIPRPFPVPPGGT-SPADAGQGMVGGAGNANVPKDAADAAAADADRKARAADA 76
Query 87 VQKFSANEADAAQQFQ 102
KF ANEA+ +Q+ Q
Sbjct 77 AAKFPANEANGSQEMQ 92
>gi|195382089|ref|XP_002049764.1| GJ20572 [Drosophila virilis]
gi|194144561|gb|EDW60957.1| GJ20572 [Drosophila virilis]
Length=234
Score = 35.4 bits (80), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 28/94 (30%), Positives = 42/94 (45%), Gaps = 2/94 (2%)
Query 7 PAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAAGNSGVP 66
P P P P PI+V ++L LT+ PA G A+ + P G +FAAGN+
Sbjct 32 PRPTPQP-RPIRVRRQVLGGSLTSNPAGGSDARLALSKGIGTPDHNVIGQVFAAGNT-QK 89
Query 67 GDVESSGLEDLDRRAHAADAVQKFSANEADAAQQ 100
G + + G + H D + + D+ QQ
Sbjct 90 GPITTGGSVAYNNHGHGFDLTKTHTPGVQDSFQQ 123
>gi|195382091|ref|XP_002049765.1| GJ20571 [Drosophila virilis]
gi|194144562|gb|EDW60958.1| GJ20571 [Drosophila virilis]
Length=234
Score = 35.4 bits (80), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 28/94 (30%), Positives = 42/94 (45%), Gaps = 2/94 (2%)
Query 7 PAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPADIANGALFAAGNSGVP 66
P P P P PI+V ++L LT+ PA G A+ + P G +FAAGN+
Sbjct 32 PRPTPQP-RPIRVRRQVLGGSLTSNPAGGSDARLALSKGIGTPDHNVIGQVFAAGNT-QK 89
Query 67 GDVESSGLEDLDRRAHAADAVQKFSANEADAAQQ 100
G + + G + H D + + D+ QQ
Sbjct 90 GPITTGGSVAYNNHGHGFDLTKTHTPGVQDSFQQ 123
>gi|31793266|ref|NP_855759.1| hypothetical protein Mb2109 [Mycobacterium bovis AF2122/97]
gi|121637968|ref|YP_978192.1| hypothetical protein BCG_2102 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224990462|ref|YP_002645149.1| hypothetical protein JTY_2096 [Mycobacterium bovis BCG str. Tokyo
172]
16 more sequence titles
Length=314
Score = 35.0 bits (79), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 40/81 (50%), Positives = 46/81 (57%), Gaps = 8/81 (9%)
Query 35 GPAPFPAVPVDLPAPADI-ANGALFAAGNS----GVPGDVESSGLEDLDRRAHAADAVQK 89
GP P PV P I ALF + G D+E +D DRRAHAADA +K
Sbjct 19 GPVPLALGPVHPGGPTLIDLLMALFGLSTNADLGGTNADIEG---DDTDRRAHAADAARK 75
Query 90 FSANEADAAQQFQGVGAQAEA 110
FSANEA+AA+Q QGVGAQ A
Sbjct 76 FSANEANAAEQMQGVGAQGMA 96
>gi|289447704|ref|ZP_06437448.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289420662|gb|EFD17863.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
Length=1033
Score = 35.0 bits (79), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 40/81 (50%), Positives = 46/81 (57%), Gaps = 8/81 (9%)
Query 35 GPAPFPAVPVDLPAPADI-ANGALFAAGNS----GVPGDVESSGLEDLDRRAHAADAVQK 89
GP P PV P I ALF + G D+E +D DRRAHAADA +K
Sbjct 738 GPVPLALGPVHPGGPTLIDLLMALFGLSTNADLGGTNADIEG---DDTDRRAHAADAARK 794
Query 90 FSANEADAAQQFQGVGAQAEA 110
FSANEA+AA+Q QGVGAQ A
Sbjct 795 FSANEANAAEQMQGVGAQGMA 815
>gi|323719381|gb|EGB28520.1| hypothetical protein TMMG_01360 [Mycobacterium tuberculosis CDC1551A]
Length=255
Score = 34.7 bits (78), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 40/81 (50%), Positives = 46/81 (57%), Gaps = 8/81 (9%)
Query 35 GPAPFPAVPVDLPAPADI-ANGALFAAGNS----GVPGDVESSGLEDLDRRAHAADAVQK 89
GP P PV P I ALF + G D+E +D DRRAHAADA +K
Sbjct 19 GPVPLALGPVHPGGPTLIDLLMALFGLSTNADLGGANADIEG---DDTDRRAHAADAARK 75
Query 90 FSANEADAAQQFQGVGAQAEA 110
FSANEA+AA+Q QGVGAQ A
Sbjct 76 FSANEANAAEQMQGVGAQGMA 96
>gi|308376907|ref|ZP_07668376.1| hypothetical protein TMHG_01272 [Mycobacterium tuberculosis SUMu008]
gi|308349563|gb|EFP38414.1| hypothetical protein TMHG_01272 [Mycobacterium tuberculosis SUMu008]
Length=255
Score = 34.7 bits (78), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 40/81 (50%), Positives = 46/81 (57%), Gaps = 8/81 (9%)
Query 35 GPAPFPAVPVDLPAPADI-ANGALFAAGNS----GVPGDVESSGLEDLDRRAHAADAVQK 89
GP P PV P I ALF + G D+E +D DRRAHAADA +K
Sbjct 19 GPVPLALGPVHPGGPTLIDLLMALFGLSTNADLGGANADIEG---DDTDRRAHAADAARK 75
Query 90 FSANEADAAQQFQGVGAQAEA 110
FSANEA+AA+Q QGVGAQ A
Sbjct 76 FSANEANAAEQMQGVGAQGMA 96
>gi|254232248|ref|ZP_04925575.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|124601307|gb|EAY60317.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
Length=178
Score = 34.3 bits (77), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 40/81 (50%), Positives = 46/81 (57%), Gaps = 8/81 (9%)
Query 35 GPAPFPAVPVDLPAPADI-ANGALFAAGNS----GVPGDVESSGLEDLDRRAHAADAVQK 89
GP P PV P I ALF + G D+E +D DRRAHAADA +K
Sbjct 19 GPVPLALGPVHPGGPTLIDLLMALFGLSTNADLGGANADIEG---DDTDRRAHAADAARK 75
Query 90 FSANEADAAQQFQGVGAQAEA 110
FSANEA+AA+Q QGVGAQ A
Sbjct 76 FSANEANAAEQMQGVGAQGMA 96
>gi|15609220|ref|NP_216599.1| hypothetical protein Rv2083 [Mycobacterium tuberculosis H37Rv]
gi|148661899|ref|YP_001283422.1| hypothetical protein MRA_2099 [Mycobacterium tuberculosis H37Ra]
gi|167966798|ref|ZP_02549075.1| hypothetical protein MtubH3_01505 [Mycobacterium tuberculosis
H37Ra]
gi|307084728|ref|ZP_07493841.1| hypothetical protein TMLG_03576 [Mycobacterium tuberculosis SUMu012]
gi|1731339|sp|Q10691.1|Y2083_MYCTU RecName: Full=Uncharacterized protein Rv2083/MT2145
gi|1370249|emb|CAA98195.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium tuberculosis H37Rv]
gi|148506051|gb|ABQ73860.1| hypothetical protein MRA_2099 [Mycobacterium tuberculosis H37Ra]
gi|308365710|gb|EFP54561.1| hypothetical protein TMLG_03576 [Mycobacterium tuberculosis SUMu012]
Length=314
Score = 34.3 bits (77), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 40/81 (50%), Positives = 46/81 (57%), Gaps = 8/81 (9%)
Query 35 GPAPFPAVPVDLPAPADI-ANGALFAAGNS----GVPGDVESSGLEDLDRRAHAADAVQK 89
GP P PV P I ALF + G D+E +D DRRAHAADA +K
Sbjct 19 GPVPLALGPVHPGGPTLIDLLMALFGLSTNADLGGANADIEG---DDTDRRAHAADAARK 75
Query 90 FSANEADAAQQFQGVGAQAEA 110
FSANEA+AA+Q QGVGAQ A
Sbjct 76 FSANEANAAEQMQGVGAQGMA 96
>gi|298525583|ref|ZP_07012992.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298495377|gb|EFI30671.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=314
Score = 34.3 bits (77), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 40/81 (50%), Positives = 46/81 (57%), Gaps = 8/81 (9%)
Query 35 GPAPFPAVPVDLPAPADI-ANGALFAAGNS----GVPGDVESSGLEDLDRRAHAADAVQK 89
GP P PV P I ALF + G D+E +D DRRAHAADA +K
Sbjct 19 GPVPLALGPVHPGGPTLIDLLMALFGLSTNADLGGANADIEG---DDTDRRAHAADAARK 75
Query 90 FSANEADAAQQFQGVGAQAEA 110
FSANEA+AA+Q QGVGAQ A
Sbjct 76 FSANEANAAEQMQGVGAQGMA 96
>gi|15841576|ref|NP_336613.1| hypothetical protein MT2145 [Mycobacterium tuberculosis CDC1551]
gi|148823299|ref|YP_001288053.1| hypothetical protein TBFG_12120 [Mycobacterium tuberculosis F11]
gi|253798858|ref|YP_003031859.1| hypothetical protein TBMG_01898 [Mycobacterium tuberculosis KZN
1435]
35 more sequence titles
Length=314
Score = 34.3 bits (77), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 40/81 (50%), Positives = 46/81 (57%), Gaps = 8/81 (9%)
Query 35 GPAPFPAVPVDLPAPADI-ANGALFAAGNS----GVPGDVESSGLEDLDRRAHAADAVQK 89
GP P PV P I ALF + G D+E +D DRRAHAADA +K
Sbjct 19 GPVPLALGPVHPGGPTLIDLLMALFGLSTNADLGGANADIEG---DDTDRRAHAADAARK 75
Query 90 FSANEADAAQQFQGVGAQAEA 110
FSANEA+AA+Q QGVGAQ A
Sbjct 76 FSANEANAAEQMQGVGAQGMA 96
Lambda K H
0.310 0.130 0.379
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 129022162688
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40