BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2077A
Length=99
Score E
Sequences producing significant alignments: (Bits) Value
gi|15841567|ref|NP_336604.1| hypothetical protein MT2138 [Mycoba... 182 1e-44
gi|118466069|ref|YP_882823.1| hypothetical protein MAV_3646 [Myc... 100 5e-20
gi|183980151|ref|YP_001848442.1| hypothetical protein MMAR_0117 ... 79.7 1e-13
gi|41408569|ref|NP_961405.1| hypothetical protein MAP2471 [Mycob... 74.7 4e-12
gi|254774329|ref|ZP_05215845.1| hypothetical protein MaviaA2_066... 74.7 5e-12
gi|118465025|ref|YP_880721.1| hypothetical protein MAV_1480 [Myc... 73.9 7e-12
gi|183984078|ref|YP_001852369.1| hypothetical protein MMAR_4107 ... 72.0 3e-11
gi|183984031|ref|YP_001852322.1| hypothetical protein MMAR_4060 ... 67.4 7e-10
gi|342858516|ref|ZP_08715171.1| hypothetical protein MCOL_06561 ... 66.6 1e-09
gi|240172764|ref|ZP_04751423.1| hypothetical protein MkanA1_2584... 61.2 5e-08
gi|183980509|ref|YP_001848800.1| hypothetical protein MMAR_0480 ... 60.5 7e-08
gi|118463827|ref|YP_879970.1| hypothetical protein MAV_0691 [Myc... 55.1 4e-06
gi|118467080|ref|YP_881457.1| hypothetical protein MAV_2253 [Myc... 48.9 3e-04
gi|240173426|ref|ZP_04752084.1| hypothetical protein MkanA1_2919... 40.4 0.082
gi|41407316|ref|NP_960152.1| hypothetical protein MAP1218c [Myco... 37.0 1.1
gi|340626960|ref|YP_004745412.1| hypothetical protein MCAN_19671... 35.0 3.5
gi|15609088|ref|NP_216467.1| hypothetical protein Rv1951c [Mycob... 34.7 4.3
>gi|15841567|ref|NP_336604.1| hypothetical protein MT2138 [Mycobacterium tuberculosis CDC1551]
gi|31793260|ref|NP_855753.1| hypothetical protein Mb2103c [Mycobacterium bovis AF2122/97]
gi|57116940|ref|YP_177658.1| hypothetical protein Rv2077A [Mycobacterium tuberculosis H37Rv]
68 more sequence titles
Length=99
Score = 182 bits (463), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 98/99 (99%), Positives = 99/99 (100%), Gaps = 0/99 (0%)
Query 1 VGSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAE 60
+GSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAE
Sbjct 1 MGSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAE 60
Query 61 FATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV 99
FATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV
Sbjct 61 FATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV 99
>gi|118466069|ref|YP_882823.1| hypothetical protein MAV_3646 [Mycobacterium avium 104]
gi|118167356|gb|ABK68253.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=100
Score = 100 bits (250), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 67/99 (68%), Positives = 73/99 (74%), Gaps = 1/99 (1%)
Query 1 VGSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAE 60
VGS+ELQVVL QL AA Q QGL AQ A A PP GQPFQATT AVSG+NAAI A A
Sbjct 3 VGSDELQVVLDQLRAAAGQWQGLSAQLAEVA-PPSPGQPFQATTAAVSGVNAAIAVARAA 61
Query 61 FATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV 99
FA+R QA A V +AAA YA+QEAT A EMAAVT+V VV
Sbjct 62 FASRIQAEAARVTSAAADYANQEATGAGEMAAVTRVAVV 100
>gi|183980151|ref|YP_001848442.1| hypothetical protein MMAR_0117 [Mycobacterium marinum M]
gi|183173477|gb|ACC38587.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=156
Score = 79.7 bits (195), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/99 (71%), Positives = 73/99 (74%), Gaps = 0/99 (0%)
Query 1 VGSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAE 60
VGS+EL V LGQL V ASQ Q L AQ AA ATPP GQPFQAT AVSGI AAI
Sbjct 58 VGSDELHVKLGQLGVTASQWQDLRAQLAAGATPPAPGQPFQATAAAVSGIGAAIGGTGVA 117
Query 61 FATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV 99
ATRTQATA V AAAA+YA+QEATAA EMAAVTQV VV
Sbjct 118 LATRTQATAAAVTAAAASYANQEATAAGEMAAVTQVRVV 156
>gi|41408569|ref|NP_961405.1| hypothetical protein MAP2471 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41396927|gb|AAS04788.1| hypothetical protein MAP_2471 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336460918|gb|EGO39801.1| hypothetical protein MAPs_35840 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=97
Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/99 (48%), Positives = 59/99 (60%), Gaps = 2/99 (2%)
Query 1 VGSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAE 60
+G + LQVV +L A+Q Q L +QF + PP G F+ATT AV+ +NAAI AA
Sbjct 1 MGQDSLQVVPAELVATAAQWQALSSQFIGA--PPSPGPSFEATTAAVNALNAAIGATAAS 58
Query 61 FATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV 99
F RTQ T GV A+ Y QEAT+A+ M VT V VV
Sbjct 59 FVARTQETVGGVTTASGGYLSQEATSAAAMNDVTNVRVV 97
>gi|254774329|ref|ZP_05215845.1| hypothetical protein MaviaA2_06630 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=97
Score = 74.7 bits (182), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/99 (48%), Positives = 59/99 (60%), Gaps = 2/99 (2%)
Query 1 VGSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAE 60
+G + LQVV +L A+Q Q L +QF + PP G F+ATT AV+ +NAAI AA
Sbjct 1 MGQDSLQVVPAELVATAAQWQALSSQFIGA--PPSPGLSFEATTAAVNALNAAIGATAAS 58
Query 61 FATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV 99
F RTQ T GV A+ Y QEAT+A+ M VT V VV
Sbjct 59 FVARTQETVGGVTTASGGYLSQEATSAAAMNDVTNVRVV 97
>gi|118465025|ref|YP_880721.1| hypothetical protein MAV_1480 [Mycobacterium avium 104]
gi|118465919|ref|YP_880693.1| hypothetical protein MAV_1451 [Mycobacterium avium 104]
gi|118166312|gb|ABK67209.1| conserved hypothetical protein [Mycobacterium avium 104]
gi|118167206|gb|ABK68103.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=97
Score = 73.9 bits (180), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 46/99 (47%), Positives = 58/99 (59%), Gaps = 2/99 (2%)
Query 1 VGSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAE 60
+G + LQVV +L A+Q Q L +QF + PP G F+ATT AV+ +NAAI A
Sbjct 1 MGQDSLQVVPAELVATAAQWQALSSQFIGA--PPSPGPSFEATTAAVNALNAAIAATTAS 58
Query 61 FATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV 99
F RTQ T GV A+ Y QEAT+A+ M VT V VV
Sbjct 59 FVARTQETVGGVTTASGGYLSQEATSAAAMNDVTNVRVV 97
>gi|183984078|ref|YP_001852369.1| hypothetical protein MMAR_4107 [Mycobacterium marinum M]
gi|183177404|gb|ACC42514.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=98
Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/94 (56%), Positives = 63/94 (68%), Gaps = 2/94 (2%)
Query 6 LQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAEFATRT 65
LQVV +L A+Q L +Q TPP SGQPFQATT A++ +NAAI AAA F RT
Sbjct 7 LQVVPAELAATAAQWSALSSQLVG--TPPTSGQPFQATTAAMNAVNAAIDVAAAAFTART 64
Query 66 QATATGVAAAAAAYAHQEATAASEMAAVTQVTVV 99
Q TA+GV AA+ Y QEA +A+EM A+T VTVV
Sbjct 65 QTTASGVTAASGGYTAQEAASAAEMGAITGVTVV 98
>gi|183984031|ref|YP_001852322.1| hypothetical protein MMAR_4060 [Mycobacterium marinum M]
gi|183177357|gb|ACC42467.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=99
Score = 67.4 bits (163), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 59/99 (60%), Positives = 68/99 (69%), Gaps = 0/99 (0%)
Query 1 VGSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAE 60
+G++ QL V ASQ +GLGAQ A PP G PFQATT AVSGI++AI A
Sbjct 1 MGTDRTTGRTDQLGVTASQRRGLGAQLTRGAAPPAPGHPFQATTAAVSGIDSAIGATATA 60
Query 61 FATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV 99
ATRTQATA VAAAAA YA+QEA+AA EMAAV Q+ VV
Sbjct 61 LATRTQATAAAVAAAAARYANQEASAADEMAAVAQLKVV 99
>gi|342858516|ref|ZP_08715171.1| hypothetical protein MCOL_06561 [Mycobacterium colombiense CECT
3035]
gi|342134220|gb|EGT87400.1| hypothetical protein MCOL_06561 [Mycobacterium colombiense CECT
3035]
Length=97
Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/99 (54%), Positives = 63/99 (64%), Gaps = 2/99 (2%)
Query 1 VGSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAE 60
+G + LQVV +L A Q L +Q A PP GQPFQATT AV+ +NAAI AA
Sbjct 1 MGQDSLQVVPAELAAIAGQWGALTSQLAG--VPPVPGQPFQATTAAVNAVNAAIGVTAAS 58
Query 61 FATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV 99
FA RTQ T GV AA Y QEAT+A+++AAVT VTVV
Sbjct 59 FAARTQETVGGVTQAAGGYTAQEATSAADIAAVTGVTVV 97
>gi|240172764|ref|ZP_04751423.1| hypothetical protein MkanA1_25842 [Mycobacterium kansasii ATCC
12478]
Length=202
Score = 61.2 bits (147), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 42/96 (44%), Positives = 53/96 (56%), Gaps = 1/96 (1%)
Query 3 SNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAEFA 62
+ EL+V + QL +ASQ + L +F+ A+P G+P Q TT AV G + A+ AA
Sbjct 108 TGELRVDVYQLRASASQWRELSTRFSVLASP-TPGRPCQPTTAAVGGAHTAVGLAAEVLT 166
Query 63 TRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTV 98
RTQAT V A A Y E TAA EMAAV V
Sbjct 167 IRTQATTGAVKAGAEGYGSNEVTAAGEMAAVRPRMV 202
>gi|183980509|ref|YP_001848800.1| hypothetical protein MMAR_0480 [Mycobacterium marinum M]
gi|183173835|gb|ACC38945.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=97
Score = 60.5 bits (145), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/89 (47%), Positives = 50/89 (57%), Gaps = 1/89 (1%)
Query 5 ELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAEFATR 64
+L+V + QLE A + A AA TPP GQPFQ TT AVS + A+ AAA T
Sbjct 5 QLRVEVPQLEATAGEWSQRSATLAA-LTPPSPGQPFQPTTAAVSSAHTAVGLAAAALTTH 63
Query 65 TQATATGVAAAAAAYAHQEATAASEMAAV 93
TQ T V + A YA E T+ +EMAAV
Sbjct 64 TQETIVAVESGATRYATNETTSPAEMAAV 92
>gi|118463827|ref|YP_879970.1| hypothetical protein MAV_0691 [Mycobacterium avium 104]
gi|118165114|gb|ABK66011.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=70
Score = 55.1 bits (131), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/67 (62%), Positives = 46/67 (69%), Gaps = 0/67 (0%)
Query 33 PPESGQPFQATTVAVSGINAAICCAAAEFATRTQATATGVAAAAAAYAHQEATAASEMAA 92
PP GQ FQATT AV+ +NAAI AAA F RTQAT GV AA Y QEATAA +M+
Sbjct 4 PPAPGQSFQATTAAVNAVNAAIGVAAASFTARTQATVGGVTQAAGGYTAQEATAAGQMSN 63
Query 93 VTQVTVV 99
+T VTVV
Sbjct 64 ITGVTVV 70
>gi|118467080|ref|YP_881457.1| hypothetical protein MAV_2253 [Mycobacterium avium 104]
gi|118168367|gb|ABK69264.1| hypothetical protein MAV_2253 [Mycobacterium avium 104]
Length=53
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/50 (58%), Positives = 35/50 (70%), Gaps = 0/50 (0%)
Query 50 INAAICCAAAEFATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV 99
+NAAI AA F RTQ T GV AAA Y QEAT+A+++AA+T VTVV
Sbjct 4 VNAAIGVTAAAFTARTQETVGGVTAAAGGYTAQEATSAADIAAITGVTVV 53
>gi|240173426|ref|ZP_04752084.1| hypothetical protein MkanA1_29196 [Mycobacterium kansasii ATCC
12478]
Length=97
Score = 40.4 bits (93), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 35/79 (45%), Positives = 44/79 (56%), Gaps = 6/79 (7%)
Query 11 GQLEVAASQSQGLGAQFAASAT------PPESGQPFQATTVAVSGINAAICCAAAEFATR 64
GQL + Q +G+ +Q+ + PP GQPFQ TT AV G +AA+ AAA R
Sbjct 4 GQLRIDVPQLEGVASQWGQRSLELAVLAPPSLGQPFQRTTAAVRGAHAAVEFAAAALLAR 63
Query 65 TQATATGVAAAAAAYAHQE 83
TQATA+ V A A YA E
Sbjct 64 TQATASTVQAGATGYASNE 82
>gi|41407316|ref|NP_960152.1| hypothetical protein MAP1218c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41395668|gb|AAS03535.1| hypothetical protein MAP_1218c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336458046|gb|EGO37033.1| hypothetical protein MAPs_17440 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=97
Score = 37.0 bits (84), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/90 (36%), Positives = 50/90 (56%), Gaps = 6/90 (6%)
Query 10 LGQLEVAASQSQGLGAQFAASA------TPPESGQPFQATTVAVSGINAAICCAAAEFAT 63
+ +LEV +++ + L ++ +A TPP SG +Q + VAV +AA+ AA
Sbjct 1 MDRLEVTSAELRMLSGKWHTNAARLRVATPPPSGMSYQPSAVAVDAAHAAVEVAANSLIG 60
Query 64 RTQATATGVAAAAAAYAHQEATAASEMAAV 93
R TAT VAAA +Y EA +A +M+A+
Sbjct 61 RMIETATKVAAADFSYTANEADSADKMSAI 90
>gi|340626960|ref|YP_004745412.1| hypothetical protein MCAN_19671 [Mycobacterium canettii CIPT
140010059]
gi|340005150|emb|CCC44299.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=98
Score = 35.0 bits (79), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 28/62 (46%), Positives = 35/62 (57%), Gaps = 1/62 (1%)
Query 5 ELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAEFATR 64
EL+V + Q+ ASQ G + + A PP GQPFQ TT AV G +AA+ A A F R
Sbjct 5 ELRVNIQQVAATASQWSGRSTELSVLA-PPPLGQPFQPTTAAVGGAHAAVGLAVAAFTAR 63
Query 65 TQ 66
T
Sbjct 64 TH 65
>gi|15609088|ref|NP_216467.1| hypothetical protein Rv1951c [Mycobacterium tuberculosis H37Rv]
gi|31793143|ref|NP_855636.1| hypothetical protein Mb1986c [Mycobacterium bovis AF2122/97]
gi|121637856|ref|YP_978079.1| hypothetical protein BCG_1990c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
74 more sequence titles
Length=98
Score = 34.7 bits (78), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 28/62 (46%), Positives = 35/62 (57%), Gaps = 1/62 (1%)
Query 5 ELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGINAAICCAAAEFATR 64
EL+V + Q+ ASQ G + + A PP GQPFQ TT AV G +AA+ A A F R
Sbjct 5 ELRVNIQQVAATASQWSGRSTELSVLA-PPPLGQPFQPTTAAVGGAHAAVGLAVAAFTAR 63
Query 65 TQ 66
T
Sbjct 64 TH 65
Lambda K H
0.310 0.116 0.303
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 129711308684
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40