BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2628
Length=120
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609765|ref|NP_217144.1| hypothetical protein Rv2628 [Mycoba... 243 8e-63
gi|121638517|ref|YP_978741.1| hypothetical protein BCG_2655 [Myc... 240 4e-62
gi|289448280|ref|ZP_06438024.1| conserved hypothetical protein [... 239 1e-61
gi|340627649|ref|YP_004746101.1| hypothetical protein MCAN_26741... 238 3e-61
gi|31793814|ref|NP_856307.1| hypothetical protein Mb2661 [Mycoba... 237 4e-61
gi|308232194|ref|ZP_07415243.2| hypothetical protein TMAG_02437 ... 180 6e-44
gi|240170621|ref|ZP_04749280.1| hypothetical protein MkanA1_1500... 114 4e-24
gi|296169060|ref|ZP_06850720.1| conserved hypothetical protein [... 102 2e-20
gi|118462821|ref|YP_882685.1| hypothetical protein MAV_3503 [Myc... 99.0 2e-19
gi|254775954|ref|ZP_05217470.1| hypothetical protein MaviaA2_149... 99.0 2e-19
gi|41408826|ref|NP_961662.1| hypothetical protein MAP2728 [Mycob... 98.6 3e-19
gi|296170912|ref|ZP_06852449.1| conserved hypothetical protein [... 95.9 2e-18
gi|254818991|ref|ZP_05223992.1| hypothetical protein MintA_03646... 93.6 8e-18
gi|342858548|ref|ZP_08715203.1| hypothetical protein MCOL_06721 ... 93.6 9e-18
gi|120402408|ref|YP_952237.1| hypothetical protein Mvan_1397 [My... 77.8 6e-13
gi|294993095|ref|ZP_06798786.1| hypothetical protein Mtub2_00940... 62.8 2e-08
gi|319791257|ref|YP_004152897.1| type VI secretion protein [Vari... 36.6 1.2
gi|158341517|ref|YP_001522681.1| hypothetical protein AM1_H0014 ... 36.2 1.7
gi|158339633|ref|YP_001521022.1| hypothetical protein AM1_A0372 ... 35.8 2.0
>gi|15609765|ref|NP_217144.1| hypothetical protein Rv2628 [Mycobacterium tuberculosis H37Rv]
gi|15842168|ref|NP_337205.1| hypothetical protein MT2703 [Mycobacterium tuberculosis CDC1551]
gi|148662468|ref|YP_001283991.1| hypothetical protein MRA_2656 [Mycobacterium tuberculosis H37Ra]
30 more sequence titles
Length=120
Score = 243 bits (620), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 120/120 (100%), Positives = 120/120 (100%), Gaps = 0/120 (0%)
Query 1 MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSH 60
MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSH
Sbjct 1 MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSH 60
Query 61 DGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV 120
DGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV
Sbjct 61 DGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV 120
>gi|121638517|ref|YP_978741.1| hypothetical protein BCG_2655 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224991011|ref|YP_002645698.1| hypothetical protein JTY_2649 [Mycobacterium bovis BCG str. Tokyo
172]
gi|289444169|ref|ZP_06433913.1| hypothetical protein TBLG_01271 [Mycobacterium tuberculosis T46]
14 more sequence titles
Length=120
Score = 240 bits (613), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 119/120 (99%), Positives = 119/120 (99%), Gaps = 0/120 (0%)
Query 1 MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSH 60
MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDR H
Sbjct 1 MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRLH 60
Query 61 DGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV 120
DGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV
Sbjct 61 DGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV 120
>gi|289448280|ref|ZP_06438024.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289421238|gb|EFD18439.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
Length=120
Score = 239 bits (610), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 118/120 (99%), Positives = 118/120 (99%), Gaps = 0/120 (0%)
Query 1 MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSH 60
MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDR H
Sbjct 1 MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRLH 60
Query 61 DGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV 120
DGRTARVPGDEITSTVSGWLSELGTQSPLADELAR VRIGDWPAAYAIGEHLSVEIAVAV
Sbjct 61 DGRTARVPGDEITSTVSGWLSELGTQSPLADELARTVRIGDWPAAYAIGEHLSVEIAVAV 120
>gi|340627649|ref|YP_004746101.1| hypothetical protein MCAN_26741 [Mycobacterium canettii CIPT
140010059]
gi|340005839|emb|CCC45005.1| hypothetical protein MCAN_26741 [Mycobacterium canettii CIPT
140010059]
Length=120
Score = 238 bits (606), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 117/120 (98%), Positives = 118/120 (99%), Gaps = 0/120 (0%)
Query 1 MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSH 60
MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQ+ATIYQVTDR H
Sbjct 1 MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQAATIYQVTDRLH 60
Query 61 DGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV 120
DGRTARVPGDEITSTVSGWLSELG QSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV
Sbjct 61 DGRTARVPGDEITSTVSGWLSELGAQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV 120
>gi|31793814|ref|NP_856307.1| hypothetical protein Mb2661 [Mycobacterium bovis AF2122/97]
gi|31619408|emb|CAD94846.1| HYPOTHETICAL PROTEIN Mb2661 [Mycobacterium bovis AF2122/97]
Length=120
Score = 237 bits (605), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 118/120 (99%), Positives = 118/120 (99%), Gaps = 0/120 (0%)
Query 1 MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSH 60
MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDR H
Sbjct 1 MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRLH 60
Query 61 DGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV 120
DGRTARV GDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV
Sbjct 61 DGRTARVRGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV 120
>gi|308232194|ref|ZP_07415243.2| hypothetical protein TMAG_02437 [Mycobacterium tuberculosis SUMu001]
gi|308369805|ref|ZP_07419144.2| hypothetical protein TMBG_02766 [Mycobacterium tuberculosis SUMu002]
gi|308371076|ref|ZP_07423755.2| hypothetical protein TMCG_01876 [Mycobacterium tuberculosis SUMu003]
22 more sequence titles
Length=90
Score = 180 bits (457), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 89/90 (99%), Positives = 90/90 (100%), Gaps = 0/90 (0%)
Query 31 VHQEAMMNLAIWHPRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLA 90
+HQEAMMNLAIWHPRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLA
Sbjct 1 MHQEAMMNLAIWHPRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLA 60
Query 91 DELARAVRIGDWPAAYAIGEHLSVEIAVAV 120
DELARAVRIGDWPAAYAIGEHLSVEIAVAV
Sbjct 61 DELARAVRIGDWPAAYAIGEHLSVEIAVAV 90
>gi|240170621|ref|ZP_04749280.1| hypothetical protein MkanA1_15005 [Mycobacterium kansasii ATCC
12478]
Length=82
Score = 114 bits (285), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 56/83 (68%), Positives = 66/83 (80%), Gaps = 2/83 (2%)
Query 37 MNLAIWHPRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARA 96
MN AIW KV TIYQVTDR HDGRTARV +EIT+TV+ WLSELG Q+PL D+LA A
Sbjct 1 MNAAIWRHHKV--TTIYQVTDRLHDGRTARVTANEITATVASWLSELGVQTPLVDDLACA 58
Query 97 VRIGDWPAAYAIGEHLSVEIAVA 119
VR GDWP AYAIGE LS+++++A
Sbjct 59 VRTGDWPTAYAIGECLSIQVSIA 81
>gi|296169060|ref|ZP_06850720.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295896251|gb|EFG75912.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=107
Score = 102 bits (253), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/90 (58%), Positives = 65/90 (73%), Gaps = 2/90 (2%)
Query 33 QEAMMNLAI--WHPRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLA 90
+EA+M A H R V +AT+Y+VTDR H GRT RV G+EI VS WL+ELG SPL
Sbjct 16 EEAVMRFATRRHHLRAVPAATMYRVTDRLHGGRTVRVSGNEIAPIVSAWLAELGAHSPLV 75
Query 91 DELARAVRIGDWPAAYAIGEHLSVEIAVAV 120
+ELARA +GDW AAYAIG+ LSV++ +AV
Sbjct 76 EELARAACVGDWSAAYAIGDQLSVDVVIAV 105
>gi|118462821|ref|YP_882685.1| hypothetical protein MAV_3503 [Mycobacterium avium 104]
gi|118164108|gb|ABK65005.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=87
Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/76 (65%), Positives = 60/76 (79%), Gaps = 0/76 (0%)
Query 44 PRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWP 103
P++ +ATIYQ+TDR H G ARVP +IT+TVS WL++LG SPL DEL RAVR GDW
Sbjct 11 PQRKPTATIYQLTDRLHRGHVARVPAHQITATVSAWLADLGADSPLVDELERAVRGGDWA 70
Query 104 AAYAIGEHLSVEIAVA 119
AA+A+GE LS+EIAVA
Sbjct 71 AAHALGECLSIEIAVA 86
>gi|254775954|ref|ZP_05217470.1| hypothetical protein MaviaA2_14960 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=81
Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/76 (65%), Positives = 60/76 (79%), Gaps = 0/76 (0%)
Query 44 PRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWP 103
P++ +ATIYQ+TDR H G ARVP +IT+TVS WL++LG SPL DEL RAVR GDW
Sbjct 5 PQRKPTATIYQLTDRLHRGHVARVPAHQITATVSAWLADLGADSPLVDELERAVRGGDWA 64
Query 104 AAYAIGEHLSVEIAVA 119
AA+A+GE LS+EIAVA
Sbjct 65 AAHALGECLSIEIAVA 80
>gi|41408826|ref|NP_961662.1| hypothetical protein MAP2728 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41397185|gb|AAS05045.1| hypothetical protein MAP_2728 [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=87
Score = 98.6 bits (244), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 49/76 (65%), Positives = 60/76 (79%), Gaps = 0/76 (0%)
Query 44 PRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWP 103
P++ +ATIYQ+TDR H G ARVP +IT+TVS WL++LG SPL DEL RAVR GDW
Sbjct 11 PQRRPTATIYQLTDRLHRGHVARVPAHQITATVSAWLADLGADSPLVDELERAVRGGDWA 70
Query 104 AAYAIGEHLSVEIAVA 119
AA+A+GE LS+EIAVA
Sbjct 71 AAHALGECLSIEIAVA 86
>gi|296170912|ref|ZP_06852449.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295894461|gb|EFG74205.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=87
Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/71 (65%), Positives = 58/71 (82%), Gaps = 0/71 (0%)
Query 49 SATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAI 108
+ATIYQ+TDR H+G ARVP ++I +TVS WL+ELG SPL D+L RAVR GDW AA+A+
Sbjct 16 AATIYQLTDRLHEGHVARVPANQIPATVSAWLAELGAHSPLVDDLERAVRGGDWSAAHAL 75
Query 109 GEHLSVEIAVA 119
GE LS+EIA+A
Sbjct 76 GECLSIEIALA 86
>gi|254818991|ref|ZP_05223992.1| hypothetical protein MintA_03646 [Mycobacterium intracellulare
ATCC 13950]
Length=87
Score = 93.6 bits (231), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 47/76 (62%), Positives = 58/76 (77%), Gaps = 0/76 (0%)
Query 44 PRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWP 103
P++ +ATIYQ+TDR H G ARVP +IT+TVS WL++LG SPL DEL RAV GDW
Sbjct 11 PQRRPTATIYQLTDRLHRGHVARVPAHQITATVSAWLADLGADSPLVDELERAVCGGDWA 70
Query 104 AAYAIGEHLSVEIAVA 119
AA+A+ E LS+EIAVA
Sbjct 71 AAHALSECLSIEIAVA 86
>gi|342858548|ref|ZP_08715203.1| hypothetical protein MCOL_06721 [Mycobacterium colombiense CECT
3035]
gi|342134252|gb|EGT87432.1| hypothetical protein MCOL_06721 [Mycobacterium colombiense CECT
3035]
Length=87
Score = 93.6 bits (231), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 46/76 (61%), Positives = 59/76 (78%), Gaps = 0/76 (0%)
Query 44 PRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWP 103
P++ +ATIYQ+TDR H G+ ARVP +IT+TV+ WL++LG SPL DEL RAV GDW
Sbjct 11 PQRRPTATIYQLTDRLHKGQIARVPAHQITATVAAWLADLGADSPLVDELERAVCGGDWA 70
Query 104 AAYAIGEHLSVEIAVA 119
AA+A+ E LS+EIAVA
Sbjct 71 AAHALSECLSIEIAVA 86
>gi|120402408|ref|YP_952237.1| hypothetical protein Mvan_1397 [Mycobacterium vanbaalenii PYR-1]
gi|119955226|gb|ABM12231.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=85
Score = 77.8 bits (190), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 38/66 (58%), Positives = 46/66 (70%), Gaps = 0/66 (0%)
Query 53 YQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHL 112
Y +TDR H R+ RV DEIT TVS WL +LG S LAD+ AR VR GDW A+A+ E L
Sbjct 18 YVLTDRLHRERSVRVTADEITVTVSSWLEQLGVHSALADDFARTVREGDWAGAHALAEAL 77
Query 113 SVEIAV 118
SV++AV
Sbjct 78 SVDVAV 83
>gi|294993095|ref|ZP_06798786.1| hypothetical protein Mtub2_00940 [Mycobacterium tuberculosis
210]
Length=32
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/32 (97%), Positives = 32/32 (100%), Gaps = 0/32 (0%)
Query 89 LADELARAVRIGDWPAAYAIGEHLSVEIAVAV 120
+ADELARAVRIGDWPAAYAIGEHLSVEIAVAV
Sbjct 1 MADELARAVRIGDWPAAYAIGEHLSVEIAVAV 32
>gi|319791257|ref|YP_004152897.1| type VI secretion protein [Variovorax paradoxus EPS]
gi|315593720|gb|ADU34786.1| type VI secretion protein, VC_A0111 family [Variovorax paradoxus
EPS]
Length=360
Score = 36.6 bits (83), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/84 (33%), Positives = 39/84 (47%), Gaps = 11/84 (13%)
Query 1 MSTQRPRHSGIR-AVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRS 59
++ RP R VG + G GRIGR VH +A ++ + W R+V +A + RS
Sbjct 149 VALDRPGEDRFRLQVGAFVGMGTPGRIGRDEVHDDARLHFSGWLARRVHNAESVESVLRS 208
Query 60 HDGRTA----------RVPGDEIT 73
+ G RVP DE+T
Sbjct 209 YFGVPVTLERWVGHWMRVPSDEVT 232
>gi|158341517|ref|YP_001522681.1| hypothetical protein AM1_H0014 [Acaryochloris marina MBIC11017]
gi|158311758|gb|ABW33367.1| hypothetical protein AM1_H0014 [Acaryochloris marina MBIC11017]
Length=74
Score = 36.2 bits (82), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 15/48 (32%), Positives = 24/48 (50%), Gaps = 0/48 (0%)
Query 53 YQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIG 100
YQ H R+P D+I VS WL+E P +L+ ++++G
Sbjct 17 YQTDYGQHPDPCTRLPSDQILDRVSSWLAEFNPLLPPESQLSESIQVG 64
>gi|158339633|ref|YP_001521022.1| hypothetical protein AM1_A0372 [Acaryochloris marina MBIC11017]
gi|158340441|ref|YP_001521797.1| hypothetical protein AM1_C0369 [Acaryochloris marina MBIC11017]
gi|158309874|gb|ABW31490.1| hypothetical protein AM1_A0372 [Acaryochloris marina MBIC11017]
gi|158310682|gb|ABW32296.1| hypothetical protein AM1_C0369 [Acaryochloris marina MBIC11017]
Length=69
Score = 35.8 bits (81), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/55 (35%), Positives = 27/55 (50%), Gaps = 1/55 (1%)
Query 47 VQSATIYQVTDR-SHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIG 100
++ A IY TD H +P D+I V+ WLSE P E A ++R+G
Sbjct 10 IEQAWIYHQTDYGQHPDPCTGLPSDQILDRVASWLSEFNPLIPPESETALSIRVG 64
Lambda K H
0.319 0.132 0.422
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128530997826
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40