BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2811
Length=202
Score E
Sequences producing significant alignments: (Bits) Value
gi|308232250|ref|ZP_07415430.2| hypothetical protein TMAG_03193 ... 387 6e-106
gi|15609948|ref|NP_217327.1| hypothetical protein Rv2811 [Mycoba... 386 1e-105
gi|289448470|ref|ZP_06438214.1| conserved hypothetical protein [... 384 3e-105
gi|289575509|ref|ZP_06455736.1| conserved hypothetical protein [... 383 1e-104
gi|167967616|ref|ZP_02549893.1| hypothetical protein MtubH3_0610... 269 1e-70
gi|294994095|ref|ZP_06799786.1| hypothetical protein Mtub2_06173... 262 2e-68
gi|289444360|ref|ZP_06434104.1| conserved hypothetical protein [... 241 5e-62
gi|296169419|ref|ZP_06851041.1| conserved hypothetical protein [... 232 2e-59
gi|15842347|ref|NP_337384.1| hypothetical protein MT2878 [Mycoba... 176 1e-42
gi|340628741|ref|YP_004747193.1| hypothetical protein MCAN_37911... 165 4e-39
gi|120401572|ref|YP_951401.1| hypothetical protein Mvan_0551 [My... 155 4e-36
gi|258655053|ref|YP_003204209.1| hypothetical protein Namu_4947 ... 152 3e-35
gi|111026944|ref|YP_708922.1| hypothetical protein RHA1_ro11117 ... 139 2e-31
gi|336117532|ref|YP_004572300.1| hypothetical protein MLP_18830 ... 139 3e-31
gi|258651520|ref|YP_003200676.1| hypothetical protein Namu_1284 ... 134 7e-30
gi|226334813|ref|YP_002784485.1| hypothetical protein ROP_pKNR-0... 133 2e-29
gi|15610907|ref|NP_218288.1| hypothetical protein Rv3771c [Mycob... 128 4e-28
gi|119854967|ref|YP_935572.1| hypothetical protein Mkms_5573 [My... 125 4e-27
gi|315441603|ref|YP_004074480.1| RNA polymerase sigma factor, si... 97.1 1e-18
gi|226349348|ref|YP_002776462.1| hypothetical protein ROP_pROB01... 77.4 1e-12
gi|296169410|ref|ZP_06851032.1| conserved hypothetical protein [... 73.9 1e-11
gi|78044239|ref|YP_360085.1| ISChy3, orf1 [Carboxydothermus hydr... 46.6 0.002
gi|78044213|ref|YP_361433.1| ISChy3, orf1 [Carboxydothermus hydr... 45.8 0.003
gi|167628183|ref|YP_001678682.1| hypothetical protein HM1_0046 [... 43.1 0.024
gi|291559600|emb|CBL38400.1| hypothetical protein CL2_14540 [but... 41.2 0.097
gi|333006170|gb|EGK25679.1| transposase IS66 family protein [Shi... 38.1 0.69
gi|332088549|gb|EGI93664.1| transposase IS66 family protein [Shi... 38.1 0.76
gi|145226040|ref|YP_001136694.1| hypothetical protein Mflv_5445 ... 38.1 0.76
gi|332094570|gb|EGI99616.1| transposase IS66 family protein [Shi... 37.7 1.0
gi|332086342|gb|EGI91494.1| transposase IS66 family protein [Shi... 37.4 1.3
gi|332995920|gb|EGK15550.1| transposase IS66 family protein [Shi... 37.4 1.4
gi|281357906|ref|ZP_06244391.1| hypothetical protein Vvad_PD1259... 37.0 1.5
gi|333003611|gb|EGK23149.1| transposase IS66 family protein [Shi... 37.0 1.6
gi|332094995|gb|EGJ00034.1| transposase IS66 family protein [Shi... 36.6 2.0
gi|332091050|gb|EGI96140.1| transposase IS66 family protein [Shi... 36.6 2.3
gi|333002481|gb|EGK22043.1| transposase IS66 family protein [Shi... 36.6 2.4
gi|332083450|gb|EGI88675.1| transposase IS66 family protein [Shi... 36.6 2.4
gi|335572844|gb|EGM59215.1| transposase IS66 family protein [Shi... 36.2 2.5
gi|335575330|gb|EGM61626.1| transposase IS66 family protein [Shi... 36.2 2.5
gi|335575322|gb|EGM61618.1| transposase IS66 family protein [Shi... 36.2 2.5
gi|345367754|gb|EGW99765.1| transposase IS66 family protein [Esc... 36.2 2.6
gi|335572364|gb|EGM58744.1| transposase IS66 family protein [Shi... 36.2 2.7
gi|320180591|gb|EFW55521.1| ISSfl4 ORF3 [Shigella boydii ATCC 9905] 36.2 2.7
gi|335573262|gb|EGM59625.1| transposase IS66 family protein [Shi... 36.2 2.7
gi|332097570|gb|EGJ02549.1| transposase IS66 family protein [Shi... 36.2 2.8
gi|333017847|gb|EGK37154.1| transposase IS66 family protein [Shi... 35.8 3.3
gi|335574467|gb|EGM60791.1| transposase IS66 family protein [Shi... 35.8 3.4
gi|320176337|gb|EFW51396.1| ISSfl4 ORF3 [Shigella dysenteriae CD... 35.8 3.5
gi|333011052|gb|EGK30466.1| transposase IS66 family protein [Shi... 35.8 3.5
gi|332756642|gb|EGJ86991.1| transposase IS66 family protein [Shi... 35.8 3.5
>gi|308232250|ref|ZP_07415430.2| hypothetical protein TMAG_03193 [Mycobacterium tuberculosis SUMu001]
gi|308369866|ref|ZP_07419331.2| hypothetical protein TMBG_02945 [Mycobacterium tuberculosis SUMu002]
gi|308371136|ref|ZP_07423946.2| hypothetical protein TMCG_02057 [Mycobacterium tuberculosis SUMu003]
21 more sequence titles
Length=215
Score = 387 bits (993), Expect = 6e-106, Method: Compositional matrix adjust.
Identities = 202/202 (100%), Positives = 202/202 (100%), Gaps = 0/202 (0%)
Query 1 VVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG 60
VVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG
Sbjct 14 VVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG 73
Query 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV
Sbjct 74 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 133
Query 121 EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV 180
EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV
Sbjct 134 EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV 193
Query 181 SGGRLLAPGWPGEWVQHESTLP 202
SGGRLLAPGWPGEWVQHESTLP
Sbjct 194 SGGRLLAPGWPGEWVQHESTLP 215
>gi|15609948|ref|NP_217327.1| hypothetical protein Rv2811 [Mycobacterium tuberculosis H37Rv]
gi|31793986|ref|NP_856479.1| hypothetical protein Mb2834 [Mycobacterium bovis AF2122/97]
gi|121638690|ref|YP_978914.1| hypothetical protein BCG_2829 [Mycobacterium bovis BCG str. Pasteur
1173P2]
41 more sequence titles
Length=202
Score = 386 bits (991), Expect = 1e-105, Method: Compositional matrix adjust.
Identities = 201/202 (99%), Positives = 202/202 (100%), Gaps = 0/202 (0%)
Query 1 VVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG 60
+VTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG
Sbjct 1 MVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG 60
Query 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV
Sbjct 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
Query 121 EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV 180
EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV
Sbjct 121 EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV 180
Query 181 SGGRLLAPGWPGEWVQHESTLP 202
SGGRLLAPGWPGEWVQHESTLP
Sbjct 181 SGGRLLAPGWPGEWVQHESTLP 202
>gi|289448470|ref|ZP_06438214.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289421428|gb|EFD18629.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
Length=202
Score = 384 bits (987), Expect = 3e-105, Method: Compositional matrix adjust.
Identities = 200/202 (99%), Positives = 202/202 (100%), Gaps = 0/202 (0%)
Query 1 VVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG 60
+VTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSR+LRGPAGPVELCPRRSRCTGCG
Sbjct 1 MVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRKLRGPAGPVELCPRRSRCTGCG 60
Query 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV
Sbjct 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
Query 121 EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV 180
EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV
Sbjct 121 EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV 180
Query 181 SGGRLLAPGWPGEWVQHESTLP 202
SGGRLLAPGWPGEWVQHESTLP
Sbjct 181 SGGRLLAPGWPGEWVQHESTLP 202
>gi|289575509|ref|ZP_06455736.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|289539940|gb|EFD44518.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
Length=202
Score = 383 bits (983), Expect = 1e-104, Method: Compositional matrix adjust.
Identities = 200/202 (99%), Positives = 201/202 (99%), Gaps = 0/202 (0%)
Query 1 VVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG 60
+VTVEADVDQVERRLAAGELSCPSCGGVLAGWGRA SRQLRGPAGPVELCPRRSRCTGCG
Sbjct 1 MVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRAGSRQLRGPAGPVELCPRRSRCTGCG 60
Query 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV
Sbjct 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
Query 121 EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV 180
EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV
Sbjct 121 EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV 180
Query 181 SGGRLLAPGWPGEWVQHESTLP 202
SGGRLLAPGWPGEWVQHESTLP
Sbjct 181 SGGRLLAPGWPGEWVQHESTLP 202
>gi|167967616|ref|ZP_02549893.1| hypothetical protein MtubH3_06106 [Mycobacterium tuberculosis
H37Ra]
gi|297732421|ref|ZP_06961539.1| hypothetical protein MtubKR_15092 [Mycobacterium tuberculosis
KZN R506]
Length=142
Score = 269 bits (688), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 141/142 (99%), Positives = 142/142 (100%), Gaps = 0/142 (0%)
Query 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
+THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV
Sbjct 1 MTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 60
Query 121 EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV 180
EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV
Sbjct 61 EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAV 120
Query 181 SGGRLLAPGWPGEWVQHESTLP 202
SGGRLLAPGWPGEWVQHESTLP
Sbjct 121 SGGRLLAPGWPGEWVQHESTLP 142
>gi|294994095|ref|ZP_06799786.1| hypothetical protein Mtub2_06173 [Mycobacterium tuberculosis
210]
Length=139
Score = 262 bits (670), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 138/139 (99%), Positives = 139/139 (100%), Gaps = 0/139 (0%)
Query 64 VLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERVEAV 123
+LLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERVEAV
Sbjct 1 MLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERVEAV 60
Query 124 RSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAVSGG 183
RSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAVSGG
Sbjct 61 RSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAVSGG 120
Query 184 RLLAPGWPGEWVQHESTLP 202
RLLAPGWPGEWVQHESTLP
Sbjct 121 RLLAPGWPGEWVQHESTLP 139
>gi|289444360|ref|ZP_06434104.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289417279|gb|EFD14519.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
Length=175
Score = 241 bits (614), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 131/142 (93%), Positives = 132/142 (93%), Gaps = 1/142 (0%)
Query 1 VVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG 60
+VTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG
Sbjct 1 MVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG 60
Query 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV
Sbjct 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
Query 121 EA-VRSVFTVWLCAVDADPVMP 141
EA V V VDADPVMP
Sbjct 121 EAGAVGVHGVGCARVDADPVMP 142
>gi|296169419|ref|ZP_06851041.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295895921|gb|EFG75614.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=192
Score = 232 bits (592), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 128/160 (80%), Positives = 134/160 (84%), Gaps = 1/160 (0%)
Query 1 VVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG 60
VVTVEADVD VERRLAAGELSCP+C VLA WG AR RQLRG G V LCPRRSRCTGCG
Sbjct 29 VVTVEADVDVVERRLAAGELSCPACSSVLARWGWARPRQLRGRDGSVRLCPRRSRCTGCG 88
Query 61 VTHVLLPVSALLRRADTAAVIVSALAAKAT-SRVGFRRIATDVARPAETVRGWLRRFAER 119
VTHVLLPV+ALLRRADTAAVIVSALAAKA RVGFRRIA D+ARP ETVRGWLRRFAER
Sbjct 89 VTHVLLPVTALLRRADTAAVIVSALAAKALRRRVGFRRIAADLARPVETVRGWLRRFAER 148
Query 120 VEAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALA 159
EAVRS+FTVWL AVD DPVMP+ GG DAV I A+A
Sbjct 149 AEAVRSMFTVWLRAVDPDPVMPEPAGGVVADAVTVIAAVA 188
>gi|15842347|ref|NP_337384.1| hypothetical protein MT2878 [Mycobacterium tuberculosis CDC1551]
gi|13882644|gb|AAK47198.1| hypothetical protein MT2878 [Mycobacterium tuberculosis CDC1551]
Length=178
Score = 176 bits (447), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 92/98 (94%), Positives = 92/98 (94%), Gaps = 0/98 (0%)
Query 105 PAETVRGWLRRFAERVEAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGR 164
P G LRRFAERVEAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGR
Sbjct 81 PGGDGAGLLRRFAERVEAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGR 140
Query 165 RFSLPTVSLAETAVAVSGGRLLAPGWPGEWVQHESTLP 202
RFSLPTVSLAETAVAVSGGRLLAPGWPGEWVQHESTLP
Sbjct 141 RFSLPTVSLAETAVAVSGGRLLAPGWPGEWVQHESTLP 178
>gi|340628741|ref|YP_004747193.1| hypothetical protein MCAN_37911 [Mycobacterium canettii CIPT
140010059]
gi|340006931|emb|CCC46122.1| putative uncharacterized protein [Mycobacterium canettii CIPT
140010059]
Length=121
Score = 165 bits (417), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 93/121 (77%), Positives = 99/121 (82%), Gaps = 0/121 (0%)
Query 82 VSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERVEAVRSVFTVWLCAVDADPVMP 141
+SA A KA SRVGFRRIA D+ARPAETVRGWLRRFAER EAVRSVFTV L AVD DPVMP
Sbjct 1 MSAPAEKALSRVGFRRIAADLARPAETVRGWLRRFAERAEAVRSVFTVMLRAVDPDPVMP 60
Query 142 DAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAVSGGRLLAPGWPGEWVQHESTL 201
DA G F AV I A+ I R+F+L TVSLAETAVAVS GRL+APGWPGEWVQHESTL
Sbjct 61 DAAVGVFAYAVTVIAAVVTVIERQFALSTVSLAETAVAVSSGRLVAPGWPGEWVQHESTL 120
Query 202 P 202
P
Sbjct 121 P 121
>gi|120401572|ref|YP_951401.1| hypothetical protein Mvan_0551 [Mycobacterium vanbaalenii PYR-1]
gi|120401588|ref|YP_951417.1| hypothetical protein Mvan_0570 [Mycobacterium vanbaalenii PYR-1]
gi|120402692|ref|YP_952521.1| hypothetical protein Mvan_1687 [Mycobacterium vanbaalenii PYR-1]
7 more sequence titles
Length=199
Score = 155 bits (391), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 100/196 (52%), Positives = 121/196 (62%), Gaps = 8/196 (4%)
Query 1 VVTVEADVDQVERRLAAGELSCPSC-GGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGC 59
+VTVE D +VE RL+ G ++CPSC GGVL GWG ARSRQ+ G PV PRR+RC C
Sbjct 1 MVTVEVDPVRVESRLSGGAIACPSCVGGVLGGWGFARSRQVEGLDHPVR--PRRARCRSC 58
Query 60 GVTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAER 119
VTHVLLPV+ LLRRA A I AL+ +A VG RRIA + P TVRGWLRR +R
Sbjct 59 LVTHVLLPVTVLLRRAHGAEQIWMALSTRAEG-VGHRRIAAWLQVPPATVRGWLRRAGQR 117
Query 120 VEAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFS----LPTVSLAE 175
+E +R+ F D +PD G G+ D V A+ AAIG+RF L V+ A+
Sbjct 118 LEPMRAWFLTVAVRTGIDVTIPDGFGCGWRDLVAALRCAVAAIGQRFGPAGLLGAVTPAQ 177
Query 176 TAVAVSGGRLLAPGWP 191
A SG RLLAPGWP
Sbjct 178 VMAAASGSRLLAPGWP 193
>gi|258655053|ref|YP_003204209.1| hypothetical protein Namu_4947 [Nakamurella multipartita DSM
44233]
gi|258558278|gb|ACV81220.1| hypothetical protein Namu_4947 [Nakamurella multipartita DSM
44233]
Length=197
Score = 152 bits (384), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/195 (53%), Positives = 120/195 (62%), Gaps = 8/195 (4%)
Query 1 VVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG 60
+VTVEAD VE RL AG + CP C GVL WG AR R++RG L PRR+RC+ C
Sbjct 1 MVTVEADQVLVESRLTAGGVPCPVCPGVLTPWGWARRREVRGVG---TLQPRRARCSLCL 57
Query 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
VTHVLLPV+ LLRRAD AAVI +AL A+A G R IA V P T RGWLRR + R+
Sbjct 58 VTHVLLPVTVLLRRADAAAVIWTALVARAAGH-GHRTIAALVGTPTSTARGWLRRMSTRL 116
Query 121 EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFS----LPTVSLAET 176
E VR FTV D +PDA G + D V A+ A AI RF + TV+ A+
Sbjct 117 EPVRVHFTVVTRRAGVDQAVPDAAGDAWRDVVAAVAAAWLAITSRFGSAGLVGTVTAAQV 176
Query 177 AVAVSGGRLLAPGWP 191
A A SGGRLL+PGWP
Sbjct 177 ACASSGGRLLSPGWP 191
>gi|111026944|ref|YP_708922.1| hypothetical protein RHA1_ro11117 [Rhodococcus jostii RHA1]
gi|110825483|gb|ABH00764.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=218
Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 101/196 (52%), Positives = 121/196 (62%), Gaps = 8/196 (4%)
Query 1 VVTVEADVDQVERRLAAGELSCPSC-GGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGC 59
+VTVE D +VE RL+ G++SCPSC GGVLAGWG AR R + G A PV PRR+RC GC
Sbjct 1 MVTVEVDPVRVESRLSRGDMSCPSCTGGVLAGWGFARPRPVAGMAAPVR--PRRARCRGC 58
Query 60 GVTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAER 119
VTHVLLPV+ LLRRA A +I +ALAAKA G R IA + P TVRGWLR A R
Sbjct 59 AVTHVLLPVTLLLRRAYLAELIWAALAAKARGH-GHRPIAQRLGIPGSTVRGWLRVEAAR 117
Query 120 VEAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAI----GALAAAIGRRFSLPTVSLAE 175
+AVRS F D +P + D + A+ GA+ A GR L V+ A+
Sbjct 118 ADAVRSWFLAVAVTTGVDVAVPRTTESVWGDVLAAVHAAHGAITARFGRSAVLGAVTAAQ 177
Query 176 TAVAVSGGRLLAPGWP 191
AVA S GRLL+PGWP
Sbjct 178 VAVAASAGRLLSPGWP 193
>gi|336117532|ref|YP_004572300.1| hypothetical protein MLP_18830 [Microlunatus phosphovorus NM-1]
gi|334685312|dbj|BAK34897.1| hypothetical protein MLP_18830 [Microlunatus phosphovorus NM-1]
Length=205
Score = 139 bits (349), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 94/195 (49%), Positives = 113/195 (58%), Gaps = 7/195 (3%)
Query 1 VVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG 60
++TVEAD QVE RLA G L+CP C L WG AR R + G G L PRR+RC GCG
Sbjct 1 MLTVEADRAQVESRLAGGRLACPGCAASLRPWGWARPRGVWGLPG--LLRPRRARCPGCG 58
Query 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
VTHVLLPV+ LLRRA VI +A+ A+A G RRI V PA TVRGWLRR R+
Sbjct 59 VTHVLLPVTVLLRRAYAVEVIGAAVVARADG-AGHRRIGEAVGVPAATVRGWLRRIGTRL 117
Query 121 EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFS----LPTVSLAET 176
E R + D ++P A G + D + + A AA+ RF L V+ +
Sbjct 118 ETTRGYLLQVVVRAGVDRLVPKAQGSPWRDLLAGLAAATAAVTSRFGPIGVLGPVTAWQV 177
Query 177 AVAVSGGRLLAPGWP 191
A A SGGRLLAPGWP
Sbjct 178 AAACSGGRLLAPGWP 192
>gi|258651520|ref|YP_003200676.1| hypothetical protein Namu_1284 [Nakamurella multipartita DSM
44233]
gi|258652264|ref|YP_003201420.1| hypothetical protein Namu_2051 [Nakamurella multipartita DSM
44233]
gi|258653877|ref|YP_003203033.1| hypothetical protein Namu_3745 [Nakamurella multipartita DSM
44233]
gi|258654633|ref|YP_003203789.1| hypothetical protein Namu_4521 [Nakamurella multipartita DSM
44233]
gi|258554745|gb|ACV77687.1| hypothetical protein Namu_1284 [Nakamurella multipartita DSM
44233]
gi|258555489|gb|ACV78431.1| hypothetical protein Namu_2051 [Nakamurella multipartita DSM
44233]
gi|258557102|gb|ACV80044.1| hypothetical protein Namu_3745 [Nakamurella multipartita DSM
44233]
gi|258557858|gb|ACV80800.1| hypothetical protein Namu_4521 [Nakamurella multipartita DSM
44233]
Length=197
Score = 134 bits (338), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 100/195 (52%), Positives = 120/195 (62%), Gaps = 8/195 (4%)
Query 1 VVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCG 60
+VTVEA+ D VE RL G + CP C GVLA WG AR R +RG L PRR+RC+ C
Sbjct 1 MVTVEANQDLVESRLTGGGVPCPVCPGVLAPWGWARRRDVRGVG---LLRPRRARCSSCL 57
Query 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
+THVLLPV+ LLRRAD AAVI +AL A++ G R +A V PA TVRGWLRR + R+
Sbjct 58 ITHVLLPVTVLLRRADAAAVIWAALVARSAGH-GHRAVAVLVGAPAPTVRGWLRRMSTRL 116
Query 121 EAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFS----LPTVSLAET 176
E VR FTV D PDA G + D V A+ A AI RF + TV+ +
Sbjct 117 EPVRVHFTVAARRAGVDQPAPDATGDAWRDVVAAVAAAWVAIASRFGSAGLVGTVTAGQV 176
Query 177 AVAVSGGRLLAPGWP 191
A A SGGRLL+PGWP
Sbjct 177 ACASSGGRLLSPGWP 191
>gi|226334813|ref|YP_002784485.1| hypothetical protein ROP_pKNR-00410 [Rhodococcus opacus B4]
gi|226246033|dbj|BAH56133.1| hypothetical protein [Rhodococcus opacus B4]
Length=213
Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/196 (51%), Positives = 119/196 (61%), Gaps = 8/196 (4%)
Query 1 VVTVEADVDQVERRLAAGELSCPSCG-GVLAGWGRARSRQLRGPAGPVELCPRRSRCTGC 59
+VTVEAD VE RLAAG + CPSCG GVL GWG AR+R++ G + R+RC C
Sbjct 1 MVTVEADPVHVESRLAAGTIGCPSCGDGVLGGWGYARARRIVGLGDRLRPR--RARCRAC 58
Query 60 GVTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAER 119
VTHVLLPV+ LLRRA A +I +AL +A G RR+A V PA TVRGWLRR AER
Sbjct 59 SVTHVLLPVAVLLRRAYAAELIWAALTVRAEGG-GHRRVAAVVGVPATTVRGWLRRMAER 117
Query 120 VEAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFS----LPTVSLAE 175
+E VRS F D +PD G + D + A+ AAA+ RF + V+
Sbjct 118 LEEVRSWFLGVAVVAGVDVTIPDTTGCRWRDVLSAVETAAAAVRSRFGPAGFVGAVTPVR 177
Query 176 TAVAVSGGRLLAPGWP 191
AVA SGGRLLAPGWP
Sbjct 178 VAVAASGGRLLAPGWP 193
>gi|15610907|ref|NP_218288.1| hypothetical protein Rv3771c [Mycobacterium tuberculosis H37Rv]
gi|15843394|ref|NP_338431.1| hypothetical protein MT3880 [Mycobacterium tuberculosis CDC1551]
gi|31794943|ref|NP_857436.1| hypothetical protein Mb3799c [Mycobacterium bovis AF2122/97]
54 more sequence titles
Length=108
Score = 128 bits (322), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 77/104 (75%), Positives = 83/104 (80%), Gaps = 0/104 (0%)
Query 86 AAKATSRVGFRRIATDVARPAETVRGWLRRFAERVEAVRSVFTVWLCAVDADPVMPDAGG 145
A KA S+VGFRRIA D+ARPAETVRGWLRRFAER EAVRSVFTV L AVD DPVMPDA
Sbjct 5 AEKALSQVGFRRIAADLARPAETVRGWLRRFAERAEAVRSVFTVMLRAVDPDPVMPDAAV 64
Query 146 GGFVDAVVAIGALAAAIGRRFSLPTVSLAETAVAVSGGRLLAPG 189
G F AV I A+ I +F+L TVSLAETAVAVSGGRL+APG
Sbjct 65 GVFAYAVTVIAAVVTVIECQFALSTVSLAETAVAVSGGRLVAPG 108
>gi|119854967|ref|YP_935572.1| hypothetical protein Mkms_5573 [Mycobacterium sp. KMS]
gi|315441536|ref|YP_004074413.1| hypothetical protein Mspyr1_54160 [Mycobacterium sp. Spyr1]
gi|315444102|ref|YP_004076981.1| hypothetical protein Mspyr1_25090 [Mycobacterium sp. Spyr1]
7 more sequence titles
Length=206
Score = 125 bits (313), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 96/196 (49%), Positives = 119/196 (61%), Gaps = 8/196 (4%)
Query 1 VVTVEADVDQVERRLAAGELSCPSC-GGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGC 59
+VTVE D VE RL G +SCP+C GVL GWG AR+R + G V RR+RC C
Sbjct 1 MVTVEVDRVCVESRLVGGAISCPACPDGVLGGWGYARARHVEGLDDRVRP--RRARCRSC 58
Query 60 GVTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAER 119
VTHVLLPV+ LLRRA A + AL+A+A VG R IA + P TVRGWLRR +R
Sbjct 59 LVTHVLLPVTMLLRRAYAAERVWMALSARAEG-VGHRGIAARLQVPPSTVRGWLRRAGQR 117
Query 120 VEAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFS----LPTVSLAE 175
+E++R+ F D ++PD G G+ D V A+ A+AIG+RF L V+ A
Sbjct 118 LESMRTWFLTVAVGTGIDVMIPDGLGCGWRDVVAAVMVAASAIGQRFGPAGLLGVVTPAL 177
Query 176 TAVAVSGGRLLAPGWP 191
VAVSG RLLAPGWP
Sbjct 178 VVVAVSGARLLAPGWP 193
>gi|315441603|ref|YP_004074480.1| RNA polymerase sigma factor, sigma-70 family [Mycobacterium sp.
Spyr1]
gi|315265258|gb|ADU01999.1| RNA polymerase sigma factor, sigma-70 family [Mycobacterium sp.
Spyr1]
Length=192
Score = 97.1 bits (240), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 76/192 (40%), Positives = 92/192 (48%), Gaps = 23/192 (11%)
Query 9 DQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGP-AGPVELCPRRSRCTGCGVTHVLLP 67
E LA + CP CGG LA WG AR R +R P A V + PRR RC C TH+LLP
Sbjct 8 QDAEAHLADAVMCCPHCGGTLAKWGYARERTVRAPGAATVTVRPRRLRCRNCTTTHILLP 67
Query 68 VSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERVEAVRSVF 127
+ RRADT VI ALA K +GFRRIAT + R TVR WLRR +
Sbjct 68 TALQPRRADTTEVIGIALAHKVNG-LGFRRIATLMGRSESTVRRWLRRATD-------TH 119
Query 128 TVWLCAVDADPVM---PDA-----GGGGFVDAVVAIGALAAAIGRR---FSLPTVSLAET 176
W C A ++ P+A G + + I + AA RR F P +L
Sbjct 120 LNWACQQGATRLIQLAPEAFTEIRYAGNQLRYTLTILSAAAYWDRRRCGFEEPPWTLIG- 178
Query 177 AVAVSGGRLLAP 188
+ GRLLAP
Sbjct 179 --MYTRGRLLAP 188
>gi|226349348|ref|YP_002776462.1| hypothetical protein ROP_pROB01-01110 [Rhodococcus opacus B4]
gi|226245263|dbj|BAH55610.1| hypothetical protein [Rhodococcus opacus B4]
Length=195
Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/141 (42%), Positives = 74/141 (53%), Gaps = 6/141 (4%)
Query 51 PRRSRCTGCGVTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVR 110
P R RC CG TH+LLP + +RRADTA VI +ALA KA +GFRRIA + RP TVR
Sbjct 54 PTRVRCRDCGATHILLPTALQVRRADTAEVIGNALAHKAKG-LGFRRIAERMGRPESTVR 112
Query 111 GWLRR-FAERVEAV--RSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFS 167
WLRR E V+ + R + L A +A + G DA+ + A A RRF
Sbjct 113 RWLRRTTGEHVQWLHRRGTERLGLVAREAFCTIRYV-GNPLGDALCVLSAAAVEDRRRFG 171
Query 168 LPTVSLAETAVAVSGGRLLAP 188
P + GRLL+P
Sbjct 172 FPDPPWDLIGIYTQ-GRLLSP 191
>gi|296169410|ref|ZP_06851032.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295895912|gb|EFG75605.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=48
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/41 (88%), Positives = 37/41 (91%), Gaps = 0/41 (0%)
Query 162 IGRRFSLPTVSLAETAVAVSGGRLLAPGWPGEWVQHESTLP 202
I RRF+LP VSLAE AVAVSGGRLLA GWPGEWVQHESTLP
Sbjct 2 IERRFALPEVSLAEVAVAVSGGRLLASGWPGEWVQHESTLP 42
>gi|78044239|ref|YP_360085.1| ISChy3, orf1 [Carboxydothermus hydrogenoformans Z-2901]
gi|77996354|gb|ABB15253.1| ISChy3, orf1 [Carboxydothermus hydrogenoformans Z-2901]
Length=226
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/123 (26%), Positives = 53/123 (44%), Gaps = 6/123 (4%)
Query 3 TVEADVDQVERRLAAGELSCPSCGGVLA--GWGRARSRQLRGPAGPVELCPRRSRCTGCG 60
+V+ ++ L + CP C ++ GW R + L G + + R RC+ C
Sbjct 46 SVKEYLENYLNFLEENQWYCPVCSAKMSFHGWYRRKIITLDGTTTRIPIA--RYRCSNCR 103
Query 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
TH +LP R + +I + + + +V R+ + P T R W+RRF +R
Sbjct 104 KTHAILPDFVAPYRHYSQVLIAAVVEEVVSKQVPPERVEGNQDIP--TTRRWIRRFLKRC 161
Query 121 EAV 123
V
Sbjct 162 HEV 164
>gi|78044213|ref|YP_361433.1| ISChy3, orf1 [Carboxydothermus hydrogenoformans Z-2901]
gi|77996328|gb|ABB15227.1| ISChy3, orf1 [Carboxydothermus hydrogenoformans Z-2901]
Length=189
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/123 (26%), Positives = 53/123 (44%), Gaps = 6/123 (4%)
Query 3 TVEADVDQVERRLAAGELSCPSCGGVLA--GWGRARSRQLRGPAGPVELCPRRSRCTGCG 60
+V+ ++ L + CP C ++ GW R + L G + + R RC+ C
Sbjct 9 SVKEYLENYLNFLEENQWYCPVCSAKMSFHGWYRRKIITLDGTTTRIPIA--RYRCSNCR 66
Query 61 VTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPAETVRGWLRRFAERV 120
TH +LP R + +I + + + +V R+ + P T R W+RRF +R
Sbjct 67 KTHAILPDFVAPYRHYSQVLIAAVVEEVVSKQVPPERVEGNQDIP--TTRRWIRRFLKRC 124
Query 121 EAV 123
V
Sbjct 125 HEV 127
>gi|167628183|ref|YP_001678682.1| hypothetical protein HM1_0046 [Heliobacterium modesticaldum Ice1]
gi|167590923|gb|ABZ82671.1| conserved hypothetical protein [Heliobacterium modesticaldum
Ice1]
Length=111
Score = 43.1 bits (100), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 30/92 (33%), Positives = 44/92 (48%), Gaps = 3/92 (3%)
Query 2 VTVEADVDQVERRLAAGELS-CPSCGGVLAGWGRARSRQLRGPAGPVE-LCPRRSRCTGC 59
TVE D + R+ E CP CG +L+G+ + R + +G V RR RC GC
Sbjct 7 YTVEYDEARSVYRIRNMEAPVCPQCGLLLSGYD-TKKRHVIDSSGAVRWFLLRRLRCPGC 65
Query 60 GVTHVLLPVSALLRRADTAAVIVSALAAKATS 91
G H+ LP ++ A +I+ LA + S
Sbjct 66 GKLHIELPDFMQPKKHYEAQLIMDVLAGHSDS 97
>gi|291559600|emb|CBL38400.1| hypothetical protein CL2_14540 [butyrate-producing bacterium
SSC/2]
Length=103
Score = 41.2 bits (95), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 30/103 (30%), Positives = 41/103 (40%), Gaps = 6/103 (5%)
Query 17 AGELSCPSCGGVLAGWGRARSRQLRGPAGPVELCPRRSRCTGCGVTHVLLPVSALLRRAD 76
E CP CGG L G R G + RR RCT CG H LP S L +
Sbjct 4 ESEHLCPLCGGELKYLGHVRRIMKTGSGHSKWIEVRRLRCTECGTIHRELPNSLLPYKHY 63
Query 77 TAAVIVSALAAKATSRVGFRRIATDVARPAE-TVRGWLRRFAE 118
++ +I ++ + T I P E T++ W F +
Sbjct 64 SSDIINRVVSGEITP-----DILEYEDYPCELTMKHWTEEFTK 101
>gi|333006170|gb|EGK25679.1| transposase IS66 family protein [Shigella flexneri K-218]
Length=371
Score = 38.1 bits (87), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 35/128 (28%), Positives = 55/128 (43%), Gaps = 6/128 (4%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARP-AETVRGWL-RRFAER 119
LP + R +A ++ L +K + R + AR E R + RR +E
Sbjct 186 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRRVSEM 245
Query 120 VEAVRSVF 127
+ +R ++
Sbjct 246 ADKLRPLY 253
>gi|332088549|gb|EGI93664.1| transposase IS66 family protein [Shigella boydii 5216-82]
Length=583
Score = 38.1 bits (87), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 61/156 (40%), Gaps = 14/156 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
V LP + R +A ++ L +K + R + AR T+ W+
Sbjct 186 VQVPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEM 245
Query 117 AERVEAVRSVFTVWLCA-----VDADPVMPDAGGGG 147
A+++ + ++ D PV A G G
Sbjct 246 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNG 281
>gi|145226040|ref|YP_001136694.1| hypothetical protein Mflv_5445 [Mycobacterium gilvum PYR-GCK]
gi|145218503|gb|ABP47906.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=86
Score = 38.1 bits (87), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 35/77 (46%), Positives = 42/77 (55%), Gaps = 4/77 (5%)
Query 119 RVEAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFS----LPTVSLA 174
RVE++R+ F D +PD G G+ D V AI AAAIG RF L ++
Sbjct 4 RVESMRAWFLQVAVGTGIDVAIPDGSGCGWSDLVAAIATAAAAIGARFGPAGVLGVMTPP 63
Query 175 ETAVAVSGGRLLAPGWP 191
VAVSGGRLLAP WP
Sbjct 64 LVMVAVSGGRLLAPCWP 80
>gi|332094570|gb|EGI99616.1| transposase IS66 family protein [Shigella boydii 5216-82]
Length=536
Score = 37.7 bits (86), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 61/156 (40%), Gaps = 14/156 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 129 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 188
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
V LP + R +A ++ L +K + R + AR T+ W+
Sbjct 189 VQVPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEM 248
Query 117 AERVEAVRSVFTVWLCA-----VDADPVMPDAGGGG 147
A+++ + ++ D PV A G G
Sbjct 249 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNG 284
>gi|332086342|gb|EGI91494.1| transposase IS66 family protein [Shigella boydii 5216-82]
Length=263
Score = 37.4 bits (85), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/148 (26%), Positives = 57/148 (39%), Gaps = 14/148 (9%)
Query 14 RLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV-THVLLPVS 69
RL E SCP+CGGVL G S QL +E + C+ C V V LP
Sbjct 95 RLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQVPLPPK 154
Query 70 ALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRFAERVEAVR 124
+ R +A ++ L +K + R + AR T+ W+ A+++ +
Sbjct 155 PIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLRPLY 214
Query 125 SVFTVWLCA-----VDADPVMPDAGGGG 147
++ D PV A G G
Sbjct 215 IALNDYVLEAGKVHADDTPVKVLAPGNG 242
>gi|332995920|gb|EGK15550.1| transposase IS66 family protein [Shigella flexneri K-272]
Length=346
Score = 37.4 bits (85), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 42/162 (26%), Positives = 64/162 (40%), Gaps = 14/162 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
LP + R +A ++ L +K + R + AR T+ W+
Sbjct 186 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEM 245
Query 117 AERVEAVRSVFTVWLC---AVDAD--PVMPDAGGGGFVDAVV 153
A+++ + ++ V AD PV A G G VV
Sbjct 246 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKRVV 287
>gi|281357906|ref|ZP_06244391.1| hypothetical protein Vvad_PD1259 [Victivallis vadensis ATCC BAA-548]
gi|281315564|gb|EFA99592.1| hypothetical protein Vvad_PD1259 [Victivallis vadensis ATCC BAA-548]
Length=232
Score = 37.0 bits (84), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 38/138 (28%), Positives = 54/138 (40%), Gaps = 18/138 (13%)
Query 37 SRQLRGPAGPVELCPRRSRCTGCGVTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFR 96
R++R G ++ R C+ CG T L L R T + L +A S +R
Sbjct 78 ERRIRTSLGEFKMSFWRVSCSACGKTFSPLQRFIHLGRYQTKTNELEKLVIEAASETNYR 137
Query 97 RIATDVARPAE------TVRGW-LRRFAERVEAVRSVFTVWLCAVDADP--VMPDAGG-- 145
R D+AR + T GW LR + ++ R V + + P +MPD G
Sbjct 138 RAVRDLARDGKLPVSFHTAHGWVLRTDCDEIDLSRQV-------IGSVPIQIMPDGTGFK 190
Query 146 GGFVDAVVAIGALAAAIG 163
G D G L IG
Sbjct 191 GEGRDGKARKGDLKVVIG 208
>gi|333003611|gb|EGK23149.1| transposase IS66 family protein [Shigella flexneri K-272]
Length=346
Score = 37.0 bits (84), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 42/162 (26%), Positives = 64/162 (40%), Gaps = 14/162 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
LP + R +A ++ L +K + R + AR T+ W+
Sbjct 186 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEM 245
Query 117 AERVEAVRSVFTVWLC---AVDAD--PVMPDAGGGGFVDAVV 153
A+++ + ++ V AD PV A G G VV
Sbjct 246 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKRVV 287
>gi|332094995|gb|EGJ00034.1| transposase IS66 family protein [Shigella boydii 5216-82]
Length=511
Score = 36.6 bits (83), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 61/156 (40%), Gaps = 14/156 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 129 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 188
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
V LP + R +A ++ L +K + R + AR T+ W+
Sbjct 189 VQVPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEM 248
Query 117 AERVEAVRSVFTVWLCA-----VDADPVMPDAGGGG 147
A+++ + ++ D PV A G G
Sbjct 249 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNG 284
>gi|332091050|gb|EGI96140.1| transposase IS66 family protein [Shigella dysenteriae 155-74]
Length=429
Score = 36.6 bits (83), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 37/149 (25%), Positives = 56/149 (38%), Gaps = 14/149 (9%)
Query 13 RRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV-THVLLPV 68
RL E SCP+CGGVL G S QL +E + C+ C V LP
Sbjct 29 HRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPP 88
Query 69 SALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRFAERVEAV 123
+ R +A ++ L +K + R + AR T+ W+ A+++ +
Sbjct 89 KPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLRPL 148
Query 124 RSVFTVWLCA-----VDADPVMPDAGGGG 147
++ D PV A G G
Sbjct 149 YIALNDYVLEAGKVHADDTPVKVLAPGNG 177
>gi|333002481|gb|EGK22043.1| transposase IS66 family protein [Shigella flexneri K-218]
Length=317
Score = 36.6 bits (83), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 34/128 (27%), Positives = 55/128 (43%), Gaps = 6/128 (4%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARP-AETVRGWLRRF-AER 119
LP + R +A ++ L +K + R + AR E R + R+ +E
Sbjct 186 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMERWVSEM 245
Query 120 VEAVRSVF 127
+ +R ++
Sbjct 246 ADKLRPLY 253
>gi|332083450|gb|EGI88675.1| transposase IS66 family protein [Shigella dysenteriae 155-74]
gi|332083774|gb|EGI88992.1| transposase IS66 family protein [Shigella dysenteriae 155-74]
gi|332083919|gb|EGI89130.1| transposase IS66 family protein [Shigella dysenteriae 155-74]
10 more sequence titles
Length=429
Score = 36.6 bits (83), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 37/149 (25%), Positives = 56/149 (38%), Gaps = 14/149 (9%)
Query 13 RRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV-THVLLPV 68
RL E SCP+CGGVL G S QL +E + C+ C V LP
Sbjct 29 HRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPP 88
Query 69 SALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRFAERVEAV 123
+ R +A ++ L +K + R + AR T+ W+ A+++ +
Sbjct 89 KPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLRPL 148
Query 124 RSVFTVWLCA-----VDADPVMPDAGGGG 147
++ D PV A G G
Sbjct 149 YIALNDYVLEAGKVHADDTPVKVLAPGNG 177
>gi|335572844|gb|EGM59215.1| transposase IS66 family protein [Shigella flexneri J1713]
Length=335
Score = 36.2 bits (82), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 38/156 (25%), Positives = 60/156 (39%), Gaps = 14/156 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
LP + R +A ++ L +K + R + AR T+ W+
Sbjct 186 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWISEM 245
Query 117 AERVEAVRSVFTVWLCA-----VDADPVMPDAGGGG 147
A+++ + ++ D PV A G G
Sbjct 246 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNG 281
>gi|335575330|gb|EGM61626.1| transposase IS66 family protein [Shigella flexneri J1713]
Length=533
Score = 36.2 bits (82), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 38/156 (25%), Positives = 60/156 (39%), Gaps = 14/156 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
LP + R +A ++ L +K + R + AR T+ W+
Sbjct 186 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWISEM 245
Query 117 AERVEAVRSVFTVWLCA-----VDADPVMPDAGGGG 147
A+++ + ++ D PV A G G
Sbjct 246 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNG 281
>gi|335575322|gb|EGM61618.1| transposase IS66 family protein [Shigella flexneri J1713]
Length=533
Score = 36.2 bits (82), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 38/156 (25%), Positives = 60/156 (39%), Gaps = 14/156 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
LP + R +A ++ L +K + R + AR T+ W+
Sbjct 186 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWISEM 245
Query 117 AERVEAVRSVFTVWLCA-----VDADPVMPDAGGGG 147
A+++ + ++ D PV A G G
Sbjct 246 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNG 281
>gi|345367754|gb|EGW99765.1| transposase IS66 family protein [Escherichia coli G58-1]
Length=533
Score = 36.2 bits (82), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 35/128 (28%), Positives = 55/128 (43%), Gaps = 6/128 (4%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARP-AETVRGWLRRF-AER 119
LP + R +A ++ L +K + R + AR E R + R+ +E
Sbjct 186 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEM 245
Query 120 VEAVRSVF 127
+ +R +F
Sbjct 246 ADKLRPLF 253
>gi|335572364|gb|EGM58744.1| transposase IS66 family protein [Shigella flexneri J1713]
gi|335573336|gb|EGM59693.1| transposase IS66 family protein [Shigella flexneri J1713]
gi|335573362|gb|EGM59719.1| transposase IS66 family protein [Shigella flexneri J1713]
Length=533
Score = 36.2 bits (82), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 38/156 (25%), Positives = 60/156 (39%), Gaps = 14/156 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
LP + R +A ++ L +K + R + AR T+ W+
Sbjct 186 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWISEM 245
Query 117 AERVEAVRSVFTVWLCA-----VDADPVMPDAGGGG 147
A+++ + ++ D PV A G G
Sbjct 246 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNG 281
>gi|320180591|gb|EFW55521.1| ISSfl4 ORF3 [Shigella boydii ATCC 9905]
Length=484
Score = 36.2 bits (82), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 38/156 (25%), Positives = 60/156 (39%), Gaps = 14/156 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 77 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 136
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
LP + R +A ++ L +K + R + AR T+ W+
Sbjct 137 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQVVELSRNTMVRWVSEM 196
Query 117 AERVEAVRSVFTVWLCA-----VDADPVMPDAGGGG 147
A+++ + ++ D PV A G G
Sbjct 197 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNG 232
>gi|335573262|gb|EGM59625.1| transposase IS66 family protein [Shigella flexneri J1713]
Length=477
Score = 36.2 bits (82), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 38/156 (25%), Positives = 60/156 (39%), Gaps = 14/156 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 70 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 129
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
LP + R +A ++ L +K + R + AR T+ W+
Sbjct 130 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEM 189
Query 117 AERVEAVRSVFTVWLCA-----VDADPVMPDAGGGG 147
A+++ + ++ D PV A G G
Sbjct 190 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNG 225
>gi|332097570|gb|EGJ02549.1| transposase IS66 family protein [Shigella dysenteriae 155-74]
Length=186
Score = 36.2 bits (82), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 31/117 (27%), Positives = 47/117 (41%), Gaps = 9/117 (7%)
Query 13 RRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV-THVLLPV 68
RL E SCP+CGGVL G S QL +E + C+ C V LP
Sbjct 29 HRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPP 88
Query 69 SALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRFAERV 120
+ R +A ++ L +K + R + AR T+ W+ A+++
Sbjct 89 KPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKL 145
>gi|333017847|gb|EGK37154.1| transposase IS66 family protein [Shigella flexneri K-227]
Length=294
Score = 35.8 bits (81), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 41/165 (25%), Positives = 65/165 (40%), Gaps = 14/165 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
LP + R +A ++ L +K + R + AR T+ W+
Sbjct 186 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEM 245
Query 117 AERVEAVRSVFTVWLC---AVDAD--PVMPDAGGGGFVDAVVAIG 156
A+++ + ++ V AD PV A G G ++G
Sbjct 246 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKNGSSVG 290
>gi|335574467|gb|EGM60791.1| transposase IS66 family protein [Shigella flexneri J1713]
Length=533
Score = 35.8 bits (81), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 37/149 (25%), Positives = 56/149 (38%), Gaps = 14/149 (9%)
Query 13 RRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV-THVLLPV 68
RL E SCP+CGGVL G S QL +E + C+ C V LP
Sbjct 133 HRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPP 192
Query 69 SALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRFAERVEAV 123
+ R +A ++ L +K + R + AR T+ W+ A+++ +
Sbjct 193 KPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLRPL 252
Query 124 RSVFTVWLCA-----VDADPVMPDAGGGG 147
++ D PV A G G
Sbjct 253 YIALNDYVLEAGKVHADDTPVKVLAPGNG 281
>gi|320176337|gb|EFW51396.1| ISSfl4 ORF3 [Shigella dysenteriae CDC 74-1112]
Length=533
Score = 35.8 bits (81), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 38/156 (25%), Positives = 60/156 (39%), Gaps = 14/156 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
LP + R +A ++ L +K + R + AR T+ W+
Sbjct 186 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEM 245
Query 117 AERVEAVRSVFTVWLCA-----VDADPVMPDAGGGG 147
A+++ + ++ D PV A G G
Sbjct 246 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNG 281
>gi|333011052|gb|EGK30466.1| transposase IS66 family protein [Shigella flexneri K-272]
Length=547
Score = 35.8 bits (81), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 38/156 (25%), Positives = 60/156 (39%), Gaps = 14/156 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
LP + R +A ++ L +K + R + AR T+ W+
Sbjct 186 VQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEM 245
Query 117 AERVEAVRSVFTVWLCA-----VDADPVMPDAGGGG 147
A+++ + ++ D PV A G G
Sbjct 246 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNG 281
>gi|332756642|gb|EGJ86991.1| transposase IS66 family protein [Shigella flexneri K-671]
Length=317
Score = 35.8 bits (81), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 38/156 (25%), Positives = 60/156 (39%), Gaps = 14/156 (8%)
Query 6 ADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQL---RGPAGPVELCPRRSRCTGCGV- 61
A++ + RL E SCP+CGGVL G S QL +E + C+ C V
Sbjct 126 AELPRETHRLLPAETSCPACGGVLKEMGETISEQLDIINTAFKVIETIRPKLACSRCDVI 185
Query 62 THVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDVARPA-----ETVRGWLRRF 116
LP + R +A ++ L +K + R + AR T+ W+
Sbjct 186 VQAPLPPQPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEM 245
Query 117 AERVEAVRSVFTVWLCA-----VDADPVMPDAGGGG 147
A+++ + ++ D PV A G G
Sbjct 246 ADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNG 281
Lambda K H
0.322 0.135 0.422
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 220408776486
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40