BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0613c
Length=855
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607753|ref|NP_215127.1| hypothetical protein Rv0613c [Mycob... 1658 0.0
gi|289446169|ref|ZP_06435913.1| conserved hypothetical protein [... 1657 0.0
gi|289744327|ref|ZP_06503705.1| conserved hypothetical protein [... 1657 0.0
gi|148821815|ref|YP_001286569.1| hypothetical protein TBFG_10624... 1656 0.0
gi|340625636|ref|YP_004744088.1| hypothetical protein MCAN_06151... 1655 0.0
gi|308374058|ref|ZP_07667701.1| putative SEC-C motif containing ... 1182 0.0
gi|289752655|ref|ZP_06512033.1| conserved hypothetical protein [... 990 0.0
gi|315446130|ref|YP_004079009.1| SEC-C motif-containing protein,... 981 0.0
gi|120402155|ref|YP_951984.1| SecC motif-containing protein [Myc... 966 0.0
gi|254230949|ref|ZP_04924276.1| hypothetical protein TBCG_00608 ... 934 0.0
gi|118473383|ref|YP_885675.1| hypothetical protein MSMEG_1285 [M... 870 0.0
gi|240169386|ref|ZP_04748045.1| hypothetical protein MkanA1_0873... 832 0.0
gi|289568551|ref|ZP_06448778.1| hypothetical protein TBJG_01055 ... 815 0.0
gi|298524103|ref|ZP_07011512.1| conserved hypothetical protein [... 737 0.0
gi|289749112|ref|ZP_06508490.1| hypothetical protein TBDG_03748 ... 709 0.0
gi|289760736|ref|ZP_06520114.1| conserved hypothetical protein [... 648 0.0
gi|306781550|ref|ZP_07419887.1| hypothetical protein TMBG_03471 ... 630 3e-178
gi|333989271|ref|YP_004521885.1| hypothetical protein JDM601_063... 514 3e-143
gi|308374057|ref|ZP_07667700.1| hypothetical protein TMFG_03279 ... 467 4e-129
gi|226359545|ref|YP_002777323.1| hypothetical protein ROP_01310 ... 458 2e-126
gi|226365319|ref|YP_002783102.1| hypothetical protein ROP_59100 ... 327 6e-87
gi|325676580|ref|ZP_08156258.1| tetratricopeptide repeat family ... 299 1e-78
gi|312138728|ref|YP_004006064.1| hypothetical protein REQ_12860 ... 299 2e-78
gi|226308702|ref|YP_002768662.1| hypothetical protein RER_52150 ... 292 2e-76
gi|229488928|ref|ZP_04382794.1| tetratricopeptide repeat family ... 291 3e-76
gi|229489113|ref|ZP_04382979.1| tetratricopeptide repeat family ... 238 3e-60
gi|226308703|ref|YP_002768663.1| hypothetical protein RER_52160 ... 234 4e-59
gi|333989270|ref|YP_004521884.1| SecC motif-containing protein [... 192 2e-46
gi|111022811|ref|YP_705783.1| hypothetical protein RHA1_ro05848 ... 110 1e-21
gi|111025241|ref|YP_707661.1| hypothetical protein RHA1_ro08459 ... 89.0 3e-15
gi|333920963|ref|YP_004494544.1| Tetratricopeptide repeat family... 87.0 1e-14
gi|116624812|ref|YP_826968.1| SecC motif-containing protein [Can... 80.9 9e-13
gi|253699919|ref|YP_003021108.1| SEC-C motif domain protein [Geo... 80.1 2e-12
gi|31790367|gb|AAP58624.1| hypothetical protein [uncultured Acid... 77.4 1e-11
gi|316932200|ref|YP_004107182.1| hypothetical protein Rpdx1_0815... 68.2 6e-09
gi|338531363|ref|YP_004664697.1| hypothetical protein LILAB_0854... 68.2 7e-09
gi|153006122|ref|YP_001380447.1| SecC motif-containing protein [... 67.8 8e-09
gi|254451191|ref|ZP_05064628.1| conserved hypothetical protein [... 67.0 1e-08
gi|257094679|ref|YP_003168320.1| SEC-C motif domain-containing p... 66.6 2e-08
gi|148655739|ref|YP_001275944.1| SecC motif-containing protein [... 65.9 3e-08
gi|156743381|ref|YP_001433510.1| SecC motif-containing protein [... 64.3 1e-07
gi|146279553|ref|YP_001169711.1| hypothetical protein Rsph17025_... 62.8 3e-07
gi|309791557|ref|ZP_07686056.1| SecC motif-containing protein [O... 61.6 6e-07
gi|339483667|ref|YP_004695453.1| SEC-C motif domain-containing p... 60.1 2e-06
gi|333978065|ref|YP_004516010.1| SEC-C motif domain-containing p... 57.4 1e-05
gi|108757805|ref|YP_628325.1| hypothetical protein MXAN_0042 [My... 56.6 2e-05
gi|163848371|ref|YP_001636415.1| SecC motif-containing protein [... 54.3 1e-04
gi|258512924|ref|YP_003189181.1| hypothetical protein APA01_4014... 54.3 1e-04
gi|163797760|ref|ZP_02191707.1| hypothetical protein BAL199_1394... 53.9 1e-04
gi|172058433|ref|YP_001814893.1| preprotein translocase, SecA su... 53.5 1e-04
>gi|15607753|ref|NP_215127.1| hypothetical protein Rv0613c [Mycobacterium tuberculosis H37Rv]
gi|15840015|ref|NP_335052.1| hypothetical protein MT0643 [Mycobacterium tuberculosis CDC1551]
gi|31791796|ref|NP_854289.1| hypothetical protein Mb0630c [Mycobacterium bovis AF2122/97]
37 more sequence titles
Length=855
Score = 1658 bits (4294), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 854/855 (99%), Positives = 855/855 (100%), Gaps = 0/855 (0%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
+AEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR
Sbjct 1 MAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG
Sbjct 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD
Sbjct 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG
Sbjct 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA
Sbjct 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
Query 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR
Sbjct 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
Query 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL
Sbjct 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
Query 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL
Sbjct 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
Query 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL
Sbjct 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
Query 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR
Sbjct 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
Query 601 AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV 660
AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV
Sbjct 601 AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV 660
Query 661 NTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD 720
NTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD
Sbjct 661 NTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD 720
Query 721 GDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD 780
GDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD
Sbjct 721 GDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD 780
Query 781 PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA 840
PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA
Sbjct 781 PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA 840
Query 841 RGGMDADRLRTALGL 855
RGGMDADRLRTALGL
Sbjct 841 RGGMDADRLRTALGL 855
>gi|289446169|ref|ZP_06435913.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289419127|gb|EFD16328.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
Length=855
Score = 1657 bits (4292), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 853/855 (99%), Positives = 854/855 (99%), Gaps = 0/855 (0%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
+AEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR
Sbjct 1 MAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG
Sbjct 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD
Sbjct 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG
Sbjct 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA
Sbjct 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
Query 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR
Sbjct 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
Query 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERG AL
Sbjct 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGSAL 420
Query 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL
Sbjct 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
Query 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL
Sbjct 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
Query 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR
Sbjct 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
Query 601 AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV 660
AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV
Sbjct 601 AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV 660
Query 661 NTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD 720
NTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD
Sbjct 661 NTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD 720
Query 721 GDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD 780
GDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD
Sbjct 721 GDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD 780
Query 781 PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA 840
PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA
Sbjct 781 PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA 840
Query 841 RGGMDADRLRTALGL 855
RGGMDADRLRTALGL
Sbjct 841 RGGMDADRLRTALGL 855
>gi|289744327|ref|ZP_06503705.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289756694|ref|ZP_06516072.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|294996128|ref|ZP_06801819.1| hypothetical protein Mtub2_16910 [Mycobacterium tuberculosis
210]
gi|289684855|gb|EFD52343.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289712258|gb|EFD76270.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|326905164|gb|EGE52097.1| preprotein translocase subunit SecA [Mycobacterium tuberculosis
W-148]
gi|339293652|gb|AEJ45763.1| hypothetical protein CCDC5079_0573 [Mycobacterium tuberculosis
CCDC5079]
gi|339297293|gb|AEJ49403.1| hypothetical protein CCDC5180_0566 [Mycobacterium tuberculosis
CCDC5180]
Length=855
Score = 1657 bits (4291), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 853/855 (99%), Positives = 854/855 (99%), Gaps = 0/855 (0%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
+AEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR
Sbjct 1 MAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG
Sbjct 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD
Sbjct 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG
Sbjct 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA
Sbjct 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
Query 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR
Sbjct 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
Query 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL
Sbjct 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
Query 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL
Sbjct 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
Query 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL
Sbjct 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
Query 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR
Sbjct 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
Query 601 AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV 660
AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV
Sbjct 601 AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV 660
Query 661 NTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD 720
NTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD
Sbjct 661 NTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD 720
Query 721 GDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD 780
GDTLRVE NSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD
Sbjct 721 GDTLRVEANSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD 780
Query 781 PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA 840
PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA
Sbjct 781 PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA 840
Query 841 RGGMDADRLRTALGL 855
RGGMDADRLRTALGL
Sbjct 841 RGGMDADRLRTALGL 855
>gi|148821815|ref|YP_001286569.1| hypothetical protein TBFG_10624 [Mycobacterium tuberculosis F11]
gi|253797550|ref|YP_003030551.1| hypothetical protein TBMG_00621 [Mycobacterium tuberculosis KZN
1435]
gi|254549571|ref|ZP_05140018.1| hypothetical protein Mtube_03760 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
9 more sequence titles
Length=855
Score = 1656 bits (4289), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 853/855 (99%), Positives = 854/855 (99%), Gaps = 0/855 (0%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
+AEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR
Sbjct 1 MAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPI TLCEHEEYGRLADGSAARIVLAG
Sbjct 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPIITLCEHEEYGRLADGSAARIVLAG 120
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD
Sbjct 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG
Sbjct 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA
Sbjct 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
Query 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR
Sbjct 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
Query 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL
Sbjct 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
Query 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL
Sbjct 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
Query 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL
Sbjct 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
Query 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR
Sbjct 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
Query 601 AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV 660
AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV
Sbjct 601 AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV 660
Query 661 NTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD 720
NTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD
Sbjct 661 NTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD 720
Query 721 GDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD 780
GDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD
Sbjct 721 GDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD 780
Query 781 PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA 840
PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA
Sbjct 781 PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA 840
Query 841 RGGMDADRLRTALGL 855
RGGMDADRLRTALGL
Sbjct 841 RGGMDADRLRTALGL 855
>gi|340625636|ref|YP_004744088.1| hypothetical protein MCAN_06151 [Mycobacterium canettii CIPT
140010059]
gi|340003826|emb|CCC42955.1| hypothetical protein MCAN_06151 [Mycobacterium canettii CIPT
140010059]
Length=855
Score = 1655 bits (4286), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 853/855 (99%), Positives = 854/855 (99%), Gaps = 0/855 (0%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
+AEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR
Sbjct 1 MAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG
Sbjct 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD
Sbjct 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG
Sbjct 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA
Sbjct 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
Query 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR
Sbjct 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
Query 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL
Sbjct 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
Query 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL
Sbjct 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
Query 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL
Sbjct 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
Query 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR
Sbjct 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
Query 601 AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV 660
AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV
Sbjct 601 AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLV 660
Query 661 NTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD 720
NTEGDSLAICEASVRV DPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD
Sbjct 661 NTEGDSLAICEASVRVGDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLD 720
Query 721 GDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD 780
GDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD
Sbjct 721 GDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPD 780
Query 781 PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA 840
PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA
Sbjct 781 PDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGA 840
Query 841 RGGMDADRLRTALGL 855
RGGMDADRLRTALGL
Sbjct 841 RGGMDADRLRTALGL 855
>gi|308374058|ref|ZP_07667701.1| putative SEC-C motif containing protein [Mycobacterium tuberculosis
SUMu006]
gi|308343213|gb|EFP32064.1| putative SEC-C motif containing protein [Mycobacterium tuberculosis
SUMu006]
Length=638
Score = 1182 bits (3057), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 610/611 (99%), Positives = 611/611 (100%), Gaps = 0/611 (0%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
+AEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR
Sbjct 1 MAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG
Sbjct 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD
Sbjct 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG
Sbjct 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA
Sbjct 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
Query 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR
Sbjct 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
Query 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL
Sbjct 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
Query 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL
Sbjct 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
Query 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL
Sbjct 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
Query 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR
Sbjct 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
Query 601 AGQLICARPVP 611
AGQLICARPVP
Sbjct 601 AGQLICARPVP 611
>gi|289752655|ref|ZP_06512033.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289693242|gb|EFD60671.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
Length=641
Score = 990 bits (2559), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 560/615 (92%), Positives = 563/615 (92%), Gaps = 9/615 (1%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
+AEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR
Sbjct 1 MAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG
Sbjct 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD
Sbjct 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG
Sbjct 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA
Sbjct 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
Query 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGG G +
Sbjct 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGGGRAGPAHRDAGAQGAA 360
Query 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
V VR ALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL
Sbjct 361 RGGVCVRCCSRSALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
Query 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL
Sbjct 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
Query 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL
Sbjct 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
Query 541 EVRGSLLPDDERLLAE----QWLLVERSVFEVEH-VQPGEGVIVRDVR-TGDTHEVHERA 594
EVRGSLLPDDERLL W +VEH GVIVRDVR VHERA
Sbjct 541 EVRGSLLPDDERLLCRANGCSW---SGRCSKVEHRATLARGVIVRDVRPRRHPMRVHERA 597
Query 595 ASRQLRAGQLICARP 609
ASRQLRAGQLICARP
Sbjct 598 ASRQLRAGQLICARP 612
>gi|315446130|ref|YP_004079009.1| SEC-C motif-containing protein,tetratricopeptide repeat protein
[Mycobacterium sp. Spyr1]
gi|315264433|gb|ADU01175.1| SEC-C motif-containing protein,tetratricopeptide repeat protein
[Mycobacterium sp. Spyr1]
Length=841
Score = 981 bits (2537), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 525/858 (62%), Positives = 630/858 (74%), Gaps = 21/858 (2%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
+A D T +ARIL E+GPL EDDIA RL ++G+ +PD VL L E + PA LVD+
Sbjct 1 MAAQLDPTDVLARILVENGPLREDDIAHRLREAGIRNPDDVLPQLLNEIDCPAVPLVDES 60
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLP L+AGR+ THR+ A E HD+L VTPDLDP+T LCE EE+ RLADG+ + L
Sbjct 61 WVWLPELMAGRILTHRVDAQELAHDILIVTPDLDPLTHLCEFEEFARLADGTPISVALPA 120
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
+DE++L+ RG+P + +D GGAL+L PGTL+ L AAG+LVG+RLT AGL L R+ TA
Sbjct 121 FDEDVLDERGVPPDMVDDGGALILPPGTLSDLDVAAGELVGLRLTPAGLTLTRV-TAETS 179
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
SVGA LA +DP+EP F +AVW+ACV D A FTEPV PL EI D H L D LAP
Sbjct 180 PSVGAALAATLDPEEPVAFDSAVWSACVADTALFTEPVEPLSEIADAHNLVRRLDVLAPA 239
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEA--TDPDELPRDVLA 298
F++D WRF+ CE+++ R+D+ + A+AL TL+ ++E MS LL +D +E+P
Sbjct 240 EFDYDRWRFDIDCEVMSRRYDIAEDAALALRTLVSIYEQMSQLLTMPLSDDEEVPDGDAP 299
Query 299 TAAETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKV 358
T A +L+ + GA LADP LAE+L+AETVG + GAAALGL E LEP+V
Sbjct 300 TPALAGYR-------ELVAEFGAELADPHLAEVLLAETVGRNPDGAAALGLFAETLEPQV 352
Query 359 PRAARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGL 418
PR ARVA RWLR+VAL+RIGD+EAAERELL AE+MDT+WPLPL DLARIASDRGD ER L
Sbjct 353 PRRARVAFRWLRSVALERIGDIEAAERELLTAETMDTDWPLPLYDLARIASDRGDVERAL 412
Query 419 ALLRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVD 478
ALL RAG PD PL +L HR + R D+GRNE CWCGSGRKYKKCHLG+E L L +RV
Sbjct 413 ALLHRAGAAPDDPLFTVLRSHRPEARPDVGRNEPCWCGSGRKYKKCHLGKEQLSLPDRVG 472
Query 479 WLYAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAE 538
WLY+KA+QHAL W GL+AE++YER R+ D+ DE DP+V+DAVLFEG A A+
Sbjct 473 WLYSKAAQHALISGWRGLVAELNYERNRHHDAPDE------AIDPVVIDAVLFEGEALAD 526
Query 539 FLEVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQ 598
FL VRG LLPDDER LAEQWLL ERS+FEVE V PG G++VRDVRTGD HEV ER ASRQ
Sbjct 527 FLAVRGPLLPDDERSLAEQWLLSERSLFEVEEVHPGRGLVVRDVRTGDVHEVRERTASRQ 586
Query 599 LRAGQLICARPVPAGD-TMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPP 657
L A QLIC R +P GD T FGG+EPVALH+R+ L+ELLD+ P PV +VA LSRRFAPP
Sbjct 587 LTARQLICTRLLPIGDGTAQLFGGVEPVALHDRSALMELLDEGPAPVEIVAFLSRRFAPP 646
Query 658 TLVNTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATL 717
L NTEGD L ICEA VRV D A ++ ALDG YDRV+ E P+W EHVT DGM R+RAT+
Sbjct 647 MLTNTEGDPLMICEAVVRVSDSARMEAALDGTYDRVEDAELPQWFEHVTTDGMDRIRATV 706
Query 718 VLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAG 777
VL+GDTLRVETNSE RMDRVL TL RLDP M V+DD R P+ + RE AA +MP T G
Sbjct 707 VLEGDTLRVETNSEERMDRVLETLERLDPGMQVVDDTRTPMDDPREMAA---KMPATAKG 763
Query 778 APDPDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAG 837
A DP+ P++AA L+ I DYET WLD+ IPALDGHTPRQAADDPTRR DLI+LLD+FP
Sbjct 764 AVDPEDPQVAALLDAMILDYETKWLDESIPALDGHTPRQAADDPTRRPDLIRLLDSFPTD 823
Query 838 AGARGGMDADRLRTALGL 855
AG R M+ADRLR ALGL
Sbjct 824 AG-RHAMNADRLRAALGL 840
>gi|120402155|ref|YP_951984.1| SecC motif-containing protein [Mycobacterium vanbaalenii PYR-1]
gi|119954973|gb|ABM11978.1| SEC-C motif domain protein [Mycobacterium vanbaalenii PYR-1]
Length=864
Score = 966 bits (2496), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 531/863 (62%), Positives = 621/863 (72%), Gaps = 24/863 (2%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
VA D T+A+A IL EHGPL +D IA RL + GVADP+ +L L E + PA LVD+
Sbjct 17 VAAQLDPTKALATILIEHGPLPKDAIAHRLREMGVADPEDLLPRLLNEIDCPAVPLVDES 76
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLP LLAGR+FTHR+ A E HD+L VTPDLD IT LCE E Y RL DGS + L
Sbjct 77 WVWLPALLAGRIFTHRVTAQELAHDVLLVTPDLDAITHLCEFEPYARLVDGSPVSVALPA 136
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
+DE++L+ RG+P E +D GGAL+L PGTL LG GDLVG+RL GL L ++ TA
Sbjct 137 FDEDVLDERGVPPEMVDDGGALVLPPGTLRGLGVVDGDLVGLRLAPEGLTLAQV-TADPS 195
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
+VGA LA ++PDEP F +AVWTACV A FTEP PL EI HGL D LAP
Sbjct 196 PAVGAALAATLNPDEPVSFDSAVWTACVAHSALFTEPTPPLSEIAADHGLEARGDLLAPA 255
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
GF+FD WRFE CE+L R+DLD +DA+A+ L ++E MS L+ LP D
Sbjct 256 GFDFDRWRFELDCEVLKRRYDLDDDDALAVRALKAIYEQMSQLITM-----LPADEDFAD 310
Query 301 AETATETGS-------DSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEM 353
D +L+ ++GA LADP LAE+LVAET+G D GAAALGL E
Sbjct 311 DADEDAAAEDAPVPPLDGYQELVAEMGAELADPRLAEVLVAETLGRDPDGAAALGLFAET 370
Query 354 LEPKVPRAARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGD 413
LEP+VPR ARVA RWLRAVAL+R+GD+E+AERELLAAESMDT+WPLPL DLARIASDRGD
Sbjct 371 LEPQVPRRARVAFRWLRAVALERMGDIESAERELLAAESMDTDWPLPLYDLARIASDRGD 430
Query 414 AERGLALLRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPL 473
ERGLALLRRAG +PD PL+ +L R + R D+GRNE CWCGSGRKYKKCHLG+E L
Sbjct 431 VERGLALLRRAGADPDDPLLEMLSSFRGEARPDVGRNEPCWCGSGRKYKKCHLGKEQPAL 490
Query 474 AERVDWLYAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEG 533
ERV WLY+KA+QHAL W L+ E+ YER ++ D+ DE ADPLV+DA LFEG
Sbjct 491 PERVGWLYSKAAQHALMSGWRALIVELDYERNQFHDAPDE------TADPLVVDAALFEG 544
Query 534 GAFAEFLEVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHER 593
GAF +FL VRG LLP+DER LAEQWLLV+RS+FEV V+PG GV VRDVRTGD HEV ER
Sbjct 545 GAFEDFLAVRGPLLPEDERSLAEQWLLVDRSLFEVGQVRPGHGVTVRDVRTGDIHEVQER 604
Query 594 AASRQLRAGQLICARPVPAG-DTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSR 652
ASR L++GQLIC R +PAG DT FFGG+EPVALH+R LI LLD+ PDPV LVA LS
Sbjct 605 TASRHLKSGQLICTRVLPAGDDTAQFFGGLEPVALHQRDALIALLDEGPDPVDLVAFLSL 664
Query 653 RFAPPTLVNTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLR 712
RFAPPTL NTEGD + ICEA+V V D A IQ ALD YDRV+ EPP+W EHVT GM R
Sbjct 665 RFAPPTLTNTEGDPMIICEATVHVGDAASIQSALDDTYDRVEDAEPPKWFEHVTTHGMER 724
Query 713 VRATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMP 772
+RATL LDG+TLRVETNSE RMDRVLAT+ RLDPAM VL+D R + + R A+A +MP
Sbjct 725 IRATLALDGETLRVETNSEERMDRVLATVARLDPAMQVLEDTREYVDDPR---AMAARMP 781
Query 773 VTGAGAPDPDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLD 832
T GA +PD PE+AA L+ I DYE WLD+PIPALDGHTPRQAADDPTRR DLI+LLD
Sbjct 782 ETAKGAIEPDDPEIAALLDAMILDYEAKWLDEPIPALDGHTPRQAADDPTRRPDLIRLLD 841
Query 833 TFPAGAGARGGMDADRLRTALGL 855
+FPA AG R GM DRLR ALGL
Sbjct 842 SFPADAG-RHGMSVDRLRAALGL 863
>gi|254230949|ref|ZP_04924276.1| hypothetical protein TBCG_00608 [Mycobacterium tuberculosis C]
gi|124600008|gb|EAY59018.1| hypothetical protein TBCG_00608 [Mycobacterium tuberculosis C]
Length=910
Score = 934 bits (2415), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 500/501 (99%), Positives = 500/501 (99%), Gaps = 0/501 (0%)
Query 355 EPKVPRAARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDA 414
EPKVPRAARVA RWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDA
Sbjct 410 EPKVPRAARVAGRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDA 469
Query 415 ERGLALLRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLA 474
ERGLALLRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLA
Sbjct 470 ERGLALLRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLA 529
Query 475 ERVDWLYAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGG 534
ERVDWLYAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGG
Sbjct 530 ERVDWLYAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGG 589
Query 535 AFAEFLEVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERA 594
AFAEFLEVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERA
Sbjct 590 AFAEFLEVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERA 649
Query 595 ASRQLRAGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRF 654
ASRQLRAGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRF
Sbjct 650 ASRQLRAGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRF 709
Query 655 APPTLVNTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVR 714
APPTLVNTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVR
Sbjct 710 APPTLVNTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVR 769
Query 715 ATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVT 774
ATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVT
Sbjct 770 ATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVT 829
Query 775 GAGAPDPDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTF 834
GAGAPDPDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTF
Sbjct 830 GAGAPDPDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTF 889
Query 835 PAGAGARGGMDADRLRTALGL 855
PAGAGARGGMDADRLRTALGL
Sbjct 890 PAGAGARGGMDADRLRTALGL 910
Score = 545 bits (1405), Expect = 1e-152, Method: Compositional matrix adjust.
Identities = 277/278 (99%), Positives = 278/278 (100%), Gaps = 0/278 (0%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR
Sbjct 4 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 63
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG
Sbjct 64 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 123
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD
Sbjct 124 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 183
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG
Sbjct 184 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 243
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHE 278
GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLH+
Sbjct 244 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHD 281
>gi|118473383|ref|YP_885675.1| hypothetical protein MSMEG_1285 [Mycobacterium smegmatis str.
MC2 155]
gi|118174670|gb|ABK75566.1| tetratricopeptide repeat family protein [Mycobacterium smegmatis
str. MC2 155]
Length=801
Score = 870 bits (2248), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 505/849 (60%), Positives = 585/849 (69%), Gaps = 54/849 (6%)
Query 8 TQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDRWVWLPTL 67
T AVARILAE GPL D+I R L SG P+ V+ L + P LVDDRWVWLPT+
Sbjct 5 TDAVARILAEQGPLHTDEIERLLQASGEPVPEPVVDELSM----PVGMLVDDRWVWLPTV 60
Query 68 LAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAGYDEELLE 127
L GRVFTHRL A E HDML DLDP++ L +EY RLADGS +A YD+ELLE
Sbjct 61 LDGRVFTHRLSAHEVAHDMLDAAVDLDPVSDLFHLDEYLRLADGSPVSFAVADYDDELLE 120
Query 128 RRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGADTSVGARL 187
RGIP E G +LL PGTLA L AA GDLVG+RLT GL LE I T D +G RL
Sbjct 121 DRGIPLELAGESGVVLLAPGTLAALKAAEGDLVGLRLTDQGLALETIETV-VDADIGNRL 179
Query 188 AELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPGGFNFDAW 247
AE++ DEP F AA T C +DP F E APL E++ + GL + D ++AP GF+F W
Sbjct 180 AEVLPGDEPTFVDAAALTLCAEDPTVFVEATAPLSEVIREAGLAYSDGFIAPAGFDFGRW 239
Query 248 RFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATAAETATET 307
RFE C A H LDP+DAVAL TLI E +++ D D LP
Sbjct 240 RFETACHRSADTHGLDPDDAVALQTLIMALEQLTV-----DADSLP-------------- 280
Query 308 GSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPRAARVAVR 367
LL GAAL +P++A+ LV ETV G +L LTE LE +VPR AR AVR
Sbjct 281 -------LLRRAGAALENPVVADALVEETVDAGRGSPESLSRLTEALEAQVPRPARAAVR 333
Query 368 WLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLALLRRAGTE 427
WLRA AL+R GD+ AAERELLAAE+MDTEWPL L+DLA IASDRGDAER LALLRRAG
Sbjct 334 WLRATALERAGDIAAAERELLAAETMDTEWPLTLVDLAHIASDRGDAERALALLRRAGFP 393
Query 428 PDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWLYAKASQH 487
PDHP V+ L+R+ PR DLGRNE CWCGSGRKYKKCHLG E L L ER WLY+KA+QH
Sbjct 394 PDHPNVQFLQRYLVAPRPDLGRNEPCWCGSGRKYKKCHLGNEQLSLEERAAWLYSKAAQH 453
Query 488 ALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFLEVRGSLL 547
W G+L E++ ER RYAD D DA+A A++DPLV+DA+L EG AFA+FL VRG LL
Sbjct 454 VSETHWHGMLLELALERSRYAD-DLHDAIAEAMSDPLVMDALLHEGDAFADFLRVRGPLL 512
Query 548 PDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICA 607
PDDER LAEQWLLV+RSVFEV+ V+PGE V VRDVRTGD HEV ER ASR+++ G+L+CA
Sbjct 513 PDDERALAEQWLLVDRSVFEVQAVRPGETVTVRDVRTGDRHEVRERLASREVKPGELLCA 572
Query 608 RPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLVNTEGDSL 667
R +P G M FFGGIE V+L ER LIELLD PD VTLVA L+RRFAPPTL NTEGD L
Sbjct 573 RVLPTGSIMQFFGGIERVSLGERDELIELLDSRPDEVTLVAALTRRFAPPTLTNTEGDLL 632
Query 668 AICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLDGDTLRVE 727
+CEA+VR DPA ALD VY R D ++PP W EHV G ++RA+L LDGDTLRVE
Sbjct 633 MVCEAAVRFADPA----ALDAVYVRAD-DDPPHWFEHVP--GKPQIRASLKLDGDTLRVE 685
Query 728 TNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRN-TREAAALAEQMPVTGAGAPDPDSPEL 786
TNSE RMDRVLA L RLDPAMTVL++ RRP+ T + L E PD PE+
Sbjct 686 TNSEERMDRVLAELGRLDPAMTVLEESRRPISEVTPPSRELLE-----------PDDPEM 734
Query 787 AAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGARGGMDA 846
AA+EEF+RDYET WLD+ IPAL+G TPRQAADDPTRR DLIKLLD+FPA GM+A
Sbjct 735 IAAMEEFMRDYETRWLDESIPALNGLTPRQAADDPTRRGDLIKLLDSFPA---TERGMNA 791
Query 847 DRLRTALGL 855
DRLR ALGL
Sbjct 792 DRLRAALGL 800
>gi|240169386|ref|ZP_04748045.1| hypothetical protein MkanA1_08738 [Mycobacterium kansasii ATCC
12478]
Length=665
Score = 832 bits (2149), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/654 (67%), Positives = 505/654 (78%), Gaps = 8/654 (1%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
VA+A ++ +ARIL EHGPLS +DIA RLL+ V DPD ++ E + ARQLVD++
Sbjct 4 VADAVGESETLARILTEHGPLSAEDIAARLLERDVTDPDTLVDRFLDELDCAARQLVDEK 63
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLP +LAGRVFTHRL A+E +D L VTPDLDP+T LCEHE+Y RLADGS A VLA
Sbjct 64 WVWLPAVLAGRVFTHRLCAEELAYDALNVTPDLDPVTALCEHEQYQRLADGSPAGAVLAD 123
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
+D+ELLERRGIPDEA+DP G LLL PGTL LG + GDL GVRLT GLV+ER+ +
Sbjct 124 FDDELLERRGIPDEAVDPAGTLLLAPGTLGALGLSEGDLAGVRLTEEGLVVERVTEVASA 183
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
VGARLA +D +P +F AAVWT C ++P FTEP+ PL EI+D HGL +WLAPG
Sbjct 184 QGVGARLAATLDAGQPTYFDAAVWTVCAEEPQLFTEPLPPLSEIVDDHGLARHGEWLAPG 243
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
GF+F AW F+ CE LA RH+LDPNDAV LY L+ LH+ +++LL+A P + LA A
Sbjct 244 GFDFGAWHFKRGCEALAERHELDPNDAVTLYVLVTLHDQIAVLLDAGVA---PEEALAAA 300
Query 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
E A TG D +DL+G+ GAALADPLLAELLVAET+GTD G A LGL E+LE KVPR
Sbjct 301 TEGA--TGPD--MDLVGEFGAALADPLLAELLVAETIGTDRVGTAGLGLFAEVLESKVPR 356
Query 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
ARVA RWLRAVAL+R+GD+EAAEREL+AAE+M+ +WPLPL DLARIASDRGD ERGL L
Sbjct 357 PARVACRWLRAVALERLGDIEAAERELVAAETMNPDWPLPLFDLARIASDRGDVERGLTL 416
Query 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
LRRAG EPD+PLV LLE +RA+PRRD+GRN+ CWCGSGRKYKKCHLGRE LPL +R WL
Sbjct 417 LRRAGAEPDYPLVELLEMYRAEPRRDVGRNDLCWCGSGRKYKKCHLGREQLPLDDRARWL 476
Query 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
YAKA QHAL W LL EV+ ER R+A DD DAL AL DPLV+DAVLFEGGAF EFL
Sbjct 477 YAKAIQHALVSGWNDLLIEVADERSRHA-GDDPDALNTALGDPLVIDAVLFEGGAFEEFL 535
Query 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
E+RGSLLPDDERLLA+QWL V RSVFEVE VQ G V VRD+RTGDTH+V E AA RQL+
Sbjct 536 EIRGSLLPDDERLLAQQWLSVPRSVFEVERVQRGYSVTVRDLRTGDTHQVREHAAGRQLK 595
Query 601 AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRF 654
GQL+CAR +PAGDTM FFGG+EPVALHER LI LLD EPD VTLVA LS RF
Sbjct 596 TGQLVCARALPAGDTMRFFGGVEPVALHERDRLINLLDTEPDAVTLVAYLSLRF 649
>gi|289568551|ref|ZP_06448778.1| hypothetical protein TBJG_01055 [Mycobacterium tuberculosis T17]
gi|289542305|gb|EFD45953.1| hypothetical protein TBJG_01055 [Mycobacterium tuberculosis T17]
Length=468
Score = 815 bits (2106), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/431 (99%), Positives = 429/431 (99%), Gaps = 0/431 (0%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
+AEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR
Sbjct 1 MAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG
Sbjct 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD
Sbjct 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG
Sbjct 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA
Sbjct 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
Query 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR
Sbjct 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
Query 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL
Sbjct 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
Query 421 LRRAGTEPDHP 431
LRRAGTEP P
Sbjct 421 LRRAGTEPRPP 431
>gi|298524103|ref|ZP_07011512.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298493897|gb|EFI29191.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=468
Score = 737 bits (1902), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/388 (99%), Positives = 387/388 (99%), Gaps = 0/388 (0%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
+AEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR
Sbjct 1 MAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEY RLADGSAARIVLAG
Sbjct 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYCRLADGSAARIVLAG 120
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD
Sbjct 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG
Sbjct 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA
Sbjct 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
Query 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR
Sbjct 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
Query 361 AARVAVRWLRAVALDRIGDVEAAERELL 388
AARVAVRWLRAVALDRIGDVEAAERELL
Sbjct 361 AARVAVRWLRAVALDRIGDVEAAERELL 388
>gi|289749112|ref|ZP_06508490.1| hypothetical protein TBDG_03748 [Mycobacterium tuberculosis T92]
gi|289689699|gb|EFD57128.1| hypothetical protein TBDG_03748 [Mycobacterium tuberculosis T92]
Length=378
Score = 709 bits (1831), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/370 (99%), Positives = 370/370 (100%), Gaps = 0/370 (0%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
+AEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR
Sbjct 1 MAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG
Sbjct 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD
Sbjct 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG
Sbjct 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA
Sbjct 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
Query 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR
Sbjct 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
Query 361 AARVAVRWLR 370
AARVAVRWLR
Sbjct 361 AARVAVRWLR 370
>gi|289760736|ref|ZP_06520114.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
gi|289708242|gb|EFD72258.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
Length=331
Score = 648 bits (1671), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/331 (99%), Positives = 331/331 (100%), Gaps = 0/331 (0%)
Query 525 VLDAVLFEGGAFAEFLEVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRT 584
+LDAVLFEGGAFAEFLEVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRT
Sbjct 1 MLDAVLFEGGAFAEFLEVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRT 60
Query 585 GDTHEVHERAASRQLRAGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPV 644
GDTHEVHERAASRQLRAGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPV
Sbjct 61 GDTHEVHERAASRQLRAGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDEPDPV 120
Query 645 TLVAQLSRRFAPPTLVNTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEH 704
TLVAQLSRRFAPPTLVNTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEH
Sbjct 121 TLVAQLSRRFAPPTLVNTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEH 180
Query 705 VTNDGMLRVRATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREA 764
VTNDGMLRVRATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREA
Sbjct 181 VTNDGMLRVRATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREA 240
Query 765 AALAEQMPVTGAGAPDPDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRR 824
AALAEQMPVTGAGAPDPDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRR
Sbjct 241 AALAEQMPVTGAGAPDPDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRR 300
Query 825 ADLIKLLDTFPAGAGARGGMDADRLRTALGL 855
ADLIKLLDTFPAGAGARGGMDADRLRTALGL
Sbjct 301 ADLIKLLDTFPAGAGARGGMDADRLRTALGL 331
>gi|306781550|ref|ZP_07419887.1| hypothetical protein TMBG_03471 [Mycobacterium tuberculosis SUMu002]
gi|308325711|gb|EFP14562.1| hypothetical protein TMBG_03471 [Mycobacterium tuberculosis SUMu002]
Length=328
Score = 630 bits (1625), Expect = 3e-178, Method: Compositional matrix adjust.
Identities = 325/325 (100%), Positives = 325/325 (100%), Gaps = 0/325 (0%)
Query 1 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 60
VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR
Sbjct 4 VAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDR 63
Query 61 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 120
WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG
Sbjct 64 WVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAG 123
Query 121 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 180
YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD
Sbjct 124 YDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGAD 183
Query 181 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 240
TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG
Sbjct 184 TSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPG 243
Query 241 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA
Sbjct 244 GFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATA 303
Query 301 AETATETGSDSLVDLLGDIGAALAD 325
AETATETGSDSLVDLLGDIGAALAD
Sbjct 304 AETATETGSDSLVDLLGDIGAALAD 328
>gi|333989271|ref|YP_004521885.1| hypothetical protein JDM601_0631 [Mycobacterium sp. JDM601]
gi|333485239|gb|AEF34631.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=508
Score = 514 bits (1324), Expect = 3e-143, Method: Compositional matrix adjust.
Identities = 285/482 (60%), Positives = 337/482 (70%), Gaps = 4/482 (0%)
Query 6 DATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDRWVWLP 65
DA +A IL EHGP S+DDIA RL ++G+ADPD V+ L E PAR L D RWVWLP
Sbjct 6 DAAPTLAAILTEHGPASQDDIADRLHEAGIADPDTVIDELLDEFSCPARPLPDGRWVWLP 65
Query 66 TLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAGYDEEL 125
+LAGRVFTHRL A+E HD+L V+PDLDPIT LC+ ++Y LADGS +VL YD+EL
Sbjct 66 AVLAGRVFTHRLTAEELTHDLLAVSPDLDPITELCD-DDYPELADGSPVSLVLPRYDDEL 124
Query 126 LERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGADTSVGA 185
LE RGIP E +D GALLL PGTLA LG AAGDL+G+RLT G+ LE + TA D ++G
Sbjct 125 LEERGIPLELVDEPGALLLAPGTLAGLGLAAGDLLGMRLTDKGIALEPV-TATMDPTLGD 183
Query 186 RLAELVDPDE-PAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPGGFNF 244
RLA LVD DE P+ F + VW AC DPA F P+APL EI D HGL DWLAP GF+F
Sbjct 184 RLAALVDVDESPSRFASVVWAACAVDPALFNTPLAPLGEIADAHGLARSGDWLAPAGFDF 243
Query 245 DAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATAAETA 304
D WRFE +C LA + LD +DA L TL+++H M+ + + P + A A T
Sbjct 244 DRWRFERKCARLADEYGLDDDDAFVLTTLVEIHGQMARIFDMA-PGAEDDEEGAADAPTD 302
Query 305 TETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPRAARV 364
D+LG++GAALA+P+LA L+ E GAAALG+L E L+PKVPR+ARV
Sbjct 303 AWPADGPYADILGELGAALANPVLALSLMEEATHEGRRGAAALGILAESLQPKVPRSARV 362
Query 365 AVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLALLRRA 424
A WL+A+A + +GDV RELLAAES D +WPL LL LARIASDRGDAE GL LLRRA
Sbjct 363 ATHWLQAMACEGLGDVAGCARELLAAESSDPDWPLALLSLARIASDRGDAEAGLGLLRRA 422
Query 425 GTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWLYAKA 484
G DHPLVRLLE+HRAQPR D+GRNE CWCGSGRKYKKCHLG E L L ERV WLYAKA
Sbjct 423 GVGSDHPLVRLLEQHRAQPRTDVGRNEPCWCGSGRKYKKCHLGNEQLSLTERVRWLYAKA 482
Query 485 SQ 486
Q
Sbjct 483 HQ 484
>gi|308374057|ref|ZP_07667700.1| hypothetical protein TMFG_03279 [Mycobacterium tuberculosis SUMu006]
gi|308343212|gb|EFP32063.1| hypothetical protein TMFG_03279 [Mycobacterium tuberculosis SUMu006]
Length=239
Score = 467 bits (1202), Expect = 4e-129, Method: Compositional matrix adjust.
Identities = 238/239 (99%), Positives = 239/239 (100%), Gaps = 0/239 (0%)
Query 617 VFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLVNTEGDSLAICEASVRV 676
+FFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLVNTEGDSLAICEASVRV
Sbjct 1 MFFGGIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLVNTEGDSLAICEASVRV 60
Query 677 DDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLDGDTLRVETNSEPRMDR 736
DDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLDGDTLRVETNSEPRMDR
Sbjct 61 DDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLDGDTLRVETNSEPRMDR 120
Query 737 VLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPDPDSPELAAALEEFIRD 796
VLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPDPDSPELAAALEEFIRD
Sbjct 121 VLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPDPDSPELAAALEEFIRD 180
Query 797 YETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGARGGMDADRLRTALGL 855
YETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGARGGMDADRLRTALGL
Sbjct 181 YETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGARGGMDADRLRTALGL 239
>gi|226359545|ref|YP_002777323.1| hypothetical protein ROP_01310 [Rhodococcus opacus B4]
gi|226238030|dbj|BAH48378.1| hypothetical protein [Rhodococcus opacus B4]
Length=832
Score = 458 bits (1179), Expect = 2e-126, Method: Compositional matrix adjust.
Identities = 347/859 (41%), Positives = 421/859 (50%), Gaps = 55/859 (6%)
Query 9 QAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPAR----QLVDDRWVWL 64
+A +L EHGPL D+ ARRL VA+ L + TE+ L D R L
Sbjct 17 EAAMSLLREHGPLHPDEWARRL----VAEGHGYLADMEELTEYIGHPRLGYLADGRSAAL 72
Query 65 PTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAGYDEE 124
LL GRV THRL E +L PDL P+ + + RI+ D++
Sbjct 73 DALLDGRVLTHRLTEMEISSGILDANPDLMPLLPFDDDDPAA-----GGLRILFRDLDDD 127
Query 125 LLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGADTS-V 183
+ + RG D ALLLEP L G GDLV + T L L A +
Sbjct 128 VFDERGALDADWPADAALLLEPDALT--GLRPGDLVALTFTGGVLRLTAAENPPAPAPDL 185
Query 184 GARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPGGFN 243
A L LV D P +W DDP+ T P PL E++ G + D +A GF+
Sbjct 186 TAALTGLVTEDRPEPLDGVIWQLMADDPSLLTAPTTPLGELITAAGYVCDGDDIAAAGFD 245
Query 244 FDAWRFENRCELLAFRHDL--DPNDAV-ALYTLIKLHETMSLLLEATDPDELPRDVLATA 300
F A R + +A H L D DAV A LI + LE T D+ V A A
Sbjct 246 FAAHRGKAHMATVAAAHHLTDDQTDAVLAFLALIGV-------LERTPDDQRAAAVDAVA 298
Query 301 AETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPR 360
A GD A LA P A E T G L +L + PR
Sbjct 299 AN-------------FGDRFAGLAHPNAARAAFGEAYATAHAGTDTLRSAAAVLRDRGPR 345
Query 361 AARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLAL 420
WL A + G AER A ++D W L LAR ASDRGDA R + L
Sbjct 346 RIAPTAHWLAGKAAELDGWTADAERHYERALAVDPNWDEALEALARFASDRGDAVRAIGL 405
Query 421 LRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWL 480
L R P+ LL+ R DLGRN+ CWCGSGRKYK CHLG L +R WL
Sbjct 406 LDRVEGAYREPMYDLLQSFLPVDRPDLGRNDRCWCGSGRKYKACHLGTAEHSLEQRAGWL 465
Query 481 YAKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFL 540
Y KA A +W LL +S + R +DD AL AL DPLV D V+FE GAFA F+
Sbjct 466 YQKAGSFAQGIEWRPLL--LSLAQIRSVHNDDPFALYHALDDPLVADVVMFECGAFARFV 523
Query 541 EVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLR 600
RG LLP DE LLA+QW L ERSV EVE V+PGEG+ +RDVRTGD HEV ER ASRQLR
Sbjct 524 AERGVLLPADELLLAQQWSLTERSVHEVETVRPGEGLTLRDVRTGDRHEVTERTASRQLR 583
Query 601 AGQLICARPVPAGDTMVFFGGIEPVALHERAVLIELLD-DEPDPVTLVAQLSRRFAPPTL 659
G CAR VPAG T FGGIEP+A +R LIELLD + DP LV LS RFAPP L
Sbjct 584 VGDFFCARVVPAGSTTQIFGGIEPIAPGQRGQLIELLDSNATDPEELVEFLSARFAPPRL 643
Query 660 VNTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVL 719
+ +G + C A + D GI+ L + D + RW N + + T
Sbjct 644 ITPDGHPMVACRAVFEIADTTGIRRKLSRRFGAADND---RWTWTEQNSVLGVLTLTRST 700
Query 720 DGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAP 779
D L E +EPR + ++ + DP + + R P AA L Q GA
Sbjct 701 DRWELEAEAMNEPRFESLIDAVRAADPGGRLREQTRTP------AAELIAQTQDNGAHPH 754
Query 780 ---DPDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPA 836
DP+ PE+AA L+E IR YE WLD+ IPAL GHTPRQ A DPTRR +LI+LLD+FP
Sbjct 755 QPVDPEDPEIAATLDEHIRRYEQQWLDEAIPALGGHTPRQCAADPTRRDELIRLLDSFPQ 814
Query 837 GAGARGGMDADRLRTALGL 855
G M A RLR ALGL
Sbjct 815 -QDRPGAMSAHRLREALGL 832
>gi|226365319|ref|YP_002783102.1| hypothetical protein ROP_59100 [Rhodococcus opacus B4]
gi|226243809|dbj|BAH54157.1| hypothetical protein [Rhodococcus opacus B4]
Length=650
Score = 327 bits (838), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 251/653 (39%), Positives = 325/653 (50%), Gaps = 50/653 (7%)
Query 14 ILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLV-DDRWVWLPTLLAGRV 72
+L E GPLS+ ++ L D+G D ++ + E + P ++ DDR V L LLAGRV
Sbjct 35 LLRERGPLSDRELTVALADAGWGGVDELIEYVE-EFDAPLLGILPDDRKVALDVLLAGRV 93
Query 73 FTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAGYDEELLE----- 127
THRL A+E D+ V PD +YG L ++ + G++ L+
Sbjct 94 LTHRLTAEEIAADV--VEPD-----------DYGSLLHLASGEPDVDGFEVVFLDDEANE 140
Query 128 --RRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGADTSVGA 185
RG L L GTLA + GDL+ + T +G+ L+ +G AD A
Sbjct 141 LAARGGESANWSDDEVLALPRGTLAH--RSPGDLLAMIATDSGVRLDFVGGPVADAPELA 198
Query 186 RLAELVDPDEPAF-FPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPGGFNF 244
+ VW VDDPAAFT P PL +I++ G +AP GF+F
Sbjct 199 LRLTRRLRESVVIDLEEEVWHLLVDDPAAFTVPGLPLADIVEAAGFDRSGQLVAPRGFDF 258
Query 245 DAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATAAETA 304
+A+ + + + + A+A+ TL+ L ATA E
Sbjct 259 EAYGRDLMVGVYGDELGVPRDAALAVATLVSL---------------------ATALEED 297
Query 305 TETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPRAARV 364
E + ++ ALADP + E+ E D L L + L PR +
Sbjct 298 GEEDIQATFFARPELYTALADPAVMEVAAQELFDLDIDPEVLL-LTAQRLLASGPREVKA 356
Query 365 AVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLALLRRA 424
A W+ A + G AE A +D + L L DLAR ASDRGDA RGL+LL R
Sbjct 357 AASWIAGRATEMQGFPTQAEDHYGHALVLDGAFDLALFDLARFASDRGDAVRGLSLLNRM 416
Query 425 GTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWLYAKA 484
PL +LE + PR LGRN CWCGSGRKYK CHLG+ L+ER WLY KA
Sbjct 417 AAGDAEPLHAVLEHFQPTPRPGLGRNHPCWCGSGRKYKTCHLGKGDHALSERAAWLYQKA 476
Query 485 SQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFLEVRG 544
HA W + E + R + D AL AL DPLV D LFEGGAFA+F+E RG
Sbjct 477 KLHAQELGWRDQIVEYAEIRSEFWAGDA--ALFQALEDPLVTDVALFEGGAFADFVECRG 534
Query 545 SLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQL 604
LLP DE LA QW VERS+ EVE V+PGEG+ +RD+RTGD ++ E ASR L G +
Sbjct 535 DLLPPDEFALARQWQEVERSLHEVEEVRPGEGLTLRDLRTGDRRDIREVTASRMLHVGNM 594
Query 605 ICARPVPAGDTMVFFGGIEPVALHERAVLIELLDDE-PDPVTLVAQLSRRFAP 656
ICAR VPAGDT FGGIEP++ R L+ LDDE DP LV LS RFAP
Sbjct 595 ICARVVPAGDTWQIFGGIEPISQDRRPSLLAALDDETTDPADLVEILSERFAP 647
>gi|325676580|ref|ZP_08156258.1| tetratricopeptide repeat family protein [Rhodococcus equi ATCC
33707]
gi|325552758|gb|EGD22442.1| tetratricopeptide repeat family protein [Rhodococcus equi ATCC
33707]
Length=623
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 231/624 (38%), Positives = 310/624 (50%), Gaps = 40/624 (6%)
Query 6 DATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDRWVWLP 65
D T A L GP + +D A+RL+D+G + + L L D R L
Sbjct 9 DLTTAAIADLRASGPSTGEDWAQRLVDAGHGSLPEMTEFVELLDHPSVVLLADGRNAVLD 68
Query 66 TLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAGYDEEL 125
TLL GRVFTHRL E +L PDL P+ GR AD S R++ YD +
Sbjct 69 TLLEGRVFTHRLSGGEIESGLLHADPDLAPVVM----SALGR-ADESV-RVLFLDYDADE 122
Query 126 LERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGADTS--- 182
L GI DE G LL GT A G ++GDL+ V + A G + + +AG D +
Sbjct 123 LAALGIADEDFPDGPVLLF--GTEALQGFSSGDLIAVTVGAGGSL--ELSSAGGDVADVP 178
Query 183 -VGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPGG 241
+ RL +V D W DD A P PL E+++ G E D++A G
Sbjct 179 DMADRLDRIVGADNADNLETVAWQLLADDDALCATPTMPLGELIEAAGYECEGDYIAARG 238
Query 242 FNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATAA 301
F+FDA ++A H+L P++A A+ + ++L + D +LP +VLA
Sbjct 239 FDFDAHHLAAHIAMVAREHELHPDEASAVVSFVQLVGIVH-----DDELDLP-EVLARVC 292
Query 302 ETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPRA 361
E A D A L DP A ++ + AL + PR
Sbjct 293 EDA-------------DSVAGLEDPAAAAAVLDLVNAVEDDYIPALYTTASAVVGAGPRR 339
Query 362 ARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLALL 421
A+ + WL +A D +GDV AER A +MD EW L +LA+IASDRGDA+RGL+LL
Sbjct 340 AKASGHWLAGMAADTLGDVLEAERHFADAAAMDEEWTPALFELAQIASDRGDAQRGLSLL 399
Query 422 RRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWLY 481
R D L +L R +LGRN+ CWCGSGRKYK CHLG+ L +R WLY
Sbjct 400 GRIDGGQDERLYDVLTRFAPAEHPELGRNDKCWCGSGRKYKVCHLGKADATLDDRASWLY 459
Query 482 AKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFLE 541
KA+ +A S L+ + + R A DDEDA+A A +PLV+D LFEGG F F+
Sbjct 460 EKATLYAQSTVLFDLV--LGLAQRRAAHWDDEDAVARAFDEPLVIDTALFEGGLFRLFVA 517
Query 542 VRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRA 601
RG LLP DER LA++WL V RSV EV V G +RD+ T ++ V + A +
Sbjct 518 RRGVLLPADERELADRWLRVRRSVHEVVSVD-GSTATLRDLATDESATVDDGARA----V 572
Query 602 GQLICARPVPAGDTMVFFGGIEPV 625
G+L+CAR VP G+ GG+E V
Sbjct 573 GELLCARVVPTGERTQILGGVEQV 596
>gi|312138728|ref|YP_004006064.1| hypothetical protein REQ_12860 [Rhodococcus equi 103S]
gi|311888067|emb|CBH47379.1| hypothetical protein REQ_12860 [Rhodococcus equi 103S]
Length=623
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 231/624 (38%), Positives = 311/624 (50%), Gaps = 40/624 (6%)
Query 6 DATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDRWVWLP 65
D T A L GP + +D A+RL+D+G + + L L D R L
Sbjct 9 DLTTAAIADLRASGPSTGEDWAQRLVDAGHGSLPEMTEFVELLDHPSVVLLADGRNAVLD 68
Query 66 TLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAGYDEEL 125
TLL GRVFTHRL E +L PDL P+ GR AD S R++ YD +
Sbjct 69 TLLEGRVFTHRLSGGEIESGLLHADPDLAPVVM----SALGR-ADESV-RVLFPDYDADE 122
Query 126 LERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGADTS--- 182
L GI DE G LL GT A G ++GDL+ V + A G + + +AG D +
Sbjct 123 LAALGIADEHFPDGPVLLF--GTEALQGFSSGDLIAVTVGAGGSL--ELSSAGGDVADVP 178
Query 183 -VGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPGG 241
+ RL +V D W DD A P PL E+++ G E D++A G
Sbjct 179 DMADRLDRIVGADNADNLETVAWQLLADDDALCATPTMPLGELIEAAGYECEGDYIAARG 238
Query 242 FNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATAA 301
F+FDA ++A H+L P++A A+ + ++L + + D +LP +VLA
Sbjct 239 FDFDAHHLAAHIAMVAREHELHPDEASAVVSFVQL-----VGIVHDDELDLP-EVLARVG 292
Query 302 ETATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPRA 361
E A D A L DP A ++ + AL + PR
Sbjct 293 EDA-------------DSVAGLEDPAAAAAVLDLVNAVEDDYIPALYTTASAVVGAGPRR 339
Query 362 ARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLALL 421
A+ + WL +A D +GDV AER A +MD EW L +LA+IASDRGDA+RGL+LL
Sbjct 340 AKASGHWLAGMAADTLGDVLEAERHFADAAAMDEEWTPALFELAQIASDRGDAQRGLSLL 399
Query 422 RRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWLY 481
R D L +L R +LGRN+ CWCGSGRKYK CHLG+ L +R WLY
Sbjct 400 GRIDGGRDERLYDVLTRFAPAEHPELGRNDKCWCGSGRKYKVCHLGKADATLDDRASWLY 459
Query 482 AKASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFLE 541
KA+ +A S L+ + + R A DDEDA+A A +PLV+D LFEGG F F+
Sbjct 460 EKATLYAQSTVLFDLV--LGLAQRRAAHWDDEDAVARAFDEPLVIDTALFEGGLFRLFVA 517
Query 542 VRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRA 601
RG LLP DER LA++WL V RSV EV V G +RD+ T ++ V + A +
Sbjct 518 RRGVLLPADERELADRWLRVRRSVHEVVSVD-GSTATLRDLATDESATVDDGARA----V 572
Query 602 GQLICARPVPAGDTMVFFGGIEPV 625
G+L+CAR VP G+ GG+E V
Sbjct 573 GELLCARVVPTGERTQILGGVEQV 596
>gi|226308702|ref|YP_002768662.1| hypothetical protein RER_52150 [Rhodococcus erythropolis PR4]
gi|226187819|dbj|BAH35923.1| hypothetical protein RER_52150 [Rhodococcus erythropolis PR4]
Length=640
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 217/653 (34%), Positives = 325/653 (50%), Gaps = 36/653 (5%)
Query 9 QAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDRWVWLPTLL 68
+A +L EHGP + D L D+G D + + + L ++R+ L TL
Sbjct 13 EAAMELLREHGPQTAADWGTLLADAGHGTADDMAEFVEYVEDPLLGYLSEERYAALDTLF 72
Query 69 AGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAGYDEELLER 128
RV THRL E +L PDL + + ++ + +V G D++LL
Sbjct 73 EHRVLTHRLTEAEIKSGVLDANPDLMMLRVFLDRDD----DEIHGISVVHRGIDDDLLAD 128
Query 129 RGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTA-----GADTSV 183
RGI D LL+ TLA +AGDL+G+ + L L I G S+
Sbjct 129 RGIEDPEFPHDEGFLLDTDTLAE--CSAGDLIGLFVADRELTLCTIPEVLDPAPGFGESL 186
Query 184 GARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPGGFN 243
G LA++ A VW +D+P+ F +P APL E+L+ G + D++A GF+
Sbjct 187 GTVLADI----GADTLDAIVWQLMLDEPSLFRQPTAPLGEMLEAAGYVRDGDYVAVDGFD 242
Query 244 FDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATAAET 303
F+A+ NR +L R +L P +A ++ + + M+ EA + ++ A
Sbjct 243 FEAYHLANRARMLEVRENLHPEEAASVIAFVDV--VMAAKAEAVED-------VSDWARK 293
Query 304 ATETGSDSLVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPRAAR 363
+ + + AA A LL+EL ++V L + L K R +
Sbjct 294 QIKEDPQTFAGIAEPPAAAAALELLSELDDHDSV---------LHSVATALAEKGSRRVK 344
Query 364 VAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLALLRR 423
A WL A DR+G + AE A +D +W + DLA +A+DR D R LALL R
Sbjct 345 PAAHWLAGKASDRLGKILVAEASYETAHDLDPDWTPAIFDLALLAADRSDVTRALALLGR 404
Query 424 AGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWLYAK 483
L +L+R+ +LGRN+ CWCGSGRK+K CHLG+ + +R +WLY K
Sbjct 405 IEGGESEVLHEVLQRYAPAEHPELGRNDKCWCGSGRKFKVCHLGKSEVTFDDRANWLYVK 464
Query 484 ASQHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFLEVR 543
A + + ++ + +++ R +S E+AL A+ L +DA LF+ G FA F+E R
Sbjct 465 AQLFSRTPEYFDAVFDLALIRAEQFNS--EEALTLAVEGGLAVDAALFDAGIFAAFVERR 522
Query 544 GSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQ 603
G LLP DER LA+QWL + RSV++V +PG+ V + DVR G +V + S L G
Sbjct 523 GELLPTDERELADQWLSIPRSVYKVASTEPGQLVTLSDVRNGHLVKVTDEWGSVNLEPGT 582
Query 604 LICARPVPAGDTMVFFGGIEPVALHERAVLIELLDD-EPDPVTLVAQLSRRFA 655
L+CAR +PAG TM FGGIEP+A ++ LI+L++ + +P LV LSR A
Sbjct 583 LVCARVLPAGSTMRTFGGIEPLAAEDKRELIQLIESVDTEPGQLVEFLSRGLA 635
>gi|229488928|ref|ZP_04382794.1| tetratricopeptide repeat family protein [Rhodococcus erythropolis
SK121]
gi|229324432|gb|EEN90187.1| tetratricopeptide repeat family protein [Rhodococcus erythropolis
SK121]
Length=636
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 214/651 (33%), Positives = 313/651 (49%), Gaps = 32/651 (4%)
Query 9 QAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDRWVWLPTLL 68
+A +L EHGP + D L D+G D + + + L ++R+ L TL
Sbjct 9 EAAMELLREHGPQTAADWGTLLADAGHGTADDMAEFVEYVEDPLLGYLSEERYAALDTLF 68
Query 69 AGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAGYDEELLER 128
RV THRL E +L PDL + + ++ + +V G D++LL
Sbjct 69 EHRVLTHRLTEAEIESGVLDANPDLMMLRVFLDRDD----DEIHGISVVHRGIDDDLLAD 124
Query 129 RGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERI-GTAGADTSVGARL 187
RGI D LL+ TLA +AGDL+G+ + L L I G L
Sbjct 125 RGIEDPEFPHDEGFLLDTDTLAE--CSAGDLIGLFVADRELTLCTIPEVLDPAPGFGEHL 182
Query 188 AELVDPDEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPGGFNFDAW 247
A ++ A VW +D+P+ F +P APL E+L+ G + D++A GF+F+A+
Sbjct 183 ATVLADIGADTLDAIVWQLMLDEPSLFRQPTAPLGEMLEASGYVRDGDYVAVDGFDFEAY 242
Query 248 RFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATAAETATET 307
NR +L R +L P +A ++ + DV+ A E
Sbjct 243 HLANRARMLEVRENLHPEEAASVIAFV--------------------DVVMAAKAEGVED 282
Query 308 GSDSLVDLLGDIGAALADPLLAELLVAETVGTDSG--GAAALGLLTEMLEPKVPRAARVA 365
SD + + A A A + L + L K R + A
Sbjct 283 VSDWARKQIKEDPQAFAGLAEPPAAAAALELLSELDDHDSVLHSVATALAEKGSRRVKPA 342
Query 366 VRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLALLRRAG 425
WL A DR+G + AAE A +D +W + DLA +A+DR D R LALL R
Sbjct 343 AHWLAGKASDRLGKILAAEASYETAHDLDPDWTPAIFDLALLAADRSDVTRALALLGRIE 402
Query 426 TEPDHPLVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWLYAKAS 485
L +L+R+ +LGRN+ CWCGSGRK+K CHLG+ + +R WLY KA
Sbjct 403 GGESEVLHEVLQRYAPAEHPELGRNDKCWCGSGRKFKVCHLGKSEVTFDDRASWLYVKAQ 462
Query 486 QHALSGDWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFLEVRGS 545
+ + ++ + +++ R +S E+AL A+ L +DA LF+ G FA F+E RG
Sbjct 463 LFSRTPEYFDAVFDLALIRAEQFNS--EEALTLAVEGGLAVDAALFDAGIFAAFVERRGE 520
Query 546 LLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLI 605
LLP DER LA+QWL + RSV++V +PG+ V + DVR G +V + S L G L+
Sbjct 521 LLPADERELADQWLSIPRSVYKVASTEPGQLVTLSDVRNGHLVKVTDEWGSLNLEPGTLV 580
Query 606 CARPVPAGDTMVFFGGIEPVALHERAVLIELLDDE-PDPVTLVAQLSRRFA 655
CAR +PAG TM FGGIEP+A+ ++ LI+L++ E +P LV LSR A
Sbjct 581 CARVLPAGSTMRTFGGIEPLAVEDKKELIQLIESEDTEPGQLVEFLSRGLA 631
>gi|229489113|ref|ZP_04382979.1| tetratricopeptide repeat family protein [Rhodococcus erythropolis
SK121]
gi|229324617|gb|EEN90372.1| tetratricopeptide repeat family protein [Rhodococcus erythropolis
SK121]
Length=630
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 210/628 (34%), Positives = 299/628 (48%), Gaps = 39/628 (6%)
Query 14 ILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDRWVWLPTLLAGRVF 73
+L E GPLS ++ A L + G A A L +E L + ++V L T+L G VF
Sbjct 19 VLREQGPLSLEEWATHLEEYGTATELA--DVLEYLSEPMLGYLPNGKYVALDTVLEGLVF 76
Query 74 THRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAGYDEELLERRGIPD 133
THRL E D+L +PDL+PI + D A R+ LA +E ER P
Sbjct 77 THRLSEVEIASDILDASPDLEPILAFGD--------DDGAIRVALA--EEAASERGAAP- 125
Query 134 EAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGADTSVGARLAELVDP 193
++L GTL A GDLVG+ + L R+ + + L EL +
Sbjct 126 --CRSRRVVVLPAGTLGE--CADGDLVGLAVEDGTLAF-RLVEIEDEPDLAPALGELFEE 180
Query 194 DEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPGGFNFDAWRFENRC 253
D + W ++DP+ FT PVAPL +I + G HE + LA GF+FDA+ + R
Sbjct 181 DGVEALDSVCWQLLIEDPSLFTVPVAPLGQIFEVAGYEHERELLARRGFDFDAYDLQIRT 240
Query 254 ELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATAAETATETGSDSLV 313
L+A +DL ++AV+ + L + D D +A A DS V
Sbjct 241 ALVASTYDLTHDEAVSAVAFVDLADRGYTDAVIADFD------IADWAHRHVSAAPDSFV 294
Query 314 DLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPRAARVAVRWLRAVA 373
L D GAA+A + D A L L L + PR+ R A WL A
Sbjct 295 SL-ADPGAAVA------VFDLGFRNQDPVTDAILEALASELAERGPRSVRAAAHWLAGKA 347
Query 374 LDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLALLRRAGTEPDHPLV 433
+DR+G V AE A ++ W L +LA+ ASDRGDA R L+LL R + L
Sbjct 348 VDRLGRVLEAEAHYEKALLAESGWGPALFELAQFASDRGDATRALSLLGRIDGGTEENLY 407
Query 434 RLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWLYAKASQHALSGDW 493
+L+ +LGRN+ CWCGSGRKYK CHLG+ + WLY KA A + ++
Sbjct 408 AVLQDFVPADHPELGRNDKCWCGSGRKYKVCHLGKADESVKADGRWLYKKACLFAFASEF 467
Query 494 TGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFLEVRGSLLPDDERL 553
++ + + +++ +AA + D LD LFEGG FAEFL R LLP+ E +
Sbjct 468 VDIV--TGLDDLENENLSEDELIAATIFDGSALDVALFEGGIFAEFLARRSELLPEAEVV 525
Query 554 LAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICARPVPAG 613
A QWL + RSV+EV GV++ D + +T V A S + + G +I AR + A
Sbjct 526 TAAQWLGIRRSVYEVIETSD-TGVVLLDRGSKETVTV---AHSVEGKPGDVISARVLTA- 580
Query 614 DTMVFFGGIEPVALHERAV-LIELLDDE 640
T F G+ +A +A ++E+L E
Sbjct 581 PTGAFAVGVVVMATESQAARVLEVLASE 608
>gi|226308703|ref|YP_002768663.1| hypothetical protein RER_52160 [Rhodococcus erythropolis PR4]
gi|226187820|dbj|BAH35924.1| hypothetical protein RER_52160 [Rhodococcus erythropolis PR4]
Length=636
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 202/629 (33%), Positives = 296/629 (48%), Gaps = 40/629 (6%)
Query 14 ILAEHGPLSEDDIARRLLDSGVADPDAVLRALRLETEWPARQLVDDRWVWLPTLLAGRVF 73
+L E GPLS ++ A L + G A A L +E L + ++V L T+L G VF
Sbjct 24 VLREQGPLSLEEWATHLEEYGTATELA--DVLEYLSEPMLGYLPNGKYVALDTVLEGLVF 81
Query 74 THRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAGYDEELLERRGIPD 133
THRL E D+L +PDL+PI + E+ A R+ LA EE RG D
Sbjct 82 THRLSELEIASDILDASPDLEPILAFGDGED-------GAIRVALA---EEAAVERG--D 129
Query 134 EAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAGLVLERIGTAGADTSVGARLAELVDP 193
++L GTL A GDL+G+ + L R+ + + L EL +
Sbjct 130 APFRSRRVVVLPAGTLGE--CADGDLIGLAVEDGRLAF-RLVEIEDEPDLAPALGELFEE 186
Query 194 DEPAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPGGFNFDAWRFENRC 253
D + W ++DP+ FT PVAPL EI + G HE + LA GF+FDA+ + R
Sbjct 187 DGVEALDSVCWQLLLEDPSLFTVPVAPLGEIFEAAGYEHERELLARRGFDFDAYDLQIRT 246
Query 254 ELLAFRHDLDPNDAVALYTLIKLHE--TMSLLLEATDPDELPRDVLATAAETATETGSDS 311
L+A +DL ++AVA + L + ++ D + R ++ A +
Sbjct 247 ALVASTYDLTHDEAVAAVAFVDLADRGYTDAVIADFDIVDWARRHVSAAPDFFVSLADPG 306
Query 312 LVDLLGDIGAALADPLLAELLVAETVGTDSGGAAALGLLTEMLEPKVPRAARVAVRWLRA 371
+ D+G DP+ +L A L L + PR+ R A WL
Sbjct 307 AAVAVFDLGFRNQDPVTDAILEA---------------LASELAERGPRSVRAAAHWLAG 351
Query 372 VALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGLALLRRAGTEPDHP 431
A+DR+G V AE A ++ W L +LA+ ASDRGDA R L+LL R +
Sbjct 352 KAVDRLGRVLEAEAHYEKALLAESGWGPALFELAQFASDRGDATRALSLLGRIDGGSEEN 411
Query 432 LVRLLERHRAQPRRDLGRNEACWCGSGRKYKKCHLGREALPLAERVDWLYAKASQHALSG 491
L +L+ +LGRN+ CWCGSGRKYK CHLG+ + WLY KA A +
Sbjct 412 LYAVLQDFVPADHPELGRNDKCWCGSGRKYKVCHLGKADESVKADGRWLYKKACLFAFAS 471
Query 492 DWTGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFLEVRGSLLPDDE 551
++ ++ + A+ +++ +AA + D LD LFEGG FA+FL R LLP+ E
Sbjct 472 EFVDVV--TGLDDLENANLSEDELIAATIFDGSALDVALFEGGIFADFLARRSELLPEAE 529
Query 552 RLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICARPVP 611
+ A QWL V RSV+EV GV++ D + +T V + G LI AR +
Sbjct 530 VVTAAQWLGVRRSVYEVIETSD-TGVVLLDRVSEETVTVGHPVEGKP---GDLISARLLT 585
Query 612 AGDTMVFFGGIEPVALHERAVLIELLDDE 640
A + G + + + A ++E+L E
Sbjct 586 APTGALAVGVVVMSSESQAARVLEVLASE 614
>gi|333989270|ref|YP_004521884.1| SecC motif-containing protein [Mycobacterium sp. JDM601]
gi|333485238|gb|AEF34630.1| SecC motif-containing protein [Mycobacterium sp. JDM601]
Length=174
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 103/173 (60%), Positives = 123/173 (72%), Gaps = 2/173 (1%)
Query 501 SYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFLEVRGSLLPDDERLLAEQWLL 560
YER RY + + +D +AAA DPL +D +LFEGGAFAEFLE+RG LLPDDER LAEQWL
Sbjct 4 GYERIRYFEDEVDDLVAAAKTDPLAIDTMLFEGGAFAEFLELRGELLPDDERQLAEQWLG 63
Query 561 VERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICARPVPAGDTMVFFG 620
V RSVFEVE V+PG GV VR+VRT + +V ER R L++GQLIC+R +P GD FFG
Sbjct 64 VPRSVFEVEQVRPGHGVTVRNVRTDEVFDVTER--RRALQSGQLICSRALPTGDGFAFFG 121
Query 621 GIEPVALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLVNTEGDSLAICEAS 673
GI+PVA ER L+ LLDDEPDP+ LV LSR AP L + GD L A+
Sbjct 122 GIDPVAPQERDELLALLDDEPDPMELVGFLSRWLAPSELDDEYGDPLGTARAN 174
>gi|111022811|ref|YP_705783.1| hypothetical protein RHA1_ro05848 [Rhodococcus jostii RHA1]
gi|110822341|gb|ABG97625.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=205
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/94 (63%), Positives = 67/94 (72%), Gaps = 1/94 (1%)
Query 564 SVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICARPVPAGDTMVFFGGIE 623
SV EVE V+PGEG+ +RD+RTGD H + ER ASRQL G ICAR VPAGDT FGGIE
Sbjct 109 SVHEVEEVRPGEGLTLRDLRTGDLHAIRERTASRQLHVGNPICARVVPAGDTWQIFGGIE 168
Query 624 PVALHERAVLIELLDDE-PDPVTLVAQLSRRFAP 656
P+A RA L+E+LDDE DP LV LS RF P
Sbjct 169 PIAPAHRAPLLEMLDDETTDPADLVVILSERFVP 202
>gi|111025241|ref|YP_707661.1| hypothetical protein RHA1_ro08459 [Rhodococcus jostii RHA1]
gi|110824220|gb|ABG99503.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=190
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 49/106 (47%), Positives = 66/106 (63%), Gaps = 3/106 (2%)
Query 730 SEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPDPDSPELAAA 789
+EPR + ++ T+ DP + + R P E A A++ + DP PE+AAA
Sbjct 2 NEPRFESLVDTVAAADPGARLREQTRTP---AAELIAQAQENSFRPSQPVDPTEPEIAAA 58
Query 790 LEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFP 835
L+E IR YE WLD+ IPAL GHTPR+ A DPTRR DLI+LLD++P
Sbjct 59 LDEHIRGYEQQWLDEAIPALGGHTPRECAADPTRRDDLIRLLDSYP 104
>gi|333920963|ref|YP_004494544.1| Tetratricopeptide repeat family protein [Amycolicicoccus subflavus
DQS3-9A1]
gi|333483184|gb|AEF41744.1| Tetratricopeptide repeat family protein [Amycolicicoccus subflavus
DQS3-9A1]
Length=538
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 148/572 (26%), Positives = 225/572 (40%), Gaps = 109/572 (19%)
Query 68 LAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEHEEYGRLADGSAARIVLAGYDEELLE 127
GRV TH L A E +D++ PD+ P E+G
Sbjct 5 FGGRVPTHVLSAAEIRNDVVVTIPDIAPFM------EFG--------------------- 37
Query 128 RRGIPDEAIDPGGALLLEPGTLATLGAAAGDLVGVRLTAAG---LVLERIGTAGADTSVG 184
E + L L GTLA+ A GD+VGV + AG L ++R +
Sbjct 38 ----GGECVT---HLYLRRGTLASF--APGDIVGVVTSHAGAEILRVDRDELTRPRADLR 88
Query 185 ARLAELVDPDE--PAFFPAAVWTACVDDPAAFTEPVAPLREILDQHGLTHEDDWLAPGGF 242
R+ V E PA + V C DDP P PL E+L ++ ++ + D + F
Sbjct 89 TRVRRFVRDREGAPAGLHSLVAYLCRDDPEFCRAPSLPLSELLARYRVSWDGDRVGLDQF 148
Query 243 NFDAWRFENRCELLAFRHDLDPNDAVALYTLIKLHETMSLLLEATDPDELPRDVLATAAE 302
+F+ +R + R E L H +D ++A A+ T+ H + L +L DV+ AA+
Sbjct 149 DFEKYRNDERIEFLVAAHGMDVDEAAAVATM---HNRVRGALGGLGTADLHSDVIRCAAK 205
Query 303 TATETGSDSLVDLLGDIGAALAD----PLLAELLVAETVGTDSGGAAALGLLTEMLEPKV 358
+ + G I L D + +LL G AA G+L +
Sbjct 206 VESAGAA------YGMIAPWLHDEAQRSRILDLLTVSVAQAPHGQAA--GVLAWVCG--- 254
Query 359 PRAARVAVRWLRAVALDRIGDVEAAERELLAAESMDTEWPLPLLDLARIASDRGDAERGL 418
A+ + D I + REL EW + A +A DRG+ +
Sbjct 255 ------AIAAAEGMTADVIAFAQRCIREL-------PEWGPGVEFAAEVAVDRGEFDEAQ 301
Query 419 ALLRRAGTEPDHPLVRLLERHRAQPRRDLGRNEACWCGS--GRKYKKCHLGREALPLAE- 475
+L PRR + + C S G ++ L A P++E
Sbjct 302 RILT------------------GLPRR---KRKDCLPHSVLGALMRELEL---APPISEI 337
Query 476 -RVDWLYAKASQHALSGDWTGLLAEVSYERFRYADSD-----DEDALAAALADPLVLDAV 529
LY+K + S D + +A + + DSD D+ L+ +ADPLV+ V
Sbjct 338 DTAKGLYSKLLRFTRS-DPSARMALIEVAQALRQDSDQVPLPDDSELSGLIADPLVVSCV 396
Query 530 LFEGGAFAEFLEVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHE 589
+ EGG FL RG LLP +ER L E+W V R +F V+ V + ++ GD
Sbjct 397 VVEGGMLHHFLRTRGGLLPSNERALLERWAGVRRRIFVVDAVGTSDNLLALTSLDGDVTR 456
Query 590 VHERAASR-QLRAGQLICARPVPA--GDTMVF 618
+ A R ++ G + A VP G + VF
Sbjct 457 CYVAAELRGKVVQGGSVLAWSVPLLRGASAVF 488
>gi|116624812|ref|YP_826968.1| SecC motif-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116227974|gb|ABJ86683.1| SEC-C motif domain protein [Candidatus Solibacter usitatus Ellin6076]
Length=890
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 126/458 (28%), Positives = 177/458 (39%), Gaps = 68/458 (14%)
Query 447 LGRNEACWCGSGRKYKKCHLGREALPLAERVDWLYAKASQHALSGDWTGLLAEVSYERFR 506
+GRN+ CWCGSG+KYKKCHL A A A +S+ L LA E
Sbjct 449 VGRNDPCWCGSGKKYKKCHL--RADEEAHLTGARPAASSEEPLPLRMLKGLARWHKE--- 503
Query 507 YADSDDEDALAAALADPLVLDAVLFEGGAFAEFL-------EVRGSLL------------ 547
AD + A+ D F+ AF +FL E R +L+
Sbjct 504 -ADRTRAQEMFFGTAEDKARDETEFD--AFVQFLLHDFRDAETRRTLIEHFLDEHGPRLS 560
Query 548 PDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICA 607
P D R +AE ++E+ V+ G G+ VRDV G T V + +SR+
Sbjct 561 PKD-RAVAESMRDSRFGLYEIFKVEKGRGIHVRDVFDGATFFVEDITSSRECVKDDCALL 619
Query 608 RPVPAGDTMVFFGGIEPVALHERAVLIELLDDE-----PDPVTLVAQLS----------- 651
R + G VA + E + E P S
Sbjct 620 RVELRDGRYMLSGNGTAVARELLGEMKEFVSAESKAAGKSPAEFARANSALLRRHYLELH 679
Query 652 -RRFAPPTLVNTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPR-------WIE 703
RRF +VN+EGD L A V D + AL + + V EEP + W+E
Sbjct 680 ARRFENLRVVNSEGDELEFWTAEYEVLDRPALILALRSLAELV--EEPSKDSAAHFGWME 737
Query 704 HVTNDGMLRVRATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTRE 763
+G V ++ + LR+ET S + L P + DR + +
Sbjct 738 --PGEGPRSVHGSIEVTETRLRLETTSLKYRELGRGMLEYNAPRLLKHLGDRLTSVDDLK 795
Query 764 AAALAEQMPVTGAGAPDP-DSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPT 822
+AL +GA P P S E AA++++ + ++W + +PAL G TPRQA
Sbjct 796 RSAL------SGARGPTPLPSEEERAAIQQYKAQHYSTWPNIALPALKGQTPRQAMRTNA 849
Query 823 RRADLIKLLDTF--PAGAGARGG---MDADRLRTALGL 855
R L LL AR G D + LR LGL
Sbjct 850 GREALRNLLRDMEHQEARSARSGEVPYDFNILRRDLGL 887
>gi|253699919|ref|YP_003021108.1| SEC-C motif domain protein [Geobacter sp. M21]
gi|251774769|gb|ACT17350.1| SEC-C motif domain protein [Geobacter sp. M21]
Length=457
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 115/445 (26%), Positives = 180/445 (41%), Gaps = 67/445 (15%)
Query 447 LGRNEACWCGSGRKYKKCHLGREALPLAER----------VDWL---YAKASQHALSGDW 493
+GRN+ C CGSG+K+KKC + +E R +DWL Y+ A+ ++
Sbjct 5 IGRNDFCPCGSGKKFKKCCMVKEQDAEVRRREEKTAVPRTLDWLSERYSNEVAEAVHAEF 64
Query 494 TGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFLEVRGSLLPDDERL 553
G L + +R D + + + + L+ D + EVRG + P E L
Sbjct 65 YGGLEDEELDRLNELSRDFQQMIFINVGEWLINDGSI----------EVRGQVTPVKEIL 114
Query 554 L----------AEQWL--LVERSV--FEVEHVQPGEGVIVRDV-RTGDTH-EVHERAASR 597
L A WL L ERS+ +EV V PGEG+ + D+ R + V ER ASR
Sbjct 115 LGPGGPLYTAAARNWLERLGERSLSLYEVVRVTPGEGIELIDLLRPAEPPVWVVERTASR 174
Query 598 QLRAGQLICARPVPAGDTMVFFGGIEPVALHE----RAVLIELL------DDEPDPVTLV 647
+ + R V +V G P E R +++++ DD+ L
Sbjct 175 TVVPHDIFGTRLVRTDSGLVMSGAAYPFTREEGLACRDHILQVMKMPGWTDDQFRDAALA 234
Query 648 --------AQLSRRFAPPTLVNTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPP 699
+ ++ R P + ++ G+++ + RV D ++ L D V+G+
Sbjct 235 MITTSWLSSLVAERPMPKLVDSSTGNAIMLTTDRYRVKDWVALEKVLAAQPD-VEGDRSE 293
Query 700 RWIEHVTND-GMLRVRATLVLDGDT-LRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRP 757
W+ + M R A+ G T L V + D L R+ A V++ R
Sbjct 294 GWVRFAQIEREMQRSLASCNPKGATSLEVFCRTVELADETRQWLERV--AAGVVEFKIRE 351
Query 758 LRNTREAAALAEQMPVTGAGAPDPDSPEL-AAALEEFIRDYETSWLDQPIPALDGHTPRQ 816
L + R A AG S EL L E +R+ +W ++PIPAL TPR
Sbjct 352 LVDPRSEKA----RDFAAAGPKKEPSLELDNEVLNELMRNIYANWTEEPIPALGNKTPRA 407
Query 817 AADDPTRRADLIKLLDTFPAGAGAR 841
A R +I LL ++ R
Sbjct 408 AIKTEKGRRAVIDLLHSYENNEARR 432
>gi|31790367|gb|AAP58624.1| hypothetical protein [uncultured Acidobacteria bacterium]
Length=517
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 127/516 (25%), Positives = 190/516 (37%), Gaps = 107/516 (20%)
Query 433 VRLLERHRAQPRRDL------GRNEACWCGSGRKYKKCHLGREALPLAERVDWLYAKASQ 486
+ L + QP + L GRN+ C CGSG+KYKKC L + E D+ Y + Q
Sbjct 15 FQFLLSTQVQPSKCLSTMIKTGRNDPCPCGSGKKYKKCCLVPD-----EDSDFQYRRFRQ 69
Query 487 HALSGDWTGLLAEV------------------SYERFRYADSDDEDALAAALADPLVLDA 528
+GL+ ++ + + + D D L P L
Sbjct 70 IH-----SGLIPKLMTFAFEIIEAEVVEEAWKEFNDYEAVEDFDPDGPLNVLFMPWFLFN 124
Query 529 VLFE----GGAFAE-------FLEVRGSLLPDDERLLAEQWLLVERSVFEVEHVQPGEGV 577
+ E G E FL R + + DE +L + +++E V+PG G+
Sbjct 125 WIIELKPAGRTRVEETTIAELFLLDRKADISADEEMLLRSSIRCPYTLYEAVEVRPGVGM 184
Query 578 IVRDVRTGDTHEVHERAASRQLRAGQLI-CA------------------RPVPAGDTMVF 618
+ D+ TH V E +AS L+ G++I CA RP G+
Sbjct 185 TLFDLLRRITHVVVEHSASETLKRGEIIYCATTQVAGFRSNVGMGPYALRPTAKGEVFAL 244
Query 619 FGGIEPVALHERAVLIELLDDEPDPVTL-VAQLSRRFAPPTLVNTEGDSLAICEASVRVD 677
I E L + E D L + L FAPP+L NT+GD +
Sbjct 245 RKWIVNGIGSEEIRTEHLHEFEQDIRGLYLDTLKGMFAPPSLANTDGDPFLPQKLYF--- 301
Query 678 DPAGIQGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATL-------------------- 717
D A G+ G+E +E T D L V+A +
Sbjct 302 DLKSADLAFQGLKSLAGGDEND-LLEQATLDNGLIVKAEIPWLGGSEEARSRLGGPVLLG 360
Query 718 --VLDGDTLRVETNSEPRM------------DRVLATLTRLDPAMTVLDDDRRPLRNTRE 763
+D + L VE NS+ R D T ++P + +++
Sbjct 361 LLKIDQERLIVEVNSKQRAELIRGLVEDRLGDTATYKTTLIEPMESRVNEMWNAAAAGSS 420
Query 764 AAALAEQMPVTGAGAPDPDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTR 823
+++ E T + D PE+ A +EE R + SW D P+PAL+ TPR+AA
Sbjct 421 SSSDPEDRQHTDLSSYDQTQPEIMAMMEEVARQHWESWFDLPVPALNDMTPREAAQTEEG 480
Query 824 RADLIKLL----DTFPAGAGARGGMDADRLRTALGL 855
R L LL +T A D LR LG+
Sbjct 481 RELLESLLLFYENTQSDSAANVLNADIPALRRELGM 516
>gi|316932200|ref|YP_004107182.1| hypothetical protein Rpdx1_0815 [Rhodopseudomonas palustris DX-1]
gi|315599914|gb|ADU42449.1| hypothetical protein Rpdx1_0815 [Rhodopseudomonas palustris DX-1]
Length=472
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 98/356 (28%), Positives = 139/356 (40%), Gaps = 93/356 (26%)
Query 564 SVFEVEHVQPGEGVIVRD-VRTGDTHEVHERAASRQLRAGQLICARPVPAGDTMVFFGGI 622
S++EV V RD +R G+ V ER+A++ L+ I AR V G + GG+
Sbjct 106 SLYEVSDVVRDTSFRARDLIRGGEPVLVSERSATQTLKCWDRIAARIVQVGSKVQISGGV 165
Query 623 EPVALHERAVLI----------------------ELLDDEPDP------------VTLVA 648
P LI E DD+ D + LVA
Sbjct 166 LPFEREVAEALITAFNQLGTLSIEEQRELAEEAGEEFDDDFDGEAALAALAPAERLRLVA 225
Query 649 QLSRRF------------APPTLVNTEGDSLAICEASVRVDDPAGIQG------------ 684
+ F P L N EGD L +CE S R+ GI G
Sbjct 226 PMFSSFWLIDAIDRIESARLPELRNAEGDELLLCEVSFRL--GTGITGDEIVRCLQARPE 283
Query 685 -------ALDGVYDRVDGEEPPR--------WIEHVTNDGMLRVRATLVLDGDTLRVETN 729
+ V R G P E + +DGML++ L+L+ DTL + N
Sbjct 284 FRPTSATSWSWVGQRERGGTAPDDDSPDETLMFETLQDDGMLQL-GELLLEDDTLVLCVN 342
Query 730 SEPRMDRVLATLTR-LDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPDPDS---PE 785
S+ R DR A LT L P + + PL T A+ E + + A P PD E
Sbjct 343 SQQRCDRGCALLTEILGPRVGL------PLIRTE---AVEEMLESSRAAMPTPDEIPEHE 393
Query 786 LAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRA---DLIKLLDTFPAGA 838
A + + + + LD+P+ LDG TPRQA +D + RA D +KL++ A +
Sbjct 394 RRAVVHDHLERHYRETLDRPVAMLDGQTPRQAVEDESGRAKVVDWLKLIENRTAKS 449
>gi|338531363|ref|YP_004664697.1| hypothetical protein LILAB_08540 [Myxococcus fulvus HW-1]
gi|337257459|gb|AEI63619.1| hypothetical protein LILAB_08540 [Myxococcus fulvus HW-1]
Length=901
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 84/306 (28%), Positives = 135/306 (45%), Gaps = 39/306 (12%)
Query 554 LAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICARPVPAG 613
LA W SVFEVE V+ G+G+ +RD+ + EV ER+ + Q+ L+ +PA
Sbjct 89 LAASWC----SVFEVEEVRLGQGLRLRDLVLDEVLEVRERSLTTQVTRHDLVAGWVMPAE 144
Query 614 DTMVFFGGIE--PVALHERAVL-----IELLDDEPDPVTLVAQLSRRFAP---------- 656
D + G I P +L + V+ L + V + +RR AP
Sbjct 145 DHLELLGAIMAVPRSLRQHVVIAARQAFAALQPPAEDVAGRRRQARRLAPLLFTRVLELF 204
Query 657 ---PTLVNTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEH----VTNDG 709
L+N +G+ L +C A V PA ++ L ++R E R+I V+
Sbjct 205 TADQPLLNADGEPLRLCTARFHVRHPAKVEARLRQ-HERFLREGEGRYIWEGPAPVSPVS 263
Query 710 MLRVRATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDR-RPLRNTREAAALA 768
V L + G++L ++T+S R+++ A L L A ++D P+++T+ A
Sbjct 264 DPVVWGILTMKGESLTLDTHSAQRLEKGKAVLAELLGAEAEHEEDTLGPVQSTQGAPG-- 321
Query 769 EQMPVTGAGAPDPDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLI 828
P AGAP PELA L E I + L + +PA G + P RA ++
Sbjct 322 ---PSLPAGAP----PELADTLAELIAQRARAELVRGVPAWGGKSASDLVRTPEGRARVL 374
Query 829 KLLDTF 834
+ L +
Sbjct 375 EWLKDW 380
>gi|153006122|ref|YP_001380447.1| SecC motif-containing protein [Anaeromyxobacter sp. Fw109-5]
gi|152029695|gb|ABS27463.1| SEC-C motif domain protein [Anaeromyxobacter sp. Fw109-5]
Length=669
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 115/431 (27%), Positives = 163/431 (38%), Gaps = 70/431 (16%)
Query 449 RNEACWCGSGRKYKKCHL-------------GREALPLAERVDWLYAKASQHALSGDWTG 495
RN C CGSG+KYKKCHL G EA L + H L T
Sbjct 206 RNAPCPCGSGKKYKKCHLAEEEAREAAARGTGLEAEEARAHARRLTERDPIHGLDERITA 265
Query 496 LLAEVSYERFRYADSDDED-ALAAALADPLVLDAVL------FEG----GAFAEFLEVRG 544
++ R R+ D D AL A D A+L + G A +LE RG
Sbjct 266 --DALALARRRWGREFDPDGALLAIGLDYQSTQALLGWSSGHYRGPDDRTALDLYLEERG 323
Query 545 SLLPDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQL 604
L + R L E S EV +PG + +RD+ G V E++ASR +R +
Sbjct 324 RALDAEGRALVEAQRRAWFSYHEVVSAEPGRTITLRDLLAGGERTVEEKSASRTVRPRDV 383
Query 605 ICARPVPAGDTMVFFGG-IEPVALHE------------RAVLIELLDDEPDPVTLVAQLS 651
+ AR + G + G + P+ E R ++ D+ T L
Sbjct 384 LLARIIDLGSRAILAGCYLRPLPPREGDEARRRLRSAVRVRAAKVPADKLREATAGGTLF 443
Query 652 RRFAP----------PTLVNTEGDSLAIC--EASVRVDDPAGIQGAL----DGVYDR--V 693
R + P L NT+G L + V + AL D D V
Sbjct 444 RIWLEIVDAADARPLPNLQNTDGADLILTVDRFDVAAAKAHEVVAALLDLPDARRDEGGV 503
Query 694 DGEEPPRWIEHVTNDGML--RVRATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVL 751
DG ++ G+L + +L+G LR+ETNS R DR+ ++ A+
Sbjct 504 DGAITVSFVREGNAKGVLPTTLIGRAILEGSLLRLETNSMQRADRLRRLVSERLGALA-- 561
Query 752 DDDRRPLRNTREAAALAEQMPVTGAGAPDPDSPELAAALEEFIR----DYETSWLDQPIP 807
R A +A G A + + E +R ++ SWLD+ IP
Sbjct 562 -----SFRIREHADPVAHLAEGAGRSARPAAPEPMPPEVLEVVRRMQAEHYRSWLDEEIP 616
Query 808 ALDGHTPRQAA 818
AL G TPR+AA
Sbjct 617 ALGGLTPREAA 627
>gi|254451191|ref|ZP_05064628.1| conserved hypothetical protein [Octadecabacter antarcticus 238]
gi|254452834|ref|ZP_05066271.1| conserved hypothetical protein [Octadecabacter antarcticus 238]
gi|198265597|gb|EDY89867.1| conserved hypothetical protein [Octadecabacter antarcticus 238]
gi|198267240|gb|EDY91510.1| conserved hypothetical protein [Octadecabacter antarcticus 238]
Length=454
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 80/318 (26%), Positives = 129/318 (41%), Gaps = 55/318 (17%)
Query 564 SVFEVEHVQPGEGVIVRDVRTGDTHEV--HERAASRQLRAGQLICARPVPAGDTMVFFGG 621
S++EV +V+ GE ++++D+ GD V E++A+R L+ I R + GDT V G
Sbjct 115 SLYEVSNVKLGESMVLKDL-VGDRERVTVFEKSATRSLKQWDRIAVRVIAEGDTHVISGA 173
Query 622 IEPVALHERAVLIE--------------------LLDDEP--DPVTLVAQLSRRFAP--P 657
+ + L E L+D P L L + + P
Sbjct 174 LLAFSAEAVEFLFEGLRAAMKLKGNAPLQLTTRQLMDCAPVFTSAWLFTTLPKAMSSGIP 233
Query 658 TLVNTEGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEP--PRWIEHV-------TND 708
L N++GD + + R+ P +Q + D V G P PR+ +
Sbjct 234 ELCNSDGDDVMFHDLRFRLA-PGVLQKEIAACLDDVKGFVPEGPRFWNWLALRNTPKKGS 292
Query 709 GML--------RVRATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRN 760
GM+ V TL L G +L V+ NS R +++ A + +RPL
Sbjct 293 GMMLDTEMTGRTVLGTLELKGKSLLVQVNSAARAEKIAALVIEATGKRL-----KRPLTA 347
Query 761 TREA-AALAEQMPVTGAGAPDPDSPELAAAL-EEFIRDYETSWLDQPIPALDGHTPRQAA 818
R ++E+ T D P + + +++ + LD P+PALDG +PRQA
Sbjct 348 IRTVEQVMSEERAETSLEGADEIPPHIEKQITHDYMDKHYRETLDAPLPALDGKSPRQAV 407
Query 819 DDPTRR---ADLIKLLDT 833
R D +KLL+
Sbjct 408 RSAAGREKVVDWLKLLEN 425
>gi|257094679|ref|YP_003168320.1| SEC-C motif domain-containing protein [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
gi|257047203|gb|ACV36391.1| SEC-C motif domain protein [Candidatus Accumulibacter phosphatis
clade IIA str. UW-1]
Length=469
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 122/466 (27%), Positives = 188/466 (41%), Gaps = 68/466 (14%)
Query 447 LGRNEACWCGSGRKYKKCHLGREA----------LPLAER-VDWLYAKASQHALSGDWTG 495
+GRN+ C CGSG+KYK+C A AER +DWL K +
Sbjct 5 IGRNDPCPCGSGKKYKQCCANSPADFVEPERKGHAGAAERAIDWLMNKHRKAVSVAITER 64
Query 496 LLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGAFAEFLEVRGSLLPDDERLLA 555
L E+S E +++D++ ++ + + +L EG E L V G P E LL
Sbjct 65 LFDELSPEEEEALNANDQETWSSIQRN--ATEWLLAEG----EIL-VHGEPRPVSEYLLG 117
Query 556 ----------EQWL--LVER--SVFEVEHVQPGEGVIVRDVRTGDTHE--VHERAASRQL 599
+W+ L ER +++V V PG+ + + D + V ER+ S+
Sbjct 118 PGGPLFTVDQRRWITQLAERPLRLYDVTDVVPGQQLTLCDSLDVEAPPIIVRERSGSQAA 177
Query 600 RAGQLICARPVP-------AGDTMVFFGGIEPVALHERAVLIELLD----DEPDPVTLVA 648
G I R + +G F P + L D D P ++ +
Sbjct 178 LLGVQIGVRIMAVDGHYELSGAIYAFSHLTGPAVAARIREAMHLFDGQGSDLPHLLSSII 237
Query 649 Q---LSRRFAP---PTLVNT-EGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRW 701
Q L++ FAP P + G+ + + RV D A + +L D V+G+ W
Sbjct 238 QRQWLTQFFAPLPMPAFRDAYSGEPMLLITDHYRVQDWAALTQSLSAQND-VEGDRDSAW 296
Query 702 IEHV-TNDGMLRVRATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRN 760
+ +G R AT+ ++ ++ + + A RL + R R
Sbjct 297 NRLIDCKEGQTRSVATINVEKSPNKITVFYKTQR---YADEGRLWFESVAGNAVRFLSRE 353
Query 761 TREAAALAEQMPV----TGAGAPDPDSPE-LAAALEEFIRDYETSWLDQPIPALDGHTPR 815
+ A L MP AGA SPE LA +E +R+ W D+PIPAL G TPR
Sbjct 354 LSDPAGLLRSMPAGQRAKPAGAGLDLSPEALAEVVESTLREMYAKWSDEPIPALAGKTPR 413
Query 816 QAADDPT---RRADLIKLLDTFPAGAGARGG---MDADRLRTALGL 855
QA + P R LI++ + A+ G + D L ALG+
Sbjct 414 QAINTPAGLERVKGLIRMYEASEKRQAAQQGRRTISFDFLWQALGI 459
>gi|148655739|ref|YP_001275944.1| SecC motif-containing protein [Roseiflexus sp. RS-1]
gi|148567849|gb|ABQ89994.1| SEC-C motif domain protein [Roseiflexus sp. RS-1]
Length=286
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 61/193 (32%), Positives = 87/193 (46%), Gaps = 24/193 (12%)
Query 447 LGRNEACWCGSGRKYKKCHLGREALPLAER------VDWLYAK----------ASQHALS 490
+GRN+ C CGSG+KYK+C L RE AE+ VD L K +AL
Sbjct 4 VGRNDPCPCGSGKKYKQCCLPREEAARAEQLRLRRSVDTLLPKIIDAARAIPEVVPNALQ 63
Query 491 GDWTGLLAEVSYERFRYADSDD-EDALAAALADPLVLDAVLFEGGAFAEFLEVRGSLL-- 547
W G Y + A+ DD ED A L D L +G E L S L
Sbjct 64 RYWNG-----KYAPEQLAELDDLEDRGADRFLTWLAFDYRLDDGQTLVERLAADDSALDL 118
Query 548 PDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICA 607
+ ER L QW V + V+ ++ G+ + V D+ + ++ + + AASR+L G ++
Sbjct 119 SEPERQLLPQWAGVGLRAWVVDTIRKGQEIEVHDLLSEQSYVIADSAASRRLATGDVVVG 178
Query 608 RPVPAGDTMVFFG 620
+PAG V G
Sbjct 179 HLLPAGAKRVIGG 191
>gi|156743381|ref|YP_001433510.1| SecC motif-containing protein [Roseiflexus castenholzii DSM 13941]
gi|156234709|gb|ABU59492.1| SEC-C motif domain protein [Roseiflexus castenholzii DSM 13941]
Length=284
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 62/193 (33%), Positives = 86/193 (45%), Gaps = 24/193 (12%)
Query 447 LGRNEACWCGSGRKYKKCHLGREALPLAER------VDWLYAKASQ----------HALS 490
+GRN+ C CGSG+KYK CHL E AE+ VD L K AL
Sbjct 4 IGRNDPCPCGSGKKYKHCHLPIEEAARAEQLRLRRAVDTLMPKVIDAARMAPEVVPDALQ 63
Query 491 GDWTGLLAEVSYERFRYADSDD-EDALAAALADPLVLDAVLFEGGAFAEFL-EVRGSL-L 547
W G Y + A+ DD E+ A L D +G E L E +L L
Sbjct 64 RFWNG-----KYATEQLAELDDLENRGADRFLTWLAFDYRFDDGRTLVERLAEDPVALDL 118
Query 548 PDDERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICA 607
+ ER L QW V F V+ V+ G+ + V D+ + + + + AASR+L G ++
Sbjct 119 SEPERQLLPQWTGVGLRAFVVDVVRKGQHIEVHDLLSEQPYAIADSAASRRLAPGDVVVG 178
Query 608 RPVPAGDTMVFFG 620
+PAG+ V G
Sbjct 179 HLLPAGEKQVIGG 191
>gi|146279553|ref|YP_001169711.1| hypothetical protein Rsph17025_3536 [Rhodobacter sphaeroides
ATCC 17025]
gi|145557794|gb|ABP72406.1| hypothetical protein Rsph17025_3536 [Rhodobacter sphaeroides
ATCC 17025]
Length=452
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 90/340 (27%), Positives = 132/340 (39%), Gaps = 76/340 (22%)
Query 564 SVFEVEHVQPGEGVIVRDVRTG-DTHEVHERAASRQLRAGQLICARPVPAGDTMVFFGGI 622
S+ EV V PG+ + +RD+ TG + V E++A+R L+ I AR VP D V GG+
Sbjct 107 SLHEVSDVVPGQSMALRDLLTGGEPVTVREKSATRSLKQWDRIVARVVPVRDHHVIAGGV 166
Query 623 EPVALHERAVLIELLDD------EPDPVTLVAQLSRRFAP-------------------P 657
P A +L L D +P V QL R AP P
Sbjct 167 LPFAAEAVEMLFGGLRDALRLRKTAEPRLTVDQL-RHCAPIFSGAWFFTHLPGLLNPQAP 225
Query 658 TLVNTEGDSLAICEASVRVDDPAGIQGALDGVYDRV-----DGEEPPRW----------- 701
L NT+G+ L E P QG + RV DG + W
Sbjct 226 HLTNTDGEELEFHELHFPF-APRVAQGQVAAALSRVPDLSRDGSKSWGWLARTKPAAGKK 284
Query 702 --------IEHVTNDGMLRVRATLVLDGDTLRVETNSEPRMDRVLATLTR-----LDPAM 748
+E + G V ++ + G L + NS+ R R A + L P +
Sbjct 285 KEDAPGLALETFSEGGT--VLGSMEMKGKALILRVNSKERAARGEAMIMAAAGDLLRPPL 342
Query 749 TVLDDDRRPLRNTREAAA----LAEQMPVTGAGAPDPDSPELA-AALEEFIRDYETSWLD 803
T + + +R+ R+A AE++P PELA L++ + + LD
Sbjct 343 TTIQTVEQAMRD-RDARGGPKDAAEEIP-----------PELARQILQDHLDRHYRDTLD 390
Query 804 QPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGARGG 843
QPIP L G +PRQA + R ++ L + G
Sbjct 391 QPIPVLGGKSPRQAVRSASGRRKVVDWLKYLENSSAQNEG 430
>gi|309791557|ref|ZP_07686056.1| SecC motif-containing protein [Oscillochloris trichoides DG6]
gi|308226417|gb|EFO80146.1| SecC motif-containing protein [Oscillochloris trichoides DG6]
Length=309
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 53/184 (29%), Positives = 80/184 (44%), Gaps = 16/184 (8%)
Query 445 RDLGRNEACWCGSGRKYKKCHL------GREALPLAERVDWLYAKASQHALS--GDWTGL 496
+ LGRN+ C CGSGRKYK+CHL E L L + D L K + A S +
Sbjct 8 KKLGRNDPCHCGSGRKYKECHLIIEEAARSEQLLLRQAQDSLLPKIIEAAQSVPEQFPEA 67
Query 497 LAEVSYERFRYADSDD----EDALAAALADPLVLDAVLFEGGAFAEFLEV---RGSLLPD 549
A ++ + D ED A D +G E L G+ D
Sbjct 68 FARFWENKYTFEQMSDLDSVEDRGAERFLTWFAFDFRQQDGQTLIEQLNTAADAGTFEVD 127
Query 550 D-ERLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICAR 608
ER L EQW V + V ++ G+G+++RD+ +V + A+++L G+++
Sbjct 128 PYERRLLEQWRTVRLRPYIVTEIRKGKGMLLRDLLGEQEFDVTDYNAAKRLEVGEVVVGH 187
Query 609 PVPA 612
PA
Sbjct 188 LTPA 191
>gi|339483667|ref|YP_004695453.1| SEC-C motif domain-containing protein [Nitrosomonas sp. Is79A3]
gi|338805812|gb|AEJ02054.1| SEC-C motif domain protein [Nitrosomonas sp. Is79A3]
Length=459
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 106/439 (25%), Positives = 172/439 (40%), Gaps = 59/439 (13%)
Query 447 LGRNEACWCGSGRKYKKCHLGREALPLAER----------VDWL---YAKASQHALSGDW 493
+GRN+ C CGSG+KYK+C A + E+ + WL + KA A+
Sbjct 5 IGRNDPCPCGSGKKYKQCCADTSAAIVEEKKGHDGAVERALSWLMDKHRKAVHIAIEEMI 64
Query 494 TGLLAEVSYERFRYADSDDEDALAAALADPLVLDAVLFEGGA---FAEFLEVRGS-LLPD 549
L++ E D + + L+ + + G +E+L +G L
Sbjct 65 FDGLSDEEREILEAQDKQTWQGIQLNATEWLLAEGHILVKGEHRRVSEYLLGQGGPLFTV 124
Query 550 DERLLAEQWLLVER--SVFEVEHVQPGEGVIVRDVRTGDT--HEVHERAASRQLRAGQLI 605
D+R Q L +R ++EV V PG+ + + D + V+E++ S+ + G LI
Sbjct 125 DQRRWIAQ--LADRPLRLYEVTDVIPGKQMTLCDALNTEALPITVYEKSGSQASQIGMLI 182
Query 606 CARPVPAGDTMVFFGGIEPVALHERAVLI----ELLD-------DEPDPVTLVAQ---LS 651
R + G P + + LI E +D D PD ++ + + L
Sbjct 183 GLRIMEVDGHFELSGAGYPFSHLKAQDLIAQIHEAMDQFNKRQKDFPDFLSFMIRRKWLE 242
Query 652 RRFAP---PTLVNT-EGDSLAICEASVRVDDPAGIQGALDGVYDRVDGEEPPRWIEHV-T 706
+ +AP PT+++ G+ + + RV D + +L D V G+ W V
Sbjct 243 QFYAPMPMPTMMDAYSGEPMLLITDHYRVKDWEALTQSLSSQSD-VQGDRKSGWDRLVDC 301
Query 707 NDGMLRVRATLVLDGD----TLRVETNSEPRMDRVLATLTRLDPAMTVLDDDRRPLRNTR 762
DG R T+ ++ TL +T S R D + R
Sbjct 302 EDGATRATVTINIEKTANKITLFYKTQSYADKGRPWFEAIAGDAVQFI-------SRELS 354
Query 763 EAAALAEQMPVTGAGAP---DPDSPE--LAAALEEFIRDYETSWLDQPIPALDGHTPRQA 817
+ + MPV P +PD P A +E+ I +W D+ I AL G TPRQA
Sbjct 355 DPKGMMANMPVNQKAKPRAAEPDIPPEVYADIIEKTIYRVYANWADESIQALGGKTPRQA 414
Query 818 ADDPTRRADLIKLLDTFPA 836
P + LL ++ A
Sbjct 415 IKTPAGLERVKGLLRSYEA 433
>gi|333978065|ref|YP_004516010.1| SEC-C motif domain-containing protein [Desulfotomaculum kuznetsovii
DSM 6115]
gi|333821546|gb|AEG14209.1| SEC-C motif domain protein [Desulfotomaculum kuznetsovii DSM
6115]
Length=499
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/177 (27%), Positives = 78/177 (45%), Gaps = 13/177 (7%)
Query 453 CWCGSGRKYKKCHLGREALPLAERVDWLYAKASQHALSGDW--TGLLAE---VSYERF-- 505
C CGSG+ Y+KC E + E+ W A G++ + A+ + E++
Sbjct 6 CPCGSGKSYRKCCGVGEKVIFLEQYRWRRAGQELRRKLGEFADSQFFAQEALKAQEKYLS 65
Query 506 ----RYADSDDEDALAAALADPLVLDAVLFEGGAFAEFLEVRGSLLPDDERLLAEQWLLV 561
D DDE + + + D VL G E L ++ LLA+ W
Sbjct 66 CLDPELVDRDDEFTMERCF-EWFIFDYVLPNGSTIIETFRQNSDLSEREQTLLAD-WAAA 123
Query 562 ERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICARPVPAGDTMVF 618
S++EV V P +GV++RD+ +VH+ A+ +L+ G ++ R + GD F
Sbjct 124 RISLYEVLQVLPRKGVVLRDLLQKKELKVHDINAAVELQPGTILLMRILKVGDEYEF 180
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/93 (32%), Positives = 43/93 (47%), Gaps = 6/93 (6%)
Query 769 EQMPVTGAGAPDPDSPELAAALEEFIRD-YETSWLDQPIPALDGHTPRQA---ADDPTRR 824
E P D S +A + E I D Y W+D+P+PAL G TPR+A A+ R
Sbjct 253 EYFPSVNDDLFDRISARIAQQITEAILDEYYDRWIDKPVPALGGKTPREACRTAEGRARL 312
Query 825 ADLIKLLDTFPAGAGARG--GMDADRLRTALGL 855
++ + L+ +G D ++R LGL
Sbjct 313 EEMFRELELVETSRELKGEPHYDVQKVRRKLGL 345
>gi|108757805|ref|YP_628325.1| hypothetical protein MXAN_0042 [Myxococcus xanthus DK 1622]
gi|108461685|gb|ABF86870.1| hypothetical protein MXAN_0042 [Myxococcus xanthus DK 1622]
Length=891
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 81/310 (27%), Positives = 125/310 (41%), Gaps = 54/310 (17%)
Query 554 LAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICARPVPAG 613
LA W SVFEVE V+ G+G+ +RD+ + EV ER+ + Q+ LI + +P
Sbjct 89 LAASWC----SVFEVEEVRLGQGLRLRDLVLDEVLEVKERSLTTQVARYDLIASWVIPTE 144
Query 614 DTMVFFGGIEPV-----------ALHERAVLIELLDDEPDPVTLVAQLSRRFAP------ 656
D + GGI + A H A DD P + +RR AP
Sbjct 145 DHLELVGGIVAIPRPLREHVVVAARHAFATHQPPADDAPG----RRRQARRLAPFLFTRV 200
Query 657 -------PTLVNTEGDSLAICEASVRVDDPAGIQGAL---DGVYDRVDGEEPPRWIEHVT 706
L+N E + L +C A R+ PA ++ L DG +G H
Sbjct 201 LELLTTERPLLNFEDEPLRLCTARFRIRHPAKVEEHLRRHDGFTREGEG--------HYN 252
Query 707 NDG-MLRVRATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVLDDDR-RPLRNTREA 764
+G V +L ++G L + T+S R+++ A L L A ++D P+++ R
Sbjct 253 WEGPRAAVWGSLTVEGKVLVLTTHSAQRLEKGKALLEELLGAEAEHEEDTVGPVQSVRGE 312
Query 765 AALAEQMPVTGAGAPDPDSPELAAALEEFIRDYETSWLDQPIPALDGHTPRQAADDPTRR 824
A A PD P+LA A + L + IPA G + R
Sbjct 313 PARA---------LPDDAPPQLADAFALMLAQRAREELSRGIPAWGGRSASDLMRSTEGR 363
Query 825 ADLIKLLDTF 834
A +++ L +
Sbjct 364 AQVLEWLKDW 373
>gi|163848371|ref|YP_001636415.1| SecC motif-containing protein [Chloroflexus aurantiacus J-10-fl]
gi|222526294|ref|YP_002570765.1| SEC-C motif domain-containing protein [Chloroflexus sp. Y-400-fl]
gi|163669660|gb|ABY36026.1| SEC-C motif domain protein [Chloroflexus aurantiacus J-10-fl]
gi|222450173|gb|ACM54439.1| SEC-C motif domain protein [Chloroflexus sp. Y-400-fl]
Length=279
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/181 (29%), Positives = 78/181 (44%), Gaps = 15/181 (8%)
Query 447 LGRNEACWCGSGRKYKKCHLGRE------ALPLAERVDWLYAKASQHALSGDWTGL---L 497
LGRN+ C CGSGRKYK CHL E L L D L + A + L
Sbjct 9 LGRNDPCHCGSGRKYKDCHLRIEEEWRSQQLRLRNAQDQLLQRILAKATEAEAAELQTAF 68
Query 498 AEVSYERFRYADSDDEDALAAALADPLV----LDAVLFEGGAFAEFL--EVRGSLLPDDE 551
+R+++A + + AD + D +G E + +++ S L E
Sbjct 69 DRYWQQRYQFAQLAELNQREGYGADRFMVWFAFDYRRSDGQTLVEQMVHQMQESDLSPLE 128
Query 552 RLLAEQWLLVERSVFEVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICARPVP 611
R L W+ V ++VE + P G +RD+ TG+ + + AS +L ++I VP
Sbjct 129 RQLLPTWVNVRLRPYQVERLHPNAGATLRDLLTGEALILADSHASLRLELAEVIVGHLVP 188
Query 612 A 612
Sbjct 189 V 189
>gi|258512924|ref|YP_003189181.1| hypothetical protein APA01_40140 [Acetobacter pasteurianus IFO
3283-01]
gi|256634827|dbj|BAI00802.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-01]
gi|256637882|dbj|BAI03850.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-03]
6 more sequence titles
Length=456
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 84/342 (25%), Positives = 133/342 (39%), Gaps = 65/342 (19%)
Query 564 SVFEVEHVQPGEGVIVRDV-RTGDTHEVHERAASRQLRAGQLICARPVPAGDTMVFFGGI 622
S++EV ++PG+ ++ RD+ R GD VHE A+R L I AR V + + GG+
Sbjct 104 SLYEVSDIKPGQSLMARDLLRGGDPVLVHEGTATRTLEQWDRIAARLVLSDGKTILAGGL 163
Query 623 --------EPVALHERAVL-------------IELLDDEPDPVTL------VAQLSRRFA 655
E +A H VL + L + TL + ++R+
Sbjct 164 LAYSRGACEDLATHLYKVLRKRRGKAEFPKVDTQTLRELAPMFTLTWLFRTLEDMARQME 223
Query 656 PPTLVNTEGDSLA---ICEASVRVDDPAGIQGALDGVY---------------------D 691
P L N +G+ L +C + + LDG+
Sbjct 224 GPALFNGDGEDLVFHEVCFPLAKGVTQKTVADVLDGIMALRPENRSFWNWLKEPMSDPVR 283
Query 692 RVDGEEPPRWIEHVTNDGMLRVRATLVLDGDTLRVETNSEPRMDRVLATLTRLDPAMTVL 751
+ +E R++ +DG + V TL L G LR++ NS+ R +R A L + +
Sbjct 284 KAGRDENGRFLRTEMDDGAI-VLGTLDLKGRQLRLQVNSKERAERGRAMLQ-----VGLG 337
Query 752 DDDRRPLRNTREAAALAEQMPVTGAG-APDPDSP--ELAAALEEFIRDYETSWLDQPIPA 808
D P A E G +P+ P E A + + + + LD+P+PA
Sbjct 338 DLVHAPFTQIMTPAQAMEDRGTHGREVSPELQIPPEEEARIIGQMLERHYRQVLDEPVPA 397
Query 809 LDGHTPRQAADDPTRRADLIKLL----DTFPAGAGARGGMDA 846
L TPRQA + R + L +T G+ G M A
Sbjct 398 LGDLTPRQAVQTASGRKKVAIWLKDIENTTVRAQGSGGAMAA 439
>gi|163797760|ref|ZP_02191707.1| hypothetical protein BAL199_13940 [alpha proteobacterium BAL199]
gi|159176980|gb|EDP61544.1| hypothetical protein BAL199_13940 [alpha proteobacterium BAL199]
Length=468
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 86/372 (24%), Positives = 145/372 (39%), Gaps = 90/372 (24%)
Query 551 ERLLAEQWLLVER----SVFEVEHVQPGEGVIVRD-VRTGDTHEVHERAASRQLRAGQLI 605
E++ A+++L R S++EV + PG+ + VRD +R GD V E+ S I
Sbjct 92 EKVPAKRYLTAIRDSVISLYEVVDLDPGKAMTVRDMIRGGDPVTVEEKLGSESAARWDRI 151
Query 606 CARPVPAGDTMVFFGGIEPVALHERAVLIELLDD----------------------EPDP 643
AR V + F GG+ ++ + + + ++ P+
Sbjct 152 AARLVTVNNKPCFTGGMLLLSHEASSKFMAVFEETARVFRTKLRREAKKQGESPEISPEA 211
Query 644 VT----------------LVAQLSRRFAP-PTLVNTEGDSLAICEASVRVD-DPAGIQGA 685
V L L + AP P + NT+GD + E + D A +
Sbjct 212 VKALLLQSSGARLFTQAWLTDALGQINAPLPEMRNTDGDKIMFSEVRFPISGDEAKLVAV 271
Query 686 LDGVYDRVDGEEP--PRWIEH---------------------VTNDGMLRVRATLVLDGD 722
+DG+ D ++ P W H V + G + + V +G
Sbjct 272 IDGI-DYIERNAPVEASWTWHGRGSPSQRMAAKKREGLTFQSVDDSGRTSLGSIEVKNG- 329
Query 723 TLRVETNSEPRMDR---VLAT--LTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAG 777
L + TNS R ++ +LA+ + + P + +D R L ++ G+
Sbjct 330 ALLLSTNSRERAEKGRDLLASHLSSLIGPPLISHEDIDRALERSK------------GSQ 377
Query 778 APDPDS--PELAAAL-EEFIRDYETSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTF 834
+ D D PE+AA + ++ D+ LD P+P LDG +PRQAA RA +I+ L
Sbjct 378 SSDKDDIPPEIAAQIIHNYLDDHYRRTLDDPLPFLDGKSPRQAAKTKNGRAQVIEWLKRL 437
Query 835 PAGAGARGGMDA 846
R D+
Sbjct 438 ENSEHRRATTDS 449
>gi|172058433|ref|YP_001814893.1| preprotein translocase, SecA subunit [Exiguobacterium sibiricum
255-15]
gi|171990954|gb|ACB61876.1| preprotein translocase, SecA subunit [Exiguobacterium sibiricum
255-15]
Length=839
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 19/31 (62%), Positives = 23/31 (75%), Gaps = 1/31 (3%)
Query 440 RAQPRRDLGRNEACWCGSGRKYKKCHLGREA 470
R P +GRN+ CWCGSG+KYK CH GR+A
Sbjct 810 RKNPNEQIGRNDPCWCGSGKKYKNCH-GRQA 839
Lambda K H
0.319 0.137 0.408
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 2083166670236
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40