BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2560
Length=325
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609697|ref|NP_217076.1| hypothetical protein Rv2560 [Mycoba... 629 2e-178
gi|31793743|ref|NP_856236.1| hypothetical protein Mb2590 [Mycoba... 628 5e-178
gi|298526034|ref|ZP_07013443.1| conserved hypothetical protein [... 626 1e-177
gi|289448209|ref|ZP_06437953.1| proline and glycine rich membran... 626 2e-177
gi|15842098|ref|NP_337135.1| hypothetical protein MT2637 [Mycoba... 625 2e-177
gi|254551611|ref|ZP_05142058.1| putative proline and glycine ric... 474 1e-131
gi|308380402|ref|ZP_07669184.1| proline and glycine rich membran... 452 4e-125
gi|183982179|ref|YP_001850470.1| proline and glycine rich transm... 323 3e-86
gi|54289545|gb|AAV32079.1| putative membrane protein [Mycobacter... 323 3e-86
gi|118617370|ref|YP_905702.1| proline and glycine rich transmemb... 323 3e-86
gi|240170719|ref|ZP_04749378.1| proline and glycine rich transme... 204 2e-50
gi|108801112|ref|YP_641309.1| hypothetical protein Mmcs_4148 [My... 183 4e-44
gi|296170788|ref|ZP_06852360.1| proline and glycine rich transme... 182 8e-44
gi|126436950|ref|YP_001072641.1| hypothetical protein Mjls_4379 ... 171 1e-40
gi|118466180|ref|YP_882618.1| hypothetical protein MAV_3436 [Myc... 168 1e-39
gi|145222632|ref|YP_001133310.1| hypothetical protein Mflv_2044 ... 168 1e-39
gi|336461500|gb|EGO40368.1| putative integral membrane protein [... 161 2e-37
gi|254775882|ref|ZP_05217398.1| hypothetical protein MaviaA2_146... 159 7e-37
gi|342858621|ref|ZP_08715276.1| proline and glycine rich transme... 158 1e-36
gi|254821818|ref|ZP_05226819.1| hypothetical protein MintA_17932... 158 1e-36
gi|296140939|ref|YP_003648182.1| hypothetical protein Tpau_3258 ... 152 5e-35
gi|120405625|ref|YP_955454.1| hypothetical protein Mvan_4673 [My... 150 2e-34
gi|41407169|ref|NP_960005.1| hypothetical protein MAP1071c [Myco... 146 4e-33
gi|262200958|ref|YP_003272166.1| integral membrane protein-like ... 144 2e-32
gi|169627754|ref|YP_001701403.1| hypothetical protein MAB_0651c ... 140 3e-31
gi|296140940|ref|YP_003648183.1| hypothetical protein Tpau_3259 ... 120 3e-25
gi|343925912|ref|ZP_08765427.1| hypothetical protein GOALK_050_0... 118 1e-24
gi|326773565|ref|ZP_08232848.1| proline and glycine rich transme... 113 5e-23
gi|296130052|ref|YP_003637302.1| hypothetical protein Cfla_2212 ... 112 7e-23
gi|333918550|ref|YP_004492131.1| hypothetical protein AS9A_0879 ... 111 2e-22
gi|336320349|ref|YP_004600317.1| hypothetical protein Celgi_1230... 107 4e-21
gi|229820238|ref|YP_002881764.1| integral membrane protein [Beut... 105 7e-21
gi|269956984|ref|YP_003326773.1| hypothetical protein Xcel_2197 ... 101 2e-19
gi|226303835|ref|YP_002763793.1| hypothetical protein RER_03460 ... 98.2 1e-18
gi|344043844|gb|EGV39531.1| hypothetical protein CgS9114_12717 [... 95.9 8e-18
gi|145296517|ref|YP_001139338.1| hypothetical protein cgR_2428 [... 95.5 1e-17
gi|19553719|ref|NP_601721.1| hypothetical protein NCgl2434 [Cory... 94.7 2e-17
gi|54025618|ref|YP_119860.1| hypothetical protein nfa36480 [Noca... 94.4 2e-17
gi|226363515|ref|YP_002781297.1| hypothetical protein ROP_41050 ... 93.2 5e-17
gi|118468565|ref|YP_889514.1| hypothetical protein MSMEG_5268 [M... 92.8 6e-17
gi|332670859|ref|YP_004453867.1| integral membrane protein [Cell... 80.1 4e-13
gi|119714909|ref|YP_921874.1| hypothetical protein Noca_0661 [No... 79.7 5e-13
gi|111021155|ref|YP_704127.1| proline rich protein [Rhodococcus ... 79.3 7e-13
gi|334337464|ref|YP_004542616.1| proline rich protein [Isopteric... 76.6 5e-12
gi|256832776|ref|YP_003161503.1| integral membrane protein [Jone... 76.3 6e-12
gi|325068575|ref|ZP_08127248.1| hypothetical protein AoriK_12171... 74.3 2e-11
gi|312137830|ref|YP_004005166.1| integral membrane protein [Rhod... 73.2 6e-11
gi|23009702|ref|ZP_00050654.1| COG5473: Predicted integral membr... 72.8 7e-11
gi|325676070|ref|ZP_08155752.1| YjbE family integral membrane pr... 72.0 1e-10
gi|326382958|ref|ZP_08204648.1| proline and glycine rich transme... 70.5 4e-10
>gi|15609697|ref|NP_217076.1| hypothetical protein Rv2560 [Mycobacterium tuberculosis H37Rv]
gi|148662399|ref|YP_001283922.1| putative proline and glycine rich transmembrane protein [Mycobacterium
tuberculosis H37Ra]
gi|308232174|ref|ZP_07664019.1| proline and glycine rich membrane protein [Mycobacterium tuberculosis
SUMu001]
8 more sequence titles
Length=325
Score = 629 bits (1622), Expect = 2e-178, Method: Compositional matrix adjust.
Identities = 325/325 (100%), Positives = 325/325 (100%), Gaps = 0/325 (0%)
Query 1 MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG 60
MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG
Sbjct 1 MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG 60
Query 61 PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL 120
PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL
Sbjct 61 PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL 120
Query 121 VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI 180
VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI
Sbjct 121 VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI 180
Query 181 ADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRS 240
ADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRS
Sbjct 181 ADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRS 240
Query 241 TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK 300
TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK
Sbjct 241 TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK 300
Query 301 LSGGQVVEAVRPAPPVGWPPGPQLA 325
LSGGQVVEAVRPAPPVGWPPGPQLA
Sbjct 301 LSGGQVVEAVRPAPPVGWPPGPQLA 325
>gi|31793743|ref|NP_856236.1| hypothetical protein Mb2590 [Mycobacterium bovis AF2122/97]
gi|121638445|ref|YP_978669.1| putative proline and glycine rich transmembrane protein [Mycobacterium
bovis BCG str. Pasteur 1173P2]
gi|148823756|ref|YP_001288510.1| hypothetical protein TBFG_12581 [Mycobacterium tuberculosis F11]
57 more sequence titles
Length=325
Score = 628 bits (1619), Expect = 5e-178, Method: Compositional matrix adjust.
Identities = 324/325 (99%), Positives = 325/325 (100%), Gaps = 0/325 (0%)
Query 1 MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG 60
MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG
Sbjct 1 MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG 60
Query 61 PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL 120
PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL
Sbjct 61 PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL 120
Query 121 VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI 180
VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI
Sbjct 121 VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI 180
Query 181 ADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRS 240
ADGKPVTIATFFRPRNLGLVLVTGLLIVA+TFIGGLLCVIPGLIFGFVAQFAVAFAVDRS
Sbjct 181 ADGKPVTIATFFRPRNLGLVLVTGLLIVALTFIGGLLCVIPGLIFGFVAQFAVAFAVDRS 240
Query 241 TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK 300
TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK
Sbjct 241 TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK 300
Query 301 LSGGQVVEAVRPAPPVGWPPGPQLA 325
LSGGQVVEAVRPAPPVGWPPGPQLA
Sbjct 301 LSGGQVVEAVRPAPPVGWPPGPQLA 325
>gi|298526034|ref|ZP_07013443.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298495828|gb|EFI31122.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=325
Score = 626 bits (1615), Expect = 1e-177, Method: Compositional matrix adjust.
Identities = 323/325 (99%), Positives = 324/325 (99%), Gaps = 0/325 (0%)
Query 1 MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG 60
MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG
Sbjct 1 MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG 60
Query 61 PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL 120
PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL
Sbjct 61 PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL 120
Query 121 VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI 180
VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI
Sbjct 121 VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI 180
Query 181 ADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRS 240
DGKPVTIATFFRPRNLGLVLVTGLLIVA+TFIGGLLCVIPGLIFGFVAQFAVAFAVDRS
Sbjct 181 TDGKPVTIATFFRPRNLGLVLVTGLLIVALTFIGGLLCVIPGLIFGFVAQFAVAFAVDRS 240
Query 241 TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK 300
TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK
Sbjct 241 TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK 300
Query 301 LSGGQVVEAVRPAPPVGWPPGPQLA 325
LSGGQVVEAVRPAPPVGWPPGPQLA
Sbjct 301 LSGGQVVEAVRPAPPVGWPPGPQLA 325
>gi|289448209|ref|ZP_06437953.1| proline and glycine rich membrane protein [Mycobacterium tuberculosis
CPHL_A]
gi|289421167|gb|EFD18368.1| proline and glycine rich membrane protein [Mycobacterium tuberculosis
CPHL_A]
Length=325
Score = 626 bits (1614), Expect = 2e-177, Method: Compositional matrix adjust.
Identities = 323/325 (99%), Positives = 324/325 (99%), Gaps = 0/325 (0%)
Query 1 MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG 60
MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG
Sbjct 1 MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG 60
Query 61 PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL 120
PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL
Sbjct 61 PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL 120
Query 121 VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI 180
VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI
Sbjct 121 VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI 180
Query 181 ADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRS 240
ADGKPVTIATFFRPRNLGLVLVT LLIVA+TFIGGLLCVIPGLIFGFVAQFAVAFAVDRS
Sbjct 181 ADGKPVTIATFFRPRNLGLVLVTELLIVALTFIGGLLCVIPGLIFGFVAQFAVAFAVDRS 240
Query 241 TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK 300
TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK
Sbjct 241 TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK 300
Query 301 LSGGQVVEAVRPAPPVGWPPGPQLA 325
LSGGQVVEAVRPAPPVGWPPGPQLA
Sbjct 301 LSGGQVVEAVRPAPPVGWPPGPQLA 325
>gi|15842098|ref|NP_337135.1| hypothetical protein MT2637 [Mycobacterium tuberculosis CDC1551]
gi|13882380|gb|AAK46949.1| hypothetical protein MT2637 [Mycobacterium tuberculosis CDC1551]
Length=325
Score = 625 bits (1613), Expect = 2e-177, Method: Compositional matrix adjust.
Identities = 323/325 (99%), Positives = 324/325 (99%), Gaps = 0/325 (0%)
Query 1 MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG 60
MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYG PPGTYLPPGYNAPPPPPGYG
Sbjct 1 MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGXPPGTYLPPGYNAPPPPPGYG 60
Query 61 PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL 120
PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL
Sbjct 61 PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL 120
Query 121 VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI 180
VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI
Sbjct 121 VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI 180
Query 181 ADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRS 240
ADGKPVTIATFFRPRNLGLVLVTGLLIVA+TFIGGLLCVIPGLIFGFVAQFAVAFAVDRS
Sbjct 181 ADGKPVTIATFFRPRNLGLVLVTGLLIVALTFIGGLLCVIPGLIFGFVAQFAVAFAVDRS 240
Query 241 TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK 300
TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK
Sbjct 241 TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRK 300
Query 301 LSGGQVVEAVRPAPPVGWPPGPQLA 325
LSGGQVVEAVRPAPPVGWPPGPQLA
Sbjct 301 LSGGQVVEAVRPAPPVGWPPGPQLA 325
>gi|254551611|ref|ZP_05142058.1| putative proline and glycine rich transmembrane protein [Mycobacterium
tuberculosis '98-R604 INH-RIF-EM']
gi|294994329|ref|ZP_06800020.1| proline and glycine rich transmembrane protein [Mycobacterium
tuberculosis 210]
gi|297635171|ref|ZP_06952951.1| proline and glycine rich transmembrane protein [Mycobacterium
tuberculosis KZN 4207]
gi|297732163|ref|ZP_06961281.1| proline and glycine rich transmembrane protein [Mycobacterium
tuberculosis KZN R506]
gi|313659497|ref|ZP_07816377.1| proline and glycine rich transmembrane protein [Mycobacterium
tuberculosis KZN V2475]
Length=245
Score = 474 bits (1219), Expect = 1e-131, Method: Compositional matrix adjust.
Identities = 243/245 (99%), Positives = 245/245 (100%), Gaps = 0/245 (0%)
Query 81 VGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVSS 140
+GDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVSS
Sbjct 1 MGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVSS 60
Query 141 ESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLGLV 200
ESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLGLV
Sbjct 61 ESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLGLV 120
Query 201 LVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNIGG 260
LVTGLLIVA+TFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNIGG
Sbjct 121 LVTGLLIVALTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNIGG 180
Query 261 SVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQVVEAVRPAPPVGWPP 320
SVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQVVEAVRPAPPVGWPP
Sbjct 181 SVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQVVEAVRPAPPVGWPP 240
Query 321 GPQLA 325
GPQLA
Sbjct 241 GPQLA 245
>gi|308380402|ref|ZP_07669184.1| proline and glycine rich membrane protein [Mycobacterium tuberculosis
SUMu011]
gi|308361581|gb|EFP50432.1| proline and glycine rich membrane protein [Mycobacterium tuberculosis
SUMu011]
Length=304
Score = 452 bits (1162), Expect = 4e-125, Method: Compositional matrix adjust.
Identities = 254/254 (100%), Positives = 254/254 (100%), Gaps = 0/254 (0%)
Query 72 THLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTA 131
THLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTA
Sbjct 51 THLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTA 110
Query 132 YTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATF 191
YTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATF
Sbjct 111 YTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATF 170
Query 192 FRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASI 251
FRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASI
Sbjct 171 FRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASI 230
Query 252 ETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQVVEAVR 311
ETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQVVEAVR
Sbjct 231 ETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQVVEAVR 290
Query 312 PAPPVGWPPGPQLA 325
PAPPVGWPPGPQLA
Sbjct 291 PAPPVGWPPGPQLA 304
>gi|183982179|ref|YP_001850470.1| proline and glycine rich transmembrane protein [Mycobacterium
marinum M]
gi|183175505|gb|ACC40615.1| proline and glycine rich transmembrane protein [Mycobacterium
marinum M]
Length=377
Score = 323 bits (827), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 156/252 (62%), Positives = 198/252 (79%), Gaps = 12/252 (4%)
Query 57 PGYGPPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGA 116
PG+G P PP FSVG+AISW+WNRFTQNA+ LVVP++ Y + L+AV G
Sbjct 113 PGFGGPAKPP------------FSVGEAISWAWNRFTQNAMALVVPIVIYGLILSAVGGV 160
Query 117 TAGLVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTG 176
GL A SDR +T YT+ G +SE+V++TM+P A IVMF+GY+A+FA+VL+MHAGI TG
Sbjct 161 MVGLFFAFSDRTSTTYTDAYGNTSETVNMTMSPLASIVMFIGYLAVFAVVLFMHAGITTG 220
Query 177 CLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFA 236
CLDIADGKPVTI +FF+PRNLG+V++TGLL++ +T IG +LC++PGLIFGF+AQFA+ A
Sbjct 221 CLDIADGKPVTIGSFFKPRNLGMVILTGLLVIVLTAIGSVLCIVPGLIFGFLAQFAIIAA 280
Query 237 VDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVY 296
VDRS SPIDS+K+S TV + +G + LSWL Q VL GELLCFVGML+G+PVA+LI Y
Sbjct 281 VDRSLSPIDSIKSSFATVRAELGNTALSWLVQYAVVLAGELLCFVGMLVGVPVASLIQTY 340
Query 297 TYRKLSGGQVVE 308
T+RKLSGGQVVE
Sbjct 341 TWRKLSGGQVVE 352
>gi|54289545|gb|AAV32079.1| putative membrane protein [Mycobacterium marinum]
Length=377
Score = 323 bits (827), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 156/252 (62%), Positives = 198/252 (79%), Gaps = 12/252 (4%)
Query 57 PGYGPPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGA 116
PG+G P PP FSVG+AISW+WNRFTQNA+ LVVP++ Y + L+AV G
Sbjct 113 PGFGGPAKPP------------FSVGEAISWAWNRFTQNAMALVVPIVIYGLILSAVGGV 160
Query 117 TAGLVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTG 176
GL A SDR +T YT+ G +SE+V++TM+P A IVMF+GY+A+FA+VL+MHAGI TG
Sbjct 161 MVGLFFAFSDRTSTTYTDAYGNTSETVNMTMSPLASIVMFIGYLAVFAVVLFMHAGITTG 220
Query 177 CLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFA 236
CLDIADGKPVTI +FF+PRNLG+V++TGLL++ +T IG +LC++PGLIFGF+AQFA+ A
Sbjct 221 CLDIADGKPVTIGSFFKPRNLGMVILTGLLVIVLTAIGSVLCIVPGLIFGFLAQFAIIAA 280
Query 237 VDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVY 296
VDRS SPIDS+K+S TV + +G + LSWL Q VL GELLCFVGML+G+PVA+LI Y
Sbjct 281 VDRSLSPIDSIKSSFATVRAELGNTALSWLVQYAVVLAGELLCFVGMLVGVPVASLIQTY 340
Query 297 TYRKLSGGQVVE 308
T+RKLSGGQVVE
Sbjct 341 TWRKLSGGQVVE 352
>gi|118617370|ref|YP_905702.1| proline and glycine rich transmembrane protein [Mycobacterium
ulcerans Agy99]
gi|118569480|gb|ABL04231.1| proline and glycine rich transmembrane protein [Mycobacterium
ulcerans Agy99]
Length=368
Score = 323 bits (827), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 156/252 (62%), Positives = 198/252 (79%), Gaps = 12/252 (4%)
Query 57 PGYGPPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGA 116
PG+G P PP FSVG+AISW+WNRFTQNA+ LVVP++ Y + L+AV G
Sbjct 104 PGFGGPAKPP------------FSVGEAISWAWNRFTQNAMALVVPIVIYGLILSAVGGV 151
Query 117 TAGLVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTG 176
GL A SDR +T YT+ G +SE+V++TM+P A IVMF+GY+A+FA+VL+MHAGI TG
Sbjct 152 MVGLFFAFSDRTSTTYTDAYGNTSETVNMTMSPLASIVMFIGYLAVFAVVLFMHAGITTG 211
Query 177 CLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFA 236
CLDIADGKPVTI +FF+PRNLG+V++TGLL++ +T IG +LC++PGLIFGF+AQFA+ A
Sbjct 212 CLDIADGKPVTIGSFFKPRNLGMVILTGLLVIVLTAIGSVLCIVPGLIFGFLAQFAIIAA 271
Query 237 VDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVY 296
VDRS SPIDS+K+S TV + +G + LSWL Q VL GELLCFVGML+G+PVA+LI Y
Sbjct 272 VDRSLSPIDSIKSSCATVRAELGNTALSWLVQYAVVLAGELLCFVGMLVGVPVASLIQTY 331
Query 297 TYRKLSGGQVVE 308
T+RKLSGGQVVE
Sbjct 332 TWRKLSGGQVVE 343
>gi|240170719|ref|ZP_04749378.1| proline and glycine rich transmembrane protein [Mycobacterium
kansasii ATCC 12478]
Length=247
Score = 204 bits (519), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 122/231 (53%), Positives = 157/231 (68%), Gaps = 10/231 (4%)
Query 81 VGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIG---ATAGLVVALSDRATTAYTNTSG 137
+G+AISW+WN+FT+N LVVP++ Y + +AAVIG A A + Y +
Sbjct 1 MGEAISWAWNKFTKNVAALVVPLVIYGLTMAAVIGIPLAIAFATAQTTTTTVVEYDYSYH 60
Query 138 VSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNL 197
+S + I+ +GYIALF +V YMHAG+LTGCLDIADGKPV+I TFF+PRN+
Sbjct 61 TTSAE----FSAIGWILTIIGYIALFFVVAYMHAGLLTGCLDIADGKPVSIGTFFKPRNV 116
Query 198 GLVLVTGLLIVAVTFIGGLLCVIPG-LIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGS 256
G V++T L+ I L C I G L+ F AQFA+AF VD+S SPI+S+KASI TV
Sbjct 117 GAVVLTSFLLAVGAMI--LSCTIVGPLVLAFFAQFAIAFVVDKSLSPIESIKASIATVRG 174
Query 257 NIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQVV 307
+G S LSWL Q AVL+GEL C VGM++G+PVAAL+ VYTYRKL+GGQVV
Sbjct 175 ELGSSALSWLVQYAAVLIGELACLVGMVVGVPVAALVQVYTYRKLTGGQVV 225
>gi|108801112|ref|YP_641309.1| hypothetical protein Mmcs_4148 [Mycobacterium sp. MCS]
gi|119870253|ref|YP_940205.1| hypothetical protein Mkms_4223 [Mycobacterium sp. KMS]
gi|108771531|gb|ABG10253.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119696342|gb|ABL93415.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=317
Score = 183 bits (464), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 110/256 (43%), Positives = 153/256 (60%), Gaps = 14/256 (5%)
Query 53 PPPPPGYGPPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAA 112
P P G PP G P GY SVG+A SW+WN+F +NAV L+V LAY + +
Sbjct 70 PVRPQGGYPPAGFGPGGY---------SVGEAFSWAWNKFGKNAVPLLVATLAYGLIIIV 120
Query 113 VIGATAGLVVALSDRATTAY-TNTSGVS-SESVDITMTPAAGIVMFLGYIALFALVLYMH 170
+ T L A+ +T Y ++ SG S ++D +PA IV F+G++ + +
Sbjct 121 IQALTNTLSAAVDPGDSTNYMSDGSGFEFSYTID---SPAGIIVAFIGWLISLVVAAAVQ 177
Query 171 AGILTGCLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQ 230
+ L G LDIADG+ V+I +FFRPRN+G V++ GL++ +T +G LLCVIPGLI +
Sbjct 178 SAYLGGMLDIADGREVSIGSFFRPRNIGSVIIAGLIVGVITTVGFLLCVIPGLIASIMLM 237
Query 231 FAVAFAVDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVA 290
F V +DR+ +PI++VK S + N G L+WL + V VG LLC VG+L+ PVA
Sbjct 238 FTVVSLLDRNLAPIEAVKTSFDISKGNFGSVFLAWLVMVVTVFVGALLCGVGLLVAAPVA 297
Query 291 ALIHVYTYRKLSGGQV 306
LI VYTYR L+GGQV
Sbjct 298 TLILVYTYRVLTGGQV 313
>gi|296170788|ref|ZP_06852360.1| proline and glycine rich transmembrane protein [Mycobacterium
parascrofulaceum ATCC BAA-614]
gi|295894603|gb|EFG74340.1| proline and glycine rich transmembrane protein [Mycobacterium
parascrofulaceum ATCC BAA-614]
Length=316
Score = 182 bits (461), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 101/231 (44%), Positives = 138/231 (60%), Gaps = 17/231 (7%)
Query 81 VGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVSS 140
V D SW+WN FT+NAV L+VP L Y + +A AG ++ LS T +G +S
Sbjct 79 VLDGFSWAWNTFTRNAVALIVPTLVYGLLIA-----VAGGLITLSQNMT------AGTTS 127
Query 141 ESVDITMT-----PAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPR 195
+ D T T P G++ LGY+ +A+ + A L+GCLD+ADG+ VTI +FFRPR
Sbjct 128 DDYDFTFTTNLTAPGYGLLA-LGYLVAYAVSAFAQAAFLSGCLDLADGRAVTIGSFFRPR 186
Query 196 NLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVG 255
N+G+V + LL+ +T I C IPGL+ G QF F +DRS S I +S G
Sbjct 187 NVGMVFLAVLLVEVLTSIASAACFIPGLVLGIFTQFTALFVIDRSESAIKGFTSSFSLAG 246
Query 256 SNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQV 306
SN ++L WL + +VG LLC VG+L+ PVA+L+ VYTYR+LSGGQV
Sbjct 247 SNFVNALLLWLIVFASAIVGFLLCGVGLLVAAPVASLLIVYTYRRLSGGQV 297
>gi|126436950|ref|YP_001072641.1| hypothetical protein Mjls_4379 [Mycobacterium sp. JLS]
gi|126236750|gb|ABO00151.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=229
Score = 171 bits (434), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 99/228 (44%), Positives = 143/228 (63%), Gaps = 5/228 (2%)
Query 81 VGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAY-TNTSGVS 139
+G+A SW+WN+F +NAV L+V LAY + + + T L A+ +T Y ++ SG
Sbjct 1 MGEAFSWAWNKFGKNAVPLLVATLAYGLIIIVIQALTNTLSAAVDPGDSTNYMSDGSGFE 60
Query 140 -SESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLG 198
S ++D +PA IV F+G++ + + + L G LDIADG+ V+I +FFRPRN+G
Sbjct 61 FSYTID---SPAGIIVAFIGWLISLVVAAAVQSAYLGGMLDIADGREVSIGSFFRPRNIG 117
Query 199 LVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNI 258
V++ GL++ +T +G LLCVIPGLI + F V +DR+ +PI++VK S + N
Sbjct 118 SVIIAGLIVGVITTVGFLLCVIPGLIASIMLMFTVVSLLDRNLAPIEAVKTSFDISKGNF 177
Query 259 GGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQV 306
G L+WL + V VG LLC VG+L+ PVA LI VYTYR L+GGQV
Sbjct 178 GSVFLAWLVMVVTVFVGALLCGVGLLVAAPVATLILVYTYRVLTGGQV 225
>gi|118466180|ref|YP_882618.1| hypothetical protein MAV_3436 [Mycobacterium avium 104]
gi|118167467|gb|ABK68364.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=325
Score = 168 bits (426), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/231 (42%), Positives = 144/231 (63%), Gaps = 12/231 (5%)
Query 79 FSVGDAISWSWNRFTQNAVTLVVPVLAYAVAL---AAVIGATAGLVVALSDRATTAYTNT 135
FSVG+A W+WN FT+N V L+VP L Y V L + +IG + + + +T T
Sbjct 79 FSVGEAFGWAWNAFTKNPVALIVPTLVYLVVLGGASTLIGLSQDVGTSGGGSGDDYFTFT 138
Query 136 SGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPR 195
+ ++ + + + LGY+ + + + + L+GCLD+ADG+PVTI +FF+PR
Sbjct 139 ANLNGGGMAL---------LVLGYLVAYLVGAFAQSAYLSGCLDLADGRPVTIGSFFKPR 189
Query 196 NLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVG 255
N G+V + LL+ +T I LC +PGLI G AQF +A+A+DRS S + ++ +S TV
Sbjct 190 NFGMVFLAALLVGILTSIASALCFLPGLILGLFAQFTIAYAIDRSESAVKALSSSFSTVT 249
Query 256 SNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQV 306
+N+ ++L WLA+ V+VG L C VG+L+ PVAAL+ +Y YRKLSGGQV
Sbjct 250 ANLANALLVWLAEFALVVVGALACGVGLLLAAPVAALVGIYAYRKLSGGQV 300
>gi|145222632|ref|YP_001133310.1| hypothetical protein Mflv_2044 [Mycobacterium gilvum PYR-GCK]
gi|315443097|ref|YP_004075976.1| integral membrane protein [Mycobacterium sp. Spyr1]
gi|145215118|gb|ABP44522.1| integral membrane protein-like protein [Mycobacterium gilvum
PYR-GCK]
gi|315261400|gb|ADT98141.1| predicted integral membrane protein [Mycobacterium sp. Spyr1]
Length=335
Score = 168 bits (425), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 92/227 (41%), Positives = 136/227 (60%), Gaps = 4/227 (1%)
Query 80 SVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVS 139
+G A SWS+N+F++NAV L+VP L YA+ VIG ++ L+ YT+ SG
Sbjct 108 DIGAAFSWSFNKFSKNAVPLIVPTLVYAL----VIGVLGAVIFGLASLFPADYTSYSGAD 163
Query 140 SESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLGL 199
+ + M PAA I++FLG I LF + + A + G LDIA+G+ V +FF+PRN+G
Sbjct 164 GAGMSLDMGPAATIILFLGLIMLFVVGGAISAAYMAGVLDIANGQQVEFGSFFKPRNIGA 223
Query 200 VLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNIG 259
V++ L++ T IG +LC++PGLI A F F VDR+ S ID +KASI +N
Sbjct 224 VVIASLIVGIATSIGYVLCIVPGLIVSIFALFTTVFIVDRNLSAIDGIKASIAVAKANFL 283
Query 260 GSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQV 306
L+WL + VG +C++G+++ +P+A L VY YR L+GG V
Sbjct 284 QVFLTWLIFNVLISVGSFVCYIGLIVTVPLAVLYMVYAYRTLTGGYV 330
>gi|336461500|gb|EGO40368.1| putative integral membrane protein [Mycobacterium avium subsp.
paratuberculosis S397]
Length=303
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 92/231 (40%), Positives = 141/231 (62%), Gaps = 12/231 (5%)
Query 79 FSVGDAISWSWNRFTQNAVTLVVPVLAYAVAL---AAVIGATAGLVVALSDRATTAYTNT 135
FSVG+A W+WN FT+N V L+VP L Y V L + +IG + + + +T T
Sbjct 73 FSVGEAFGWAWNAFTKNPVALIVPTLVYLVVLGGASTLIGLSQDVGTSGGGSGDDYFTFT 132
Query 136 SGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPR 195
+ ++ + + + LGY+ + + + + L+GCLD+ADG+PVTI +FF+PR
Sbjct 133 ANLNGGGMAL---------LVLGYLVAYLVGAFAQSAYLSGCLDLADGRPVTIGSFFKPR 183
Query 196 NLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVG 255
N G+V + LL+ +T + LC +PGLI G AQF + +A+DRS S + ++ +S TV
Sbjct 184 NFGMVFLAALLVGILTSVASALCFLPGLILGLFAQFTIPYAIDRSESAVKALSSSFSTVT 243
Query 256 SNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQV 306
+N ++L WLA+ V+VG + C G+L+ PVAAL+ +Y YRKLSGGQV
Sbjct 244 ANFANALLVWLAEFALVVVGAVACGAGLLLAAPVAALVGIYAYRKLSGGQV 294
>gi|254775882|ref|ZP_05217398.1| hypothetical protein MaviaA2_14600 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=245
Score = 159 bits (401), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 92/229 (41%), Positives = 140/229 (62%), Gaps = 12/229 (5%)
Query 81 VGDAISWSWNRFTQNAVTLVVPVLAYAVAL---AAVIGATAGLVVALSDRATTAYTNTSG 137
+G+A W+WN FT+N V L+VP L Y V L + +IG + + + +T T+
Sbjct 1 MGEAFGWAWNAFTKNPVALIVPTLVYLVVLGGASTLIGLSQDVGTSGGGSGDDYFTFTAN 60
Query 138 VSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNL 197
++ + + + LGY+ + + + + L+GCLD+ADG+PVTI +FF+PRN
Sbjct 61 LNGGGMAL---------LVLGYLVAYLVGAFAQSAYLSGCLDLADGRPVTIGSFFKPRNF 111
Query 198 GLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSN 257
G+V + LL+ +T I LC +PGLI G AQF + +A+DRS S + ++ +S TV +N
Sbjct 112 GMVFLAALLVGILTSIASALCFLPGLILGLFAQFTIPYAIDRSESAVKALSSSFSTVTAN 171
Query 258 IGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQV 306
++L WLA+ V+VG L C VG+L+ PVAAL+ +Y YRKLSGGQV
Sbjct 172 FANALLVWLAEFALVVVGALACGVGLLLAAPVAALVGIYAYRKLSGGQV 220
>gi|342858621|ref|ZP_08715276.1| proline and glycine rich transmembrane protein [Mycobacterium
colombiense CECT 3035]
gi|342134325|gb|EGT87505.1| proline and glycine rich transmembrane protein [Mycobacterium
colombiense CECT 3035]
Length=216
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 73/154 (48%), Positives = 108/154 (71%), Gaps = 0/154 (0%)
Query 154 VMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFI 213
++ LGY+ + + + + L+GCLD+ DG+PVTI +FF+PRN G+V + LL+ +T I
Sbjct 41 LLILGYLVAYLVGAFAQSAFLSGCLDLTDGRPVTIGSFFKPRNFGMVFLAALLVGILTSI 100
Query 214 GGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVL 273
+LC +PGLI G AQF + +A+DRS PI ++ +S TV +N G ++L WL ++ AV+
Sbjct 101 ASMLCFLPGLILGIFAQFTIPYAIDRSEQPIKALTSSFSTVAANFGNALLVWLVEVAAVI 160
Query 274 VGELLCFVGMLIGIPVAALIHVYTYRKLSGGQVV 307
VG L C VG+L+ +PVAAL+ +Y YRK SGGQVV
Sbjct 161 VGFLACGVGVLVAVPVAALVGIYAYRKFSGGQVV 194
>gi|254821818|ref|ZP_05226819.1| hypothetical protein MintA_17932 [Mycobacterium intracellulare
ATCC 13950]
Length=223
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 86/213 (41%), Positives = 127/213 (60%), Gaps = 10/213 (4%)
Query 97 VTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVSSESVDITMTPAAG--IV 154
+ L+VP L Y + + G+ AL + + T T+G + T G +
Sbjct 1 MALIVPALVYGILI--------GVASALVGLSQSVGTTTTGSDDDYFTFTANLNGGGMTL 52
Query 155 MFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIG 214
+ LGY+ + + + A L+GCLD+ADG+PVT+ +FF+PRN G+V + LL+ +T I
Sbjct 53 LILGYLVAYLVGAFAQAAFLSGCLDLADGRPVTVGSFFKPRNFGMVFLAALLVGILTSIA 112
Query 215 GLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLV 274
LC +PGLI G AQF + FA+DRS PI ++ +S TV +N G ++L WL ++ +V
Sbjct 113 SALCFLPGLILGIFAQFTIPFAIDRSEQPIKALTSSFSTVTANFGNALLVWLVEVALFVV 172
Query 275 GELLCFVGMLIGIPVAALIHVYTYRKLSGGQVV 307
G L C VG+L+ PVA+LI +Y YRK SGGQVV
Sbjct 173 GALACGVGLLVAAPVASLIGIYAYRKFSGGQVV 205
>gi|296140939|ref|YP_003648182.1| hypothetical protein Tpau_3258 [Tsukamurella paurometabola DSM
20162]
gi|296029073|gb|ADG79843.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=295
Score = 152 bits (385), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 107/321 (34%), Positives = 158/321 (50%), Gaps = 46/321 (14%)
Query 1 MSQPPEHPGNPADPQGG---------------NQGAGSYPPPGYGAPPPPPGYGPPPGTY 45
M+QPP +PG D QGG G P G PPP GY P PG
Sbjct 1 MTQPPNNPG---DNQGGFPPPQDPQQPGVPPQQPGGYPPPQGAQGFPPPAGGYQPAPG-- 55
Query 46 LPPGYNAPPPPPGYGPPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLA 105
GY AP P Y SVGDA SW+WN+FT+NA L+ +LA
Sbjct 56 ---GYGAPQVQPQY--------------------SVGDAFSWAWNKFTKNAWPLIGAMLA 92
Query 106 YAVALAAVIGATAGLVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFAL 165
+A+ +A ++ + V +L+ T + G + D +T + +V +G + L
Sbjct 93 FAIIMA-IVSSLVYWVFSLTVTNVQDVTYSDGTEGPTFD--LTGWSYVVGIIGVAVIIYL 149
Query 166 VLYMHAGILTGCLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIF 225
L + A TG LDIADG+ VT+ +FF+PRN G +L ++G ++ ++PG++
Sbjct 150 ALLIQASYTTGVLDIADGRKVTVGSFFKPRNFGSAAGAAILTTLAIYVGLIIFIVPGIVL 209
Query 226 GFVAQFAVAFAVDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLI 285
F ++V FAVD++ ++KAS V SN G S+L+ G +LC++G L+
Sbjct 210 AFFLAYSVLFAVDKNIGGGGALKASWNAVKSNAGNSILTTFLAGLVAAAGAVLCYIGALV 269
Query 286 GIPVAALIHVYTYRKLSGGQV 306
P+ L+ VY YR L+GGQV
Sbjct 270 TGPLGQLVQVYAYRTLTGGQV 290
>gi|120405625|ref|YP_955454.1| hypothetical protein Mvan_4673 [Mycobacterium vanbaalenii PYR-1]
gi|119958443|gb|ABM15448.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=323
Score = 150 bits (380), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 100/254 (40%), Positives = 151/254 (60%), Gaps = 15/254 (5%)
Query 58 GYGPPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGAT 117
GY PP G PP +SVGDA +W+WN+F++NA+ L+V L + + + A + A
Sbjct 83 GY-PPVGGPPA----------YSVGDAFNWAWNKFSKNAMPLIVATLVFGIVVIA-LQAI 130
Query 118 AGLVVALSDRATTAY-TNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTG 176
+V AL T+Y + SG S T A IV +G+ + + + L G
Sbjct 131 INIVQALVSPGDTSYIADDSGFSFSYA--TTGVAGTIVAIVGWFLSLIVTAAIQSAFLGG 188
Query 177 CLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFA 236
DIA+G+ V + +FFRPRN+G V++ GL++ +T +G LC++PG+I F+ F
Sbjct 189 IFDIANGQQVAVGSFFRPRNVGNVIIAGLIVGVITTVGLFLCIVPGVIASFLLMFTTIAV 248
Query 237 VDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVY 296
+DR+ +P+D++K+S ET +N+G +L+WLA + V VG LLC VG+L+ P+AALI VY
Sbjct 249 LDRNLAPMDAIKSSFETSKNNVGPVLLTWLASVAVVFVGALLCGVGLLVAAPLAALILVY 308
Query 297 TYRKLSGGQVVEAV 310
YR L+GG V AV
Sbjct 309 AYRTLNGGFVAPAV 322
>gi|41407169|ref|NP_960005.1| hypothetical protein MAP1071c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41395520|gb|AAS03388.1| hypothetical protein MAP_1071c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=319
Score = 146 bits (369), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 93/231 (41%), Positives = 141/231 (62%), Gaps = 12/231 (5%)
Query 79 FSVGDAISWSWNRFTQNAVTLVVPVLAYAVAL---AAVIGATAGLVVALSDRATTAYTNT 135
FSVG+A W+WN FT+N V L+VP L Y V L + +IG + + + +T T
Sbjct 73 FSVGEAFGWAWNAFTKNPVALIVPTLVYLVVLGGASTLIGLSQDVGTSGGGSGDDYFTFT 132
Query 136 SGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPR 195
++ + + + LGY+ + + + + L+GCLD+ADG+PVTI +FF+PR
Sbjct 133 PNLNGGGMAL---------LVLGYLVAYLVGAFAQSAYLSGCLDLADGRPVTIGSFFKPR 183
Query 196 NLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVG 255
N G+V + LL+ +T + LC +PGLI G AQF + +A+DRS S + ++ +S TV
Sbjct 184 NFGMVFLAALLVGILTSVASALCFLPGLILGLFAQFTIPYAIDRSESAVKALSSSFSTVT 243
Query 256 SNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQV 306
+N ++L WLA+ V+VG + C VG+L+ PVAAL+ +Y YRKLSGGQV
Sbjct 244 ANFANALLVWLAEFALVVVGAVACGVGLLLAAPVAALVGIYAYRKLSGGQV 294
>gi|262200958|ref|YP_003272166.1| integral membrane protein-like protein [Gordonia bronchialis
DSM 43247]
gi|262084305|gb|ACY20273.1| integral membrane protein-like protein [Gordonia bronchialis
DSM 43247]
Length=250
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 88/252 (35%), Positives = 148/252 (59%), Gaps = 15/252 (5%)
Query 61 PPPGPPPPGY---PTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLA-YAVALAAVIGA 116
PP G PPGY PT + VG+A W+W +F N +++P LA +A+AL ++ A
Sbjct 2 PPAGAVPPGYGADPTKVD-----VGEAFGWAWGKFKNNVGVMILPGLAVFALALVVLLIA 56
Query 117 TAGLVVALSDRATTAYTNT-SGVSSESVDITMTPAAGIVMFLGYIALFAL-VLYMHAGIL 174
+ A S TT T SG + V T A G ++ + LF + +LY+ A I+
Sbjct 57 ----IFATSIFGTTETTTIGSGEYATDVQSTTLGAGGTILLILVQLLFYIGLLYLQASII 112
Query 175 TGCLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVA 234
+G + +A+G+P++ A+F P G V+ T +L+ + IG +LC+IPGLI F QF+V
Sbjct 113 SGAIRVANGEPISAASFLVPIRFGPVIGTAILVGIIVAIGSVLCIIPGLIAIFFLQFSVV 172
Query 235 FAVDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIH 294
+D++ SPI+++KAS E + +G S+++ L VLVG ++C++G+++ P+A L +
Sbjct 173 ATIDKALSPIEAMKASFELAKAKVGDSLITLLVTYAIVLVGAIICYIGLIVAAPLAQLFY 232
Query 295 VYTYRKLSGGQV 306
V+ +R+L+G +
Sbjct 233 VHCWRRLNGAAI 244
>gi|169627754|ref|YP_001701403.1| hypothetical protein MAB_0651c [Mycobacterium abscessus ATCC
19977]
gi|169239721|emb|CAM60749.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=327
Score = 140 bits (353), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 95/247 (39%), Positives = 136/247 (56%), Gaps = 22/247 (8%)
Query 79 FSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGV 138
FS G++ SWSW + ++ T + P L + +A IG G+V A+ A+ T+TSG
Sbjct 85 FSAGESWSWSWAQVSKRFGTFIPPYLVWFLA----IGLPVGIVYAIL-MASLPQTSTSGY 139
Query 139 SSESVDITMTPAAG--------IVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIAT 190
S G +M L Y +FA+ LY+ A +++ LD+ADGKPV+ T
Sbjct 140 GGNSRSSYSYSYEGPELSGGAIAIMILLYAVVFAVSLYVGACLISANLDVADGKPVSFGT 199
Query 191 FFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKAS 250
FFR R GL + LL+ IG LL +I G+IFGF AQ+AV FA+DR P+D++KAS
Sbjct 200 FFRARGFGLYVGAALLVGVGVLIGSLL-IIGGVIFGFFAQYAVFFAIDRGLGPVDALKAS 258
Query 251 IETVGSNIGGSVLSWLAQLTAVLVGELLCFV----GMLIGIPVA----ALIHVYTYRKLS 302
+ V N+G +++ +L L G L F+ G +I P A LIHVYTYR+L+
Sbjct 259 FQLVKDNLGQALVVFLITLGVAFGGFALTFITCGLGGIIAYPAAGALTGLIHVYTYRRLT 318
Query 303 GGQVVEA 309
GG + A
Sbjct 319 GGTIAPA 325
>gi|296140940|ref|YP_003648183.1| hypothetical protein Tpau_3259 [Tsukamurella paurometabola DSM
20162]
gi|296029074|gb|ADG79844.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=314
Score = 120 bits (301), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 79/236 (34%), Positives = 130/236 (56%), Gaps = 23/236 (9%)
Query 79 FSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGV 138
F++GD SW+WN+FT+NA L++ + V L ++ LV A+ Y G
Sbjct 89 FNLGDGFSWAWNKFTKNAANLILAL----VVLGIIVSIVGFLVSAI-------YGALFGQ 137
Query 139 SSESVDITMTPAAGIVMFLGYIALFALVLYM-HAGILTGCLDIADGKPVTIATFFRPRNL 197
+++ T+ + G + + + +V Y+ A +G LDIADGK + +FF+PRN+
Sbjct 138 TADDGSYTVYYSPGTLQGAVFTLITGIVAYIAQAAYFSGVLDIADGKQIGFGSFFKPRNV 197
Query 198 G-------LVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKAS 250
G LV V L+ + ++G LL +I G F+A F + VDR S +D VK +
Sbjct 198 GQVALVSVLVSVVNALLSFIPYVGSLLSIIVG----FIAAFTLLVVVDRGVSAVDGVKQA 253
Query 251 IETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQV 306
+E + +IG ++++++ V+ G +LC VGML+ +P+AAL+ V YR +SG QV
Sbjct 254 VEVIQKDIGNAIVAYIIAGLLVIAGAILCGVGMLVTVPLAALLMVNAYRLISGAQV 309
>gi|343925912|ref|ZP_08765427.1| hypothetical protein GOALK_050_02070 [Gordonia alkanivorans NBRC
16433]
gi|343764263|dbj|GAA12353.1| hypothetical protein GOALK_050_02070 [Gordonia alkanivorans NBRC
16433]
Length=322
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 78/225 (35%), Positives = 124/225 (56%), Gaps = 6/225 (2%)
Query 80 SVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVS 139
VG+A SW++N+F N +++P L + AA+I V Y N G
Sbjct 94 DVGEAFSWAFNKFKNNVGAMILPGLVVLLLGAALIAVGFSAVALFGTTERVDYGN--GYY 151
Query 140 SESVDITMTPAAGIVMF-LGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLG 198
E + G V+F L Y+ +LY+ A I++G + +A+G+PVT +F P G
Sbjct 152 YEETSLGF---GGSVLFGLVYLVFILGLLYIQASIISGAVRVANGEPVTAKSFLTPIRFG 208
Query 199 LVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNI 258
V+ T +L+ +T IG LC+IPG+I F F+V +D+S SPI+++K S E S +
Sbjct 209 PVVGTAILVGIITGIGYALCIIPGIIAMFFLMFSVVATIDKSLSPINAMKNSFELTKSKV 268
Query 259 GGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSG 303
G S+++ L LVG L+C+VG+++ PVA L V+ +R+L+G
Sbjct 269 GDSIITLLVTYAINLVGVLVCYVGLIVAAPVAQLFLVHCWRRLNG 313
>gi|326773565|ref|ZP_08232848.1| proline and glycine rich transmembrane protein [Actinomyces viscosus
C505]
gi|326636795|gb|EGE37698.1| proline and glycine rich transmembrane protein [Actinomyces viscosus
C505]
Length=327
Score = 113 bits (282), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 107/349 (31%), Positives = 151/349 (44%), Gaps = 76/349 (21%)
Query 5 PEHPGNPAD-PQGGN--QGAGSYPPPGYGAPPPPPGYGPP----PGTYLPPGYNAPPPP- 56
P++PG P D Q G QGAG+ PGYG P PGY P PG PGY A P P
Sbjct 10 PQYPGYPDDGSQAGGVPQGAGT---PGYG---PQPGYDPQVGSVPGYGAQPGYGAGPDPQ 63
Query 57 --PGYGPPPG------PPPPGYPTHLQSSG-----------------------------F 79
PGYGP PG P P P + G
Sbjct 64 QQPGYGPQPGATQGYGPQPAAGPDYASQPGAVPGYGPQPGMGAGGGMPPYPPGAMGGAPL 123
Query 80 SVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVS 139
SVGD +SW+W++F +NA+ LVV + GL LS A+ +G
Sbjct 124 SVGDGMSWAWSKFKENALILVVGM---------------GLWTVLSSFTVEAHYTVNG-- 166
Query 140 SESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLGL 199
E + G + L I LFA ++ H I +A G+P+ F N G
Sbjct 167 -EEHGFGLGVPFGTYIALA-IGLFASIVTTHMAI-----KVATGRPLAWGDLFTFPNFGA 219
Query 200 VLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNIG 259
L+ L T +G LLC +PG+I F+ ++V F VD+ I +KAS T+ S++G
Sbjct 220 SLLAAFLTWLATSVGSLLCAVPGIIAAFLFHYSVYFTVDKGMDGIAGMKASWATLSSHVG 279
Query 260 GSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQVVE 308
LA + ++G + +G L+ +P+ L+ Y+Y ++ G VV
Sbjct 280 ELFPFALAGVGLYILGA-VTLIGWLVTVPLVMLLSAYSYVRIQGYDVVR 327
>gi|296130052|ref|YP_003637302.1| hypothetical protein Cfla_2212 [Cellulomonas flavigena DSM 20109]
gi|296021867|gb|ADG75103.1| conserved hypothetical protein [Cellulomonas flavigena DSM 20109]
Length=311
Score = 112 bits (280), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 86/291 (30%), Positives = 137/291 (48%), Gaps = 31/291 (10%)
Query 23 SYPPPGYGAPPPPPG--YGPPPGTYLPPG--YNAPPPPPGYGPPPGPPPPGYPTHLQSSG 78
+YPPP YG P P G Y PP Y PG Y P P YG P PP +
Sbjct 43 AYPPPAYGTPADPSGGAYPPPASPYGQPGQPYGQPGQP--YGQPYTPP---------AGQ 91
Query 79 FSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGV 138
+G SW++++F Q+ V+ LA+ +A V G+V + +
Sbjct 92 VDIGAGFSWAFSKFGQHWAAFVLGGLAWFAVIAVVFAIGLGIV-----------GGAAAL 140
Query 139 SSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLG 198
+ +S G+ +F I L L++ A + L +ADG+P+++ F + G
Sbjct 141 TGDSSAGGFGATLGLAVFFAIILL--LLVLFSAAFVKAALKVADGRPISVGDLFDTSHAG 198
Query 199 LVLVTGLLIVAVTFIGGLLCVIPGLIF---GFVAQFAVAFAVDRSTSPIDSVKASIETVG 255
++V LL A + L+ I + GF A +AV +DR+ ID+++ S
Sbjct 199 QLVVLALLYGAAGLVASLIPFIGQIALIAVGFFAFYAVVSIIDRNLGAIDAIRTSFSLQT 258
Query 256 SNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQV 306
++G +L ++ VG L+C VG+L+ +PV AL+ VY YR+L+GGQ+
Sbjct 259 RDLGTGILVYVVVGLVSWVGSLVCGVGVLVSLPVGALLTVYAYRRLTGGQI 309
>gi|333918550|ref|YP_004492131.1| hypothetical protein AS9A_0879 [Amycolicicoccus subflavus DQS3-9A1]
gi|333480771|gb|AEF39331.1| Hypothetical membrane protein [Amycolicicoccus subflavus DQS3-9A1]
Length=310
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/300 (33%), Positives = 156/300 (52%), Gaps = 28/300 (9%)
Query 7 HPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYGPPPGPP 66
PG +P GG G YP G P G PPPG YL PG N P P YGP G
Sbjct 25 QPGATPEP-GGYPPQGEYPTAGGSLSP---GAKPPPGNYLDPGENPPTP---YGPIKGSG 77
Query 67 PPGYPTHLQSSG--FSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVAL 124
G + ++ F +GDA+ ++WN++ N + +L + GL+VA
Sbjct 78 GKGKLRYRGAADVTFDIGDALRFAWNKYVNNVGAWIGFLL---------LSLVFGLMVAF 128
Query 125 SDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI-ADG 183
A+ + +G + + ++ AA +G + A+++ + A I+ G LD AD
Sbjct 129 P--ASMIFLAPAGEPDRNPLLVVSLAA-----VGIAIIVAVLIVLSAAIVRGALDESADE 181
Query 184 KPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSP 243
+P + F R N+ +L+ + + A+T G LLCV+PGLI GF++ F V F VD++ +
Sbjct 182 RP-ALRDFLRLTNISQILLATVTVAALTLAGLLLCVVPGLIVGFLSMFTVHFVVDQNQNA 240
Query 244 IDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSG 303
I+++K+S TV N+G +L +A V++G ++ VG LI IPV+A+ Y YR+++G
Sbjct 241 IEALKSSWRTVIDNVGPLLLLTVACYLIVVLGTVV-IVGFLITIPVSAIALAYAYRRVTG 299
>gi|336320349|ref|YP_004600317.1| hypothetical protein Celgi_1230 [Cellvibrio gilvus ATCC 13127]
gi|336103930|gb|AEI11749.1| hypothetical protein Celgi_1230 [Cellvibrio gilvus ATCC 13127]
Length=260
Score = 107 bits (266), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 80/258 (32%), Positives = 123/258 (48%), Gaps = 40/258 (15%)
Query 59 YGPPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYA----------- 107
YG PG G V +A W W +FT+N +++ +L Y
Sbjct 23 YGSAPG------------QGVDVVEAFKWGWKKFTENVSPILLAILGYVVAIAVVVVIWY 70
Query 108 VALAAVIGATAGLVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVL 167
V LAAV T+ +V D +V + P V+F+G + VL
Sbjct 71 VILAAVFLKTSDDIVIHDD--------------GTVSMGSGPNFLAVLFVGALTTLVAVL 116
Query 168 Y---MHAGILTGCLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLI 224
M AG + G L +A G+ +T FF+ +NL V++T LL+ +T +G L +PG+
Sbjct 117 LVSIMQAGFVQGALRLARGEALTPDAFFKFKNLPGVVLTSLLVAILTAVGCALFYLPGIA 176
Query 225 FGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGML 284
QF + +A+DR P+D+VKAS E V +N+ + L++L + A VG L C +G L
Sbjct 177 AALFLQFTLYYAIDRGLGPVDAVKASFELVKNNLATAGLTFLGLIVANAVGSLACGIGAL 236
Query 285 IGIPVAALIHVYTYRKLS 302
+ +PV L Y YR+L+
Sbjct 237 VALPVGLLAQAYVYRRLT 254
>gi|229820238|ref|YP_002881764.1| integral membrane protein [Beutenbergia cavernae DSM 12333]
gi|229566151|gb|ACQ80002.1| integral membrane protein [Beutenbergia cavernae DSM 12333]
Length=452
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 76/237 (33%), Positives = 120/237 (51%), Gaps = 22/237 (9%)
Query 70 YPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRAT 129
Y + +Q++ D SW W +FT+N TLV+ L + + +AA++ + +++ + A
Sbjct 228 YASQVQAT-----DGFSWGWKKFTENWGTLVLAQLLWGLIIAALVILWSFIIIGIGRAAA 282
Query 130 TAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVL----YMHAGILTGCLDIADGKP 185
SG ++E A ++ F G + LF V+ G++ G L+IA+GKP
Sbjct 283 G-----SGSATED-------AFSVLGFFGTVVLFFAVIAGAFLSQIGMVHGYLEIANGKP 330
Query 186 VTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPID 245
VT+ FF +N+G L LLI + +G + ++ GLI F A + + F VD+ ID
Sbjct 331 VTLKDFFTFKNVGAALGATLLIALASMVGSFI-IVGGLIVLFFALYVIWFIVDQRRGAID 389
Query 246 SVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLS 302
+KA I +N G + L L L A VG LC +G LI P+ L Y YR+L
Sbjct 390 GIKAGINLSANNFGQTALLLLLVLVANAVGSALCGIGTLISAPLGNLATTYMYRRLQ 446
>gi|269956984|ref|YP_003326773.1| hypothetical protein Xcel_2197 [Xylanimonas cellulosilytica DSM
15894]
gi|269305665|gb|ACZ31215.1| hypothetical protein Xcel_2197 [Xylanimonas cellulosilytica DSM
15894]
Length=313
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 83/260 (32%), Positives = 127/260 (49%), Gaps = 24/260 (9%)
Query 56 PPGYGPPPGPP--PPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAV 113
PPGYG P PP P + ++GDA+S++W +F QN + V L + A +
Sbjct 70 PPGYGQYASMPTAPPATPYGVAPPTLTIGDALSFAWAKFRQNWASWVAFALIFVAATVLL 129
Query 114 IGATAGLVVALSDRATTAYTNTSGVSSESVDITMTPAA-GIVMFLGYIALFALVLYMHAG 172
+ V +DRA G D T AA G+++ G ++ A + HA
Sbjct 130 VLPATLQAVDAADRAVD-----RGEVFTMDDFRFTAAATGLMVLGGLLSYVAQAMAWHA- 183
Query 173 ILTGCLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQ-- 230
L ADG ++A F R LG+ ++TG++I + G++ IP FG +A
Sbjct 184 ----ALREADGARPSLAQFVAARRLGVAVLTGIVIAVAS---GIVAFIP---FGSIAWQI 233
Query 231 ---FAVAFAVDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGI 287
FA+AF VDRS SP ++ S TVG N G + L L L+G L VG+L+ +
Sbjct 234 FTVFAIAFVVDRSLSPFAAIAESFRTVGRNFGSVFVLLLTLLGINLLGFLALGVGLLVTL 293
Query 288 PVAALIHVYTYRKLSGGQVV 307
P++ L Y +R+++GG +V
Sbjct 294 PLSVLALTYAFRRITGGTIV 313
>gi|226303835|ref|YP_002763793.1| hypothetical protein RER_03460 [Rhodococcus erythropolis PR4]
gi|226182950|dbj|BAH31054.1| hypothetical membrane protein [Rhodococcus erythropolis PR4]
Length=204
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 74/227 (33%), Positives = 118/227 (52%), Gaps = 26/227 (11%)
Query 81 VGDAISWSWNRFTQNAVTLV-VPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVS 139
+G AI++ WN+F NA+ + + ++A+ +A + GA G YTNT S
Sbjct 1 MGAAITYGWNKFKDNALVWIGISIIAFLIA-GLIQGAFNGF----------DYTNTE-FS 48
Query 140 SESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLGL 199
+ S+ + A +GYI + A L G L DG TFF+ N+G
Sbjct 49 ALSIVGGLVTA-----IVGYI--------IQAAFLRGALSELDGIKPAFGTFFQFTNIGA 95
Query 200 VLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNIG 259
V++ G L+ T++G +LC+IPG+I F+ + + F VD++ I +K+S SN+G
Sbjct 96 VVLGGFLVAVATYVGLVLCIIPGIIAAFLLYYTLTFIVDKNQDAISGIKSSYALTSSNVG 155
Query 260 GSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQV 306
+L LA + ++G LLC +G+L+ PVA + Y YR L+GG V
Sbjct 156 TLILLALALIGINIIGALLCGIGLLVTAPVALIASTYAYRVLTGGHV 202
>gi|344043844|gb|EGV39531.1| hypothetical protein CgS9114_12717 [Corynebacterium glutamicum
S9114]
Length=297
Score = 95.9 bits (237), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 91/313 (30%), Positives = 145/313 (47%), Gaps = 30/313 (9%)
Query 2 SQPPEHPGNPADPQGGN-QGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG 60
SQ P N + Q GN G +Y P YGAP YG P G G+NA P
Sbjct 7 SQYPGDDNNNWNSQFGNLSGEQNYGQP-YGAP-----YGQPYGQPFDQGFNAYSSPI--- 57
Query 61 PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAY-AVALAAVIGATAG 119
PP P P +S F +G +W F V+ L Y AV L +
Sbjct 58 PPEVPQPSMQEAQWRS--FDLGTVFGQAWKGFAATWQAWVLSTLIYFAVILVLMFAWIIP 115
Query 120 LVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVL--YMHA-GILTG 176
+V L+ AT++ ++++ ++ AAG F G++ + LV ++++
Sbjct 116 MVGVLA--ATSSGSDSAAIA----------AAGGTSFFGFVLMIVLVFISFIYSLNCYRN 163
Query 177 CLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFA 236
+ G+ +TI +FF+ + LG L +L+ V FIG +L +IPG+I V FAV A
Sbjct 164 AARVVRGEQITIQSFFKMKGLGKALGIYILVNIVIFIGMILLLIPGIIAAVVLVFAVPVA 223
Query 237 VD-RSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHV 295
R S D+ AS + V N+G ++L +L +G + +GML+ P+ L++
Sbjct 224 FQLRDASIGDAFSASWKVVSKNVGQTILLFLVIFVLSFLGSAVI-IGMLVTTPLTFLLYA 282
Query 296 YTYRKLSGGQVVE 308
Y ++ SGG +++
Sbjct 283 YAFQTASGGPIMQ 295
>gi|145296517|ref|YP_001139338.1| hypothetical protein cgR_2428 [Corynebacterium glutamicum R]
gi|140846437|dbj|BAF55436.1| hypothetical protein [Corynebacterium glutamicum R]
Length=297
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 92/313 (30%), Positives = 142/313 (46%), Gaps = 30/313 (9%)
Query 2 SQPPEHPGNPADPQGGN-QGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG 60
SQ P N + Q GN G +Y P YGAP YG P G G+NA P
Sbjct 7 SQYPGDDNNNWNSQFGNLSGEQNYGQP-YGAP-----YGQPYGQPFDQGFNAYSSPI--- 57
Query 61 PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAY-AVALAAVIGATAG 119
PP P P +S F +G +W F V+ L Y AV L +
Sbjct 58 PPEVPQPSMQEAQWRS--FDLGTVFGQAWKGFAATWQAWVLSTLIYFAVILVLMFAWIIP 115
Query 120 LVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILT---G 176
+V L+ AT++ ++++ ++ AAG F G++ + LV L
Sbjct 116 MVGVLA--ATSSGSDSAAIA----------AAGGTSFFGFVLMIVLVFISFVYSLNCYRN 163
Query 177 CLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFA 236
+ G+ +TI +FF+ + LG L +L+ V FIG +L +IPG+I V FAV A
Sbjct 164 AARVVRGEQITIQSFFKMKGLGKALGIYILVNIVIFIGMILLLIPGIIAAVVLVFAVPVA 223
Query 237 VD-RSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHV 295
R S D+ AS + V N+G ++L +L +G + +GML+ P+ L++
Sbjct 224 FQLRDASIGDAFSASWKVVSKNVGQTILLFLVIFVLSFLGSAVI-IGMLVTTPLTFLLYA 282
Query 296 YTYRKLSGGQVVE 308
Y ++ SGG +++
Sbjct 283 YAFQTASGGPIMQ 295
>gi|19553719|ref|NP_601721.1| hypothetical protein NCgl2434 [Corynebacterium glutamicum ATCC
13032]
gi|62391360|ref|YP_226762.1| hypothetical protein cg2777 [Corynebacterium glutamicum ATCC
13032]
gi|21325292|dbj|BAB99913.1| Hypothetical membrane protein [Corynebacterium glutamicum ATCC
13032]
gi|41326701|emb|CAF21183.1| putative membrane protein [Corynebacterium glutamicum ATCC 13032]
Length=297
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/309 (31%), Positives = 136/309 (45%), Gaps = 22/309 (7%)
Query 2 SQPPEHPGNPADPQGGN-QGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYG 60
SQ P N + Q GN G +Y P YGAP YG P G G+NA P
Sbjct 7 SQYPGDDNNNWNSQFGNPSGEQNYGQP-YGAP-----YGQPYGQPFDQGFNAYSSPI--- 57
Query 61 PPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGL 120
PP P P +S F +G +W FT V+ L Y L ++ A +
Sbjct 58 PPEVPQPSMQEAQWRS--FDLGTVFGQAWKGFTATWQAWVLSALIYFAVLLVLM--FAWI 113
Query 121 VVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDI 180
+ +S A T +SG S+S I T F+ I L + +
Sbjct 114 LPMVSVLAAT----SSG--SDSAAIAATGGTSFFGFMLMIVLAFISFVYSLNCYRNAARV 167
Query 181 ADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVD-R 239
G+ +TI +FF+ + LG L +LI V FIG +L +IPG+I V FAV A R
Sbjct 168 VRGEQITIQSFFKMKGLGKALGIYILINIVIFIGMILLLIPGIIAAVVLIFAVPVAFQLR 227
Query 240 STSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYR 299
S D+ AS + V N+G +L LA +G + +GML+ P+ L++ Y ++
Sbjct 228 DASIGDAFSASWKAVSKNVGQVILLELAIFALSFLGSAVI-IGMLVTTPLTFLLYAYAFQ 286
Query 300 KLSGGQVVE 308
SGG +++
Sbjct 287 TASGGPIMQ 295
>gi|54025618|ref|YP_119860.1| hypothetical protein nfa36480 [Nocardia farcinica IFM 10152]
gi|54017126|dbj|BAD58496.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=272
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/262 (33%), Positives = 123/262 (47%), Gaps = 31/262 (11%)
Query 52 APPPPPGYGPPPGPPPP----GYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYA 107
AP P P YGPP P GY + VG+AIS+ +F N + P LA
Sbjct 38 APTPGPQYGPPGSAPADQPVYGYQQLAAPTTLDVGNAISYGLEKFRSN----MAPWLA-V 92
Query 108 VALAAVIGATAGLVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVL 167
A+ VI T LVV T P + + + L ++A+ +
Sbjct 93 TAVGVVIYLTFLLVVQ----------------------TFEPNSLLSLVLLFLAVMVGLW 130
Query 168 YMHAGILTGCLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGF 227
+ A ++ G L DG +FF+ N G VL+T LL T++G LCV+PGL G
Sbjct 131 LLQAAMVRGALHETDGVKPVFGSFFQVLNAGNVLLTALLAFLGTWLGLALCVLPGLAVGV 190
Query 228 VAQFAVAFAVDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGI 287
+ F++ F VD+ PID+++AS V N +L L+ + L+G L C +G+L
Sbjct 191 LCMFSLHFVVDQDLGPIDAIRASAMLVARNPVQVLLLALSVVVITLLGLLACGIGVLFAG 250
Query 288 PVAALIHVYTYRKLSGGQVVEA 309
PV L Y YR L+GG++V A
Sbjct 251 PVCVLAVTYAYRGLTGGRLVPA 272
>gi|226363515|ref|YP_002781297.1| hypothetical protein ROP_41050 [Rhodococcus opacus B4]
gi|226242004|dbj|BAH52352.1| hypothetical membrane protein [Rhodococcus opacus B4]
Length=222
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 71/228 (32%), Positives = 115/228 (51%), Gaps = 21/228 (9%)
Query 79 FSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGV 138
FSVGDAI + WN+F NA+ + +L +AAVI LV G
Sbjct 14 FSVGDAIGYGWNKFKDNALIWIGILL-----IAAVIQVVLNLVFG-------------GF 55
Query 139 SSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLG 198
S+ S M+ A + +G I + ++A ++ G L DG +FF+ N+G
Sbjct 56 STSS---DMSAAFSVWRIIGTIVTTIVGYLINAALVRGALHEVDGNKPAFGSFFQFTNVG 112
Query 199 LVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNI 258
+++ ++I T IG +L +IPGLI F+ + + F +D++ I +K+S + N+
Sbjct 113 AIIIASVIIGVATTIGFVLLIIPGLIVIFLTWWTLQFVIDQNEDAITGIKSSFRVISQNV 172
Query 259 GGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQV 306
G +L LA + +VG +LC VG+L+ IP+ + Y YR L+G V
Sbjct 173 GPVLLLALALVGINIVGAILCGVGLLVSIPITIIASTYAYRVLTGRYV 220
>gi|118468565|ref|YP_889514.1| hypothetical protein MSMEG_5268 [Mycobacterium smegmatis str.
MC2 155]
gi|118169852|gb|ABK70748.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=377
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 76/287 (27%), Positives = 128/287 (45%), Gaps = 78/287 (27%)
Query 59 YGPPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATA 118
+G P G P G P +SVGDA SW+WN+F+++AV ++VP L + + A + G
Sbjct 119 FGQPGGYAPVGAPGF--GGAYSVGDAFSWAWNKFSKHAVEMIVPALVFGLVYAILQGIVN 176
Query 119 GLVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCL 178
G+ + A+T+TS S++ +++M G+V +G I
Sbjct 177 GI--------SGAFTSTS--SADGFELSMATGGGVVSIIGAII----------------- 209
Query 179 DIADGKPVTIAT------------------------FFRPRNLGLVLVTGLLIVAVTFIG 214
I T FF+PRN+G V++ +++ + F+
Sbjct 210 -------TLIVTAVIQAAYISGVLEIANGQPVTIGSFFKPRNVGDVIIATVIVGVINFVV 262
Query 215 GLLCVIPGLI---FGFV---------AQFAVAF------AVDRSTSPIDSVKASIETVGS 256
+ + PG + FV A AV F +DR+ S +D+VK S E +
Sbjct 263 AAILLFPGFFVPGYLFVGVPVLLIASAIIAVLFLFTTVAVLDRNLSGVDAVKTSFELSKA 322
Query 257 NIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSG 303
N G ++ + +L G + C +G+L+ P+ ALI VY +R+L+G
Sbjct 323 NFGTVFITAVVIFLLLLAGAIACGIGLLVAYPLVALIEVYAFRRLTG 369
>gi|332670859|ref|YP_004453867.1| integral membrane protein [Cellulomonas fimi ATCC 484]
gi|332339897|gb|AEE46480.1| integral membrane protein [Cellulomonas fimi ATCC 484]
Length=334
Score = 80.1 bits (196), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 70/233 (31%), Positives = 119/233 (52%), Gaps = 23/233 (9%)
Query 81 VGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVSS 140
+G+A+S+ W +FT N + V V +AAV + GLV + +GV+
Sbjct 110 IGEAMSYGWGKFTTNG-GVFVAAALIWVVVAAVAVSLVGLV----------FGGLAGVTD 158
Query 141 ESVDITMTPAAGIVMFLGYI---ALFALVLYM-HAGILTGCLDIADGKPVTIATFFRPRN 196
D + AG+ + G+I A+F L Y+ A + L++ G+P +A FF
Sbjct 159 PDGD--GSGLAGVGLSFGWIVVNAVFWLAAYLVQAAFVRVSLNLTYGRPARLADFFSFER 216
Query 197 LGLVLVTGLLIVAVTFIGGLLCVIP--GLIF----GFVAQFAVAFAVDRSTSPIDSVKAS 250
G V++T LL+ V + L+ IP G + F+ F + F +D+ SP+D++++S
Sbjct 217 PGPVVLTALLLAGVNLVVSLVSWIPLIGWLLPAAVNFLLLFTLWFVIDKDLSPVDALRSS 276
Query 251 IETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSG 303
++ V +N+G ++L +L + G LC VG+LI +PV + Y YR+L G
Sbjct 277 VQLVTANLGTTILFYLLGFLVLAAGAALCGVGLLIALPVVLVATSYLYRRLLG 329
>gi|119714909|ref|YP_921874.1| hypothetical protein Noca_0661 [Nocardioides sp. JS614]
gi|119535570|gb|ABL80187.1| conserved hypothetical protein [Nocardioides sp. JS614]
Length=282
Score = 79.7 bits (195), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 64/228 (29%), Positives = 117/228 (52%), Gaps = 19/228 (8%)
Query 83 DAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVSSES 142
+A+S+ W +F N +++ + VAL V ++ AL+ A+ + N S
Sbjct 60 NALSYGWAKFQANTAQIILSAVVLVVALVVVAVLGTFVMNALTTDASCSVQNGS------ 113
Query 143 VDITMTPAAGIVMFLGYIALFAL-------VLYMHA-GILTGCLDIADGKPVTIATFFRP 194
+T G F G + L +L V ++ G++ L++ G+P A +
Sbjct 114 --LTCDDGTG---FFGRLILQSLLSAVLLVVAWIIGAGLVRASLNVTAGRPFLFADVIKT 168
Query 195 RNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETV 254
NLG V+V ++I TF+G +LC +PGL+ GF + + F +D++ +P+D++KAS+ V
Sbjct 169 DNLGSVVVASVIIAVATFVGTILCYLPGLVVGFATSYTLFFIIDKNMAPVDAIKASVLFV 228
Query 255 GSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLS 302
N+ +++ ++ VG ++C VG L+ +PV L YTY+ L+
Sbjct 229 KDNLAATIVWYIVGGLVAAVGFVICVVGALVSVPVVLLGTAYTYKTLN 276
>gi|111021155|ref|YP_704127.1| proline rich protein [Rhodococcus jostii RHA1]
gi|110820685|gb|ABG95969.1| possible proline rich protein [Rhodococcus jostii RHA1]
Length=338
Score = 79.3 bits (194), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 67/231 (30%), Positives = 110/231 (48%), Gaps = 27/231 (11%)
Query 79 FSVGDAISWSWNRFTQNAV---TLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNT 135
FSVGDAI + WN+F NA+ +++ V L V G
Sbjct 130 FSVGDAIGYGWNKFKDNALIWIGILLIAAIIQVVLNLVFG-------------------- 169
Query 136 SGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPR 195
G S+ S M+ A + +G I + ++A ++ G L DG +FF+
Sbjct 170 -GFSTSS---DMSAAFSVWRIIGTIVTTIVGYLINAALVRGALHEVDGNKPAFGSFFQFT 225
Query 196 NLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVG 255
N+ +++ ++I IG +L +IPGLI F+ + + F +D++ I +K+S +
Sbjct 226 NVAAIIIASVIIGVAATIGFVLLIIPGLIVIFLTWWTLQFVIDQNEDAITGIKSSFRVIS 285
Query 256 SNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQV 306
N+G +L LA + +VG LLC VG+L+ IP+ + Y YR L+G V
Sbjct 286 QNVGPVLLLALALVGINIVGALLCGVGLLVSIPITIIASTYAYRVLTGRYV 336
>gi|334337464|ref|YP_004542616.1| proline rich protein [Isoptericola variabilis 225]
gi|334107832|gb|AEG44722.1| proline rich protein [Isoptericola variabilis 225]
Length=325
Score = 76.6 bits (187), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 87/277 (32%), Positives = 130/277 (47%), Gaps = 31/277 (11%)
Query 35 PPGYGPPPGTYLPPGYNAPPPPPGYGPPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQ 94
PP YG Y P G+ PPP G+ P GP Y + VG A SW+W F +
Sbjct 66 PPAYG----QYAPEGW--APPPAGHDPYGGP---AYGQAPDAGAVRVGTAFSWAWASFGR 116
Query 95 NAVTLVVPVLAY-AVALAAVIGATAGL---VVALSD-RATTAYTNTSGVSSESVDITMTP 149
+A + L A+A+AA T L V L D A A NT ++E T+
Sbjct 117 SAGAWIGATLVLGAIAMAASWLLTPSLRDTVTNLGDPAALDAVVNTPVSTTE----TLLS 172
Query 150 AAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLGLVLVTGLLIVA 209
A G ++ LFAL ++TG L + FF RNL VLV GL+ A
Sbjct 173 ALGSLV---NTVLFAL-------LVTGALAATRKGTASFGDFFALRNLAGVLVYGLITAA 222
Query 210 VTFIGGLLCVIPG---LIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNIGGSVLSWL 266
++F+ L + G L+ F A+ F +D+ I ++++S+ V N+G +++ L
Sbjct 223 ISFVLSFLPFLGGVLQLVVSFFLAAAIFFVIDKEQDAITAIRSSVRLVSRNLGTVLITVL 282
Query 267 AQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSG 303
+ VG LL VG+L+ +P+A L+ + YR+L G
Sbjct 283 LAVVVTFVGALLLVVGLLVAVPIAVLLGAHVYRRLVG 319
>gi|256832776|ref|YP_003161503.1| integral membrane protein [Jonesia denitrificans DSM 20603]
gi|256686307|gb|ACV09200.1| integral putative membrane protein [Jonesia denitrificans DSM
20603]
Length=299
Score = 76.3 bits (186), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 98/319 (31%), Positives = 136/319 (43%), Gaps = 39/319 (12%)
Query 2 SQPPEHPGNPADPQGGNQGAGS-----YPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPP 56
+ P P N P G G Y PGYG PP G P G P
Sbjct 4 NNDPTQPENQQPPTSGQPGPTEQPQYPYQQPGYGQQPPAQPQGNPYDGQQQYGQPGAQQP 63
Query 57 PGYGPPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGA 116
G YP S+GF +GDA SW WN+F NA + ++ Y + L V
Sbjct 64 GYGQQGYGQQGASYPHQNNSAGFPIGDAFSWGWNKFKDNAGAFIGGMVIYGLILLIV--- 120
Query 117 TAGLVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGY--IALFALVLYMHAGIL 174
T + G S+ S D G +M LG+ + LF+LV+ A
Sbjct 121 ------------TIIMSVVLGASAASGD-------GGLMALGFGGLILFSLVVGALALAA 161
Query 175 TG-----CLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCV--IPGLIFGF 227
L +A G+ +T+A FF NLG ++ LLI GLL I G+I F
Sbjct 162 GALFAKVALKVAAGQKLTLADFFDFSNLGQAIIVSLLIAVAN---GLLAWTGIAGIIISF 218
Query 228 VAQFAVAFAVDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGI 287
FA+ FA+D++ ID++KAS +N ++L + + V VG LL VG+LI
Sbjct 219 FTIFALYFALDKNMGAIDAIKASATLAMNNFVPTLLLLVFVMLLVFVGALLLGVGLLITT 278
Query 288 PVAALIHVYTYRKLSGGQV 306
PV+ L + Y++L G V
Sbjct 279 PVSLLAIAWVYKRLIGESV 297
>gi|325068575|ref|ZP_08127248.1| hypothetical protein AoriK_12171 [Actinomyces oris K20]
Length=205
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/233 (28%), Positives = 107/233 (46%), Gaps = 33/233 (14%)
Query 80 SVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVS 139
SVGD +SW+W++F NA+ LVV +A+ LS+ + +G
Sbjct 2 SVGDGLSWAWSKFKDNALILVVGFGVWAI---------------LSNLGFDSRVELNG-- 44
Query 140 SESVDITMTPAAGIVMFLGYIA----LFALVLYMHAGILTGCLDIADGKPVTIATFFRPR 195
E + + F GY+A LF+ ++ + L +A G+ + F
Sbjct 45 -EEYGFSYG-----IPFWGYVAPVVRLFSAIVAANM-----SLKVASGRQLEWNDIFSFP 93
Query 196 NLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVG 255
N G L+ L T +G LLC IPG+I F+ ++V F VD+ I +KAS T+
Sbjct 94 NFGASLLASFLTAVATGVGLLLCFIPGIIMAFLLYYSVYFTVDKGVDGIAGMKASWATLS 153
Query 256 SNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQVVE 308
S++G L + +G + +G L+ +P+ AL+ Y+Y ++ G VV
Sbjct 154 SHVGELFPFALTGVGLYFIGG-ITLIGWLVTVPLVALLSAYSYVRIQGYDVVR 205
>gi|312137830|ref|YP_004005166.1| integral membrane protein [Rhodococcus equi 103S]
gi|311887169|emb|CBH46478.1| putative integral membrane protein [Rhodococcus equi 103S]
Length=331
Score = 73.2 bits (178), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 71/246 (29%), Positives = 119/246 (49%), Gaps = 31/246 (12%)
Query 58 GYGPPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQN-AVTLVVPVLAYAVALAAVIGA 116
GYG P PPP S +VGDA+S+ WNR+ N V + + +A+ +++ +
Sbjct 104 GYGQRPAGPPP--------SQVTVGDALSYGWNRYKANPGVWIGILAVAFLISVVVSLPF 155
Query 117 TAGLVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTG 176
+ G S+R +++ + S I IV +L + A ++ G
Sbjct 156 SFG-----SNRDIEDWSDLATSSFSVWQIIGNVVTAIVGYL-----------ISAALIRG 199
Query 177 CLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFA 236
L DG+P +FF +N+G +++ L+ +T +G +L VIPGLI F+ + + F
Sbjct 200 ALHEVDGRPPAFGSFFEFKNVGAIIIASFLVGLMTAVGFVLLVIPGLILMFLTWWTLEFV 259
Query 237 VDRSTSPIDSVKASIETVG---SNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALI 293
VD+ I ++K+S SN G +L + ++G LLC VG+L+ IPV+ +
Sbjct 260 VDQDQDAITAIKSSFR---AISSNWGTLLLLAITLFFLNVLGVLLCVVGLLVTIPVSIIA 316
Query 294 HVYTYR 299
Y YR
Sbjct 317 STYAYR 322
>gi|23009702|ref|ZP_00050654.1| COG5473: Predicted integral membrane protein [Magnetospirillum
magnetotacticum MS-1]
Length=199
Score = 72.8 bits (177), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 53/179 (30%), Positives = 90/179 (51%), Gaps = 13/179 (7%)
Query 126 DRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKP 185
D + T + SG++ SV + + +GY+ + A G LD ADG+
Sbjct 30 DYSDTNFAALSGITFTSVVLGLVGTV-----IGYL--------ITAFFTRGALDEADGRR 76
Query 186 VTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPID 245
+A FFR N+ VL+ L++ +++IG LCV+PGL + F A+D+ I
Sbjct 77 PDVAAFFRIGNVVNVLLAALIVGVLSYIGLFLCVLPGLAVLLFSAFVYYVALDQGVDAIT 136
Query 246 SVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGG 304
+++ S V N G L LA + ++G + C +G+ + IP++ + Y YR+L+GG
Sbjct 137 AIRTSFSLVAKNFGQVFLLLLALVGINILGAIPCGLGLFVTIPLSYVTVGYAYRRLTGG 195
>gi|325676070|ref|ZP_08155752.1| YjbE family integral membrane protein [Rhodococcus equi ATCC
33707]
gi|325553110|gb|EGD22790.1| YjbE family integral membrane protein [Rhodococcus equi ATCC
33707]
Length=215
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/185 (28%), Positives = 93/185 (51%), Gaps = 17/185 (9%)
Query 80 SVGDAISWSWNRFTQN-AVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGV 138
+VGDA+S+ WNR+ N V + + +A+ +++ + + G S+R +++ +
Sbjct 2 TVGDALSYGWNRYKANPGVWIGILAVAFLISVLVSLPFSFG-----SNRDIEDWSDLATS 56
Query 139 SSESVDITMTPAAGIVMFLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLG 198
S I IV GY+ + A ++ G L DG+P +FF +N+G
Sbjct 57 SFSVWQIIGNVVTAIV---GYL--------ISAALIRGALHEVDGRPPAFGSFFEFKNVG 105
Query 199 LVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNI 258
+++ L+ +T +G +L VIPGLI F+ + + F VD+ I ++K+S + SN
Sbjct 106 AIIIASFLVGLMTAVGFVLLVIPGLILMFLTWWTLEFVVDQDQDAITAIKSSFRAISSNW 165
Query 259 GGSVL 263
G +L
Sbjct 166 GTLLL 170
>gi|326382958|ref|ZP_08204648.1| proline and glycine rich transmembrane protein [Gordonia neofelifaecis
NRRL B-59395]
gi|326198548|gb|EGD55732.1| proline and glycine rich transmembrane protein [Gordonia neofelifaecis
NRRL B-59395]
Length=253
Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/233 (26%), Positives = 108/233 (47%), Gaps = 2/233 (0%)
Query 80 SVGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVS 139
+ A+ WSW +F + +++VP L V +A V+ V R + +
Sbjct 19 DIAAALRWSWGQFRAHPWSMIVPGLISTV-MAFVLTLIGQWVSVNRPRIYFSDLRHHVIF 77
Query 140 SESVDITMTPAAGIVM-FLGYIALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLG 198
S + A IV+ + Y + LY ++G + A G+ + F P +
Sbjct 78 SRVFEDPKFDAKTIVIGLILYFVSMNVTLYFQNCTVSGAIRAARGESIGPKAFLVPMHFR 137
Query 199 LVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNI 258
+ T + +G + +IP LI + QF++ A+ T PIDSVKAS + +
Sbjct 138 NTVRTVTIACVGLILGAIAFIIPALIVLYFWQFSILIAIGTETGPIDSVKASQRITRTRV 197
Query 259 GGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQVVEAVR 311
S+L+ L ++VG L+ FVG+++ P+AAL+ + +R++ G + + VR
Sbjct 198 VASLLTLLVCGGLIVVGFLVYFVGLIVAGPLAALVQAHCFRQIMGLPIEQPVR 250
Lambda K H
0.319 0.141 0.444
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 577149478596
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40