BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0313 Length=128 Score E Sequences producing significant alignments: (Bits) Value gi|15607454|ref|NP_214827.1| hypothetical protein Rv0313 [Mycoba... 252 1e-65 gi|340625344|ref|YP_004743796.1| hypothetical protein MCAN_03151... 250 6e-65 gi|306782941|ref|ZP_07421263.1| hypothetical protein TMCG_02996 ... 249 7e-65 gi|240173239|ref|ZP_04751897.1| hypothetical protein MkanA1_2826... 220 5e-56 gi|118619804|ref|YP_908136.1| hypothetical protein MUL_4741 [Myc... 206 9e-52 gi|254821944|ref|ZP_05226945.1| hypothetical protein MintA_18572... 206 1e-51 gi|118466301|ref|YP_883971.1| hypothetical protein MAV_4847 [Myc... 205 2e-51 gi|342859209|ref|ZP_08715863.1| hypothetical protein MCOL_10033 ... 204 5e-51 gi|296167685|ref|ZP_06849931.1| conserved hypothetical protein [... 203 8e-51 gi|15828357|ref|NP_302620.1| hypothetical protein ML2518 [Mycoba... 201 3e-50 gi|145224018|ref|YP_001134696.1| hypothetical protein Mflv_3432 ... 176 7e-43 gi|167967752|ref|ZP_02550029.1| hypothetical protein MtubH3_0682... 165 2e-39 gi|118469083|ref|YP_887988.1| hypothetical protein MSMEG_3685 [M... 160 6e-38 gi|120404165|ref|YP_953994.1| hypothetical protein Mvan_3186 [My... 158 2e-37 gi|226359688|ref|YP_002777466.1| hypothetical protein ROP_02740 ... 158 3e-37 gi|111017181|ref|YP_700153.1| hypothetical protein RHA1_ro00159 ... 157 4e-37 gi|108799855|ref|YP_640052.1| hypothetical protein Mmcs_2889 [My... 156 1e-36 gi|333988957|ref|YP_004521571.1| hypothetical protein JDM601_031... 145 3e-33 gi|262203256|ref|YP_003274464.1| hypothetical protein Gbro_3369 ... 142 2e-32 gi|343924790|ref|ZP_08764329.1| hypothetical protein GOALK_026_0... 140 6e-32 gi|258654171|ref|YP_003203327.1| hypothetical protein Namu_4046 ... 135 2e-30 gi|296140779|ref|YP_003648022.1| hypothetical protein Tpau_3092 ... 125 3e-27 gi|54026723|ref|YP_120965.1| hypothetical protein nfa47490 [Noca... 109 1e-22 gi|167966670|ref|ZP_02548947.1| hypothetical protein MtubH3_0079... 98.2 4e-19 gi|326381421|ref|ZP_08203115.1| hypothetical protein SCNU_00680 ... 85.5 2e-15 gi|218670352|ref|ZP_03520023.1| 4-alpha-glucanotransferase prote... 35.8 2.3 gi|323498828|ref|ZP_08103812.1| putative glycosyltransferase [Vi... 35.0 3.7 gi|115373877|ref|ZP_01461169.1| YvcK [Stigmatella aurantiaca DW4... 34.7 4.7 >gi|15607454|ref|NP_214827.1| hypothetical protein Rv0313 [Mycobacterium tuberculosis H37Rv] gi|15839698|ref|NP_334735.1| hypothetical protein MT0327 [Mycobacterium tuberculosis CDC1551] gi|31791492|ref|NP_853985.1| hypothetical protein Mb0321 [Mycobacterium bovis AF2122/97] 76 more sequence titlesLength=128 Score = 252 bits (644), Expect = 1e-65, Method: Compositional matrix adjust. Identities = 127/128 (99%), Positives = 128/128 (100%), Gaps = 0/128 (0%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP 60 +GDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP Sbjct 1 MGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP 60 Query 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL Sbjct 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 Query 121 DDPVDEAQ 128 DDPVDEAQ Sbjct 121 DDPVDEAQ 128 >gi|340625344|ref|YP_004743796.1| hypothetical protein MCAN_03151 [Mycobacterium canettii CIPT 140010059] gi|340003534|emb|CCC42655.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059] Length=128 Score = 250 bits (638), Expect = 6e-65, Method: Compositional matrix adjust. Identities = 126/128 (99%), Positives = 127/128 (99%), Gaps = 0/128 (0%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP 60 +GDYGPFGFDPDEFDRVIREGSEGLRDA ERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP Sbjct 1 MGDYGPFGFDPDEFDRVIREGSEGLRDACERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP 60 Query 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL Sbjct 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 Query 121 DDPVDEAQ 128 DDPVDEAQ Sbjct 121 DDPVDEAQ 128 >gi|306782941|ref|ZP_07421263.1| hypothetical protein TMCG_02996 [Mycobacterium tuberculosis SUMu003] gi|308332219|gb|EFP21070.1| hypothetical protein TMCG_02996 [Mycobacterium tuberculosis SUMu003] Length=128 Score = 249 bits (637), Expect = 7e-65, Method: Compositional matrix adjust. Identities = 126/128 (99%), Positives = 127/128 (99%), Gaps = 0/128 (0%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP 60 +G YGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP Sbjct 1 MGAYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP 60 Query 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL Sbjct 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 Query 121 DDPVDEAQ 128 DDPVDEAQ Sbjct 121 DDPVDEAQ 128 >gi|240173239|ref|ZP_04751897.1| hypothetical protein MkanA1_28261 [Mycobacterium kansasii ATCC 12478] Length=128 Score = 220 bits (561), Expect = 5e-56, Method: Compositional matrix adjust. Identities = 110/127 (87%), Positives = 116/127 (92%), Gaps = 0/127 (0%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP 60 +G GPFGFDPD+FDRVIREGSEGLRDAFERI RFL+ G +GWS IFEDL RR RPAP Sbjct 1 MGSNGPFGFDPDDFDRVIREGSEGLRDAFERINRFLTGPGDRSGWSMIFEDLGRRRRPAP 60 Query 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL Sbjct 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 Query 121 DDPVDEA 127 DDP D++ Sbjct 121 DDPADDS 127 >gi|118619804|ref|YP_908136.1| hypothetical protein MUL_4741 [Mycobacterium ulcerans Agy99] gi|183980592|ref|YP_001848883.1| hypothetical protein MMAR_0564 [Mycobacterium marinum M] gi|118571914|gb|ABL06665.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99] gi|183173918|gb|ACC39028.1| conserved hypothetical protein [Mycobacterium marinum M] Length=128 Score = 206 bits (524), Expect = 9e-52, Method: Compositional matrix adjust. Identities = 106/128 (83%), Positives = 111/128 (87%), Gaps = 0/128 (0%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP 60 +GD G FGFDPD+FDRVIREGSEGLRDAFERIGRF+S G WS IFEDL R RPAP Sbjct 1 MGDNGGFGFDPDDFDRVIREGSEGLRDAFERIGRFVSGPGDRPAWSMIFEDLGHRRRPAP 60 Query 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTD +RKVRFLPYGIAVSVL Sbjct 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDSRRKVRFLPYGIAVSVL 120 Query 121 DDPVDEAQ 128 DD E + Sbjct 121 DDSSGEPE 128 >gi|254821944|ref|ZP_05226945.1| hypothetical protein MintA_18572 [Mycobacterium intracellulare ATCC 13950] Length=128 Score = 206 bits (523), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 101/127 (80%), Positives = 113/127 (89%), Gaps = 0/127 (0%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP 60 +G++G FGFDPDEFDR+IREGSEGLRD FER+ RF+++ G +GWS IFED+SRR RPAP Sbjct 1 MGEHGGFGFDPDEFDRMIREGSEGLRDVFERVSRFVAAPGGRSGWSTIFEDMSRRPRPAP 60 Query 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 ETAGEAG+GVWAIYTVD GGARVEQVYATELDALRANKDN DPKRKVRFLPYGIAVSVL Sbjct 61 ETAGEAGEGVWAIYTVDDGGGARVEQVYATELDALRANKDNLDPKRKVRFLPYGIAVSVL 120 Query 121 DDPVDEA 127 DD +E Sbjct 121 DDDSEET 127 >gi|118466301|ref|YP_883971.1| hypothetical protein MAV_4847 [Mycobacterium avium 104] gi|254777284|ref|ZP_05218800.1| hypothetical protein MaviaA2_21806 [Mycobacterium avium subsp. avium ATCC 25291] gi|118167588|gb|ABK68485.1| conserved hypothetical protein [Mycobacterium avium 104] gi|336460238|gb|EGO39141.1| hypothetical protein MAPs_42660 [Mycobacterium avium subsp. paratuberculosis S397] Length=128 Score = 205 bits (522), Expect = 2e-51, Method: Compositional matrix adjust. Identities = 100/122 (82%), Positives = 112/122 (92%), Gaps = 0/122 (0%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP 60 +G+ G FGFDPDEFDR+IREGSEGLR+ FER+ +F+++ GA TGWS++FEDLSRR RPAP Sbjct 1 MGEQGGFGFDPDEFDRMIREGSEGLREVFERVSKFVAAPGARTGWSSLFEDLSRRGRPAP 60 Query 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 ETAGEAGDGVWAIYTVDA GGA VEQV+ATELDALRANKDN DPKRKVRFLPYGIAVSVL Sbjct 61 ETAGEAGDGVWAIYTVDAGGGAHVEQVFATELDALRANKDNVDPKRKVRFLPYGIAVSVL 120 Query 121 DD 122 DD Sbjct 121 DD 122 >gi|342859209|ref|ZP_08715863.1| hypothetical protein MCOL_10033 [Mycobacterium colombiense CECT 3035] gi|342133450|gb|EGT86653.1| hypothetical protein MCOL_10033 [Mycobacterium colombiense CECT 3035] Length=137 Score = 204 bits (518), Expect = 5e-51, Method: Compositional matrix adjust. Identities = 99/126 (79%), Positives = 113/126 (90%), Gaps = 0/126 (0%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP 60 +G++ FGFDPDEFDR+IREGSEGLRD FER+ +F+++ G TGWSA+FEDLSRR R AP Sbjct 1 MGEHSGFGFDPDEFDRMIREGSEGLRDVFERVSKFVAAPGGRTGWSALFEDLSRRPRSAP 60 Query 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 ETAGEAG+GVWAIYTV+A+GGARVEQVYATELDALRANKDN DPKRKVRFLPYGIAV VL Sbjct 61 ETAGEAGEGVWAIYTVEANGGARVEQVYATELDALRANKDNVDPKRKVRFLPYGIAVGVL 120 Query 121 DDPVDE 126 DD +E Sbjct 121 DDDAEE 126 >gi|296167685|ref|ZP_06849931.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295897098|gb|EFG76711.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=133 Score = 203 bits (516), Expect = 8e-51, Method: Compositional matrix adjust. Identities = 99/122 (82%), Positives = 110/122 (91%), Gaps = 0/122 (0%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP 60 +G+YG FGFDPDEFDR+IREGSEGLRD FER+ +F+++ GA GWS+ FEDLSRR RP P Sbjct 1 MGEYGAFGFDPDEFDRMIREGSEGLRDVFERVSKFVAAPGARPGWSSFFEDLSRRPRPEP 60 Query 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 ETAGEAGDGVWAIYTVD+ GGARVEQVYATELDALRANKDN DP RKVRFLPYGIAVSVL Sbjct 61 ETAGEAGDGVWAIYTVDSGGGARVEQVYATELDALRANKDNVDPMRKVRFLPYGIAVSVL 120 Query 121 DD 122 D+ Sbjct 121 DN 122 >gi|15828357|ref|NP_302620.1| hypothetical protein ML2518 [Mycobacterium leprae TN] gi|221230834|ref|YP_002504250.1| hypothetical protein MLBr_02518 [Mycobacterium leprae Br4923] gi|13093787|emb|CAC32049.1| conserved hypothetical protein [Mycobacterium leprae] gi|219933941|emb|CAR72617.1| conserved hypothetical protein [Mycobacterium leprae Br4923] Length=130 Score = 201 bits (512), Expect = 3e-50, Method: Compositional matrix adjust. Identities = 99/128 (78%), Positives = 108/128 (85%), Gaps = 0/128 (0%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAP 60 +G+Y FGFDPD+FDR+I+EGSEGLRDAFERI RF+ G T WSAIFEDLSRR+RPA Sbjct 1 MGEYSAFGFDPDDFDRLIKEGSEGLRDAFERISRFVGGPGVRTAWSAIFEDLSRRARPAQ 60 Query 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 ETA EAGDGVWAIYTV DG ARVEQVYATELDALRANK+N DPKRKVRFLPYGIAVSVL Sbjct 61 ETADEAGDGVWAIYTVTGDGAARVEQVYATELDALRANKNNVDPKRKVRFLPYGIAVSVL 120 Query 121 DDPVDEAQ 128 D + Q Sbjct 121 DSHQESTQ 128 >gi|145224018|ref|YP_001134696.1| hypothetical protein Mflv_3432 [Mycobacterium gilvum PYR-GCK] gi|315444354|ref|YP_004077233.1| hypothetical protein Mspyr1_27690 [Mycobacterium sp. Spyr1] gi|145216504|gb|ABP45908.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK] gi|315262657|gb|ADT99398.1| hypothetical protein Mspyr1_27690 [Mycobacterium sp. Spyr1] Length=143 Score = 176 bits (447), Expect = 7e-43, Method: Compositional matrix adjust. Identities = 85/119 (72%), Positives = 101/119 (85%), Gaps = 2/119 (1%) Query 5 GPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPA--PET 62 GPFGFDP++ DR++RE EGLRDA E +GRF+++ G TGWS +F++ SR +RP PET Sbjct 6 GPFGFDPEDLDRMVREAGEGLRDALEGLGRFVNTPGERTGWSVLFDEFSRGTRPRTRPET 65 Query 63 AGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVLD 121 GEAGDGVWAI+TVD +GGAR+EQVY TELDALRANKDNTDP R+VRFLPYGIAVSVLD Sbjct 66 TGEAGDGVWAIFTVDGEGGARIEQVYPTELDALRANKDNTDPTRRVRFLPYGIAVSVLD 124 >gi|167967752|ref|ZP_02550029.1| hypothetical protein MtubH3_06828 [Mycobacterium tuberculosis H37Ra] Length=89 Score = 165 bits (417), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 83/87 (96%), Positives = 83/87 (96%), Gaps = 0/87 (0%) Query 42 GTGWSAIFEDLSRRSRPAPETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDN 101 G G IFEDLSRRSRPAPETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDN Sbjct 3 GNGLVGIFEDLSRRSRPAPETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDN 62 Query 102 TDPKRKVRFLPYGIAVSVLDDPVDEAQ 128 TDPKRKVRFLPYGIAVSVLDDPVDEAQ Sbjct 63 TDPKRKVRFLPYGIAVSVLDDPVDEAQ 89 >gi|118469083|ref|YP_887988.1| hypothetical protein MSMEG_3685 [Mycobacterium smegmatis str. MC2 155] gi|118170370|gb|ABK71266.1| conserved hypothetical protein [Mycobacterium smegmatis str. MC2 155] Length=143 Score = 160 bits (405), Expect = 6e-38, Method: Compositional matrix adjust. Identities = 79/119 (67%), Positives = 93/119 (79%), Gaps = 2/119 (1%) Query 5 GPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRP--APET 62 GPFGFDP++FDRV RE EGLRDA ++I R ++SG G + + ++ SR SRP PET Sbjct 5 GPFGFDPEDFDRVAREAGEGLRDALDQISRMFTTSGERAGLAGLLDEFSRFSRPRTEPET 64 Query 63 AGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVLD 121 GE GDGVWAIYTVD +GGA +EQV+ TELDALRAN NTDP R+VRFLPYGIAVSVLD Sbjct 65 TGEKGDGVWAIYTVDDEGGAHIEQVFPTELDALRANASNTDPSRRVRFLPYGIAVSVLD 123 >gi|120404165|ref|YP_953994.1| hypothetical protein Mvan_3186 [Mycobacterium vanbaalenii PYR-1] gi|119956983|gb|ABM13988.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1] Length=145 Score = 158 bits (400), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 80/119 (68%), Positives = 91/119 (77%), Gaps = 13/119 (10%) Query 5 GPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRP--APET 62 GPFGFDP++F+RV+RE EGLR+A R GWS +F++ SR +RP PET Sbjct 6 GPFGFDPEDFERVVREAGEGLREALGR-----------AGWSTLFDEFSRGARPRTQPET 54 Query 63 AGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVLD 121 GEAGDGVWAIYT DA GGA +EQVY TELDALRANKDN DPKR+VRFLPYGIAVSVLD Sbjct 55 TGEAGDGVWAIYTTDATGGAHIEQVYPTELDALRANKDNVDPKRRVRFLPYGIAVSVLD 113 >gi|226359688|ref|YP_002777466.1| hypothetical protein ROP_02740 [Rhodococcus opacus B4] gi|226238173|dbj|BAH48521.1| hypothetical protein [Rhodococcus opacus B4] Length=134 Score = 158 bits (399), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 79/123 (65%), Positives = 91/123 (74%), Gaps = 2/123 (1%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSR--RSRP 58 +GD PFGFDPD+ DRVIRE E L+ +RI RFL + + W+ +F D +R R+ P Sbjct 1 MGDNRPFGFDPDDLDRVIREAGEELQGVKDRIVRFLEQADSQIPWTGVFADFARPPRAAP 60 Query 59 APETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVS 118 PET GE GDGVWAIYTVD DG ARVEQVYATELDALRA+K N DP R VRFLPYG+ VS Sbjct 61 KPETTGETGDGVWAIYTVDGDGVARVEQVYATELDALRAHKHNIDPHRSVRFLPYGVTVS 120 Query 119 VLD 121 VLD Sbjct 121 VLD 123 >gi|111017181|ref|YP_700153.1| hypothetical protein RHA1_ro00159 [Rhodococcus jostii RHA1] gi|110816711|gb|ABG91995.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length=134 Score = 157 bits (398), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 81/129 (63%), Positives = 95/129 (74%), Gaps = 2/129 (1%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPA- 59 +GD PFGFDPD+ DRVIRE E L+ +RI RFL + + W+ +F D +R R A Sbjct 1 MGDNRPFGFDPDDLDRVIREAGEELQGVKDRIVRFLDQADSQIPWTGVFADFARSPRGAA 60 Query 60 -PETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVS 118 PETAGE GDGVWAIYT+D +G ARVEQVYATELDALRA+K NTDP R VRFLPYG+ VS Sbjct 61 KPETAGETGDGVWAIYTLDDEGVARVEQVYATELDALRAHKHNTDPHRSVRFLPYGVTVS 120 Query 119 VLDDPVDEA 127 VLD P + A Sbjct 121 VLDQPEEAA 129 >gi|108799855|ref|YP_640052.1| hypothetical protein Mmcs_2889 [Mycobacterium sp. MCS] gi|119868965|ref|YP_938917.1| hypothetical protein Mkms_2933 [Mycobacterium sp. KMS] gi|126435498|ref|YP_001071189.1| hypothetical protein Mjls_2919 [Mycobacterium sp. JLS] gi|108770274|gb|ABG08996.1| hypothetical protein Mmcs_2889 [Mycobacterium sp. MCS] gi|119695054|gb|ABL92127.1| conserved hypothetical protein [Mycobacterium sp. KMS] gi|126235298|gb|ABN98698.1| conserved hypothetical protein [Mycobacterium sp. JLS] Length=125 Score = 156 bits (394), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 80/123 (66%), Positives = 92/123 (75%), Gaps = 9/123 (7%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRP-- 58 + + GPFGFDPD+ DRV RE EG +GRF ++SG GW A+ ++L R RP Sbjct 1 MSNNGPFGFDPDDLDRVAREALEG-------VGRFFTTSGERAGWGALIDELGRFGRPRT 53 Query 59 APETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVS 118 PET GE GDGVWAIYTVDADG A +EQV+ TELDALRANK+NTDP RKVRFLPYGIAVS Sbjct 54 EPETTGETGDGVWAIYTVDADGDAHIEQVFPTELDALRANKNNTDPTRKVRFLPYGIAVS 113 Query 119 VLD 121 VLD Sbjct 114 VLD 116 >gi|333988957|ref|YP_004521571.1| hypothetical protein JDM601_0317 [Mycobacterium sp. JDM601] gi|333484925|gb|AEF34317.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=127 Score = 145 bits (365), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 76/126 (61%), Positives = 91/126 (73%), Gaps = 5/126 (3%) Query 5 GPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGT--GWSAIFEDLSRRSRPAPET 62 G FGFD ++ DR++RE EGLR AF+R F + G G GW I DL++ +R P T Sbjct 5 GAFGFDSEDLDRMMREAGEGLRAAFDR---FSGAFGPGNRAGWGGILADLAQAARSGPPT 61 Query 63 AGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVLDD 122 GE GDGVWAIYTV DG ARVEQVYATE+DALRA++ N DP RKVRFLPYGIAV VLDD Sbjct 62 TGETGDGVWAIYTVGDDGAARVEQVYATEIDALRASQRNPDPGRKVRFLPYGIAVGVLDD 121 Query 123 PVDEAQ 128 D+++ Sbjct 122 SADQSE 127 >gi|262203256|ref|YP_003274464.1| hypothetical protein Gbro_3369 [Gordonia bronchialis DSM 43247] gi|262086603|gb|ACY22571.1| hypothetical protein Gbro_3369 [Gordonia bronchialis DSM 43247] Length=145 Score = 142 bits (357), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 76/122 (63%), Positives = 91/122 (75%), Gaps = 5/122 (4%) Query 5 GPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFED--LSRRSRPAPET 62 GPF F P++FDR RE S+GLRD F ++ F +GAG WSA+F+D R R PET Sbjct 11 GPFNFGPEDFDRFAREASDGLRDVFGKL--FEGQAGAGA-WSALFDDGRTRTRRRAEPET 67 Query 63 AGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVLDD 122 G+ G GVWA++ +D DGGARVEQV+ATELDALRAN+ NTDP R+VRFLPYGIAVS LD Sbjct 68 TGDTGSGVWAVFVIDDDGGARVEQVFATELDALRANQANTDPARRVRFLPYGIAVSALDT 127 Query 123 PV 124 PV Sbjct 128 PV 129 >gi|343924790|ref|ZP_08764329.1| hypothetical protein GOALK_026_00610 [Gordonia alkanivorans NBRC 16433] gi|343765297|dbj|GAA11255.1| hypothetical protein GOALK_026_00610 [Gordonia alkanivorans NBRC 16433] Length=173 Score = 140 bits (353), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 72/117 (62%), Positives = 87/117 (75%), Gaps = 6/117 (5%) Query 12 DEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSR---RSRPAPETAGEAGD 68 ++FDR RE EGLRD F G+ A +S F++ +R R++P PETAGE G Sbjct 50 EDFDRFAREAGEGLRDVF---GKLFEGQAAPGAFSMFFDEAARGRTRTQPRPETAGETGS 106 Query 69 GVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVLDDPVD 125 GVWA++ VD DGGARVEQVYATELDALRANK+NTDP+RKVRFLPYGIAVS LD+ +D Sbjct 107 GVWAVFVVDEDGGARVEQVYATELDALRANKNNTDPRRKVRFLPYGIAVSALDEALD 163 >gi|258654171|ref|YP_003203327.1| hypothetical protein Namu_4046 [Nakamurella multipartita DSM 44233] gi|258557396|gb|ACV80338.1| hypothetical protein Namu_4046 [Nakamurella multipartita DSM 44233] Length=134 Score = 135 bits (339), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 74/122 (61%), Positives = 84/122 (69%), Gaps = 1/122 (0%) Query 7 FGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAG-TGWSAIFEDLSRRSRPAPETAGE 65 FGFDPD+ DR + LR A + R L++SG G S+ SRRS P PET GE Sbjct 6 FGFDPDDLDRFFPGAGDQLRGALGQFARMLNASGEGRGAGSSAGFGGSRRSAPEPETTGE 65 Query 66 AGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVLDDPVD 125 G+GVW IYTVDADG ARVEQV+ATEL+ALRANK NTD R VRFLPYGI V+VLD PV Sbjct 66 TGEGVWMIYTVDADGDARVEQVFATELEALRANKHNTDSNRSVRFLPYGIPVTVLDSPVT 125 Query 126 EA 127 A Sbjct 126 SA 127 >gi|296140779|ref|YP_003648022.1| hypothetical protein Tpau_3092 [Tsukamurella paurometabola DSM 20162] gi|296028913|gb|ADG79683.1| conserved hypothetical protein [Tsukamurella paurometabola DSM 20162] Length=117 Score = 125 bits (313), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 67/122 (55%), Positives = 84/122 (69%), Gaps = 8/122 (6%) Query 2 GDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTG-WSAIFEDLSRRSRPAP 60 G+ PFGF P EFD + RE A + +GR + + G + ++F++ RR+R P Sbjct 3 GNENPFGFGPGEFDDIARE-------ARDMLGRIVGGAAGGRDVFGSLFDEAGRRTRREP 55 Query 61 ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVL 120 ETAGEAGDGVWAI DGGARVEQV+ TE++ALRA++ NTD RKVRFLPYGIAVS L Sbjct 56 ETAGEAGDGVWAIIATADDGGARVEQVFKTEIEALRAHQHNTDATRKVRFLPYGIAVSAL 115 Query 121 DD 122 DD Sbjct 116 DD 117 >gi|54026723|ref|YP_120965.1| hypothetical protein nfa47490 [Nocardia farcinica IFM 10152] gi|54018231|dbj|BAD59601.1| hypothetical protein [Nocardia farcinica IFM 10152] Length=130 Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 59/126 (47%), Positives = 81/126 (65%), Gaps = 7/126 (5%) Query 5 GPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSS--SGAGTGWSAIFEDL--SRRSRPAP 60 GPFG DP++F+R +RE LRD + G +L + G +++ SR +RP Sbjct 5 GPFGIDPEDFERALREAGTELRDLLGKAGVYLDRVDHASVAGLTSLLAQFVPSRPARPPE 64 Query 61 ---ETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAV 117 E GE+G GVW IY +D G ARV+QV+ +EL+ALRA++DNTD +R+VRFLPYG+ Sbjct 65 PEGEITGESGSGVWVIYRLDDGGEARVDQVFPSELEALRAHRDNTDERRRVRFLPYGVPA 124 Query 118 SVLDDP 123 SVLD P Sbjct 125 SVLDAP 130 >gi|167966670|ref|ZP_02548947.1| hypothetical protein MtubH3_00798 [Mycobacterium tuberculosis H37Ra] Length=49 Score = 98.2 bits (243), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 48/49 (98%), Positives = 49/49 (100%), Gaps = 0/49 (0%) Query 1 VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIF 49 +GDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIF Sbjct 1 MGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIF 49 >gi|326381421|ref|ZP_08203115.1| hypothetical protein SCNU_00680 [Gordonia neofelifaecis NRRL B-59395] gi|326199668|gb|EGD56848.1| hypothetical protein SCNU_00680 [Gordonia neofelifaecis NRRL B-59395] Length=117 Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 53/113 (47%), Positives = 68/113 (61%), Gaps = 8/113 (7%) Query 9 FDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSR-RSRPAPETAGEAG 67 F PD+FDR RE EGLR + L + A W+ I +R R+ P P E Sbjct 7 FGPDDFDRFAREAGEGLRKLLRQA---LDNPRATATWADIAASATRPRTSPEPARDVEVV 63 Query 68 D---GVWAIYTVDADGGARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAV 117 D GVWAI D DGGARVE+V+++ELDALRAN+ NTDP R V+FL +G+ + Sbjct 64 DAEPGVWAIVR-DDDGGARVERVFSSELDALRANQTNTDPSRTVQFLEFGVEI 115 >gi|218670352|ref|ZP_03520023.1| 4-alpha-glucanotransferase protein [Rhizobium etli GR56] Length=353 Score = 35.8 bits (81), Expect = 2.3, Method: Compositional matrix adjust. Identities = 18/50 (36%), Positives = 27/50 (54%), Gaps = 2/50 (4%) Query 8 GFDPDEFDRVIREGSEGLRDA--FERIGRFLSSSGAGTGWSAIFEDLSRR 55 +DP +FD + +G + LR FE + +++ GAG GW DL RR Sbjct 52 AYDPRDFDAFVAQGGDSLRRHALFECLSLSMAARGAGAGWQKWPSDLQRR 101 >gi|323498828|ref|ZP_08103812.1| putative glycosyltransferase [Vibrio sinaloensis DSM 21326] gi|323316110|gb|EGA69137.1| putative glycosyltransferase [Vibrio sinaloensis DSM 21326] Length=372 Score = 35.0 bits (79), Expect = 3.7, Method: Compositional matrix adjust. Identities = 22/65 (34%), Positives = 30/65 (47%), Gaps = 3/65 (4%) Query 66 AGDGVWAIYTVDADGGARVEQVYATELDALRANKDNTD---PKRKVRFLPYGIAVSVLDD 122 GD A+ VD D A+ E D + A D P+RK + LP+G+ VS+ Sbjct 140 CGDDFSALAGVDHDVVAQHENKLVASADLILAASDKLCKKFPQRKTQLLPHGVDVSLFQT 199 Query 123 PVDEA 127 PV A Sbjct 200 PVSRA 204 >gi|115373877|ref|ZP_01461169.1| YvcK [Stigmatella aurantiaca DW4/3-1] gi|310820920|ref|YP_003953278.1| hypothetical protein STAUR_3663 [Stigmatella aurantiaca DW4/3-1] gi|115369143|gb|EAU68086.1| YvcK [Stigmatella aurantiaca DW4/3-1] gi|309393992|gb|ADO71451.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length=359 Score = 34.7 bits (78), Expect = 4.7, Method: Compositional matrix adjust. Identities = 21/61 (35%), Positives = 34/61 (56%), Gaps = 4/61 (6%) Query 22 SEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAPETAGEAGDGVWAIYTVDADGG 81 +E L+ +R R ++ G GTG + L+RR+ P P G+AG + A+ T+ DGG Sbjct 28 NELLQPQVDRPTRIVAM-GGGTGLPVVLRGLARRAMPKP---GDAGVDITAVVTMSDDGG 83 Query 82 A 82 + Sbjct 84 S 84 Lambda K H 0.316 0.138 0.413 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 128801298864 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40