BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2203 Length=230 Score E Sequences producing significant alignments: (Bits) Value gi|15609340|ref|NP_216719.1| hypothetical protein Rv2203 [Mycoba... 468 3e-130 gi|183983238|ref|YP_001851529.1| hypothetical protein MMAR_3247 ... 328 3e-88 gi|240168264|ref|ZP_04746923.1| hypothetical protein MkanA1_0305... 308 3e-82 gi|118618845|ref|YP_907177.1| hypothetical protein MUL_3560 [Myc... 306 2e-81 gi|342859914|ref|ZP_08716567.1| hypothetical protein MCOL_13583 ... 284 9e-75 gi|41408041|ref|NP_960877.1| hypothetical protein MAP1943 [Mycob... 275 3e-72 gi|15827395|ref|NP_301658.1| hypothetical protein ML0872 [Mycoba... 263 2e-68 gi|296166075|ref|ZP_06848521.1| conserved hypothetical protein [... 263 2e-68 gi|254823306|ref|ZP_05228307.1| hypothetical protein MintA_25484... 258 6e-67 gi|108800267|ref|YP_640464.1| hypothetical protein Mmcs_3301 [My... 177 1e-42 gi|126435890|ref|YP_001071581.1| hypothetical protein Mjls_3312 ... 172 3e-41 gi|118468790|ref|YP_888548.1| hypothetical protein MSMEG_4271 [M... 151 8e-35 gi|120404538|ref|YP_954367.1| hypothetical protein Mvan_3567 [My... 151 9e-35 gi|315443879|ref|YP_004076758.1| hypothetical protein Mspyr1_227... 150 1e-34 gi|145223533|ref|YP_001134211.1| hypothetical protein Mflv_2946 ... 149 2e-34 gi|169629045|ref|YP_001702694.1| hypothetical protein MAB_1958c ... 98.2 8e-19 gi|134113779|ref|XP_774474.1| hypothetical protein CNBG1200 [Cry... 36.2 4.2 gi|58269800|ref|XP_572056.1| hypothetical protein CNG03570 [Cryp... 35.8 5.2 >gi|15609340|ref|NP_216719.1| hypothetical protein Rv2203 [Mycobacterium tuberculosis H37Rv] gi|15841694|ref|NP_336731.1| hypothetical protein MT2259 [Mycobacterium tuberculosis CDC1551] gi|31793382|ref|NP_855875.1| hypothetical protein Mb2226 [Mycobacterium bovis AF2122/97] 82 more sequence titlesLength=230 Score = 468 bits (1204), Expect = 3e-130, Method: Compositional matrix adjust. Identities = 230/230 (100%), Positives = 230/230 (100%), Gaps = 0/230 (0%) Query 1 MPGPHSPNPGVGTNGPAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPG 60 MPGPHSPNPGVGTNGPAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPG Sbjct 1 MPGPHSPNPGVGTNGPAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPG 60 Query 61 VPPQVSYPKRRHKRLLIGIVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQGY 120 VPPQVSYPKRRHKRLLIGIVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQGY Sbjct 61 VPPQVSYPKRRHKRLLIGIVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQGY 120 Query 121 LNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWS 180 LNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWS Sbjct 121 LNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWS 180 Query 181 QYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY 230 QYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY Sbjct 181 QYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY 230 >gi|183983238|ref|YP_001851529.1| hypothetical protein MMAR_3247 [Mycobacterium marinum M] gi|183176564|gb|ACC41674.1| conserved protein [Mycobacterium marinum M] Length=229 Score = 328 bits (842), Expect = 3e-88, Method: Compositional matrix adjust. Identities = 174/231 (76%), Positives = 192/231 (84%), Gaps = 3/231 (1%) Query 1 MPGPHSPNPGVGTNGPAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPG 60 M GPH NP V T GP PYPE +EP L+YPHDLG AEPAF PAD P +YPG Sbjct 1 MAGPHPSNPAVSTEGPMPYPENGPNEP--LEYPHDLGGAEPAFGTPPADGPPRLPVSYPG 58 Query 61 VPPQVS-YPKRRHKRLLIGIVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQG 119 +PP Y RR +RLLIG ++ALALV A+TA I+YGVRTNG N++GT SEG AKTAIQG Sbjct 59 LPPAPGGYRNRRPRRLLIGTLLALALVGALTATIVYGVRTNGTNSSGTLSEGAAKTAIQG 118 Query 120 YLNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYW 179 YLNALE+RDVD IVRNALCGI+DGV+DKRSDQALAKLSSDAFRKQFSQ +VTSIDKIVYW Sbjct 119 YLNALEHRDVDVIVRNALCGIYDGVKDKRSDQALAKLSSDAFRKQFSQADVTSIDKIVYW 178 Query 180 SQYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY 230 SQYQAQVLFTMQVTPA GGPP+GQVQGIAQLLFQRGQ++VCSYVLRTAG Y Sbjct 179 SQYQAQVLFTMQVTPATGGPPKGQVQGIAQLLFQRGQIMVCSYVLRTAGQY 229 >gi|240168264|ref|ZP_04746923.1| hypothetical protein MkanA1_03057 [Mycobacterium kansasii ATCC 12478] Length=222 Score = 308 bits (790), Expect = 3e-82, Method: Compositional matrix adjust. Identities = 168/233 (73%), Positives = 187/233 (81%), Gaps = 14/233 (6%) Query 1 MPGPHSPNPGVGTNGPA---PYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAA 57 M G H PNP VGT G PYP EP L+YPH+ + FAP PA A Sbjct 1 MAGQHPPNPAVGTEGAREVRPYPPTGPDEP--LEYPHEAADRQTGFAPAPA-------AG 51 Query 58 YPGVPPQVSYPKRRHKRLLIGIVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAI 117 YPG+P SYP+RR KRLLIG ++ALAL+ A+TAAI+YGVRTNG NT GTF+E AKTAI Sbjct 52 YPGMPG--SYPRRRPKRLLIGTLLALALIGALTAAIVYGVRTNGTNTGGTFTEASAKTAI 109 Query 118 QGYLNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIV 177 QGYLNALE+RDVD IVRNALCGI+DGV+D+RSDQALAKLSSDAFRKQFSQ +VTSIDK+V Sbjct 110 QGYLNALEHRDVDVIVRNALCGIYDGVKDRRSDQALAKLSSDAFRKQFSQADVTSIDKVV 169 Query 178 YWSQYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY 230 YWSQYQAQVLFTMQVTPA GGPP+GQVQGIAQLLFQRGQ+LVCSYVLRTAG Y Sbjct 170 YWSQYQAQVLFTMQVTPATGGPPKGQVQGIAQLLFQRGQILVCSYVLRTAGQY 222 >gi|118618845|ref|YP_907177.1| hypothetical protein MUL_3560 [Mycobacterium ulcerans Agy99] gi|118570955|gb|ABL05706.1| conserved protein [Mycobacterium ulcerans Agy99] Length=213 Score = 306 bits (783), Expect = 2e-81, Method: Compositional matrix adjust. Identities = 163/214 (77%), Positives = 181/214 (85%), Gaps = 3/214 (1%) Query 18 PYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPGVPPQ-VSYPKRRHKRLL 76 PYPE +EP L+YPHDLG AEPAF PAD P +YPG+PP Y RR KRLL Sbjct 2 PYPENGPNEP--LEYPHDLGGAEPAFGVPPADGPPRLPVSYPGLPPAPGGYRNRRPKRLL 59 Query 77 IGIVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQGYLNALENRDVDTIVRNA 136 IG ++ALALV A+TA I+YGVRTNG N++GT SEG AKTAIQGYLNALE+RDVD IVRNA Sbjct 60 IGTLLALALVGALTATIVYGVRTNGTNSSGTLSEGAAKTAIQGYLNALEHRDVDVIVRNA 119 Query 137 LCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWSQYQAQVLFTMQVTPAA 196 LCGI++GV+DKRSDQALAKLSSDAFRKQFSQ +VTSID+IVYWSQYQAQVLFTMQVTPA Sbjct 120 LCGIYNGVKDKRSDQALAKLSSDAFRKQFSQADVTSIDEIVYWSQYQAQVLFTMQVTPAT 179 Query 197 GGPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY 230 GGPP GQVQGIAQLLFQRGQ++VCSYVLRTAG Y Sbjct 180 GGPPNGQVQGIAQLLFQRGQIMVCSYVLRTAGQY 213 >gi|342859914|ref|ZP_08716567.1| hypothetical protein MCOL_13583 [Mycobacterium colombiense CECT 3035] gi|342133046|gb|EGT86266.1| hypothetical protein MCOL_13583 [Mycobacterium colombiense CECT 3035] Length=225 Score = 284 bits (726), Expect = 9e-75, Method: Compositional matrix adjust. Identities = 146/230 (64%), Positives = 173/230 (76%), Gaps = 5/230 (2%) Query 1 MPGPHSPNPGVGTNGPAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPG 60 M GPHSPN VG GP P PS P L++P D + P FA A A Sbjct 1 MAGPHSPNHTVGGEGPTP---PSESPP--LEFPDDPNSGGPGFASAAQAGPATANYAGQP 55 Query 61 VPPQVSYPKRRHKRLLIGIVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQGY 120 P P+R + L++G+ +A+ALV+ +T AI+YGVRTNGANT TFSEG A+TAIQGY Sbjct 56 PAPVPYPPQRSKRGLIVGVALAIALVAVLTVAIVYGVRTNGANTGATFSEGAARTAIQGY 115 Query 121 LNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWS 180 L+ALE+RD+D I RNALCG++DGV+DKRSDQALAKLSSDAFRKQFS+VE+TSIDKIVY S Sbjct 116 LDALEHRDIDEIARNALCGLYDGVQDKRSDQALAKLSSDAFRKQFSEVELTSIDKIVYLS 175 Query 181 QYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY 230 QYQAQ LFTM+V+P +GGP GQVQGIAQLLFQRGQ++VCSYVLRT GSY Sbjct 176 QYQAQALFTMRVSPVSGGPMHGQVQGIAQLLFQRGQIMVCSYVLRTGGSY 225 >gi|41408041|ref|NP_960877.1| hypothetical protein MAP1943 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118466864|ref|YP_881492.1| hypothetical protein MAV_2288 [Mycobacterium avium 104] gi|254774960|ref|ZP_05216476.1| hypothetical protein MaviaA2_09849 [Mycobacterium avium subsp. avium ATCC 25291] gi|41396396|gb|AAS04260.1| hypothetical protein MAP_1943 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118168151|gb|ABK69048.1| conserved hypothetical protein [Mycobacterium avium 104] gi|336461927|gb|EGO40780.1| hypothetical protein MAPs_25980 [Mycobacterium avium subsp. paratuberculosis S397] Length=225 Score = 275 bits (704), Expect = 3e-72, Method: Compositional matrix adjust. Identities = 148/230 (65%), Positives = 176/230 (77%), Gaps = 5/230 (2%) Query 1 MPGPHSPNPGVGTNGPAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPG 60 M GPHSPN VG GP P PS +P L++P A + +A P A P Sbjct 1 MAGPHSPNHTVGGQGPTP---PSESQP--LEFPDHPNAGDTGYAAAPQAPPGSANYAGPP 55 Query 61 VPPQVSYPKRRHKRLLIGIVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQGY 120 P P+R +RL++G+ +A+ALV+ MT AI+YGVRTNGANT TFSEG AKTAIQGY Sbjct 56 PAPAPYPPRRSKRRLIVGLALAVALVAVMTVAIVYGVRTNGANTGATFSEGAAKTAIQGY 115 Query 121 LNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWS 180 L+ALE+RD+D I RNALCG++DGV+DKRSDQALAKLSSDAFRKQFS+V+VTSIDKIVY S Sbjct 116 LDALEHRDIDEIARNALCGLYDGVQDKRSDQALAKLSSDAFRKQFSEVQVTSIDKIVYLS 175 Query 181 QYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY 230 QYQAQ LF+M+V+P +GGP RGQVQGIAQLLFQRGQ++VCSYVLRT GSY Sbjct 176 QYQAQALFSMRVSPVSGGPARGQVQGIAQLLFQRGQIMVCSYVLRTGGSY 225 >gi|15827395|ref|NP_301658.1| hypothetical protein ML0872 [Mycobacterium leprae TN] gi|221229872|ref|YP_002503288.1| hypothetical protein MLBr_00872 [Mycobacterium leprae Br4923] gi|13092945|emb|CAC31253.1| putative membrane protein [Mycobacterium leprae] gi|219932979|emb|CAR70967.1| putative membrane protein [Mycobacterium leprae Br4923] Length=171 Score = 263 bits (672), Expect = 2e-68, Method: Compositional matrix adjust. Identities = 123/170 (73%), Positives = 148/170 (88%), Gaps = 0/170 (0%) Query 61 VPPQVSYPKRRHKRLLIGIVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQGY 120 +PP VSYP+RR KRL+I ++VA+ALV+AMTA IIYGVRTNG+ T GTFSE AKTAI+ Y Sbjct 2 LPPAVSYPRRRSKRLIISVLVAIALVAAMTAVIIYGVRTNGSKTGGTFSEVTAKTAIEDY 61 Query 121 LNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWS 180 L ALE +++TI RNALCG++D VRD+R DQALA+LSSDAFRKQFSQVE+TSID+IVYWS Sbjct 62 LKALEQSNINTIARNALCGMYDSVRDQRPDQALAQLSSDAFRKQFSQVELTSIDQIVYWS 121 Query 181 QYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY 230 YQAQVLFTM+ +PA GGP R Q+QGIAQLL++R QVLVCSY+LRTA S+ Sbjct 122 PYQAQVLFTMRTSPATGGPKRRQIQGIAQLLYRRNQVLVCSYMLRTADSH 171 >gi|296166075|ref|ZP_06848521.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295898570|gb|EFG78130.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=218 Score = 263 bits (672), Expect = 2e-68, Method: Compositional matrix adjust. Identities = 152/231 (66%), Positives = 181/231 (79%), Gaps = 14/231 (6%) Query 1 MPGPHSPNPGVGTNGPAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPG 60 M GPHSPN VG GP+ P+ E Q L++P D A + GPA+ A PP Sbjct 1 MAGPHSPNHTVGGGGPSGPQPPA--EAQPLEFPTDPRAVDV----GPANYAGQPP----- 49 Query 61 VPPQVSYPKRRHKR-LLIGIVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQG 119 P + YP+RR KR L++GI++A+ALV+A+T AI+YGVRTNGANT TFSEG AKTAIQG Sbjct 50 --PSMPYPQRRSKRRLVVGILLAVALVAALTVAIVYGVRTNGANTGATFSEGAAKTAIQG 107 Query 120 YLNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYW 179 YL+AL++RD+ I RNALCG++D VRDKRSDQALAKLSSDAFRKQFSQVEVTS+DKIVY Sbjct 108 YLDALDHRDIPEIERNALCGMYDAVRDKRSDQALAKLSSDAFRKQFSQVEVTSVDKIVYL 167 Query 180 SQYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY 230 SQYQAQ LFTM+ PAAGGP RG++QGIAQLLFQRG+++VCSYVLRT GSY Sbjct 168 SQYQAQALFTMKAAPAAGGPLRGELQGIAQLLFQRGEIMVCSYVLRTGGSY 218 >gi|254823306|ref|ZP_05228307.1| hypothetical protein MintA_25484 [Mycobacterium intracellulare ATCC 13950] Length=152 Score = 258 bits (658), Expect = 6e-67, Method: Compositional matrix adjust. Identities = 119/152 (79%), Positives = 138/152 (91%), Gaps = 0/152 (0%) Query 79 IVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQGYLNALENRDVDTIVRNALC 138 + +A+ALV+ MT AI+YGVRTNGANT TFSEG AKTAIQGYL+ALE+RD++ I RNALC Sbjct 1 MALAIALVAVMTVAIVYGVRTNGANTGTTFSEGAAKTAIQGYLDALEHRDINEIARNALC 60 Query 139 GIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWSQYQAQVLFTMQVTPAAGG 198 G++D V+DKRSDQALAKLSSDAFRKQFS+VE+TSIDKIVY SQYQAQ LFTM+V+P +GG Sbjct 61 GLYDAVQDKRSDQALAKLSSDAFRKQFSEVEITSIDKIVYLSQYQAQALFTMRVSPVSGG 120 Query 199 PPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY 230 P RGQVQGIAQLLFQRGQ++VCSYVLRT GSY Sbjct 121 PMRGQVQGIAQLLFQRGQIMVCSYVLRTGGSY 152 >gi|108800267|ref|YP_640464.1| hypothetical protein Mmcs_3301 [Mycobacterium sp. MCS] gi|119869395|ref|YP_939347.1| hypothetical protein Mkms_3363 [Mycobacterium sp. KMS] gi|108770686|gb|ABG09408.1| putative conserved membrane protein [Mycobacterium sp. MCS] gi|119695484|gb|ABL92557.1| putative conserved membrane protein [Mycobacterium sp. KMS] Length=212 Score = 177 bits (448), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 107/205 (53%), Positives = 134/205 (66%), Gaps = 17/205 (8%) Query 33 PHDLGAAEPAFAPGPADDAALPPAAYPG-VPPQVSYPKRRHKRLLIGIVVALALVSAMTA 91 PH GA +P PGP P+AYPG +PP V YPKR KRLL V +V+ + Sbjct 18 PHQ-GAPQPY--PGPV------PSAYPGMLPPPVQYPKRGRKRLLW-AVAVAVVVATLVG 67 Query 92 AIIYGVRTNGA-NTAGTFSEGPAKTAIQGYLNALENRDVDTIVRNALCGIHDGVRDKRSD 150 +I+ R +GA AG ++ A+TAIQGYL+AL N D + I R+ALCG+ D V++KRSD Sbjct 68 VVIFATRDDGAPQAAGPLTDASARTAIQGYLDALSNGDDEEIARHALCGLFDAVKEKRSD 127 Query 151 QALAKLSSDAFRKQFSQVEVTSIDKIVYWSQYQAQVLFTMQVTPAAGG-----PPRGQVQ 205 ALA L+ DAFRKQFS+ EVTSID IV WS +QAQVLFTM+V PA G PP + Q Sbjct 128 LALAGLAGDAFRKQFSRAEVTSIDTIVPWSSHQAQVLFTMRVAPARGSARGQQPPNEEEQ 187 Query 206 GIAQLLFQRGQVLVCSYVLRTAGSY 230 +AQLL + +VLVCSY+LRT Y Sbjct 188 AVAQLLIRDNEVLVCSYLLRTGSQY 212 >gi|126435890|ref|YP_001071581.1| hypothetical protein Mjls_3312 [Mycobacterium sp. JLS] gi|126235690|gb|ABN99090.1| putative conserved membrane protein [Mycobacterium sp. JLS] Length=212 Score = 172 bits (437), Expect = 3e-41, Method: Compositional matrix adjust. Identities = 108/205 (53%), Positives = 135/205 (66%), Gaps = 17/205 (8%) Query 33 PHDLGAAEPAFAPGPADDAALPPAAYPG-VPPQVSYPKRRHKRLLIGIVVALALVSAMTA 91 PH GA +P PGP P+AYPG +PP V YPKR KRLL VV +V+ + Sbjct 18 PHQ-GAPQPY--PGPV------PSAYPGMLPPPVQYPKRGRKRLLW-AVVVAVVVATLVG 67 Query 92 AIIYGVRTNGA-NTAGTFSEGPAKTAIQGYLNALENRDVDTIVRNALCGIHDGVRDKRSD 150 +I+ R +GA AG ++ A+TAIQGYL+AL N D + I R+ALCG+ D V++KRSD Sbjct 68 VVIFATRDDGAPQAAGPLTDASARTAIQGYLDALSNGDDEEIARHALCGLFDAVKEKRSD 127 Query 151 QALAKLSSDAFRKQFSQVEVTSIDKIVYWSQYQAQVLFTMQVTPAAGG-----PPRGQVQ 205 ALA L+ DAFRKQFS+ EVTSID IV WS +QAQVLFTM+V PA G PP + Q Sbjct 128 LALAGLAGDAFRKQFSRAEVTSIDTIVPWSSHQAQVLFTMRVAPARGSARGQQPPNEEEQ 187 Query 206 GIAQLLFQRGQVLVCSYVLRTAGSY 230 +AQLL + +VLVCSY+LRT Y Sbjct 188 AVAQLLIRDNEVLVCSYLLRTGSQY 212 >gi|118468790|ref|YP_888548.1| hypothetical protein MSMEG_4271 [Mycobacterium smegmatis str. MC2 155] gi|118170077|gb|ABK70973.1| conserved hypothetical protein [Mycobacterium smegmatis str. MC2 155] Length=217 Score = 151 bits (381), Expect = 8e-35, Method: Compositional matrix adjust. Identities = 91/186 (49%), Positives = 125/186 (68%), Gaps = 11/186 (5%) Query 55 PAAYPG-VPPQVSYPKRRHKRLLIGIVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPA 113 P+AYPG +PP V YPKRR ++ V+A+A+++ + A++ R++G+ + G +E A Sbjct 33 PSAYPGMLPPPVPYPKRRRWPKVLAAVLAVAVLAGVVTAVVQVTRSSGSES-GVVTETQA 91 Query 114 KTAIQGYLNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSI 173 + AIQ YL+AL + D +T+ R+ +CG+ D VRD+++D A+A L+ D FRKQF VEVTSI Sbjct 92 RDAIQEYLSALIDADDETVARHTMCGLFDAVRDRKADLAVASLAGDTFRKQFGNVEVTSI 151 Query 174 DKIVYWSQYQAQVLFTMQVTPAAGG-----PPRGQVQGIAQLLFQR----GQVLVCSYVL 224 DKIV WS QAQVLFTM+V PA PP + QG+AQLL + VLVCSYVL Sbjct 152 DKIVPWSTTQAQVLFTMRVAPARSSSRGQRPPAEEQQGVAQLLVDKTDGGDDVLVCSYVL 211 Query 225 RTAGSY 230 RT G Y Sbjct 212 RTGGQY 217 >gi|120404538|ref|YP_954367.1| hypothetical protein Mvan_3567 [Mycobacterium vanbaalenii PYR-1] gi|119957356|gb|ABM14361.1| putative conserved membrane protein [Mycobacterium vanbaalenii PYR-1] Length=213 Score = 151 bits (381), Expect = 9e-35, Method: Compositional matrix adjust. Identities = 102/220 (47%), Positives = 126/220 (58%), Gaps = 31/220 (14%) Query 18 PYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPG-VPPQVSYPKRRHKRLL 76 YPEP + PQ YP LG A YPG +PP V YPKRR ++ Sbjct 18 MYPEP--YPPQT--YPP-LG------------------ATYPGTLPPPVQYPKRRRPWII 54 Query 77 IGIVVALALVSAMTAAIIY--GVRTNGANTAGTFSEGPAKTAIQGYLNALENRDVDTIVR 134 + L A + G R A +G +E A+TAIQ YL+AL N DV+T+ R Sbjct 55 AVVAAVAVLAVVGVVAAVALTGSRDEAA-PSGALTEASAQTAIQDYLDALTNADVETVAR 113 Query 135 NALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWSQYQAQVLFTMQVTP 194 + LCG++D V ++RSD ALA LSSDAFRKQ+ EVTSIDK+V S QAQVLFTM+VTP Sbjct 114 HTLCGLYDAVNERRSDLALANLSSDAFRKQYESAEVTSIDKMVLSSPSQAQVLFTMRVTP 173 Query 195 AAGG----PPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY 230 A G P + Q +AQLL +VLVCSY+ RTAG Y Sbjct 174 AGGSSRNQPQQADEQAVAQLLSIDDEVLVCSYLPRTAGQY 213 >gi|315443879|ref|YP_004076758.1| hypothetical protein Mspyr1_22780 [Mycobacterium sp. Spyr1] gi|315262182|gb|ADT98923.1| hypothetical protein Mspyr1_22780 [Mycobacterium sp. Spyr1] Length=206 Score = 150 bits (380), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 100/223 (45%), Positives = 129/223 (58%), Gaps = 27/223 (12%) Query 16 PAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPG-VPPQVSYPKRRHKR 74 P PYP+ S+ P A YP PGP YPG +PP V YPKRR + Sbjct 3 PEPYPQ-QSYPPYAAPYPDG--------GPGPV---------YPGALPPPVQYPKRRRRP 44 Query 75 LLIGIVVALALVSA---MTAAIIYGVRTNGANTAGTFSEGPAKTAIQGYLNALENRDVDT 131 +I V A+V+ +T + R + T GT +E A+ AIQ YL+AL + D + Sbjct 45 WIIVAVALAAVVAVGAVITGVTLAAGREDQGAT-GTLTETSARAAIQDYLDALTDGDDER 103 Query 132 IVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWSQYQAQVLFTMQ 191 + R+ LCG+ D V+++RSD ALA LSSDAFRKQ+ EVTSIDK+V S QAQVLFTM+ Sbjct 104 VARHTLCGLFDAVKERRSDLALANLSSDAFRKQYDSAEVTSIDKMVRSSPTQAQVLFTMR 163 Query 192 VTPAAG----GPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY 230 V PA+G P Q +AQ+L +VLVCSY+ RTAG Y Sbjct 164 VVPASGSSRNAPQEADEQAVAQVLSIDDEVLVCSYLPRTAGQY 206 >gi|145223533|ref|YP_001134211.1| hypothetical protein Mflv_2946 [Mycobacterium gilvum PYR-GCK] gi|145216019|gb|ABP45423.1| hypothetical protein Mflv_2946 [Mycobacterium gilvum PYR-GCK] Length=223 Score = 149 bits (377), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 100/223 (45%), Positives = 128/223 (58%), Gaps = 27/223 (12%) Query 16 PAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPG-VPPQVSYPKRRHKR 74 P PYP+ S+ P A YP PGP YPG +PP V YPKRR Sbjct 20 PEPYPQ-QSYPPYAAPYPDG--------GPGPV---------YPGALPPPVQYPKRRRSP 61 Query 75 LLIGIVVALALVSA---MTAAIIYGVRTNGANTAGTFSEGPAKTAIQGYLNALENRDVDT 131 +I V A+V+ +T + R + T GT +E A+ AIQ YL+AL + D + Sbjct 62 WIIVAVALAAVVAVGAVITGVTLAAGREDQGAT-GTLTETSARAAIQDYLDALTDGDDER 120 Query 132 IVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWSQYQAQVLFTMQ 191 + R+ LCG+ D V+++RSD ALA LSSDAFRKQ+ EVTSIDK+V S QAQVLFTM+ Sbjct 121 VARHTLCGLFDAVKERRSDLALANLSSDAFRKQYDSAEVTSIDKMVRSSPTQAQVLFTMR 180 Query 192 VTPAAG----GPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY 230 V PA+G P Q +AQ+L +VLVCSY+ RTAG Y Sbjct 181 VVPASGSSRNAPQEADEQAVAQVLSIDDEVLVCSYLPRTAGQY 223 >gi|169629045|ref|YP_001702694.1| hypothetical protein MAB_1958c [Mycobacterium abscessus ATCC 19977] gi|169241012|emb|CAM62040.1| Conserved hypothetical protein [Mycobacterium abscessus] Length=180 Score = 98.2 bits (243), Expect = 8e-19, Method: Compositional matrix adjust. Identities = 51/122 (42%), Positives = 69/122 (57%), Gaps = 5/122 (4%) Query 109 SEGPAKTAIQGYLNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQV 168 SE + IQ YL+A+ D T+ RNA CG++D VRDK +D + + ++ F F Sbjct 64 SEAKIEQTIQAYLDAMARLDTVTLARNAGCGLYDAVRDKDTDDTIVRANAQQFVATFGTA 123 Query 169 EVTSIDKIVYWSQYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVLVCSYVLRTAG 228 V SIDKIVY+SQYQ +VLFT P QG A+LL G++ VCS +R A Sbjct 124 TVKSIDKIVYFSQYQLKVLFTATSQKRKDAP-----QGQAELLLNEGKIYVCSAYMRGAN 178 Query 229 SY 230 +Y Sbjct 179 AY 180 >gi|134113779|ref|XP_774474.1| hypothetical protein CNBG1200 [Cryptococcus neoformans var. neoformans B-3501A] gi|50257112|gb|EAL19827.1| hypothetical protein CNBG1200 [Cryptococcus neoformans var. neoformans B-3501A] Length=912 Score = 36.2 bits (82), Expect = 4.2, Method: Compositional matrix adjust. Identities = 40/173 (24%), Positives = 63/173 (37%), Gaps = 38/173 (21%) Query 16 PAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPGVPPQVSY-----PKR 70 P+PY +SH P YP++ E + AP PA + P A+ P PP++ P Sbjct 389 PSPYRSHNSHTPHR--YPNEHHLHENSHAPSPAHTQSFPSASLPYAPPELLRAPPLGPSL 446 Query 71 RHKRLLIGIVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQGYLNALENRDVD 130 +GIV+ L + Y R GT+ E P ++ Sbjct 447 AQDIWAVGIVLHALLTGKLPFFDPYDPRLQMKILRGTWEEPP---------------NLG 491 Query 131 TIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWSQYQ 183 L G DG R++R ++ V+V D ++ W + Q Sbjct 492 KEWLECLKGCLDGDRERR----------------WTIVKVKQSDAVLGWKEVQ 528 >gi|58269800|ref|XP_572056.1| hypothetical protein CNG03570 [Cryptococcus neoformans var. neoformans JEC21] gi|57228292|gb|AAW44749.1| hypothetical protein CNG03570 [Cryptococcus neoformans var. neoformans JEC21] Length=912 Score = 35.8 bits (81), Expect = 5.2, Method: Compositional matrix adjust. Identities = 40/173 (24%), Positives = 63/173 (37%), Gaps = 38/173 (21%) Query 16 PAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPGVPPQVSY-----PKR 70 P+PY +SH P YP++ E + AP PA + P A+ P PP++ P Sbjct 389 PSPYRSHNSHTPHR--YPNEHHLHENSHAPSPAHTQSFPSASLPYAPPELLRAPPLGPSL 446 Query 71 RHKRLLIGIVVALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQGYLNALENRDVD 130 +GIV+ L + Y R GT+ E P ++ Sbjct 447 AQDIWAVGIVLHALLTGRLPFFDPYDPRLQMKILRGTWEEPP---------------NLG 491 Query 131 TIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWSQYQ 183 L G DG R++R ++ V+V D ++ W + Q Sbjct 492 KEWLECLKGCLDGDRERR----------------WTIVKVKQSDAVLGWKEVQ 528 Lambda K H 0.317 0.134 0.402 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 300567788510 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40