BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3643 Length=63 Score E Sequences producing significant alignments: (Bits) Value gi|15610779|ref|NP_218160.1| hypothetical protein Rv3643 [Mycoba... 128 4e-28 gi|340628605|ref|YP_004747057.1| hypothetical protein MCAN_36531... 125 3e-27 gi|15843255|ref|NP_338292.1| hypothetical protein MT3746 [Mycoba... 69.7 1e-10 gi|118619389|ref|YP_907721.1| hypothetical protein MUL_4218 [Myc... 68.2 4e-10 gi|255325881|ref|ZP_05366973.1| phage integrase family protein [... 39.7 0.17 gi|260904973|ref|ZP_05913295.1| phage integrase family protein [... 38.5 0.37 gi|311742638|ref|ZP_07716447.1| integrase [Aeromicrobium marinum... 38.5 0.37 gi|269956368|ref|YP_003326157.1| integrase family protein [Xylan... 37.4 0.74 gi|269955602|ref|YP_003325391.1| integrase family protein [Xylan... 37.0 0.90 gi|260904954|ref|ZP_05913276.1| phage integrase family protein [... 37.0 0.99 gi|116669908|ref|YP_830841.1| phage integrase family protein [Ar... 35.0 3.3 gi|306818603|ref|ZP_07452326.1| possible integrase [Mobiluncus m... 33.9 7.3 gi|269978227|ref|ZP_06185177.1| phage integrase family protein [... 33.9 7.3 >gi|15610779|ref|NP_218160.1| hypothetical protein Rv3643 [Mycobacterium tuberculosis H37Rv] gi|31794813|ref|NP_857306.1| hypothetical protein Mb3667 [Mycobacterium bovis AF2122/97] gi|121639556|ref|YP_979780.1| hypothetical protein BCG_3701 [Mycobacterium bovis BCG str. Pasteur 1173P2] 64 more sequence titlesLength=63 Score = 128 bits (321), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 62/63 (99%), Positives = 63/63 (100%), Gaps = 0/63 (0%) Query 1 VERSIGLEAAAQQAGHSGSEITRRHYVERSVTVPDYTAALDEYSRPIRAFRPLKSNRPGD 60 +ERSIGLEAAAQQAGHSGSEITRRHYVERSVTVPDYTAALDEYSRPIRAFRPLKSNRPGD Sbjct 1 MERSIGLEAAAQQAGHSGSEITRRHYVERSVTVPDYTAALDEYSRPIRAFRPLKSNRPGD 60 Query 61 IPT 63 IPT Sbjct 61 IPT 63 >gi|340628605|ref|YP_004747057.1| hypothetical protein MCAN_36531 [Mycobacterium canettii CIPT 140010059] gi|340006795|emb|CCC45984.1| hypothetical protein MCAN_36531 [Mycobacterium canettii CIPT 140010059] Length=63 Score = 125 bits (313), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 61/63 (97%), Positives = 62/63 (99%), Gaps = 0/63 (0%) Query 1 VERSIGLEAAAQQAGHSGSEITRRHYVERSVTVPDYTAALDEYSRPIRAFRPLKSNRPGD 60 +ERSIGL AAAQQAGHSGSEITRRHYVERSVTVPDYTAALDEYSRPIRAFRPLKSNRPGD Sbjct 1 MERSIGLGAAAQQAGHSGSEITRRHYVERSVTVPDYTAALDEYSRPIRAFRPLKSNRPGD 60 Query 61 IPT 63 IPT Sbjct 61 IPT 63 >gi|15843255|ref|NP_338292.1| hypothetical protein MT3746 [Mycobacterium tuberculosis CDC1551] gi|13883612|gb|AAK48106.1| hypothetical protein MT3746 [Mycobacterium tuberculosis CDC1551] Length=33 Score = 69.7 bits (169), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 32/33 (97%), Positives = 33/33 (100%), Gaps = 0/33 (0%) Query 31 VTVPDYTAALDEYSRPIRAFRPLKSNRPGDIPT 63 +TVPDYTAALDEYSRPIRAFRPLKSNRPGDIPT Sbjct 1 MTVPDYTAALDEYSRPIRAFRPLKSNRPGDIPT 33 >gi|118619389|ref|YP_907721.1| hypothetical protein MUL_4218 [Mycobacterium ulcerans Agy99] gi|118571499|gb|ABL06250.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99] Length=51 Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 33/39 (85%), Positives = 37/39 (95%), Gaps = 0/39 (0%) Query 1 VERSIGLEAAAQQAGHSGSEITRRHYVERSVTVPDYTAA 39 VER+IGLEA+A+QAGHS SEITRRHYVE+SV VPDYTAA Sbjct 13 VERAIGLEASARQAGHSSSEITRRHYVEQSVPVPDYTAA 51 >gi|255325881|ref|ZP_05366973.1| phage integrase family protein [Corynebacterium tuberculostearicum SK141] gi|255297093|gb|EET76418.1| phage integrase family protein [Corynebacterium tuberculostearicum SK141] Length=394 Score = 39.7 bits (91), Expect = 0.17, Method: Compositional matrix adjust. Identities = 18/44 (41%), Positives = 29/44 (66%), Gaps = 0/44 (0%) Query 1 VERSIGLEAAAQQAGHSGSEITRRHYVERSVTVPDYTAALDEYS 44 +E G+E AA+Q GH+ IT R+YV+R+ D+TA L+ ++ Sbjct 345 IENVDGVEEAARQLGHASPAITGRYYVKRAADAGDHTAVLEMFA 388 >gi|260904973|ref|ZP_05913295.1| phage integrase family protein [Brevibacterium linens BL2] Length=393 Score = 38.5 bits (88), Expect = 0.37, Method: Compositional matrix adjust. Identities = 16/28 (58%), Positives = 23/28 (83%), Gaps = 0/28 (0%) Query 1 VERSIGLEAAAQQAGHSGSEITRRHYVE 28 ++R GLEAA++QAGHS + +T RHYV+ Sbjct 338 LQRVRGLEAASEQAGHSDTAVTSRHYVQ 365 >gi|311742638|ref|ZP_07716447.1| integrase [Aeromicrobium marinum DSM 15272] gi|311314266|gb|EFQ84174.1| integrase [Aeromicrobium marinum DSM 15272] Length=274 Score = 38.5 bits (88), Expect = 0.37, Method: Compositional matrix adjust. Identities = 16/32 (50%), Positives = 19/32 (60%), Gaps = 0/32 (0%) Query 8 EAAAQQAGHSGSEITRRHYVERSVTVPDYTAA 39 EAA GH+ IT RHY+ R + VPDY A Sbjct 225 EAAMHHLGHTSKVITERHYINRKLVVPDYREA 256 >gi|269956368|ref|YP_003326157.1| integrase family protein [Xylanimonas cellulosilytica DSM 15894] gi|269305049|gb|ACZ30599.1| integrase family protein [Xylanimonas cellulosilytica DSM 15894] Length=422 Score = 37.4 bits (85), Expect = 0.74, Method: Compositional matrix adjust. Identities = 14/28 (50%), Positives = 23/28 (83%), Gaps = 0/28 (0%) Query 1 VERSIGLEAAAQQAGHSGSEITRRHYVE 28 V+R++G++ AA+ GH+ +EITR HY+E Sbjct 347 VDRAVGIDLAAELLGHTTTEITRVHYIE 374 >gi|269955602|ref|YP_003325391.1| integrase family protein [Xylanimonas cellulosilytica DSM 15894] gi|269304283|gb|ACZ29833.1| integrase family protein [Xylanimonas cellulosilytica DSM 15894] Length=401 Score = 37.0 bits (84), Expect = 0.90, Method: Compositional matrix adjust. Identities = 20/46 (44%), Positives = 27/46 (59%), Gaps = 0/46 (0%) Query 1 VERSIGLEAAAQQAGHSGSEITRRHYVERSVTVPDYTAALDEYSRP 46 VER+ G + AA+ GH+ S IT+ HY+ER V TA + E P Sbjct 346 VERAAGADLAAELLGHTSSSITKEHYIERDDHVDARTAEILEALGP 391 >gi|260904954|ref|ZP_05913276.1| phage integrase family protein [Brevibacterium linens BL2] Length=405 Score = 37.0 bits (84), Expect = 0.99, Method: Compositional matrix adjust. Identities = 13/29 (45%), Positives = 21/29 (73%), Gaps = 0/29 (0%) Query 15 GHSGSEITRRHYVERSVTVPDYTAALDEY 43 GHSG+ +T HYV ++ T PD ++ LD++ Sbjct 368 GHSGTGVTSAHYVAKAATAPDMSSILDQF 396 >gi|116669908|ref|YP_830841.1| phage integrase family protein [Arthrobacter sp. FB24] gi|116610017|gb|ABK02741.1| phage integrase family protein [Arthrobacter sp. FB24] Length=374 Score = 35.0 bits (79), Expect = 3.3, Method: Compositional matrix adjust. Identities = 13/31 (42%), Positives = 20/31 (65%), Gaps = 0/31 (0%) Query 13 QAGHSGSEITRRHYVERSVTVPDYTAALDEY 43 Q GHS +TR+HYV+++ PD T L+ + Sbjct 339 QLGHSSENVTRKHYVQKTHEAPDNTVLLEAF 369 >gi|306818603|ref|ZP_07452326.1| possible integrase [Mobiluncus mulieris ATCC 35239] gi|304648776|gb|EFM46078.1| possible integrase [Mobiluncus mulieris ATCC 35239] Length=396 Score = 33.9 bits (76), Expect = 7.3, Method: Compositional matrix adjust. Identities = 16/44 (37%), Positives = 24/44 (55%), Gaps = 0/44 (0%) Query 1 VERSIGLEAAAQQAGHSGSEITRRHYVERSVTVPDYTAALDEYS 44 ++R ++ AA GHS EITR +YVE+ PD L ++ Sbjct 346 IKRESSMKDAAAVLGHSSPEITRTYYVEKENIAPDMRQVLARFA 389 >gi|269978227|ref|ZP_06185177.1| phage integrase family protein [Mobiluncus mulieris 28-1] gi|269933736|gb|EEZ90320.1| phage integrase family protein [Mobiluncus mulieris 28-1] Length=396 Score = 33.9 bits (76), Expect = 7.3, Method: Compositional matrix adjust. Identities = 16/44 (37%), Positives = 24/44 (55%), Gaps = 0/44 (0%) Query 1 VERSIGLEAAAQQAGHSGSEITRRHYVERSVTVPDYTAALDEYS 44 ++R ++ AA GHS EITR +YVE+ PD L ++ Sbjct 346 IKRESSMKDAAAVLGHSSPEITRTYYVEKENIAPDMRQVLARFA 389 Lambda K H 0.314 0.131 0.372 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 131230491224 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40