BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1507A Length=167 Score E Sequences producing significant alignments: (Bits) Value gi|15840970|ref|NP_336007.1| hypothetical protein MT1555.1 [Myco... 343 4e-93 gi|289442957|ref|ZP_06432701.1| hypothetical protein TBLG_00060 ... 340 3e-92 gi|340626525|ref|YP_004744977.1| hypothetical protein MCAN_15271... 335 1e-90 gi|144899028|emb|CAM75892.1| conserved hypothetical protein [Mag... 39.3 0.20 gi|144899029|emb|CAM75893.1| Hemerythrin [Magnetospirillum gryph... 38.1 0.49 gi|149640945|ref|XP_001514678.1| PREDICTED: thioredoxin-related ... 37.0 0.90 gi|320594178|gb|EFX06581.1| c2h2 finger domain containing protei... 35.8 2.0 >gi|15840970|ref|NP_336007.1| hypothetical protein MT1555.1 [Mycobacterium tuberculosis CDC1551] gi|57116879|ref|YP_177648.1| hypothetical protein Rv1507A [Mycobacterium tuberculosis H37Rv] gi|148822729|ref|YP_001287484.1| hypothetical protein TBFG_11539 [Mycobacterium tuberculosis F11] 39 more sequence titlesLength=167 Score = 343 bits (881), Expect = 4e-93, Method: Compositional matrix adjust. Identities = 167/167 (100%), Positives = 167/167 (100%), Gaps = 0/167 (0%) Query 1 MQSGQNILAKVCNLIEQSRLSSTRCLQFRITNTSRPRQLRWSEFKRFCDIFNMVLGKARM 60 MQSGQNILAKVCNLIEQSRLSSTRCLQFRITNTSRPRQLRWSEFKRFCDIFNMVLGKARM Sbjct 1 MQSGQNILAKVCNLIEQSRLSSTRCLQFRITNTSRPRQLRWSEFKRFCDIFNMVLGKARM 60 Query 61 GRDPGRPVRDERRIVSCEIIASDHIGLAAARLLAKRYRGRSVSGFVLMIKSASVHEIDSW 120 GRDPGRPVRDERRIVSCEIIASDHIGLAAARLLAKRYRGRSVSGFVLMIKSASVHEIDSW Sbjct 61 GRDPGRPVRDERRIVSCEIIASDHIGLAAARLLAKRYRGRSVSGFVLMIKSASVHEIDSW 120 Query 121 SSPSVAMSIGVALCSYPHYAAARTSPPNRDWGEDTTRSRPVTGLLAG 167 SSPSVAMSIGVALCSYPHYAAARTSPPNRDWGEDTTRSRPVTGLLAG Sbjct 121 SSPSVAMSIGVALCSYPHYAAARTSPPNRDWGEDTTRSRPVTGLLAG 167 >gi|289442957|ref|ZP_06432701.1| hypothetical protein TBLG_00060 [Mycobacterium tuberculosis T46] gi|289569535|ref|ZP_06449762.1| hypothetical protein TBJG_03725 [Mycobacterium tuberculosis T17] gi|289750069|ref|ZP_06509447.1| hypothetical protein TBDG_02796 [Mycobacterium tuberculosis T92] gi|289415876|gb|EFD13116.1| hypothetical protein TBLG_00060 [Mycobacterium tuberculosis T46] gi|289543289|gb|EFD46937.1| hypothetical protein TBJG_03725 [Mycobacterium tuberculosis T17] gi|289690656|gb|EFD58085.1| hypothetical protein TBDG_02796 [Mycobacterium tuberculosis T92] Length=167 Score = 340 bits (873), Expect = 3e-92, Method: Compositional matrix adjust. Identities = 166/167 (99%), Positives = 166/167 (99%), Gaps = 0/167 (0%) Query 1 MQSGQNILAKVCNLIEQSRLSSTRCLQFRITNTSRPRQLRWSEFKRFCDIFNMVLGKARM 60 MQSGQNILAKVCNLIEQSRLSSTRCLQFRITNTSRPRQLRWSEFKRFCDIFNMVLGKARM Sbjct 1 MQSGQNILAKVCNLIEQSRLSSTRCLQFRITNTSRPRQLRWSEFKRFCDIFNMVLGKARM 60 Query 61 GRDPGRPVRDERRIVSCEIIASDHIGLAAARLLAKRYRGRSVSGFVLMIKSASVHEIDSW 120 GRDPGRPVRDERRIVSCEIIASDHIGLAAARL AKRYRGRSVSGFVLMIKSASVHEIDSW Sbjct 61 GRDPGRPVRDERRIVSCEIIASDHIGLAAARLPAKRYRGRSVSGFVLMIKSASVHEIDSW 120 Query 121 SSPSVAMSIGVALCSYPHYAAARTSPPNRDWGEDTTRSRPVTGLLAG 167 SSPSVAMSIGVALCSYPHYAAARTSPPNRDWGEDTTRSRPVTGLLAG Sbjct 121 SSPSVAMSIGVALCSYPHYAAARTSPPNRDWGEDTTRSRPVTGLLAG 167 >gi|340626525|ref|YP_004744977.1| hypothetical protein MCAN_15271 [Mycobacterium canettii CIPT 140010059] gi|340004715|emb|CCC43859.1| hypothetical protein MCAN_15271 [Mycobacterium canettii CIPT 140010059] Length=167 Score = 335 bits (859), Expect = 1e-90, Method: Compositional matrix adjust. Identities = 164/167 (99%), Positives = 165/167 (99%), Gaps = 0/167 (0%) Query 1 MQSGQNILAKVCNLIEQSRLSSTRCLQFRITNTSRPRQLRWSEFKRFCDIFNMVLGKARM 60 MQSGQNILAKVCNLIEQSRLSSTRCLQFRITNTSRPRQLR SEFKRFCDIFNMVLGKARM Sbjct 1 MQSGQNILAKVCNLIEQSRLSSTRCLQFRITNTSRPRQLRRSEFKRFCDIFNMVLGKARM 60 Query 61 GRDPGRPVRDERRIVSCEIIASDHIGLAAARLLAKRYRGRSVSGFVLMIKSASVHEIDSW 120 GRDPGRPVRDERRIVSCEIIASDHIGLAAARLLAKRYRGRSVSG VLMIKSASVHEIDSW Sbjct 61 GRDPGRPVRDERRIVSCEIIASDHIGLAAARLLAKRYRGRSVSGLVLMIKSASVHEIDSW 120 Query 121 SSPSVAMSIGVALCSYPHYAAARTSPPNRDWGEDTTRSRPVTGLLAG 167 SSPSVA+SIGVALCSYPHYAAARTSPPNRDWGEDTTRSRPVTGLLAG Sbjct 121 SSPSVAISIGVALCSYPHYAAARTSPPNRDWGEDTTRSRPVTGLLAG 167 >gi|144899028|emb|CAM75892.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense MSR-1] Length=396 Score = 39.3 bits (90), Expect = 0.20, Method: Compositional matrix adjust. Identities = 19/57 (34%), Positives = 31/57 (55%), Gaps = 1/57 (1%) Query 3 SGQNILAKVCNLIE-QSRLSSTRCLQFRITNTSRPRQLRWSEFKRFCDIFNMVLGKA 58 +G AKV NL+ +L R LQ R + P+ L+W+ ++ +FN++L KA Sbjct 4 AGDGFEAKVANLVAVTQKLDVQRILQVRRLTMADPQSLKWASARQLDLVFNVILAKA 60 >gi|144899029|emb|CAM75893.1| Hemerythrin [Magnetospirillum gryphiswaldense MSR-1] Length=424 Score = 38.1 bits (87), Expect = 0.49, Method: Compositional matrix adjust. Identities = 23/65 (36%), Positives = 35/65 (54%), Gaps = 3/65 (4%) Query 10 KVCNLIEQSR-LSSTRCLQFRITNTSRPRQLRWSEFKRFCDIFNMVLGKA--RMGRDPGR 66 KV NLI +R L + L R + P L+W+ + +F+++LGKA R+G D R Sbjct 14 KVSNLINATRHLEVPQILLLRRLTMADPESLKWANIRELDVVFSVILGKAVERLGVDALR 73 Query 67 PVRDE 71 RD+ Sbjct 74 QARDK 78 >gi|149640945|ref|XP_001514678.1| PREDICTED: thioredoxin-related transmembrane protein 4-like [Ornithorhynchus anatinus] Length=320 Score = 37.0 bits (84), Expect = 0.90, Method: Compositional matrix adjust. Identities = 27/99 (28%), Positives = 43/99 (44%), Gaps = 12/99 (12%) Query 38 QLRWSEFKRFCDIFNMVLGKARMGRDPGRPVRDERRIVSCEIIASDHIGLAAARLLAKRY 97 +L W F + DI ++ +GK + ++PG R + A D I +RY Sbjct 52 ELEWETFAKSGDILDISVGKVDVTQEPGLSGRFFVTTLPTIFHAKDGI--------FRRY 103 Query 98 RGRSVS----GFVLMIKSASVHEIDSWSSPSVAMSIGVA 132 RG +S ++L K +V + W SPS G+A Sbjct 104 RGPGISKDLQNYILEKKWEAVEPVAGWKSPSSITMSGMA 142 >gi|320594178|gb|EFX06581.1| c2h2 finger domain containing protein [Grosmannia clavigera kw1407] Length=1109 Score = 35.8 bits (81), Expect = 2.0, Method: Compositional matrix adjust. Identities = 16/35 (46%), Positives = 25/35 (72%), Gaps = 0/35 (0%) Query 100 RSVSGFVLMIKSASVHEIDSWSSPSVAMSIGVALC 134 SV GF+LM+++A V+E++ +SP VA+SI C Sbjct 550 ESVDGFMLMLETACVYEVEVGNSPMVALSIVCLRC 584 Lambda K H 0.323 0.134 0.408 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 127548590676 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40