BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0239
Length=77
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607380|ref|NP_214753.1| hypothetical protein Rv0239 [Mycoba... 157 4e-37
gi|196231347|ref|ZP_03130206.1| hypothetical protein CfE428DRAFT... 56.6 1e-06
gi|336288124|gb|AEI30374.1| conserved hypothetical protein [uncu... 38.9 0.26
gi|313246088|emb|CBY35044.1| unnamed protein product [Oikopleura... 36.2 1.5
gi|291296006|ref|YP_003507404.1| CopG domain-containing protein ... 36.2 1.6
gi|320450298|ref|YP_004202394.1| CopG domain-containing protein ... 35.4 2.9
gi|297623625|ref|YP_003705059.1| CopG family transcriptional reg... 33.9 7.8
>gi|15607380|ref|NP_214753.1| hypothetical protein Rv0239 [Mycobacterium tuberculosis H37Rv]
gi|15839620|ref|NP_334657.1| hypothetical protein MT0253 [Mycobacterium tuberculosis CDC1551]
gi|31791417|ref|NP_853910.1| hypothetical protein Mb0245 [Mycobacterium bovis AF2122/97]
73 more sequence titles
Length=77
Score = 157 bits (398), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 77/77 (100%), Positives = 77/77 (100%), Gaps = 0/77 (0%)
Query 1 MIRTQVQLPDELYRDAKRVAHEHEMTLAEVVRRGLEHMVRIYPRRDAASDTWQPPTPRRL 60
MIRTQVQLPDELYRDAKRVAHEHEMTLAEVVRRGLEHMVRIYPRRDAASDTWQPPTPRRL
Sbjct 1 MIRTQVQLPDELYRDAKRVAHEHEMTLAEVVRRGLEHMVRIYPRRDAASDTWQPPTPRRL 60
Query 61 GPFRASEETWRELANEA 77
GPFRASEETWRELANEA
Sbjct 61 GPFRASEETWRELANEA 77
>gi|196231347|ref|ZP_03130206.1| hypothetical protein CfE428DRAFT_3371 [Chthoniobacter flavus
Ellin428]
gi|196224683|gb|EDY19194.1| hypothetical protein CfE428DRAFT_3371 [Chthoniobacter flavus
Ellin428]
Length=83
Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/76 (41%), Positives = 42/76 (56%), Gaps = 1/76 (1%)
Query 1 MIRTQVQLPDELYRDAKRVAHEHEMTLAEVVRRGLEHMVRIYPRRDAASDTWQPPTPRRL 60
M RTQ+QLPDELY+ K A + E++LAEV RRG+E + YP + W+ P
Sbjct 1 MTRTQIQLPDELYQRVKAFAEQRELSLAEVARRGIELFLSRYPETPESGREWKLPCVDG- 59
Query 61 GPFRASEETWRELANE 76
G + E R +A E
Sbjct 60 GGLKVPLEQLRSIAAE 75
>gi|336288124|gb|AEI30374.1| conserved hypothetical protein [uncultured microorganism]
Length=96
Score = 38.9 bits (89), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 16/46 (35%), Positives = 29/46 (64%), Gaps = 0/46 (0%)
Query 2 IRTQVQLPDELYRDAKRVAHEHEMTLAEVVRRGLEHMVRIYPRRDA 47
++T VQ+PD L+ +A++VA+ TL ++ GL +V + RR+
Sbjct 1 MKTTVQIPDSLFEEARKVANRERTTLKALIEEGLRRIVSQHKRRNG 46
>gi|313246088|emb|CBY35044.1| unnamed protein product [Oikopleura dioica]
Length=293
Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats.
Identities = 22/76 (29%), Positives = 36/76 (48%), Gaps = 4/76 (5%)
Query 1 MIRTQVQLPDELYRDAKRVAHEHEMTLAEVVRRGLEHMVRIYPRRDAASDTWQPPTPRRL 60
M T + LY D+ E+E++L E +R + I +R +S+ Q P P +
Sbjct 69 MAHTHSLPKNNLYDDSTDFRREYELSLVEAIRGEMSSRGLIAKKRSDSSEAEQKPKPDK- 127
Query 61 GPFRASEETWRELANE 76
++ + RELANE
Sbjct 128 ---KSFSKRLRELANE 140
>gi|291296006|ref|YP_003507404.1| CopG domain-containing protein DNA-binding domain-containing
protein [Meiothermus ruber DSM 1279]
gi|290470965|gb|ADD28384.1| CopG domain protein DNA-binding domain protein [Meiothermus ruber
DSM 1279]
Length=82
Score = 36.2 bits (82), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 14/39 (36%), Positives = 28/39 (72%), Gaps = 0/39 (0%)
Query 1 MIRTQVQLPDELYRDAKRVAHEHEMTLAEVVRRGLEHMV 39
M+RTQ+QL + ++ + +AH +++AE VRR +++M+
Sbjct 1 MVRTQIQLEEAQWQQLREIAHREGISIAEAVRRAVDNML 39
>gi|320450298|ref|YP_004202394.1| CopG domain-containing protein DNA-binding domain-containing
protein [Thermus scotoductus SA-01]
gi|320150467|gb|ADW21845.1| CopG domain protein DNA-binding domain protein [Thermus scotoductus
SA-01]
Length=79
Score = 35.4 bits (80), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 26/36 (73%), Gaps = 0/36 (0%)
Query 1 MIRTQVQLPDELYRDAKRVAHEHEMTLAEVVRRGLE 36
M+RTQVQL +E R + +A E ++LAE+VRR +E
Sbjct 1 MVRTQVQLTEEQARRLRALAREEGVSLAEMVRRAVE 36
>gi|297623625|ref|YP_003705059.1| CopG family transcriptional regulator [Truepera radiovictrix
DSM 17093]
gi|297164805|gb|ADI14516.1| transcriptional regulator, CopG family [Truepera radiovictrix
DSM 17093]
Length=82
Score = 33.9 bits (76), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 24/58 (42%), Positives = 33/58 (57%), Gaps = 6/58 (10%)
Query 2 IRTQVQLPDELYRDAKRVAHEHEMTLAEVVRRGLEHMVRIYPRRDAASDTWQPP-TPR 58
+RT ++L DEL R+AKR+A E TL V+ L + RR A D +PP TP+
Sbjct 1 MRTTIRLDDELLREAKRLAAETNQTLTAVIEEALRERL---ARRKGARD--RPPFTPK 53
Lambda K H
0.320 0.132 0.404
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130175841596
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40