Genbank accession
WMM35793.1 [GenBank]
Protein name
tail spike protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,72
Protein sequence
MFVHDIKTGQQHELIHVEPKVNDDITGKKDLSFSITLTEYNQIPFNALVGRNFIIIDEVRYKKQQYFINTPTIKQEGALLTKDITATHIYSFRAIKHVVHETIEGTKTLNEALKHAVKGSEITFTIMPDAKEIDAKKLEGFGNKKTSELMDEIISTFGVEIIPDNTHLYIYKKAGKEIVKRLDNLSNLTSLQITTSEDNTTTRVKGYGKLKEDKDILGDQSIPYDSKTGTWTYNSSLKADYTKKIGATFSFSFTGTGFKFKTLVSKLGGKWEFKIGDQTKTISVYKDSAPTEKEFDIIRGLDSKTYKVVATFKSRDSNNPNTKGTKKVDPVMYLLRGNIIGVYRTFKNEDEKYIFPPVTYVHPEEEKFLINGQPSWAEPVTDDSIKTKDDMIKLLKTKVNPYAEVSYDADYVELLDQALADIEEPVMAGDTIRVYADTPLNGITFDGKLRATGASYNPLRPEQPSDLTIDGKRKSRVDMEIEEKKRAKNQEQAIKNYQNQLATGLAEITQIKQSLATAQPSQQTTYTFSIQFLNGEWSVSYGEGFASLESGILSLNTDDDYTIQYVTGDANFIMKEAGYSLYVDDVDVNKINITLYKDGKLSDPLGVPDGSKVKILIVGQK
Physico‐chemical
properties
protein length:621 AA
molecular weight: 69841,19430 Da
isoelectric point:5,88752
aromaticity:0,09340
hydropathy:-0,50757

Domains

Domains [InterPro]
G3DSA:3.55.50.40
STR
88–170
IPR010572
ENZ
97–479
WMM35793.1
1 621
Architecture
ENZ
STR
ENZ
ENZ 9-87 | STR 88-355 | ENZ 356-479 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage vB_BteM-A9Y
[NCBI]
2945959 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Bacillus tequilensis
[NCBI]
227866 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WMM35793.1 [NCBI]
Genbank nucleotide accession
ON528935.2 [NCBI]
CDS location
range 27556 -> 29421
strand +
CDS
ATGTTTGTACATGACATTAAAACCGGTCAACAGCATGAACTCATTCACGTTGAGCCCAAAGTGAACGATGATATCACTGGGAAAAAGGATTTGTCTTTCTCAATTACTTTAACTGAATACAATCAGATTCCTTTCAATGCTTTAGTCGGAAGGAATTTTATTATTATCGACGAGGTGCGGTATAAAAAGCAGCAGTATTTTATTAACACGCCCACTATTAAACAAGAAGGGGCCTTGCTGACAAAAGACATAACAGCTACGCATATCTATTCCTTTAGGGCGATCAAGCATGTTGTTCATGAAACTATTGAAGGTACAAAAACTCTCAATGAAGCACTTAAGCACGCAGTAAAAGGCAGTGAAATCACATTTACAATTATGCCTGATGCGAAGGAGATCGACGCTAAAAAGCTGGAGGGTTTCGGCAATAAAAAGACTTCCGAGCTGATGGATGAAATCATTTCCACCTTTGGAGTGGAGATCATCCCGGATAACACTCATTTATACATTTACAAAAAAGCCGGCAAAGAGATTGTAAAAAGGCTGGATAACCTTTCGAATCTCACGTCTTTACAGATTACAACAAGTGAAGATAACACGACAACGCGGGTGAAAGGATACGGGAAGCTCAAGGAAGATAAAGACATCCTGGGCGATCAGTCCATTCCCTACGATTCCAAAACAGGCACTTGGACGTATAATAGTTCATTAAAAGCAGATTACACCAAGAAAATAGGAGCCACGTTTTCTTTTTCCTTCACAGGGACAGGCTTTAAATTTAAAACCCTAGTGTCAAAGCTGGGTGGTAAATGGGAATTTAAGATAGGCGATCAGACGAAAACCATATCTGTCTATAAAGATTCAGCCCCGACAGAAAAAGAGTTTGATATCATTCGCGGCCTGGACAGTAAAACTTATAAGGTAGTGGCTACCTTTAAAAGCAGGGACAGCAATAACCCTAATACAAAAGGCACAAAGAAAGTCGACCCGGTCATGTATCTTCTGCGCGGCAACATTATCGGGGTGTACAGAACTTTTAAGAATGAGGATGAAAAGTATATCTTTCCACCAGTCACCTATGTTCACCCGGAAGAAGAAAAGTTTCTAATCAATGGGCAGCCATCCTGGGCGGAACCGGTCACGGATGATTCAATCAAGACAAAGGATGACATGATTAAGCTGCTTAAAACCAAAGTCAATCCTTACGCAGAGGTGTCCTATGATGCCGACTATGTGGAATTGTTAGATCAGGCCTTGGCTGATATAGAAGAGCCGGTTATGGCAGGGGACACCATTCGTGTATATGCTGACACGCCTCTAAACGGAATTACATTTGATGGGAAGCTGAGAGCAACAGGGGCTTCATATAACCCACTGAGACCAGAACAGCCTTCTGACCTAACAATTGACGGGAAACGAAAAAGCCGGGTAGACATGGAAATTGAAGAGAAAAAGCGTGCAAAGAATCAGGAACAGGCGATCAAGAATTATCAAAATCAATTGGCCACCGGGTTGGCTGAAATCACTCAGATTAAGCAGAGCTTGGCTACTGCACAACCGTCTCAGCAGACCACATATACTTTCTCTATTCAATTCTTAAATGGTGAATGGTCTGTGTCTTATGGCGAAGGCTTTGCTTCCTTGGAATCTGGCATCCTTTCTTTGAATACAGACGATGATTACACCATCCAGTATGTGACCGGTGATGCTAATTTTATTATGAAAGAGGCGGGCTACTCACTTTATGTTGATGATGTTGACGTAAACAAGATTAATATAACCCTATATAAAGATGGTAAACTCAGTGATCCTCTTGGAGTTCCAGACGGATCGAAAGTAAAAATCCTCATAGTAGGACAAAAATAG

Genome Context

Genome Context

Tertiary structure

PDB ID
b47371e58831cd9c0bdae4f7edbc18cc2447926529d6d43200037e73674ee8e5
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8752
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50