Genbank accession
UGO54080.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MADKSPVKISQLPEPTGTGINDSDIFAASRITGADKLETQRITVSELRKLMDYGNAFSDLNAAIAATVKDQQFYVFVDDSKEFVYRYVNMGGIASPVLSADGTPVKEPTKNLLRNLGDIKSQDGFGLVGAVTSFDALRKLTPRAAGWRVYLAGYYEGTTLGSGFFVSKSGVATDDQGVIAVVNGSYYWERVLETNESIPMDYYGIRPGDDISDKLNFAVIAAKVRNIVDISLPGTPFDKPFILSKKVDIDLTDGKVIFIKGACNKIGTIIQHRFDGTAFYFHRNHVSSKNFWNTGGIENVTITCHTDYQQTATAAAIQISDTWGFIVNRVRILNFKAGNGIVVKNETAWTEGTKITDVDIRATQNGVLFTRDVTNDKNTNSFFGTYFNQYSFQAGTGKAGTSAIRVGDAATDAANKTCVLYGSEIQFRYWAEGGGANFGLMVSSSGQVTSGNTLIIPDGVGLSANDTITATPLKCIAVRGNGTYLNNTTIEPYQGRMNCFKVKDINFALEALLSFEKGVGVNSTLFKGRPVIDPKGLRFYTYQEFTGDEIKDGFSVGMTNLPVGTRLRVVLRYSTTDNDDNSRISSYIVSVGGAGMFTTVAPENIAILNNVTTTKSAVSGTDVVTDVGINDGRSRLNDWVDTATSPRVVNGNATQTNKDGYGTNARNFRIVVPANPAATATLKLGVEVEFI
Physico‐chemical
properties
protein length:691 AA
molecular weight: 74745,13140 Da
isoelectric point:6,21809
aromaticity:0,09696
hydropathy:-0,15412

Domains

Domains [InterPro]
DC_0717
ATT
99–220
UGO54080.1
1 691
Architecture
ATT
STR
ATT 3-220 | STR 221-691
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
UGO54080.1
1 691
Domain Start End Length (AA) Confidence
N-terminal 1 212 212 0,9944
Central domain 213 505 294 0,9931
C-terminal 506 691 185 0,9829
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-212
Central
213-505
C-terminal
506-691

Taxonomy

  Name Taxonomy ID Lineage
Phage Serratia phage vB_SmaM_Haymo
[NCBI]
2902685 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UGO54080.1 [NCBI]
Genbank nucleotide accession
OL539468.1 [NCBI]
CDS location
range 112576 -> 114651
strand +
CDS
ATGGCCGATAAATCACCAGTTAAAATATCTCAGTTACCCGAGCCTACCGGGACTGGGATTAACGATAGCGACATCTTCGCGGCATCCCGTATAACGGGAGCAGATAAGTTAGAGACTCAACGTATCACTGTCAGCGAATTGCGTAAGCTGATGGATTACGGTAATGCATTCTCTGACTTGAACGCAGCTATTGCAGCAACCGTTAAAGATCAACAGTTTTACGTATTCGTTGATGACTCGAAAGAGTTCGTTTACCGTTACGTGAACATGGGCGGAATTGCTTCCCCAGTTTTAAGTGCCGATGGTACACCTGTTAAAGAACCAACTAAAAACTTACTGCGTAATCTTGGCGATATTAAAAGTCAAGATGGTTTCGGTTTAGTTGGGGCAGTGACCTCTTTCGATGCCCTGCGTAAGTTAACACCTCGTGCTGCCGGCTGGCGAGTCTATCTTGCAGGTTATTACGAAGGGACTACTTTAGGTTCCGGATTCTTCGTTTCTAAATCTGGTGTAGCTACTGATGACCAAGGTGTCATTGCCGTTGTTAACGGTAGCTATTATTGGGAACGTGTACTGGAAACTAACGAAAGTATCCCAATGGATTATTACGGTATCCGTCCTGGTGATGATATTTCCGATAAACTCAACTTTGCAGTTATTGCAGCTAAAGTTCGGAATATCGTAGATATCTCTTTGCCAGGGACACCTTTCGACAAACCATTCATCCTTTCTAAAAAAGTAGATATCGATCTCACCGATGGAAAGGTCATCTTTATTAAAGGTGCATGTAACAAGATCGGTACAATCATTCAACATAGGTTCGATGGAACCGCATTCTATTTCCACCGCAACCATGTGTCCAGTAAGAACTTCTGGAACACTGGCGGTATTGAGAATGTTACAATCACCTGCCATACTGACTATCAACAGACAGCAACTGCGGCTGCAATCCAGATTTCAGATACCTGGGGCTTTATTGTAAACCGTGTTCGTATCCTTAACTTCAAAGCCGGCAACGGTATCGTCGTTAAGAATGAGACAGCATGGACTGAAGGGACTAAGATCACCGACGTCGATATTCGAGCCACTCAAAACGGCGTGCTGTTTACCAGGGATGTAACCAACGATAAGAACACCAACTCGTTCTTCGGTACTTACTTCAATCAATATAGTTTCCAGGCCGGTACTGGTAAGGCCGGTACTTCCGCTATTCGTGTAGGCGATGCAGCTACAGACGCCGCTAACAAAACATGTGTCCTCTACGGTTCTGAAATTCAGTTCCGCTATTGGGCAGAAGGTGGCGGCGCTAACTTCGGACTGATGGTTTCTAGTAGTGGCCAGGTTACATCAGGCAACACGCTAATTATCCCAGATGGTGTTGGGTTGAGTGCTAATGATACCATCACCGCTACTCCGCTTAAATGTATCGCAGTACGTGGTAATGGGACCTACCTGAACAACACGACAATCGAACCTTACCAGGGGAGAATGAACTGCTTCAAGGTTAAGGATATTAACTTTGCACTGGAAGCTCTGCTCAGTTTTGAGAAAGGTGTCGGCGTCAACAGTACCTTGTTCAAAGGTCGTCCTGTTATCGATCCTAAAGGCCTGCGGTTCTATACCTACCAAGAGTTCACTGGGGATGAAATCAAAGATGGATTCTCAGTTGGGATGACTAACCTTCCGGTAGGTACTCGTCTACGTGTTGTGTTACGCTACTCTACTACCGACAATGATGATAACAGCCGTATATCTTCGTACATCGTTTCTGTTGGTGGTGCTGGTATGTTCACTACAGTAGCTCCGGAGAATATCGCTATTCTTAATAATGTCACAACGACTAAGTCAGCAGTTAGTGGGACGGATGTTGTTACTGATGTAGGGATTAACGATGGGCGTTCACGCCTTAACGATTGGGTGGATACTGCGACAAGTCCACGCGTCGTTAACGGTAACGCAACGCAGACTAACAAAGACGGCTACGGAACCAACGCACGTAACTTCCGCATCGTTGTACCAGCTAACCCGGCTGCTACAGCTACCCTGAAACTGGGTGTTGAAGTGGAATTCATTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
6a78d5ba74a6a556e12c16bfc92c90947f02275ea6405b52cdd33e9fc57a1536
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6687
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50