Genbank accession
ANM47214.1 [GenBank]
Protein name
tail fibers protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,94
Protein sequence
MKTQFNQPQGSTSRETNKEAIARVYGIKKSDVAYVNNGLIVDDYKILYDKATQTSYYVGNATGNIQSWSVTSKYISITTNFGVFPCFKASVSKGIIPTISDLRNLRFDEVDQTVEVFEHTVDQNSGGGIWYCYSLNNPNSLIDDNGCQIINNYGQVIRRKDVKSLYSDMFGLVPGGDFDSVIQNMFKASRTFNIEEAWICHMGRDNPKPRSNGGNVFDLSDGMSFYVKGVGGGRFGASIVHAGNNICMRFRRDYNTSKEFWISGGVEGLRIVGQGDSFSGTNSYVNATAIEISDLWGANLNHIFISGYTGNSSGSAISLYNETGWTEGTELNDIVIRQSVNGLWLHRNPVSGSGATDSFFKTVGNLDINAGVSGTAINFIKIGDGTSAGKCLLYGHDIMLTGWMSNGSWHNGILVTNYSTCLNGKFRLNFDGYGISSSASSEVIHLIRLGGSESLFDCDVVNTSGQTDGYPLSMLKLLMNSCVYLDDSTVFDSSIIKGRPLVRAKGLTVRYTGTFTQAECISGMSYQLNGLVPGTKLKVTLRSWGNNSKYEAVTQYWDVEVRGTDLPCIVKPTLASGTSLTTSNVSGGVINGVTTAKAQFMQTASTSADAVTQVNTTTANALASATLTTSQLQINSTFNTSLTLTNGQANNGISYAANSGRKINIVLPADADATQPMPYTVEIEVL
Physico‐chemical
properties
protein length:686 AA
molecular weight: 74114,07710 Da
isoelectric point:5,96647
aromaticity:0,09475
hydropathy:-0,16837

Domains

Domains [InterPro]
G3DSA:3.30.2020.50
ATT
1–93
G3DSA:3.30.2020.50
ATT
1–95
ANM47214.1
1 686
Architecture
ATT
STR
ATT 1-95 | STR 160-686
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
ANM47214.1
1 686
Domain Start End Length (AA) Confidence
N-terminal 1 178 178 0,9930
Central domain 179 485 308 0,9832
C-terminal 486 686 200 0,9641
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-178
Central
179-485
C-terminal
486-686

Taxonomy

  Name Taxonomy ID Lineage
Phage Serratia phage vB_Sru_IME250
[NCBI]
1852640 Uroviricota > Caudoviricetes > Pantevenvirales > Taipeivirus > Taipeivirus IME250
Host Serratia rubidaea
[NCBI]
61652 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ANM47214.1 [NCBI]
Genbank nucleotide accession
KX147096.1 [NCBI]
CDS location
range 82640 -> 84700
strand +
CDS
ATGAAAACGCAATTTAACCAACCCCAAGGCTCCACGTCCCGCGAAACGAATAAGGAAGCGATTGCTCGGGTATATGGTATTAAGAAAAGCGACGTCGCTTATGTTAACAACGGATTAATTGTTGATGATTACAAAATTTTGTACGATAAAGCTACACAAACATCTTATTATGTGGGGAATGCCACGGGAAATATACAAAGTTGGTCAGTAACTTCAAAATATATTTCCATAACAACAAATTTTGGCGTTTTCCCTTGTTTCAAAGCTTCTGTGTCCAAAGGGATAATACCTACCATTTCCGATTTAAGGAATTTAAGATTTGATGAAGTGGATCAGACCGTAGAAGTTTTTGAACATACGGTAGATCAAAACTCTGGTGGTGGTATTTGGTATTGTTATTCCTTGAATAATCCTAATTCTTTAATAGATGACAATGGTTGTCAAATTATTAACAATTATGGGCAGGTTATAAGAAGAAAAGATGTTAAAAGCCTTTATTCTGACATGTTTGGCTTAGTCCCTGGAGGGGATTTCGACTCTGTAATCCAAAATATGTTCAAAGCTTCTAGAACATTTAATATAGAAGAAGCATGGATTTGTCATATGGGCAGAGATAATCCAAAACCAAGAAGTAATGGTGGTAATGTTTTTGACCTGTCTGACGGCATGAGTTTTTATGTTAAAGGCGTCGGCGGCGGAAGATTCGGCGCTTCTATAGTACATGCAGGTAATAATATATGCATGAGATTTAGGCGGGACTATAATACTTCTAAAGAGTTTTGGATAAGCGGCGGGGTAGAAGGTCTCAGGATCGTAGGACAAGGAGATTCGTTCAGCGGAACGAATTCCTATGTTAATGCCACGGCTATTGAAATATCTGACCTCTGGGGTGCCAATTTAAATCATATTTTTATTAGTGGCTATACTGGTAATTCATCCGGTTCGGCCATCAGTTTATATAATGAAACTGGATGGACAGAAGGCACAGAATTGAACGATATCGTTATCCGACAATCTGTGAATGGTTTATGGCTGCACCGAAACCCGGTTTCTGGAAGTGGTGCGACAGATTCTTTCTTCAAAACGGTGGGGAATTTAGATATAAATGCAGGGGTTTCAGGCACTGCTATCAATTTTATAAAAATTGGTGATGGTACTTCAGCAGGTAAATGCCTTTTGTACGGACATGACATTATGCTAACCGGATGGATGAGTAATGGTTCTTGGCACAATGGCATTTTGGTGACGAATTACAGTACTTGCCTCAATGGTAAATTCCGTTTAAACTTTGATGGTTATGGTATTAGTTCTTCCGCTTCTTCTGAAGTAATCCATTTAATAAGACTTGGTGGTTCTGAATCATTATTTGATTGTGATGTCGTGAACACATCAGGACAAACAGATGGATACCCTCTTAGTATGTTGAAACTGTTGATGAATTCTTGTGTGTATTTAGATGACTCAACTGTTTTCGATTCGTCAATTATTAAGGGGCGTCCTTTGGTCAGGGCGAAAGGGCTTACAGTACGTTATACGGGGACATTTACCCAAGCTGAATGCATATCTGGTATGTCATACCAACTCAATGGGCTTGTTCCGGGCACTAAATTAAAAGTAACGTTGAGATCTTGGGGGAATAATTCTAAATATGAAGCAGTTACCCAATATTGGGATGTTGAAGTAAGAGGTACTGATTTGCCATGTATTGTTAAACCGACTTTGGCATCAGGGACATCTTTAACAACAAGTAATGTGAGCGGCGGTGTTATCAACGGAGTTACGACAGCAAAAGCGCAATTTATGCAAACTGCCAGCACTTCTGCTGACGCAGTGACTCAAGTAAATACAACCACAGCGAACGCTTTGGCATCAGCAACGTTAACAACATCCCAACTACAAATTAATAGTACATTCAATACAAGTTTGACTTTGACTAATGGTCAGGCTAACAATGGAATATCATACGCTGCAAACTCTGGAAGGAAAATTAATATTGTTCTTCCAGCTGATGCTGATGCAACTCAACCAATGCCGTACACAGTTGAAATAGAGGTATTATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
072b24fc25f6538368b31223e29194d85a822b37daf2e362575ac475aa67372e
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6221
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50