Genbank accession
YP_009842427.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,94
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MLQSVKISELPSADTLTEDDLIVIDQPDDTKKATLFQVLNHLEDVVEQSTLVTLAQPDGFKNIGQCANLTVLRSLQFSSVGKQVFLKEHTTGQNAGGGIWYCHSLTSDSSYVDDNGCQIINDYGQVIRRKDVKELTSSYFGLQAGDTIDTVLTNMYKASRTFNIYEAKIENPGFDKGYVLQGGLRFYCGDKPFYIHSYSIGTLRGPNIWHTGNNVGVTFSRFKEDGTSQQAWSGGGIKGFRIWGAASYLVQGNTGADSTPVRLSDMWQGEACDLWVTGYTGNTNGAVVSLYNEFAWTEGALVENIMVRQSLRGLTFLRKHGTTATDSFFRTVADISFNAGVSGQSTQVMVVGDGTAAGECLVYGHDIKLTQWMSAGSWHDIVRLEDYSIIAETGVIKIVADGYGISKTTVPATEVVHSINVRGLNARFRSRVENWSNQAGGWGLDFLNIIFQSSMYTNAMTFYESDFDALPTINPVGMKVRFNGTFTVAERQSGKVYTLNGLIPGMTLKVKLTSRNGNDLNDAVVQEWKVFVRSTNLPCIVVPMSGSANIATTDGLAVTNTSPVQTATFLKTVTPTQARNFIGQNYGLTVKNANDDNSLSYALNSGRKIRFVLPANPTATTTTPYSVEIEVL
Physico‐chemical
properties
protein length:632 AA
molecular weight: 69069,83490 Da
isoelectric point:5,68585
aromaticity:0,09810
hydropathy:-0,15949

Domains

Domains [InterPro]
DC_0471
ATT
4–179
IPR059934
RBD
65–124
YP_009842427.1
1 632
Architecture
ATT
STR
ATT 4-179 | STR 180-632
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009842427.1
1 632
Domain Start End Length (AA) Confidence
N-terminal 1 148 148 0,9941
Central domain 149 455 308 0,9726
C-terminal 456 632 176 0,9795
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-148
Central
149-455
C-terminal
456-632

Taxonomy

  Name Taxonomy ID Lineage
Phage Proteus phage Mydo
[NCBI]
2483610 Uroviricota > Caudoviricetes > Vequintavirinae > Mydovirus mydo >
Host Proteus mirabilis
[NCBI]
584 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009842427.1 [NCBI]
Genbank nucleotide accession
NC_048741.1 [NCBI]
CDS location
range 115854 -> 117752
strand +
CDS
ATGCTACAGAGTGTTAAAATTTCGGAGTTGCCAAGTGCGGACACTCTAACAGAAGATGACCTGATCGTAATTGATCAACCAGATGATACTAAGAAGGCAACTTTATTTCAGGTTTTAAATCATCTTGAAGATGTCGTTGAGCAGTCAACACTGGTGACTTTGGCCCAACCTGACGGGTTTAAAAATATTGGTCAGTGTGCTAACCTCACTGTCTTAAGGTCTTTGCAGTTTTCATCTGTTGGCAAACAAGTGTTTCTGAAAGAACATACTACTGGACAGAATGCTGGCGGAGGTATTTGGTATTGCCATTCCCTGACGTCCGATAGTTCATACGTCGATGATAATGGATGCCAGATTATTAATGATTATGGACAGGTCATCCGCCGTAAGGATGTTAAAGAGTTAACCTCAAGCTATTTCGGGTTACAGGCTGGCGATACTATCGACACTGTTCTCACCAACATGTACAAGGCTTCGCGTACATTTAACATTTATGAAGCAAAAATTGAGAACCCCGGATTTGATAAAGGGTATGTATTGCAAGGTGGACTGAGGTTCTATTGTGGGGATAAGCCTTTTTATATCCACTCCTATTCCATAGGTACTTTGCGCGGTCCAAACATTTGGCATACTGGTAATAATGTTGGGGTTACGTTCTCTCGTTTCAAAGAAGATGGTACGTCACAACAGGCATGGTCTGGGGGAGGAATCAAAGGCTTCAGAATTTGGGGTGCTGCGTCTTATCTTGTACAAGGTAATACCGGTGCAGATTCTACACCAGTTCGCCTGTCGGATATGTGGCAAGGAGAGGCCTGTGACTTGTGGGTTACTGGATATACAGGTAACACCAACGGTGCAGTGGTATCTCTGTATAACGAATTTGCGTGGACGGAAGGCGCTCTGGTTGAAAACATCATGGTCCGTCAATCCCTACGAGGACTTACATTTTTGCGTAAGCACGGGACAACGGCAACGGATTCATTTTTCAGAACAGTAGCTGATATTTCATTCAATGCTGGCGTGTCAGGTCAGTCAACTCAGGTAATGGTTGTAGGTGACGGGACTGCTGCTGGTGAGTGTCTTGTGTACGGGCACGACATTAAACTGACTCAGTGGATGAGCGCCGGGTCATGGCATGATATTGTTCGTCTTGAGGACTACAGCATTATCGCGGAGACTGGCGTTATTAAAATAGTCGCGGATGGGTATGGTATATCCAAAACCACCGTGCCAGCGACAGAGGTTGTTCACTCCATTAACGTCCGCGGACTCAATGCCAGGTTCAGGAGTAGAGTGGAAAACTGGTCAAATCAGGCGGGTGGTTGGGGATTGGACTTCCTGAACATCATCTTCCAGTCCAGCATGTACACTAACGCCATGACTTTTTATGAGTCTGACTTTGATGCGCTGCCAACCATTAACCCTGTCGGGATGAAGGTGCGATTTAACGGCACGTTCACTGTTGCCGAGCGGCAGTCTGGAAAGGTTTACACCCTCAACGGGCTAATTCCAGGGATGACGCTGAAAGTCAAGCTGACATCTCGCAACGGTAACGACCTCAACGACGCCGTTGTTCAGGAGTGGAAGGTATTTGTTCGCAGCACTAACTTACCTTGTATAGTCGTTCCTATGTCAGGTAGTGCAAACATCGCAACTACCGATGGGTTGGCTGTTACCAACACCAGTCCAGTACAAACAGCGACGTTCCTTAAAACAGTGACACCTACTCAAGCCAGAAATTTTATTGGTCAGAACTATGGCCTGACAGTTAAGAATGCGAACGATGACAACAGCCTGTCATATGCACTTAACTCGGGACGTAAGATTCGTTTTGTTCTTCCGGCTAACCCAACTGCAACTACAACCACTCCATACTCTGTTGAGATAGAGGTACTTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
d5a8b650118266cd4f8c1a7c9dc44ee44a9c769fd16c7f1b4440821840df37c1
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6886
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50