Genbank accession
YAF79240.1 [GenBank]
Protein name
long tail fiber protein distal subunit
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,80
TF
Evidence RBPdetect2
Probability 0,75
Protein sequence
MNPGELAINLADQYLLTKNDSGAIINLSCPPVYDRDVTMAGKVKGNNYILSKTANYLEDQTARDLNYFGAFRTNGLDGFLELTLNVPHSSGVQHGRGFTFQYGHTGSRVETYGYNREGQKAFSYKMYHEGDKPTPGELNVYSKQEVDRMFVKNVKLATVPGDTVEGYFKLATAMIPQNGRSVFFRIHGGNGYNVTAYDQVDIVEIVIRSGNNRPKGVNVIAYRRNTNKSFDVLAVNTSGDNYDIYVKYQRYTDNVIVEFGKSVDVNLVVHDVPDFVAERPAGDNVIGGRAVTLFNTENKRGVLSFDDNTQNSYDIVHLSNDKGTGRKYIRKFRSNYNEMIWHETVQGSSYRLATGSTDAQEIMTIESSSSIAGTHKGNIISGRMLLNGGSNAITLRRPAGQSNHIAFQDNRTGDITRQGWIGYANADTDVFEWYSDVGGSSIRQHIDGQIEFQTGNMKRVYTNGQFISLYADGFRTVYGNYGSWWRNDGSNVYLMSTKSGDAMGPWNTFRPFIYSLANGNVTLGGSDAGNHLMRLNNENRQVEINATTHLGSRLYWERSEGSASRFFIKNWGNGTSRAQVWELADETGYHVYSQRNDAGVLQFRTAGNFETGGDAQVNGTLRVTNVLRTDNQIQIYRDNNKELWFKDANGTNRGVIWGDSTGQMRIRNYNSGEFDHVFQTGMIRLERGYANGSDRGLIRGEVQGGAWSEWKTRAAGLLVDCPAAQTSAYNVWKATKWGLDHIAAMGVHVPSGVITNAMARLHIHTTNFDFNASGDFQAGRNGNFNDVYIRSDARLKINKEEYKENATDKVNRLTVYTYDKVKSLTDRTVIAHEVGIIAQDLEKELPEAVTTSKVGDPDKPEEILTISNSAVNALLIKAFQEMSEELKVVKAELAELKKN
Physico‐chemical
properties
protein length:899 AA
molecular weight: 100322,24100 Da
isoelectric point:6,72544
aromaticity:0,10234
hydropathy:-0,55106

Domains

Domains [InterPro]
DC_0932
STR
7–553
IPR048390
ATT
460–512
DC_0000
STR
496–899
YAF79240.1
1 899
Architecture
STR
ATT
STR
STR 7-459 | ATT 460-512 | STR 513-899
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YAF79240.1
1 899
Domain Start End Length (AA) Confidence
N-terminal 1 224 224 0,1326
Central domain 225 423 200 0,4607
C-terminal 424 899 475 0,7312
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-224
Central
225-423
C-terminal
424-899

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage ETE131P06
[NCBI]
3459904 Viruses >
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YAF79240.1 [NCBI]
Genbank nucleotide accession
PX104378 [NCBI]
CDS location
range 69241 -> 71940
strand +
CDS
ATGAATCCGGGTGAGTTGGCGATCAACTTAGCCGACCAATATTTGCTTACTAAAAACGACTCAGGTGCTATTATCAATTTAAGTTGTCCTCCGGTGTATGACCGCGATGTTACAATGGCTGGTAAAGTTAAAGGTAATAATTATATCTTAAGTAAAACCGCTAACTATCTGGAAGATCAGACAGCGCGAGATCTTAACTACTTTGGTGCTTTCCGTACCAATGGTCTTGATGGTTTTCTAGAACTAACGCTAAACGTTCCTCATTCTTCTGGTGTTCAGCATGGTCGAGGATTTACTTTCCAGTATGGTCACACTGGATCGCGAGTAGAAACCTATGGCTATAATAGAGAAGGTCAAAAAGCATTCAGCTATAAAATGTATCACGAAGGTGATAAACCAACTCCAGGAGAATTGAACGTTTATAGCAAACAAGAAGTAGATAGAATGTTTGTTAAGAACGTTAAACTTGCTACAGTTCCTGGTGATACAGTTGAAGGCTATTTTAAATTAGCAACTGCAATGATTCCACAAAATGGTCGTAGTGTATTTTTCCGTATTCATGGTGGTAACGGATATAACGTTACTGCATACGATCAAGTTGATATTGTAGAAATTGTTATTCGCAGTGGAAATAATCGCCCTAAAGGTGTTAACGTTATTGCATATCGCCGAAATACAAACAAATCATTTGATGTTTTGGCTGTTAATACTTCTGGTGATAACTATGATATCTATGTGAAATATCAGCGTTACACTGATAACGTTATTGTTGAATTTGGTAAAAGTGTTGATGTTAATCTGGTAGTTCATGACGTTCCAGATTTTGTTGCTGAACGTCCTGCTGGTGATAATGTTATTGGCGGTCGCGCGGTAACTCTTTTCAACACCGAAAATAAACGTGGTGTGTTGAGTTTTGACGATAACACACAAAATAGCTATGATATTGTTCACTTGAGTAATGATAAAGGTACTGGACGAAAATATATTCGTAAATTCCGTAGCAACTATAATGAAATGATCTGGCATGAGACTGTTCAAGGTTCCAGTTATCGTCTGGCTACTGGTAGCACTGATGCTCAGGAAATTATGACTATTGAATCTAGTAGCTCAATTGCTGGAACTCATAAAGGTAATATTATTTCTGGTCGTATGCTGTTAAATGGCGGATCTAATGCCATTACACTACGCCGACCTGCTGGTCAATCTAATCATATTGCGTTTCAAGATAATCGTACTGGAGATATTACCCGTCAAGGATGGATCGGTTATGCAAACGCCGATACTGACGTTTTTGAATGGTATAGTGATGTAGGTGGCAGTTCTATTCGTCAACATATCGACGGACAGATCGAATTTCAGACAGGTAACATGAAGCGAGTTTATACCAACGGTCAATTCATTTCTTTATATGCTGATGGCTTCCGTACTGTATATGGTAACTATGGATCATGGTGGCGAAATGACGGTAGTAACGTTTATCTGATGAGCACAAAATCAGGCGACGCTATGGGTCCGTGGAATACCTTTAGGCCGTTCATATACAGTCTCGCCAATGGAAACGTTACTCTAGGTGGTAGCGATGCTGGCAATCATTTAATGCGTCTCAATAATGAAAATCGTCAAGTAGAAATAAATGCAACAACACATTTAGGGTCTCGATTATATTGGGAACGTAGCGAAGGATCGGCATCTAGATTTTTTATTAAGAATTGGGGTAATGGAACTTCTCGCGCTCAAGTGTGGGAACTTGCTGACGAGACAGGTTATCATGTATATTCTCAGCGAAACGACGCGGGTGTTCTTCAATTCCGTACTGCTGGTAATTTTGAGACTGGTGGTGATGCTCAAGTAAACGGGACTTTACGTGTCACCAATGTTCTTCGAACAGATAACCAAATTCAGATTTATCGCGATAACAACAAAGAACTCTGGTTTAAGGATGCTAATGGTACTAACCGTGGTGTTATCTGGGGTGATAGCACAGGTCAAATGCGCATTCGAAACTATAACAGCGGAGAATTTGACCATGTTTTCCAGACTGGTATGATTCGCTTAGAACGTGGGTATGCAAACGGTAGCGACCGTGGTTTGATTCGCGGAGAAGTACAAGGTGGTGCTTGGTCAGAATGGAAAACTCGCGCTGCTGGCTTATTGGTTGACTGTCCAGCTGCTCAAACTTCCGCATATAACGTATGGAAAGCGACCAAATGGGGTTTAGACCACATCGCGGCAATGGGCGTTCATGTTCCTAGTGGTGTTATTACTAATGCTATGGCACGTCTTCATATTCATACCACAAACTTTGACTTTAACGCATCTGGTGATTTCCAAGCTGGTCGCAATGGTAACTTTAACGATGTTTACATTCGCTCTGATGCTCGCCTGAAAATCAATAAGGAAGAGTATAAGGAGAATGCCACCGATAAAGTTAATCGCTTGACGGTATACACCTATGACAAGGTTAAATCTTTAACCGACCGTACTGTCATTGCTCATGAAGTCGGTATTATTGCTCAGGATCTTGAAAAAGAATTGCCGGAAGCAGTAACAACTTCTAAGGTCGGCGATCCTGATAAGCCAGAAGAGATCTTAACAATTTCTAACTCTGCTGTCAACGCTCTTTTAATTAAGGCGTTTCAGGAAATGAGCGAAGAATTGAAAGTCGTTAAAGCTGAACTAGCGGAACTTAAAAAGAATTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
b0ec2ef6c84c3adc5d19b46743acf7234696d89a5232406558280983101ab13e
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,8131
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
A machine learning approach to predict strain-specific phage-host interactions Camejo,P.Y., Leon,L.E., Rojas,F., Ossa,A., Hurtado,R., Tichy,D., Pieringer,C., Pino,M., Mora-Uribe,P., Ulloa,S., Norambuena,R., Tobar-Calfucoy,E., Aguilera,M., Rojas-Martinez,V., Cifuentes,O., Sabag,A., Cifuentes,N., San Martin,D., Infante,C., Cifuentes,P. and Pieringer,H. GenBank