Genbank accession
YP_010675100.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MSHNTGNDLESSDILDFEDNCENLDVLMNTDQDHWTDRLGNKRPTIDYALRMAGFTPAGFDFTTGGVLKNGDRNKCVFNQADQTWYSWSGDLPYNVIAGSVPGEGWKVVNRHSLIIAREALRRTYQEVGLNLVEGSFEQGGTLVNSNDVLLHETSSKVYAGEVGSYPAGTEPVGATWIDMSAVSVRDQLSVNNGASMIGTSVGRTIENRLSDTIKVIDYKTLGRTYPEAIATCIAYAKELQASGKSVKISFENCDAFITTTVGFVLDFPVIWDQCKTRITGDGSIPTLLKCTAADIWLCNFSLRGVGDGDNTRLLWMSGAPSFRIQNGDVSNGKEGWWCDGGSYGGFIDVFTVNNNVSRGIYCDVVTSAEHSWRDVYVGLSKSYLTDSTVGIEMYSSSVIDSGGYHWYNVLVVNNTAAGAKAGKGVYLNGSNFQGITPIHWVGGGADGFIKANGKTGVNIRNWAQVKCVNVWTSNCEINSADSPQWIGGNCSTGFTLKGTIKDPAFIAPRCGEAVDGAYLLDPALTITGFIREYNTITSLITNNLTKWIFYTRNSLVKTTSTESSFTAERYSDQTGLTKFYRRINSVGENTFLKSDFTDHTVMRQNGQLYVPGGYAPFTGVHQMRSSEYIDNGKVVIFVNSEQIVDRLTVYNSELEVDEDVAVVTEGEVKLCNEDDSLLFAGIVVESQLIGDYYLVTVAASGDNSMPQLDCVAVDGDFSIGDLLSTSATGLLKRYEGNDLRIPCLRVKGVRNGKAYGYFL
Physico‐chemical
properties
protein length:760 AA
molecular weight: 82994,79790 Da
isoelectric point:4,86106
aromaticity:0,10132
hydropathy:-0,18408

Domains

Domains [InterPro]

No domain annotations available.

Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_010675100.1
1 760
Domain Start End Length (AA) Confidence
N-terminal 1 224 224 0,9895
Central domain 225 552 329 0,9871
C-terminal 553 760 207 0,8711
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-224
Central
225-552
C-terminal
553-760

Taxonomy

  Name Taxonomy ID Lineage
Phage Aeromonas phage pAEv1810
[NCBI]
2908744 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Aeromonas sp.
[NCBI]
647 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Aeromonadales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_010675100.1 [NCBI]
Genbank nucleotide accession
NC_070999.1 [NCBI]
CDS location
range 180899 -> 183181
strand +
CDS
ATGAGTCATAACACTGGTAATGATTTAGAATCTTCAGATATCTTAGATTTTGAAGATAACTGTGAAAACTTAGATGTTTTAATGAACACTGATCAAGACCACTGGACAGATCGTCTTGGTAACAAACGTCCTACAATTGATTACGCTTTGCGTATGGCTGGATTCACTCCTGCTGGTTTTGATTTCACTACAGGCGGTGTTTTAAAGAATGGTGATAGAAACAAGTGTGTATTTAACCAAGCTGATCAAACTTGGTATAGTTGGTCAGGAGATCTTCCTTATAATGTAATTGCTGGAAGTGTTCCTGGAGAAGGTTGGAAAGTTGTCAATAGGCATTCCTTAATCATAGCAAGAGAGGCACTTCGTAGAACTTATCAAGAAGTTGGATTAAATCTGGTTGAAGGTTCTTTTGAACAAGGTGGCACCCTTGTAAATAGTAATGATGTGTTACTCCACGAAACTAGTAGTAAGGTATATGCTGGTGAGGTAGGATCATATCCAGCTGGAACAGAACCTGTTGGGGCTACTTGGATAGATATGTCAGCTGTGTCTGTGCGGGATCAGTTATCTGTCAATAACGGCGCCTCGATGATTGGCACCTCGGTAGGCCGGACTATTGAAAACAGACTATCAGACACCATTAAAGTTATTGACTATAAAACCCTTGGTAGGACATACCCAGAGGCTATTGCGACCTGTATAGCCTACGCAAAGGAGCTACAAGCCTCTGGGAAGTCGGTAAAAATATCCTTCGAAAATTGCGATGCTTTTATAACAACGACAGTGGGGTTTGTTCTCGACTTCCCCGTTATCTGGGACCAGTGTAAAACTAGGATTACTGGCGATGGTTCTATACCCACCCTTTTAAAATGTACAGCTGCTGACATTTGGTTGTGTAACTTTAGCCTTCGCGGGGTGGGTGACGGCGACAACACTAGACTACTGTGGATGAGTGGCGCGCCTAGTTTCCGCATTCAGAACGGTGACGTCAGTAATGGTAAGGAGGGCTGGTGGTGTGATGGTGGATCTTACGGTGGATTCATTGATGTCTTCACGGTCAACAACAATGTGTCACGCGGGATATACTGTGATGTTGTGACCAGCGCAGAACATTCGTGGCGTGACGTGTACGTTGGGTTGTCGAAGTCATACCTAACCGACTCTACAGTTGGCATTGAAATGTACTCAAGCAGTGTGATCGATTCTGGCGGGTATCACTGGTACAACGTGTTAGTCGTAAATAACACGGCTGCTGGCGCGAAAGCTGGTAAAGGTGTTTATCTGAATGGGTCAAATTTCCAAGGCATTACTCCGATTCATTGGGTTGGTGGGGGCGCTGATGGTTTTATCAAAGCTAACGGTAAGACTGGCGTGAACATCAGAAACTGGGCGCAGGTTAAGTGCGTTAACGTCTGGACATCTAACTGCGAAATCAACAGCGCAGATAGCCCTCAGTGGATTGGTGGTAATTGTTCAACTGGTTTTACCCTCAAGGGTACTATTAAAGACCCAGCATTCATCGCCCCACGTTGCGGCGAGGCGGTAGATGGAGCTTATCTGCTAGACCCAGCGCTCACGATAACTGGATTTATCAGAGAGTATAATACAATCACCTCTCTAATAACCAATAACCTTACTAAATGGATTTTTTATACGAGAAACTCGCTTGTAAAAACCACATCAACAGAAAGCTCTTTTACAGCCGAAAGGTACAGTGATCAGACTGGGTTAACCAAGTTTTACCGTCGCATAAATTCTGTCGGTGAAAACACTTTCCTTAAAAGCGATTTCACCGACCACACAGTAATGCGACAAAATGGCCAGTTGTATGTCCCAGGTGGGTATGCGCCGTTCACCGGTGTTCACCAGATGCGTTCTAGTGAGTACATTGATAATGGGAAAGTTGTCATATTTGTGAATTCAGAACAAATTGTTGATCGTCTAACCGTATACAACTCTGAATTAGAAGTCGATGAGGATGTCGCTGTAGTGACAGAGGGTGAGGTAAAGCTGTGTAACGAGGATGACTCTCTGTTGTTTGCTGGGATTGTAGTTGAGTCACAGCTAATCGGTGACTACTATCTCGTCACTGTTGCAGCATCTGGTGATAACTCAATGCCACAACTCGATTGCGTGGCTGTTGATGGTGATTTCTCAATTGGCGACCTATTATCAACGTCAGCAACAGGTCTGCTTAAGCGGTATGAAGGTAATGATTTGCGGATCCCTTGCTTGCGTGTGAAAGGAGTTAGAAACGGAAAGGCCTATGGGTACTTCCTTTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
da11ebafd493cdc1aeafe3126630b157a87e7700c444668e101a1a2b7ee12d11
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6667
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50