Genbank accession
YP_009818125.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,94
Protein sequence
MNAYSKWFEVLRVGGNYNTMHYFAGTGGDTFDINFAGGYLDKAHVKAISDDGGIELSLTFITKSRVKLSRPIAKGYTVLIFRDTPKVVPLALFRDGAVMNSVNLDRNAKQAVFVAAEMLDRFDVFGSSIENSVDQISEALEIAARAENMANAASETADSAKSDAKTALDMYTLINDSVRKYSGTLVGDGSGTAIPALEDSDLNKQAQAILNRFEELGGPKGFGVVGGISSDVVVKVPSMFSTLQGAFNHLHSTYKQSTSTRKIILIESGHSPMSGIVCKYGDFSNIWIRSEDAVVNIPKLFTGGPNYLTYSNSLIHVEFAAGPVLDCLFNAQDNVGCGYLLGVGSSGNIRPGKGVRYVSQVGLLVSDGSRAIADGAIFDYAVEQGMHVTAFGDVTAAYADFGNCRGGEGSSCASVVRNSRAYLRYAKMNGSLNGYGLRTSSESTVDAYEAEFKGNYRGAMRTTKSHISAPGAVVQKGIDGACLMTNGGSVFIDRMVDETGAAVDISFFPLNSAFNVWNSYGAVFQTSSSVQAFSETGDLKSNLHDWGKGLLFGRVSGSATNEYSGRQSGDSIIKFPGSDTPALMAISRFGVGRAHIAAWDSGSGGYKQWNQLMTQLEPRAGASITLAADNAYDFGTAAFRGRIGYFAQGVQTTSDARLKSDVRPLTPSELRASSAIAKTIGVFTWLAESSDRLHVGTTVQAVIKCLEDEGLDPMQYGFVCHDSWEASEGDGFTEPREAGDIYSLRDHELYKFLVRGLEQRISNLEG
Physico‐chemical
properties
protein length:766 AA
molecular weight: 82085,91800 Da
isoelectric point:5,51335
aromaticity:0,09661
hydropathy:-0,13760

Domains

Domains [InterPro]
IPR005604
ATT
26–121
DC_1478
STR
82–657
IPR030392
CHP
654–711
IPR036388
RBD
654–766
IPR030392
CHP
654–766
YP_009818125.1
1 766
Architecture
ATT
STR
RBD
ATT 26-121 | STR 122-657 | RBD 658-766
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009818125.1
1 766
Domain Start End Length (AA) Confidence
N-terminal 1 242 242 0,9913
Central domain 243 523 282 0,9802
C-terminal 524 766 242 0,9636
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-242
Central
243-523
C-terminal
524-766

Taxonomy

  Name Taxonomy ID Lineage
Phage Aeromonas phage ZPAH7
[NCBI]
2420320 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Aeromonas hydrophila
[NCBI]
644 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Aeromonadales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009818125.1 [NCBI]
Genbank nucleotide accession
NC_048133.1 [NCBI]
CDS location
range 4504 -> 6804
strand -
CDS
ATGAACGCATACTCAAAATGGTTTGAAGTCCTCCGTGTTGGGGGCAACTACAACACAATGCATTACTTTGCAGGTACAGGCGGCGACACCTTCGATATCAACTTCGCAGGCGGTTACCTGGACAAGGCGCACGTTAAGGCAATCTCAGACGATGGGGGTATTGAACTTTCCCTAACATTCATCACTAAGAGCCGCGTAAAGCTCTCAAGACCAATCGCAAAGGGATATACCGTTCTCATCTTCAGAGACACTCCGAAGGTAGTACCACTTGCCCTGTTTAGGGATGGTGCTGTGATGAACAGCGTTAATCTTGACAGGAACGCGAAGCAGGCAGTGTTCGTCGCTGCGGAGATGTTAGACCGGTTTGATGTGTTCGGGTCTAGTATCGAGAACTCAGTAGACCAGATTTCTGAGGCACTTGAAATAGCTGCGCGGGCAGAGAACATGGCGAACGCTGCTTCAGAAACGGCGGACTCTGCAAAGTCAGATGCAAAAACAGCACTTGACATGTATACTTTAATCAACGATTCAGTTAGGAAGTATTCCGGCACACTAGTTGGGGATGGTAGCGGAACTGCTATACCTGCACTTGAAGATTCAGACCTTAACAAGCAAGCACAGGCCATACTGAATCGTTTCGAGGAGCTAGGTGGTCCAAAGGGCTTTGGTGTTGTTGGTGGTATTTCGTCTGATGTAGTTGTAAAAGTGCCAAGCATGTTCAGCACACTTCAAGGGGCATTCAACCACTTACATTCGACATATAAACAGTCTACTTCAACACGCAAGATTATACTTATAGAGTCGGGACATTCCCCCATGTCCGGCATAGTTTGTAAGTACGGCGACTTCTCTAATATTTGGATTCGCTCAGAGGATGCTGTTGTTAATATACCTAAGTTGTTTACAGGCGGGCCAAACTACCTTACATATAGCAACTCACTAATCCATGTGGAGTTTGCAGCAGGCCCCGTGTTAGATTGCCTGTTTAATGCCCAGGATAACGTAGGTTGCGGGTACTTACTAGGTGTAGGTTCTTCAGGGAATATACGCCCAGGTAAAGGTGTGCGCTATGTGTCACAGGTAGGGCTTCTGGTATCAGATGGCTCACGGGCCATAGCGGACGGTGCAATATTTGACTATGCTGTAGAGCAGGGTATGCACGTTACAGCATTCGGTGATGTGACAGCGGCATATGCGGACTTCGGAAACTGTAGGGGAGGCGAGGGTTCTTCTTGTGCATCCGTCGTTAGGAATTCCAGGGCATATCTGCGCTACGCTAAAATGAACGGCTCTCTGAATGGTTATGGACTACGGACATCCAGTGAGTCTACAGTTGACGCATATGAAGCAGAGTTTAAAGGGAACTACCGGGGCGCAATGCGTACAACAAAGTCCCACATATCTGCACCCGGCGCTGTTGTGCAGAAGGGGATTGATGGGGCCTGCCTAATGACAAATGGTGGCTCTGTGTTCATAGACAGGATGGTAGATGAGACAGGGGCAGCAGTAGATATATCGTTCTTCCCACTAAATAGTGCATTTAATGTGTGGAACAGTTACGGTGCTGTGTTCCAGACGTCGAGTAGCGTTCAGGCATTTTCTGAAACAGGCGACCTGAAGTCTAACCTACATGATTGGGGTAAAGGGCTTCTGTTTGGACGTGTTAGCGGCTCAGCAACAAACGAGTATAGCGGGCGCCAATCAGGCGACTCAATTATCAAGTTCCCTGGTTCAGATACCCCTGCACTTATGGCAATTAGTCGCTTCGGTGTAGGGCGTGCTCATATAGCTGCGTGGGATTCTGGTTCTGGTGGGTATAAACAGTGGAACCAGCTAATGACTCAGCTTGAGCCAAGAGCAGGTGCCTCTATAACCCTAGCCGCAGATAATGCCTATGATTTCGGAACAGCTGCATTCCGGGGTAGGATTGGGTACTTTGCACAGGGCGTTCAGACAACTTCAGATGCCAGGCTTAAGTCAGATGTCCGCCCCCTCACACCTAGTGAGCTGCGTGCTTCGTCTGCAATTGCAAAGACAATCGGCGTGTTCACGTGGTTGGCTGAATCCAGTGACCGGCTACACGTAGGTACTACAGTGCAGGCTGTAATTAAGTGCCTTGAAGATGAAGGTCTTGACCCAATGCAGTATGGGTTCGTTTGTCATGACTCCTGGGAAGCATCCGAGGGTGACGGGTTCACAGAGCCTCGGGAAGCAGGGGATATCTACTCACTTCGTGACCATGAGCTTTATAAGTTCCTGGTGCGTGGACTTGAACAACGTATAAGTAATTTGGAGGGTTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
4f8216634087c69451411dbdf4eaf74cbffcccefc8f412caf9feb1949b621245
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7468
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50