Genbank accession
XTK85805.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MIVNPNKSLIPVSVRTNGVGVVLDAFIVNDGNIQAGGSGPSKYEYSLVDIALNGRWPRSNGILNPDIPKPDDDQDDRIDWESKIAQAKREIAIELAKYDEWAAAYVKAHTNKRGVVHGETKENVGMGKKDNFPMGTLQEQHDGLLDNVFVSPSGMRTLIESRLRIDDRTYIPSGVFPVNASGILGDVPMWSYDCEIGELAQSPTNPIEHLGETPFEFSTPTGLLIFPSINNSPVQGRYTQPANGTLPTVYTPWGGTKVRNYNGRVDTRRTRPSFIRGWGLGPYRGDLVKKPSALFDNNAIYYMEGDRVKIRSFNKIELPFDTAEGIYGSTSLLDGIVQYSAEFLYNLKSGIHNRPYTGETGNVPHLLFQYEASKLAGVDLEIVAGPPLHAPEHVVNSSVTYRGDVTMVPAHGKIKVVELVPQVTSIAIPFKELLNFGPADIQLWYDNLNQKLAKKISFGWRNRMRMIGSVRIPLGWYNKDKTKYWNGYIDFEIEFTPNETTRTYRTDIITNGPWTLPKQTLDANWDLVGEGMFKAFEPTTANDPLHPLCMSGIFEPRGGHYKTYTLYNRQYIGFYEHDLTKALEFNDLGTKKPNSINTFNYVTQGTINTDGIYGDHLRHIPIKVADAGRLVTYLTQIRDARNRYRWAFVNVDNDHEVVQNRPYGNYIGPTVLSMSWANETVLTVPSFLIENDDLSSDMNINNMVFNTSNHFSNYLSISIAENREVAIVTGKTVSIDQSVIDWVKEFGGNWFKVNTVFFYFKGNLYWLNQCLNANEYPDNGKDCFYGIIKNCELIYDADGNAVVINKNPILDSVTINSTNVMVKSSLEIKQSDIIGLDRFESQDIYLMKSGEVGNEETWNVMINVGPFNNFYVHFDLRRNHQNNVTTFKPSTTMVDPVFPWDPAKGFQIDYDKVVKYGKDIPHRLHINFQSPVMLTKGMWRFCKTPNDYCFHTRRSGQVKVAGGVMGTFRGVCTHPVGSVTTITGKNTVIQKPVGIRLSDSVPYDEIFAKQNGDNIDLVSYVKQGEQVTERAVPVGWINGGEFSYYDPYGWRNAGFPVIDGVRQNFFGNGNSFPTFFGKPGSGVPVNRFFLTNTVTTFLWDTYLGRIIPTIASRNVEITINGTVYNTNGAQSFIIPNNHVNVVTITIKYLETIKWAPGLTNLKTIGVEVMSLDFSGSTNFTIEAPLPKRIYSLKGLLKGATGSSYPGLATWDTSNVVDISELLMNTTNFNTPLTWVLTNCRNMNQFAKNSRKYDQPITEAMNLVNVIDMSEAFMGAWVFNSTITGKFTRCKTIARMFASSRLFNKSLNTAHFPAVTDIDGLFQDAYVYDTSINDIDIPSVEKASNVLKLARVFNKPISVSWNQCKDVSGFLSGTVLFNSSVEMTLPVCYKWSSFLENAKVFNTALPSWFFPYGSDVSSMLAGTEKFNQPITFNFSRVQKAGSFLKGSKLFQQDLPNCKFTLAEDVSYFFANTSYNGQIPGWEFATDDTIEVDASGMFNNNPAFNRDISDWNTIRFVLLTAMFDGATVFNQDVGKWNYSHVRSLSRFAKNAKAFNPDLSKIDVSKVEDFSEMVYGASEFQSDVSLWDVQNGLNFTRTFSDVPKFNSEQADWRPRKAVVMDYMYAGALIYNRDLRLWSVENVTSHVNFDQNCPAWVLPRPRWPA
Physico‐chemical
properties
protein length:1661 AA
molecular weight: 187021,13360 Da
isoelectric point:6,22662
aromaticity:0,12161
hydropathy:-0,29862

Domains

Domains [InterPro]
IPR005046
STR
1290–1379
XTK85805.1
1 1661
Architecture
STR
RBD
STR
RBD
STR
RBD
STR 1-1379 | RBD 1380-1390 | STR 1391-1480 | RBD 1481-1490 | STR 1491-1581 | RBD 1582-1659 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
XTK85805.1
1 1661
Domain Start End Length (AA) Confidence
N-terminal 1 182 182 0,9645
Central domain 183 794 613 0,9013
C-terminal 795 1661 866 0,0136
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-182
Central
183-794
C-terminal
795-1661

Taxonomy

  Name Taxonomy ID Lineage
Phage Aeromonas phage AhC3_1
[NCBI]
3411828 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Aeromonas hydrophila
[NCBI]
644 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Aeromonadales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XTK85805.1 [NCBI]
Genbank nucleotide accession
PV158447 [NCBI]
CDS location
range 206802 -> 211787
strand +
CDS
ATGATTGTCAATCCGAATAAATCACTTATCCCCGTTTCAGTAAGAACGAATGGGGTAGGGGTGGTGTTGGATGCGTTCATTGTGAACGACGGAAATATCCAAGCAGGTGGAAGTGGGCCTAGTAAATACGAATACTCACTAGTTGACATTGCACTAAATGGAAGATGGCCGAGATCGAACGGGATACTCAACCCGGATATCCCGAAACCTGATGACGACCAGGACGATCGTATCGATTGGGAAAGTAAGATCGCACAGGCTAAGAGAGAAATTGCGATTGAACTTGCTAAATACGACGAGTGGGCTGCTGCTTATGTTAAGGCACACACAAATAAGCGTGGTGTTGTACATGGGGAGACAAAGGAAAACGTCGGGATGGGGAAGAAAGACAACTTCCCGATGGGAACTCTCCAAGAACAACACGACGGACTACTTGACAACGTATTTGTTTCGCCAAGTGGGATGAGGACACTGATCGAATCACGTTTAAGAATCGATGACCGCACCTACATCCCTTCTGGGGTATTCCCAGTGAATGCTAGTGGGATTTTGGGTGATGTTCCCATGTGGTCTTACGACTGCGAGATTGGAGAGCTTGCGCAGTCTCCTACTAACCCAATTGAACATCTTGGTGAAACTCCCTTCGAGTTCTCAACCCCAACAGGACTTTTGATTTTCCCAAGCATTAACAACTCGCCCGTGCAAGGACGGTATACCCAACCCGCCAATGGAACACTTCCGACGGTTTATACTCCTTGGGGTGGTACAAAGGTAAGAAATTACAACGGTCGTGTAGATACGCGTCGAACCAGACCTTCCTTTATTCGTGGTTGGGGATTGGGTCCATATCGCGGGGACCTAGTTAAGAAGCCTTCCGCACTTTTTGACAACAATGCAATTTACTACATGGAAGGAGACCGTGTTAAGATTCGTAGTTTCAACAAGATTGAACTCCCCTTTGATACTGCCGAAGGTATTTATGGCAGTACCTCTCTTCTTGATGGAATCGTTCAATATAGCGCAGAATTTCTTTACAATTTGAAGTCGGGTATTCATAACCGCCCATATACCGGGGAAACCGGGAACGTCCCCCACCTTCTTTTCCAATACGAAGCAAGCAAGTTGGCTGGAGTAGATTTGGAAATTGTTGCCGGCCCACCCCTCCATGCTCCTGAACATGTCGTCAATAGTTCAGTAACTTATCGTGGTGACGTCACAATGGTTCCTGCTCATGGGAAGATAAAGGTGGTTGAACTCGTTCCTCAGGTCACCAGTATCGCAATCCCCTTTAAAGAGCTTCTAAACTTTGGGCCAGCAGATATACAACTCTGGTACGACAATCTTAACCAGAAGTTAGCCAAGAAGATTTCATTTGGCTGGCGCAACCGCATGAGAATGATCGGTTCAGTTCGGATCCCGTTGGGTTGGTACAATAAGGATAAAACCAAGTACTGGAATGGTTATATTGACTTTGAAATTGAATTTACCCCTAATGAGACTACCAGGACATATCGGACAGACATTATCACTAATGGACCTTGGACTCTCCCTAAACAAACTTTGGATGCTAACTGGGATTTGGTGGGGGAAGGGATGTTCAAAGCCTTTGAACCGACCACCGCTAATGACCCACTCCACCCGTTGTGTATGTCAGGGATCTTTGAACCACGAGGTGGCCACTACAAAACGTACACGTTGTATAACCGCCAGTATATCGGGTTTTATGAACACGACTTGACTAAAGCGCTGGAGTTCAATGATCTCGGTACTAAGAAACCGAACAGTATTAATACGTTTAACTACGTTACCCAAGGGACAATTAATACAGACGGGATTTATGGGGATCACCTTCGACACATCCCGATTAAAGTAGCCGATGCAGGTAGGTTGGTAACTTATTTGACACAAATCAGGGACGCTAGAAACCGTTATCGTTGGGCATTTGTAAATGTTGATAATGATCACGAAGTTGTTCAGAATAGACCGTATGGGAACTACATCGGCCCTACTGTGCTCAGTATGAGTTGGGCAAATGAAACAGTATTGACAGTGCCGTCTTTCCTGATCGAGAATGACGACTTGTCTTCTGACATGAACATTAACAACATGGTCTTTAATACCAGCAACCATTTCAGCAATTACCTCTCTATCTCCATTGCCGAGAACAGGGAGGTTGCAATAGTCACTGGGAAGACGGTTTCTATTGACCAAAGCGTGATCGACTGGGTGAAAGAGTTCGGTGGTAACTGGTTTAAAGTTAACACGGTGTTCTTCTACTTTAAAGGCAACCTTTATTGGCTGAACCAGTGTCTGAATGCTAACGAATATCCCGATAACGGGAAAGATTGTTTCTACGGGATTATTAAGAACTGTGAATTGATTTATGATGCTGATGGGAACGCAGTCGTCATCAACAAGAACCCGATCCTTGATTCCGTGACAATCAACAGTACTAACGTGATGGTTAAAAGTTCCTTGGAAATTAAGCAGTCTGACATTATTGGTTTGGACCGATTTGAATCCCAGGACATTTACCTAATGAAGTCAGGGGAGGTAGGGAATGAAGAAACGTGGAACGTGATGATTAACGTCGGACCGTTCAACAACTTCTACGTCCACTTTGATCTTCGACGTAATCACCAAAACAACGTTACCACGTTTAAACCAAGCACGACAATGGTTGACCCTGTATTCCCTTGGGATCCTGCAAAGGGGTTCCAGATTGACTACGACAAAGTGGTCAAGTACGGAAAGGATATTCCACATCGACTTCATATTAACTTCCAGTCCCCCGTGATGTTGACTAAAGGTATGTGGCGGTTTTGTAAAACGCCCAATGACTATTGTTTCCATACCAGACGGTCTGGCCAGGTTAAAGTAGCAGGTGGGGTAATGGGGACCTTCCGTGGGGTTTGTACACACCCTGTTGGATCGGTGACTACCATCACCGGAAAGAATACGGTGATTCAGAAACCAGTCGGGATACGTCTAAGTGACTCTGTTCCTTATGATGAGATTTTTGCAAAACAGAATGGCGATAACATCGATCTCGTTAGTTATGTTAAACAAGGCGAACAGGTCACAGAACGAGCGGTCCCAGTCGGTTGGATTAATGGCGGGGAATTTTCCTACTATGATCCGTATGGGTGGAGAAATGCTGGGTTTCCAGTGATTGATGGTGTCCGACAAAATTTCTTTGGTAACGGTAATTCGTTCCCAACGTTCTTCGGTAAGCCTGGTTCCGGTGTTCCTGTGAACAGATTCTTCTTGACAAATACAGTCACTACGTTCCTGTGGGACACGTATTTGGGAAGGATAATTCCGACGATAGCAAGTCGGAACGTCGAAATTACTATTAACGGCACTGTTTACAACACCAACGGCGCTCAGTCATTCATCATTCCCAATAACCATGTCAACGTGGTCACCATCACGATCAAGTACTTGGAGACGATTAAGTGGGCGCCCGGGTTAACCAACTTGAAGACCATCGGGGTCGAAGTGATGAGCCTGGACTTCTCAGGTTCGACTAACTTCACCATTGAAGCTCCCTTGCCTAAACGGATTTACTCGTTGAAAGGGTTACTGAAAGGTGCCACGGGATCAAGTTATCCGGGACTAGCAACTTGGGATACCAGTAACGTCGTTGATATATCCGAATTGTTGATGAACACCACCAATTTCAATACCCCATTGACGTGGGTTCTCACCAACTGTCGGAATATGAACCAGTTCGCTAAAAACAGTCGGAAATACGACCAACCCATCACTGAGGCGATGAACTTGGTTAACGTTATCGACATGTCGGAGGCGTTTATGGGTGCTTGGGTTTTTAACAGTACGATCACGGGTAAGTTCACTCGTTGTAAAACAATCGCTAGGATGTTTGCTAGCAGTCGGTTGTTTAACAAGTCGTTGAACACTGCCCACTTCCCAGCGGTCACTGACATTGATGGACTTTTCCAAGACGCTTACGTATACGACACGTCTATTAACGACATAGATATCCCTTCCGTAGAGAAAGCATCTAACGTACTGAAATTGGCAAGGGTGTTCAATAAACCAATTTCAGTCTCTTGGAATCAGTGTAAAGATGTTTCTGGTTTCTTGAGTGGGACCGTCTTGTTCAACAGTTCTGTGGAAATGACATTACCGGTTTGCTATAAGTGGAGTAGCTTCTTGGAGAACGCTAAGGTATTTAACACGGCACTCCCAAGTTGGTTCTTCCCTTATGGCAGTGACGTCAGTTCCATGTTGGCAGGGACAGAGAAATTTAACCAACCAATCACGTTTAATTTCTCCCGTGTTCAAAAGGCAGGATCTTTCCTCAAAGGAAGTAAGTTATTCCAGCAAGATCTCCCTAACTGTAAATTTACTTTGGCCGAAGATGTCTCGTACTTTTTTGCAAACACGTCTTACAACGGCCAAATACCGGGATGGGAATTCGCTACCGACGACACTATTGAGGTAGATGCTTCTGGGATGTTTAATAACAACCCAGCGTTCAACCGAGATATCAGTGACTGGAATACAATAAGGTTTGTTCTATTGACCGCGATGTTCGACGGAGCCACTGTGTTCAACCAGGACGTCGGGAAGTGGAACTATAGTCACGTCAGATCACTGAGTCGATTTGCTAAGAATGCGAAAGCATTTAACCCAGATCTGAGTAAGATAGATGTGAGTAAGGTGGAGGACTTCTCTGAAATGGTTTATGGTGCATCTGAGTTCCAGAGCGACGTTTCTCTCTGGGATGTTCAGAATGGACTGAACTTTACCAGGACATTTTCAGATGTTCCCAAATTCAACTCTGAACAAGCTGACTGGAGACCACGTAAAGCAGTGGTGATGGATTACATGTACGCTGGCGCGCTCATCTACAACCGAGATCTTCGTCTCTGGTCAGTAGAGAACGTAACAAGTCATGTGAACTTTGACCAAAACTGTCCCGCATGGGTGCTTCCGCGCCCACGTTGGCCTGCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
2fffb606a33cd041e245ae7d163b94f7e86cbf517db864dfffce1e4b50ab00b7
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,6697
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50