UniProt accession
A0A1W6DXJ6 [UniProt]
Protein name
Tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,88
TF
Evidence RBPdetect2
Probability 0,96
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MALYRTGTAAMDAQGVITGTGTKWREPLSLIRTGATIVFLDQPVIKLAVISEIVSNTEMKAISTDGAVVPDGKYVILLNDSLTVDGMAQDVAETLRYYQSKETVIEEAIEFFKNFDLKTLQDLVNKVNADAQQVANDKAATEQLKNETQQIKDSAVSETQQIKDAAVNETNQIKADTDAIKTQTQQIKDSAVSEIGSIKNESVNARDAAKESQLAAEQSKIAADSAKAGAETARDEARQWAQQVNPENLLHKDQNLNDLANKDLAREALKVEAVNSVKGQNPGDYNSFRNPTWTYELRIANNGEWRVARNSDNSTSALSVGAGGTGAENIEGARKNFRVQGVFCITSETPGDFNSIKSPDGKLNMLVANNGVWGVQDDYGNIKPLHIERGGTGAQSIGQARSNFGIGETDIPVFRGISLTEKNSANSGILYLLNKNAEGVQISYSRVYNEIQGGIAKATIQVTREGGDTNYYQFDESGNALNYNSITIGRGIGNALGSNSIVIGDTDTGFRQNGDGILQAIADNQVMFAFTKSSNVAYRTIQSFTPEDARFAYVEGARRGGANCFIGGHVEGGAFTAWRDRAAGMLVELPSDDVAVNVVKVVRWGGDWAFGIDVARYGAGGCETHFNVRGAVYGFNDAGYASAVQWVNTSDIRLKANLKEIESAKEKVKSIKGYTYFKRNNLDEDEYSFYSEEAGVIAQDVQTVLPEAVYKISDSEYLGVSYGGVTALLVNAINEMIDDSDKQNETIQKQQDEINELKNEVAEMKKMIEEMQSMFIQIAK
Physico‐chemical
properties
protein length:780 AA
molecular weight: 84843,18960 Da
isoelectric point:4,81133
aromaticity:0,07436
hydropathy:-0,41885

Domains

Domains [InterPro]
DC_0313
STR
1–780
Coil
Unmapped
117–137
IPR030392
CHP
650–747
IPR030392
CHP
650–710
A0A1W6DXJ6
1 780
Architecture
STR
STR 1-780
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A1W6DXJ6
1 780
Domain Start End Length (AA) Confidence
N-terminal 1 351 351 0,9863
Central domain 352 550 200 0,2844
C-terminal 551 780 229 0,8359
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-351
Central
352-550
C-terminal
551-780

Taxonomy

  Name Taxonomy ID Lineage
Phage Citrobacter phage CF1 DK-2017
[NCBI]
2267237 Uroviricota > Caudoviricetes > Drexlerviridae > Tlsvirus > Tlsvirus DK2017
Host Citrobacter freundii
[NCBI]
546 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ARK07609.1 [NCBI]
Genbank nucleotide accession
KY694971 [NCBI]
CDS location
range 13391 -> 15733
strand -
CDS
ATGGCTTTATATCGCACTGGAACCGCCGCGATGGACGCGCAAGGCGTTATAACTGGTACTGGCACAAAATGGCGTGAACCTTTGTCATTAATCCGCACTGGTGCAACCATTGTATTTCTTGATCAGCCAGTTATTAAACTTGCTGTTATTAGTGAGATTGTTAGCAACACTGAAATGAAAGCAATTAGCACTGATGGCGCTGTTGTTCCTGATGGAAAATATGTGATCCTGTTAAATGATTCTTTGACTGTTGACGGAATGGCGCAAGATGTAGCGGAAACGCTGCGTTATTACCAAAGTAAAGAAACGGTAATTGAGGAAGCAATTGAATTCTTTAAAAACTTCGATCTGAAAACATTGCAAGATTTAGTTAACAAGGTTAATGCTGATGCTCAACAGGTAGCAAACGACAAAGCGGCAACTGAGCAACTAAAAAATGAAACGCAACAGATTAAGGATTCTGCTGTATCTGAGACTCAACAGATTAAGGATGCTGCTGTCAATGAAACCAATCAGATTAAGGCGGACACTGACGCAATAAAAACGCAGACGCAACAAATAAAAGATAGCGCCGTTTCTGAAATTGGATCAATTAAAAACGAGTCTGTTAATGCTCGCGATGCAGCAAAGGAATCACAACTTGCCGCTGAACAATCAAAGATTGCCGCCGACTCTGCAAAGGCTGGCGCTGAGACTGCGCGTGATGAAGCTCGCCAATGGGCACAACAAGTTAACCCTGAAAACCTTCTGCATAAGGATCAGAATCTTAATGATCTTGCTAATAAAGATTTAGCAAGGGAGGCGCTGAAAGTTGAGGCTGTTAATTCAGTTAAGGGGCAAAATCCAGGTGATTATAACTCATTTCGTAACCCTACATGGACTTATGAATTACGGATCGCAAACAACGGCGAGTGGCGAGTTGCAAGAAATAGCGATAACAGCACATCCGCTCTTTCTGTTGGTGCTGGCGGCACTGGTGCGGAAAATATTGAAGGGGCAAGAAAAAACTTTCGTGTGCAAGGTGTTTTTTGCATTACTAGCGAAACACCAGGAGATTTTAACTCCATAAAATCACCTGATGGAAAACTAAATATGCTTGTAGCTAACAATGGTGTGTGGGGTGTTCAAGATGATTATGGAAACATCAAACCATTACACATCGAACGAGGAGGCACTGGCGCTCAAAGCATAGGGCAAGCAAGGAGTAATTTTGGCATTGGTGAAACTGACATTCCTGTTTTTAGAGGAATAAGCCTAACAGAAAAAAACTCTGCAAACTCCGGCATACTTTATTTGTTAAACAAAAATGCAGAAGGAGTACAGATTTCGTATTCAAGGGTTTACAACGAAATTCAGGGTGGTATTGCAAAGGCAACTATTCAAGTAACTAGAGAAGGTGGCGACACAAACTATTATCAGTTTGATGAAAGTGGCAACGCGCTGAATTACAACTCAATAACAATCGGTAGGGGGATTGGTAATGCTCTTGGTAGTAACTCTATAGTAATTGGAGATACTGATACTGGATTTAGGCAAAATGGAGATGGCATTCTTCAAGCTATTGCTGATAATCAAGTGATGTTTGCATTTACAAAGAGTAGCAATGTTGCATACAGAACTATTCAATCATTTACTCCAGAAGATGCACGTTTTGCTTATGTTGAAGGTGCTAGGAGGGGTGGTGCGAATTGCTTTATAGGTGGACATGTTGAGGGTGGTGCTTTTACTGCATGGCGTGATCGCGCCGCTGGTATGCTTGTTGAACTTCCAAGCGATGATGTAGCCGTTAACGTTGTTAAGGTTGTAAGGTGGGGTGGTGATTGGGCTTTTGGTATAGATGTTGCCAGATATGGCGCTGGAGGTTGCGAAACTCATTTTAATGTAAGGGGCGCTGTTTATGGGTTTAACGATGCTGGTTATGCGTCTGCTGTGCAATGGGTTAACACTTCCGATATTCGCCTGAAAGCAAACCTAAAAGAGATTGAAAGCGCTAAAGAAAAAGTGAAATCAATAAAAGGTTACACTTATTTTAAGCGCAACAATCTTGATGAAGATGAATATTCTTTCTATTCGGAGGAGGCTGGTGTAATAGCTCAAGACGTGCAAACTGTTTTACCGGAAGCGGTTTACAAGATTTCAGATTCAGAATATTTAGGTGTTAGCTATGGCGGTGTAACTGCTCTTTTGGTTAACGCAATTAATGAAATGATTGATGATTCAGATAAGCAGAATGAAACCATTCAGAAGCAACAAGATGAAATCAATGAGCTAAAAAATGAAGTAGCAGAAATGAAAAAGATGATCGAGGAAATGCAATCAATGTTTATTCAGATTGCTAAGTAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098015 virus tail Cellular Component IEA:UniProtKB-KW (UniProt)

Tertiary structure

PDB ID
14150a3d0bc7597d1ae749edfb6f23fb97612257d647d1597b2b7e726b209f99
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6614
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50