UniProt accession
A0A077SK37 [UniProt]
Protein name
Tail fibrer protein GpS
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,89
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MNDVTVVTSVTYPSPESLALVADVQYHEPYLSAALNRKFRGIVDPGFYAGFLPKPGGGMNLLITSVDGDKTAGAASVDIGEFYQVTIQQRKDISLALSAGKKYAIVLKGRYLLGEDTYQVNTASHIHAAEFVARTYTDSYQLGDGELLVCTVNIPAGVSAITQEMIDTSERINRTIGIDISDSVTSTRSDVAASSLAVKKAYDLAKSKYTAQDASITQKGLVQLSSATNSDSETMAATPKAVKSIKDLADTKAPIESPSLTGTPSAPTAAQGTNSTQIANTAFVKAAITALINGAPGTLDTLKEIAAAINNDPNFSTTVNNALALKAPLASPALTGIPTAPTAAQGTNNTQIATTAYVRAAISALVGSSPEALDTLNELAAALGNDPNFATTMTNALAGKQPLDATLTALAGLATGANKLPYFTGKDTVAQTDLTSVGRDILAKTSVLAVIQYLGLRELGTSGEKIPLLSTANTWSARQTFNGGITGALTGNADTATKLKTAININGVRFDGSTNISIPTITSRGRVTALTGTTQGAATGLQMYEAYNNGYPTTYGNVLHLKGAASTGEGELLIGWSGTNGAHAPAFIRSKRDITAAAWSEWAQIYTSKDSVPGVNTKGNQDTSGNAATATKLQTACTINGVSFDGSKNIELTAADLNLEQTVELAAGALQKNQNGADIPGKDTFTKNIGACRAYSAWLNIGGDSQVWTTAQFISWLESQGAFNHPYWMCKGSWAYANNKVITDTGCGNICLAGAVVEVIGTRGAMTIRVTTPSTSSGGGITNAQFTYINHGDAYAPGWRRDYNTKNQQPAFALGQTGSRVANDKAVGWNWNSGVYDADISGASTLILHFNMNAGSCPAVQFRVNYKNGGIFYRSARDGYGFEANWSEFYTTTRKPSAGDVGAYTQAECNSRFITGIRLGGLSSVQTWNGPGWSDRSGYVVTGSVNGNRDELIDTTQARPIQYCINGTWYNAGSI
Physico‐chemical
properties
protein length:975 AA
molecular weight: 102226,79430 Da
isoelectric point:6,30853
aromaticity:0,07692
hydropathy:-0,16267

Domains

Domains [InterPro]
DC_0070
STR
50–330
IPR005068
STR
174–212
IPR051934
Unmapped
326–611
A0A077SK37
1 975
Architecture
STR
RBD
STR 50-845 | RBD 907-974 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage RCS47
[NCBI]
1590550 Uroviricota > Caudoviricetes > Punavirus >
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CDN90784.1 [NCBI]
Genbank nucleotide accession
FO818745 [NCBI]
CDS location
range 39592 -> 42519
strand -
CDS
ATGAATGACGTTACAGTTGTCACATCGGTTACTTACCCATCACCCGAGTCGTTGGCTCTGGTGGCTGATGTGCAATACCACGAACCATATCTGTCAGCCGCGCTAAACCGAAAATTCAGGGGGATTGTTGACCCGGGATTTTATGCCGGTTTCTTACCTAAGCCTGGCGGTGGGATGAACCTGTTAATCACCTCAGTGGATGGTGATAAAACCGCAGGCGCGGCGTCGGTGGATATTGGTGAATTCTACCAGGTAACTATTCAGCAACGTAAGGATATTTCTCTTGCACTTAGTGCAGGCAAGAAATATGCAATTGTGCTGAAGGGAAGATACCTCCTTGGAGAAGATACCTATCAGGTGAATACCGCGTCACATATTCATGCGGCTGAATTTGTTGCCAGAACCTATACCGATTCATATCAGTTAGGAGATGGGGAGCTGCTTGTTTGTACGGTGAATATCCCTGCTGGTGTATCTGCCATTACCCAGGAGATGATTGATACATCCGAGCGTATCAACCGCACGATCGGCATTGATATTTCAGACTCTGTAACCAGTACCAGAAGTGATGTTGCTGCAAGTTCGCTGGCAGTTAAAAAAGCCTACGATCTGGCGAAAAGCAAGTATACGGCGCAGGATGCAAGCATAACGCAAAAGGGATTAGTTCAGCTCAGTAGCGCAACTAACAGCGACAGCGAAACAATGGCGGCTACCCCTAAAGCTGTTAAGTCTATAAAAGATCTGGCTGATACCAAAGCGCCAATAGAAAGCCCGAGTCTGACAGGAACGCCAAGCGCGCCGACGGCAGCGCAAGGTACAAACAGCACGCAGATAGCAAATACAGCCTTTGTTAAGGCAGCTATAACTGCACTTATCAACGGTGCACCTGGCACACTGGATACACTTAAAGAAATAGCTGCTGCGATCAATAACGACCCGAATTTCAGCACAACTGTCAACAATGCTCTGGCCCTTAAAGCGCCTTTAGCAAGTCCTGCATTAACGGGAATACCTACTGCGCCTACCGCTGCACAGGGTACGAATAACACGCAGATTGCTACGACCGCTTATGTAAGAGCTGCCATATCCGCATTGGTTGGTTCATCACCAGAAGCTCTTGATACCCTGAATGAGCTTGCCGCAGCACTTGGTAATGACCCGAACTTTGCGACAACAATGACAAATGCGCTGGCAGGCAAACAGCCTCTGGATGCAACTTTAACCGCGCTCGCTGGCCTTGCGACTGGTGCAAACAAACTGCCTTATTTCACCGGTAAGGATACGGTAGCGCAGACTGATTTAACGTCAGTCGGTCGCGATATTCTGGCCAAAACAAGCGTTCTTGCTGTTATCCAATACCTTGGTTTAAGAGAACTCGGTACCAGTGGTGAAAAGATCCCCCTGTTGAGCACGGCTAACACATGGAGTGCACGCCAGACTTTTAACGGCGGGATCACCGGGGCGCTGACAGGGAACGCCGACACCGCGACGAAATTAAAAACAGCCATAAACATTAATGGCGTCAGATTCGATGGTTCTACGAACATTTCGATACCAACAATTACGTCTAGAGGACGCGTTACTGCGCTTACCGGTACAACGCAAGGTGCTGCTACTGGATTGCAGATGTATGAGGCATACAACAATGGTTATCCGACGACTTACGGGAATGTACTTCACCTGAAGGGAGCTGCATCCACTGGTGAAGGCGAGTTGCTCATTGGCTGGAGTGGCACAAATGGCGCTCATGCACCAGCTTTCATTCGATCCAAAAGAGATATCACTGCTGCGGCATGGTCCGAATGGGCACAGATCTATACGTCAAAAGATTCCGTTCCCGGCGTTAATACCAAAGGGAATCAGGACACCTCTGGTAATGCGGCTACAGCGACCAAATTGCAGACGGCGTGTACTATCAACGGTGTCTCGTTTGACGGTTCTAAAAATATTGAGCTAACGGCGGCAGATTTAAATCTTGAGCAAACTGTAGAATTAGCCGCAGGAGCATTACAGAAAAACCAGAACGGCGCAGATATTCCGGGAAAAGATACCTTTACCAAAAATATTGGTGCCTGCCGCGCATATAGCGCATGGCTGAATATTGGTGGCGATAGTCAGGTCTGGACAACCGCGCAATTTATTTCGTGGCTGGAGAGTCAGGGAGCATTTAACCATCCTTACTGGATGTGCAAAGGCTCATGGGCTTATGCAAATAATAAGGTCATTACAGATACAGGTTGCGGAAATATTTGTCTTGCAGGTGCTGTGGTGGAAGTTATTGGCACTCGCGGCGCAATGACCATACGCGTTACTACGCCGAGCACGTCCAGCGGTGGCGGAATTACTAACGCTCAATTCACTTATATTAATCATGGTGATGCTTATGCTCCTGGCTGGCGAAGAGACTACAACACGAAAAACCAGCAGCCTGCATTTGCTTTAGGGCAAACAGGAAGCAGGGTTGCAAATGATAAAGCTGTTGGCTGGAACTGGAATAGCGGCGTTTATGATGCAGATATCAGTGGCGCATCGACATTAATCCTCCACTTCAATATGAATGCGGGGAGTTGCCCTGCTGTACAGTTCCGCGTGAATTATAAGAACGGCGGTATCTTTTATCGTTCAGCGCGTGATGGTTATGGCTTTGAAGCTAACTGGTCAGAGTTTTACACCACAACCCGCAAACCCTCTGCGGGGGATGTTGGTGCATATACGCAGGCAGAATGTAACTCAAGGTTTATTACAGGTATTCGCCTGGGCGGTCTGTCATCTGTTCAGACATGGAATGGTCCCGGCTGGTCTGACAGGTCAGGTTATGTCGTTACAGGTTCAGTTAACGGAAACCGTGATGAATTAATTGATACAACTCAGGCAAGGCCAATTCAGTATTGCATTAATGGGACGTGGTATAACGCGGGGAGTATTTAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098024 virus tail, fiber Cellular Component IEA:UniProtKB-KW (UniProt)
GO:0046718 symbiont entry into host cell Biological Process IEA:UniProtKB-KW (UniProt)
GO:0019062 virion attachment to host cell Biological Process IEA:UniProtKB-KW (UniProt)

Tertiary structure

PDB ID
ff2679394f32c9912914bfeadd65928a7785316e777cc3d7b33a67dce2d58fc1
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6295
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50