Genbank accession
QLF80463.1 [GenBank]
Protein name
tail fiber
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,56
Protein sequence
MPPAIIGAAIAIGASAAAAAGWIATGVALAIGMTATVAGALLTKTPNTNFNAYKGQQERKQVLRAAAAARTVVYGTTVASGVMTFAEEQPGDQDEDELLHMALVVASHPLEGIGDIWLGDDLIQTFEQYVSWEFHNDRQTVDPFMRDNCPSWKDDMIGKGIAWLRISFKFNAEKFPSGLPNIKLLKRGRRVYDPRSGYTVFSDNAALVILDYFRTYLKRSDEHINWEQFKEAANICDEYVSNADGSTERRYRINGEFEVDEAPAKILDAMLQACGGELTYIAGKHGLLVGAYYGPATMTLDESCISGDIKIIPETSYKERTNTITGTFIDPKQNYVEADFPPVVVKEWVEKDGAEITQDMDFRFVTSEYQAQRLSNIILRRKRVGRTIEIPCNMKGYKFRPGMYVKVTISQIGMKNVEMRVTKWSFNPKGGVDITLRQDFLEMWDDAIGKPMDRPDLVDLPSGGVAQPQNLQYQILQISDVVQGVLSWTNMGQVAYNRVAVRQGGVTIWTAQVPGQSVRVTGLLRGAYTAHVQAVSHGGAISPEAYLEFNIQAPPPPSSVEVQNGYFSLTLIPKISELTSVSTQFDFWTSGETKLSSTNTDVVERDATREGMGTVWQKNNLLNDHVYYWYVRTINAFGSSSFVEVEARVWAETGQIIEIMDKEFQETETFKNLMETVQKVDDGVTELSKVVEGQASKIDSVQTSVGDVVVTVQQQSQALADLNGKLSAQWGQKVQIDSNGNKYVAGMQLGIEGSGGQFQSYFMVSADNFAVYNHVVGAAQLAFAIKNGQAFLRDGFIENGSITNAKIGNVIQSNNWNGNDQGWAITKDGYCVFNNVTVRGTVVANAGQFGFNGNGGITIDSNGIRVPLSGGGVVIVGRW
Physico‐chemical
properties
protein length:879 AA
molecular weight: 96578,80090 Da
isoelectric point:5,13122
aromaticity:0,09556
hydropathy:-0,20796

Domains

Domains [InterPro]
DC_0187
STR
1–840
IPR057587
ATT
556–661
IPR000727
STR
660–722
IPR015406
RBD
711–848
QLF80463.1
1 879
Architecture
STR
ATT
STR
STR 1-555 | ATT 556-661 | STR 662-848 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoS_SP8
[NCBI]
2750858 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QLF80463.1 [NCBI]
Genbank nucleotide accession
MT682705.1 [NCBI]
CDS location
range 1000 -> 3639
strand -
CDS
ATGCCACCAGCAATAATTGGAGCGGCAATTGCAATTGGTGCATCTGCTGCGGCAGCGGCGGGATGGATTGCGACAGGTGTGGCTTTAGCTATCGGTATGACCGCAACCGTAGCTGGCGCACTATTAACTAAGACACCAAATACAAATTTCAATGCTTATAAAGGTCAACAGGAGCGTAAACAAGTATTACGTGCCGCGGCAGCAGCCAGAACTGTCGTATATGGTACAACTGTAGCTTCTGGAGTCATGACTTTTGCAGAAGAACAGCCCGGTGACCAAGATGAAGATGAACTTCTCCATATGGCGTTAGTTGTTGCGTCACATCCATTGGAAGGAATTGGAGATATTTGGCTTGGGGATGACCTTATTCAAACTTTCGAACAATATGTCAGCTGGGAATTTCATAACGACCGTCAAACTGTAGACCCGTTTATGCGGGACAATTGTCCTTCCTGGAAGGATGATATGATTGGTAAAGGTATTGCTTGGCTTCGTATTAGTTTCAAATTTAATGCTGAAAAATTCCCTTCTGGTTTACCAAATATCAAACTTCTAAAACGAGGACGACGTGTATATGACCCTAGATCTGGCTATACTGTGTTCAGTGATAACGCGGCATTAGTTATACTGGATTATTTCCGAACATATTTGAAACGTAGCGACGAACATATTAACTGGGAGCAGTTCAAAGAAGCTGCGAATATATGTGACGAATATGTTTCTAACGCAGATGGTTCCACGGAGCGACGCTATAGAATCAATGGAGAATTCGAAGTAGATGAAGCACCTGCTAAAATCCTTGATGCAATGTTGCAGGCTTGCGGCGGGGAATTGACATATATAGCCGGTAAGCACGGATTATTGGTCGGTGCATACTATGGTCCAGCAACAATGACTTTGGACGAAAGTTGTATTTCCGGTGATATTAAAATCATCCCGGAGACATCCTATAAGGAACGAACTAACACAATCACTGGTACATTCATAGATCCTAAACAGAACTATGTTGAAGCCGACTTCCCTCCCGTTGTGGTCAAAGAATGGGTGGAAAAAGATGGTGCAGAAATAACCCAGGATATGGATTTCCGATTTGTAACAAGCGAATATCAAGCCCAACGTTTATCCAATATTATTTTACGTCGTAAACGTGTTGGTCGCACAATTGAAATTCCTTGTAACATGAAAGGTTATAAGTTTCGACCAGGAATGTATGTAAAAGTTACTATTTCTCAGATTGGAATGAAAAATGTCGAGATGCGAGTAACTAAATGGTCTTTCAACCCTAAAGGTGGAGTAGACATTACTCTACGACAAGATTTCCTGGAAATGTGGGATGATGCTATCGGTAAACCGATGGACCGCCCAGATCTTGTAGATTTACCAAGTGGCGGTGTTGCTCAACCACAAAATCTACAGTACCAGATTTTGCAGATCAGCGATGTGGTACAGGGTGTGTTATCCTGGACTAATATGGGGCAAGTGGCGTACAACCGCGTAGCCGTGCGACAGGGCGGTGTAACTATTTGGACCGCACAAGTCCCAGGACAAAGTGTTAGAGTTACTGGTCTGTTACGTGGTGCGTATACAGCGCACGTCCAAGCTGTGTCACACGGTGGGGCAATATCCCCTGAGGCTTATCTTGAATTTAATATTCAAGCTCCACCACCACCTTCTTCTGTCGAAGTACAAAACGGTTATTTTTCTTTAACACTTATTCCAAAAATTTCTGAACTTACTAGCGTCAGCACTCAGTTCGATTTCTGGACATCAGGTGAAACAAAATTATCATCCACCAATACGGATGTCGTTGAACGAGACGCTACTCGTGAAGGTATGGGTACTGTATGGCAAAAGAACAATTTGCTTAATGACCACGTTTATTACTGGTATGTTCGCACAATTAATGCCTTCGGATCGTCTTCGTTTGTTGAAGTGGAAGCCCGAGTATGGGCGGAAACTGGGCAAATCATTGAAATCATGGATAAGGAGTTCCAGGAGACTGAAACTTTCAAAAACCTCATGGAGACTGTACAGAAAGTCGACGATGGTGTTACAGAATTGTCTAAAGTTGTAGAAGGTCAAGCCTCCAAGATAGATTCAGTGCAGACTAGCGTTGGCGACGTTGTGGTTACAGTTCAACAACAATCGCAAGCATTAGCAGATCTTAATGGTAAACTTTCTGCTCAATGGGGGCAAAAAGTTCAGATTGATAGTAACGGCAACAAATATGTCGCCGGTATGCAGTTAGGTATAGAAGGTAGCGGAGGTCAATTCCAATCTTATTTCATGGTGAGTGCGGATAACTTTGCTGTTTACAACCATGTAGTTGGAGCTGCTCAACTTGCGTTTGCTATTAAAAACGGTCAGGCGTTCTTGAGAGATGGTTTTATTGAAAATGGTTCAATTACAAATGCTAAGATAGGTAATGTGATTCAGTCCAATAACTGGAATGGGAACGATCAAGGGTGGGCAATTACAAAAGACGGTTATTGCGTATTTAATAACGTTACCGTCAGAGGGACGGTCGTTGCTAACGCTGGTCAATTTGGTTTCAACGGAAATGGAGGTATTACAATTGATAGTAATGGTATCCGCGTTCCGTTATCCGGTGGGGGTGTTGTCATAGTAGGACGCTGGTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
62403e77495f8db937d9a9e29e21c8b8af6fbb0c3545c099bd5e374057bba21d
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7916
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome sequences of eight phages infecting swine Enterotoxigenic Escherichia coli Ferreira,A., Oliveira,H., Silva,D., Almeida,C., Burgan,J., Azered,J. and Oliveira,A. 2020-09-03 GenBank