Genbank accession
QXV82767.1 [GenBank]
Protein name
lateral tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,83
TF
Evidence RBPdetect2
Probability 0,76
Protein sequence
MADIKVVRIESLPATTAVTEDDYLVVQQPDLTRRVKIGDVVHVDGTVSHVISFKEGGKLNGPMDFAYFEEEDLYLRWKGEFPHTVPALSSPYSDGGITDAAWMVYTDPSLREELESTIGASMIMTAEGQSVQDVMDVTVQTANDAKALAQRVDFGTVHTVGDVIHLDNFVGPAVIEEGRTTNYPSVAAGEKFLNGVVSRRDTTTVDGIFRGATTGAMYTIAVTNGVATTKRIALRDEFKRLEANNPTRTVIRAGDDLNTAGYLQLDATGRWGMWNQQTASWQPLAIEQGGTGARDAAGVRINIGAFYKQRAALEPNFNINNLTGNQDGVYYQPMTAYATEANGYPAGSGAGHLIVWQNNANGGTGCRQEYYPFSNVDVWYLRTYQANTNQWTAWQPMVRPRNDDTFRSHIGLGKNNSPAFGHLYLAQYSGDVKSASGILHGDKYNTDGVLEHGYRIYSEVRNDNKAWLTIHLHKGAKGSETHRYLGFREDGVLDCPKYMQVGDLNGQLANWGLGEWIRSSGAERGFWGSKKAAKMVIWDGGMDESGNGTLEWGVYNNRKAKWEPLPQAAGGTGATTLADAQNLFKVPIAAGVKDFLTLPRTAGMEDGKYYPIIVRTDPYYAPSTGTDITIVTRSSSGNDPMNCATLQCHYRTGGWTDRGDSFYGVVNFYQNEKALLGMIAPTRGKQEYVAFYVEARAFPVSIYASRNVVEVFTREQDYQVGAVTDNQDGVKFVAPLQSADLNLAILGDNNTNTRPIVDFKGTSGFYTGGGTQWHYIGTAERYAVMSKMNMPKVELWADGLDYLCYGSPRKALFSNAGFQCASDGTGDLTNGTFTSKCGNGAGLQGQAEFRSTPEAAQVIVRDVVGTAHRFYNFNKDGTFSAPGGFVCHTGADWNNQFGPNNPSKIIAGNVNGPQDTMVVGGLSVGFSGNYAFQIAGRQSNLYTRSIEGGNHHPWNKAMQHRGQGLGSSDLNTYHGLWEGIYHQPANNDATYDRHYPVNEAGTLIVLQNNANNGNGCVQEYITYHGDRFTRYGNMVKQVFTWGPWTQTGGNGVSFRYGGTTPEGEIPTPELKVYISGSHVTNNGGNMPANAAHMYYWGNGSTGDRPNVLEFKVLDESQSNWVWHCGTLPDKSRYLSVNGAVNCTSVNQSSDRDLKDNIAVIPDALEAIRKMKGYTYTLKENGMPYAGVIAQEVLEALPEAVSSFVQRKEIPNPDQDGTPLITEERFYSVDYAAVTGLLVQVCREQDDKITALEEQVKKLTEVVTGLQEKLK
Physico‐chemical
properties
protein length:1270 AA
molecular weight: 139157,15870 Da
isoelectric point:5,56638
aromaticity:0,10315
hydropathy:-0,44583

Domains

Domains [InterPro]
DC_1918
ATT
1–438
G3DSA:2.10.10.80
ATT
47–117
QXV82767.1
1 1270
Architecture
ATT
STR
ATT 1-438 | STR 685-1270
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QXV82767.1
1 1270
Domain Start End Length (AA) Confidence
N-terminal 1 235 235 0,9701
Central domain 236 434 200 0,1709
C-terminal 435 1270 835 0,8397
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-235
Central
236-434
C-terminal
435-1270

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage MaxTheCat
[NCBI]
2852029 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli K-12
[NCBI]
83333 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QXV82767.1 [NCBI]
Genbank nucleotide accession
MZ501094.1 [NCBI]
CDS location
range 22240 -> 26052
strand +
CDS
ATGGCTGATATTAAAGTTGTCAGAATTGAATCTCTTCCTGCCACTACCGCAGTGACAGAGGATGATTACCTGGTTGTTCAACAACCAGACCTGACCCGTCGTGTAAAAATTGGCGATGTTGTCCATGTTGATGGGACTGTTTCTCATGTAATCTCCTTTAAGGAAGGTGGTAAGTTAAACGGCCCAATGGATTTTGCCTACTTCGAAGAGGAAGACCTCTACCTGCGTTGGAAAGGCGAATTTCCACACACTGTTCCTGCACTGTCTTCACCGTACTCCGATGGTGGGATTACTGACGCTGCATGGATGGTATATACTGACCCGTCCTTAAGGGAGGAGCTGGAATCTACCATTGGCGCATCTATGATCATGACAGCCGAAGGGCAGTCTGTCCAGGATGTAATGGATGTTACTGTTCAGACAGCTAACGATGCCAAAGCACTTGCACAGCGTGTAGATTTTGGTACAGTGCACACTGTAGGCGATGTTATTCACCTTGATAACTTTGTTGGCCCTGCAGTTATTGAAGAGGGAAGAACTACCAACTACCCTTCTGTAGCAGCTGGTGAAAAATTTCTGAACGGTGTGGTATCTCGTCGTGATACTACAACTGTTGATGGTATTTTCCGTGGCGCTACCACCGGAGCCATGTACACCATTGCAGTAACCAACGGTGTAGCAACAACGAAAAGAATAGCACTTCGTGATGAATTTAAACGCCTGGAGGCAAACAACCCAACAAGAACAGTTATCCGTGCTGGAGATGATTTAAACACGGCAGGGTACTTGCAGCTCGATGCAACAGGCCGTTGGGGTATGTGGAACCAACAAACAGCTTCGTGGCAACCTCTTGCTATAGAGCAAGGCGGCACAGGGGCTAGAGATGCCGCTGGAGTCCGTATCAATATCGGTGCTTTCTATAAGCAGCGTGCAGCCCTTGAGCCAAATTTCAATATAAATAACTTGACTGGTAATCAGGATGGTGTATACTACCAGCCGATGACTGCTTATGCAACTGAGGCAAATGGTTACCCTGCAGGTTCTGGTGCTGGTCACCTGATTGTTTGGCAGAACAATGCTAACGGCGGTACAGGTTGTCGTCAGGAATACTATCCATTCTCTAACGTAGATGTTTGGTATTTGAGAACCTATCAGGCCAATACAAACCAGTGGACTGCATGGCAGCCAATGGTCAGACCTCGTAACGATGATACCTTCAGATCTCATATCGGCCTTGGTAAAAACAACTCGCCAGCCTTCGGGCACCTTTACTTAGCTCAATACTCTGGAGATGTTAAATCGGCCTCAGGTATTCTCCATGGAGATAAATATAACACTGATGGTGTTCTTGAGCATGGATATAGGATCTATTCTGAGGTAAGAAACGACAATAAGGCTTGGTTGACAATCCACCTGCACAAAGGCGCAAAAGGATCTGAAACTCATAGATATTTAGGCTTCCGTGAGGATGGGGTCTTAGATTGTCCTAAATATATGCAGGTTGGTGATCTGAACGGTCAGCTGGCAAACTGGGGACTTGGAGAATGGATCCGCAGTTCAGGAGCAGAAAGAGGTTTCTGGGGGTCCAAGAAAGCCGCCAAGATGGTTATCTGGGATGGTGGGATGGACGAATCCGGTAACGGCACTCTGGAATGGGGTGTTTATAACAACCGGAAGGCCAAGTGGGAACCTCTACCTCAAGCCGCAGGTGGCACCGGGGCTACAACTTTAGCAGATGCTCAGAATCTGTTCAAAGTTCCTATAGCAGCAGGTGTAAAAGACTTCTTAACACTGCCAAGAACCGCAGGGATGGAAGATGGGAAATACTACCCAATCATCGTTAGAACAGATCCGTATTACGCCCCTTCGACAGGTACAGATATTACTATAGTCACCAGGTCTTCATCTGGAAATGACCCTATGAACTGTGCCACCCTGCAGTGTCACTATAGAACTGGCGGCTGGACGGACAGGGGAGACTCTTTCTACGGGGTAGTAAATTTCTACCAGAATGAAAAAGCACTTCTTGGGATGATTGCTCCAACAAGGGGTAAACAAGAGTATGTTGCTTTCTATGTGGAGGCTCGTGCTTTCCCTGTCAGCATATACGCAAGTAGAAATGTTGTCGAAGTGTTTACCAGAGAGCAAGATTACCAGGTTGGGGCTGTAACAGACAATCAGGATGGGGTTAAGTTCGTTGCACCTCTTCAGTCAGCAGATTTGAATCTTGCTATTCTTGGAGATAATAACACCAATACCAGACCTATTGTTGACTTCAAAGGTACTTCAGGGTTCTACACTGGTGGCGGCACACAGTGGCACTATATAGGCACTGCTGAACGCTATGCTGTGATGAGCAAAATGAATATGCCAAAAGTCGAGCTATGGGCAGACGGTCTTGACTATTTGTGCTACGGCAGTCCTAGAAAGGCGTTATTCTCTAATGCAGGATTCCAGTGTGCATCAGATGGGACAGGTGATCTGACTAACGGTACGTTTACTTCTAAATGCGGTAATGGTGCTGGTCTCCAAGGTCAGGCAGAGTTCAGATCTACTCCAGAAGCTGCTCAAGTTATTGTTCGTGATGTTGTAGGCACAGCTCATAGATTCTACAACTTCAACAAAGATGGCACTTTCTCAGCTCCGGGCGGTTTTGTATGCCACACGGGTGCAGACTGGAACAACCAGTTTGGGCCTAACAACCCGTCAAAAATAATAGCTGGTAATGTTAACGGCCCACAAGATACCATGGTTGTAGGCGGGTTATCTGTAGGATTCTCGGGAAACTACGCTTTCCAGATTGCTGGCAGACAAAGTAACTTATACACCAGGTCTATAGAAGGTGGTAATCACCATCCCTGGAATAAGGCCATGCAACACAGGGGTCAAGGTCTTGGTTCTTCAGACCTTAACACCTATCACGGATTGTGGGAAGGTATTTACCATCAGCCTGCAAATAATGATGCAACATATGATCGTCATTATCCCGTAAATGAGGCCGGAACACTTATTGTGTTGCAAAATAACGCCAATAACGGTAATGGGTGTGTTCAAGAGTATATTACCTACCATGGCGATAGATTCACACGTTATGGGAATATGGTGAAACAGGTCTTCACTTGGGGCCCGTGGACCCAAACAGGTGGTAATGGTGTTAGCTTCAGGTATGGAGGAACAACCCCAGAAGGGGAAATACCAACCCCAGAGTTAAAGGTGTATATTTCTGGAAGCCATGTTACTAATAACGGTGGCAACATGCCTGCTAATGCCGCCCATATGTATTATTGGGGTAACGGCAGCACAGGAGATCGTCCAAACGTGTTAGAGTTTAAGGTTCTTGACGAGTCTCAATCAAATTGGGTATGGCATTGCGGTACCTTGCCGGATAAATCAAGATATCTTTCTGTAAACGGCGCTGTGAATTGCACATCTGTCAATCAGAGTTCTGACAGAGACCTGAAAGATAACATAGCTGTTATTCCTGATGCGTTAGAAGCTATCCGGAAGATGAAGGGTTATACTTACACTCTTAAAGAGAACGGTATGCCTTATGCAGGTGTTATAGCACAAGAAGTGCTTGAAGCTCTGCCGGAAGCGGTTAGCTCTTTCGTGCAAAGAAAGGAAATCCCAAACCCTGATCAAGATGGAACTCCTTTGATAACAGAGGAAAGATTCTACAGCGTGGATTACGCAGCTGTAACAGGCTTGCTTGTCCAGGTTTGCAGGGAACAGGATGATAAAATAACCGCCCTTGAGGAGCAGGTTAAAAAATTGACAGAGGTTGTTACTGGGTTGCAAGAGAAATTGAAGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
c2704304c7a37a3c2617856cc1df516a75afd4089eb1e80c930a232f20a13114
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5553
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Systematic exploration of Escherichia coli phage-host interactions with the BASEL phage collection Maffei,E., Shaidullina,A., Burkolter,M., Heyer,Y., Estermann,F., Druelle,V., Sauer,P., Willi,L., Michaelis,S., Hilbi,H., Thaler,D.S. and Harms,A. 2021 GenBank