Protein
View in Explore- Genbank accession
- QKN86080.1 [GenBank]
- Protein name
- tail fiber protein
- RBP type
-
TF
- Protein sequence
-
MSTIWPEGKRDQVILKGMPKPAADAYEGYQSAAAAAKDIYAHTKGELVKTERTLKMQARQLGPTMKKYLPDAVTKRIDKWSKSDQLDYQNYDPNQALMDRELGDVFSGVPKDADEQRQLQDRMVEDKLRDSIKEMRADAMHQTMIGMAKDINLLTGLSRGVFLNIERKKLELQYRTLFAIQDIVKMKQSEFDRNTPALEAIVKNTALPDYAKEDFSEVRWANVKRQAAEWMNPLRYADGFMDMIRENTKKKISGIFGEGRGLLESVLGMGVEDDFGMSDSSSLTAERRKTNARDKATAWGSGFLAKKLLGPQIEKLQKWTREEMEKNPEVMKRLQKGAFTFGNLSSISNSAIAGETQGPLADLFRVLNELGIVQPLNREKAFLDERNGETLSRSAKFDRKAYLSLVEVIPAWLAEINKSVRRGYGEHADLEYDITSRGFVDRKVVGNRVRKAVANDEQRLRLQNSINSTVDFVDRGKTLSQKDRQHLADYIESRASQGRAFDVEAILKDPMHLHRYMPGNAAERIKEALQGHSDSLVGGSNELSNELARKISTVQSSITQRQAIIDEAVNIYGERALRDAGIFNYDAKSDTFGVDKDLSDPYTLFNDLAMGKTRSGRALTREQEIQRKLQNGSALGDYLRRMNQGVNGGADDTSLPPALRGGGKGRGMSPRQLAAVLYGETSTNFVELLSQRNRGEEAPRNNFDGIIEAIRGNNNSDTLQKILEHVRSMDEEGVLLASLAGGAGSGDEEMGPPRPGGGSGGGKRRRIIIGEDGLIRRWGGVLFDTAAGIGGFAKRGVKGAWNKLNQFGGWARGKVAGMGGGEGPGFLTRMRGLISGSVRGGFEAVSSFGKGLLGIRDVYDDHGNVVLQGARLEAGEYYQVIDGKMVQLKTLDDIKLGSDIVDSAGNLILAAADLAAAGKLRYYKGGKIQALTQGLASKIGLGFNKVAKLPKRFLDFLSPKAGSIVGKIKDWLNEIPEGEEKTRLQKAFNRVTSLPGKVLGFAKRGVDRLKDMITDNPLTRWWKGRKDGGGGGFSLFSSTGKKTNHILIRIYKLLNQRLPGDPEDESWTEEMEKGVGGGGSTIGRAVRGAYDRAKASLSERFGGRWSRTKAFFGRGRDRLRGWFSGFRGRAGDMLDGYRGARHDIATRYEVERRLAGRDDDVAEFYRSHLNAKGGLSGRKVYGDAKEDLETARDAAGRVINKGKNAAKSAGARLFERLDRMIGLQEMSWFNTMRESVSRAGGDDGIIRTMFAKFGKRNKPPEGDEKRDYLNFFKRWREKRKEKKEKAQGSKGKSGGLWDMVKSLPIIGPIVSILGTVGNILGSITKWGVLKPVGLLGKAAWNVGKFAVTRLAAPAVSAVATAASAVVTAVGWPAILIGGAIAAAGYAAYRIATTTYTQYLDKMRLAQYGFRDYDKWSSDDGAKARYLEDALREYVSYAEDGQASLRGLSGKDVQKLAEGFGINVEEKGEMLAFQAFMLQRFIPIYLRWITALKSMPNSIQLADVGDAKKVSKEDMLTLFNKMKMTKDAKAFSSLTDPRKVNQGFFSKAWDVVTFTPKEFLSGEEVMEVQNEVERAIKSRMDDKKARKYGMAPAVEGIKSAGVDEAINKLGQLDNERNKNLAKVEGWEDGTEQVQIQVDWNAVLDQKDMNAMESVRWKTYGFTTIDNATRTLITVFEKNVIKDIDVKTASYKGDWKKAIASMVPDAIGTPKEDRLKRWFFDRFLPVFMTYLVGVKRYLPTADPLNLKLTGGYLYEISLMMSTAYSLKGGIRQSVWEVNINPLGGEANTNPSSIKAELETLKLLSKEADLAVRNMIKAIRNNGKRARWKDRNKNRSSLEVTDEDEEDSNISSGDSLSSDGARASGYIPSGTSGGVPGNLGQVVDAVGGVRNYAAMTTGSSSINLSDVKDGDYKSLAEKYPIEMLGRKGALNVPNIKALITDAANMMGVPPAVALAMAKAESGFNYTAKNPYASASGLFQFIDGTWDGMMKGYSRKFGIPRVNQMDPWANAILGVQFIRDNIQQAQRDLGGKAPPPAVAYLYHFLGAGGGKKFLEAWKRNPNMAASSAPGITSAILRGNANVFYSNGRIRSVDGVIQELNRRMGAISANEVAADPSKTKDMVAGLSPNSPTNPAAAMGAPAANDPSLSPADNLPADNANRRDDALTQKGAMAAQDAMATAAGNVGPAAPTPTTGGSGTSDASTTAETVASQAAAEGLSATDVAKVKAGAEAQVNAAARPVAAPTSDATASTPTLNGDPIDVQQLKVLIQSRDYLKEIRDILKSNPRAANDTRGGSIQQAANVAPPGSAARRQEITQPTPSLNVSRKAS
- Physico‐chemical
properties -
protein length: 2324 AA molecular weight: 253565,66800 Da isoelectric point: 9,49260 aromaticity: 0,07229 hydropathy: -0,48021
Domains
Domains [InterPro]
DC_0124
STR
1–2025
STR
1–2025
G3DSA:1.10.530.10
RBD
1927–2101
RBD
1927–2101
IPR023346
STR
1933–2021
STR
1933–2021
IPR008258
ENZ
1935–2023
ENZ
1935–2023
cd00254
ENZ
1947–2025
ENZ
1947–2025
1
2324
Architecture
STR 1-2025 | RBD 2026-2101 | RBD 2103-2324
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage vB_EcoM_EC001 [NCBI] |
2739754 | Uroviricota > Caudoviricetes > Chimalliviridae > Seoulvirus SPN3US > |
| Host |
Escherichia coli [NCBI] |
562 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
QKN86080.1
[NCBI]
Genbank nucleotide accession
MN445185.1
[NCBI]
CDS location
range 204748 -> 211722
strand -
strand -
CDS
TTGTCAACTATCTGGCCAGAAGGGAAACGCGACCAAGTTATCCTCAAGGGGATGCCTAAGCCGGCAGCTGATGCATACGAGGGCTATCAGTCAGCGGCAGCTGCAGCAAAGGACATTTACGCCCATACGAAAGGGGAGCTTGTGAAGACCGAGCGCACGCTGAAGATGCAAGCGCGCCAACTCGGTCCGACGATGAAGAAATACTTGCCGGATGCAGTTACTAAGCGCATCGACAAGTGGTCAAAATCTGACCAGTTAGATTACCAGAACTACGATCCGAATCAGGCATTGATGGATCGTGAACTGGGCGACGTATTCTCAGGTGTGCCAAAGGATGCCGATGAACAACGTCAGCTGCAGGACCGCATGGTAGAGGATAAGCTCCGCGACAGCATCAAAGAGATGCGTGCGGATGCAATGCACCAAACCATGATTGGGATGGCGAAAGACATTAACCTGCTAACCGGGTTAAGCCGTGGGGTATTCTTAAACATCGAGCGTAAGAAACTCGAACTGCAATACCGTACCCTGTTTGCCATCCAAGACATCGTGAAGATGAAGCAGTCGGAGTTCGATCGTAATACGCCTGCTTTAGAAGCCATCGTGAAGAACACGGCATTGCCGGATTACGCGAAGGAAGATTTCTCAGAAGTCCGTTGGGCGAACGTTAAACGCCAGGCTGCGGAATGGATGAACCCGTTGCGTTATGCTGACGGGTTTATGGACATGATTCGCGAAAATACCAAGAAGAAGATTTCTGGTATATTTGGCGAAGGTCGTGGTTTGTTAGAGTCTGTTCTCGGCATGGGTGTTGAAGACGATTTCGGCATGAGCGACAGCTCTTCGTTGACTGCTGAACGTCGCAAAACAAATGCTCGTGATAAAGCAACTGCGTGGGGTAGCGGATTCCTTGCGAAGAAACTACTCGGCCCACAAATCGAGAAACTCCAGAAATGGACTCGTGAGGAGATGGAGAAAAATCCAGAGGTCATGAAACGCCTGCAGAAAGGCGCGTTCACCTTTGGTAATCTTTCCTCTATCTCGAACTCAGCTATCGCAGGCGAAACGCAAGGGCCGTTGGCGGATTTGTTCCGGGTATTGAATGAACTCGGTATTGTCCAACCGCTAAACCGCGAGAAAGCCTTCCTTGACGAGCGTAATGGCGAAACGTTAAGCCGTTCGGCGAAGTTTGACCGTAAAGCATATCTTAGCCTGGTTGAAGTTATTCCTGCCTGGCTGGCAGAGATTAACAAATCTGTCCGTCGTGGCTATGGCGAACATGCCGACCTGGAATATGACATCACCAGCCGTGGTTTCGTAGACCGCAAAGTGGTCGGTAACCGCGTACGCAAGGCGGTTGCAAACGATGAACAACGCCTGCGTCTCCAAAATTCCATTAACAGCACTGTGGATTTTGTTGACCGCGGCAAAACGCTCTCGCAGAAGGATCGACAGCATCTCGCCGACTATATCGAGTCGCGAGCCTCGCAGGGCCGCGCATTCGACGTAGAAGCGATTCTGAAAGACCCGATGCATCTTCATCGGTATATGCCAGGAAACGCCGCAGAACGCATTAAGGAAGCGTTACAGGGGCATTCTGATAGCTTGGTCGGCGGTAGCAACGAATTAAGTAACGAGCTGGCCCGCAAAATCTCCACAGTGCAGTCATCTATCACCCAACGTCAGGCAATTATCGACGAGGCGGTGAATATTTACGGTGAGCGAGCATTGCGCGATGCGGGTATCTTTAACTACGACGCGAAGAGTGACACTTTCGGCGTGGATAAAGACCTGTCCGATCCCTATACCTTGTTTAATGACTTGGCAATGGGTAAGACACGCAGCGGTCGTGCGTTGACTCGTGAGCAGGAGATTCAACGTAAGCTGCAAAACGGTTCGGCGTTAGGCGACTATCTGCGCCGAATGAACCAAGGGGTAAATGGTGGTGCCGATGACACGTCGTTGCCGCCGGCTTTACGTGGCGGTGGTAAAGGCCGCGGAATGTCGCCTCGGCAGTTAGCCGCAGTTCTCTACGGTGAAACCTCGACTAACTTTGTTGAATTGTTGAGTCAACGTAACCGCGGCGAAGAAGCGCCACGGAATAACTTTGACGGCATCATCGAAGCGATTCGGGGTAATAACAACAGTGACACCCTCCAGAAAATTCTGGAACACGTCAGAAGTATGGACGAAGAAGGGGTTCTCCTGGCTTCGTTAGCAGGGGGTGCTGGTTCTGGTGACGAAGAGATGGGACCACCTCGTCCTGGTGGTGGTAGCGGTGGCGGTAAACGCCGTCGTATTATCATCGGTGAAGATGGGCTTATTCGTCGTTGGGGTGGTGTGTTGTTCGACACCGCAGCAGGAATCGGTGGCTTTGCGAAACGTGGTGTTAAGGGTGCCTGGAATAAACTGAACCAGTTCGGTGGCTGGGCACGCGGTAAAGTCGCAGGAATGGGTGGCGGAGAAGGTCCTGGTTTCTTAACCCGGATGCGTGGCCTTATCAGCGGTAGTGTCCGTGGTGGCTTTGAAGCGGTAAGCTCTTTCGGTAAAGGACTACTGGGTATCCGTGACGTTTACGATGACCACGGTAATGTTGTTCTGCAAGGTGCACGTCTGGAAGCTGGAGAATACTATCAGGTCATTGATGGTAAAATGGTTCAGCTGAAAACGCTGGACGACATCAAGCTGGGGAGTGATATTGTTGACTCCGCAGGTAATCTGATATTAGCAGCCGCTGACTTAGCCGCTGCGGGTAAACTCCGCTACTATAAAGGCGGGAAAATCCAAGCACTGACCCAAGGTCTGGCCAGTAAGATTGGCTTAGGCTTTAATAAGGTGGCTAAGCTACCGAAAAGGTTCCTGGATTTCCTGTCACCGAAAGCGGGCAGCATCGTTGGTAAGATTAAGGACTGGCTGAACGAGATTCCGGAAGGCGAAGAAAAGACACGTCTCCAGAAAGCGTTCAACCGTGTAACGAGCTTACCGGGTAAAGTGCTCGGTTTTGCGAAACGTGGTGTCGATCGTCTGAAAGACATGATTACCGACAACCCACTGACGCGTTGGTGGAAAGGCCGTAAAGATGGTGGGGGCGGTGGTTTCAGTCTCTTCTCATCAACCGGCAAGAAAACCAACCACATCCTTATCCGTATCTATAAACTGTTGAACCAACGTTTACCGGGAGATCCGGAAGACGAAAGTTGGACAGAGGAAATGGAGAAGGGCGTCGGGGGTGGCGGTAGTACGATCGGTCGTGCAGTACGCGGGGCGTATGATCGCGCGAAAGCGTCACTGTCTGAACGTTTTGGTGGACGTTGGTCAAGAACAAAAGCGTTCTTTGGTCGTGGACGTGACCGCCTGCGTGGTTGGTTCAGTGGTTTCCGAGGCCGGGCTGGGGATATGCTCGATGGATATCGTGGAGCACGACACGACATTGCTACACGTTACGAAGTTGAACGTCGGTTGGCTGGACGCGACGATGATGTTGCTGAGTTTTATCGCAGCCATTTGAACGCGAAAGGTGGCCTTTCTGGCCGTAAAGTCTACGGCGATGCGAAGGAAGACCTTGAGACCGCCCGTGATGCAGCAGGGAGGGTTATTAACAAAGGGAAGAATGCCGCCAAGTCTGCAGGCGCAAGGTTATTCGAACGCTTGGACCGAATGATTGGCTTGCAAGAGATGTCATGGTTTAACACCATGCGCGAATCAGTATCCCGTGCAGGTGGTGACGATGGCATTATTCGCACCATGTTTGCGAAGTTCGGTAAACGTAACAAGCCGCCTGAAGGTGATGAAAAACGCGATTACCTTAACTTCTTCAAACGTTGGCGTGAGAAACGTAAGGAGAAGAAAGAGAAAGCGCAGGGCTCCAAAGGGAAATCTGGCGGTCTGTGGGATATGGTGAAAAGCTTACCTATCATCGGTCCCATCGTCAGTATCTTGGGAACTGTGGGTAATATACTCGGTTCTATCACGAAATGGGGCGTGTTAAAACCAGTAGGTCTTTTAGGTAAGGCTGCGTGGAATGTCGGTAAGTTCGCAGTAACCCGTTTGGCGGCACCCGCGGTGTCCGCTGTGGCAACTGCGGCATCTGCCGTCGTGACTGCGGTGGGTTGGCCGGCAATTCTTATCGGTGGTGCGATCGCTGCGGCTGGTTACGCCGCGTACAGGATTGCAACGACTACCTATACCCAGTACCTGGATAAGATGCGTTTAGCTCAGTACGGTTTCCGTGATTACGATAAGTGGTCGTCGGACGACGGGGCGAAAGCGCGTTACTTAGAAGACGCATTGCGTGAGTATGTGTCTTACGCGGAAGATGGCCAAGCCAGTCTCCGCGGGCTAAGTGGAAAAGACGTTCAGAAGCTAGCTGAAGGGTTCGGTATTAACGTTGAGGAAAAAGGCGAGATGTTGGCCTTCCAAGCGTTTATGCTTCAGCGCTTTATTCCGATTTATCTCCGCTGGATTACGGCACTGAAGTCAATGCCGAACAGTATCCAGTTGGCTGACGTCGGCGATGCGAAGAAAGTATCGAAAGAGGACATGCTGACTCTCTTCAACAAAATGAAGATGACTAAGGATGCAAAAGCGTTCTCTTCGTTAACAGACCCACGCAAAGTCAACCAAGGTTTCTTCTCGAAAGCTTGGGACGTTGTAACCTTTACACCGAAGGAGTTCTTGAGCGGTGAAGAGGTTATGGAAGTGCAGAATGAAGTCGAGCGTGCAATCAAGTCCAGAATGGACGATAAGAAAGCACGTAAGTACGGAATGGCACCGGCCGTGGAGGGGATTAAGTCCGCGGGCGTTGATGAAGCTATCAACAAACTTGGACAGTTGGATAACGAACGTAATAAAAATCTGGCGAAGGTAGAGGGTTGGGAAGACGGAACAGAGCAGGTTCAGATTCAGGTAGACTGGAATGCGGTGCTCGACCAGAAAGACATGAATGCGATGGAATCGGTGCGTTGGAAGACTTACGGCTTTACCACCATCGATAACGCCACCCGCACGTTGATCACGGTATTCGAGAAGAACGTCATCAAAGACATTGACGTGAAAACCGCAAGCTACAAAGGAGATTGGAAGAAAGCAATCGCCTCGATGGTTCCAGACGCTATCGGTACACCGAAAGAAGATCGTTTGAAGCGGTGGTTCTTTGACCGTTTCTTACCGGTATTCATGACGTATTTGGTTGGGGTGAAGCGTTACCTGCCAACAGCCGATCCGCTGAACTTGAAGCTGACTGGTGGTTACCTGTACGAAATCAGCTTGATGATGTCAACAGCCTACAGCTTGAAAGGTGGTATAAGACAGTCGGTGTGGGAAGTGAATATCAACCCATTAGGGGGTGAGGCTAACACGAACCCATCGTCTATTAAAGCCGAGCTGGAAACGCTGAAGCTGTTGTCAAAAGAAGCTGACCTTGCCGTGCGTAACATGATTAAAGCCATTAGAAATAATGGCAAGCGTGCGCGCTGGAAGGATCGTAACAAGAACCGCAGTTCTCTGGAAGTCACCGATGAAGATGAAGAAGACTCGAACATCAGTTCGGGAGATTCTTTGTCATCTGACGGTGCTCGCGCCTCAGGCTACATTCCATCGGGTACGAGTGGTGGTGTGCCGGGTAACTTGGGTCAAGTCGTCGATGCGGTTGGTGGTGTGCGGAACTACGCTGCAATGACCACTGGTTCGTCTTCGATTAACCTGAGTGATGTGAAAGACGGTGATTATAAGTCACTGGCTGAAAAATACCCGATAGAAATGTTGGGTAGAAAGGGTGCGTTGAACGTTCCGAATATCAAAGCATTGATTACCGATGCGGCGAACATGATGGGCGTGCCACCTGCAGTGGCGTTAGCAATGGCTAAGGCGGAGTCCGGATTTAACTACACCGCTAAAAACCCGTATGCTTCGGCGTCTGGGTTGTTCCAGTTTATTGACGGTACGTGGGACGGGATGATGAAGGGGTATTCGCGGAAGTTCGGTATTCCGCGTGTTAACCAGATGGACCCGTGGGCGAATGCTATATTGGGTGTACAGTTCATTCGTGACAACATCCAACAAGCACAGCGTGACCTGGGTGGTAAAGCACCACCTCCAGCCGTGGCTTATCTGTATCACTTCCTGGGTGCGGGCGGCGGTAAGAAATTCCTGGAAGCATGGAAGCGTAATCCGAATATGGCGGCATCGAGTGCTCCTGGGATTACATCCGCAATATTGAGAGGGAATGCCAACGTCTTCTACAGCAACGGTCGTATACGTAGCGTGGATGGGGTTATTCAGGAACTGAACCGCCGTATGGGCGCAATTTCTGCCAACGAAGTCGCTGCCGATCCGAGTAAGACGAAGGATATGGTTGCAGGCTTGTCGCCTAATTCACCAACCAACCCGGCAGCAGCAATGGGTGCACCGGCCGCTAATGATCCGAGTCTGTCGCCAGCAGATAACCTGCCGGCAGATAATGCTAATCGTCGTGATGACGCATTGACGCAGAAAGGGGCCATGGCGGCACAAGATGCAATGGCTACCGCCGCAGGTAATGTAGGACCAGCAGCACCTACACCAACTACAGGTGGTTCCGGAACATCCGATGCCTCAACAACAGCCGAAACCGTAGCGTCGCAAGCTGCAGCAGAAGGATTGTCTGCAACGGATGTTGCTAAAGTGAAAGCAGGCGCGGAAGCGCAGGTTAACGCAGCAGCTCGTCCAGTTGCCGCCCCAACTTCTGATGCTACAGCATCGACGCCAACGCTGAATGGTGACCCGATAGATGTTCAGCAGCTCAAGGTACTGATTCAGTCTCGCGATTACTTGAAAGAGATTCGTGATATTTTGAAATCAAATCCGAGAGCGGCAAACGACACACGGGGTGGTAGTATCCAGCAAGCGGCAAATGTGGCTCCTCCGGGCTCAGCTGCACGCAGGCAGGAAATAACCCAACCGACACCGTCGTTAAACGTAAGTCGGAAAGCAAGCTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
4aa3fc971e6a54cfadbf1fe3a313f19e78c80b8ce640f16505121500a9cd3120
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50