UniProt accession
A0AA49INA0 [UniProt]
Protein name
Tail fiber protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,91
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MADYKLSQLNSIDTIRSEDLLHIRVKKRPEMLGDEDRRMTYQDFLASFKLERFVQIAGSTMTGDLGIVKLLYGGKAVFDPTDSSEITIGDILKTFKINSNGLKLTIADASRSATVYHTLNKPSPNELGMRTNEENDARYARLAVTNNFSIRQAIVTDGEQLTLKKATNAGASYIQGRDSNNQQTWYVGQGNANSTNVYLYNHATGAMLTLDATAAFNKSLRITGQVQPSDFSNLDARYFTQTVANQRFAQLAGNNTFTGANTFRTMSVISNAAAITLQCASANESLYLLAKDYEGGNLWYFGKGSSGDGVTIYNYTTNTSLVLDAAGVSANKNVRITGQVQPSDWANIDSRYIPAATLSTIARTNGSNTFTAKQVINTNGEALVLQGKSTAALYIRGKDYDGTNRWYVGNDDGSDTVSLYNYKTGKSLSIGSEVSINANLVIGGQVQPSNFTNFDARYYGKAEADSKYFWSAFRASKTEAAGGVAWDQPSGIYLETKSSGQNRLVLHMASNAKALTPSAQLRFESKNGGMWYRSARDTLGFEVEWSKVYTEAQKPTPSEIGAYTKAETDQKIAEAISNSTDLNKIYPVGIVTWFNSNVNPNTALPGLTWTYLNNGVGRTIRIAAANGSDVATTGGSDSVTLSVGNLPSHTHSFSSTTSSFDYGTKATNTTGNHAHDRGNMEISGVVGWFRSDNSAFYTAGGAFQMGSAQASHGFTGSDFTYGAAADFHASRTWTGVTNTTGNHAHTVGIGAHSHTVSGNTGGTGSGSAFSVTNQFYKLMAWVRTA
Physico‐chemical
properties
protein length:785 AA
molecular weight: 84584,34970 Da
isoelectric point:8,44840
aromaticity:0,09936
hydropathy:-0,38854

Domains

Domains [InterPro]
G3DSA:6.20.80.10
STR
157–218
IPR048388
ATT
158–247
A0AA49INA0
1 785
Architecture
ATT
ATT
RBD
ATT
RBD
ATT 1-264 | ATT 272-355 | RBD 356-378 | ATT 379-466 | RBD 467-785
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EfeM_pEP20
[NCBI]
3043517 Uroviricota > Caudoviricetes > Andersonviridae > Felixounavirus > Felixounavirus pEP20
Host Escherichia fergusonii
[NCBI]
564 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WHM53049.1 [NCBI]
Genbank nucleotide accession
OQ743992 [NCBI]
CDS location
range 49459 -> 51816
strand -
CDS
ATGGCAGATTACAAATTGAGTCAATTAAACTCAATCGACACAATTCGTTCAGAAGACCTATTGCACATCAGAGTTAAGAAAAGACCTGAAATGTTGGGCGACGAAGACCGTAGAATGACCTATCAAGACTTCTTAGCTTCCTTCAAACTAGAAAGGTTTGTTCAGATTGCTGGGAGCACTATGACGGGGGATTTAGGTATTGTCAAGTTACTTTACGGTGGTAAGGCTGTATTTGACCCAACAGATTCTTCTGAGATTACTATTGGGGATATTTTAAAGACTTTTAAAATTAACTCAAATGGTCTTAAACTCACTATTGCAGATGCTTCAAGGTCGGCAACTGTTTATCATACTTTGAATAAACCAAGCCCTAATGAACTTGGTATGAGAACTAATGAAGAGAATGATGCAAGATATGCAAGGCTTGCTGTCACAAATAACTTCAGCATTAGGCAAGCAATCGTAACTGATGGTGAGCAGTTAACTCTTAAGAAGGCAACCAATGCAGGTGCTTCATATATTCAAGGTAGAGATTCTAATAACCAGCAAACTTGGTATGTTGGACAGGGAAATGCAAACAGTACTAATGTATATTTGTACAATCATGCTACTGGTGCAATGTTAACCCTTGATGCAACAGCAGCATTTAATAAGTCACTTAGAATCACTGGTCAAGTCCAACCTTCAGATTTCTCTAACTTAGATGCTAGATACTTTACTCAGACAGTCGCTAATCAGAGATTTGCACAGTTAGCTGGTAATAACACTTTTACAGGTGCTAATACATTTAGGACTATGAGTGTTATCTCGAATGCTGCTGCTATCACATTGCAATGTGCATCAGCAAACGAATCCCTGTATCTACTTGCAAAAGATTATGAAGGCGGGAACTTATGGTATTTTGGTAAAGGTAGCTCAGGTGATGGTGTAACTATCTACAACTACACCACTAACACCTCTTTGGTCTTAGATGCTGCTGGAGTATCAGCAAATAAAAATGTCAGAATCACTGGTCAAGTTCAACCTTCAGATTGGGCTAACATCGACTCTAGATATATTCCGGCAGCAACATTAAGTACGATTGCAAGAACTAATGGAAGTAACACGTTCACTGCAAAGCAGGTTATTAACACTAATGGTGAAGCTCTCGTGTTACAAGGAAAATCAACCGCCGCACTTTACATTAGGGGTAAGGATTATGATGGAACGAATAGGTGGTATGTTGGCAATGACGACGGTAGTGACACTGTTAGTTTGTATAACTATAAAACTGGTAAAAGCCTAAGCATTGGTTCCGAAGTATCAATTAATGCAAACTTAGTAATTGGTGGTCAAGTACAACCTAGCAACTTCACAAACTTTGATGCAAGGTATTACGGTAAAGCCGAAGCCGATTCAAAGTATTTCTGGTCAGCATTCAGAGCTTCGAAGACCGAGGCTGCTGGTGGTGTGGCTTGGGACCAACCCTCAGGAATCTATCTTGAGACAAAATCTAGCGGGCAAAACCGTTTAGTCTTGCACATGGCCTCTAATGCGAAAGCTCTAACACCATCTGCACAATTAAGATTTGAATCGAAAAATGGTGGGATGTGGTACAGGTCAGCCAGAGACACACTTGGCTTTGAGGTTGAATGGTCTAAGGTTTACACTGAAGCTCAGAAACCTACCCCTTCAGAGATTGGTGCATACACTAAAGCTGAAACTGACCAGAAGATTGCAGAAGCAATTAGTAACTCTACAGACCTGAATAAAATCTATCCAGTAGGTATTGTGACGTGGTTTAACAGTAATGTTAACCCTAATACAGCACTTCCTGGTCTAACTTGGACGTACCTGAATAATGGTGTTGGTAGAACTATTAGAATTGCAGCAGCAAACGGTTCAGATGTTGCTACAACTGGGGGTTCAGACTCTGTAACGTTATCTGTTGGTAACTTACCTTCACACACGCATAGTTTCTCTTCTACCACTTCATCATTTGATTATGGTACTAAGGCGACAAATACGACTGGCAACCATGCTCACGATAGAGGTAACATGGAGATTTCTGGTGTGGTTGGTTGGTTCAGAAGTGATAACAGTGCATTCTACACTGCGGGCGGTGCGTTTCAGATGGGTAGTGCTCAGGCTTCACATGGCTTCACAGGCTCTGACTTCACTTACGGTGCTGCTGCTGATTTCCACGCGTCAAGAACTTGGACAGGTGTAACCAACACAACTGGCAACCACGCGCACACTGTTGGTATTGGTGCTCACAGCCACACAGTTAGCGGTAACACAGGTGGTACTGGTTCTGGTTCAGCATTTAGTGTTACTAACCAGTTCTATAAGTTAATGGCTTGGGTAAGAACTGCTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
855d9b867d474e1785a5ec529532e8809fa5a8222e1ddb1d61cd80c5a36973b8
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6983
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50