Genbank accession
WFG41753.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MAKRGKDGFDDIDLDRLDDWDDFGEPPRPKDEKKRSPILSTLNVARKSALSTIWPEGKRDQVILKGMPKPAADAYEGYQSAAAAAKDIYAHTKGELEKTERTLKMQARQLGPTMKKYLPDAVTKRFDKWSKSDQLDYQNYDPNQALMDRELGDVFSGVPKDADEQRQLQDRMVEDKLRDSIKEMRADAMHQTMIGMAKDINLLTGLSRGVFLNIERKKLELQYRTLFAIQDIVKMKQSEFDRNTPALEAIVKNTALPDYAKEDFSEVRWANVKRQAAEWMNPLRYADGFMDMIRENTKKKISGIFGEGRGLLESVLGMGVEDDFGMSDSSSLTAERRKTNARDKATAWGSGFLAKKLLGPQIEKLQKWTREEMEKNPEVMKRLQKGAFTFGNLSSISNSAIAGETQGPLADLFRVLNELGIVQPLNREKAFLDERNGETLSRSAKFDRKAYLSIVEVIPAWLAEINKSVRRGYGEHADLEYDITSRGFVDRKVVGNRVRKAVANDEQRLRLQNSINSTVDFVDRGKTLSQKDRQHLADYIESRASQGRAFDVEAILKDPMHLHRYMPGNSAERIKEALQGHSDSLVGGSNELSNELARKISTVQSSITQRQAIIDEAVNIYGERALRDAGIFNYDAKSDTFGVDKDLSDPYTLFNDLAMGKTRSGRALTRDQEIQRKLQNGSALGDYLRRMNQGASGGADDTSLPPALRGGGKGRGMSPRQLAAVLYGETSTNFVELLSQRNRGEEAPRNNFDGIIEAIRGNNNSDTLQKILEHVRSMDEEGVLLASLAGGAGSGDEEMGPPRPGGGGGGGKRRRIIIGEDGLIRRWGGVLFDTATGIGGFAKRGVKGAWNKLNQFGGWARGKIAGMGGGEGPGFLTRMRGLISGSVRGGFEAVSSFGKGLLGIRDIYDDHGNVVLQGARLEAGEYYQVIDGKMVQLKTLDDIKLGSDIVDSAGNLVLAAADLAAAGKLRYYKGGKIQALTQGLASKIGLGFNKVAKLPKRFLDFLSPKAGSIVGKIKDWLNEIPEGEEKTRLQKAFNRVTSLPGKVLGFAKRGVDRLKDMITDNPLTRWWKGRKDGGGGGFSLFSSTGKKTNHILIRIYKLLNQRLPGDPEDESWTEEMEKGVGGGGSKIGRAVRGAFDRAKSSLSERFGGRWSRTKAFFGRGRDRLRGWFSGFRGRAGDMLDGYRGARHDIATRYEVERRLAGRSDDVADFYRSHLNAKGGISGRKVYGDAKDDLETARDAAGRVINKGKNAAKSAGARLFERLDRMIGLQEMSWFNTMRESVSRAGGDDGIIRTMFAKFGKRNKPPESDEKRDYFNFFKRWREKRKEKKEKAQGSKGKSGGLWDMVKGLPIIGPIVSILGTVGSILGSITKWGVLKPVGLLGKAAWNVGKFAVTRLAAPAVSAVATAASAVVTAVGWPAILIGGAIAAAGYAAYKIATTTYTQYLDKMRLAQYGFRDYDKWSSDDGAKARYLEDALREYVSYAEDGQASLRGLSGKDVQKLAEGFGINVEEKGEMLAFQAFMLQRFIPIYLRWITALKSMPNSIQLADVGDAKKVSKEDMLTLFNKMKMTKDAKAFSSLTDPRKVNQGFFSKAWDVVTFTPKEFLSGEEVMEVQNEVERAIKFRMDDKKARKYGMAPAVEGIKSAGVDEAINKLGQLDNERNKNLAKVEGWEDGTEQVQIQVDWNAVLDQKDMNAMESVRWKTYGFTTIDNATRTLITVFEKNVIKDIDVKTASYKGDWKKAIASMVPDAIGTPKEDRLKRWFFDRFLPVFMTYLVGVKRYLPTADPLNLKLTGGYLYEISLMMSTAYSLKGGIRQSVWEVNINPLGGDANTNPSSIKAELETLKLLSKEADLAVRNMIKAIKNNGKRARWKDRNKNRSSLEVTGEDEEDSNISSGDSLSSDGARASGYIPSGTSGGVPGNLGQVVDAVGGVRNYAAMTTGSSSINLSDVKDGDYKSLAEKYPIEMLGRKGALNVPNIKALITDAANMMGVPPAVALAMAKAESGFNYTAKNPYASASGLFQFVNGTWDGMMKGYSRKFGIPRVNQMDPWANAILGVQFIRDNIQQAQRDLGGKAPPPAVAYLYHFLGAGGGKKFLEAWKRNPNMAASSAPGITSAILRGNANVFYSNGRIRSVDGVIQELNRRMGAISANEVAADPSKTKDMVAGLSPNSPTNPAAAMGAPAANDPSLSPADNLPADNANRRDDALTQKGAMAAQDAMATAAGNVGPAAPTPTTGGTGTSDASTTAETVASQAAAEGFSATDVAKVKAGAEAQVNAAARPVAAPTSDATASTPTLNGDPIDVQQLKVLIQSRDYLKEIRDILKSNPRAANDTRGGSIQQAANVAPPGSAARRQEITQPTPSLNVSRKAS
Physico‐chemical
properties
protein length:2373 AA
molecular weight: 259113,76540 Da
isoelectric point:9,49350
aromaticity:0,07375
hydropathy:-0,49663

Domains

Domains [InterPro]
DC_0124
STR
1–2078
G3DSA:1.10.530.10
RBD
1976–2150
IPR023346
STR
1982–2070
IPR008258
ENZ
1984–2072
cd00254
ENZ
1996–2074
WFG41753.1
1 2373
Architecture
STR
RBD
RBD
STR 1-2078 | RBD 2079-2150 | RBD 2152-2373
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage MET_P1_082_240
[NCBI]
3032418 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WFG41753.1 [NCBI]
Genbank nucleotide accession
OQ383623.1 [NCBI]
CDS location
range 214929 -> 222050
strand +
CDS
ATGGCAAAAAGAGGCAAAGACGGTTTCGATGATATCGATTTAGATAGACTCGACGACTGGGATGATTTCGGTGAACCACCCCGTCCCAAAGACGAGAAAAAACGAAGCCCGATACTCAGTACCCTGAATGTTGCCAGGAAGTCGGCGTTGTCAACTATCTGGCCAGAAGGGAAACGCGACCAAGTTATCCTCAAGGGGATGCCTAAGCCGGCAGCTGATGCATACGAGGGTTATCAGTCAGCGGCAGCTGCAGCAAAGGACATTTACGCCCATACGAAAGGGGAGCTTGAGAAGACCGAGCGCACGCTGAAGATGCAAGCGCGTCAACTCGGTCCGACAATGAAGAAATACTTGCCGGATGCAGTTACTAAACGCTTCGACAAGTGGTCAAAATCTGACCAGTTAGATTACCAGAACTACGATCCGAATCAGGCATTGATGGATCGTGAACTGGGCGACGTATTCTCAGGTGTGCCAAAGGACGCCGATGAGCAACGTCAGCTGCAGGATCGGATGGTAGAGGACAAGCTCCGTGACAGCATCAAAGAGATGCGCGCGGATGCAATGCACCAAACCATGATTGGGATGGCGAAAGATATTAACCTGTTAACCGGGTTAAGCCGTGGGGTATTCTTAAACATCGAGCGTAAGAAGCTCGAATTGCAATACCGCACCCTGTTCGCCATTCAAGACATCGTGAAGATGAAGCAGTCGGAGTTCGATCGTAACACGCCTGCTTTAGAAGCCATCGTGAAGAACACGGCACTGCCGGATTACGCGAAGGAAGATTTCTCAGAAGTCCGTTGGGCGAACGTTAAACGCCAGGCTGCGGAATGGATGAACCCATTGCGTTATGCTGACGGGTTTATGGACATGATTCGCGAAAATACCAAGAAGAAAATTTCTGGTATATTTGGCGAAGGTCGTGGTTTGTTAGAGTCTGTTCTCGGCATGGGTGTTGAAGACGATTTCGGCATGAGCGACAGCTCTTCGTTGACTGCCGAACGTCGCAAAACAAATGCTCGCGATAAAGCAACTGCGTGGGGTAGCGGATTCCTTGCGAAGAAACTACTCGGTCCACAAATCGAGAAACTCCAGAAATGGACTCGTGAGGAGATGGAGAAAAATCCAGAGGTCATGAAACGCCTGCAGAAAGGCGCGTTCACCTTTGGTAATCTTTCCTCTATCTCGAACTCAGCTATCGCAGGCGAAACGCAAGGACCGTTGGCGGATTTGTTCCGGGTATTGAATGAACTCGGTATTGTCCAACCGTTAAACCGCGAGAAAGCTTTCCTTGATGAGCGTAATGGCGAAACGTTAAGCCGTTCGGCGAAGTTTGACCGTAAAGCGTATCTTAGCATAGTTGAAGTTATTCCTGCCTGGCTGGCAGAGATTAACAAATCTGTTCGTCGTGGCTATGGCGAACATGCCGACCTGGAATACGACATCACCAGTCGTGGTTTCGTAGACCGCAAAGTGGTCGGTAATCGTGTACGAAAAGCGGTTGCAAATGATGAGCAGCGTCTGCGTCTTCAGAATTCCATTAATAGCACTGTGGATTTTGTTGACCGTGGCAAAACACTCTCGCAGAAGGATCGTCAGCATCTTGCCGACTATATTGAGTCGCGTGCCTCGCAGGGCCGTGCATTCGACGTAGAAGCAATTTTGAAAGACCCGATGCATCTTCATCGGTATATGCCAGGAAACTCTGCAGAACGCATTAAGGAAGCGTTACAGGGGCATTCTGATAGCCTGGTCGGTGGTAGCAACGAATTAAGTAACGAACTAGCTCGCAAAATCTCCACAGTGCAGTCATCTATCACCCAACGTCAGGCAATTATCGACGAGGCGGTGAATATTTACGGCGAGCGAGCACTGCGTGATGCGGGTATCTTTAACTACGACGCGAAGAGTGACACCTTCGGCGTGGATAAAGACCTGTCCGATCCCTATACCTTGTTTAATGACTTGGCAATGGGTAAGACACGCAGCGGGCGTGCATTGACTCGTGACCAGGAGATTCAACGTAAGCTGCAAAATGGTTCGGCGTTAGGCGACTATCTGCGCCGAATGAACCAAGGGGCAAGTGGTGGTGCCGATGACACGTCGTTACCGCCGGCTTTACGTGGCGGTGGTAAAGGTCGTGGAATGTCGCCTCGGCAGTTAGCCGCAGTTCTCTACGGTGAAACCTCGACTAACTTTGTTGAGTTGCTGAGTCAACGTAACCGTGGCGAAGAAGCACCACGGAATAACTTTGACGGTATCATCGAAGCGATTCGGGGTAACAACAACAGTGACACCCTCCAGAAAATTCTGGAACACGTCAGAAGCATGGACGAAGAAGGGGTTCTCCTGGCTTCGTTAGCAGGGGGTGCTGGTTCTGGCGATGAAGAAATGGGTCCACCTCGCCCTGGCGGTGGTGGCGGTGGTGGTAAACGCCGTCGTATTATCATCGGTGAGGATGGGCTTATTCGTCGTTGGGGTGGTGTGTTGTTCGACACCGCAACAGGAATCGGTGGCTTTGCAAAACGTGGTGTTAAGGGTGCCTGGAATAAACTGAACCAGTTCGGTGGCTGGGCACGCGGTAAGATCGCAGGGATGGGAGGTGGCGAAGGTCCTGGTTTCTTAACCCGGATGCGTGGCCTTATCAGTGGTAGTGTCCGTGGGGGCTTTGAAGCAGTAAGCTCTTTCGGTAAAGGACTACTGGGTATCCGCGACATTTACGATGACCACGGTAACGTTGTTCTGCAGGGCGCTCGCCTGGAAGCTGGGGAATATTATCAGGTTATTGATGGTAAGATGGTTCAGTTGAAAACACTGGACGACATCAAACTGGGGAGCGACATTGTTGACTCCGCAGGTAATCTGGTATTAGCGGCCGCTGACCTAGCCGCTGCCGGTAAACTCCGCTACTATAAAGGCGGGAAAATCCAAGCACTGACCCAAGGTCTGGCTAGTAAGATTGGTTTAGGCTTTAATAAGGTGGCTAAGCTACCGAAACGGTTCCTGGATTTCCTGTCACCAAAAGCGGGCAGCATCGTTGGTAAGATTAAGGACTGGCTGAACGAAATTCCGGAAGGCGAAGAAAAAACACGTCTCCAGAAAGCGTTCAATCGCGTAACGAGCTTGCCGGGTAAAGTGCTCGGCTTTGCGAAGCGTGGTGTTGACCGTCTGAAAGACATGATTACCGACAACCCACTAACGCGTTGGTGGAAAGGCCGTAAAGATGGTGGGGGTGGTGGTTTCAGTCTCTTCTCATCAACCGGCAAGAAAACCAACCACATCCTTATCCGTATCTATAAACTGTTGAACCAACGTTTACCGGGCGATCCGGAAGACGAAAGTTGGACAGAAGAAATGGAGAAAGGTGTCGGTGGGGGCGGAAGTAAGATTGGCCGTGCTGTACGTGGTGCGTTCGATCGTGCGAAATCATCGTTGTCTGAGCGATTCGGTGGTCGTTGGTCAAGAACAAAAGCGTTCTTTGGCCGTGGCCGCGATCGGTTACGTGGTTGGTTCAGTGGTTTTCGAGGCCGTGCGGGGGATATGCTCGATGGTTATCGCGGAGCACGACACGACATTGCTACACGCTACGAAGTTGAACGTCGATTAGCGGGCCGCAGTGATGACGTCGCCGATTTCTATCGCAGCCATTTGAACGCGAAAGGTGGCATTTCTGGCCGTAAAGTGTACGGTGATGCAAAGGATGACCTTGAAACTGCCCGCGATGCGGCAGGTCGTGTTATCAACAAAGGGAAGAACGCAGCCAAGTCCGCAGGTGCAAGGTTATTCGAACGTTTAGACCGAATGATTGGGCTGCAAGAGATGTCGTGGTTTAACACCATGCGCGAATCTGTATCTCGTGCAGGTGGTGACGATGGCATTATTCGCACCATGTTTGCGAAGTTCGGTAAACGTAATAAGCCACCTGAAAGTGACGAAAAACGCGACTACTTTAACTTCTTCAAACGTTGGCGTGAGAAACGTAAGGAGAAGAAAGAGAAAGCGCAGGGTTCCAAAGGGAAATCCGGCGGTCTGTGGGATATGGTGAAAGGTTTACCTATCATCGGTCCTATCGTCAGTATCCTGGGAACTGTGGGGAGCATACTCGGCTCTATCACGAAATGGGGCGTGTTAAAACCCGTAGGTCTTTTAGGTAAGGCTGCGTGGAACGTTGGTAAGTTCGCAGTAACCCGTTTGGCGGCACCCGCGGTGTCCGCTGTGGCAACTGCAGCGTCTGCCGTCGTGACCGCAGTGGGTTGGCCGGCAATCCTCATCGGTGGTGCGATCGCCGCGGCAGGTTACGCCGCGTACAAGATTGCAACGACTACCTATACCCAGTACCTGGATAAGATGCGTTTAGCTCAGTACGGTTTCCGTGATTACGATAAGTGGTCGTCGGACGACGGGGCGAAGGCGCGTTACTTAGAAGACGCATTGCGTGAGTATGTGTCTTACGCGGAAGATGGTCAAGCCAGTCTCCGCGGACTAAGTGGAAAAGACGTTCAGAAGTTAGCTGAAGGGTTCGGTATCAACGTCGAGGAAAAAGGCGAGATGTTGGCCTTCCAGGCGTTCATGCTTCAGCGCTTCATTCCGATATATCTCCGCTGGATTACGGCACTGAAGTCAATGCCGAACAGTATCCAGTTGGCTGACGTCGGCGATGCGAAGAAAGTATCGAAAGAGGACATGCTGACACTCTTCAACAAAATGAAGATGACTAAGGATGCAAAAGCGTTCTCTTCGTTAACAGACCCACGCAAAGTCAACCAAGGCTTCTTCTCGAAAGCCTGGGACGTTGTAACCTTTACACCGAAGGAGTTTTTGAGCGGTGAAGAGGTTATGGAAGTGCAGAATGAAGTCGAGCGTGCGATCAAGTTCAGAATGGACGATAAGAAAGCACGTAAGTACGGAATGGCACCGGCCGTGGAGGGGATTAAGTCCGCGGGCGTTGATGAAGCTATCAACAAGCTCGGCCAGCTGGATAACGAACGTAACAAAAATCTGGCGAAGGTAGAGGGTTGGGAAGACGGAACAGAGCAGGTTCAGATTCAGGTAGACTGGAACGCGGTACTTGACCAGAAAGACATGAATGCGATGGAGTCGGTACGTTGGAAGACTTATGGTTTTACCACCATCGACAACGCCACACGCACGTTGATTACGGTATTCGAGAAAAACGTCATCAAAGACATTGACGTGAAAACCGCAAGCTACAAAGGCGACTGGAAGAAAGCTATCGCTTCGATGGTTCCAGACGCTATCGGCACACCGAAAGAAGATCGTTTGAAGCGGTGGTTCTTTGACCGTTTCTTACCGGTATTCATGACGTACTTGGTTGGGGTGAAACGCTACCTGCCAACAGCCGATCCGCTGAACTTGAAACTGACCGGCGGTTACCTGTACGAAATCAGTTTGATGATGTCAACAGCCTACAGCTTGAAAGGCGGTATAAGACAGTCGGTGTGGGAAGTGAACATCAACCCATTAGGGGGCGATGCTAACACTAATCCGTCGTCTATTAAAGCCGAACTGGAAACGCTGAAGCTGTTGTCAAAAGAAGCTGACCTTGCCGTGCGTAACATGATTAAAGCCATTAAAAATAATGGCAAGCGTGCACGCTGGAAGGATCGTAACAAGAACCGCAGTTCTCTGGAAGTCACCGGTGAAGATGAAGAAGACTCGAACATCAGTTCGGGAGATTCTTTGTCATCCGACGGTGCTCGCGCCTCAGGCTACATTCCATCGGGCACGAGTGGCGGTGTGCCGGGTAACTTAGGTCAAGTCGTCGATGCGGTCGGTGGTGTGCGGAACTACGCTGCAATGACCACTGGTTCGTCTTCGATTAACCTGAGTGATGTGAAAGACGGTGATTATAAGTCACTGGCTGAAAAATACCCGATAGAAATGTTGGGTAGAAAGGGTGCGTTGAACGTTCCGAATATCAAAGCATTGATTACCGATGCAGCGAACATGATGGGCGTGCCACCTGCAGTGGCGTTAGCAATGGCTAAGGCGGAGTCCGGTTTTAACTACACCGCTAAAAACCCGTATGCTTCGGCGTCTGGGTTGTTCCAGTTTGTTAACGGCACGTGGGACGGGATGATGAAGGGGTATTCGCGGAAGTTTGGTATTCCGCGTGTTAACCAAATGGACCCGTGGGCGAATGCTATATTGGGTGTACAGTTCATTCGTGACAACATCCAACAAGCACAGCGTGACCTGGGTGGTAAAGCACCACCTCCAGCCGTGGCTTATCTGTACCACTTCTTGGGTGCAGGCGGCGGTAAGAAATTCCTGGAAGCATGGAAGCGTAATCCGAATATGGCGGCATCGAGTGCTCCTGGGATTACATCCGCAATACTGAGAGGGAATGCCAACGTTTTCTACAGCAACGGTCGTATACGTAGCGTAGACGGGGTTATTCAGGAACTGAACCGCCGCATGGGCGCAATTTCCGCCAACGAAGTCGCTGCCGATCCGAGCAAGACGAAGGATATGGTTGCAGGATTGTCGCCTAACTCACCAACCAACCCGGCAGCAGCAATGGGTGCACCGGCCGCTAATGACCCGAGTCTGTCGCCAGCAGATAACCTGCCGGCAGATAATGCTAATCGTCGTGATGATGCATTGACGCAGAAAGGGGCCATGGCGGCACAAGATGCAATGGCTACCGCCGCAGGCAATGTGGGACCAGCAGCGCCTACACCAACTACAGGTGGTACCGGAACATCCGATGCCTCAACAACTGCTGAGACTGTAGCGTCGCAAGCTGCAGCAGAAGGATTCTCTGCGACGGATGTTGCTAAAGTGAAAGCAGGCGCGGAAGCGCAGGTTAACGCAGCAGCACGTCCAGTTGCCGCTCCAACTTCTGATGCCACAGCATCGACGCCAACATTGAATGGTGACCCGATAGATGTTCAGCAGCTCAAGGTACTGATTCAATCTCGCGATTACTTGAAAGAGATTCGTGATATTTTGAAATCAAATCCAAGAGCGGCAAACGACACGCGTGGTGGTAGTATCCAGCAAGCGGCAAATGTGGCTCCTCCGGGCTCAGCTGCACGTAGGCAGGAGATAACCCAACCGACGCCGTCTTTAAACGTAAGTCGGAAAGCAAGCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
3bfc41fcd1a402353d76d2826343d14c9584f6c0ddb95ea609c5f62175c5ea75
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,4441
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50