Genbank accession
XAN61542.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MAKRGKDGFDDIDLDRLDDWDDFGEPPRPKDEKKRSPILSTLNVARKSALSTIWPEGKRDQVILKGMPKPAADAYEGYQSAAAAAKDIYAHTKGELEKTERTLKMQARQLGPTMKKYLPDAVTKRFDKWSKSDQLDYQNYDPNQALMDRELGDVFSGVPKDADEQRQLQDRMVEDKLRDSIKEMRADAMHQTMIGMAKDINLLTGLSRGVFLNIERKKLELQYRTLFAIQDIVKMKQSEFDRNTPALEAIVKNTALPDYAKEDFSEVRWANVKRQAAEWMNPLRYADGFMDMIRENTKKKISGIFGEGRGLLESVLGMGVEDDFGMSDSSSLTAERRKTNARDKATAWGSGFLAKKLLGPQIEKLQKWTREEMEKNPEVMKRLQKGAFTFGNLSSISNSAIAGETQGPLADLFRVLNELGIVQPLNREKAFLDERNGETLSRSAKFDRKAYLSLVEVIPAWLAEINKSVRRGYGEHADLEYDITSRGFVDRKVVGNRVRKAVANDEQRLRLQNSINSTVDFVDRGKTLSQKDRQHLADYIESRASQGRAFDVEAILKDPMHLHRYMPGNSAERIKEALQGHSDSLTGGSNELSNELARKISTVQSSITQRQAIIDEAVNIYGERALRDAGIFNYDAKSDTFGVDKDLSDPYTLFNDLAMGKTRSGRALTRDQEIQRKLQNGSALGDYLRRMNQGASGGADDTSLPPALRGGGKGRGMSPRQLAAVLYGETSTNFVELLSQRNRGEEAPRNNFDGIIEAIRGNNNSDTLQKILEHVRSMDEEGVLLASLAGGAGSGDEEMGPPRPGGGSGGGKRRRIIIGEDGLIRRWGGVLFDTAAGIGGFAKRGVKGAWNKLNQFGGWARGKVAGMGGGEGPGFLTRMRGLISGSVRGGFEAVSSFGKGLLGIRDIYDDHGNVVLQGARLEAGEYYQVIDGKMVQLKTLDDIKLGSDIVDSAGNLVLAAADLAAAGKLRYYKGGKIQALTQGLASKIGLGFNKVAKLPKRFLDFLSPKAGSIVGKIKDWLNEIPEGEEKTRLQKAFNRVTSLPGKVLGFAKRGVDRLKDMITDNPLTRWWKGRKDGGGGGFSLFSSTGKKTNHILIRIYKLLNQRLPGDPEDESWTEEMEKGVGGGGSTIGRAVRGAYDRAKASLSERFGGRWSRTKAFFGRGRDRLRGWFSGFRGRAGDMLDGYRGARHDIATRYEVERRLAGRDDDVAEFYRSHLNAKGGLSGRKVYGDAKEDLETARDAAGRVINKGKNAAKSAGARLFERLDRMIGLQEMSWFNTMRESVSRAGGDDGIIRTMFAKFGKRNKPPEGDEKRDYFNFFKRWREKRKEKKEKAQGSKGKSGGLWDMVKGLPIIGPIVSILGTVGNILGSITKWGVLKPVGLLGKAAWNVGKFAVTRLAAPAVSAVATAASAVVTAVGWPAILIGGAIAAAGYAAYKIATTTYTQYLDKMRLAQYGFRDYDKWSSDDGAKARYLEDALREYVSYAEDGQASLRGLSGKDVQKLAEGFGINVEEKGEMLAFQAFMLQRFIPIYLRWITALKSMPNSIQLADVGDAKKVSKEDMQTLFNKMKMTKDAKAFSSLTDPRKVNQGFFSKAWDVVTFTPKEFLSGEEVMEVQNEVERAIKFRMDDKKARKYGMAPAVEGIKSAGVDEAINKLGQLDNERNKNLAKVEGWEDGTEQVQIQVDWNAVLDQKDMNAMESVRWKTYGFTTIDNATRTLITVFEKNVIKDIDVKTASYKGDWKKAIASMVPDAIGTPKEDRLKRWFFDRFLPVFMTYLVGVKRYLPTADPLNLKLTGGYLYEISLMMSTAYSLKGGIRQSVWEVNINPLGGDANTNPSSIKAELETLKLLSKEADLAVRNMIKAIKNNGKRARWKDRNKNRSSLEVTGEDEEDSNISSGDSLSSDGARASGYIPSGTSGGVPGNLGQVVDAVGGVRNYAAMTTGSSSINLSDVKDGDYKSLAEKYPIEMLGRKGALNVPNIKALITDAANMMGVPPAVALAMAKAESGFNYTAKNPYASASGLFQFVNGTWDGMMKGYSRKFGIPRVNQMDPWANAILGVQFIRDNIQQAQRDLGGKAPPPAVAYLYHFLGAGGGKKFLEAWKRNPNMAASSAPGITSAILRGNANVFYSNGRIRSVDGVIQELNRRMGAISANEVAADPSKTKDMVAGLSPNSPTNPAAAMGAPAANDPSLSPADNLPADNANRRDDALTQKGAMAAQDAMATAAGNVGPAAPTPTTGGTGTSDASTTAETVASQAAAEGLSATDVAKVKAGAEAQVNAAARPVAAPTSDATASTPTLNGDPIDVQQLKVLIQSRDYLKEIRDILKSNPRAANDTRGGSIQQAANVPPPGSAARRQEITQPTPSLNVSRKAS
Physico‐chemical
properties
protein length:2373 AA
molecular weight: 259134,69860 Da
isoelectric point:9,47371
aromaticity:0,07332
hydropathy:-0,50400

Domains

Domains [InterPro]
DC_0124
STR
1–2073
G3DSA:1.10.530.10
RBD
1976–2150
IPR023346
STR
1982–2070
IPR008258
ENZ
1984–2072
cd00254
ENZ
1996–2074
XAN61542.1
1 2373
Architecture
STR
RBD
RBD
STR 1-2073 | RBD 2074-2150 | RBD 2152-2373
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage 1252
[NCBI]
2999047 Viruses > unclassified bacterial viruses >
Host Salmonella enteritidis
[NCBI]
149539 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XAN61542.1 [NCBI]
Genbank nucleotide accession
PP695294.1 [NCBI]
CDS location
range 157530 -> 164651
strand -
CDS
ATGGCAAAAAGAGGCAAAGACGGTTTCGATGATATCGATTTAGATAGACTCGACGACTGGGATGATTTCGGTGAACCACCCCGTCCCAAAGACGAGAAAAAACGAAGCCCGATACTCAGTACCCTGAATGTTGCCAGGAAGTCGGCGTTGTCAACTATCTGGCCAGAAGGGAAACGCGACCAAGTTATCCTCAAGGGGATGCCTAAGCCGGCAGCTGATGCATACGAGGGCTATCAGTCAGCGGCAGCTGCAGCAAAGGACATTTACGCCCATACGAAAGGGGAGCTTGAGAAGACCGAGCGCACGCTGAAGATGCAAGCGCGTCAACTCGGTCCGACAATGAAGAAATACTTGCCGGATGCAGTTACTAAACGCTTCGACAAGTGGTCAAAATCTGACCAGTTAGATTACCAGAACTACGATCCGAATCAGGCATTGATGGATCGTGAACTGGGCGACGTATTCTCAGGTGTGCCAAAGGACGCCGATGAGCAACGTCAGCTGCAGGATCGCATGGTAGAGGACAAGCTCCGTGACAGCATCAAAGAGATGCGCGCGGATGCAATGCACCAAACCATGATTGGGATGGCGAAAGATATTAACCTGTTAACCGGGTTAAGCCGTGGGGTATTCTTAAACATCGAGCGTAAGAAGCTCGAATTGCAATACCGCACCCTGTTCGCCATTCAAGACATCGTGAAGATGAAGCAGTCGGAGTTCGATCGTAACACGCCTGCTTTAGAAGCCATCGTGAAGAACACGGCACTGCCGGATTACGCGAAGGAAGATTTCTCAGAAGTCCGTTGGGCGAACGTTAAACGCCAGGCTGCGGAATGGATGAACCCATTGCGTTATGCTGACGGGTTTATGGACATGATTCGCGAAAATACCAAGAAGAAAATTTCTGGTATATTTGGCGAAGGTCGTGGTTTGTTAGAGTCTGTTCTCGGCATGGGTGTTGAAGACGATTTCGGCATGAGCGACAGCTCTTCGTTGACTGCCGAACGTCGCAAAACAAATGCTCGCGATAAAGCAACTGCATGGGGTAGCGGATTCCTTGCGAAGAAACTACTCGGCCCACAAATCGAGAAACTCCAGAAATGGACTCGTGAGGAGATGGAGAAAAATCCAGAGGTCATGAAACGCCTGCAGAAGGGCGCGTTCACCTTTGGTAATCTTTCCTCTATCTCGAACTCAGCTATCGCAGGCGAAACGCAAGGACCGTTGGCGGATTTGTTCCGGGTACTGAATGAACTCGGTATTGTCCAACCGTTAAACCGTGAGAAAGCCTTCCTCGATGAGCGTAATGGCGAAACGTTAAGTCGCTCGGCGAAGTTTGACCGTAAAGCGTATCTTAGCCTGGTTGAAGTTATTCCTGCCTGGCTGGCAGAGATTAACAAATCTGTTCGTCGTGGCTATGGCGAACATGCCGATCTGGAATACGACATCACCAGTCGCGGTTTCGTAGACCGCAAAGTGGTCGGTAACCGTGTACGCAAAGCGGTTGCAAACGATGAGCAGCGCCTGCGTCTCCAAAATTCCATTAACAGCACTGTGGATTTTGTTGACCGTGGCAAAACGCTCTCGCAGAAGGATCGACAGCATCTTGCCGACTATATCGAGTCACGTGCCTCGCAGGGCCGCGCATTCGACGTAGAAGCGATTCTGAAAGACCCGATGCATCTTCATCGGTATATGCCAGGAAACTCTGCAGAACGCATTAAGGAAGCGTTACAGGGGCATTCAGACAGTCTTACTGGTGGTAGCAACGAATTAAGTAACGAACTGGCTCGCAAAATCTCCACAGTGCAGTCATCTATCACCCAACGTCAGGCAATTATCGACGAGGCGGTGAATATTTACGGCGAGCGAGCGTTGCGTGATGCGGGTATCTTTAACTACGACGCGAAGAGTGACACCTTCGGCGTGGATAAAGACCTGTCCGATCCCTATACCTTGTTTAATGACTTGGCGATGGGTAAGACACGCAGCGGGCGTGCATTGACTCGTGACCAGGAGATTCAACGTAAGCTGCAAAATGGTTCGGCGTTAGGCGACTATCTGCGCCGAATGAACCAAGGGGCAAGTGGTGGTGCCGATGACACGTCGTTACCGCCGGCTTTACGTGGCGGTGGTAAAGGCCGCGGAATGTCGCCTCGGCAGCTAGCCGCGGTTCTCTACGGTGAAACCTCGACTAACTTTGTTGAATTGTTGAGTCAACGTAACCGTGGCGAAGAAGCACCACGGAATAACTTTGACGGTATCATCGAAGCGATTCGGGGTAACAACAACAGTGACACCCTCCAGAAAATTCTGGAACACGTCAGAAGTATGGACGAAGAAGGGGTTCTCCTGGCTTCGTTAGCAGGGGGTGCTGGTTCTGGCGATGAAGAAATGGGCCCGCCTCGCCCTGGCGGTGGTAGCGGTGGCGGTAAACGCCGTCGTATTATCATCGGTGAGGATGGGCTTATTCGTCGTTGGGGTGGTGTGTTGTTCGACACCGCAGCAGGAATCGGTGGCTTTGCGAAACGTGGTGTTAAGGGTGCCTGGAATAAACTGAACCAGTTCGGTGGCTGGGCACGCGGTAAAGTCGCAGGAATGGGTGGCGGAGAAGGTCCTGGTTTCTTAACCCGGATGCGTGGCCTTATCAGCGGTAGTGTCCGTGGGGGCTTTGAAGCAGTAAGCTCTTTCGGTAAAGGACTACTGGGTATCCGCGACATTTACGATGACCACGGTAATGTTGTTCTGCAGGGCGCACGCCTGGAAGCTGGGGAATACTACCAGGTTATTGATGGTAAGATGGTTCAGTTGAAAACACTGGACGACATCAAACTGGGGAGTGATATTGTTGACTCCGCAGGTAATCTGGTATTAGCGGCCGCTGACTTAGCCGCAGCCGGTAAACTCCGCTACTATAAAGGCGGGAAAATCCAAGCGCTGACCCAAGGTCTGGCCAGTAAGATTGGTTTAGGCTTTAATAAGGTGGCTAAGCTACCGAAACGGTTCCTGGATTTCCTGTCACCAAAAGCGGGCAGCATCGTTGGTAAGATTAAGGACTGGCTGAACGAGATTCCGGAAGGCGAAGAAAAGACACGTCTCCAGAAAGCGTTCAACCGTGTAACGAGCTTACCGGGTAAAGTGCTCGGTTTTGCGAAACGTGGTGTCGATCGTCTGAAAGACATGATTACCGACAACCCACTGACGCGTTGGTGGAAAGGCCGTAAAGATGGTGGGGGTGGTGGTTTCAGTCTCTTCTCATCAACCGGCAAGAAAACCAACCACATCCTTATCCGTATCTATAAACTGTTGAACCAACGTTTACCGGGAGATCCGGAAGACGAAAGTTGGACAGAGGAAATGGAGAAGGGCGTCGGGGGTGGCGGTAGTACGATCGGTCGTGCAGTACGCGGGGCGTATGATCGTGCGAAAGCGTCATTGTCTGAACGTTTTGGTGGACGTTGGTCAAGAACAAAAGCGTTCTTTGGTCGTGGACGTGACCGCCTGCGTGGTTGGTTCAGTGGTTTCCGAGGCCGGGCTGGGGATATGCTCGATGGATATCGTGGAGCACGACACGACATTGCTACACGTTACGAAGTTGAACGTCGGTTGGCTGGACGCGATGACGATGTTGCTGAGTTTTATCGCAGCCATTTGAACGCGAAAGGTGGCCTTTCTGGCCGTAAAGTCTACGGCGATGCGAAGGAAGACCTTGAGACCGCCCGTGATGCAGCAGGGAGGGTTATCAACAAAGGGAAGAATGCCGCCAAGTCAGCAGGTGCAAGGTTATTCGAACGCTTGGACCGAATGATTGGCTTGCAAGAGATGTCGTGGTTTAACACCATGCGCGAATCAGTATCCCGTGCAGGTGGCGACGATGGCATTATTCGTACCATGTTTGCGAAGTTCGGTAAACGTAACAAGCCGCCTGAAGGTGATGAAAAACGCGACTACTTTAACTTCTTCAAACGTTGGCGTGAGAAACGTAAGGAGAAGAAAGAGAAAGCGCAGGGCTCCAAAGGGAAATCTGGCGGTCTGTGGGATATGGTGAAAGGCTTACCTATCATCGGTCCCATCGTCAGTATCTTGGGAACTGTGGGTAATATACTCGGTTCTATCACGAAATGGGGCGTGTTAAAACCCGTAGGTCTTTTAGGTAAGGCTGCGTGGAACGTTGGTAAGTTCGCAGTAACCCGTTTGGCGGCACCCGCGGTGTCCGCTGTGGCAACTGCGGCATCTGCCGTCGTGACAGCAGTGGGCTGGCCGGCAATTCTTATCGGTGGCGCGATCGCTGCGGCAGGTTACGCCGCGTACAAGATCGCAACGACTACCTATACCCAGTACCTGGATAAGATGCGTTTAGCTCAGTACGGTTTCCGTGATTACGATAAGTGGTCGTCGGACGACGGGGCGAAGGCGCGTTACTTAGAAGACGCATTGCGTGAGTATGTGTCTTACGCGGAAGATGGCCAAGCCAGTCTCCGCGGACTAAGTGGAAAAGACGTTCAGAAGTTAGCTGAAGGGTTCGGTATCAACGTTGAGGAAAAAGGCGAGATGTTGGCCTTCCAAGCGTTTATGCTTCAGCGCTTCATTCCGATTTATCTCCGCTGGATTACGGCACTGAAATCGATGCCGAACAGTATCCAGTTGGCTGACGTCGGCGATGCGAAGAAAGTATCGAAAGAGGACATGCAGACACTCTTCAACAAAATGAAGATGACTAAGGATGCAAAAGCGTTCTCTTCGTTAACTGACCCACGCAAAGTCAACCAAGGTTTCTTCTCGAAAGCTTGGGACGTTGTAACCTTTACACCGAAGGAGTTTTTGAGCGGTGAAGAGGTTATGGAAGTGCAGAATGAAGTTGAGCGTGCAATCAAGTTCAGAATGGACGATAAGAAAGCACGTAAGTACGGAATGGCACCGGCTGTAGAGGGGATTAAGTCCGCGGGCGTTGATGAAGCTATCAACAAACTTGGACAGTTGGATAACGAACGTAATAAAAATCTGGCGAAGGTAGAGGGTTGGGAAGACGGAACAGAGCAGGTTCAGATTCAGGTAGACTGGAATGCGGTGCTCGACCAGAAAGACATGAATGCGATGGAGTCGGTACGTTGGAAGACTTATGGTTTTACCACCATCGACAACGCCACACGCACGTTGATTACGGTATTCGAGAAAAACGTCATCAAAGACATTGACGTGAAAACCGCAAGCTACAAAGGTGATTGGAAGAAAGCAATCGCCTCAATGGTTCCAGACGCTATCGGCACACCGAAAGAAGATCGTTTGAAGCGGTGGTTCTTTGACCGTTTCTTACCGGTATTCATGACGTACTTGGTTGGGGTGAAACGCTACCTGCCAACAGCTGATCCGCTGAACTTGAAACTGACCGGCGGTTACCTGTACGAAATCAGTTTGATGATGTCAACAGCTTATAGCTTGAAGGGCGGTATAAGACAGTCGGTGTGGGAAGTGAATATCAACCCATTAGGGGGCGATGCTAACACGAACCCGTCGTCTATCAAAGCCGAGCTGGAAACGCTGAAGCTGTTATCAAAAGAAGCTGACCTTGCCGTGCGTAACATGATTAAAGCCATTAAAAATAATGGCAAGCGTGCACGCTGGAAGGATCGTAACAAGAACCGCAGTTCTCTGGAAGTCACCGGTGAAGATGAAGAAGACTCGAACATCAGTTCGGGAGATTCTTTGTCATCTGACGGTGCTCGCGCCTCAGGCTACATTCCATCGGGCACGAGTGGCGGTGTGCCGGGTAACTTAGGTCAAGTCGTCGATGCGGTTGGTGGTGTGCGGAACTACGCTGCAATGACCACTGGTTCGTCTTCGATTAACCTGAGTGATGTGAAAGACGGTGATTATAAGTCACTGGCTGAAAAATACCCGATAGAAATGTTGGGTAGAAAGGGTGCGTTGAACGTTCCGAATATCAAAGCATTGATTACCGATGCAGCGAACATGATGGGCGTGCCACCTGCAGTGGCGTTAGCAATGGCTAAGGCGGAGTCCGGATTTAACTACACCGCTAAAAACCCGTATGCTTCGGCATCTGGGTTGTTCCAGTTTGTTAACGGCACGTGGGACGGGATGATGAAGGGGTATTCGCGGAAGTTTGGTATTCCGCGTGTTAACCAGATGGACCCGTGGGCGAATGCTATTTTGGGTGTACAGTTCATTCGTGACAACATCCAACAAGCACAACGTGACCTGGGTGGTAAAGCACCACCTCCAGCCGTGGCTTATCTGTACCACTTCCTGGGTGCAGGCGGCGGTAAGAAATTCCTGGAAGCATGGAAGCGTAATCCGAATATGGCGGCATCAAGTGCTCCTGGGATTACATCTGCAATACTGAGAGGGAATGCCAACGTCTTCTACAGCAACGGTCGTATACGTAGCGTAGACGGGGTTATTCAGGAACTGAACCGCCGCATGGGTGCAATTTCCGCCAACGAAGTCGCTGCCGATCCGAGCAAGACGAAGGATATGGTTGCAGGCTTGTCGCCTAATTCACCAACCAACCCGGCAGCAGCAATGGGTGCACCGGCCGCTAATGACCCGAGTCTGTCGCCAGCAGATAACCTGCCAGCAGATAATGCTAATCGTCGTGATGACGCATTGACGCAGAAAGGGGCCATGGCGGCACAAGATGCAATGGCTACCGCCGCAGGTAATGTAGGACCAGCAGCACCTACACCAACCACAGGCGGTACCGGAACATCCGATGCCTCAACAACGGCCGAAACCGTAGCGTCGCAAGCTGCAGCAGAAGGATTGTCTGCGACGGATGTTGCTAAAGTGAAAGCAGGCGCGGAAGCGCAGGTTAACGCAGCAGCTCGTCCAGTTGCCGCTCCAACTTCTGATGCTACAGCATCGACGCCAACGCTGAATGGTGACCCGATAGATGTTCAGCAGCTCAAGGTACTGATTCAGTCTCGCGATTACTTGAAAGAGATTCGTGATATTTTGAAATCAAATCCAAGAGCGGCAAACGACACACGGGGTGGTAGTATCCAGCAAGCGGCAAATGTGCCTCCTCCTGGCTCAGCTGCACGTAGGCAGGAAATAACCCAACCGACACCGTCGTTAAACGTAAGTCGGAAAGCAAGCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
36f312ab7fa16fd2322abaf6fbf4b4ac1b9a09b7a589e7603e11d386d8b14d6d
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,4635
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50