Genbank accession
XLG28162.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MPIQVGTVGPLKRRYRGAQEVQRVYRGSNLIWQKPTGSLMPPVDGATKPFVVMYIGSSTTEGTIGATDEHHNYVNHFSAMVVTHANNGVSTEKVIKQNSGSATRPTTNGIWFYNAGIGGRVASNYVDATVQSLASDLSPNLCIHMIGSNDYANQVTPASYQADIQSAINALNARGASGVQHILVHSYIRLDITAAQTYEWPAYKARLDALAAANSNVVSIDVSDLFANRGVFKGSNDPQNLVHDGDNIHATDAGYKFLAESMAVKLGMNLRRGEKIWELDPDALSLADGTAVDSLPSTAGTLEQTPATQTGANRPTLVHNSANGRKTLKFDGVDDSIAATFPKSYGLPLTMFAVVRTNSGGTSTRPIYSRVSTAHKGYIYAFDVETSAGNSNFTSISNSPTSKKSFAMRDDLYQVIAVVFRAADDQTVYISDKYGKSNETQAPDLTHGPFISSMRLGSNTGLNLFSPMNLAYARLYQGELTQAEIETQLDALGSRFNVPITYTPPVVYPEPATGPVTYQNDFSSNTTTSGQDLAGIGEGWVDQVNSTGQMHLYAGRLVPSGMGSYRTMSNYYRNAPLSSKYHYARAIIHAAPAASDPKAGVGLVVRHSGTQAIMCRLSTDGTWQVYAGTSLSGTNTVQTDPTYLGSGQIGDVVNGGDEMIVTVEKDDKVRIYLNNREITGGGVDVSSIAAGLNTGVLITMTNAGELRGFKTGVLTDQPAPVAYIEDTFVSVPATSTTANFVVPLPPNRQVGDLMILHGAVTNVTSPSTAPTGWSVVQRRTTTTSGAPRPAIYSRICDGTEASTVALNFGTKANTAAASAGILLFRNTPRTGQIINISANGGGTATTAHATPAITTTASAEVGRSMVVRMVFSSLAGAAPASAWTWPSPMTEMIERAADTNAINDADFGASWARPTAVGSQAAVTATHYASRTYGSLSFAINSIAIT
Physico‐chemical
properties
protein length:946 AA
molecular weight: 100154,71010 Da
isoelectric point:6,55947
aromaticity:0,07611
hydropathy:-0,16839

Domains

Domains [InterPro]
IPR051532
Unmapped
6–264
IPR036514
STR
37–266
SSF52266
STR
48–263
cd00229
ENZ
52–263
IPR001087
ENZ
55–261
XLG28162.1
1 946
Architecture
STR
STR
STR 37-266 | STR 275-910 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Gordonia phage ScarletRaider
[NCBI]
3251017 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XLG28162.1 [NCBI]
Genbank nucleotide accession
PQ184837.1 [NCBI]
CDS location
range 21635 -> 24475
strand +
CDS
GTGCCAATCCAGGTAGGAACCGTCGGGCCGCTGAAGCGTCGTTACAGGGGCGCTCAGGAAGTCCAGCGTGTTTACCGAGGTAGCAATCTCATCTGGCAGAAGCCAACCGGATCACTCATGCCGCCAGTGGACGGCGCGACGAAACCGTTCGTCGTCATGTACATCGGCAGCTCGACGACCGAGGGCACCATCGGTGCGACTGACGAACACCACAACTACGTCAACCACTTCTCGGCAATGGTGGTTACGCACGCCAACAATGGTGTGTCGACGGAGAAGGTGATCAAGCAGAACTCGGGCAGTGCCACTCGTCCAACGACCAACGGCATCTGGTTCTACAACGCGGGCATCGGTGGTCGAGTCGCGTCGAACTATGTGGACGCCACAGTCCAGAGTCTCGCCTCCGACCTGTCGCCCAACTTGTGCATCCACATGATCGGCTCGAACGATTACGCAAACCAAGTAACACCTGCGTCGTACCAGGCCGACATTCAGTCCGCGATCAACGCCCTCAATGCACGCGGTGCTTCGGGGGTCCAGCACATTCTCGTGCACTCCTACATCCGCCTGGACATCACGGCGGCACAGACGTACGAGTGGCCTGCGTACAAGGCGCGGTTGGATGCACTGGCAGCGGCCAACTCCAACGTGGTCTCCATCGACGTGTCGGACCTGTTCGCAAACCGAGGCGTGTTCAAGGGCAGTAACGATCCACAGAACCTCGTGCACGACGGTGACAACATTCACGCCACCGATGCGGGGTACAAATTCCTCGCTGAGTCCATGGCCGTGAAGCTGGGGATGAACCTCCGTCGCGGCGAGAAAATCTGGGAACTCGATCCCGACGCTCTGTCCCTTGCTGACGGCACAGCAGTCGATTCCCTGCCCAGTACCGCAGGAACGCTCGAACAGACGCCAGCGACGCAGACAGGGGCCAACAGACCCACTCTGGTACACAACTCCGCGAACGGTCGCAAGACCCTGAAGTTCGATGGTGTCGATGACTCGATCGCAGCAACGTTCCCGAAGTCGTATGGTCTGCCGCTGACGATGTTTGCTGTCGTGCGTACCAACTCCGGTGGTACATCCACACGTCCGATCTACTCGCGTGTGTCCACCGCGCACAAGGGATACATCTACGCGTTCGATGTGGAGACGAGTGCAGGCAACTCGAACTTTACTTCGATCAGTAACTCGCCCACGTCCAAGAAGTCGTTCGCGATGCGGGACGATCTCTATCAAGTCATCGCCGTTGTGTTCCGAGCGGCCGACGATCAGACGGTGTACATCAGCGACAAGTACGGGAAGTCGAACGAGACGCAGGCACCGGACCTGACACATGGTCCGTTCATCTCGTCCATGCGCCTTGGTTCCAATACCGGCTTGAATTTGTTCTCTCCCATGAACCTTGCGTACGCACGCCTGTACCAAGGCGAACTGACACAGGCCGAGATCGAGACGCAGCTCGATGCATTGGGATCGCGGTTCAATGTGCCGATCACCTACACACCTCCGGTCGTGTACCCGGAACCTGCAACAGGTCCCGTGACTTACCAAAATGACTTCTCCTCGAACACGACGACCAGCGGCCAAGACCTTGCGGGTATCGGTGAGGGTTGGGTCGACCAGGTCAACTCGACCGGACAGATGCATCTCTACGCGGGGCGACTCGTGCCATCCGGCATGGGTTCGTACCGCACGATGAGCAACTACTACCGCAACGCACCTCTGTCGTCGAAGTACCACTACGCACGTGCCATCATTCACGCCGCTCCCGCAGCGTCGGACCCTAAGGCTGGTGTGGGTCTGGTTGTCCGTCACTCAGGCACACAGGCCATCATGTGCCGCCTTTCCACGGACGGCACGTGGCAGGTGTACGCCGGTACATCTCTCAGCGGTACGAACACGGTGCAGACCGATCCGACCTACCTTGGCTCGGGGCAGATTGGTGATGTCGTAAACGGTGGCGACGAAATGATCGTGACGGTGGAGAAGGATGACAAGGTCCGTATCTACCTGAACAACCGCGAGATCACTGGCGGCGGTGTCGATGTGTCATCGATCGCGGCGGGTCTCAATACTGGTGTCCTGATCACCATGACGAACGCCGGTGAGTTGCGTGGGTTCAAGACTGGCGTTCTGACCGACCAGCCTGCTCCTGTGGCGTACATCGAAGACACCTTTGTGTCTGTCCCGGCAACGTCTACCACCGCGAACTTCGTTGTGCCCCTGCCTCCGAACAGGCAAGTGGGCGATCTCATGATCTTGCACGGAGCAGTCACCAACGTCACTTCGCCATCAACAGCTCCGACAGGCTGGTCTGTTGTTCAGCGCCGTACGACAACAACTTCAGGTGCACCGCGACCAGCTATCTACTCGCGTATCTGTGATGGCACTGAAGCGTCAACAGTCGCACTGAACTTTGGTACCAAGGCAAACACGGCAGCAGCCTCGGCAGGGATACTGTTGTTCCGCAACACTCCGCGTACCGGACAGATCATCAATATCAGCGCAAACGGTGGCGGTACAGCAACCACAGCACACGCGACTCCTGCGATCACTACTACAGCATCGGCAGAGGTCGGCCGATCGATGGTAGTGCGCATGGTCTTTTCGAGCCTGGCTGGTGCAGCACCGGCAAGTGCATGGACATGGCCTAGCCCAATGACTGAGATGATTGAGCGTGCAGCAGACACCAATGCGATCAACGACGCGGACTTTGGAGCATCTTGGGCACGCCCCACCGCAGTTGGTTCTCAGGCAGCAGTGACCGCAACGCATTACGCAAGTCGCACATACGGATCACTGTCGTTCGCTATCAACTCGATAGCGATCACATAG

Genome Context

Genome Context

Tertiary structure

PDB ID
e884b4b2b6da3dc413c2c9e5a0d97b96b8e0fd318de80c20821bd1715ef2717f
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,7981
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50