Genbank accession
QEQ94133.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect2
Probability 0,88
Protein sequence
MAVSTTPRFGVETYSAGTDPHPGRTKFNERMLAFEALAAIATQGATGSRPSAGKGRAFYWDTTVDRLYFDNGTAWKEVTTNGAGGPGAAIVPGAAAVEGTSSRSARADHTHSLPLATAAVNGAMAAADKAKLDAASAAPTPNTLVMLDANGRTQVAAPSAGGDTTNKTYVDAQVATRATTAHSHAAADISSGVLAAARLPASTSAAAGSMSAADKAKLDAASAAATPSTLVLLDGNGRAAVANPSLAGEIANKGYVDTQVATRATTAHTHAWADITSGVPTTFTPAAHVHAAADVSSGVFAAARLPAATGAAQGAMSAADKTKLDGATALATPSTLVMLDAAGRAAVAAPSGGGDIANKTYVDGQVATKANTTHTHTWGQITGAPATYAPSAHTHDWFDLTGVPTASRTQSGVMSASAYALLYDATAAYATSNIVMRDANGNIEINRPVQDIDGANKLYVDDQMNAAKAGKLDVSVFTAAISTSASARAIRSPNLGSYMTFYDNGVVESPAIYNTNAASGSGFRAVWVNNTGGLGYNLSSEKFKTNIQPYEVPLEVLDKIEPKRFQYKENVAEMGEDAPFRVNFIAEDLHDAGLTEYVSYDENGTERENCQTINESLMVNALWSFAQQQQSQIKAMQNQLNEMAK
Physico‐chemical
properties
protein length:645 AA
molecular weight: 66469,80040 Da
isoelectric point:5,62617
aromaticity:0,06512
hydropathy:-0,18558

Domains

Domains [InterPro]
DC_0757
STR
234–284
IPR030392
CHP
539–597
QEQ94133.1
1 645
Architecture
STR
STR 1-645
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Arthrobacter phage Mordred
[NCBI]
2601685 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QEQ94133.1 [NCBI]
Genbank nucleotide accession
MN204499.1 [NCBI]
CDS location
range 23708 -> 25645
strand +
CDS
ATGGCCGTTTCAACAACACCCCGCTTTGGCGTAGAGACTTATAGCGCTGGAACTGACCCGCACCCCGGGCGAACCAAGTTCAACGAGCGCATGCTGGCGTTCGAGGCGCTGGCAGCTATCGCCACGCAAGGCGCCACGGGCTCGCGCCCGTCTGCCGGTAAAGGCCGCGCGTTCTACTGGGACACGACCGTCGACAGGCTCTATTTCGACAACGGCACAGCCTGGAAGGAAGTCACCACCAACGGCGCTGGTGGCCCGGGCGCGGCGATTGTCCCGGGCGCTGCGGCGGTTGAGGGCACGTCGTCCCGTTCCGCTCGCGCTGACCACACCCACAGCTTGCCGTTGGCCACGGCTGCGGTGAACGGTGCCATGGCGGCTGCTGATAAGGCCAAGCTTGACGCTGCCAGCGCTGCCCCCACCCCCAACACTTTGGTGATGCTGGACGCCAACGGGCGCACTCAGGTGGCTGCCCCGTCCGCCGGCGGCGACACCACGAACAAGACGTATGTTGACGCCCAAGTTGCCACGAGGGCGACCACGGCCCACAGCCACGCGGCTGCGGACATTTCCAGCGGTGTCTTGGCTGCGGCTCGCTTGCCTGCTTCCACCTCCGCTGCTGCGGGTTCCATGTCCGCCGCTGACAAAGCCAAGTTGGACGCAGCGTCGGCAGCGGCCACCCCATCAACGCTTGTCCTGTTGGACGGCAACGGACGCGCCGCCGTCGCCAACCCCAGTCTTGCCGGGGAAATCGCAAACAAGGGATACGTTGACACCCAAGTTGCCACAAGGGCCACGACGGCGCACACGCACGCTTGGGCGGACATTACCAGCGGCGTGCCCACCACCTTTACCCCCGCCGCGCACGTCCATGCGGCTGCTGACGTGTCCAGCGGCGTCTTTGCGGCTGCCCGTCTTCCCGCAGCGACCGGCGCGGCACAAGGGGCCATGAGCGCAGCGGACAAGACCAAGCTGGACGGCGCCACCGCGCTGGCCACCCCGTCCACACTGGTGATGCTGGACGCTGCCGGACGTGCCGCCGTGGCTGCCCCATCGGGCGGCGGCGACATTGCAAACAAGACCTATGTGGACGGTCAGGTTGCCACCAAGGCCAACACCACCCACACCCACACATGGGGCCAAATCACGGGCGCGCCGGCGACCTATGCCCCGTCCGCGCACACGCATGATTGGTTCGACCTCACGGGCGTGCCCACGGCGTCCCGCACCCAGTCCGGCGTCATGTCAGCTTCTGCTTACGCGCTGCTCTACGACGCCACGGCGGCTTACGCCACGTCCAACATTGTCATGCGTGACGCGAACGGCAACATCGAGATTAACCGACCGGTTCAGGACATTGACGGCGCTAACAAGCTCTACGTTGACGACCAGATGAACGCGGCCAAGGCCGGAAAGCTGGACGTGTCGGTGTTCACCGCCGCGATTTCCACCAGTGCCAGCGCTCGCGCGATCCGGTCCCCTAACCTGGGCTCCTACATGACGTTCTACGACAACGGCGTGGTTGAGTCCCCGGCGATCTACAACACCAACGCCGCGAGCGGTTCCGGCTTCCGTGCTGTCTGGGTCAACAACACTGGCGGTTTGGGCTATAACCTGTCGTCGGAAAAGTTCAAGACCAACATTCAGCCCTACGAGGTGCCGCTGGAAGTCTTGGACAAGATCGAACCTAAGCGGTTCCAGTACAAAGAGAACGTGGCCGAAATGGGCGAGGACGCCCCGTTCCGGGTCAACTTCATTGCGGAAGACCTCCACGACGCCGGCCTAACCGAATATGTCAGCTACGACGAAAACGGCACCGAGCGCGAGAACTGCCAAACGATCAACGAGTCGTTGATGGTCAACGCGCTGTGGAGCTTCGCACAGCAACAGCAAAGCCAGATCAAGGCCATGCAGAACCAACTCAACGAAATGGCCAAGTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
172869cc50b0f45ae8119db31fdae189a30dc10e9e2082e3fbd0c10f14ca4a4e
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6722
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50