Genbank accession
XTK83704.1 [GenBank]
Protein name
tail protein
RBP type
TF
Evidence RBPdetect2
Probability 0,92
Protein sequence
MQKFETNCLWAMKSGDGLILYREAREICRSNGVIRLRKKVRKLASSGSFSGSIRDGHYAVRVDWTQTKNVSENTSTITCRIYLVNDWSLSINSRNNNTMTIDGKAQNFTSPSIGSTGTHLLGTVSQKVTHGSDGGKAISMSVVFKIQATLSGTFYDSITANTTVTLDSIPRASSVTASNAELGKASVIQISRASSAFTHTLVYAFGGATGTIVSKTASTSVTWNPPISLANQIPKAVTGTCTITCTTYNGSTNIGSKTCTLTLSVPASIKPSITSLTAARVDGEVPSAWGIYVQTKSKATLKINGAAGSYGSNITAYSITGGGYTSTASSFTTGFLNTEGTITFSAVVTDSRGRTSEEKTVSITVVPYSPPSFINSTSQRCLSNGAVNEDGTYVRSVINFKFASCGGKNTASGTIHYKRTTAESWIAAGAFVSGAAVVFGGGNISTEYSFDVRYTLKDMFSSVSIQEIISTAAVVMDFKRGGKGVAVGKVAEDENVFEVSEDWDVRVYGKLLKDYIQQFAKTIYPVGSIYMSVVNTNPSQFFGGTWVAWGAGRVPVGINTGDGNFNTVEKIGGSSVVTLTANQMPSHTHTFTGNATTTGSAGGHTHNIGRDTDGGAGSSRYTVHGSGVSGADATAPTSNAGSHTHSLTPKGTIAKAGGGASHSNLQPYIVCYMWKRTA
Physico‐chemical
properties
protein length:678 AA
molecular weight: 71159,71220 Da
isoelectric point:9,37823
aromaticity:0,08260
hydropathy:-0,12743

Domains

Domains [InterPro]
DC_1448
ATT
1–55
IPR008577
STR
60–539
DC_2084
STR
480–678
SSF88874
STR
521–671
XTK83704.1
1 678
Architecture
ATT
STR
ATT 1-55 | STR 56-678
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Faecalimonas phage NatCom_26447
[NCBI]
3403528 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Faecalimonas sp.
[NCBI]
2005356 cellular organisms > Bacteria > Bacillati > Bacillota > Clostridia > Lachnospirales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XTK83704.1 [NCBI]
Genbank nucleotide accession
PV036436 [NCBI]
CDS location
range 13796 -> 15832
strand -
CDS
ATGCAGAAATTCGAAACAAACTGTCTCTGGGCAATGAAGTCCGGGGATGGTTTGATTTTATACCGAGAAGCACGGGAAATTTGTCGATCAAATGGCGTGATCCGTCTTAGAAAGAAGGTGAGAAAGTTGGCATCCAGTGGAAGTTTTTCGGGATCTATCCGTGATGGACATTATGCGGTGCGGGTAGATTGGACGCAGACGAAGAATGTGTCGGAGAATACATCTACAATTACCTGCAGGATTTATCTTGTTAATGACTGGTCATTGTCGATTAACAGCAGAAATAACAATACGATGACGATTGATGGAAAGGCTCAGAATTTCACATCCCCTTCCATCGGTTCAACAGGAACACATTTGTTGGGAACGGTATCTCAGAAAGTCACGCATGGAAGTGACGGTGGGAAGGCAATTTCCATGAGTGTTGTGTTTAAGATTCAGGCAACATTATCAGGAACTTTTTACGATTCTATTACGGCAAATACAACCGTAACTTTGGATTCCATTCCAAGAGCATCTTCCGTGACGGCTTCAAATGCAGAGTTGGGGAAGGCGTCCGTGATACAAATTTCTCGTGCATCTTCTGCGTTTACCCACACACTGGTATACGCTTTTGGCGGTGCGACGGGGACGATTGTTTCAAAAACAGCATCCACCTCGGTTACATGGAATCCACCGATTTCATTGGCAAATCAGATACCAAAAGCGGTTACGGGAACCTGTACGATTACCTGCACGACCTATAACGGAAGCACAAATATTGGAAGTAAGACATGTACATTGACTTTGAGCGTTCCGGCCTCCATTAAGCCATCTATTACCAGTCTGACTGCAGCGAGGGTGGATGGAGAAGTGCCATCTGCATGGGGCATTTATGTACAGACAAAATCAAAAGCAACCTTAAAAATCAATGGTGCAGCAGGTAGCTACGGATCGAATATTACTGCATATTCCATAACAGGCGGAGGTTATACCAGTACGGCATCCAGCTTTACAACAGGGTTTCTGAATACGGAAGGTACGATCACATTCTCTGCTGTTGTTACGGATTCTAGGGGAAGAACTTCAGAAGAGAAGACAGTATCGATTACGGTAGTTCCCTATTCGCCTCCGTCTTTTATCAATTCAACATCGCAAAGATGTTTGAGTAATGGGGCGGTCAATGAGGACGGGACTTATGTACGAAGTGTGATAAATTTTAAATTTGCATCATGTGGGGGAAAGAATACTGCATCTGGAACAATTCATTATAAAAGAACAACAGCAGAGTCATGGATAGCAGCAGGGGCATTTGTATCAGGAGCGGCTGTTGTTTTTGGCGGGGGAAATATTTCTACAGAATATTCCTTTGATGTTCGTTATACCCTGAAAGATATGTTTTCTTCTGTTTCAATACAAGAGATCATTTCTACAGCAGCAGTAGTAATGGATTTTAAACGTGGCGGCAAAGGAGTAGCTGTTGGTAAAGTTGCAGAGGATGAAAATGTGTTTGAAGTGTCAGAAGACTGGGATGTGCGGGTTTATGGAAAACTTCTTAAGGATTATATCCAACAGTTTGCGAAAACAATCTATCCGGTTGGAAGTATTTATATGAGTGTGGTTAATACAAATCCAAGCCAATTTTTTGGAGGAACTTGGGTGGCGTGGGGAGCAGGAAGAGTCCCTGTAGGTATCAATACTGGTGATGGGAATTTTAATACTGTGGAAAAAATAGGTGGTTCTTCTGTGGTAACACTTACTGCAAATCAAATGCCCTCCCACACGCATACTTTTACAGGAAACGCCACAACAACAGGAAGTGCCGGAGGTCATACACACAACATTGGACGTGATACAGACGGAGGTGCAGGGAGCAGCAGATATACCGTGCATGGAAGTGGTGTATCGGGAGCAGATGCGACAGCACCGACAAGCAATGCTGGAAGTCATACGCATTCCCTGACACCAAAGGGGACAATCGCAAAAGCGGGTGGTGGAGCTTCCCATTCGAATTTACAGCCATATATTGTCTGCTATATGTGGAAACGTACAGCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
10a64e81e7e827711228da0f5dbe5b9616a6fe5dffe11ecdb6f1835a998379b5
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8695
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50