Genbank accession
CAB4141343.1 [GenBank]
Protein name
Phage tail collar domain containing protein
RBP type
TF
Evidence RBPdetect
Probability 0,66
TF
Evidence RBPdetect2
Probability 0,91
Protein sequence
MAYIGNGRTLLVLGSNVRDDITPGYNNPNNSYPQDGPFDKNIFNLSQEVPGGYEGNVYVFRQKYITERLITNSASGEISIPSSAGTTFTITSSNTSIAAALSDIKESIKLYADLNHTLTISGSGIAANNGTFQIVGCTYNGTTITITLNKNGLQASSTESGSYTISRGYSGFWEVLEPEIDYRIVGTGVNLNKQIEFINTAQGGAAMPPQINDKIYVIHKGDATYNLVPSDNSVGPNQLSANLRNFVTDTFTGNGSTNTYTLSQTAVSEKSLLVTVNGVVKEDTTEYTLNVAGTQITFTSGNTPANGAKIRILHLGFSTVSRRIALSPGQVTTAVDPGSITSLELASDSVITSKISDSNVTTQKIANDSINSTKILLTNNTSLRGLSSTSSIIPLLSISPSDETLLNSGGATSLSFNSTKAINFTSTSILPEVDNAVSFGSALKRFTTANFSGAITSGSISTGSISSGAVTTTGNISVTGNISVTGTVDGVDISTKISEIESLLSALVPIGTIAQYSGSSPTATLINDKWMLCDGVTVNRLDYPLLWNLFSDNGTISSPYGNGNGTTTFTLPDLRRRVPIGLGPTDSLGNNDGLLVADRSLNHTHTVPQHAHGLNNHTHSIPAHFHGLGTGSDLQISVASGTHTTNIDISHGHTATSQNNTAGLTANGANANVTFTDPGHSHGGFTGYDNPSHAHTINTTDLNHAHQFDRQGGGRDPADRYNSEMTARGVSQTAHNHDQGTGGDISFGISTAPESSGNQTTTGQFSIAIRNTLNTQNLNHTHTMELASINHRHTITSSGTGASITQTNHTHTVDAHTHPIVVNNLPATTRSDTSGVHTHPTSSFSGRIGLITGGVDGNLNMTSGPSSGNTENSSVLTSGSTLNTPYIILNYIIKVK
Physico‐chemical
properties
protein length:896 AA
molecular weight: 93887,96380 Da
isoelectric point:5,93373
aromaticity:0,05580
hydropathy:-0,25681

Domains

Domains [InterPro]
DC_1502
STR
140–449
DC_1872
RBD
375–675
SSF88874
STR
504–583
IPR011083
ATT
511–579
CAB4141343.1
1 896
Architecture
STR
RBD
ATT
RBD
STR 140-449 | RBD 450-499 | ATT 500-579 | RBD 580-675 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage uncultured Caudovirales phage
[NCBI]
2100421 Uroviricota > Caudoviricetes > Peduoviridae > Maltschvirus maltsch >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAB4141343.1 [NCBI]
Genbank nucleotide accession
LR796388 [NCBI]
CDS location
range 127216 -> 129906
strand +
CDS
ATGGCTTATATAGGAAATGGAAGAACATTATTAGTTTTAGGATCCAATGTTAGGGATGATATAACTCCTGGATACAATAATCCTAATAACTCTTATCCGCAAGATGGTCCTTTCGATAAAAACATATTTAATTTAAGTCAAGAAGTTCCTGGTGGTTATGAAGGAAATGTATATGTTTTTAGACAAAAGTATATTACGGAGCGTTTAATTACAAATTCGGCTTCTGGAGAAATAAGTATCCCCTCTAGTGCAGGAACCACATTTACAATAACATCTTCAAATACATCTATTGCAGCCGCTCTTTCTGATATAAAAGAGTCAATAAAATTATATGCAGATTTGAATCATACTCTAACTATCAGCGGTTCTGGAATTGCAGCAAATAATGGAACATTTCAAATTGTAGGATGTACATATAATGGAACTACAATAACAATAACTCTTAATAAAAACGGACTTCAAGCCAGCTCCACCGAAAGCGGATCATATACAATATCTAGAGGTTATTCTGGGTTTTGGGAAGTATTAGAACCAGAGATAGATTATAGAATCGTAGGAACTGGAGTAAATTTAAATAAACAAATAGAATTCATAAACACTGCTCAGGGTGGCGCAGCAATGCCACCTCAAATAAATGATAAGATTTATGTTATTCACAAAGGTGATGCAACCTATAATTTAGTTCCATCTGACAATTCTGTTGGACCAAATCAATTATCTGCAAATCTTAGAAATTTTGTAACCGACACCTTTACTGGTAATGGTTCTACTAATACATATACACTCAGTCAAACTGCTGTTAGTGAAAAATCTCTGCTTGTTACGGTTAATGGTGTTGTAAAAGAAGATACAACCGAATATACATTGAATGTTGCGGGAACTCAAATAACATTTACTTCAGGAAATACTCCTGCAAATGGAGCGAAAATAAGAATTTTGCATCTTGGCTTTTCAACAGTAAGTAGAAGAATTGCATTATCTCCAGGACAAGTTACCACGGCAGTTGATCCTGGATCTATAACATCATTAGAGTTAGCTTCGGATTCTGTTATTACTTCAAAAATTTCCGATTCAAATGTAACCACTCAAAAGATTGCAAATGATTCTATAAATTCAACCAAAATATTGCTCACAAACAATACTTCGTTAAGAGGATTATCTTCAACCAGTTCTATAATTCCTTTATTGTCCATTAGTCCAAGCGATGAAACTTTATTAAATTCTGGAGGGGCAACTTCCTTATCATTCAATTCAACTAAAGCCATAAACTTTACATCAACTTCAATATTACCGGAAGTAGACAATGCAGTAAGTTTTGGATCAGCACTAAAAAGATTTACAACAGCAAATTTTAGTGGAGCAATAACTTCAGGATCAATTTCTACCGGATCTATCTCTTCTGGTGCAGTTACAACTACTGGAAATATTTCAGTTACCGGAAACATTTCTGTGACTGGAACAGTAGATGGTGTAGACATATCAACAAAGATAAGTGAAATTGAATCATTATTATCCGCTCTTGTTCCCATAGGAACTATAGCTCAATATAGTGGAAGTTCACCAACTGCTACTTTAATAAATGACAAATGGATGTTATGTGATGGTGTCACAGTAAATAGATTAGATTATCCACTTCTATGGAATCTTTTTAGTGATAATGGTACAATTTCTTCTCCATATGGTAATGGAAATGGAACAACAACATTTACTCTACCAGACTTACGCAGAAGAGTTCCCATTGGATTAGGACCAACAGATAGTTTAGGAAATAATGACGGTCTGTTAGTAGCAGATAGAAGTCTAAATCACACGCACACAGTTCCACAACATGCTCATGGATTAAACAATCACACACACAGTATTCCAGCTCACTTTCATGGTCTTGGAACTGGATCGGACTTACAAATAAGTGTTGCTTCTGGAACTCATACTACTAATATAGACATATCGCATGGTCATACTGCAACATCTCAAAATAACACCGCAGGATTAACAGCAAATGGGGCTAATGCGAATGTGACATTTACAGATCCTGGGCATAGTCATGGTGGTTTTACCGGATATGATAATCCATCTCATGCTCATACCATAAACACCACTGACTTAAACCATGCACATCAATTTGATCGACAAGGTGGTGGAAGAGATCCCGCAGATAGATACAACAGTGAGATGACCGCCAGAGGTGTGTCACAAACAGCACACAATCACGATCAAGGAACTGGAGGTGATATTTCCTTTGGAATATCAACCGCTCCAGAATCTAGTGGAAATCAAACAACAACGGGTCAATTTTCTATAGCAATTAGAAATACATTAAATACTCAAAATTTAAATCACACGCACACGATGGAGTTAGCTAGTATAAATCATAGACACACTATAACATCAAGTGGTACTGGGGCTTCAATAACTCAAACAAATCACACTCATACTGTGGATGCACACACGCACCCCATTGTAGTAAATAACTTACCAGCAACAACAAGAAGCGACACTTCTGGAGTTCACACACATCCAACTTCATCATTTTCTGGAAGAATAGGATTGATAACCGGTGGAGTTGATGGTAATTTAAATATGACCAGCGGACCAAGTTCTGGAAATACAGAAAATTCTTCAGTTTTAACTTCTGGTTCAACATTAAATACTCCATACATAATCTTAAATTATATAATAAAAGTCAAATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
2e95df563b86b03f1f5365c8847bca4ef518820a5566bf254ccccef5d8225425
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,3034
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50