Genbank accession
QIR82330.1 [GenBank]
Protein name
major capsid protein
RBP type
TSP
Evidence DepoScope
Probability 0,93
Protein sequence
MANVFDKIGDIRNDVKRNNFDWSHDNNFTTGLGRIIPVFCTQVPAGSSLRINPTFGLQFMPMMFPVQTKVKAYLSAYRMPLRALWKDYKDWVSSANDQTSQLEPPYMEFLPEFGKGEILGVSGLSDYIGLPVTMIAEEAVPAPSLVSANPDENRRYVGNTVPVYGGTYGVNTTPNTPYSSCASALFNGDYADIYVVRRGFATVSEDTDDSVKTVGMRCTFSFTATGAQASFLLNQLASLETSQVMDKVLGWFAFVDSNSKVRGVASFTGRDVKTSGDESALFVTISSTCTVDAVKPVPSLELNVHVAIANATCFINSDGSKWLYDDKTFNFAGFAQFPISANSCPYYMHGGDNNNALKISAYPYRTYEAIYNAYIRNTKNNPFLINGKPTYNKWIVNDDGGADNAEYVMHYANWASDMFTTAVPSPQQGQAPLVGITTYAESRTLENGHVETTINTALVDEDGNKYKINYESNGEELKNVSYTKLSSDTAVKPLSSLYDLVTSGISINDFRNVNAYQRYLELNMFRGYSYKEIVEGRFDVSIRYDDLNMPEYLGGCTRDVVINPVTQTVQTNSAGTYDGALGSQAGLGLLRGDCENISCYCDEESIVMVLLSVVPMPIYSQALPKYLLYRERLDSFNPEFDHIGFQPIRMSEIAPIQQWIKDDTKMNDVFGYQRPWYEYCQQLDTAHGLFKTDLRNFLINRVFGDVPVLGSEFTTVDESEVNDVFAVTDVSDKILGQIHFDITAKLPISRVVVPKLE
Physico‐chemical
properties
protein length:757 AA
molecular weight: 84282,72590 Da
isoelectric point:4,79354
aromaticity:0,11625
hydropathy:-0,23104

Domains

Domains [InterPro]
IPR037002
ATT
9–195
IPR016184
Unmapped
12–178
IPR003514
STR
14–100
QIR82330.1
1 757
Architecture
ATT
ATT
ATT 9-195 | ATT 309-754 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QIR82330.1
1 757
Domain Start End Length (AA) Confidence
N-terminal 1 129 129 0,9743
Central domain 130 344 216 0,8694
C-terminal 345 757 412 0,8251
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-129
Central
130-344
C-terminal
345-757

Taxonomy

  Name Taxonomy ID Lineage
Phage Chicken microvirus mg7_6
[NCBI]
2720928 Viruses > Monodnaviria > Sangervirae > Phixviricota > Malgrandaviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QIR82330.1 [NCBI]
Genbank nucleotide accession
MN379637 [NCBI]
CDS location
range 1 -> 2274
strand +
CDS
ATGGCAAATGTTTTTGATAAAATCGGTGACATTCGTAACGATGTAAAACGTAATAACTTCGACTGGTCACACGATAACAATTTCACTACGGGATTAGGTCGTATCATTCCCGTATTTTGTACGCAAGTTCCTGCCGGTTCTTCCTTACGGATTAACCCTACATTTGGCTTACAATTCATGCCTATGATGTTCCCGGTACAGACTAAAGTTAAGGCTTATCTCTCCGCTTATCGTATGCCGTTACGTGCGTTGTGGAAAGATTATAAAGATTGGGTTTCATCTGCAAATGATCAAACTTCACAACTCGAGCCGCCTTATATGGAGTTTCTACCTGAATTTGGTAAAGGGGAAATTCTCGGTGTTTCAGGTCTTTCTGATTACATCGGATTGCCCGTTACTATGATTGCTGAGGAGGCTGTTCCTGCTCCGTCTTTGGTTAGTGCAAACCCTGACGAAAACCGAAGGTATGTGGGTAATACTGTGCCTGTTTATGGTGGTACTTATGGAGTTAATACGACTCCTAATACTCCTTATTCTTCGTGTGCAAGTGCTCTTTTTAATGGAGACTATGCTGATATTTATGTAGTCCGTCGTGGTTTTGCTACCGTTTCTGAGGATACGGATGATTCTGTTAAAACCGTGGGTATGCGTTGTACTTTCTCTTTTACTGCAACTGGTGCTCAGGCTTCTTTTTTGTTGAATCAATTGGCAAGTCTTGAAACCAGCCAAGTCATGGATAAAGTGTTAGGTTGGTTTGCTTTTGTCGATAGTAATAGCAAAGTGCGGGGCGTTGCATCCTTTACGGGTCGTGATGTTAAGACTTCAGGTGATGAATCGGCTTTGTTTGTTACTATATCCTCTACGTGTACCGTGGATGCTGTAAAGCCGGTACCTTCATTAGAGCTTAATGTGCATGTTGCTATTGCCAACGCTACGTGCTTCATTAATTCGGATGGCTCTAAGTGGTTATATGATGACAAAACATTTAATTTTGCCGGCTTTGCTCAATTCCCGATAAGTGCAAATAGTTGTCCGTATTATATGCATGGTGGGGATAATAACAACGCATTGAAGATTTCCGCGTACCCTTATCGTACTTATGAGGCTATCTATAACGCTTATATTCGAAATACTAAGAATAATCCGTTCCTGATTAACGGTAAGCCGACTTACAATAAGTGGATTGTAAATGATGATGGCGGAGCTGATAATGCTGAATACGTTATGCATTATGCTAACTGGGCTTCCGATATGTTTACTACGGCTGTTCCGTCTCCCCAGCAAGGACAAGCACCGCTCGTTGGTATTACTACGTACGCTGAAAGTCGTACTTTAGAAAATGGTCATGTAGAAACTACTATTAATACTGCGTTAGTTGACGAAGATGGAAATAAGTATAAGATAAACTATGAAAGTAACGGTGAAGAACTCAAAAACGTATCATATACGAAGTTGTCCTCTGATACTGCCGTAAAACCGTTATCTTCGTTGTACGATTTAGTTACATCAGGTATTTCTATCAACGACTTCCGTAACGTTAACGCTTACCAACGTTATCTCGAACTTAATATGTTCCGTGGTTATTCATACAAGGAAATAGTCGAAGGTCGTTTTGACGTGTCGATTCGTTACGATGACTTAAATATGCCTGAATATCTTGGAGGTTGTACTCGTGACGTTGTAATCAATCCTGTTACTCAGACAGTACAGACAAATAGCGCTGGTACGTATGACGGTGCGTTAGGTTCTCAGGCTGGTCTCGGCTTACTTCGTGGTGATTGTGAAAATATTAGCTGTTATTGTGATGAGGAATCAATCGTGATGGTGTTGTTGTCCGTAGTTCCTATGCCTATCTACTCTCAGGCTTTGCCTAAATATCTGTTGTATCGCGAACGTTTGGATAGCTTCAATCCTGAATTTGACCACATCGGATTCCAGCCTATTAGGATGTCTGAGATTGCTCCTATTCAGCAATGGATAAAAGATGATACAAAAATGAATGACGTCTTTGGTTATCAACGGCCTTGGTATGAGTATTGTCAGCAGTTAGACACCGCTCACGGTTTATTTAAGACAGACTTAAGAAATTTCTTAATCAATCGTGTTTTTGGTGATGTCCCCGTATTAGGTAGCGAGTTTACAACTGTGGATGAATCGGAAGTTAACGATGTATTTGCGGTAACTGATGTATCTGATAAGATTCTTGGACAAATCCATTTCGATATTACGGCTAAGTTGCCGATTTCACGTGTTGTTGTACCTAAATTAGAATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
af37cd7dd3cc1d4e1a5ca5f941a670cdf870cc1fc480779ea40d484e29f1b70b
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,4335
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Identification of single stranded DNA viruses in chicken tracheal swab swabs Chrzastek,K., Kapczynski,D., Kulkarni,A., Chappell,L., Schmidlin,K. and Varsani,A. 2021-12-16 GenBank