UniProt accession
A0A8A6NLL7 [UniProt]
Protein name
Long tail fiber protein Gp37
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect
Probability 0,81
TF
Evidence RBPdetect2
Probability 0,94
Protein sequence
MADLKAGTTVGGNTIWSQANLPLLPSGNTITYKGFKIYTENDKPTKAEIGLGNVTNDAQVKKAGDDMTGNLTLTNNASLTLPRRINFVSDNSSAVYTVATRSDNNAVTDPFPSSETRVYSLLVNTKSHTSDPAGGATLAAHLFNQKASGAGSMFIQVFGPPNKTTGASGSVTGQLLMNGENGQLQMTGSGLTIGMQTVVNGTMFAQQMAIKGDGNKNLFFQDAGGNELSLIYADTGKNLYVRTGGGAYATRFASDGTLVLANHLTVPGQATFNGLGVFNNRASVESDYATQGANTAILRVTTNGDGNAVGDGVTHLGYKDANGRYNHYFRGTGSTFINTKLGLNVNVPSYYNDVPQGTPAQYFLPTVGQGGKNYLRQFRGGNADTIWHETVQGGVWRLATGSTDAQEELQVSTAGYCRTRQEFQSTAVKGDGGQFRAVAGNYGFIIRNDGGNTYFLLTDSGDPYGSWGSLRPITISNGNGVVSITNGANINGSVSFGNAVNFNGTVETNGIGCGTANGLGTTGISLGDNDTGFKQEGDGILNAYANSQRIMRWTTGATANYKQLQVQGVNGPALLLNNTATNQSCYLLITLAGNNGAYFGFGGADDNVSVHNYRLNTTLQLRTSDLYMNRGLYVEGNVNANDVYIRSDIRLKSNLVELKDSLSKIEQLKGYIYDKQSKDADDIVYHRESGLIAQDVEKVLPEAVREDTDTGMLTISPSGINALLVNAINELRERLEAIENKLGA
Physico‐chemical
properties
protein length:744 AA
molecular weight: 78999,50920 Da
isoelectric point:5,73422
aromaticity:0,08065
hydropathy:-0,33871

Domains

Domains [InterPro]
DC_1202
STR
345–744
G3DSA:6.20.80.10
STR
571–630
IPR030392
CHP
647–742
A0A8A6NLL7
1 744
Architecture
STR
ATT
STR
STR 1-427 | ATT 428-475 | STR 476-744
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A8A6NLL7
1 744
Domain Start End Length (AA) Confidence
N-terminal 1 459 459 0,1239
Central domain 460 658 200 0,2028
C-terminal 659 744 85 0,9974
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-459
Central
460-658
C-terminal
659-744

Taxonomy

  Name Taxonomy ID Lineage
Phage Enterobacter phage PF-CE2
[NCBI]
2810367 Uroviricota > Caudoviricetes > Pantevenvirales > Pseudotevenvirus > Pseudotevenvirus leb
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QTJ24436.1 [NCBI]
Genbank nucleotide accession
MW629017 [NCBI]
CDS location
range 170598 -> 172832
strand +
CDS
ATGGCAGATTTAAAAGCGGGTACTACCGTTGGCGGTAATACTATCTGGAGTCAGGCGAACTTGCCTTTGCTCCCTTCTGGAAACACAATCACATATAAAGGCTTTAAAATCTACACCGAAAACGATAAGCCAACAAAGGCAGAAATCGGATTAGGCAACGTAACGAACGATGCACAGGTGAAGAAAGCTGGCGACGATATGACAGGCAACTTGACATTAACGAACAATGCTTCATTAACGCTCCCTCGTCGCATCAATTTTGTTTCTGATAATTCATCGGCGGTGTATACTGTTGCAACGCGTTCTGATAACAACGCAGTTACAGATCCATTTCCATCATCGGAAACTCGCGTATACAGTTTATTAGTTAATACCAAATCGCATACAAGCGATCCTGCGGGTGGTGCTACTTTAGCCGCACATCTGTTTAACCAGAAAGCATCCGGTGCTGGTTCGATGTTTATACAGGTATTTGGCCCACCTAACAAGACAACAGGTGCTTCTGGATCGGTGACCGGACAACTGTTGATGAACGGGGAAAACGGTCAACTACAAATGACAGGTTCCGGTTTAACGATCGGGATGCAAACTGTTGTTAATGGTACGATGTTCGCTCAACAAATGGCTATCAAAGGCGACGGAAACAAAAACCTGTTCTTCCAGGATGCTGGCGGTAATGAGCTTAGTTTGATCTATGCTGATACAGGAAAGAACTTATATGTTCGTACTGGTGGTGGTGCATACGCAACCAGATTTGCATCTGATGGAACTTTAGTTCTCGCTAACCATCTAACTGTTCCTGGACAGGCTACATTTAATGGTCTGGGTGTATTCAATAACCGTGCTTCCGTCGAAAGTGATTATGCTACTCAAGGAGCAAACACTGCAATACTTCGAGTAACTACAAACGGGGATGGAAATGCTGTTGGTGATGGTGTTACCCACTTAGGCTATAAAGATGCTAACGGAAGATATAACCATTATTTCCGTGGTACTGGTTCGACTTTCATCAACACCAAATTAGGTCTTAACGTAAACGTTCCATCATATTATAATGATGTTCCCCAGGGTACGCCAGCGCAATATTTCTTACCTACTGTTGGACAGGGCGGGAAAAACTATTTGCGTCAGTTCCGAGGCGGTAACGCGGACACAATCTGGCATGAAACTGTTCAGGGTGGTGTGTGGAGACTGGCAACAGGTTCTACCGACGCACAGGAAGAATTACAGGTTTCAACTGCTGGTTATTGTCGTACTCGTCAGGAATTCCAAAGCACAGCAGTAAAAGGCGACGGTGGACAATTCCGAGCGGTTGCTGGCAACTACGGTTTCATTATCCGTAATGATGGTGGTAACACTTATTTCCTTCTGACAGATTCAGGCGATCCATATGGTTCATGGGGTTCATTACGCCCTATTACAATCAGTAACGGTAATGGTGTTGTTTCTATCACAAACGGCGCGAACATTAACGGTTCTGTTAGCTTTGGCAACGCGGTAAACTTCAACGGCACGGTCGAAACAAACGGTATTGGTTGTGGTACTGCAAACGGATTGGGTACGACTGGTATTTCTCTGGGTGATAACGATACAGGTTTCAAACAGGAAGGTGACGGTATTCTCAACGCCTATGCAAATAGCCAGCGTATTATGCGATGGACTACTGGTGCAACTGCTAACTATAAACAGTTACAGGTTCAGGGTGTTAACGGGCCAGCATTACTTCTGAATAACACAGCGACTAACCAATCGTGTTATTTGCTGATTACTTTGGCAGGTAACAACGGCGCATACTTCGGGTTTGGTGGTGCTGATGATAACGTATCGGTACACAACTACCGACTGAATACAACCTTACAATTACGTACTTCGGATCTGTATATGAACCGTGGTCTGTATGTTGAAGGTAACGTAAACGCCAACGATGTGTATATTCGTTCCGATATTCGACTGAAATCTAATTTGGTTGAATTGAAAGATTCATTAAGCAAGATTGAACAGCTTAAAGGTTATATCTACGATAAGCAATCAAAAGATGCTGATGATATCGTATACCATCGCGAATCCGGTCTGATCGCTCAGGATGTTGAGAAGGTATTGCCGGAAGCAGTGCGCGAAGATACTGATACTGGTATGTTAACCATTTCACCTTCTGGGATCAACGCGCTTCTGGTTAACGCAATCAACGAACTGCGTGAACGTCTGGAAGCAATCGAAAACAAATTAGGGGCTTAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098024 virus tail, fiber Cellular Component IEA:UniProtKB-KW (UniProt)
GO:0003700 DNA-binding transcription factor activity Molecular Function IEA:TreeGrafter (UniProt)
GO:0043565 sequence-specific DNA binding Molecular Function IEA:TreeGrafter (UniProt)
GO:0045893 positive regulation of DNA-templated transcription Biological Process IEA:TreeGrafter (UniProt)
GO:0016540 protein autoprocessing Biological Process IEA:TreeGrafter (UniProt)
GO:0019062 virion attachment to host cell Biological Process IEA:UniProtKB-KW (UniProt)

Tertiary structure

PDB ID
9d26ccf13c8df1f6d52bed3c96699d533d8220a73ede9bf28c9a0d4a15bdb75f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6244
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50