Genbank accession
YP_009280109.1 [GenBank]
Protein name
tail fiber protein proximal subunit
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,79
TF
Evidence RBPdetect2
Probability 0,77
Protein sequence
MSLNKSFEATYGLNASGEKVINVAKADKTVLSDGVNVEFFIDQNTIQKYDPTRGYDKNFAIIYDNRIYVSNTIIDEPAGDFDKFKWDTLRVDPSYIQIHETVGNGYNLKSGEYIAAYTNINDLTFNLPKQPIEGDTIYIKEMSGNNGYKFLKVKKTTHNLSWNGALVDSHIITRPFAETLLIFSGSTWNLFQIEHEIVGRIVSTSLTKQKVSSGEKIYRRSSTGPINIELPKHAVHGDIIEFYDIDQMTAINHLFVFVNNQEHSIGNLGQKDFEARTSGTGRLVFDSSVNLWRIWDGDLRTRLKTITDDYDMLANDSVLVFGANNTEERTITINLPKDNAEGDTAEIVLTYMRKGQDVKIKCADNDIIFTEKKLLQFPKRSEYPPEVDWVEVTELVFNGTSDYVPYIKLAYSDKKDKGGWFVQAAIPTVERVDFKNPDRLGVIALATQEQANVDKDSNPEKEIAITPSTLANRTATEKREGIARISTTAEVNQISTATYLDTTIVTPKKLNERTATEDRRGLAELATQEETNKGLDDTTIVTPKKLNDRKASEELSGIAKIVASNGTPGTQRDFAGTGVYDFTNNTDIVTPGAVHELISTENAHGVVYLATETEVIDAPVMDPEFPVVVTPVQLHKKTATETRIGFGKIATQDEVNAGTDDFSYVTSKKLNDRKASETLTGLSRYATQDEFNAGDKELISEPAKIKTFLKDKRLEVNTDSGLTLTGNIWDKATINIKASTETQRGTTTIASQVQVDEGTDHTIIVTPKTLHNKKSTEDKEGIIQVADYDETIAGTVVNKAVSPKNFVNAIRTPKNGLEATTVNRGVVRLPADASVWEGTDKDGSTGTYEHEGFAVSPRELNKALSHYLPIEGRAVDSDKFNGLVEADFVRRNKDQTIEGKITFKETISLEKALTSTSDATFVNTNTDVINIGNGTKGTINFKSTTPWTIEAEDKLKINTVEIEQDGTINLKGINATGVVDALIFNVDGTKVISKDGLNTDIGIKEQQLRLFSKDPDATKINDSTIIVDTNMKDKGKGHFILRNGDTVEGDMTFIKPIRVQKEQMKASVKPVAGSFTSEIKDKAIYDTYPGIAVPVIDPETSLVTDYTYVKGPGLLTQIGDSADFVYQTWIPQALGAEANQFNRTVWTRVYNPVKKDWDEWGRVYNTNNPPTAKDIGAVSTVGSTFETLTINQWLQVGPVRLVPNPVTKTVDFVWVG
Physico‐chemical
properties
protein length:1216 AA
molecular weight: 134600,20180 Da
isoelectric point:5,14634
aromaticity:0,07812
hydropathy:-0,43824

Domains

Domains [InterPro]
DC_1946
ATT
1–209
IPR048391
ATT
1073–1164
YP_009280109.1
1 1216
Architecture
ATT
STR
ATT
STR
ATT 1-209 | STR 343-1072 | ATT 1073-1164 | STR 1165-1199 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009280109.1
1 1216
Domain Start End Length (AA) Confidence
N-terminal 1 918 918 0,9313
Central domain 919 1117 200 0,2503
C-terminal 1118 1216 98 0,2581
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-918
Central
919-1117
C-terminal
1118-1216

Taxonomy

  Name Taxonomy ID Lineage
Phage Morganella phage vB_MmoM_MP1
[NCBI]
1852628 Uroviricota > Caudoviricetes > Pantevenvirales > Gualtarvirus > Gualtarvirus mp1
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009280109.1 [NCBI]
Genbank nucleotide accession
NC_031020 [NCBI]
CDS location
range 148037 -> 151687
strand +
CDS
ATGTCACTTAATAAATCTTTCGAAGCTACGTATGGATTAAATGCTTCTGGTGAAAAAGTAATCAACGTAGCAAAAGCAGATAAAACAGTTTTATCTGACGGTGTTAACGTTGAATTTTTTATAGACCAGAACACTATTCAAAAATATGATCCTACACGTGGTTATGACAAAAATTTCGCTATTATATACGATAACAGAATCTATGTGTCAAATACTATCATAGATGAACCTGCAGGCGATTTTGATAAATTCAAATGGGACACCTTACGAGTTGACCCGAGTTATATTCAAATCCACGAAACTGTTGGTAATGGTTATAACCTTAAATCTGGTGAATACATTGCCGCATACACTAATATAAATGATTTAACCTTTAACCTGCCTAAACAGCCGATCGAAGGTGACACGATTTATATTAAAGAAATGTCGGGTAATAACGGTTATAAATTTCTTAAAGTTAAAAAGACCACCCATAATTTATCATGGAATGGTGCATTAGTTGATAGCCATATTATAACACGACCATTCGCCGAAACACTTTTGATTTTCTCAGGTAGTACCTGGAACTTATTCCAGATTGAACATGAAATTGTTGGACGTATTGTATCAACAAGTTTAACAAAGCAAAAAGTTTCATCAGGTGAAAAAATTTACCGAAGATCTTCAACAGGTCCTATCAATATTGAATTACCTAAACATGCAGTTCATGGTGATATTATTGAATTCTATGATATCGATCAAATGACTGCTATTAACCACTTATTTGTGTTTGTTAATAATCAGGAACATTCAATAGGTAATTTAGGACAAAAAGATTTTGAAGCCCGAACTTCTGGTACTGGACGTTTAGTATTTGACTCATCTGTCAATTTATGGCGAATTTGGGATGGTGATCTTCGTACTCGTCTGAAAACTATCACTGATGATTATGATATGCTTGCCAACGATTCAGTATTAGTTTTTGGTGCAAATAATACTGAAGAGCGTACAATTACAATCAATCTTCCAAAAGATAACGCAGAAGGTGATACGGCAGAAATTGTTCTTACTTATATGCGTAAGGGTCAAGATGTAAAAATCAAATGTGCAGACAACGATATTATTTTCACAGAGAAAAAATTACTTCAGTTCCCTAAACGCAGCGAATATCCACCTGAAGTTGATTGGGTGGAAGTCACTGAATTAGTGTTTAACGGAACTTCAGATTATGTACCATATATTAAATTAGCATATTCTGATAAAAAGGACAAAGGTGGTTGGTTTGTACAAGCAGCAATCCCAACAGTCGAACGAGTTGATTTTAAAAATCCAGATCGCTTGGGTGTTATTGCTCTTGCTACTCAGGAACAGGCTAACGTTGATAAAGATAGTAACCCAGAAAAAGAAATAGCGATTACTCCTTCTACTTTGGCTAATCGTACGGCCACTGAAAAGCGTGAAGGTATTGCCCGTATTTCTACTACTGCTGAAGTTAATCAGATTTCTACGGCGACTTATCTTGATACCACTATTGTTACTCCTAAAAAGTTAAATGAACGCACTGCAACAGAAGATCGTCGTGGCCTGGCTGAATTAGCTACCCAGGAAGAAACAAATAAAGGTTTAGACGATACTACTATCGTCACCCCTAAAAAGTTAAACGATCGTAAGGCTTCTGAAGAGCTTTCAGGTATTGCAAAAATCGTGGCCTCAAATGGGACCCCTGGGACTCAACGTGACTTCGCAGGCACAGGCGTATATGATTTCACTAATAACACTGATATCGTCACACCAGGCGCTGTACACGAGCTTATTTCAACTGAAAATGCTCATGGTGTAGTTTATCTGGCAACTGAGACTGAAGTTATTGATGCCCCGGTGATGGATCCTGAATTTCCTGTTGTTGTTACACCAGTTCAGTTACATAAAAAGACTGCTACCGAAACGCGTATTGGTTTTGGTAAAATTGCAACTCAGGATGAAGTTAATGCAGGTACAGATGATTTCTCTTATGTCACATCTAAGAAATTAAATGATCGCAAGGCGTCTGAAACATTAACTGGTTTATCGAGATATGCAACTCAGGATGAATTTAATGCCGGTGATAAAGAATTAATTTCTGAACCTGCAAAAATTAAAACATTCCTGAAAGATAAACGTCTTGAGGTTAATACTGATTCAGGTTTAACTTTGACAGGTAATATCTGGGACAAAGCAACAATTAATATCAAGGCTTCAACTGAAACTCAACGTGGTACGACAACTATCGCGTCACAGGTACAAGTTGATGAAGGAACTGATCATACTATTATTGTTACTCCTAAAACTCTACATAATAAAAAATCAACTGAAGATAAAGAAGGTATTATCCAGGTTGCTGATTATGACGAGACAATAGCCGGTACAGTAGTTAATAAGGCGGTTTCACCTAAGAATTTTGTTAATGCTATCCGTACACCAAAAAATGGTTTAGAAGCAACCACTGTCAATCGTGGTGTTGTTCGTTTACCTGCAGATGCTTCTGTCTGGGAAGGTACTGATAAAGATGGTTCAACTGGTACATACGAACACGAAGGCTTTGCTGTTTCACCTCGTGAATTAAATAAAGCACTCAGTCATTATCTCCCTATCGAAGGAAGAGCGGTTGACAGTGATAAATTTAATGGTCTGGTTGAAGCTGATTTTGTTCGCCGTAATAAAGATCAAACCATCGAAGGTAAAATCACGTTCAAAGAAACGATTTCACTTGAAAAAGCTTTAACAAGTACAAGTGATGCGACCTTTGTTAATACCAATACTGATGTTATTAATATCGGTAACGGAACCAAAGGTACAATTAATTTTAAATCAACTACTCCTTGGACCATCGAGGCAGAAGATAAACTTAAAATTAATACCGTTGAAATTGAGCAAGACGGTACGATTAACCTTAAAGGAATAAACGCGACTGGTGTTGTTGATGCATTGATTTTTAATGTTGACGGTACTAAAGTTATTTCTAAAGACGGTTTAAACACCGATATCGGTATCAAAGAACAACAGCTAAGACTTTTCAGTAAAGATCCGGATGCAACCAAAATAAATGATTCGACTATTATCGTTGATACAAACATGAAAGATAAAGGTAAAGGACACTTTATTTTACGTAACGGTGATACGGTCGAAGGTGATATGACTTTCATTAAGCCTATCAGAGTTCAAAAAGAACAAATGAAAGCTTCTGTTAAACCAGTTGCAGGTTCATTTACTTCTGAGATCAAAGATAAGGCGATATATGATACATATCCAGGAATCGCAGTTCCGGTTATTGATCCTGAAACTTCTTTAGTAACTGATTATACTTATGTCAAAGGTCCAGGTTTATTAACTCAGATCGGTGATTCAGCTGATTTTGTATATCAGACTTGGATTCCACAGGCATTAGGTGCAGAAGCTAACCAGTTCAACAGAACAGTATGGACTCGTGTTTATAATCCGGTTAAAAAAGATTGGGATGAATGGGGACGCGTCTATAATACTAATAATCCGCCAACAGCTAAAGATATTGGTGCGGTATCAACTGTAGGTTCAACTTTCGAAACATTAACCATTAACCAGTGGTTACAAGTTGGTCCTGTCAGATTGGTTCCAAATCCAGTCACTAAAACTGTTGATTTTGTTTGGGTAGGTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
033e09bfe40be02a797450cd89e5036c75b6a832a4a1d40cfbf68483a623ad67
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5822
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Comparative genomics of Morganella phages MP1 and MP2 define new clades among the T4 and T7-like Viruses Pinto,G., Oliveira,A., Malgorzata,L., Kropinski,A. and Azeredo,J. 2017-04-07 GenBank