UniProt accession
A0A0A0Q3M7 [UniProt]
Protein name
Long tail fiber proximal subunit
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,60
TF
Evidence RBPdetect2
Probability 0,79
Protein sequence
MADLLKPAFRATSGLDAAGEKVINVAKADFDVLSDGVNVDFFIEENTIQQYDPTRGYKEDFAVIYDNRMWISNSEIIKPSGPFASILWRAVRTDPKWIEVTQPVYSLKSGDYVTINSNQRSSDLSLPSDPQDGDYIVVKDIGNNAGYNRQRIIATAQSIIRWGSARSEVLLSKPLSYNIMVFSNRQWQFYETAQEDRGTVITSSSGVFRAQAGDNILRRYTNAEPVRLTLPKYANQGDIIRSVDIDGLGPTYHLIISTFDTTSSIGTVGTHEIEFRTSSDGFLVYDELNKLWVVWDADIKTRLRIIKDDVVLRPNESIMVFGENNAISQTINITLPTSVAIGDTVKIALNYIRKMQTVIIKASPGDEIATDINLLQFPKRSEYPPDAEWVYVTELSFNGDISYTPVVEFSYIENDGKSCWVVAQNVPTIEQVDPKDNNTRKRLGVISLASQAEANVDFENSPLRQQAITPETLANRTATETRRGIARIANTGQVNQDTTFNFQDDIIITPKKLNERTATETRRGVAEISTQAETNAGIDDTTIITPKKLEARRATENMAGIAPLVSTVNTTMAPSRGNPGTNSYDYNEATKIVTPKAMFQAKATNTSQGGVYLALQSEVIAGVTQSGFPNAVVTPETLHAKTSTDSRIGLIEIATQAETDAGTDYTRAVTPKTLNDRKSSEVLTGIARIATQVEFDAGSLDTVISTPLKVKTHFNNSSRTSVVSDSGLVETGTLWDHYTLNIQEASITQRGTLKLSTQAQVDSGTDDTTAITPLKLQRKKSTESTEGIIQLSTQAEVIAGTVSNKAFSPLHYKYIVQQEKSWEATPSRRGYVKLTENALTWAGDNVNGSVANQETFEKTGYAVSPYEMNKALSHYLPIGAKAVDSDKLDGLDSLQFIRRDINQTVDGSLTLTKSTQFQAAITSTSTAVFSGSVTGAGLVSTNGSLSINNGTNTWGITAANNGTTLVLGNSLTLNSNGNASVTGNISSNASIEAKNSYILNGKTIASTITRTPNTLILGDNTQNTVIKTLDASNLVVSDVADYKVLTEKNAKDIVGTNFVKKEGDTMGGKLTVNAPVLPKISETLAMAPLTSANIGFWGAEIVSASIYNTLPGYAVPVMEMSGGQQTGFVDHYEYVKAPGLMTCSGTSIDYIYRTWSPRPQIAQDNHNANTQYISIWDAARGVWGSWGRVYTNSAPPTASEIGAVTNSGSAFDNLTIRDWLQIGNVRIEPDESTQTVKFTWVE
Physico‐chemical
properties
protein length:1242 AA
molecular weight: 135249,41160 Da
isoelectric point:5,14128
aromaticity:0,07407
hydropathy:-0,30161

Domains

Domains [InterPro]
DC_1986
ATT
14–125
IPR048391
ATT
1090–1190
A0A0A0Q3M7
1 1242
Architecture
ATT
STR
ATT
STR
ATT 14-125 | STR 343-1089 | ATT 1090-1190 | STR 1191-1235 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A0A0Q3M7
1 1242
Domain Start End Length (AA) Confidence
N-terminal 1 973 973 0,8867
Central domain 974 1172 200 0,2212
C-terminal 1173 1242 69 0,6757
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-973
Central
974-1172
C-terminal
1173-1242

Taxonomy

  Name Taxonomy ID Lineage
Phage Pectobacterium bacteriophage PM2
[NCBI]
1429794 Uroviricota > Caudoviricetes > Pantevenvirales > Tevenvirinae > Mosugukvirus
Host Pectobacterium
[NCBI]
122277 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AHY25232.1 [NCBI]
Genbank nucleotide accession
KF835987 [NCBI]
CDS location
range 153670 -> 157398
strand +
CDS
ATGGCCGACTTATTGAAACCTGCATTTCGTGCAACTTCTGGCCTGGATGCAGCAGGTGAAAAAGTTATCAATGTCGCTAAGGCAGATTTTGATGTATTAAGTGATGGTGTTAACGTTGATTTCTTCATTGAAGAAAATACCATTCAGCAATATGACCCTACAAGAGGGTATAAAGAAGATTTTGCTGTAATTTATGATAACAGAATGTGGATTTCTAACTCAGAAATTATTAAACCTTCAGGACCGTTTGCAAGTATATTATGGCGTGCTGTTAGAACAGACCCAAAATGGATCGAAGTTACCCAACCTGTTTATTCACTTAAATCTGGCGATTATGTTACTATTAATAGTAACCAACGTTCTAGCGATTTATCTTTACCTAGCGATCCACAAGATGGCGATTATATTGTTGTTAAAGATATCGGAAACAACGCTGGATATAATCGTCAAAGAATAATTGCAACAGCCCAAAGTATCATTCGTTGGGGCTCAGCTCGTTCTGAAGTTTTATTATCTAAGCCATTAAGCTATAATATTATGGTTTTCTCTAATCGTCAATGGCAATTCTATGAAACTGCTCAAGAAGATAGAGGAACAGTTATAACTTCAAGTAGTGGTGTATTCAGAGCTCAAGCAGGCGATAATATCTTAAGACGTTATACTAATGCAGAACCAGTAAGATTAACTTTGCCAAAATATGCTAATCAAGGTGATATCATTAGATCTGTTGATATTGATGGGCTTGGTCCAACATATCATTTAATTATTTCTACCTTTGATACTACTTCATCTATTGGAACTGTAGGAACTCACGAAATTGAATTTCGTACTTCTTCTGATGGATTTTTAGTTTACGATGAATTAAATAAGCTCTGGGTTGTCTGGGACGCTGATATTAAAACTCGTCTAAGAATAATTAAAGACGACGTTGTTTTGAGACCTAATGAAAGTATTATGGTCTTCGGTGAAAACAATGCTATTTCGCAAACAATTAATATCACTTTACCTACTAGCGTTGCTATCGGTGATACTGTTAAAATTGCACTGAACTACATCAGAAAAATGCAGACAGTTATTATTAAAGCATCTCCTGGCGATGAAATTGCTACAGACATTAATTTACTTCAGTTCCCTAAAAGGTCTGAGTATCCGCCGGATGCAGAATGGGTATACGTAACTGAATTATCTTTCAATGGTGACATTAGTTATACTCCTGTTGTGGAATTTAGTTATATAGAAAACGATGGTAAAAGTTGTTGGGTTGTGGCTCAAAACGTTCCTACAATAGAACAAGTAGACCCTAAAGATAATAATACTCGTAAGCGTTTAGGTGTTATTTCTTTGGCAAGCCAGGCTGAAGCCAATGTTGATTTTGAAAATTCTCCTTTGAGACAACAAGCTATTACGCCAGAAACACTTGCAAATAGAACTGCAACAGAAACAAGACGTGGTATCGCTCGTATTGCAAATACAGGACAAGTTAATCAAGATACGACATTTAATTTCCAAGATGATATCATTATTACTCCTAAAAAATTAAATGAAAGAACTGCAACAGAAACAAGACGTGGTGTAGCTGAAATCTCTACTCAGGCTGAAACTAATGCTGGCATAGATGATACAACTATTATTACTCCTAAAAAATTAGAAGCGAGAAGAGCTACCGAAAATATGGCAGGTATTGCGCCATTAGTTTCTACTGTTAATACGACCATGGCTCCATCCCGTGGCAATCCAGGAACTAATAGTTATGATTACAATGAAGCTACAAAAATTGTAACTCCGAAAGCTATGTTCCAAGCTAAAGCTACAAATACTTCTCAAGGTGGCGTTTATTTAGCATTGCAGTCTGAAGTTATTGCGGGTGTAACCCAATCAGGTTTTCCTAATGCGGTTGTTACGCCTGAAACATTACATGCAAAAACTTCCACTGATTCTAGAATTGGATTGATTGAAATTGCCACACAGGCAGAAACTGATGCTGGAACGGATTATACAAGAGCGGTTACACCTAAAACTCTTAATGACCGTAAATCTAGTGAAGTGCTAACAGGTATTGCTCGTATTGCAACACAAGTAGAATTTGATGCTGGCTCATTAGATACTGTTATTTCAACTCCGTTGAAAGTTAAAACACATTTTAATAATTCATCTAGAACTTCTGTTGTCAGCGATAGTGGTTTAGTAGAAACAGGGACCTTATGGGACCATTATACACTTAATATCCAGGAAGCAAGTATTACCCAACGCGGGACACTTAAGTTGAGTACCCAGGCCCAGGTTGATTCAGGCACCGATGACACGACTGCAATTACTCCATTAAAATTGCAGAGAAAGAAATCCACTGAAAGCACTGAAGGTATTATTCAGTTATCTACTCAGGCCGAAGTTATTGCTGGAACTGTTTCTAATAAAGCTTTTAGTCCGCTTCATTACAAATATATAGTCCAACAAGAAAAATCTTGGGAAGCGACTCCTTCTAGAAGAGGATATGTAAAATTAACTGAAAACGCTTTAACATGGGCTGGTGATAACGTTAATGGCTCTGTTGCTAATCAAGAAACATTTGAGAAGACAGGCTATGCGGTTTCTCCTTATGAAATGAATAAAGCATTAAGTCATTATTTACCTATTGGTGCAAAAGCAGTTGATTCTGATAAATTAGACGGACTAGATTCGCTTCAGTTTATTAGACGTGATATTAATCAGACTGTTGATGGCTCATTAACCTTGACCAAATCTACACAATTCCAGGCAGCAATAACGTCTACATCTACTGCAGTATTTTCTGGTAGTGTGACTGGTGCTGGTTTAGTGTCAACTAATGGCTCTTTAAGCATTAACAATGGAACTAATACTTGGGGTATTACAGCAGCCAATAATGGAACGACTTTAGTTTTAGGAAATTCACTTACATTAAATTCAAATGGAAATGCTTCTGTAACAGGAAATATTTCTTCTAATGCAAGTATCGAAGCTAAAAATAGTTATATCTTAAACGGTAAAACTATAGCATCAACTATTACCAGAACTCCTAATACTTTAATCTTAGGTGATAATACACAAAATACCGTTATTAAAACTCTTGATGCAAGTAATTTAGTCGTTAGCGATGTGGCAGATTATAAAGTATTGACTGAAAAGAATGCCAAAGATATTGTTGGTACTAATTTTGTCAAAAAAGAAGGCGACACAATGGGTGGTAAACTCACTGTTAACGCGCCGGTATTACCTAAAATATCTGAAACATTGGCAATGGCACCATTAACTTCTGCTAATATAGGATTCTGGGGTGCTGAAATAGTTTCTGCGTCTATTTACAATACATTACCTGGTTATGCTGTTCCTGTTATGGAAATGTCTGGTGGTCAACAGACTGGTTTTGTGGACCATTATGAATATGTTAAAGCTCCAGGATTAATGACTTGTTCAGGAACTTCTATTGATTATATTTACCGCACTTGGTCCCCAAGACCACAAATTGCACAAGATAATCATAATGCTAACACCCAATATATTTCTATTTGGGATGCTGCTAGAGGTGTTTGGGGTTCATGGGGAAGAGTTTATACCAATAGTGCTCCACCTACAGCAAGTGAAATTGGCGCTGTAACTAATTCTGGTTCTGCTTTTGATAACCTTACTATTAGAGATTGGTTGCAAATAGGTAATGTTAGAATTGAACCAGACGAATCCACTCAAACAGTTAAATTTACTTGGGTAGAATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
c7ec4bcff1a524c233dc09a5dd8ff51fcb3ff1bead2d7aa2e4ef23435ac6528b
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,2729
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50