Genbank accession
NP_043502.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,88
TF
Evidence RBPdetect2
Probability 0,95
Protein sequence
MASLITPQFERYVAEQTIARGTVQFDEFIFANIPGLNENNLTQHLTIPTSAQIVHRQAVSQSGVINENAVVYSVTIGTEVGDFDFNFIGLINRSKNLLAVAVQTDTVKKIRNKNAVQGNSITRNMLLEFSGAKALTGINVNANTWQIDFTVRLHGLDEKIRLTNRDLYGRAVFFDDSFLVKRKTGNQFTIQPGNAYVEGVRMDLGTEHHLTANSLPCSIYADVVHHCTVTGEYQTEIKYLTQSKADYVDTANRQHYVQILADIDSQGNVTDRRLLSPFWGMNPLTLDDTTENTKDKLGHTHKLPIASIKKRGITKLSSATNSDSETQAATSKAVKTAYDKAVEVKTTAESKVGLRGNESIQGTKSFESKIIGFRGIGVADSQTYANANHLLNMGANDGDGWIEYKKSNRVIGTIRIRANGELSYNNQKIYHAGAKPQFNTDIEGKPNTLAGYGIGNFKVEQGQGDANGYKTDGNYYLASGQNLPENGEWHIEVVSGGATNAVRQIARKANDNKIKTRFFNGSNWSEWKDAGGDGVPIGSVVSFPRAVTNPVGFLKANGTTFNQQTFPDLYRTLGDSNQLPDLTRSDVGMTAYFAVDNIPSGWIAFDSIRSTVTQQNYPELYQYLVDKYSSISNVPLAEDRFIRNTGNGLNIGQTQSDEIKKHVHRVRTHWADSSDSSIFYDKTKTVIDSRLRTATTTDDNLSDNGFMHPLLDTPMATGGDETRPKSLILKLCIKAKNTFDDVQFWVKAFGVVENVGALDAGTLAQNMQALSESVGQKIKENKQSTLLEINNAKADINQKFLQVQESLSQIKTVWQGNVSSGRINISEKCFGKTLILYLQSSESHRLDDNNNIEPVSFEVGAEIEGKSGGGVYLSATHDVTPHYSSGGSRLYGVGVKKFAVYVGRDGTTIEIEDLSNYFVKRIDIR
Physico‐chemical
properties
protein length:925 AA
molecular weight: 101999,68940 Da
isoelectric point:6,72294
aromaticity:0,08432
hydropathy:-0,44519

Domains

Domains [InterPro]
IPR005068
STR
307–348
IPR011083
ATT
538–582
NP_043502.1
1 925
Architecture
ATT
STR
STR
ATT
STR
RBD
ATT 3-156 | STR 284-365 | STR 462-533 | ATT 534-585 | STR 587-735 | RBD 736-925
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
NP_043502.1
1 925
Domain Start End Length (AA) Confidence
N-terminal 1 366 366 0,9755
Central domain 367 565 200 0,0770
C-terminal 566 925 359 0,6759
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-366
Central
367-565
C-terminal
566-925

Taxonomy

  Name Taxonomy ID Lineage
Phage Haemophilus phage HP1
[NCBI]
2992577 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
NP_043502.1 [NCBI]
Genbank nucleotide accession
NC_001697.1 [NCBI]
CDS location
range 25815 -> 28592
strand +
CDS
ATGGCTAGTTTAATTACGCCACAATTTGAACGCTACGTTGCAGAACAAACTATTGCACGTGGCACAGTACAGTTTGATGAATTTATTTTCGCCAATATTCCAGGTTTAAATGAGAACAATCTTACTCAACATCTTACGATCCCCACATCCGCACAAATTGTACATCGCCAAGCCGTATCGCAAAGTGGTGTGATTAATGAAAATGCCGTTGTGTATTCTGTGACGATTGGTACTGAAGTAGGCGATTTTGATTTCAATTTTATTGGTTTGATTAATCGTTCTAAAAATCTTTTAGCTGTTGCAGTGCAAACGGATACAGTGAAAAAAATCCGAAATAAAAATGCTGTGCAAGGCAACAGTATTACGCGCAATATGCTTTTAGAATTTAGTGGCGCAAAAGCTCTAACGGGCATTAATGTCAATGCGAACACTTGGCAAATTGATTTTACTGTGCGACTACATGGACTTGATGAAAAAATTCGTTTAACCAATCGTGATCTGTATGGCAGAGCAGTATTTTTCGATGATAGTTTTCTGGTTAAACGTAAAACAGGCAATCAATTTACGATTCAACCAGGCAATGCTTATGTTGAAGGCGTTCGTATGGATTTAGGCACAGAGCATCATCTTACTGCTAATAGCTTGCCTTGTTCTATTTATGCGGATGTGGTGCATCATTGCACCGTAACGGGCGAATATCAAACCGAAATTAAGTATCTCACGCAATCAAAAGCGGATTATGTAGATACTGCAAACCGCCAACATTATGTACAAATTCTGGCGGATATTGACAGCCAAGGCAATGTGACAGATCGTCGCTTGCTTTCGCCGTTTTGGGGTATGAATCCGCTCACATTAGATGACACAACCGAAAACACTAAAGATAAACTGGGTCATACGCACAAATTACCGATTGCCAGCATCAAGAAACGAGGCATCACGAAACTAAGCTCCGCCACTAACAGCGACAGCGAAACCCAAGCGGCAACCTCAAAAGCGGTTAAAACCGCCTATGACAAAGCAGTAGAAGTCAAAACTACGGCGGAGAGCAAAGTAGGATTAAGGGGCAATGAATCGATTCAAGGTACCAAAAGTTTTGAATCTAAAATCATTGGGTTTCGTGGCATTGGGGTGGCTGATTCGCAAACTTATGCAAATGCTAATCACCTCTTAAATATGGGGGCAAATGATGGCGACGGCTGGATAGAGTATAAAAAAAGTAACCGAGTTATCGGCACCATTCGTATTCGGGCAAATGGGGAATTGTCATATAACAATCAAAAAATCTATCACGCTGGGGCAAAACCGCAATTTAATACGGATATTGAAGGCAAGCCTAATACACTTGCTGGCTATGGTATTGGAAATTTCAAAGTAGAACAAGGACAAGGCGATGCCAATGGCTATAAAACCGATGGCAATTATTACTTAGCAAGCGGTCAAAATCTACCCGAAAATGGGGAATGGCATATTGAAGTAGTTAGCGGTGGAGCAACGAATGCGGTGCGTCAAATTGCACGTAAAGCAAATGACAACAAAATCAAAACACGCTTTTTTAATGGCTCAAATTGGTCAGAATGGAAAGATGCAGGCGGCGACGGCGTGCCTATTGGTTCCGTAGTGTCATTTCCCCGTGCGGTAACCAATCCCGTTGGTTTTTTAAAAGCCAATGGTACGACATTTAACCAACAAACCTTCCCTGATTTATACCGCACTTTGGGCGACAGCAACCAACTTCCTGATTTAACCCGTAGTGATGTGGGGATGACAGCTTATTTTGCCGTGGATAATATCCCTTCTGGGTGGATTGCCTTTGATAGCATTCGCTCAACAGTCACACAGCAAAATTACCCAGAGTTATATCAATATCTTGTTGATAAATATAGCTCTATTTCAAATGTACCACTTGCGGAAGACCGATTTATTAGAAATACAGGGAATGGGTTAAATATCGGTCAGACACAAAGTGACGAGATTAAAAAGCACGTTCACAGAGTGAGAACACACTGGGCTGATTCATCTGATAGTAGTATTTTTTATGACAAAACGAAAACTGTTATAGATTCACGATTACGCACTGCAACTACAACCGATGATAATCTCAGTGATAATGGATTTATGCATCCGCTTTTAGATACCCCGATGGCAACAGGTGGAGATGAAACTCGCCCTAAATCGCTTATTCTCAAACTTTGTATAAAAGCAAAAAACACATTTGATGACGTGCAATTTTGGGTTAAAGCATTCGGTGTTGTTGAAAATGTTGGGGCTTTAGATGCGGGTACACTTGCACAAAATATGCAAGCGTTATCTGAGAGTGTTGGACAAAAAATAAAAGAAAATAAACAATCTACTTTACTAGAAATAAACAATGCAAAAGCTGATATAAATCAGAAATTTTTACAGGTGCAAGAGAGTTTATCTCAAATTAAAACAGTGTGGCAAGGTAATGTAAGTTCTGGGCGAATTAATATATCAGAGAAGTGCTTCGGTAAAACGTTAATTTTATATCTTCAATCATCAGAAAGTCATAGGCTTGATGATAATAACAATATTGAACCCGTCAGTTTTGAAGTAGGGGCAGAGATTGAAGGTAAAAGTGGCGGTGGAGTTTATTTGTCTGCTACTCATGACGTAACTCCACACTATTCTTCTGGTGGAAGTCGTTTATATGGTGTAGGGGTCAAGAAATTCGCTGTGTATGTTGGTAGAGACGGTACAACAATAGAGATTGAAGACCTTTCTAATTATTTTGTAAAACGTATCGATATCCGATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
475b3e8c5b3de1adb0aa57eca91c212f092505075749b7c9422c435b8cc1df82
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7304
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
The complete nucleotide sequence of bacteriophage HP1 DNA Esposito,D., Fitzmaurice,W.P., Benjamin,R.C., Goodman,S.D., Waldman,A.S. and Scocca,J.J. 1996 8710508 GenBank
Identification of an HP1 phage protein required for site-specific excision Esposito,D. and Scocca,J.J. 1994 7997180 GenBank
Nucleotide sequence and expression of the gene for the site-specific integration protein from bacteriophage HP1 of Haemophilus influenzae Goodman,S.D. and Scocca,J.J. 1989 2546915 GenBank
Nucleotide sequences and properties of the sites involved in lysogenic insertion of the bacteriophage HP1c1 genome into the Haemophilus influenzae chromosome Waldman,A.S., Goodman,S.D. and Scocca,J.J. 1987 3491821 GenBank
Nucleotide sequence and properties of the cohesive DNA termini from bacteriophage HP1c1 of Haemophilus influenzae Rd Fitzmaurice,W.P., Waldman,A.S., Benjamin,R.C., Huang,P.C. and Scocca,J.J. 1984 6335448 GenBank
Nucleotide sequence of cloned DNA segments of the Haemophilus influenzae bacteriophage HP1c1 Benjamin,R.C., Fitzmaurice,W.P., Huang,P.C. and Scocca,J.J. 1984 6098523 GenBank