Genbank accession
QNN97587.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MPREDAAQFNGTLSTATYPNRYSTVIPVEAPFYRIGLTLTVTDKTTGTKRKLIEGLDYFLGHYFQELAEAENDAIYGSIMLLNATEVEYELLSVGRQYRIPASEIGKYLVKTDMKDPRNCDWSELMKYPPIISPIDPPKDLEEAILRDEIVKALEDIRLGILERAHELDEAFAEVTDLIYQNGKQVFDDEMYQHHLIKNAHGYTCGDIGALPVLGKAVDATKAFGRTLAELVTLMSTSGIQQKHVDMLLDNTLGDLLGRMRVLNNDAITFQTTSGHVISMKGEKFIITSTKPLVLKADQDNNEPGIATEFSAGLNTLWVHSGKAGDKWGAYDAGVLAPVYNSVYLVTPDMVKMYLTSVKILPANAYFKSSDTLKIYGSGKETNKVYMNAELPAATTTVQGLFAITNLSASAASSTAISQKAVTDLKNKLDGYVDDTYTVNGKGFVLDENGQMHLTLTASDFGIEKMNNTGPLEKPVTNALRTVLSGKALSTHTHTIADLDNVPYASTSVTGLAKLWDAIDTTTDKMVTARQGFLLEQKIGTLNDKIATLLPAWTVGGSSYGNNNFLPIPVQGNYEGYTKNQTWQKGLARYEAGKVYSLRNGSNGNPPGDWAIYYAYADVSPSNTLENMQQTSVRYHPAGMSAYPGVNLVAILITGTDAMICLGSDGAYYLVIFDGTIDHAKHHRVVKVYLGDYRNGNAAGVITTAPWVADPSTNDLIVCNDRVYLLRTVLTSGDYFVSMRSIALNELNLGGSNTFVADALTGGILTSGENVYLRQGQKRAPSDALPAPVTQIYADALAKWNNAINFVHGPECNHAIGVDGLKFRVGLTPTVWFASTTGQTINFKQWVSSFVVDCSSMTVTLEGADRFPIVADLTTVRYNGGAEIGIPSRKWGEGQANNRTFATSQDKYIVSQGHAGDRDLMPYVTITTLDDGVSWYEYLSCDYNFTARSFSAVMLNQGRGSIYELGMAYPLTLGANSRLIWLSQPRSRLAIEVEFDPNTTYSGKAGYGPTNNRRLVDSITYENLSQMCHIVTPASPGTELNGWYCTGPETFPYHNVSGTFTIHADRLTLSQAEWDKMKQMIIDGAPAGAANGYASEFILAANNAGKALFSVWFIGIGTATPLTLAQVACTRTVNNVNVLDIYYFTLKPTISNGQVTFPAAILTLFDFYNNLDYGLNYATLGEITGGKTRRPGQPTMVSVDGTQFGLYLPHTLTLRLIGDLGALNYAVGLTLSNGTWTRSSAAHGVGRMAPAHTPVSPAYLAHRDQVMLHNSSLDYVYCGGIATSSRDFVAAPNFSGPTESVVIAGVETAEGWLVYVTEEVNLRFGSSTYKLPTWYIDLSAAFPTNHQNRTFYLHAKVENGTPKYVMDVVQHPDTETELYIGYVKTGSSRVVETNVEHAKRLGAVKQLLEHAAVENKHDVEVGRDAATGRLAPLRKEAMGPLSVDATKGYLDSQLLYTVSDAQRRGKAMVSKRFEGLKTLPAFTQANKSGAFWKHPTTIATVTADENSEATSLGMEWVGIPTPAFAPTSTNLMVVVQSKFVVPGTPGVETTFNVHVAPWEAVDGFFYNVEAEGTDVNIDENPEVLLPSPHTVHHKQHSLPAGVPCTFTFVAAVSSSALETANQHICSWLFTDAGGLPITTDLSDSKIQILPHPGETHGTPVVLNSRSYTFTNVAGAIPVVTTGAQGLFPPPIMSWDGDNLTLTIAHDWVNERGRQQPDIIQVNLMFRT
Physico‐chemical
properties
protein length:1727 AA
molecular weight: 188104,17830 Da
isoelectric point:5,55245
aromaticity:0,09265
hydropathy:-0,14551

Domains

Domains [InterPro]

No domain annotations available.

Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QNN97587.1
1 1727
Domain Start End Length (AA) Confidence
N-terminal 1 535 535 0,8381
Central domain 536 1294 760 0,8006
C-terminal 1295 1727 432 0,4543
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-535
Central
536-1294
C-terminal
1295-1727

Taxonomy

  Name Taxonomy ID Lineage
Phage Proteus phage 7
[NCBI]
2767546 Uroviricota > Caudoviricetes > Chimalliviridae > Seoulvirus SPN3US >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QNN97587.1 [NCBI]
Genbank nucleotide accession
MT679221 [NCBI]
CDS location
range 146402 -> 151585
strand -
CDS
ATGCCGAGAGAAGATGCGGCGCAGTTTAACGGTACCCTCAGTACCGCTACTTACCCGAACCGTTATTCGACGGTCATTCCCGTCGAAGCACCGTTCTATCGCATCGGGTTAACGTTAACCGTTACCGATAAAACAACCGGAACCAAACGCAAGTTAATAGAAGGTCTTGATTATTTCTTAGGACATTACTTCCAGGAATTAGCCGAGGCTGAGAACGATGCAATATACGGCAGCATAATGCTGCTGAACGCCACAGAGGTAGAATACGAGTTGTTGTCGGTTGGTCGACAGTATCGCATCCCTGCTTCTGAAATCGGCAAGTATCTGGTTAAGACGGATATGAAAGACCCGCGCAACTGCGATTGGTCTGAGCTGATGAAATATCCGCCGATTATCTCGCCAATCGACCCGCCAAAAGATCTCGAAGAAGCGATTCTTCGCGATGAAATCGTAAAGGCGTTGGAAGATATCCGTCTGGGTATTTTAGAGCGCGCCCACGAACTGGATGAGGCATTTGCCGAAGTTACCGATTTGATTTATCAGAACGGTAAGCAGGTGTTTGATGACGAAATGTATCAACACCACCTGATTAAAAATGCACATGGTTACACCTGCGGTGACATTGGCGCTCTCCCGGTACTCGGCAAAGCCGTAGATGCGACTAAGGCATTCGGACGTACCCTGGCAGAGCTAGTGACGTTAATGTCTACCAGCGGCATCCAACAGAAACACGTTGACATGCTTCTGGATAATACCCTGGGTGATTTACTGGGCCGTATGCGTGTTCTGAATAACGACGCGATTACGTTCCAAACAACGTCTGGTCACGTTATCAGCATGAAGGGTGAGAAGTTCATCATCACCTCGACAAAACCGCTGGTATTAAAAGCTGACCAGGATAACAACGAACCGGGTATCGCTACGGAGTTCAGTGCTGGTCTGAACACGCTTTGGGTTCACTCGGGTAAAGCAGGTGACAAGTGGGGCGCTTATGATGCAGGTGTGCTTGCGCCTGTGTATAACAGCGTGTATCTCGTTACACCGGATATGGTTAAGATGTACTTGACCTCGGTGAAGATACTTCCAGCAAACGCCTACTTCAAGTCCAGCGACACGTTGAAGATTTATGGCTCTGGTAAAGAAACAAACAAAGTTTACATGAATGCGGAACTGCCTGCAGCGACCACAACTGTGCAGGGCTTGTTTGCGATTACTAACCTTTCGGCGTCCGCTGCATCGTCCACAGCGATTTCCCAAAAAGCTGTTACTGACTTGAAGAACAAACTCGACGGATATGTGGACGACACGTATACGGTCAATGGTAAGGGGTTTGTGTTAGACGAAAACGGTCAAATGCATTTAACCTTAACCGCGTCGGATTTCGGTATTGAGAAGATGAACAATACCGGACCGCTGGAGAAACCGGTTACTAATGCGTTGCGCACCGTGCTATCAGGCAAGGCACTCTCAACACACACCCACACCATTGCTGATCTCGATAACGTGCCGTACGCATCCACGTCTGTTACAGGTTTGGCTAAACTGTGGGATGCTATTGACACCACAACAGATAAGATGGTTACTGCCCGCCAGGGTTTCTTGCTCGAACAGAAAATCGGTACACTGAATGACAAGATTGCTACGTTGTTGCCGGCATGGACCGTTGGCGGATCATCGTACGGTAACAATAACTTCTTGCCTATACCTGTCCAGGGGAACTACGAAGGGTACACCAAAAACCAAACCTGGCAAAAAGGACTGGCGCGTTACGAGGCTGGTAAGGTTTACTCGTTGCGCAATGGTTCCAACGGGAACCCGCCAGGTGACTGGGCTATCTATTACGCCTATGCTGATGTGTCGCCAAGCAATACGCTGGAAAATATGCAGCAGACGTCCGTACGTTATCACCCCGCTGGAATGTCGGCGTATCCTGGGGTGAATCTGGTTGCTATTCTGATTACCGGTACAGATGCAATGATCTGCTTAGGCAGTGACGGTGCGTATTACCTGGTCATCTTTGACGGCACCATTGACCATGCAAAACATCATCGTGTTGTTAAAGTTTACTTGGGCGATTACCGAAATGGTAACGCGGCCGGTGTGATTACAACCGCACCATGGGTTGCCGATCCATCAACAAACGATTTGATCGTTTGCAACGATCGTGTTTATCTGCTCCGTACTGTCCTTACCTCCGGTGACTATTTCGTGAGTATGCGTAGCATCGCACTCAACGAACTTAATCTGGGTGGTAGCAACACGTTTGTGGCGGATGCGTTAACAGGCGGGATATTAACGTCGGGTGAAAACGTTTATCTGCGACAGGGACAGAAACGTGCACCTTCCGATGCGCTCCCAGCACCAGTTACACAGATCTACGCAGACGCGTTAGCGAAGTGGAACAATGCAATCAACTTTGTACATGGTCCTGAGTGTAATCACGCGATTGGTGTTGATGGGCTGAAATTTCGTGTTGGTTTAACGCCAACAGTCTGGTTTGCTTCTACTACAGGGCAAACTATTAACTTTAAACAATGGGTGTCAAGCTTTGTTGTTGATTGCAGTTCAATGACGGTAACGCTGGAAGGTGCCGATCGATTCCCTATCGTGGCCGATCTTACCACGGTTCGTTATAACGGCGGCGCTGAGATTGGGATTCCGAGCCGGAAGTGGGGCGAAGGTCAAGCTAATAACCGAACCTTTGCAACATCGCAGGACAAGTATATCGTTTCGCAAGGACATGCAGGCGATCGCGACCTAATGCCTTATGTCACGATTACAACGTTAGATGACGGTGTGAGTTGGTACGAGTACCTGTCGTGCGACTACAACTTCACTGCCAGAAGTTTTAGTGCGGTAATGTTAAACCAAGGACGTGGTTCTATCTACGAGCTGGGTATGGCGTATCCGCTGACTCTGGGAGCGAATTCCAGACTTATCTGGTTGTCACAGCCGCGTTCCCGCTTGGCAATTGAAGTTGAGTTTGATCCGAACACCACTTACTCAGGTAAGGCGGGGTATGGCCCAACTAACAATCGTCGTCTAGTAGATTCTATCACGTATGAGAATCTGTCGCAGATGTGCCACATTGTTACACCGGCGTCGCCTGGTACAGAGTTAAACGGCTGGTATTGCACAGGACCAGAAACATTCCCTTACCACAACGTTTCTGGCACTTTCACGATTCATGCCGATCGGTTGACGCTGTCACAAGCTGAATGGGATAAGATGAAGCAGATGATCATCGACGGAGCACCGGCAGGCGCTGCGAATGGTTATGCTTCGGAATTTATCCTTGCCGCAAATAACGCAGGAAAGGCATTGTTCTCCGTTTGGTTCATCGGCATTGGTACGGCCACACCGCTCACGTTAGCACAAGTAGCATGTACGCGTACTGTGAACAACGTTAACGTGTTGGATATCTATTACTTTACACTGAAACCAACGATCAGCAACGGGCAGGTAACATTCCCCGCAGCGATATTGACATTGTTTGACTTCTATAACAACCTCGACTATGGCTTGAACTATGCTACGTTGGGAGAGATAACAGGTGGAAAAACACGCCGTCCGGGACAACCTACCATGGTGTCGGTAGATGGTACGCAGTTCGGGTTGTATCTTCCACACACGTTAACCTTGCGCCTGATTGGCGACTTGGGTGCGCTTAACTACGCTGTTGGTCTTACGCTGTCTAATGGTACTTGGACGCGTTCTTCTGCCGCACACGGCGTTGGACGTATGGCACCGGCACACACTCCAGTCAGCCCAGCTTATCTGGCTCATCGTGATCAGGTGATGCTACACAACTCATCTCTCGACTACGTCTATTGTGGTGGTATCGCAACGTCGTCGAGAGATTTTGTCGCGGCTCCTAATTTTAGTGGCCCGACTGAATCGGTCGTCATTGCTGGCGTGGAAACAGCCGAAGGTTGGCTGGTATATGTGACTGAGGAAGTTAACCTGCGTTTCGGCTCGAGCACTTATAAGCTGCCTACGTGGTATATTGACCTCAGCGCAGCATTTCCAACAAACCATCAGAACCGGACATTCTACCTGCACGCCAAGGTAGAAAATGGTACACCAAAGTACGTAATGGACGTGGTTCAACATCCAGACACTGAAACGGAACTCTATATCGGGTATGTTAAAACGGGCTCGTCGCGAGTCGTCGAGACTAACGTCGAACATGCTAAGCGATTGGGGGCGGTGAAACAACTGTTGGAACATGCGGCGGTTGAGAATAAGCATGATGTGGAAGTGGGAAGGGATGCAGCAACGGGTAGACTGGCACCATTGCGTAAAGAAGCGATGGGTCCTTTGTCAGTCGATGCTACGAAAGGCTATCTGGATAGTCAGTTGCTTTATACCGTATCTGACGCACAGCGTCGTGGTAAAGCGATGGTGTCCAAACGTTTCGAAGGCTTGAAAACTTTGCCGGCGTTTACTCAGGCAAACAAGTCGGGGGCGTTTTGGAAACATCCTACAACAATTGCGACGGTAACGGCCGATGAGAACAGTGAAGCGACATCGTTGGGTATGGAGTGGGTAGGTATTCCAACGCCCGCTTTCGCTCCGACCTCAACAAACCTGATGGTGGTGGTGCAAAGTAAATTCGTCGTCCCAGGTACACCGGGAGTAGAAACGACCTTTAATGTCCACGTTGCCCCATGGGAGGCGGTGGATGGCTTTTTCTACAACGTCGAAGCGGAAGGTACGGATGTTAACATTGATGAAAATCCAGAAGTGTTATTGCCGTCACCACATACCGTGCACCACAAACAGCACTCTCTGCCAGCAGGCGTGCCCTGCACCTTTACATTTGTCGCGGCGGTAAGTTCTTCAGCGCTCGAAACAGCTAACCAGCATATTTGCTCTTGGCTTTTCACGGACGCTGGGGGACTACCAATTACCACCGATCTTTCGGATAGTAAGATTCAAATCTTACCACATCCGGGAGAGACCCACGGCACACCGGTTGTTCTGAACAGTCGTAGCTATACGTTTACGAATGTGGCAGGTGCAATCCCGGTTGTTACCACAGGTGCTCAGGGTTTATTCCCACCGCCGATCATGTCGTGGGATGGCGATAATCTTACGTTAACGATTGCGCATGATTGGGTGAACGAGAGAGGACGTCAACAGCCTGATATTATCCAGGTCAATCTTATGTTCCGTACCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
94958a01d3f672990d6fc22ce2b0a8dd2b05cd7594d56ea18faab9f770f2d2fe
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,7177
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Bacteriophages application to control Proteus mirabilis infection Connerton,I.F. 2011-04-20 GenBank