Protein

Genbank accession
URC25481.1 [GenBank]
Protein name
tail tip host specificity protein J
RBP type
TSP
Evidence RBPdetect
Probability 0,83
TF
Evidence RBPdetect2
Probability 0,96
TF
Evidence Phold
Probability 1,00
Protein sequence
MAKYMISGSKGGSKKPYVPKEMEDNLISINKIKVLLAVSDGECDPDFTLRDLYLDDVPVIASDGTVNYEGVTAEYRPGTQTQDYIQGFTDTSSEVTVARDITGDNPYVISVTNKNLSAVRIKILMPVGIKTEDNGDLVGVRVEYAVDMAVDGGSYSEVMRDVIDGKTRSGYDRSRRIDLPKFDERVLIRVKRLTPDSTSSKVTDKIKLQSYAEVVDAKFRYPLTGLVFVEFDSELFPTQIPNISIKKKWKIINVPSNYDPISREYHGSWDGTFKKAWSNNPAWVLYDLVTNQRYGLDQRELGIQIDKWSLYEAGVYCDQKVPDGKGGTEPRYLCDVVIQNQVEAYQLIRDICSIFRGMSFWNGESLSIVIDKPRDPSYVFTNENVINGDFQYTNASEKSMYTQCNVTFDDEQNMYQQDVEGVFDTEAALRFGYNPTSITAIGCTRRSEANRRGRWVLKTNLRSTTVNFATGLEGMIPSIGDVIAIADNFQSSNLTLNLSGRVMEVSGLQVFVPFKVDARPGDFIIINKPDGKPVKRTISKVSADGKTIELNIGFGFDVNPDTVFAIDRTDLALQQYVVTTISKGDDENEFTYSITAVEYDPNKYDEIDYGVNIDDRPTSIVQPDVMAAPENVKISSYSRVVQGVSVETMVVSWDKVPYASLYEMQWRKGDGNWLNTPQTANKEIEVEGIYSGNYQVRVRSVSASGNASPWSKIATATLTGKVGEPGAPINLTASDNEVFGIRVKWGMPEGSGDTAYIELHQSPDGTVENSSLLTLIPYPQYEYWHSTLPAGQVVWYRIRSVDRIGNVSSWTDFVRGMASDDVESVLGDILDKIFDTEAGQEIKENAIDSANKIKDQAQSIIQNALANDADVKWTRVQNGKRKAEYGHALELIANETEARVTQIEELRASIDGEITSSIKTVQEAIATESETRATQIQQLDSKFTKEIDGVRKDTSASISDVRQTITNESEARAQAVQQLDAKFTKEINDLDGVIKTEVEANISEVKQAIANETEARVQADQALTARFGDVESALVEKLDSWASVDSVGAKYAMKLGLTYKGQQYSAGMVMQLSQGSSGLISQILFDANRFAIMTSSTGGTFTLPFVVENNQVFINSLLVKNGSITNAMIGNVIQSNNFVQNQQGWRLDKNGIFENYGSTPGEGATKFTNEGLKVKDANGVLRVEVGRITGSW
Physico‐chemical
properties
protein length:1192 AA
molecular weight: 132089,32580 Da
isoelectric point:4,80371
aromaticity:0,08557
hydropathy:-0,37366

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage EC125
[NCBI]
2936944 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
URC25481.1 [NCBI]
Genbank nucleotide accession
ON185586 [NCBI]
CDS location
range 16414 -> 19992
strand +
CDS
ATGGCTAAATATATGATAAGCGGCAGTAAGGGCGGAAGCAAAAAGCCATACGTGCCAAAAGAGATGGAAGATAACCTGATCTCGATAAACAAGATTAAAGTTTTGCTGGCTGTATCTGATGGCGAGTGCGATCCAGATTTCACGTTGCGCGATCTTTATCTTGATGATGTTCCGGTTATTGCCAGCGATGGCACTGTTAACTACGAGGGTGTTACTGCTGAATATCGACCAGGTACGCAGACGCAAGATTACATCCAGGGGTTTACTGACACATCAAGCGAGGTGACAGTTGCAAGAGATATTACCGGAGACAATCCTTATGTTATTTCTGTTACAAATAAAAATCTATCTGCGGTAAGAATAAAGATCCTGATGCCAGTAGGCATTAAAACAGAGGATAACGGCGATCTTGTTGGCGTAAGGGTTGAGTATGCCGTAGATATGGCTGTTGATGGCGGTTCTTATAGCGAGGTTATGAGAGATGTAATTGACGGCAAGACAAGATCAGGATACGACCGCAGCAGAAGGATTGATCTTCCTAAGTTTGATGAGCGCGTTTTAATCCGAGTAAAGCGACTGACTCCAGACAGCACATCTTCAAAGGTGACTGATAAAATCAAGCTGCAAAGTTACGCTGAGGTTGTTGATGCAAAATTCCGCTATCCTCTGACTGGACTTGTATTCGTAGAATTTGACAGCGAATTGTTTCCTACGCAAATCCCTAACATTTCTATAAAAAAGAAATGGAAGATTATTAATGTGCCAAGCAACTATGATCCAATATCAAGAGAATATCACGGGTCATGGGATGGGACTTTTAAAAAAGCGTGGTCAAATAATCCTGCTTGGGTTCTTTATGATCTGGTGACAAATCAGCGTTATGGACTTGATCAGCGAGAGTTAGGAATACAGATCGACAAGTGGAGCTTATACGAGGCTGGCGTTTACTGCGATCAGAAAGTTCCAGACGGTAAAGGCGGCACTGAGCCTCGCTACCTATGCGATGTGGTGATTCAGAATCAAGTTGAGGCTTATCAGCTAATCCGTGACATTTGCTCAATCTTTCGCGGAATGAGTTTTTGGAATGGTGAGAGCTTATCAATCGTGATTGATAAGCCGCGCGATCCATCATACGTGTTTACTAATGAAAACGTCATCAACGGTGATTTTCAGTACACAAACGCAAGCGAAAAAAGCATGTACACGCAGTGTAACGTGACGTTTGACGACGAACAAAACATGTATCAGCAGGACGTAGAGGGGGTTTTTGATACTGAGGCGGCATTACGATTTGGATACAACCCAACAAGCATTACAGCGATCGGGTGTACGCGCAGGAGCGAAGCGAATCGTCGCGGTCGTTGGGTTTTGAAAACAAACCTTAGAAGCACTACTGTAAACTTTGCTACCGGACTAGAGGGGATGATTCCATCAATAGGTGATGTTATTGCTATCGCTGATAATTTTCAGAGTAGCAACCTAACGTTAAACCTATCGGGCCGAGTAATGGAAGTTTCAGGGCTGCAGGTTTTCGTTCCGTTTAAGGTTGATGCTCGCCCTGGTGATTTTATTATCATCAACAAGCCGGACGGCAAGCCAGTTAAGCGCACGATCTCAAAGGTTAGCGCAGACGGAAAAACCATTGAGTTAAATATTGGATTTGGTTTTGATGTTAATCCTGATACTGTTTTTGCGATTGACCGTACTGATCTTGCGTTGCAGCAATACGTTGTGACAACCATCAGCAAGGGTGATGACGAAAACGAGTTTACCTATTCAATCACGGCTGTAGAGTACGATCCGAACAAATACGACGAGATTGATTATGGAGTAAACATTGATGACAGACCGACTTCAATTGTTCAGCCTGACGTGATGGCAGCGCCTGAGAACGTTAAGATCTCATCTTATTCTCGCGTCGTGCAGGGTGTTAGCGTTGAGACTATGGTTGTTTCATGGGATAAGGTTCCTTACGCATCGCTTTATGAAATGCAGTGGCGAAAAGGTGATGGTAACTGGCTGAATACGCCGCAGACCGCTAACAAAGAGATAGAGGTAGAAGGGATTTACTCTGGCAACTACCAAGTAAGGGTGAGATCCGTTTCTGCAAGCGGTAACGCTTCCCCGTGGTCAAAGATTGCAACCGCCACTCTGACAGGTAAAGTTGGCGAGCCAGGAGCGCCGATTAATCTTACGGCTTCTGATAATGAAGTTTTTGGCATTCGTGTCAAATGGGGTATGCCGGAAGGATCAGGCGATACGGCTTACATTGAGCTTCACCAATCGCCAGACGGAACGGTTGAAAACTCAAGTCTGCTTACGCTGATTCCATATCCTCAATATGAGTATTGGCATAGCACGTTACCAGCGGGGCAAGTTGTATGGTATAGAATCCGCAGCGTTGACAGAATAGGCAACGTTTCAAGCTGGACTGACTTTGTTCGCGGCATGGCGTCAGATGACGTTGAATCTGTTTTGGGCGACATTCTGGACAAGATTTTTGATACAGAAGCTGGTCAAGAAATCAAAGAGAACGCCATAGACAGTGCCAATAAAATCAAAGACCAGGCGCAATCAATCATACAGAACGCGTTGGCAAATGATGCAGATGTGAAGTGGACGCGAGTGCAAAACGGAAAGCGCAAGGCTGAATATGGTCATGCTCTTGAGCTTATCGCCAATGAAACAGAAGCGCGCGTAACTCAAATCGAAGAGTTAAGGGCTTCAATTGATGGCGAGATAACATCAAGCATCAAGACAGTGCAGGAGGCAATTGCCACTGAATCAGAGACGCGAGCGACTCAAATTCAGCAGCTTGATTCTAAATTCACAAAAGAAATCGACGGCGTGCGCAAGGATACTTCTGCAAGCATTAGCGATGTAAGGCAGACAATCACTAACGAGTCAGAAGCGCGCGCTCAGGCCGTTCAGCAGCTTGACGCTAAGTTCACGAAAGAGATAAACGACCTTGACGGAGTTATCAAAACAGAAGTCGAGGCTAACATCTCAGAAGTGAAACAGGCGATCGCCAATGAGACAGAGGCAAGGGTTCAGGCTGACCAGGCATTAACAGCACGATTTGGCGACGTTGAATCTGCATTGGTTGAAAAGTTGGATTCTTGGGCGAGCGTTGATTCAGTTGGGGCTAAATACGCTATGAAACTTGGCCTTACTTACAAAGGCCAGCAATACAGCGCAGGAATGGTGATGCAGCTTTCGCAGGGTTCATCCGGCCTTATCTCGCAAATTTTGTTTGATGCTAACAGGTTCGCCATTATGACTAGCTCTACTGGAGGGACTTTTACTTTGCCTTTCGTGGTTGAGAATAATCAGGTTTTCATTAATAGTCTTTTGGTGAAGAACGGTTCAATCACTAATGCGATGATTGGTAATGTGATTCAGTCAAACAACTTTGTTCAAAACCAGCAAGGATGGAGGCTTGATAAAAACGGAATCTTTGAGAATTACGGATCAACGCCAGGAGAAGGAGCTACTAAATTCACCAATGAGGGATTGAAGGTAAAAGATGCAAACGGAGTATTGAGGGTTGAAGTCGGAAGGATTACCGGAAGCTGGTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
17b594f92469d1ec574bbdda1132e767b21181b269fb03e5378354c1359b3c0c
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7726
Evidence 0,7726

Literature

Title Authors Date PMID Source
Complete genome sequences of 17 Escherichia coli bacteriophages isolated from wastewater, pond water, cow manure and bird feces Vitt,A.R., Ahern,S.J., Gambino,M., Holst Sorensen,M.C. and Brondsted,L. 2022-10-20 GenBank