Protein

Genbank accession
WWD12988.1 [GenBank]
Protein name
tail protein
RBP type
TSP
Evidence RBPdetect
Probability 0,86
TF
Evidence RBPdetect2
Probability 0,96
TF
Evidence Phold
Probability 1,00
Protein sequence
MAKNIITGSKGGSKKPYVPKELEDNLISINKIKILLAVSDGECDADFSLRDLYLDDVPVIADDGTVNYPGVKAEFRPGTQDQEYIQGFTDTSNEVIINRDLTTNTPYVLSVTNKTLSAIRIKMLMPVGIKQEDNGDLVGVTVTYAVDMAVDGDSYKEVMIDTINGKTRTGYDRSRRIDLPKFNERVLLRVRRVTPDSTTSKVTDLIRIQSYAEVIDAKFRYPLTGLVYVEFDSELFPNQIPNISIKKKWKLIQVPSNYDPVSRRYSGTWDGTFKKAWSNNPAWVLYDLVTNQRYGLDQRELGIQIDKWSLYEAGVYCDQMVPDGKGGTEPRYLCDMVIQSQIEAYQLIRDICSIFRGMTFWNGESLSVVVDKPREPSYIFTNDNVINGDFQYTSASEKSMYTQCNVTFDDEQNMYQQDVEGVFDTEAALRFGYNPTSITAIGCTRRSEANRRGRWILKTNLRSQTVNFATGLEGMIPSIGDVIAISDNFLSSNLTLNLSGRVMEVSGLQVFVPFKIDARPGDFIIINRPDGKPVKRTISKVSADGKTIELNVGFGFDVKPDTVFAIDRTDLALQQYVVTGISKGDTEEEFTYSITAVEYDPNKYDEIDYGVNIDDRPISIVQPDIMQAPENVQVSSYSRVVQGVSVETMVVSWDKVPYASLYEMQWRKGDGNWLNTPRTANKETEVEGIYSGNYQVRVRSVSASGSTSAWSKIATASLTGKVGEPGAPVNLTASDNEVFGIRIKWGMPEGSGDTAYIELHQSPDGTVENSSLLTLIPYPQYEYWHSTLPAGQVVWYRIRSVDRIGNVSSWTDFVRGMASDDVESVLGDILDKIFDSEAGQEIKENAIESANKIKDQAQAIIQNALAIDANLKWTRVQNGKRKAEYGHAVQLIANETEARVTQIEELRASIDGEITSSIKTVQEAIATESETRATQIQQLDSKFTKEIDGVRKDTSASISDVRQTITNESEARAQAVQQLDAKFTKEINDLNDAIKVDVEASISEVRQAIANEEEARVQADQALTARFGDVESALVEKLDSWASVDSVGAKYAMKLGLTYKGQQYSAGMVMQLSQGSSGLISQILFDANRFAIMTSSTGGTFTLPFVVENNQVFINSLLVKNGSITNAMIGNVIQSNNFVQNQQGWRLDKNGIFENYGSTTGEGATKFTNEGLKVKDANGVLRVEVGRITGSW
Physico‐chemical
properties
protein length:1192 AA
molecular weight: 132372,93910 Da
isoelectric point:4,84185
aromaticity:0,08473
hydropathy:-0,35176

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage WaterSpirit
[NCBI]
3098282 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WWD12988.1 [NCBI]
Genbank nucleotide accession
OR896833 [NCBI]
CDS location
range 16136 -> 19714
strand +
CDS
ATGGCTAAAAATATTATAACTGGCAGTAAGGGTGGGTCGAAAAAACCTTACGTGCCTAAAGAGCTTGAAGATAACCTGATTTCAATAAACAAAATTAAAATCCTGCTTGCCGTATCCGATGGTGAGTGTGACGCGGATTTCTCATTACGTGATCTGTATCTTGATGATGTTCCGGTTATCGCTGACGACGGAACTGTAAACTACCCAGGCGTTAAAGCAGAGTTTAGGCCAGGAACGCAGGATCAGGAATACATTCAGGGTTTTACTGACACGTCAAACGAGGTGATCATCAATCGAGATTTGACTACAAATACGCCATATGTTTTGTCGGTAACGAATAAGACGCTTTCCGCAATACGCATCAAAATGCTAATGCCAGTGGGTATTAAGCAGGAGGATAACGGCGACTTGGTGGGCGTTACCGTTACATATGCAGTTGATATGGCGGTTGATGGTGATTCATACAAAGAAGTAATGATCGACACCATTAACGGCAAGACGCGCACCGGATACGACCGTAGCCGACGCATTGACTTGCCGAAATTCAATGAGCGTGTTTTGCTTCGCGTTCGCCGCGTTACGCCTGACAGCACAACGTCGAAGGTTACAGACCTGATCCGCATACAGAGTTACGCCGAGGTAATTGATGCAAAATTCCGTTACCCTCTGACTGGGCTTGTTTACGTCGAGTTTGACAGCGAACTGTTCCCTAACCAGATCCCTAACATTTCAATTAAAAAGAAATGGAAGTTGATTCAGGTTCCCAGCAATTACGATCCTGTATCTCGTCGGTATTCTGGAACATGGGACGGAACATTTAAAAAAGCGTGGTCAAATAATCCGGCATGGGTTCTTTACGATTTGGTTACTAATCAGCGTTATGGACTTGATCAACGAGAATTGGGAATACAGATCGACAAGTGGAGTCTATACGAGGCTGGGGTGTATTGTGATCAGATGGTTCCAGATGGCAAGGGAGGAACAGAGCCGCGCTACCTATGCGATATGGTGATTCAAAGCCAGATTGAGGCTTATCAGCTTATTCGTGACATTTGCTCAATCTTCCGAGGAATGACTTTTTGGAATGGCGAGAGCTTGTCTGTAGTGGTGGATAAACCGCGCGAGCCATCATACATCTTTACAAACGACAACGTAATTAATGGAGATTTTCAGTACACGAGCGCCAGCGAAAAGAGCATGTACACTCAATGCAACGTTACGTTTGACGATGAGCAAAACATGTATCAACAGGACGTTGAGGGCGTATTTGACACTGAGGCGGCGTTACGTTTTGGCTACAATCCAACAAGTATCACGGCGATTGGTTGTACCCGTCGAAGCGAAGCGAATCGACGCGGACGCTGGATACTGAAAACCAACCTGCGCAGCCAAACGGTAAACTTTGCCACCGGGCTGGAAGGGATGATCCCATCTATCGGTGACGTTATCGCGATTTCTGATAACTTCCTGAGCAGCAACTTAACGCTCAACCTATCAGGGCGCGTAATGGAAGTTTCCGGCTTGCAGGTATTCGTTCCGTTTAAGATTGACGCTAGACCAGGTGATTTCATTATCATCAACAGACCGGACGGAAAACCAGTTAAGCGCACAATCTCAAAAGTCAGCGCGGACGGAAAAACCATTGAGTTAAACGTAGGGTTCGGATTCGACGTAAAACCGGACACCGTATTTGCAATCGACCGAACAGACCTTGCGTTGCAACAGTACGTTGTAACAGGAATCAGCAAGGGTGACACTGAGGAAGAATTTACCTACTCGATTACGGCGGTTGAATACGATCCTAACAAATACGACGAAATTGATTATGGCGTAAACATTGATGATCGTCCGATCTCAATCGTGCAGCCGGACATCATGCAAGCACCGGAAAATGTTCAGGTGTCATCATACTCCCGTGTTGTCCAGGGTGTTAGCGTTGAGACGATGGTTGTGTCGTGGGATAAAGTGCCTTATGCGTCACTGTATGAAATGCAATGGCGAAAAGGTGATGGCAACTGGCTAAATACGCCGCGCACAGCGAACAAAGAGACGGAAGTAGAAGGAATTTATTCAGGGAACTATCAAGTAAGGGTCAGATCTGTTTCAGCTTCCGGCAGTACATCAGCATGGTCGAAGATTGCAACAGCTTCCCTGACTGGTAAAGTTGGCGAGCCAGGCGCGCCAGTTAACTTAACCGCATCAGATAATGAAGTTTTCGGCATTCGTATTAAATGGGGCATGCCAGAAGGAAGCGGAGACACGGCCTACATTGAGCTTCACCAGTCGCCAGACGGAACGGTTGAAAACTCAAGCCTGTTAACGCTGATCCCATACCCGCAATATGAATACTGGCACAGTACACTACCAGCGGGTCAAGTTGTCTGGTATAGAATCCGCAGCGTTGACAGAATCGGCAACGTTTCTAGCTGGACTGATTTTGTTCGTGGCATGGCATCAGATGATGTTGAATCTGTTTTAGGAGATATTCTTGATAAGATTTTTGATAGCGAAGCAGGGCAAGAAATCAAGGAGAACGCCATCGAAAGCGCGAACAAGATCAAAGACCAGGCGCAGGCAATAATCCAGAACGCTTTGGCGATTGACGCTAACTTGAAATGGACGCGAGTACAGAACGGAAAGCGAAAGGCTGAATATGGTCATGCTGTTCAGTTGATCGCAAATGAGACTGAGGCGCGAGTAACTCAAATCGAAGAATTGAGGGCTTCAATTGATGGCGAGATAACATCAAGCATCAAGACAGTGCAGGAGGCAATCGCCACTGAATCAGAGACACGAGCAACTCAAATTCAGCAGCTTGATTCTAAATTCACAAAAGAAATTGACGGCGTGCGAAAGGATACTTCTGCAAGCATTAGCGATGTAAGGCAGACAATCACTAACGAGTCAGAAGCGCGAGCGCAAGCGGTACAACAGCTTGATGCAAAATTCACCAAAGAGATTAACGATCTCAATGATGCAATCAAGGTTGACGTTGAGGCCAGCATTTCCGAAGTGAGACAAGCTATCGCCAACGAGGAGGAAGCGCGAGTACAAGCTGATCAGGCATTAACAGCACGATTTGGAGACGTTGAATCTGCATTGGTTGAAAAGTTGGATTCTTGGGCGAGCGTTGATTCAGTTGGTGCTAAGTACGCTATGAAACTTGGCCTTACTTACAAAGGACAGCAGTACAGCGCAGGAATGGTAATGCAGCTTTCGCAGGGTTCATCCGGTCTTATCTCGCAAATTTTGTTTGATGCTAACAGGTTCGCCATTATGACTAGCTCTACTGGAGGGACTTTCACTTTGCCTTTCGTGGTTGAGAATAATCAGGTTTTCATTAATAGTCTTTTGGTGAAGAACGGATCAATCACTAATGCGATGATTGGTAATGTGATTCAGTCAAACAACTTTGTTCAAAACCAGCAAGGATGGAGGCTTGATAAAAACGGAATCTTTGAGAATTACGGATCAACGACAGGAGAAGGAGCTACTAAATTCACCAATGAGGGATTGAAGGTAAAAGATGCAAACGGAGTATTGAGGGTTGAAGTCGGAAGGATTACCGGAAGCTGGTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
ffda6b0da798542ea1aba82f6f45b5b8f0bff4ed0b6e184b0939520d39f9e3f0
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7725
Evidence 0,7725

Literature

No literature entries available.