Protein

Genbank accession
WRQ13383.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
TF
Evidence Phold
Probability 1,00
Protein sequence
MGYFQMTRNVEELFGGVVTAPHQIPFTYKSNVGGETFLSLPFYPVTGVVTINGGMQVPLDNFEIEGNTLNLGRALSKGDVVYCLFDKILSPEDTAKGIRIYKFQAVGGETEFTPDFTSYGVQSLYIGGEYKTPEIEYSYDSTTGKVSLQTALTAGVWVVAEMSVKQPNISPAFDRSIQEIARSANVKDSEVIVSTDTISLLDGKKVVYDIATQTSYGLPTIPDGSVISSVSAGKLNYNPGDVQVDLLPLEDSFINVINTLGRNDGAKYIGECHSVADLRNTEPTMDGQRIILKQHTAGTLLGGGVFRALIDGTGKTDNNGTVIKTVGGAAWLRVNADRVNPFMFGALGGSNDDTIPVQSCVDSGKATQLTGVHYVSNIQLKYNTSSIYGSGLHYSRLHQLPSATGNCITIKDTCSLIVLDAFGVYGTGAQQGTSFTAGTTGIYVETPSGLSADYPFHTTADPRRDLCISKVHIAGFDEYGLNIDSGNFSVTTDSLLVNHINQVGVRCATTDWTWTNIQVNTCGKQCLVLDGCGNGRIIGGKFIWANWQPYGTVGQFPGITINNSQNMVINGIEVQDCGGNGIEISDSYSISMNGLNTNRNGINANNTFYNIVFNKSDAVINGFVGLNYAANSGSGANSSAGNFQFLSNDCSVTINGVVETGYMGINFIGDNNIINPTNSDLSINGLVNYSKTGLQTMNETPTFDGVSTTPVYVSVPSSVGQVNGLRLSQANKDKLLYSRTAGPEGITMAAVVVPTISGAEVFNFMAIGSGFSDTSNSLHLQLVIDASGKQTIALLLGGDGTTQILSGDLPNDLKLQSGVPYHIAIGAKPGYFWWSILNIQTGKRIRRSFRGAYLAVPFNSIFGLTSSLTFFSDSNAGGDACSGVGAKVYVGMFSSENDYVASRYYNLINPVDPTKLISYRILDSSI
Physico‐chemical
properties
protein length:926 AA
molecular weight: 98798,63870 Da
isoelectric point:4,99753
aromaticity:0,09179
hydropathy:-0,03855

Domains

Domains [InterPro]
WRQ13383.1
1 926
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage vB_SenAc-pSK20
[NCBI]
3093916 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WRQ13383.1 [NCBI]
Genbank nucleotide accession
OR729889.1 [NCBI]
CDS location
range 140348 -> 143128
strand -
CDS
ATGGGGTATTTTCAAATGACCAGAAATGTAGAAGAATTATTCGGCGGCGTAGTCACAGCTCCCCACCAGATTCCTTTCACGTATAAATCAAATGTCGGTGGAGAAACTTTCCTTTCCTTGCCGTTCTATCCTGTCACTGGCGTAGTCACAATCAACGGTGGTATGCAAGTTCCCTTAGACAACTTTGAAATCGAAGGAAATACGTTGAATCTCGGGCGCGCATTGTCCAAAGGCGATGTTGTGTATTGCTTATTCGATAAAATTCTTTCGCCAGAAGATACAGCCAAAGGTATCCGCATATACAAATTTCAGGCCGTAGGAGGTGAAACCGAGTTCACTCCTGATTTCACATCTTATGGAGTCCAATCTCTTTATATCGGTGGCGAGTACAAAACCCCCGAAATTGAATATTCCTATGACAGCACGACAGGAAAAGTATCTTTGCAAACTGCACTGACTGCAGGCGTTTGGGTAGTCGCTGAAATGTCTGTTAAACAACCGAATATCAGTCCGGCGTTTGACCGAAGTATTCAAGAAATCGCCCGTTCTGCTAATGTAAAAGACTCTGAAGTCATCGTTAGTACGGACACCATATCTTTGTTGGATGGGAAGAAAGTTGTTTATGATATAGCGACGCAAACCAGTTATGGTTTACCAACCATTCCTGATGGTTCTGTCATTTCTTCTGTATCTGCTGGGAAATTGAATTACAACCCAGGTGATGTGCAGGTTGATTTGTTGCCTTTAGAAGATTCATTTATTAATGTGATAAACACTCTGGGGCGCAATGATGGTGCCAAGTATATTGGAGAATGCCATTCTGTTGCTGATCTCAGGAATACTGAACCCACTATGGATGGACAACGCATTATTCTTAAGCAACACACTGCGGGTACTCTTCTTGGTGGAGGGGTATTCCGTGCGTTAATTGATGGTACAGGAAAGACTGATAATAACGGTACTGTGATCAAAACTGTTGGCGGCGCGGCATGGTTACGTGTTAATGCTGATAGAGTTAACCCATTCATGTTTGGTGCTTTGGGTGGTTCTAATGATGATACTATTCCAGTACAATCTTGTGTGGATAGTGGTAAGGCCACACAATTAACTGGTGTACATTACGTTAGCAATATCCAGTTAAAATATAATACGTCGTCTATTTATGGGTCTGGATTACATTACTCAAGGTTGCATCAGTTGCCTTCTGCTACTGGGAATTGTATTACCATAAAAGATACATGCTCCCTTATTGTATTAGACGCCTTTGGGGTATATGGCACAGGTGCACAACAAGGCACGTCATTTACTGCGGGCACAACAGGTATCTATGTAGAAACTCCTTCAGGTCTCTCAGCCGATTATCCGTTCCACACTACCGCAGACCCAAGACGCGACTTGTGTATTTCTAAGGTCCATATAGCAGGTTTTGATGAATATGGGTTAAATATTGATAGTGGTAACTTTAGTGTTACTACAGATTCTCTTTTAGTCAACCACATCAATCAGGTGGGTGTCCGTTGTGCTACTACTGATTGGACTTGGACAAATATCCAGGTTAATACCTGCGGTAAACAATGTCTGGTTCTTGATGGTTGTGGTAATGGTCGTATTATTGGCGGTAAATTCATTTGGGCTAACTGGCAACCTTATGGTACAGTAGGACAGTTCCCAGGCATTACTATTAATAACAGCCAGAATATGGTTATTAATGGTATTGAGGTACAAGATTGTGGCGGGAATGGCATTGAGATTAGCGATTCATATTCAATTTCCATGAACGGATTGAACACCAATCGTAACGGCATCAATGCTAACAACACTTTCTACAACATCGTATTTAACAAAAGTGATGCAGTTATCAACGGATTCGTAGGACTCAATTATGCCGCGAATAGTGGTTCAGGTGCTAACTCTAGTGCAGGCAATTTTCAGTTCCTGTCTAATGATTGTAGTGTCACCATTAATGGTGTGGTTGAGACTGGTTATATGGGCATTAACTTTATTGGTGATAACAATATTATCAACCCCACCAATTCCGACCTGAGCATTAACGGATTGGTTAATTATTCCAAGACTGGTTTGCAAACCATGAACGAGACCCCTACATTTGATGGTGTTAGCACTACACCTGTTTATGTAAGTGTCCCATCTTCTGTAGGGCAAGTAAATGGTCTGAGACTATCACAAGCCAACAAAGATAAATTACTGTATTCAAGAACAGCAGGTCCAGAAGGTATTACCATGGCTGCTGTTGTAGTACCCACCATATCTGGAGCTGAAGTATTTAACTTCATGGCCATTGGTTCAGGGTTTAGTGATACATCCAACAGTCTTCATCTTCAATTAGTTATAGACGCTTCTGGAAAACAAACAATTGCTTTGCTATTGGGGGGCGATGGTACAACCCAAATTTTATCTGGGGATTTACCTAACGACCTTAAACTACAAAGTGGTGTACCATATCATATAGCTATTGGTGCTAAACCTGGATATTTCTGGTGGAGTATTCTTAATATTCAGACGGGTAAGAGAATCAGACGGTCATTCCGAGGCGCTTATTTAGCCGTACCATTTAATTCTATATTCGGATTAACTTCTTCATTAACATTCTTCTCGGATAGCAATGCTGGTGGGGATGCTTGTTCTGGGGTGGGCGCTAAAGTGTATGTTGGTATGTTCTCTTCTGAGAACGATTATGTAGCTTCACGATACTACAACCTGATTAATCCTGTAGACCCTACTAAGTTAATTAGTTACCGTATATTGGATTCTTCTATTTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
ab05130c5154a064bd3bc295be74aa089067ef8263d8d6738b4374802f4a8c09
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7225
Evidence 0,7225

Literature

Title Authors Date PMID Source
Complete genome sequence of Salmonella virus vB_SenAc-pSK1 Kim,Y., Park,S.Y. and Kim,J.H. 2012-09-15 GenBank