Protein

Genbank accession
AVZ45100.1 [GenBank]
Protein name
putative tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,90
TSP
Evidence RBPdetect2
Probability 0,73
TF
Evidence Phold
Probability 1,00
Protein sequence
MALYPIKSLGAVGVIADQAPTDLAPNAFTNAMNARFVEQRVFKTGGNAPLSYVEEDKDLTPLSFVSMPFDYYSAGNSFLVVGTDKKLYKLTDESLTDISRKVATVTKKASAIIKIYPVVSRIVPKESTITMNFNQTKELEVQVFPEDANNANLTWEVSNPSYASIAVNPTDSKKATLTTLSTEGTLSITVSIEDESVTAQISVNIVDGDTGIFLSQDTITIRRGGTTTLTAISGKTPITWISSNGGALSVTPNANTLTAVLNAMGEGTFTVTADNGSKSATCTVNVIPQIDSISLSQTDVQMDRGTQYVLTATVNPADAPNKAITWTSSNPNIATVSGTSTEATITGLLAGFTEITAVTEEGSRSAVCTVRVNLAGRMLNTRSLAMAASAPLVEEFKEEEEPVVQNEEVVYFMSDSMGIDTSGMAEGNNFFDYSNVFDMEGFARAAENSRAAPLTNVTLDIVEASLDVGEEIVITATAAPEGDYSYQWVVDKSGYVSTTSTTGRSLKLTAVRKGEIKVTCTASQMTQRDYDAFDDYPWYHAVISNCAVATTHYETPQVKEFESEYFTDLPGWGEQTIVDGDGNPSVRKFNWKCERVRAFNNRLFALNMRESNASGVTTHYPLRLRWSNFANENKAPTLWDDYAYDRLTTSDLSANIVGQTEALENGYAGYIDLADSNGSLIDVLPLKDYLFVYTEFETYIGSPTNNTYQPLMFKKLFNDSGILAPECVVEVEGGHFVVTQNDVILHNGASKKSIASNRVKNMLINEVCLVNPLATRVHLHQDKKEVWVMYVGPGEPKESFACTKAAVWNYEFDTWSFRTIPYAQCIGLVDPPVLERGPVWTDFQTITWDDPAIDKLVWRKDATNFRQRITIVGSFLRGFYQVDVGALDYFYDRANDKIIERPLEMRLERTGIDFDNVTNEWNQKHINRFRPQTTGSGTYIFEAGGSQFSNEYGHNHTTKSYTIGVDRHVAVRLNHPYLFYNVIDNDVNSNAAINGLTIEFNVGGRR
Physico‐chemical
properties
protein length:1006 AA
molecular weight: 110891,69280 Da
isoelectric point:4,80701
aromaticity:0,09642
hydropathy:-0,24503

Domains

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage EP335
[NCBI]
2070199 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AVZ45100.1 [NCBI]
Genbank nucleotide accession
MG748548 [NCBI]
CDS location
range 16657 -> 19677
strand +
CDS
ATGGCGTTATACCCTATAAAATCACTTGGGGCTGTCGGTGTTATCGCTGATCAGGCACCAACTGATTTAGCACCTAACGCTTTCACCAACGCTATGAACGCTCGGTTTGTTGAGCAGAGAGTTTTTAAGACGGGGGGCAATGCCCCTCTTTCTTACGTGGAAGAAGACAAAGATCTGACTCCACTCTCTTTTGTCTCCATGCCTTTCGATTATTATAGCGCAGGAAATAGCTTCCTTGTAGTAGGTACAGATAAGAAGTTATATAAACTGACAGATGAAAGCTTAACTGATATCAGTCGTAAAGTTGCTACGGTAACTAAGAAAGCTTCTGCTATCATAAAGATTTATCCAGTGGTCTCAAGGATTGTTCCTAAAGAGAGTACTATCACAATGAACTTTAACCAGACAAAAGAGTTAGAAGTTCAGGTTTTTCCAGAGGATGCTAATAATGCTAATCTGACTTGGGAAGTAAGTAACCCTTCTTATGCCAGTATTGCAGTAAATCCTACAGATTCTAAAAAAGCCACCCTCACTACATTATCTACAGAAGGAACACTGTCCATTACTGTTTCCATTGAAGATGAATCTGTGACAGCTCAAATCTCCGTTAACATTGTTGATGGGGATACGGGTATCTTCTTGAGTCAAGACACAATCACAATTCGAAGAGGTGGTACAACAACTCTTACTGCTATCTCAGGTAAGACTCCTATCACTTGGATTAGTAGCAATGGTGGTGCTTTGTCTGTGACACCCAATGCCAATACACTAACTGCTGTTCTCAATGCCATGGGAGAAGGAACTTTTACTGTTACAGCTGATAATGGCTCTAAGTCTGCTACCTGTACAGTTAATGTGATACCTCAGATTGATTCTATTTCTCTGAGTCAGACAGATGTTCAGATGGATAGAGGGACTCAGTATGTTCTAACTGCAACAGTCAACCCTGCTGATGCTCCTAATAAAGCAATCACTTGGACTTCTTCCAATCCTAATATTGCTACAGTATCAGGGACAAGCACAGAGGCTACAATTACTGGACTTCTGGCTGGGTTTACAGAGATTACAGCAGTAACAGAGGAAGGTAGTCGTTCAGCTGTTTGTACTGTTCGTGTCAACCTAGCAGGTAGAATGCTAAATACCAGAAGCCTAGCGATGGCTGCTAGTGCACCTCTAGTGGAAGAGTTCAAAGAGGAAGAAGAACCAGTTGTGCAGAATGAAGAAGTTGTTTACTTCATGTCTGACTCTATGGGAATTGATACCTCTGGCATGGCTGAAGGCAATAACTTCTTTGACTACTCTAACGTATTTGATATGGAAGGTTTTGCTCGTGCTGCGGAGAACTCAAGAGCTGCTCCTCTGACAAATGTGACACTAGATATTGTTGAAGCTTCTCTAGATGTAGGTGAAGAAATTGTCATAACTGCTACAGCAGCTCCAGAAGGGGATTACTCCTATCAGTGGGTTGTTGACAAGAGTGGTTATGTTTCTACTACCTCAACAACTGGAAGATCTTTGAAACTTACAGCTGTTCGTAAAGGCGAGATTAAAGTTACATGTACGGCGAGTCAGATGACTCAAAGAGACTACGATGCTTTTGATGATTACCCTTGGTATCATGCAGTAATCTCTAACTGTGCAGTAGCGACAACTCACTATGAAACTCCTCAGGTTAAAGAATTCGAATCTGAATACTTTACAGACCTTCCGGGCTGGGGTGAACAAACAATTGTTGATGGTGATGGGAACCCTTCTGTTCGTAAGTTTAACTGGAAGTGCGAAAGGGTTAGAGCTTTTAACAACAGATTGTTTGCTCTGAATATGAGGGAATCTAATGCCTCTGGTGTTACCACTCACTATCCTTTACGTCTTCGCTGGTCTAACTTTGCGAACGAGAACAAGGCTCCTACTTTGTGGGATGATTATGCTTACGATCGACTGACAACTTCTGATCTTTCAGCGAACATTGTTGGGCAGACTGAAGCTCTTGAGAATGGTTATGCAGGGTATATTGATCTGGCTGACTCTAACGGTAGTTTGATTGATGTTCTCCCTTTGAAAGATTACTTATTTGTTTACACCGAGTTTGAAACCTACATCGGTTCTCCTACTAACAACACATACCAGCCTCTGATGTTTAAGAAGCTGTTTAACGATTCAGGTATTCTTGCTCCTGAGTGTGTGGTTGAAGTAGAGGGTGGTCACTTTGTTGTAACACAGAACGATGTGATTCTTCATAACGGTGCATCTAAGAAATCTATTGCATCTAACCGTGTCAAGAACATGCTTATTAATGAAGTGTGTTTGGTAAACCCTCTAGCTACTAGAGTTCACTTGCACCAAGATAAGAAAGAAGTTTGGGTCATGTATGTTGGGCCGGGAGAGCCGAAAGAAAGTTTTGCTTGTACGAAGGCTGCGGTCTGGAATTACGAGTTTGATACTTGGTCTTTCCGTACTATCCCGTATGCTCAATGTATTGGTCTTGTTGATCCTCCTGTTCTCGAAAGAGGTCCAGTGTGGACTGACTTCCAAACTATCACTTGGGACGACCCTGCTATTGATAAACTGGTGTGGAGAAAGGATGCAACTAACTTCCGTCAGAGAATTACTATCGTAGGCTCTTTCTTAAGGGGTTTCTATCAAGTAGATGTTGGTGCTTTGGATTATTTCTATGACAGAGCGAATGACAAAATAATAGAGCGCCCTCTGGAAATGAGGTTAGAGAGAACAGGGATTGATTTTGATAACGTCACTAACGAATGGAATCAAAAACACATCAACCGGTTCAGACCTCAGACTACAGGTTCTGGTACGTATATCTTTGAAGCTGGAGGTAGTCAATTCTCTAACGAGTATGGTCACAACCACACAACTAAGAGTTATACGATTGGAGTTGATAGGCACGTAGCTGTGAGACTGAACCATCCATACCTATTCTATAATGTTATAGATAATGATGTTAACAGTAACGCAGCCATAAATGGGCTGACAATAGAGTTTAATGTTGGCGGTCGAAGATAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4853aef1eb2ab4040ab135726b6911b36134690d30c9c9f8453570c4453dff78
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7689
Evidence 0,7689

Literature

No literature entries available.