Protein

Genbank accession
AGH57796.1 [GenBank]
Protein name
fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,88
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
Protein sequence
MALTRLKNIITSRTGRIIYVNPDDFDASDAIDNRGNSALRPFKSIQRAFLEVARFSYRVGLSNDEFDAFSIMLYPAEYVVDNRPGEVLYTNVAPIDENSNLDLTSPNNVLYKYNSIEGGIIVPRGASLVGTDLRRTKIIPKYVPYPTVFAAKGINTEDQVPSRTSIFKVTGGTYFWQFSFFDGAEEGVYFKPDSVETLAPKFSHHRLTCFEFADGVNSLSTLISQGRVPNSDYSAVPNILERTDLEIYYQKVSKAFASIPDTSGDPAADQIQARVEENRIVGPISDEYRVLQITRNGNTATAVTVDEFDNPRDHGFSVGVNINISGVTGSTGPQSELDASLYNGSFTVTSASGNIFTYQMIQEPTGNAVGSNITVKTEIDTVDSASPYAFNLSLRSVWGMNGMLADGSRATGFKSMVVAQFTGLSLQKDDRAFVRYNESTGNYDVASAGDGAHLDGFAEYRKGWAHRHIVASNDAFIQAVSVFAVGYGSHFTCESGADMSITNSNSNFGNTALRAAGFKAKSFSKDKAGEITHIVPPKALNVISTIASGASGEASITLANDGSVNGVVQGMTVTGDNIGTGATVVSVNTNTRIITLSVVNSDAVNGNVIFGEETSVNWVNIDIQRTKVINQSLAGAGGTPGTRLYLYGYTAPAGAPTTRVQGYTVGARQDGTGVSAIPDKINCLLVAQGATEATVQSAFISPYGPSVSGLAAGVAGSPIQYDSNTYTISGVAGQVGGWYLSVDSVDNDIYTTLSTNTRYNNVNFTPTTFLRRIPDPRDLADRTFRIRLKIDKDKTNPLPRDPLSGYVLQPLNSDSTNYKLDKTFYIYDIEKVQEFERGVSDGLYYITLLCASITPSTSNFNDRKFSQNVNEVYPTFDRDNPVADPNPAVSVADNETIGLVYSTDGATPTPNKDPKRSITKEAIKFLLTDTGWTQPGTTPNFDSVNNRLSNVELTARAGDEEVRKINIRENNDGTVAPIPVEFRRHSILRSGNHTFEYLGFGPGNYSTAFPQTQVETLSADQVKFSQSIKEEAGVAFYSGLNSNGDLFIGNQVINPVTGQITNEDIAQLNVVGEENTTIETFSELVLTDKLTVIGGASNQLESIFAGPVTFQGQVSSTSNLIAKKITYNNQDGTVIKQTLLAPEDALGQPDFTNITGYTTPSDGDLVYNINWTPGKSLGWIYYQGLWKEFGITDTGQIDIQTFVDGDGVDQQHLGFGVAANSTYRANILGNVRIDGNLTTTGTGGISADKYVTRTYTGDGNTLTYNITTFTGGVKHTADSLLVFLNGVAQIGGTNFSVDANGANIIFGAGDAPLSTDTIHIIELPI
Physico‐chemical
properties
protein length:1325 AA
molecular weight: 142719,46900 Da
isoelectric point:4,81099
aromaticity:0,09283
hydropathy:-0,23419

Domains

Domains [InterPro]

No domain annotations available.

Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage KBS-M-1A
[NCBI]
889950 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AGH57796.1 [NCBI]
Genbank nucleotide accession
JF974293 [NCBI]
CDS location
range 19209 -> 23186
strand -
CDS
ATGGCTCTTACTAGACTTAAGAATATTATTACGTCCAGAACTGGACGTATTATCTACGTTAACCCTGACGATTTCGATGCATCTGATGCTATTGATAATAGGGGAAACTCTGCACTTCGTCCGTTTAAGTCTATTCAGAGGGCATTCTTAGAAGTTGCTAGATTTTCGTATCGAGTCGGTCTGTCAAACGACGAATTTGACGCCTTCTCGATTATGCTTTATCCAGCAGAATATGTTGTTGATAACAGACCTGGAGAAGTTCTTTATACAAACGTTGCTCCTATTGATGAAAACTCTAACTTAGATTTAACATCACCAAACAACGTTCTTTACAAATATAATTCTATTGAAGGTGGTATCATTGTTCCTAGAGGTGCTTCCCTCGTTGGTACTGACCTTCGTCGTACAAAAATTATTCCTAAGTATGTTCCCTATCCTACAGTATTTGCTGCAAAGGGAATCAACACAGAAGACCAAGTTCCTTCTAGAACTTCAATCTTTAAGGTAACTGGTGGTACTTATTTCTGGCAATTCTCGTTCTTTGATGGTGCTGAAGAGGGTGTCTATTTCAAACCAGATAGTGTAGAAACACTTGCACCTAAGTTCTCTCACCATAGATTGACATGTTTCGAGTTTGCTGATGGTGTAAACTCTCTTTCAACTCTTATTAGTCAAGGAAGAGTACCAAACTCAGACTATTCTGCAGTCCCTAATATCCTTGAAAGAACTGACTTAGAGATTTACTATCAGAAAGTATCGAAAGCATTTGCATCAATTCCCGATACATCTGGAGACCCTGCAGCAGACCAAATTCAGGCAAGAGTTGAAGAAAACAGAATCGTTGGTCCTATTTCTGACGAATATCGTGTTCTTCAAATTACTCGCAACGGCAATACTGCAACCGCAGTTACTGTTGATGAATTTGATAACCCCAGAGACCATGGATTCTCTGTTGGTGTTAACATTAACATCTCTGGTGTTACTGGTTCTACTGGACCACAATCAGAACTGGATGCTTCCTTGTATAATGGTTCATTCACAGTAACATCTGCGTCTGGTAACATTTTTACCTATCAGATGATTCAAGAACCTACAGGTAATGCCGTTGGTTCTAACATTACAGTTAAGACTGAGATTGATACTGTTGACTCTGCATCACCATATGCGTTCAACTTGTCTCTTAGAAGTGTCTGGGGCATGAATGGAATGCTCGCAGATGGTTCTCGTGCAACTGGTTTCAAATCGATGGTTGTTGCACAGTTTACGGGTCTATCTTTGCAGAAAGATGATAGAGCGTTTGTAAGATATAATGAGTCTACTGGTAACTATGATGTTGCATCTGCTGGTGATGGTGCTCACTTAGACGGTTTTGCTGAGTATCGTAAAGGTTGGGCACACAGACACATTGTTGCATCTAACGACGCATTTATTCAGGCAGTTTCGGTGTTCGCGGTTGGATATGGTTCCCACTTCACTTGTGAGAGTGGTGCTGACATGTCCATTACCAACTCTAACTCCAACTTCGGTAATACCGCTCTTCGTGCTGCTGGATTTAAGGCAAAATCTTTCTCTAAAGATAAAGCAGGAGAGATTACACATATTGTTCCACCTAAAGCACTTAATGTTATTTCAACAATCGCAAGTGGCGCTAGTGGCGAAGCATCAATCACTCTTGCAAATGATGGGTCGGTTAATGGTGTTGTTCAGGGAATGACTGTTACTGGAGACAATATTGGAACTGGAGCAACAGTTGTCTCTGTCAATACAAATACTAGAATTATTACACTTTCGGTAGTTAATTCTGATGCAGTTAATGGCAACGTAATTTTTGGTGAAGAAACTTCTGTTAACTGGGTCAACATTGATATTCAACGTACTAAAGTAATTAACCAGTCTCTTGCTGGTGCTGGTGGTACTCCTGGCACCAGACTTTACCTCTATGGATATACTGCTCCTGCGGGTGCCCCAACAACCAGAGTTCAGGGTTATACAGTTGGCGCTCGTCAAGATGGTACTGGAGTAAGTGCAATTCCTGATAAAATTAACTGCTTATTGGTTGCCCAAGGTGCAACTGAGGCGACAGTACAATCTGCATTTATTTCACCCTATGGTCCCAGTGTATCTGGTCTCGCTGCTGGTGTTGCTGGTTCACCCATTCAGTACGATAGCAATACCTACACTATTAGTGGAGTTGCTGGACAAGTCGGTGGTTGGTATCTTTCTGTTGATTCTGTAGATAATGATATCTACACAACACTGTCTACTAATACTCGTTATAACAATGTTAACTTTACTCCAACTACATTCTTAAGAAGAATCCCTGACCCTAGAGACCTTGCAGACAGAACATTCCGTATTCGTTTGAAGATTGATAAAGATAAGACTAATCCATTACCCAGAGATCCCCTGAGTGGTTATGTACTACAACCATTGAATAGTGATAGTACAAACTACAAACTTGATAAAACTTTCTACATTTACGATATTGAAAAAGTACAAGAGTTTGAGAGAGGTGTTTCCGATGGACTTTACTACATTACCCTCCTTTGTGCATCTATTACACCTTCAACTTCTAACTTCAACGATAGAAAGTTCTCCCAAAACGTCAACGAAGTCTATCCTACGTTTGACAGAGACAACCCTGTTGCTGACCCTAACCCTGCTGTATCCGTCGCTGACAACGAAACTATCGGTCTAGTATATTCTACTGATGGTGCAACTCCAACTCCAAACAAAGACCCCAAACGTTCTATTACTAAGGAAGCGATTAAGTTTCTTCTGACTGATACTGGTTGGACACAACCAGGTACAACACCCAACTTTGACTCTGTTAATAATAGACTTTCTAACGTTGAACTTACTGCTCGTGCTGGTGATGAAGAAGTTAGAAAGATTAACATCAGAGAGAACAATGATGGAACAGTTGCTCCTATTCCAGTAGAGTTCAGAAGGCACTCCATCTTACGTTCAGGTAACCATACGTTTGAATATCTTGGTTTCGGTCCTGGTAACTATTCTACCGCATTCCCCCAGACTCAGGTAGAAACTCTGAGTGCAGACCAAGTTAAGTTCTCGCAGTCTATTAAGGAAGAAGCAGGTGTTGCTTTCTATTCTGGTTTGAACTCTAACGGTGACCTGTTTATTGGTAACCAGGTTATTAACCCTGTTACTGGTCAAATTACAAACGAAGATATTGCACAACTGAACGTTGTTGGTGAAGAAAATACAACAATTGAAACATTCTCTGAGTTGGTTCTTACCGATAAACTTACGGTTATTGGTGGTGCATCTAACCAGTTGGAATCTATCTTTGCTGGTCCTGTTACTTTCCAAGGTCAAGTATCTTCTACTAGTAACTTAATTGCTAAGAAGATTACCTACAATAACCAAGATGGTACTGTTATTAAACAGACCTTACTTGCACCAGAAGACGCACTGGGTCAACCAGACTTTACTAATATCACTGGATATACCACACCTTCCGATGGTGATTTAGTTTACAACATTAACTGGACACCTGGTAAATCTCTTGGATGGATTTACTATCAAGGTCTTTGGAAAGAGTTTGGAATCACAGATACAGGTCAAATTGATATTCAAACGTTTGTAGATGGTGATGGTGTAGACCAGCAGCATCTTGGTTTTGGTGTTGCTGCTAACAGCACTTATAGAGCAAACATTCTTGGTAATGTTCGAATTGATGGTAACTTAACTACAACTGGTACTGGTGGTATTTCTGCTGACAAGTATGTCACCAGAACATATACTGGTGATGGCAATACTCTCACCTATAATATCACCACATTCACTGGTGGTGTTAAGCATACGGCGGATTCTCTGCTAGTATTCTTAAATGGTGTTGCTCAGATTGGTGGAACTAACTTCAGCGTTGATGCAAATGGTGCCAACATTATCTTCGGTGCTGGCGATGCACCACTCTCTACGGATACGATTCATATCATTGAATTGCCTATCTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
db6bd3810fa111dc0f93dee3ca03c33ea3915daf9567e76afb789808443fa8e3
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,3858
Evidence 0,3858

Literature

Title Authors Date PMID Source
The Genome Sequence of Cyanophage KBS-M-1A Henn,M.R., Lennon,J., Levin,J., Malboeuf,C., Casali,M., Russ,C., Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C., Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E., Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S., Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L., Mehta,T., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T., Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Haas,B., Nusbaum,C. and Birren,B. 2011-09-23 GenBank