Protein

Genbank accession
AIT14459.1 [GenBank]
Protein name
structural protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MSINRIRGDGGVIIDSKSFLELPKAPSKETADVIRSGMIRYNKEWKSFEGVLDFEDGTMAYRRFANLDANGKLLTSQLPDSVTSGLQFVGTYNPIPDDIDPPLAQTPLPAPAASLNGDYYVVRGIMDAAQAHYEANNPQTSPVTFTPTNPTGQGNWIEILYYVDSNPNISGAKLVTYAFARIIPASIPSTGHDGLKNLATDADLTKPFASGVPMNQTALSDGDWIIITEDKNIRLRQSRTSISAASVLYDNTVMIANHRQFRTNAGTVQTSIDNVVIECLRRTGDSMYNDGTSGSGRFGVVYGSAAAPALTFNNNPFDPTNNPGNDPAKWSDANTGMFHPADDAIGFSTAGTERIRINNSTLTLYQTTSTPASTPVLRFDNATNTNVGLSASSNIISFSSMNKVQVEFKNGESAFHGNIVVDGTSTLTGDTSASNITASGNLNIQGNTTLGDAASDTITVNGVSTFTANTNFNGTTNKFKNINLLANGIITLESTTNQSTIQLVSSDLKLSMGNYADVSIYDNGTIRTRFNRYGIQLPVLATIDNSVGVDGMIAYSNTERTSMQKVQGQWVPIGSGSVRTDAFTISSWVLSGNYYTLTVTASNIVTAEIQEQVSPGVYTRVEVDSVTFNGTNVVFSIPSTPDVRFDGRTLVTIR
Physico‐chemical
properties
protein length:654 AA
molecular weight: 70163,02970 Da
isoelectric point:4,91603
aromaticity:0,07951
hydropathy:-0,23624

Domains

Domains [InterPro]

No domain annotations available.

Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage 121Q
[NCBI]
1555202 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AIT14459.1 [NCBI]
Genbank nucleotide accession
KM507819 [NCBI]
CDS location
range 279726 -> 281690
strand -
CDS
ATGAGTATAAATCGAATTAGAGGTGATGGTGGTGTAATTATTGATAGTAAATCTTTTTTAGAGTTACCGAAAGCACCATCAAAAGAAACCGCTGATGTAATAAGATCAGGGATGATTAGATACAATAAGGAATGGAAATCATTTGAAGGTGTTCTGGATTTTGAAGATGGTACTATGGCATATCGCCGCTTTGCAAACTTAGATGCAAATGGTAAACTTCTTACTTCCCAATTACCAGATTCAGTAACTAGTGGTTTACAGTTTGTTGGTACATACAATCCTATCCCAGATGATATTGATCCACCTTTAGCACAAACACCACTACCAGCACCAGCAGCATCACTAAATGGTGATTATTATGTTGTTCGTGGTATTATGGATGCCGCACAAGCCCATTATGAAGCAAATAATCCACAAACAAGTCCTGTTACATTTACACCTACAAATCCAACTGGACAAGGTAATTGGATTGAAATTTTATATTATGTTGACTCTAATCCAAATATATCTGGTGCTAAACTAGTAACATACGCTTTCGCTAGAATTATTCCTGCATCTATTCCTAGTACTGGTCATGATGGTCTTAAAAATTTAGCAACAGATGCTGACTTAACCAAACCATTTGCTTCAGGTGTTCCAATGAACCAAACCGCACTATCAGATGGTGATTGGATTATTATAACTGAAGATAAAAACATACGTCTACGCCAGTCAAGAACAAGCATTTCCGCTGCTTCAGTATTATATGACAATACTGTAATGATTGCAAATCATCGTCAATTTAGGACAAACGCAGGGACCGTTCAAACATCCATAGATAACGTTGTGATTGAGTGTTTACGTAGAACTGGTGATTCCATGTATAATGATGGAACAAGCGGCTCAGGGCGTTTTGGTGTGGTCTATGGTAGTGCAGCAGCACCAGCATTGACATTTAATAACAATCCATTTGATCCAACCAATAACCCAGGTAATGATCCTGCAAAATGGTCAGACGCTAATACTGGTATGTTTCATCCAGCAGATGATGCAATTGGTTTCAGTACTGCTGGAACTGAACGTATTAGAATTAATAATTCAACATTAACATTATACCAAACAACTTCAACCCCAGCATCAACCCCAGTACTAAGATTTGATAATGCAACCAATACTAATGTTGGACTAAGTGCTAGTAGTAATATTATTTCATTTAGTTCAATGAATAAAGTACAAGTTGAATTTAAAAATGGTGAAAGTGCGTTTCATGGTAATATTGTGGTTGATGGTACAAGTACATTGACTGGTGATACATCAGCTAGTAATATCACTGCTAGTGGTAATTTGAACATTCAAGGTAATACTACATTAGGCGATGCTGCATCTGATACAATTACAGTTAATGGAGTTAGTACGTTTACAGCAAACACTAATTTCAATGGAACAACTAATAAATTTAAAAATATAAATCTATTAGCTAATGGTATAATTACATTAGAAAGTACTACTAATCAAAGTACAATCCAATTAGTTAGTTCTGATTTAAAACTATCAATGGGTAATTATGCTGATGTAAGTATCTATGATAATGGAACTATAAGAACACGTTTTAATAGATATGGTATCCAATTACCAGTACTAGCTACTATTGATAACTCTGTTGGCGTTGATGGTATGATTGCATACTCCAATACCGAGCGTACTTCAATGCAAAAAGTTCAAGGTCAATGGGTTCCAATTGGTTCAGGTTCAGTGAGAACCGATGCTTTTACAATTTCATCTTGGGTATTAAGTGGTAACTACTACACCCTTACTGTAACAGCCTCAAACATCGTCACAGCAGAGATACAGGAACAGGTTAGTCCTGGTGTCTATACTCGTGTTGAGGTCGATAGCGTTACGTTTAACGGCACTAACGTAGTGTTTAGTATACCTTCAACTCCTGATGTACGTTTCGACGGAAGAACATTGGTAACAATTAGATAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
b855c20b3c251ce7ddd527f0f8e7f1bb43e05ca4012383cb234a0aaae5a8712a
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5635
Evidence 0,5635

Literature

No literature entries available.