Protein
View in Explore- Genbank accession
- CAH0786508.1 [GenBank]
- Protein name
- tail fiber protein
- RBP type
-
TFTFTFTF
- Protein sequence
-
MSTKFRTVITTAGAAKLAAATAPGGRKVNITTMAVGDGGGKLPVPDAGQTGLIHEVWRHALNKISQDKRNSNYIIAELVIPPEVGGFWMRELGLYDDAGTLIAVANMAESYKPALAEGSGRSQTCRMVIIVSSVASVALTIDTTTVMATQDYVDDKIAEHEQSRRHPDASLTAKGFTQLSSATNSTSETLAATPKAVKAAYDLANGKYTAQDATTARKGLVQLSSATNSTSETLAATPKAVKTVMDETNKKAPLNSPALTGTPTTPTARQGTNNTQIANTAFVMAAIAALVDSSPDALNTLNELAAALGNDPNFATTMTNALAGKQPKDATLTALAGLATAADRFPYFTGNDVASLATLTKVGRDILAKSTVAAVIEYLGLQETVNRAGNAVQKNGDTLSGGLTFENDSILAWIRNTDWAKIGFKNDADSDTDSYMWFETGDNGNEYFKWRSRQSTTTKDLMNLKWDALYVLVKALFSSEVKISTVNALRIFNSSFGAIFRRSEENLYIIPTRENEGENGDIGPLRPFGINLRTGVVSVGNGARIDGGLALGTNNALGGNSIVLGDNDTGFKQNGDGNLDVYANNVHVMRFVSGSIQSNKTINITGRVNPSDYGNFDSRYVRDIRLGTRVVQTMQKGVMYEKAGHVITGLGIVGEVDGDDPAVFRPIQKYINGTWYNVAQV
- Physico‐chemical
properties -
protein length: 681 AA molecular weight: 72501,54960 Da isoelectric point: 6,37707 aromaticity: 0,06755 hydropathy: -0,21630
Domains
Domains [InterPro]
IPR051934
Unmapped
1–201
Unmapped
1–201
IPR022225
ATT
1–151
ATT
1–151
DC_1371
STR
1–405
STR
1–405
1
681
Architecture
ATT 1-151 | STR 152-404 | ATT 405-528 | RBD 620-681
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage P2_AC1 [NCBI] |
2881011 | Uroviricota > Caudoviricetes > Peduoviridae > Peduovirus AC1 > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
CAH0786508.1
[NCBI]
Genbank nucleotide accession
OV192280.1
[NCBI]
CDS location
range 18581 -> 20626
strand -
strand -
CDS
ATGAGCACAAAATTCAGAACCGTTATCACCACTGCTGGTGCAGCAAAGCTGGCAGCGGCAACCGCGCCGGGAGGGCGGAAGGTCAACATTACCACGATGGCCGTCGGGGATGGCGGTGGTAAATTGCCTGTCCCGGATGCCGGACAGACCGGGCTTATCCACGAAGTCTGGCGACATGCGCTGAACAAAATCAGCCAGGACAAACGAAACAGTAATTATATTATCGCAGAGCTGGTTATTCCGCCGGAGGTGGGCGGTTTCTGGATGCGTGAGCTTGGCCTGTACGATGATGCTGGAACGTTAATTGCCGTGGCGAACATGGCTGAAAGTTATAAGCCAGCTCTTGCCGAAGGCTCAGGGCGTTCGCAGACCTGCCGCATGGTCATCATCGTCAGCAGTGTGGCCTCAGTGGCGCTGACCATTGACACCACAACGGTGATGGCAACGCAGGATTACGTTGATGACAAAATTGCAGAACATGAACAGTCACGACGTCACCCGGACGCCTCGCTGACCGCCAAAGGTTTTACTCAGTTAAGCAGTGCGACCAACAGCACGTCTGAAACACTCGCCGCAACACCAAAAGCGGTAAAAGCAGCATATGACCTTGCTAACGGGAAATACACTGCACAGGACGCCACCACAGCGCGAAAAGGCCTTGTCCAGCTCAGTAGTGCCACCAACAGCACGTCTGAAACGCTCGCCGCAACACCAAAAGCCGTTAAGACGGTAATGGATGAAACGAACAAGAAAGCGCCATTAAACAGCCCTGCACTGACCGGAACGCCAACGACGCCAACTGCGCGACAGGGAACGAATAATACTCAGATCGCAAACACGGCTTTCGTTATGGCCGCGATTGCCGCCCTTGTAGACTCGTCGCCTGACGCACTGAATACGCTGAACGAGCTGGCGGCGGCGCTGGGCAATGACCCGAATTTTGCTACCACCATGACTAATGCGCTTGCGGGTAAGCAACCGAAAGATGCCACTTTGACGGCGCTGGCGGGGCTTGCTACTGCGGCAGACAGGTTTCCGTATTTTACGGGGAATGATGTTGCCAGCCTGGCGACCCTGACAAAAGTCGGGCGGGATATTCTGGCTAAATCGACCGTTGCCGCCGTTATCGAATATCTCGGTTTACAGGAAACGGTAAACCGAGCCGGGAACGCCGTGCAAAAAAATGGCGATACCTTGTCCGGCGGGCTTACTTTTGAAAACGACTCAATCCTTGCCTGGATTCGAAATACTGACTGGGCAAAGATTGGATTTAAAAATGATGCCGACAGCGATACTGATTCATATATGTGGTTTGAAACAGGTGACAACGGCAATGAATATTTCAAATGGAGAAGTCGCCAGAGCACCACAACAAAAGACCTGATGAATCTTAAATGGGATGCTCTGTATGTTCTTGTTAAAGCCCTTTTCAGCAGTGAAGTAAAAATATCTACAGTCAATGCGCTGAGGATATTTAATTCATCTTTTGGTGCTATTTTTCGCCGTTCTGAAGAAAACCTGTATATCATTCCTACACGAGAAAATGAGGGTGAAAATGGAGATATTGGGCCATTAAGGCCATTCGGCATCAACTTAAGAACAGGAGTTGTGTCTGTTGGTAATGGTGCCAGGATTGATGGCGGGCTGGCACTTGGCACGAATAACGCGTTGGGTGGGAACTCTATTGTTCTTGGTGATAACGACACCGGATTTAAACAAAATGGCGATGGTAATCTGGATGTTTATGCTAATAACGTCCATGTTATGCGCTTTGTTTCCGGAAGCATTCAAAGTAATAAGACCATAAATATTACGGGGCGTGTTAATCCCTCGGATTACGGTAACTTTGATTCCCGCTATGTGAGAGATATCAGACTTGGCACACGTGTTGTCCAGACCATGCAGAAAGGGGTGATGTATGAGAAAGCAGGGCACGTAATTACCGGGCTTGGTATTGTCGGTGAAGTCGATGGTGATGACCCCGCAGTATTCAGGCCAATACAAAAATACATCAATGGCACATGGTATAACGTCGCACAGGTGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
fa3195fd5c7e209833706c20d808b9e724454a7a605a78374df6ebb2e5dfdc3a
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50