Protein
View in Explore- Genbank accession
- YP_002274206.1 [GenBank]
- Protein name
- tail fiber protein
- RBP type
-
TFTFTFTF
- Protein sequence
-
MAVKISGVLKDGTGKPVENCTIQLKARRNSATVVVNTVASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRPEALRRFELMVEEVARHAEEAKKNAGEAETSARNAGISASQAEESAANADTSAGEASESARQAAESAASAKQSEEASSSSASAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESAESAQSAEQSRIAAEEAVNRIPTVVGPPGPKGEQGPAGPQGPKGDKGERGDTGPVGATGERGPKGETGAAGPVGATGPQGPKGDPGETQIRFRLGPGNIIETNSNGWFPDTDGALITGLTFLDPKDATRVQGFFQHLQVRFGDGPWQDVKGLDEVGSDTGRTGE
- Physico‐chemical
properties -
protein length: 389 AA molecular weight: 40000,87820 Da isoelectric point: 4,68049 aromaticity: 0,03856 hydropathy: -0,65424
Domains
Domains [InterPro]
IPR013609
ATT
1–133
ATT
1–133
IPR013609
ATT
1–140
ATT
1–140
IPR008969
ATT
4–83
ATT
4–83
G3DSA:2.60.40.1120
STR
5–98
STR
5–98
1
389
Architecture
ATT 1-140 | STR 141-312 | RBD 313-378 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Enterobacteria phage YYZ-2008 [NCBI] |
564886 | Uroviricota > Caudoviricetes > Pankowvirus > |
| Host |
Escherichia coli O157:H7 [NCBI] |
83334 | Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
YP_002274206.1
[NCBI]
Genbank nucleotide accession
NC_011356.1
[NCBI]
CDS location
range 49546 -> 50715
strand +
strand +
CDS
ATGGCAGTAAAGATTTCAGGTGTACTGAAAGACGGCACAGGAAAACCGGTAGAGAACTGCACCATTCAACTGAAAGCCAGACGTAACAGCGCCACGGTGGTGGTGAACACGGTGGCCTCTGAAAATCCGGATGAAGCCGGTCGTTACAGCATGGACGTTGAGTACGGTCAGTACAGCGTCATTCTGTTGGTGGAGGGCTTCCCGCCGTCACATGCCGGGACCATCACCGTGTATGAAGATTCTCAACCCGGTACGCTGAATGATTTTCTCGGTGCCATGTCGGAGGATGACGTCCGGCCGGAGGCACTGCGCCGTTTTGAACTGATGGTGGAAGAGGTGGCGCGTCACGCTGAGGAGGCGAAGAAGAATGCCGGAGAGGCGGAGACGTCCGCGAGGAATGCCGGCATATCAGCCAGTCAGGCAGAAGAGAGCGCTGCAAATGCTGACACTTCAGCAGGGGAGGCATCGGAGTCAGCCCGGCAGGCGGCAGAAAGTGCAGCCTCAGCAAAGCAGTCAGAGGAGGCGTCCTCGTCCTCGGCTTCTGCGGCCGCTCAAAAAGCCAGTGAGTCATCACAAAGTGCAGCAGAAGCTGAATTGTCAAGAAAGACGGCAGAAAGTGCAGCCGGTAATGCAGCCAGGGATGCAACGACCGCAACAGAAAAAGCCCGGGAGTCAGCAGAAAGCGCACAGTCAGCGGAACAAAGCAGGATAGCGGCGGAAGAAGCCGTAAACAGAATCCCCACCGTGGTGGGACCTCCCGGGCCAAAGGGGGAACAGGGGCCCGCGGGTCCTCAGGGGCCGAAGGGAGATAAAGGAGAGCGTGGAGACACCGGCCCTGTCGGGGCAACCGGTGAACGGGGACCGAAAGGAGAAACAGGTGCGGCTGGCCCGGTGGGGGCAACCGGACCTCAGGGACCGAAGGGCGACCCGGGGGAGACACAAATACGGTTCCGTCTGGGGCCGGGAAACATTATTGAGACAAACAGCAATGGCTGGTTCCCGGATACAGATGGCGCACTCATCACCGGACTGACCTTTCTTGACCCCAAAGATGCCACACGGGTTCAGGGTTTTTTTCAGCATTTGCAGGTCAGGTTTGGTGACGGGCCGTGGCAGGATGTTAAGGGGCTGGATGAAGTGGGCAGTGATACAGGCAGAACAGGAGAATGA
Genome Context
Genome Context
Tertiary structure
PDB ID
af232b6e11f5feee3630436709f82a8a30c798244225e5e86bd4b09136e7c228
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50