Protein
View in Explore- UniProt accession
- A0A2R2YB20 [UniProt]
- Protein name
- Tail fiber protein
- RBP type
-
TFTFTFTF
- Protein sequence
-
MADISKIDMTDIWATSGDIVAPDSAKIQAGWGVEVVPRQWWNWFENRQDNNIAYILQKGFPEWDATTQYIINKSYVQRNGIVYRATETTTGVDPTGLVSWVRAFGDYSASSNALGGLTPAANNIPYFTSGTAAGQFPSTAYGRGVSNLANIAAALTYFGAQASNSNLTALSALTASANQLPYFNGGTTMTTTTLTSFGRSLIDDIDAASARATLEVDSASTTADNLAAGLATKQPLNSSLTILATLTPAANKLPYFTGAGAVTTTDLTPFGRSLIDDADASAARTTLGVLSSAETATNLQAGLDTKQPLASNLTAWANLTPVANTLFYWTSGTGVASTSLTSFARTLLGQADALSVRTTIGADNATNLTSGTIPLARIPTALTGVNAETATRLATPRTIQGVAFDGTANISLPVVPRDSATGAATMPAGATSARPASPVVGMMRYNSDNQTFEGYQGGQWATVGGAGLPVGALVPWNVSEASIPFGWLPRSGGLYNRADYPDLWTLIQSLVVSDADWISTPANRGKYSNGDGTTTFRMPDDNGKYDSNGFGAVTLRGHGKNSAGSVGLHQQDQLQNITGSMLSSSAALINISDATGALAANTTSVGARPSPVSAAGYVWTFDASRVARAGTETRMTNTTVIWCTVAAGKVNNIGNIDINVMSTTVNTHTTQIAALQTSKPTGSSAQLSTAWVNFDGTNGTIRGGYNVSSVTRTGVGSYRIFFTVPMTDVNYVPMFSANALASTNQSNQCYPVALQLTYVDVVNRVGDTLVDRAYCFLNVFGGR
- Physico‐chemical
properties -
protein length: 783 AA molecular weight: 81799,97630 Da isoelectric point: 5,28792 aromaticity: 0,08301 hydropathy: -0,05006
Domains
Domains [InterPro]
DC_0043
STR
1–228
STR
1–228
DC_0043
STR
217–303
STR
217–303
SSF88874
STR
467–644
STR
467–644
1
783
Architecture
STR 1-303 | STR 456-466 | ATT 467-544 | STR 545-781 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Pseudomonas phage PPSC2 [NCBI] |
2041350 | Uroviricota > Caudoviricetes > Vandenendeviridae > Shenlongvirus > Shenlongvirus PPSC2 |
| Host |
Pseudomonas fluorescens [NCBI] |
294 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Pseudomonadales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
ATN92789.1
[NCBI]
Genbank nucleotide accession
MF893340
[NCBI]
CDS location
range 19497 -> 21848
strand +
strand +
CDS
ATGGCTGATATTAGCAAAATTGACATGACAGATATTTGGGCCACTTCCGGTGACATTGTAGCTCCAGATTCTGCAAAGATCCAAGCTGGATGGGGCGTTGAAGTTGTTCCTCGTCAGTGGTGGAACTGGTTTGAAAACCGTCAAGACAACAACATTGCATACATCCTGCAAAAAGGCTTCCCTGAGTGGGATGCAACAACTCAGTACATCATCAACAAAAGTTATGTACAGCGTAATGGGATTGTGTACAGGGCTACAGAGACAACAACAGGAGTTGACCCTACTGGACTTGTTAGCTGGGTACGTGCCTTCGGTGACTACTCCGCGTCCTCTAACGCTTTGGGTGGACTTACACCAGCAGCAAACAATATCCCATACTTCACTTCTGGCACGGCGGCTGGTCAATTTCCGTCCACTGCCTACGGTCGTGGGGTATCCAATCTAGCTAACATTGCCGCTGCTTTGACATACTTCGGGGCACAGGCCAGCAACAGCAACCTGACAGCCCTGTCTGCACTTACCGCATCCGCAAACCAACTTCCGTACTTCAACGGTGGGACTACGATGACAACGACAACGTTGACATCTTTTGGTCGTAGTTTGATTGACGATATCGATGCAGCTTCTGCAAGAGCAACACTTGAGGTTGATAGTGCATCCACAACTGCTGATAATCTGGCAGCTGGACTTGCAACAAAACAACCTCTAAACTCTTCCCTGACAATCCTTGCGACTCTGACTCCAGCTGCTAATAAGCTGCCGTATTTCACTGGTGCTGGTGCTGTTACAACAACAGACCTTACACCTTTTGGCCGTAGCCTTATTGATGACGCAGACGCCAGTGCAGCAAGAACAACACTTGGGGTCTTGTCTTCCGCAGAAACAGCAACAAACTTGCAAGCTGGCTTGGATACTAAGCAACCTCTTGCGTCCAACCTAACAGCTTGGGCTAACCTGACTCCAGTGGCTAACACACTGTTTTACTGGACAAGTGGAACTGGTGTGGCATCTACTTCTCTCACATCCTTCGCCAGAACCCTTCTAGGTCAGGCAGATGCTTTGAGTGTTAGAACTACAATCGGAGCTGATAACGCAACCAACTTGACCTCTGGAACTATTCCTCTGGCAAGGATTCCTACTGCTCTGACAGGCGTTAACGCTGAAACAGCGACACGCTTGGCAACTCCTCGTACAATCCAAGGAGTGGCGTTCGACGGTACAGCGAACATCAGTCTTCCAGTCGTTCCTCGCGATAGTGCAACTGGTGCGGCCACAATGCCAGCTGGGGCTACATCTGCTAGACCAGCAAGTCCTGTTGTTGGTATGATGCGTTACAACAGTGATAACCAGACCTTCGAGGGTTACCAAGGTGGTCAGTGGGCAACAGTTGGCGGTGCTGGTCTTCCAGTTGGGGCTTTGGTTCCTTGGAACGTCTCCGAGGCTTCCATTCCGTTTGGATGGTTGCCACGTAGTGGTGGACTATACAACCGTGCAGATTACCCAGACCTGTGGACTTTGATTCAATCGCTTGTTGTTTCTGATGCTGATTGGATTAGCACACCAGCAAACCGTGGTAAATATTCTAATGGTGACGGTACTACAACATTCCGTATGCCTGATGATAACGGTAAGTACGACTCTAACGGGTTCGGCGCTGTAACTCTCCGTGGTCATGGTAAGAACTCTGCTGGAAGTGTTGGACTACACCAACAAGACCAACTACAGAACATCACTGGTTCTATGCTTTCGTCTTCTGCAGCACTGATTAACATCTCTGACGCTACAGGCGCATTGGCTGCGAACACAACTTCGGTTGGCGCTCGTCCATCTCCAGTTTCTGCTGCGGGCTACGTGTGGACATTTGACGCATCTCGTGTTGCCCGTGCTGGTACTGAAACACGCATGACAAACACCACTGTAATTTGGTGTACTGTTGCTGCTGGTAAAGTGAACAACATTGGTAACATTGATATTAATGTTATGAGTACCACAGTAAACACACACACAACACAGATTGCAGCTCTGCAAACAAGTAAGCCAACAGGATCTTCCGCTCAGCTATCCACAGCTTGGGTAAACTTTGATGGAACTAATGGTACAATCAGAGGTGGCTACAACGTTAGCAGTGTGACGAGAACAGGCGTAGGCAGCTATCGTATCTTCTTTACTGTGCCTATGACTGACGTTAACTATGTTCCAATGTTCAGTGCCAACGCACTAGCTAGCACGAACCAATCCAACCAGTGCTACCCTGTTGCTTTGCAACTCACTTATGTTGACGTTGTTAACAGAGTTGGTGATACACTTGTGGATAGAGCATACTGCTTCCTTAACGTGTTTGGCGGAAGATGA
Genome Context
Genome Context
Tertiary structure
PDB ID
fe77af92f809bc943399ece10233a18a3a6e9940c30212c63f1dfb3e8e80725d
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50