UniProt accession
A0A2R2YB20 [UniProt]
Protein name
Tail fiber protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,91
TF
Evidence RBPdetect2
Probability 0,80
Protein sequence
MADISKIDMTDIWATSGDIVAPDSAKIQAGWGVEVVPRQWWNWFENRQDNNIAYILQKGFPEWDATTQYIINKSYVQRNGIVYRATETTTGVDPTGLVSWVRAFGDYSASSNALGGLTPAANNIPYFTSGTAAGQFPSTAYGRGVSNLANIAAALTYFGAQASNSNLTALSALTASANQLPYFNGGTTMTTTTLTSFGRSLIDDIDAASARATLEVDSASTTADNLAAGLATKQPLNSSLTILATLTPAANKLPYFTGAGAVTTTDLTPFGRSLIDDADASAARTTLGVLSSAETATNLQAGLDTKQPLASNLTAWANLTPVANTLFYWTSGTGVASTSLTSFARTLLGQADALSVRTTIGADNATNLTSGTIPLARIPTALTGVNAETATRLATPRTIQGVAFDGTANISLPVVPRDSATGAATMPAGATSARPASPVVGMMRYNSDNQTFEGYQGGQWATVGGAGLPVGALVPWNVSEASIPFGWLPRSGGLYNRADYPDLWTLIQSLVVSDADWISTPANRGKYSNGDGTTTFRMPDDNGKYDSNGFGAVTLRGHGKNSAGSVGLHQQDQLQNITGSMLSSSAALINISDATGALAANTTSVGARPSPVSAAGYVWTFDASRVARAGTETRMTNTTVIWCTVAAGKVNNIGNIDINVMSTTVNTHTTQIAALQTSKPTGSSAQLSTAWVNFDGTNGTIRGGYNVSSVTRTGVGSYRIFFTVPMTDVNYVPMFSANALASTNQSNQCYPVALQLTYVDVVNRVGDTLVDRAYCFLNVFGGR
Physico‐chemical
properties
protein length:783 AA
molecular weight: 81799,97630 Da
isoelectric point:5,28792
aromaticity:0,08301
hydropathy:-0,05006

Domains

Domains [InterPro]
DC_0043
STR
1–228
DC_0043
STR
217–303
SSF88874
STR
467–644
A0A2R2YB20
1 783
Architecture
STR
STR
ATT
STR
STR 1-303 | STR 456-466 | ATT 467-544 | STR 545-781 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Pseudomonas phage PPSC2
[NCBI]
2041350 Uroviricota > Caudoviricetes > Vandenendeviridae > Shenlongvirus > Shenlongvirus PPSC2
Host Pseudomonas fluorescens
[NCBI]
294 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Pseudomonadales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ATN92789.1 [NCBI]
Genbank nucleotide accession
MF893340 [NCBI]
CDS location
range 19497 -> 21848
strand +
CDS
ATGGCTGATATTAGCAAAATTGACATGACAGATATTTGGGCCACTTCCGGTGACATTGTAGCTCCAGATTCTGCAAAGATCCAAGCTGGATGGGGCGTTGAAGTTGTTCCTCGTCAGTGGTGGAACTGGTTTGAAAACCGTCAAGACAACAACATTGCATACATCCTGCAAAAAGGCTTCCCTGAGTGGGATGCAACAACTCAGTACATCATCAACAAAAGTTATGTACAGCGTAATGGGATTGTGTACAGGGCTACAGAGACAACAACAGGAGTTGACCCTACTGGACTTGTTAGCTGGGTACGTGCCTTCGGTGACTACTCCGCGTCCTCTAACGCTTTGGGTGGACTTACACCAGCAGCAAACAATATCCCATACTTCACTTCTGGCACGGCGGCTGGTCAATTTCCGTCCACTGCCTACGGTCGTGGGGTATCCAATCTAGCTAACATTGCCGCTGCTTTGACATACTTCGGGGCACAGGCCAGCAACAGCAACCTGACAGCCCTGTCTGCACTTACCGCATCCGCAAACCAACTTCCGTACTTCAACGGTGGGACTACGATGACAACGACAACGTTGACATCTTTTGGTCGTAGTTTGATTGACGATATCGATGCAGCTTCTGCAAGAGCAACACTTGAGGTTGATAGTGCATCCACAACTGCTGATAATCTGGCAGCTGGACTTGCAACAAAACAACCTCTAAACTCTTCCCTGACAATCCTTGCGACTCTGACTCCAGCTGCTAATAAGCTGCCGTATTTCACTGGTGCTGGTGCTGTTACAACAACAGACCTTACACCTTTTGGCCGTAGCCTTATTGATGACGCAGACGCCAGTGCAGCAAGAACAACACTTGGGGTCTTGTCTTCCGCAGAAACAGCAACAAACTTGCAAGCTGGCTTGGATACTAAGCAACCTCTTGCGTCCAACCTAACAGCTTGGGCTAACCTGACTCCAGTGGCTAACACACTGTTTTACTGGACAAGTGGAACTGGTGTGGCATCTACTTCTCTCACATCCTTCGCCAGAACCCTTCTAGGTCAGGCAGATGCTTTGAGTGTTAGAACTACAATCGGAGCTGATAACGCAACCAACTTGACCTCTGGAACTATTCCTCTGGCAAGGATTCCTACTGCTCTGACAGGCGTTAACGCTGAAACAGCGACACGCTTGGCAACTCCTCGTACAATCCAAGGAGTGGCGTTCGACGGTACAGCGAACATCAGTCTTCCAGTCGTTCCTCGCGATAGTGCAACTGGTGCGGCCACAATGCCAGCTGGGGCTACATCTGCTAGACCAGCAAGTCCTGTTGTTGGTATGATGCGTTACAACAGTGATAACCAGACCTTCGAGGGTTACCAAGGTGGTCAGTGGGCAACAGTTGGCGGTGCTGGTCTTCCAGTTGGGGCTTTGGTTCCTTGGAACGTCTCCGAGGCTTCCATTCCGTTTGGATGGTTGCCACGTAGTGGTGGACTATACAACCGTGCAGATTACCCAGACCTGTGGACTTTGATTCAATCGCTTGTTGTTTCTGATGCTGATTGGATTAGCACACCAGCAAACCGTGGTAAATATTCTAATGGTGACGGTACTACAACATTCCGTATGCCTGATGATAACGGTAAGTACGACTCTAACGGGTTCGGCGCTGTAACTCTCCGTGGTCATGGTAAGAACTCTGCTGGAAGTGTTGGACTACACCAACAAGACCAACTACAGAACATCACTGGTTCTATGCTTTCGTCTTCTGCAGCACTGATTAACATCTCTGACGCTACAGGCGCATTGGCTGCGAACACAACTTCGGTTGGCGCTCGTCCATCTCCAGTTTCTGCTGCGGGCTACGTGTGGACATTTGACGCATCTCGTGTTGCCCGTGCTGGTACTGAAACACGCATGACAAACACCACTGTAATTTGGTGTACTGTTGCTGCTGGTAAAGTGAACAACATTGGTAACATTGATATTAATGTTATGAGTACCACAGTAAACACACACACAACACAGATTGCAGCTCTGCAAACAAGTAAGCCAACAGGATCTTCCGCTCAGCTATCCACAGCTTGGGTAAACTTTGATGGAACTAATGGTACAATCAGAGGTGGCTACAACGTTAGCAGTGTGACGAGAACAGGCGTAGGCAGCTATCGTATCTTCTTTACTGTGCCTATGACTGACGTTAACTATGTTCCAATGTTCAGTGCCAACGCACTAGCTAGCACGAACCAATCCAACCAGTGCTACCCTGTTGCTTTGCAACTCACTTATGTTGACGTTGTTAACAGAGTTGGTGATACACTTGTGGATAGAGCATACTGCTTCCTTAACGTGTTTGGCGGAAGATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
fe77af92f809bc943399ece10233a18a3a6e9940c30212c63f1dfb3e8e80725d
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7518
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50