Genbank accession
UYL86012.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 0,98
TSP
Evidence RBPdetect
Probability 0,66
TF
Evidence RBPdetect2
Probability 0,94
Protein sequence
MALDPNINRIKFLRSSTAGAKPTTAAIQPGEIAINLADRTLYSTDGNAIIDIGFGLGGSVNGPINATGQISTNDFLYSKYGFSVNSETADGRGISLYGNGYTASNGLPSYGLAMAATSKYGTFGAVSGSHATYLTTNSGTDRGWIFNYNGTTNVASISGTGIATFARVDAPLNGNANTATKLQSARNINGVLFDGTSDINTPAITDVVSFDNRTVKPSDVRNKAMGVYFTSKAGLNGAADTNYGDFLSLSTYQDGSGGKVNGLYFNKLTREILHYQTDLNSNSWGTPKTIAYTDSSITGNAASATRLQTARTINGTAFDGTANINVNATYSEFIPDGANLNDYKTPGLYYCPTDAGAATQLNLPFSNAYSLFVERHAGIKQTITQYATNKTFIRKFYNGYWDNWRQLAFLDPSDQKFTENITLEKATNAAINVKSTPGSYSELILSNGNKTASVSLTPDGSFILWDSTRQSSFATFTPDGVQSILNSKLLINTPASLNSTGQETFTVTGGESSPIRFKINRGGAITTNSPVTGVSSIPHAISFNWYNTEWQIGNVRDGSTGTVGFGITKFNDTLVWRHDGNTMTNYGNISNTGSISTQGDISSNGNISNTGSISTQGNISSNGNLTTAGSISGDSLITRTGRIGAPTSTYHYLDIGRDGTDITTVGQYGGAFRVVDTAIGKNTFSVDPNTAIFAGRIITKPGTFYQNITDLGNATAAITVPDTVAPKDVTGYVPFIHGSVQTNGSGYRTNVSIGALRGSNTWSSSGAYIAIGGNDNYTTEDFRFISGGYIGTSGGTLNILGTLNAPTLTSTTGSFTTVNTTNINAQGSIVMSAAPGGSGRSGMYTGNGDGASFSTCNIDIGSHWGLGFKDNLGNRNIIFDTRAGNASFKGSIRIGASFADETQLLPTSNQLQILTSGGQARNISTGGVLASDSYADFNKVPTNGIYSKGDIKTAVWMYASTFTGPTGSGDGRFDGNANTATRLQTARTFQITGGITTNAVSFDGQQNVVLTASNVDGSKVSGVVPEAVKAQTLAVTPQKNAKLVASWKGTILQSMTPTLTIVDANTLRVRLADDNPSNRLAVLRFNVKIGTVYHLAFSDTMLPINGTVTFVQTGLTWVEVDLNSPNHGLSGSGNGVNVMAITYSAYGCYFEGSISQIIGTKGPQDSQWAYVLKLNSPTTDATYNLSGSSQDAIWVADKNIWYLNPAQPVITAGAMISPDRLNFFAADTDSSARMRSNMVTAQIWDIV
Physico‐chemical
properties
protein length:1247 AA
molecular weight: 131429,60410 Da
isoelectric point:6,58141
aromaticity:0,09222
hydropathy:-0,21259

Domains

Domains [InterPro]
cd19958
STR
334–408
UYL86012.1
1 1247
Architecture
STR
STR
STR
STR
STR 1-80 | STR 83-731 | STR 820-899 | STR 996-1247
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
UYL86012.1
1 1247
Domain Start End Length (AA) Confidence
N-terminal 1 573 573 0,1623
Central domain 574 828 256 0,5898
C-terminal 829 1247 418 0,5744
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-573
Central
574-828
C-terminal
829-1247

Taxonomy

  Name Taxonomy ID Lineage
Phage Acinetobacter phage vB_AbaM_DP45
[NCBI]
2985295 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Acinetobacter baumannii
[NCBI]
470 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Moraxellales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UYL86012.1 [NCBI]
Genbank nucleotide accession
OP585103.1 [NCBI]
CDS location
range 148836 -> 152579
strand +
CDS
ATGGCATTAGATCCAAATATTAACAGAATTAAATTTTTACGATCTTCTACTGCTGGAGCTAAACCTACTACCGCAGCGATTCAACCAGGCGAAATTGCTATCAATTTGGCAGATAGAACTCTTTACTCAACCGATGGTAATGCTATTATTGATATCGGTTTTGGCTTGGGTGGAAGTGTCAATGGGCCAATAAATGCAACAGGGCAAATTAGTACCAATGACTTTTTATATTCAAAGTATGGATTTTCTGTAAATTCAGAAACTGCAGATGGACGCGGTATTTCATTATATGGTAATGGATATACCGCATCAAACGGTCTGCCGTCATACGGTCTTGCGATGGCTGCTACAAGTAAATATGGTACATTTGGTGCTGTGTCAGGTTCGCACGCAACTTACCTTACTACAAACTCAGGTACTGACCGTGGTTGGATTTTTAACTATAACGGAACTACCAATGTAGCTTCAATTTCGGGAACTGGCATTGCGACATTTGCTCGTGTAGATGCTCCGTTAAACGGTAATGCTAATACAGCAACTAAACTTCAATCAGCTAGAAACATTAATGGCGTGTTGTTCGACGGCACATCAGATATTAACACACCAGCTATCACTGACGTCGTTTCATTTGATAATAGAACTGTTAAACCTTCTGATGTTAGAAATAAAGCCATGGGTGTTTATTTTACTTCAAAAGCAGGTTTAAATGGCGCGGCTGATACCAATTATGGTGATTTTTTATCTCTAAGCACTTATCAAGATGGCTCCGGAGGAAAGGTAAACGGTTTATATTTCAATAAATTAACTCGAGAAATATTACATTACCAGACAGATTTAAATTCAAATTCGTGGGGAACTCCGAAAACGATTGCGTATACAGATAGTTCTATTACCGGCAATGCTGCATCAGCAACACGCTTACAGACAGCAAGAACTATCAATGGTACTGCATTTGATGGTACTGCGAATATCAATGTCAATGCTACATATTCGGAATTTATTCCTGATGGTGCGAATTTAAATGATTATAAAACTCCAGGGCTATATTATTGTCCTACTGATGCTGGTGCAGCGACTCAATTAAATTTACCGTTTAGTAATGCGTATTCGTTGTTTGTTGAGAGACATGCTGGAATTAAACAGACTATTACCCAATATGCGACAAATAAAACCTTTATTCGTAAATTTTATAATGGCTATTGGGATAATTGGCGTCAATTAGCTTTCTTAGACCCAAGTGATCAGAAATTTACAGAAAATATCACTTTGGAAAAAGCTACTAATGCAGCAATCAACGTTAAGAGTACTCCCGGCAGTTATTCTGAACTTATTTTAAGCAATGGAAATAAAACCGCTTCTGTGAGTCTTACGCCAGACGGTAGTTTTATATTATGGGATTCAACTAGACAGTCGTCATTCGCGACATTTACACCAGATGGTGTACAATCGATATTAAATTCTAAGTTGTTGATTAATACTCCAGCATCATTGAACTCTACTGGCCAAGAAACATTTACTGTTACTGGTGGAGAAAGCTCTCCGATTAGATTTAAAATTAACCGCGGTGGGGCAATCACTACTAACTCGCCTGTTACTGGTGTTTCTTCTATTCCCCATGCTATATCATTTAACTGGTATAATACAGAATGGCAAATAGGTAACGTTCGTGATGGTTCTACCGGTACCGTTGGTTTTGGTATTACTAAATTCAACGATACGTTAGTATGGCGCCATGATGGCAATACTATGACCAACTACGGTAACATTAGTAATACGGGTTCGATCAGTACGCAAGGAGATATTTCATCTAATGGTAACATTAGTAATACAGGTTCGATCAGTACGCAAGGAAATATTTCATCTAATGGTAACTTAACTACCGCAGGTTCAATCAGCGGAGATAGTTTAATCACTAGAACTGGTCGTATCGGTGCGCCAACATCTACGTATCACTATCTCGATATCGGTAGAGACGGTACAGATATTACTACGGTCGGTCAATATGGAGGTGCATTTAGAGTTGTTGATACAGCTATCGGCAAAAATACATTCAGTGTTGACCCAAATACGGCTATTTTCGCTGGAAGAATTATCACGAAGCCTGGTACATTTTATCAAAATATAACAGACTTGGGTAATGCGACTGCAGCTATTACAGTTCCAGATACTGTTGCTCCAAAAGATGTCACTGGATATGTTCCGTTCATCCATGGATCTGTCCAAACGAATGGCTCTGGTTATAGAACTAACGTATCTATTGGTGCTTTGAGAGGTTCTAATACTTGGTCATCTTCGGGTGCTTATATCGCTATCGGTGGAAACGATAACTATACGACAGAAGATTTTAGATTCATTTCTGGTGGTTATATTGGTACAAGTGGCGGTACTTTAAATATCTTAGGTACTTTAAATGCACCTACTTTAACATCTACAACTGGATCTTTCACTACCGTTAATACAACAAATATTAACGCCCAAGGCAGTATTGTGATGAGTGCTGCGCCTGGTGGATCTGGGCGAAGCGGTATGTATACAGGAAATGGTGACGGTGCGTCCTTCTCTACCTGTAATATAGATATAGGTTCTCATTGGGGCTTAGGATTCAAAGATAATTTAGGAAACAGAAATATTATCTTTGATACCCGTGCAGGTAATGCGTCGTTTAAAGGATCTATTAGAATCGGTGCATCTTTTGCTGATGAGACTCAATTATTACCTACATCAAACCAACTTCAAATATTAACTTCAGGCGGTCAAGCTAGAAATATTTCGACTGGTGGTGTATTGGCTTCTGATTCGTACGCTGATTTTAATAAGGTTCCGACAAATGGCATCTACTCAAAAGGTGATATTAAAACCGCCGTATGGATGTATGCATCTACATTCACTGGTCCTACAGGATCAGGCGACGGTCGTTTTGATGGTAACGCAAACACAGCAACTCGTTTGCAAACTGCAAGAACTTTCCAAATTACTGGTGGCATCACAACAAATGCAGTATCATTTGACGGACAACAAAACGTAGTGTTAACTGCAAGCAATGTAGATGGTTCTAAAGTATCTGGCGTGGTTCCTGAGGCAGTTAAAGCGCAAACTCTTGCAGTGACACCGCAAAAAAATGCGAAACTCGTTGCATCATGGAAGGGTACCATATTACAATCTATGACGCCTACACTAACTATTGTTGATGCAAATACACTTCGTGTTAGATTAGCAGACGATAATCCGAGTAACAGATTGGCAGTATTGAGATTTAATGTGAAAATCGGCACAGTGTATCATCTCGCATTTAGTGATACTATGTTGCCGATCAATGGCACAGTAACTTTCGTCCAAACTGGGTTAACATGGGTTGAAGTTGATCTTAATTCACCTAACCACGGGCTATCAGGTTCAGGCAATGGTGTAAATGTTATGGCGATAACATATTCTGCGTATGGTTGTTATTTTGAGGGCTCAATTAGCCAAATCATTGGCACAAAAGGACCACAAGACAGTCAATGGGCATATGTATTGAAGCTCAATTCTCCGACAACTGATGCTACATATAATCTTAGCGGATCTTCGCAAGATGCAATATGGGTTGCAGATAAGAATATTTGGTATCTTAATCCTGCGCAGCCGGTGATAACTGCAGGCGCTATGATTTCTCCTGATAGATTGAATTTCTTCGCTGCTGATACAGATTCTTCAGCAAGAATGCGTTCTAATATGGTAACTGCACAAATCTGGGACATTGTATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
4a19e6f79df55b9eff19fa31b94cc50ac4c8cdbd8d4004f535036cb2c9edf668
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,4629
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50