Genbank accession
YP_010107752.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MSTKFKTVITTAGAAKLAAATVPGGKKVNLSAMAVGDGNGQLPVPDAGQTKLVHEVWRHALNKVSVDNKNKNYIVAELVVPPEVGGFWMRELGLYDDAGTLIAVSNMAESYKPELAEGSGRAQTCRMVIILSNVASVELSIDASTVMATQDYVDDKIAEHEQSRRHPDATLTEKGFTQLSSATNSTSEKQAATPKAVKAAYDNAEKRLQKDQNGGDIPDKDAFLDNVGVTSLTFMKNNGEMPLDADLNTFGPVKAYLGIWSKATSTNATLEKNFPEDNAVGVLEVFAAGNFAGTQRFTTRDGNVYIRRLANKWNGSDGPWGIWRHTQSATRPLSTTIDLNTLGAAEHLGLWRNSSSAIASYERNYPEEGGFAQGMLEILEGGNYGRTQRYTTRRGNMYVRCLAASWDASNPQWEPWLRVGHQSESRYYEGDLNVLTDPGIYSVTGKATNGPMLDTVGATLLGILEVIRRFDGVSVWQRYTTTGKSETTQGRTFERVYAGSKWTEWREVYNSFSLPLNLGIGGAVAKLSILDWQTYDFVPGSLITVRLDNMTNIPDGMDWGVIDGNLINIAVGPSDDSGTGRSMHVWRSTVSKANYRFFMVRISGNPGSRTITTRRVPIIDEAQTWGAKQTFSAGLSGELSGNAATATKLKTARKINNVSFDGTSDINLTPKNIGAFASGKTGDTVANDKAVGWNWSSGAYNATIGGASTLILHFNIGEGSCPAAQFRVNYKNGGIFYRSARDGYGFEADWSEFYTTTRKPTAGDVGALPLSGGQLNGALGIGTSSALGGNSIVLGDNDTGFKQNGDGNLDVYANSVHVMRFVSGSVQSNKTINITGRVNPSDYGNFDSRYVRDVRLGTRVVQTMQKGVMYEKAGHVITGLGIVGEVDGDDPAVFRPIQKYINGTWYNVAQV
Physico‐chemical
properties
protein length:911 AA
molecular weight: 98463,83950 Da
isoelectric point:6,86418
aromaticity:0,08891
hydropathy:-0,35104

Domains

Domains [InterPro]
IPR051934
Unmapped
1–259
DC_0032
ATT
1–110
DC_0032
ATT
103–579
YP_010107752.1
1 911
Architecture
ATT
RBD
ATT 1-579 | RBD 610-911
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Yersinia phage vB_YpM_22
[NCBI]
2736197 Uroviricota > Caudoviricetes > Peduoviridae > Peduovirus YPM22 >
Host Yersinia pestis EV76-CN
[NCBI]
665028 Pseudomonadota > Gammaproteobacteria > Enterobacterales > Yersiniaceae > Yersinia > Yersinia pestis

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_010107752.1 [NCBI]
Genbank nucleotide accession
NC_055844.1 [NCBI]
CDS location
range 15975 -> 18710
strand +
CDS
ATGAGCACGAAATTTAAAACCGTTATCACTACTGCCGGAGCCGCGAAGCTGGCAGCCGCCACTGTTCCCGGCGGGAAAAAAGTAAACCTGTCTGCAATGGCTGTGGGTGACGGTAATGGCCAATTGCCGGTGCCGGATGCCGGTCAGACGAAACTGGTGCATGAAGTCTGGCGTCATGCCCTGAATAAAGTCAGTGTGGATAACAAGAATAAAAACTATATCGTGGCTGAACTGGTTGTTCCGCCAGAAGTGGGCGGCTTCTGGATGCGTGAGCTTGGTCTGTATGACGATGCCGGAACACTGATTGCGGTATCCAACATGGCAGAAAGCTATAAGCCAGAACTGGCTGAAGGCTCCGGACGTGCGCAGACCTGCCGCATGGTTATTATTCTCAGCAACGTGGCGTCCGTTGAGCTGAGTATTGATGCCAGCACAGTGATGGCGACGCAGGATTACGTCGATGACAAAATCGCAGAGCATGAGCAGTCCCGCCGCCATCCTGACGCCACGCTGACAGAAAAAGGTTTTACTCAGTTAAGCAGTGCAACAAACAGCACCAGTGAAAAGCAGGCTGCAACGCCAAAGGCAGTAAAAGCAGCCTATGACAATGCTGAGAAACGTCTGCAGAAAGACCAGAACGGTGGCGATATTCCAGATAAGGACGCTTTTCTGGACAATGTTGGCGTTACCAGCCTGACGTTTATGAAAAACAATGGCGAAATGCCGCTTGATGCTGATCTGAATACATTTGGTCCCGTTAAGGCTTATCTGGGAATCTGGTCTAAAGCTACCTCAACTAACGCAACACTGGAGAAAAATTTCCCGGAAGATAATGCTGTCGGTGTGCTTGAGGTTTTTGCTGCCGGCAATTTTGCAGGTACGCAACGCTTCACCACGAGAGACGGCAATGTATACATACGCAGACTCGCCAATAAGTGGAATGGCTCTGATGGTCCGTGGGGCATATGGCGTCACACTCAATCAGCTACCCGCCCTTTGAGTACGACTATAGACCTGAATACGCTTGGAGCCGCCGAACATCTTGGTTTATGGCGTAACAGTAGCTCGGCTATAGCTTCATATGAACGCAATTATCCAGAGGAAGGCGGCTTTGCTCAGGGGATGCTTGAGATCCTCGAAGGCGGAAATTATGGAAGAACGCAACGTTATACCACTCGCCGTGGAAATATGTACGTCCGCTGCCTTGCGGCAAGCTGGGATGCATCAAATCCGCAGTGGGAACCGTGGTTAAGAGTCGGTCATCAGTCAGAGAGTCGTTATTACGAAGGTGATTTAAATGTTCTAACCGACCCCGGTATTTACAGTGTTACAGGAAAGGCAACAAACGGTCCGATGCTGGACACCGTTGGCGCGACACTACTTGGGATACTGGAAGTAATCAGACGTTTTGATGGTGTGTCTGTCTGGCAGCGTTACACAACCACAGGGAAATCAGAAACCACACAGGGACGCACTTTTGAGCGCGTCTACGCCGGGAGCAAATGGACCGAATGGCGAGAAGTATATAACTCCTTTTCGTTGCCTCTGAATCTGGGCATCGGTGGCGCAGTGGCAAAACTATCCATTCTGGACTGGCAGACCTACGATTTTGTGCCGGGCAGTCTGATAACCGTTCGGCTTGATAATATGACCAACATTCCCGACGGTATGGACTGGGGCGTCATTGATGGCAACCTGATAAACATCGCAGTTGGTCCGAGTGATGATTCCGGTACGGGGCGCTCAATGCATGTATGGCGCAGCACTGTAAGTAAAGCCAACTACCGCTTTTTTATGGTGCGCATTTCAGGAAATCCGGGAAGCCGCACGATCACAACAAGACGAGTACCAATCATTGACGAAGCCCAGACATGGGGCGCGAAACAGACATTCAGTGCTGGCCTTTCTGGTGAACTGTCCGGCAATGCGGCGACAGCAACAAAGCTGAAAACAGCCCGTAAAATTAATAACGTTTCGTTTGATGGAACATCAGATATTAACCTGACGCCGAAAAATATTGGTGCATTTGCTTCAGGAAAAACAGGAGACACCGTTGCGAATGATAAAGCCGTTGGATGGAACTGGAGTAGCGGAGCCTATAACGCAACTATTGGTGGGGCATCAACGTTAATTCTTCATTTTAATATCGGTGAAGGAAGTTGTCCCGCCGCCCAGTTTCGCGTTAATTATAAGAACGGTGGTATTTTTTATCGTTCTGCTCGTGACGGTTACGGATTCGAGGCTGACTGGTCTGAGTTTTATACCACAACGCGAAAACCTACAGCGGGAGATGTCGGTGCACTGCCGTTATCTGGTGGTCAATTGAATGGTGCTCTGGGTATAGGAACATCCAGTGCTCTTGGCGGTAATTCGATTGTTTTGGGTGATAATGACACGGGCTTTAAACAAAATGGTGATGGTAATCTGGATGTTTATGCTAATAGCGTCCATGTTATGCGCTTTGTCTCCGGAAGCGTTCAAAGTAATAAAACCATAAATATTACGGGGCGTGTTAATCCCTCGGATTACGGTAACTTTGATTCCCGCTATGTGAGAGATGTCAGACTTGGCACACGTGTTGTCCAGACCATGCAGAAAGGGGTGATGTATGAGAAAGCAGGGCACGTAATTACCGGGCTTGGTATTGTCGGTGAAGTCGATGGTGATGACCCCGCAGTATTCAGACCAATACAAAAATACATCAATGGCACATGGTATAACGTCGCACAGGTGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
0919d91d97129afd4015fa9188eaef875ded83ce441386fbb913b9d9956656ad
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7344
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50