Protein
View in Explore- Genbank accession
- CAL9996644.1 [GenBank]
- Protein name
- tail fiber protein
- RBP type
-
TFTF
- Protein sequence
-
MNPEIQPLVENIEILNGARGSGLDRAVLLRDLVDLGLAGTVKTSGGKVRPVTGTPGTGTPGGGTSPTPDPTVEDPVAPTGVYASGGFSNILIAWDSPGYKGHAYANIYRSATDDYSQAVAISQTAANLYSDTVAVGATWYYWIRFVNKNDKEGPIQSTNGVKGTTSAAIGDILDQLKGEIDETFFTPVFNTRLDDTDKDIQQLFGDVTNIDNKFTDKFVQVEDQFVEFDGKFVDVNSIIDELKEAGEIIAEAAMSAAVGVEIESDTRRKITARIEEKQRTIITDQIAMAEDIKTVTATVDDNKAQISINNTALVELDKDTKEAVKTITERLDTQQSNIDDNKASITNQQQTLVSVQDSVNKNTGEIDATKQDVQAVTTRLDKQQSEIDDNKATITTQQQTIITIDGKVTGNADDIKDTADKVEVITQRVDTLTSEVGDNKAQISQTQQTLITIDGKVTDNEKEIGDNAKSIQVVTTRIDQQQSEIDSNKATITSQSQTIIKIDGEVQKNKADVEKAITAASNAQATADGKIDTFFQDSHPANASSGDIWFDTNDGNKQYIYHNGAWIVAQDTAIGDAILAAAGAQATADGKIETFYQPDAPTASAKGDLWVDTDDKNKLYRWSGSAWVDIHDQNIDEIKGDVEVITQRVDTMKSEVDNNTAEITDTKQTLVTIDGKVSDNANDIADANQSITTMTQRLESQQSEIDGNKASITSQAQTIVTIDGKVSQNASDIDTANKSIETITQKQDEQKSELDGAKAVIESNSQTIARIEGELGEDGDNSQEILAEAVMSSAIGVDNEGSTRRKVTARIEKQQRVIMTDQSAMAQELTIITASVADNKAEIKSVNTALVEFDKETEQALKVMTERLDTQKANIDDNTASITTQQQTIVTIDGKVQDNTEEVAKAIASASNAQATADGKIDTFFQDDEPATASEGDIWFDTNSGNKQYIYQSGSWVIAQDTEIGDAIKAAAGAQATADGKIETFYQTTPPTANAEGDLWIDTNNNDRLYRWNSLTWVDIQDKDIHKAIQDAASAQATADGKIDTFFQDGEPQAASEGDLWFDTDNGNKQHVYKNGAWIVAQDTAIGDAILAAATAQSTADGKITTFYVPDAPKAKAVGDLWVDTNDKNKLYRWSGSNWLDIQDGNINEIDGKVTVITERLDQLKSEVDGNTASITTNSQTIIEVNSKAEANESKINVVSQQLTTVESELGDTKSAVSTNSQTIAKMNADGTTAYEAQWGVKASVGDVQAGIGLVAKKNPDGTTTSQCTVLADQFSVGHVNTDGDDETIYPFIVTSEGVYIDTAYIKAATVQELVAGEVIADTVKASASITAPKIKGGTIEIGSNFSVDENGNATTNNIKGNNVHLTGYINATSGTFRGTVYATTGEFKGTVYATDGDFKGTVYANRIVGDVVTANTKKKSNSVGYFDRARVNKPTSKNRTLQFTVMVGLKAKGYRDQEGRFQPSTVEGRLKVTGTYGTRYSQIFSFSTNRSSEESRFFPVNVSIPIPANTTGTVNIYSEKTHSVGETSVVTSAPTTDGIWTAMLFTDGSDLS
- Physico‐chemical
properties -
protein length: 1553 AA molecular weight: 167230,35600 Da isoelectric point: 4,43983 aromaticity: 0,05795 hydropathy: -0,49047
Domains
Domains [InterPro]
IPR013783
STR
79–161
STR
79–161
DC_0308
STR
100–369
STR
100–369
SSF57997
STR
300–510
STR
300–510
DC_0308
STR
354–415
STR
354–415
Coil
Unmapped
366–386
Unmapped
366–386
1
1553
Architecture
STR 79-1091 | RBD 1092-1131 | STR 1132-1233 | RBD 1234-1469 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
CAL9996644.1
[NCBI]
Genbank nucleotide accession
OZ196028.1
[NCBI]
CDS location
range 14424 -> 19085
strand +
strand +
CDS
ATGAATCCTGAGATTCAGCCGCTTGTAGAAAATATAGAGATACTTAACGGCGCAAGGGGTTCAGGCTTAGATCGAGCCGTGCTACTGCGCGACCTTGTTGATCTCGGTTTAGCCGGTACTGTAAAAACGTCGGGCGGAAAGGTTAGGCCAGTTACCGGAACGCCTGGTACTGGCACGCCTGGCGGTGGTACATCACCAACACCAGACCCAACAGTAGAAGATCCCGTTGCACCTACTGGCGTTTATGCTTCTGGTGGTTTTAGTAACATTCTTATTGCTTGGGATTCCCCAGGGTACAAAGGTCACGCATACGCTAATATTTATCGATCGGCTACTGACGACTATTCACAAGCTGTAGCTATCAGCCAAACGGCTGCGAACCTTTACAGTGATACCGTTGCGGTAGGTGCAACGTGGTACTACTGGATTAGATTTGTAAACAAGAACGATAAAGAAGGTCCGATCCAATCGACTAACGGCGTTAAGGGTACAACCAGTGCCGCTATTGGCGATATATTAGATCAGCTAAAAGGTGAGATTGACGAAACGTTTTTCACGCCTGTATTCAACACTCGACTTGATGACACTGATAAAGACATACAGCAATTATTTGGTGACGTCACAAATATTGATAACAAGTTCACTGACAAGTTTGTACAGGTTGAAGATCAGTTTGTTGAGTTCGATGGTAAATTTGTAGACGTTAACTCAATCATTGATGAGCTTAAAGAGGCTGGCGAGATCATAGCTGAAGCGGCTATGAGTGCGGCGGTAGGCGTTGAGATTGAAAGTGATACACGACGCAAGATTACAGCTCGAATTGAAGAGAAGCAGCGCACTATCATCACTGATCAGATTGCTATGGCCGAAGATATCAAAACGGTTACAGCTACGGTTGACGATAACAAAGCTCAAATAAGCATTAACAACACTGCGCTGGTTGAGCTTGATAAGGACACCAAGGAAGCGGTTAAGACAATTACCGAGCGATTGGATACCCAACAATCAAACATTGACGACAACAAAGCGTCTATCACCAACCAGCAGCAGACACTTGTAAGCGTTCAGGACTCCGTAAACAAAAATACCGGGGAAATAGATGCTACTAAACAAGACGTTCAAGCAGTAACAACGAGGCTTGATAAACAACAGTCTGAGATTGACGATAACAAAGCGACGATCACCACGCAGCAGCAAACCATCATTACCATTGACGGTAAGGTTACTGGTAACGCTGACGACATCAAGGACACCGCCGATAAAGTTGAGGTGATTACACAGCGTGTTGATACGTTAACCTCTGAAGTTGGCGACAACAAAGCACAAATATCTCAAACGCAACAAACGTTAATCACCATTGACGGCAAGGTTACGGACAACGAAAAGGAAATAGGCGATAACGCTAAGTCCATTCAAGTGGTAACTACTCGAATTGATCAGCAGCAATCTGAAATTGATAGCAACAAGGCTACCATTACCAGCCAGTCTCAAACGATCATTAAGATTGATGGTGAAGTTCAGAAGAACAAAGCAGACGTTGAGAAAGCAATTACCGCCGCGTCTAATGCGCAAGCAACTGCAGACGGCAAGATCGATACGTTTTTCCAAGACTCACACCCTGCTAATGCGAGTAGTGGTGACATTTGGTTCGACACTAACGACGGCAATAAGCAATACATTTACCACAACGGTGCTTGGATTGTTGCGCAGGATACTGCTATCGGTGATGCGATTCTTGCGGCGGCAGGAGCACAAGCAACAGCAGACGGTAAGATCGAAACCTTCTACCAGCCAGATGCACCGACGGCAAGCGCTAAGGGCGACTTATGGGTAGACACTGACGATAAAAATAAACTTTATCGTTGGAGCGGATCAGCTTGGGTTGATATTCACGATCAGAACATTGATGAGATCAAAGGTGACGTTGAGGTAATAACGCAACGTGTCGACACAATGAAGTCAGAGGTTGATAACAACACCGCTGAAATCACAGACACAAAGCAAACCTTAGTAACTATTGACGGTAAGGTATCCGATAACGCCAATGACATTGCAGATGCAAATCAAAGCATCACAACAATGACGCAGCGCTTGGAAAGCCAGCAATCAGAAATAGATGGTAACAAGGCCAGCATTACCAGCCAGGCACAAACGATCGTTACTATTGACGGCAAGGTTAGCCAAAACGCTAGCGATATCGACACGGCTAATAAATCGATTGAAACTATCACGCAAAAGCAAGACGAACAGAAGTCCGAGCTTGATGGTGCGAAAGCTGTTATTGAATCTAACTCTCAAACTATCGCTAGAATTGAGGGTGAACTTGGTGAGGATGGAGACAACTCTCAAGAGATACTAGCTGAAGCTGTGATGAGCTCTGCTATTGGTGTTGATAACGAAGGCTCGACACGACGCAAAGTAACCGCACGAATTGAGAAGCAGCAGCGAGTGATCATGACCGATCAATCCGCTATGGCTCAAGAGTTGACGATAATCACGGCAAGCGTCGCAGATAACAAGGCAGAAATTAAAAGCGTAAACACTGCGCTAGTTGAGTTTGACAAAGAGACAGAGCAAGCGCTCAAGGTAATGACTGAGCGACTAGACACGCAAAAGGCAAACATTGACGATAATACGGCCTCTATCACTACTCAGCAGCAAACAATCGTAACCATTGACGGAAAGGTACAGGACAATACTGAAGAAGTAGCCAAGGCTATTGCTTCAGCTTCTAACGCTCAAGCAACAGCTGACGGTAAGATTGATACGTTTTTCCAAGATGACGAGCCAGCAACAGCAAGCGAAGGCGATATTTGGTTCGATACTAATAGCGGTAACAAACAGTATATTTATCAAAGCGGCTCTTGGGTAATCGCTCAAGACACTGAGATCGGAGACGCAATAAAAGCGGCTGCGGGCGCACAAGCTACAGCTGACGGCAAGATTGAAACGTTCTACCAAACGACACCACCGACGGCAAATGCTGAAGGTGATCTATGGATTGATACAAATAACAACGATCGCCTATATCGTTGGAACTCGCTTACCTGGGTTGATATTCAAGACAAGGATATTCACAAGGCCATTCAAGATGCAGCGAGCGCACAAGCTACAGCTGACGGCAAGATTGACACCTTCTTTCAAGATGGTGAGCCGCAAGCAGCAAGCGAAGGTGATCTTTGGTTTGATACTGATAACGGGAACAAGCAGCACGTTTATAAAAACGGCGCTTGGATTGTCGCTCAAGATACAGCTATCGGTGACGCAATCCTTGCAGCAGCTACGGCCCAATCAACTGCAGACGGGAAGATCACAACGTTCTATGTCCCAGATGCACCAAAAGCTAAAGCGGTTGGCGATCTTTGGGTAGACACCAATGACAAGAACAAGCTTTACCGTTGGAGCGGTAGCAACTGGTTAGATATTCAAGACGGCAATATTAACGAGATTGACGGAAAGGTTACGGTAATCACTGAAAGACTGGATCAGTTAAAATCAGAGGTTGACGGAAACACAGCAAGTATCACAACAAACAGTCAAACTATTATTGAGGTAAACAGCAAGGCAGAAGCTAACGAAAGCAAAATAAACGTAGTATCTCAGCAGCTAACAACAGTTGAGAGTGAGTTAGGTGATACTAAGTCGGCGGTTTCTACAAACAGCCAAACTATTGCCAAAATGAATGCAGACGGTACAACGGCATACGAAGCGCAATGGGGAGTTAAAGCAAGTGTTGGTGATGTTCAAGCGGGTATCGGTTTAGTTGCTAAGAAGAACCCGGACGGCACAACCACTTCACAATGTACTGTGCTGGCGGATCAGTTCTCAGTTGGCCATGTAAACACTGACGGGGATGACGAGACAATTTACCCGTTTATCGTTACGTCAGAAGGGGTTTACATTGATACTGCTTACATCAAGGCGGCTACTGTTCAGGAGCTTGTTGCTGGTGAAGTTATTGCTGATACCGTTAAAGCGTCTGCTTCAATCACTGCACCTAAAATAAAAGGCGGCACAATTGAGATAGGCAGTAACTTTAGCGTTGACGAAAACGGCAATGCTACAACGAACAACATTAAAGGCAATAACGTTCATCTTACTGGTTATATCAATGCAACATCCGGCACGTTCAGGGGGACTGTTTACGCGACCACAGGAGAGTTTAAAGGCACTGTTTATGCGACTGATGGTGATTTTAAAGGCACTGTATATGCTAATAGAATTGTCGGTGATGTAGTTACAGCGAATACAAAAAAGAAATCTAATAGTGTGGGGTATTTTGACAGAGCAAGGGTAAATAAACCGACAAGCAAAAATAGAACTTTGCAATTTACGGTTATGGTAGGGCTAAAGGCTAAGGGTTATAGAGATCAAGAGGGCAGATTCCAACCAAGTACCGTAGAGGGTAGGCTTAAAGTTACAGGCACCTACGGGACCAGATATTCTCAAATATTTAGCTTTTCCACAAATAGAAGTTCTGAGGAATCTAGATTTTTTCCTGTAAATGTCAGTATTCCTATTCCTGCAAACACAACCGGCACCGTTAATATATATTCAGAGAAAACACATAGCGTCGGGGAAACTAGCGTAGTGACTAGCGCTCCAACTACTGACGGTATTTGGACAGCAATGTTATTCACAGATGGCAGCGATCTATCATAG
Genome Context
Genome Context
Tertiary structure
PDB ID
9d89a3378f99505c0ca83deb0c78cdddc3abe4505247ce618488eb2adbe5b7db
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50