Genbank accession
QZA70943.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MAENVKVDYEDLGDLDLDFDFDFSDDASQKKKGKGGAIREFAAGLWTGAKEEVFTKGTPQRIIRSLLPVSFSPAFDAKDRAARFKDDLYDKVKSNTRNSVNDIKDLTLDALSMYGGKLPDKVLKQIEVWAENKDIDYSVSSTKPSDPSLEDGLETSDSDDYVTMLQQSTLQASQLAVSLHKESMAQTAAVATKQMSLSADSLAFMQGIHRMTARTVGFNEQFTANYQRKSLELQYRSYKLQLSIGKMHERYYKRSLEAMSGLVKNTGLTDFEKMSHGASAREMIRKRALGGLGGGVSQFGNRLFDNVFNVVDQRSQDVNYRASGGIAAARAAMMGAQQARMMGRSLGARDYGQMAGQGLAAMLPMLMSTIARPALAKNDKLNRMGHDLSYYSNAAPGLINGWLRNRQSFDENYDPSAQGNKWYSRIQKRVVNPFLNNTLFQMPASMGNRTRLNNPGVKDLTEPAIYDQMSRRSLVEVIPGLLTKQLAQQTAIANKMGADVTGVKEVHFNHIQGGFTSQKRANVDLRSSIFNRNEFSSAAGSFNTMVDTLDPNNELSPNARVALAMRFAKDAEAGDGFNVNSYLKSSGWGTANKDAVQEITEFLHKRFDTREATGKAKAFGEFQIGDAPELAQMRNNLSNIMQAQSQYMPNVEQSLNTMANSGSRAQLKDMGVIKRVNGQDVFNHEMYWEMMQKFISNPNYRPDREGEDENERGSNVDPGAELGAMGREAMANFKRGVTDQFGNLRGKLTRDQEEAIRDQLAQARANGVDAYNVVKRQLTARYGDVAVGRIVRRLDNSFGQRFTDKINGFSVDGAAASAGNAARAAGQGASDLFGKIKGASLDTSGVKQALMDAASNGKEALRQATTELQRRFGRQAVEEAAEEIAKAGDVVKSSSDVLGSRVAAAVSRTRAGVSSAMDEATESASETAAIVQAQLDNNAILREVVGVLASVRDATAATKDATIAQVTGNPDILKDGITERASWLRGLGKRMGESKAGRIARFSRTVFKHLPNINTPVMTVAKWATIGPAVVGFKATRGLWNMLRDKKRGDAADPDHDGVRNNSVFDLLRRRKQQKLEQDEAKRAHNADDNGKEKEKPTTLFGLVAGLFSSVTGLVSGIKEFGILGGLAKFLGLGWVGDLIGGLGKILAGKKAIDAASDIMDDLGDGDEGDDRRSRRRGRRTGGRGGRGRGGLIRRLAGGTARGVKRLGAGIGRRTAGALRAGVGGNLKTLVKGGGIVTAGLSAFEAYSAYKEGDDAGAAEAVGSGVGGILGGAAMGAAIGSVVPVVGTAIGAVAGGAIGALGGGAIGRSLYGWFNDPGLLQQMRLRQYGVPDNDTDHVSAILKLEAALEPYVKTTDDGYASLDPKAPIAKLASGFVDDPNDRDQVESFAGWFLHRFKPVFLTHKAVAKQVLPSQSFMDLDKSTDEAAKYEIAKRAQQFDDSADHPYTFTGRVFPDLNATDRKQTEALVKDVVDRLRVKASKMTTGKSTSVFATGMSEVASKRELSGDMVKAINPTGDIPGLNTVGHKESSWFTGDRTVVSAGDILGGLLPKSGQPMDDLTALRMKVYGLPTLDVDRVSTLLQLELVMANKVKFTDSGATFDGKASDIYRLVAASFGLNPSSQWSYKSWEPWFTRRFLPAYLSFAGTVYTQTGDTRPTLAVPKMPPELKFAAATAMANAKYTDGVNAISVWTIKTSPWSTGEMNDDASIIQNHLNNLKAQVKQAQYSAEAVKGGVQQNKDGTTEKEWRKDSSGNMVNNQVKTSTGQVYTSQRQVTTYNPETGRIETAYGGGAGQGAGGASGGVDNTGKVGPIKLGPGAQEGARILIRQAVKAGITDKNEIAMLLAQTHLESGGFSKLEENLRYKAETLMKLWPNRFPSLAAAQQLATAGPVAIANAIYGGRMGNDKPGDGWKYRGRGFMQLTGKANYAAASKGMGVDLVSDPDKLSTDPEMAAKSALWYWKSRGGIEDAAKKGDLNTVTKLINGGTHGLAERGQLFKQYTDLVGTGKFDDIISGKDTSAGAQTDDASTAQTSVQQGGPDSPALSGAKTAAELTAPGASAANAAAAAPVTPPSGQGDAPALKSTTGPSVSATVSAGSSTPNAKDSTASAIDNQASTTPPPGAATTPTLNTPPQMQPTQAQVPNAPQVVAATPAPAQPVMPKESVAALSGSKDHLSSIDDKMTSLVDVLRQFIEMQMQAMSKPAAATQQPASGTGAPAVDFRRKYGNQ
Physico‐chemical
properties
protein length:2225 AA
molecular weight: 237638,49490 Da
isoelectric point:9,23659
aromaticity:0,06472
hydropathy:-0,42894

Domains

Domains [InterPro]
DC_0337
STR
1–1909
IPR052354
Unmapped
1806–1990
G3DSA:1.10.530.10
RBD
1824–2003
IPR023346
STR
1831–2005
cd00325
ENZ
1834–2010
IPR000726
ENZ
1908–1961
QZA70943.1
1 2225
Architecture
STR
RBD
STR 1-2010 | RBD 2011-2225
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Erwinia phage AH06
[NCBI]
2869570 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QZA70943.1 [NCBI]
Genbank nucleotide accession
MZ501268.1 [NCBI]
CDS location
range 122424 -> 129101
strand -
CDS
ATGGCCGAAAATGTCAAAGTTGATTACGAAGACCTAGGTGATCTTGACCTGGACTTTGACTTCGATTTCAGTGACGACGCCAGTCAGAAGAAGAAGGGTAAGGGCGGGGCTATTCGTGAGTTCGCCGCGGGCCTGTGGACAGGCGCCAAGGAAGAAGTGTTCACCAAAGGCACACCTCAGCGCATCATCCGCTCTCTTCTACCGGTCTCGTTTTCGCCTGCGTTTGATGCAAAAGATCGCGCGGCTAGATTTAAAGATGACCTATACGATAAAGTTAAAAGTAATACGCGTAACTCAGTCAATGACATCAAAGACCTTACGCTTGATGCTTTATCGATGTATGGTGGCAAACTCCCAGATAAAGTTCTGAAACAAATTGAAGTCTGGGCTGAGAACAAAGACATTGATTATTCCGTGTCGAGCACCAAACCGAGTGACCCGAGTCTGGAAGACGGGCTTGAGACAAGTGACAGTGACGACTATGTCACTATGCTTCAGCAGTCTACTTTGCAAGCCAGTCAGTTGGCGGTCTCTTTACATAAAGAGAGCATGGCACAGACCGCAGCAGTCGCCACCAAGCAGATGAGTTTGTCTGCCGATAGCCTGGCCTTCATGCAAGGTATCCACCGCATGACTGCCCGTACGGTTGGTTTCAATGAACAGTTCACCGCAAACTACCAACGCAAGTCGTTAGAGCTGCAGTACCGTTCTTACAAGTTGCAACTCTCTATTGGTAAGATGCATGAGCGGTATTACAAACGTTCATTAGAAGCCATGTCAGGTCTGGTGAAGAACACCGGCTTAACAGACTTCGAGAAAATGTCCCACGGGGCATCTGCACGTGAGATGATTCGTAAACGTGCACTCGGTGGTTTGGGTGGCGGGGTCTCTCAGTTTGGTAACCGTCTGTTTGATAACGTCTTCAATGTTGTCGATCAACGTAGCCAGGATGTGAACTACCGCGCCAGTGGTGGTATCGCCGCTGCCCGTGCGGCAATGATGGGTGCACAGCAAGCCCGTATGATGGGTCGCTCGTTAGGGGCACGTGACTACGGTCAGATGGCGGGGCAAGGTTTAGCGGCGATGCTGCCGATGCTGATGTCGACCATCGCCCGTCCAGCACTGGCAAAGAACGATAAACTGAACCGCATGGGTCATGACCTGAGTTATTACTCGAATGCGGCACCAGGGTTAATCAACGGCTGGTTGCGTAATCGTCAATCATTCGATGAGAACTACGACCCAAGTGCACAGGGTAATAAATGGTACAGTCGTATTCAGAAACGTGTGGTCAACCCGTTCTTGAATAACACCCTGTTCCAGATGCCGGCTTCAATGGGTAACCGGACGCGCCTGAACAACCCAGGTGTTAAGGATCTGACTGAGCCGGCTATCTACGACCAGATGTCTCGTCGTTCATTGGTAGAAGTCATCCCAGGATTACTCACCAAACAACTTGCCCAACAAACGGCGATTGCCAATAAGATGGGTGCGGATGTGACGGGTGTGAAAGAAGTTCACTTCAATCACATCCAAGGTGGGTTCACCTCACAGAAACGCGCCAATGTTGATCTGCGCTCGTCTATCTTTAACCGCAATGAGTTCTCGTCTGCCGCTGGGTCATTCAACACCATGGTAGACACACTCGACCCAAACAACGAGCTTTCACCAAATGCCCGTGTGGCCTTAGCAATGCGTTTTGCTAAAGACGCCGAAGCCGGTGATGGGTTTAATGTTAACAGCTACCTGAAGTCCAGTGGCTGGGGTACAGCGAATAAGGATGCGGTTCAGGAGATTACCGAGTTCCTGCATAAGCGCTTTGATACGCGTGAAGCAACCGGTAAAGCCAAAGCCTTTGGTGAATTCCAAATCGGTGATGCGCCTGAACTAGCACAGATGCGTAACAACCTCTCGAACATCATGCAAGCCCAGTCTCAGTACATGCCAAATGTCGAGCAGTCATTGAACACCATGGCTAACAGTGGCAGTCGTGCACAACTCAAAGACATGGGTGTCATCAAACGCGTCAATGGTCAAGATGTCTTTAACCATGAGATGTATTGGGAGATGATGCAGAAGTTCATTTCCAATCCTAACTATCGCCCTGATCGTGAAGGAGAGGATGAGAACGAACGCGGCAGTAATGTCGACCCGGGTGCTGAGTTGGGTGCAATGGGTCGTGAGGCCATGGCCAACTTCAAACGGGGTGTGACGGATCAGTTTGGTAACCTGCGTGGTAAACTTACTCGTGATCAAGAGGAGGCAATCCGTGATCAATTGGCACAAGCTCGGGCAAATGGGGTTGATGCTTATAACGTGGTCAAGCGTCAACTTACTGCTCGCTACGGCGATGTGGCTGTGGGTCGTATTGTCCGCCGTTTGGATAACAGCTTTGGTCAGCGATTTACCGATAAGATAAACGGCTTCTCTGTTGATGGCGCTGCTGCTTCAGCAGGCAACGCTGCCCGTGCAGCCGGTCAAGGTGCATCGGATCTCTTTGGTAAAATCAAAGGCGCATCGCTCGATACCTCAGGTGTGAAACAAGCACTGATGGATGCGGCGAGTAACGGTAAAGAAGCATTACGTCAAGCCACCACAGAACTGCAACGTCGCTTTGGTCGTCAGGCCGTAGAAGAAGCTGCAGAAGAGATCGCTAAAGCCGGGGATGTTGTTAAGTCCTCTAGTGATGTCCTTGGGTCTCGTGTTGCTGCTGCTGTCTCCAGAACCCGTGCTGGGGTTTCTTCAGCAATGGATGAAGCAACAGAGTCTGCTTCAGAGACCGCCGCTATCGTACAAGCCCAGCTTGACAACAACGCGATACTGCGTGAGGTCGTTGGGGTGTTGGCATCGGTACGTGATGCAACCGCTGCAACAAAAGATGCGACCATTGCTCAGGTGACCGGTAACCCGGATATCTTGAAAGACGGGATCACTGAACGTGCCAGTTGGTTACGTGGGTTAGGTAAACGCATGGGGGAGAGCAAAGCAGGTCGTATTGCGCGCTTCTCTCGTACGGTATTCAAACACTTACCTAACATTAACACACCGGTGATGACCGTGGCTAAATGGGCAACCATCGGTCCTGCGGTTGTTGGCTTTAAAGCAACCCGTGGATTGTGGAACATGCTGCGTGATAAGAAACGTGGTGATGCTGCTGATCCTGATCATGATGGTGTGCGTAACAACTCGGTGTTCGATCTCCTGCGTCGCCGTAAGCAACAGAAGCTGGAACAAGACGAAGCCAAGCGTGCACACAACGCAGATGACAACGGAAAAGAGAAAGAGAAACCTACCACCCTGTTTGGGTTAGTGGCAGGACTGTTCTCGTCCGTGACTGGGTTGGTCTCGGGGATTAAGGAGTTTGGGATCCTCGGGGGATTGGCTAAGTTCTTGGGTCTTGGTTGGGTCGGTGATCTGATTGGTGGTCTGGGTAAAATCTTGGCCGGTAAGAAAGCCATTGATGCGGCCTCAGACATCATGGATGATCTCGGCGATGGAGATGAAGGTGATGATCGTCGCAGTCGTCGACGGGGTCGCCGTACCGGTGGGCGTGGCGGCCGTGGTCGTGGTGGGTTGATTCGTCGTCTGGCTGGCGGGACTGCTCGCGGTGTCAAAAGACTGGGTGCTGGGATTGGTCGTCGTACTGCAGGTGCATTGCGTGCAGGTGTCGGCGGTAATTTGAAAACCTTGGTCAAAGGTGGCGGAATTGTCACAGCCGGTCTTTCTGCTTTTGAGGCTTACAGTGCTTATAAAGAAGGAGACGATGCCGGTGCAGCTGAAGCAGTAGGTTCAGGTGTTGGCGGTATTCTGGGTGGCGCTGCAATGGGTGCGGCTATCGGTTCAGTTGTGCCGGTTGTCGGAACCGCCATCGGTGCTGTTGCCGGGGGTGCGATTGGTGCGCTCGGTGGTGGTGCGATTGGTCGTTCCTTATACGGCTGGTTCAATGACCCTGGGTTACTCCAGCAAATGCGTCTTCGTCAGTACGGTGTACCAGACAATGATACCGATCATGTCTCGGCTATCCTGAAACTTGAAGCAGCATTAGAACCGTATGTCAAGACTACTGATGATGGTTACGCCTCATTGGATCCAAAGGCCCCGATTGCGAAACTGGCATCTGGCTTCGTTGATGATCCTAATGACCGTGATCAGGTGGAGTCATTCGCAGGTTGGTTCCTCCATCGTTTCAAACCGGTCTTCCTGACACACAAAGCCGTGGCCAAGCAAGTTCTACCAAGCCAATCATTCATGGACCTTGATAAGTCAACGGATGAGGCTGCGAAGTATGAGATTGCTAAACGCGCACAACAGTTCGATGATTCTGCTGACCACCCATACACCTTTACCGGACGTGTGTTCCCGGACCTGAATGCAACTGATCGCAAACAGACCGAAGCACTGGTAAAAGATGTGGTCGATCGTTTACGGGTGAAAGCCTCGAAGATGACGACGGGTAAATCCACCTCAGTGTTTGCCACTGGGATGAGTGAAGTGGCGTCTAAGCGTGAGCTTAGTGGCGACATGGTCAAAGCGATCAACCCAACCGGTGACATCCCTGGGTTGAACACCGTGGGGCACAAAGAGTCATCGTGGTTCACTGGTGATCGAACCGTGGTTAGTGCAGGAGACATCTTAGGCGGGTTGTTACCGAAGTCTGGTCAACCGATGGATGACCTGACTGCCTTGCGTATGAAAGTCTATGGCTTGCCTACGCTTGACGTGGATCGTGTTTCAACGCTCCTGCAGTTAGAACTGGTGATGGCGAACAAAGTGAAGTTCACGGACAGCGGTGCGACGTTTGATGGTAAAGCATCTGACATCTACCGTTTAGTGGCTGCTTCCTTTGGATTGAATCCATCCAGTCAGTGGTCGTACAAATCCTGGGAGCCGTGGTTCACTCGTCGCTTCTTGCCAGCGTACTTGTCCTTTGCAGGAACAGTCTACACGCAAACGGGCGATACCCGACCAACCTTAGCTGTGCCGAAGATGCCACCGGAACTGAAGTTTGCTGCTGCAACTGCAATGGCAAATGCGAAGTACACTGATGGGGTTAATGCGATCTCTGTCTGGACTATCAAGACATCTCCGTGGTCTACGGGTGAGATGAATGATGATGCTTCAATTATTCAAAACCATCTGAATAACTTGAAAGCCCAGGTGAAGCAAGCGCAGTATTCGGCTGAAGCAGTGAAAGGCGGTGTTCAGCAAAACAAAGACGGCACCACTGAGAAAGAGTGGCGTAAAGACTCGTCAGGTAACATGGTGAATAACCAAGTGAAGACGAGTACTGGTCAAGTCTATACCTCTCAGCGACAGGTCACTACCTACAACCCAGAAACAGGCCGTATTGAAACAGCGTACGGTGGTGGTGCGGGTCAAGGTGCTGGCGGAGCAAGCGGAGGGGTTGACAATACCGGTAAGGTCGGTCCAATCAAGTTAGGACCAGGTGCACAAGAGGGTGCGCGTATCTTAATCCGTCAGGCTGTGAAAGCGGGGATAACGGATAAGAACGAGATTGCCATGCTCTTGGCACAAACCCACTTGGAGTCCGGTGGCTTTAGTAAGCTTGAGGAGAACCTGCGTTACAAAGCAGAGACCCTCATGAAACTCTGGCCGAATCGTTTCCCAAGTCTTGCAGCAGCACAACAGCTGGCAACAGCGGGTCCTGTCGCAATTGCGAATGCCATCTACGGCGGACGCATGGGGAATGATAAACCGGGGGATGGTTGGAAGTACCGTGGGCGTGGCTTCATGCAGCTGACCGGTAAAGCCAACTACGCCGCAGCCAGTAAAGGAATGGGGGTCGACCTGGTCTCTGATCCAGATAAGCTCTCAACAGATCCTGAGATGGCTGCCAAGTCGGCATTGTGGTATTGGAAGTCCCGTGGCGGGATTGAAGATGCAGCGAAGAAAGGGGATCTGAATACGGTTACCAAACTGATCAATGGCGGGACACACGGTCTTGCTGAACGCGGTCAGTTGTTTAAACAGTATACGGATCTGGTTGGGACAGGTAAGTTTGATGACATCATCTCTGGTAAGGATACCTCTGCCGGGGCACAAACGGATGATGCGTCAACTGCTCAAACGAGTGTGCAGCAAGGCGGTCCAGATTCACCTGCGTTGTCCGGGGCGAAGACGGCAGCTGAACTCACTGCACCAGGTGCCTCTGCAGCTAACGCTGCGGCAGCGGCTCCGGTTACCCCGCCGTCAGGTCAGGGTGATGCACCGGCTCTGAAGAGTACCACAGGACCAAGTGTTAGCGCGACCGTCAGTGCAGGATCAAGTACACCGAATGCAAAAGACTCCACCGCATCGGCAATCGACAATCAGGCATCGACGACACCACCTCCGGGTGCAGCAACGACACCAACCCTGAATACCCCACCACAGATGCAACCGACCCAGGCGCAAGTTCCTAATGCCCCTCAGGTTGTTGCAGCTACCCCTGCTCCAGCTCAACCGGTCATGCCGAAAGAGTCTGTGGCTGCCCTGTCTGGTAGCAAGGACCATCTGTCGTCCATTGATGATAAGATGACTTCTTTGGTTGATGTGTTGAGGCAATTCATTGAAATGCAAATGCAGGCAATGAGTAAGCCTGCAGCAGCAACACAACAACCGGCTAGCGGGACAGGCGCACCTGCCGTAGACTTCCGTCGTAAGTACGGTAATCAGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
3771a0cd2c26a12c5c1672a66ab840b04a23ddae0f264dd7dade3faf524d33a7
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,4198
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50