Genbank accession
QIG66676.1 [GenBank]
Protein name
central tail fiber J
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,70
TF
Evidence RBPdetect2
Probability 0,95
Protein sequence
MKNSMWHPSGFPRMITGSKGGGKGSSGGGSESPNTLRSKATVRLLDMLGEGEIKGLVNGAKSIYLNETPLVSASGDYNFKGVNWDIRVGLPSQSVMPGASGVEAITNVGAQVKYGAPLTRSLTDPDYDACKVTIRIPALSKADDKGNINGTTLQFVIDIRYQGGAFTTSSGVITLTGKCTSPYDREFYFALPKNPGGASAPWEIRVTRLTPDSDSVKLQNDMYFSSFEGTIEAKFSYPYTAYVGMVVDGEQFNQQIPERKYLIDGRIIKVPSNYTTRLYDVNGNISRNPSYSGVWDGTFKMEWTNNPAWVFYDMVTNDRFGLGDYIDVSQVDKWGLYEIAKYCDEYVPNGIGGVEPRMTFNGVVATKREAYDTLSSMASCFRGMAYWSSGSIVTTQDRPKDPVVLASQSNVVDGQFDRQSTALKSRHTVAMAKFLNPQDFYREDYAIYQDEAGIAKYGYRDTKFEAVGCTSRGQAYRMAKWTVLTELYEKGTTTYQAGLDHAGVRPGDIIAIQDPSLANVEFGGRISGGKPELITNGFFGAGITGWTSSVSSGGAVTWGAGNVTIKGNGTTRSYIEQAITTEVGKTYRMQITPSGGQAVTVWVGTTQGASDLSSTTPSAYSEISFTATGTTTWLRFSRQPTTTVTIDNISVREQTNTPSVINVDTPVTFEGGNLYWLNVQMPDGSIEESNIVIAAYDTPTTQITVSPAFTKTPDAESQWIISRNDLVPELIRCITIKESEPNKYDVMGLQYEPSKFAAVDVDAQFSLINTSDFPTGELGVPTAPLFNEYAYALGQTGTIMALDWSITAPKDPRIGQFEWQYQTRTETGTLNGWQAAGFTYDPIITIQDLEAGLYNFRVRCLGSFGVGVSGWTVSNDILLQGLNLPPATPTQFRISVLGDQSQFTWRVANQLNIRFYEIRHTPNVVDPEWNSAVVLVKEVAASGIQLPTMAGTFLIKSVSFTGVYSLDAAMIQNGVLGQALNAVEVLDEATAGFLGTKGNTLVVSGDLQLKPYNGNFPPLGVYGFNAPVDLGDEYTSRLSATIDIFGLDVRDTIDKWTSLDTVTALSTARSDQWSFELEYATTLDNPVAVVARASTGTYFDADGILKTAANNVPRYNYTPSDLSAVGVLLAEAAGTNLILQSQTFDNASWTKTNLTVTANAIAAVDGTTTADKLTEDTTASAVHQTAQQFAVTSGQSYVGSTFFKAGSAGRLLRVTMGAAFASNCWASFDPVTELVTLGADAVAAGFIKCLDGNYRVWVKGTATSTGNATLYISMQTSTSPVYTGDGTSFIYVWGAQAEQGAAMTSYIPTTTATVTRAAETITSQPTWTDWEQFITRDVNFRAIKFRAKLISNDPAVTPVLRGLTVNVDMPDRIIAGNDIAVPTAGLSIVFDPPFKGLTGLSMASQDMATGDRALITAKSENGFTIRYFNSTGTAVARTFDYNAVGYGVKK
Physico‐chemical
properties
protein length:1450 AA
molecular weight: 156716,68700 Da
isoelectric point:4,91500
aromaticity:0,10345
hydropathy:-0,16407

Domains

Domains [InterPro]
DC_0323
STR
4–657
IPR053171
Unmapped
15–527
IPR055385
ATT
102–232
DC_0129
STR
649–1008
QIG66676.1
1 1450
Architecture
STR
ATT
STR
ATT
STR
STR
STR 4-101 | ATT 102-232 | STR 233-367 | ATT 368-519 | STR 520-1008 | STR 1030-1443 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QIG66676.1
1 1450
Domain Start End Length (AA) Confidence
N-terminal 1 527 527 0,8961
Central domain 528 905 379 0,2851
C-terminal 906 1450 544 0,5697
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-527
Central
528-905
C-terminal
906-1450

Taxonomy

  Name Taxonomy ID Lineage
Phage Rhizobium phage RHph_TM16
[NCBI]
2509755 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QIG66676.1 [NCBI]
Genbank nucleotide accession
MN988459.1 [NCBI]
CDS location
range 6717 -> 11069
strand +
CDS
GTGAAGAACTCTATGTGGCATCCTTCTGGCTTTCCCCGGATGATCACCGGCTCGAAAGGTGGTGGTAAAGGCTCTTCCGGGGGAGGATCGGAAAGCCCGAACACCCTCCGCTCGAAGGCCACCGTCCGCCTCCTCGACATGCTTGGGGAAGGCGAAATCAAGGGCCTCGTCAACGGGGCCAAGTCGATCTACCTGAACGAGACGCCACTGGTCTCGGCTTCGGGCGACTACAATTTCAAGGGCGTCAACTGGGACATCCGTGTCGGCCTCCCCAGCCAGTCGGTGATGCCGGGAGCCTCGGGCGTCGAAGCCATCACCAACGTCGGTGCCCAAGTGAAATATGGTGCACCGCTGACGCGCTCGCTGACAGACCCAGACTATGACGCCTGCAAGGTGACGATCCGTATCCCGGCGCTCTCCAAGGCTGATGACAAGGGCAACATCAACGGCACGACGCTGCAGTTCGTGATCGACATCCGCTATCAGGGCGGGGCGTTCACCACCTCGTCTGGCGTTATCACCCTGACCGGAAAGTGCACCTCGCCCTATGACCGCGAGTTCTACTTCGCGCTGCCGAAGAACCCGGGCGGGGCATCTGCGCCGTGGGAAATCCGCGTCACCCGCCTGACCCCTGACAGCGACAGCGTCAAGCTCCAGAACGACATGTACTTCTCGTCGTTCGAGGGCACCATCGAAGCCAAGTTCAGCTACCCCTACACCGCTTACGTCGGCATGGTGGTCGATGGCGAGCAGTTCAACCAGCAAATCCCAGAGCGCAAGTACCTGATCGATGGCCGCATCATCAAGGTGCCGTCGAACTACACCACCCGCCTCTACGACGTGAACGGCAACATCAGCCGAAACCCGTCCTACTCGGGGGTCTGGGACGGCACCTTCAAGATGGAGTGGACGAACAACCCCGCGTGGGTGTTCTACGACATGGTGACCAACGACCGCTTCGGACTGGGGGACTACATCGACGTCAGCCAAGTCGATAAGTGGGGCCTCTACGAGATCGCCAAGTACTGCGACGAATACGTGCCGAACGGCATCGGTGGCGTTGAGCCCCGCATGACCTTCAACGGCGTCGTGGCCACCAAGCGCGAAGCCTACGACACCCTGTCGTCCATGGCCTCCTGCTTCCGTGGGATGGCTTACTGGTCCTCGGGCTCGATCGTCACCACACAGGACCGACCGAAGGACCCTGTCGTCCTCGCCTCGCAGTCGAACGTCGTTGACGGCCAGTTCGACCGCCAGAGCACGGCGCTGAAGAGCCGCCACACCGTCGCCATGGCGAAGTTCCTCAATCCGCAGGACTTCTACCGCGAAGACTACGCGATCTATCAGGACGAAGCCGGGATCGCCAAGTACGGCTATCGCGACACCAAGTTCGAGGCTGTCGGCTGCACCTCTCGCGGTCAGGCCTACCGCATGGCCAAGTGGACGGTGCTGACCGAACTCTATGAGAAGGGCACGACCACCTATCAGGCCGGTCTCGATCACGCTGGCGTCCGCCCGGGCGACATCATTGCCATTCAGGACCCGTCGCTGGCCAACGTCGAATTCGGCGGCCGTATTTCTGGGGGCAAACCGGAACTGATCACCAACGGCTTCTTCGGGGCCGGGATCACCGGCTGGACCTCTTCTGTATCCTCGGGTGGCGCGGTCACGTGGGGCGCGGGCAACGTCACCATCAAGGGCAACGGCACGACCCGCAGCTACATCGAGCAGGCGATCACCACGGAAGTCGGCAAGACCTACCGGATGCAGATCACCCCGTCTGGCGGCCAAGCCGTGACCGTCTGGGTGGGAACGACGCAGGGGGCCTCCGACCTGTCGAGCACTACGCCCTCGGCGTACAGTGAAATATCTTTCACTGCCACAGGCACGACGACATGGCTGCGTTTCAGTCGCCAGCCGACCACTACCGTCACCATAGATAACATCTCGGTACGGGAGCAGACCAACACGCCGTCCGTCATCAACGTCGATACGCCGGTCACTTTCGAGGGCGGCAATCTCTACTGGCTGAACGTCCAGATGCCGGATGGCTCGATCGAGGAAAGCAACATCGTCATCGCGGCCTACGACACGCCGACGACACAGATCACAGTCTCCCCGGCGTTTACCAAGACACCGGACGCGGAGAGCCAGTGGATCATATCGCGCAACGACCTCGTGCCCGAACTGATCCGCTGCATCACCATCAAGGAGAGCGAACCGAACAAGTACGACGTCATGGGCCTGCAGTACGAACCTTCGAAGTTCGCGGCTGTGGACGTGGACGCTCAGTTCTCGCTGATCAACACCTCGGACTTCCCGACCGGCGAGCTTGGCGTGCCCACGGCTCCGCTGTTCAACGAATACGCCTACGCCCTCGGCCAGACGGGCACGATCATGGCCCTCGACTGGTCGATCACCGCGCCGAAGGACCCGCGCATTGGCCAGTTCGAATGGCAGTACCAGACCCGCACGGAAACCGGGACCCTGAACGGCTGGCAGGCGGCAGGCTTCACCTATGACCCGATCATCACCATTCAGGACCTCGAAGCCGGTCTCTACAACTTCCGTGTCCGCTGCCTCGGCTCCTTCGGTGTCGGGGTCTCGGGCTGGACGGTGTCGAACGATATCCTGCTGCAGGGCTTGAACCTGCCCCCGGCCACGCCGACCCAGTTCCGCATCTCCGTCCTCGGGGATCAGAGCCAGTTCACGTGGCGGGTGGCCAACCAGTTGAACATCCGCTTCTACGAAATCCGCCACACGCCAAACGTCGTGGACCCGGAGTGGAACTCGGCTGTCGTCCTCGTCAAGGAAGTCGCAGCCTCGGGCATCCAGCTTCCGACCATGGCCGGTACCTTTCTGATCAAGTCGGTGTCGTTCACCGGGGTCTATTCGCTCGATGCGGCGATGATCCAGAACGGCGTCCTCGGGCAGGCGCTGAACGCGGTGGAAGTTCTGGACGAAGCAACCGCAGGCTTCCTCGGCACCAAGGGAAACACTCTCGTGGTATCCGGCGACCTGCAGCTAAAGCCCTACAACGGCAACTTCCCGCCGCTGGGGGTTTACGGCTTCAACGCCCCTGTTGACCTCGGGGACGAATACACCTCGCGCCTGTCGGCCACCATCGACATCTTCGGCCTAGACGTCCGCGACACGATCGACAAGTGGACGTCGCTCGATACGGTGACAGCGCTCTCGACGGCACGATCCGACCAGTGGAGCTTCGAACTCGAATACGCCACGACGCTGGACAACCCCGTCGCCGTGGTCGCTCGTGCCTCGACCGGCACCTACTTCGACGCCGATGGTATTCTGAAGACGGCGGCCAACAACGTTCCGCGCTACAACTACACGCCATCCGATCTGAGTGCCGTTGGCGTGCTGCTGGCGGAAGCGGCAGGGACCAACCTGATCCTGCAGTCGCAGACCTTCGACAACGCTTCGTGGACCAAGACCAACCTGACGGTCACGGCCAATGCTATCGCAGCCGTGGACGGCACGACGACGGCCGACAAACTGACGGAAGACACGACCGCTTCGGCCGTGCACCAGACCGCGCAGCAGTTCGCCGTCACCTCTGGCCAGAGCTACGTAGGGTCCACCTTCTTCAAGGCTGGGAGTGCTGGCCGCCTGTTGCGTGTGACGATGGGAGCGGCGTTTGCCTCCAACTGCTGGGCCTCCTTCGACCCCGTGACAGAACTTGTCACCCTCGGGGCTGACGCCGTAGCTGCAGGCTTCATCAAGTGCCTCGACGGCAACTACCGCGTCTGGGTGAAAGGGACGGCAACGTCTACCGGCAACGCCACCCTGTACATCTCGATGCAGACCTCGACCTCGCCAGTCTACACTGGCGACGGCACGTCCTTCATCTACGTCTGGGGCGCACAGGCCGAACAGGGCGCGGCAATGACGTCTTACATCCCGACGACGACGGCCACGGTGACCCGCGCAGCGGAAACCATCACCTCACAGCCGACGTGGACGGACTGGGAGCAGTTCATCACCCGCGACGTGAATTTCCGGGCGATCAAGTTCCGCGCCAAGCTGATCTCGAATGACCCTGCGGTTACGCCCGTGCTCCGGGGCCTCACGGTGAACGTTGACATGCCTGACCGCATCATCGCAGGAAATGACATCGCGGTTCCAACTGCAGGTCTGTCGATCGTCTTCGATCCTCCCTTCAAGGGCCTCACTGGTCTGTCGATGGCGAGCCAAGACATGGCGACCGGCGACCGCGCACTCATCACTGCGAAGTCTGAGAACGGCTTCACCATTCGCTACTTCAACAGCACAGGAACGGCTGTGGCTAGAACCTTCGACTACAACGCAGTTGGCTACGGAGTGAAGAAATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
d75fc28b4d548dd503c08f2964cd1b361364558e14865d0b28c1acaaeaee6417
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,8560
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Patterns of diversity and host range of bacteriophage communities associated with bean-nodulatin bacteria Vann Cauwenberghe,J., Santamaria,R.I., Bustos,P., Juarez,S. and Gonzalez,V. 1998-01 GenBank