UniProt accession
A0A7U0G9K9 [UniProt]
Protein name
Tail fiber protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,91
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MADYKLSQLNSIDTIRSDDLLHVRVKKRPEMLGDEDRRMTYQDFLASFKLERFVQIAGSTMTGDLGIVKLLYGGKAVFDPTGSSEINIGDVLKTFKINANGLKLTIADASRSATVYHTLNKPSPNELGMRTNEENDARYARLAFTNNFSIRQAIATDGEQLTLKKATNEGASFIQGRDANNQQTWYVGQGNANNNSVYLYNHATGSMLTLDATASFNKTLRITGQVQPSDLSNLDARYFTQTVANKKFAQLAGDNTFSGANTFTNLVVKKNANAITLQNTDASTPLYILGKKSDGTNKWYVGTDSEDTRLNIYNYLTGSQVSLGTTIGINKTVQITGQVQPSDFSNIDSRYIPAATLSTIARTNVTNTFSARQVINSDGEALLLKAKTATTLFIRGVDADGTAKWYVGNGDADNNNNLRLFNYKTGKGLTIGSTFEMNATLAITGQVQPSDFSNLDARYYTQSAANSRYMLSSSSGTGTEVGDSDGVTWNAKTGLYNVTGKSGGSTQLVYHMYQGSSSTPSAQLKFNYRNGGFWYRSARDGYGFENTWAKIYTDQDKPTPSDIGAYTKAETDQKIAQAISDSTDLNKIYPVGIVTWFNSNVDPNTALPGLTWTYLNNGVGRTIRIAAANGSDVATTGGSDSVTLSVGNLPSHTHSFSATTSSFDYGTKTSNSTGAHTHTVSGSTNTTGNHQHSVGGRYGGDSIGGKQRVQVSGTNQISSVAGDHSHTVSGTAASAGAHSHTVGIGAHTHTVSGNTGGTGSGSAFSVTNQFYKLMAWVRTA
Physico‐chemical
properties
protein length:780 AA
molecular weight: 83590,99290 Da
isoelectric point:8,89369
aromaticity:0,08718
hydropathy:-0,45282

Domains

Domains [InterPro]
A0A7U0G9K9
1 780
Architecture
ATT
STR
ATT
STR
ATT 1-354 | STR 355-379 | ATT 380-471 | STR 472-776 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage vB Seyj1-1
[NCBI]
2801511 Uroviricota > Caudoviricetes > Andersonviridae > Felixounavirus > Felixounavirus fv1
Host Salmonella paratyphi
[NCBI]
54388 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QQV89222.1 [NCBI]
Genbank nucleotide accession
MW423797 [NCBI]
CDS location
range 49152 -> 51494
strand -
CDS
ATGGCAGATTACAAATTGAGTCAATTGAACTCAATCGACACAATCCGTTCAGATGACCTTCTTCATGTCAGAGTTAAAAAGAGACCTGAAATGCTGGGTGATGAAGACCGTCGAATGACCTATCAAGACTTCTTAGCATCTTTTAAGCTTGAAAGATTTGTTCAGATTGCTGGTAGTACTATGACGGGTGACTTAGGGATTGTCAAGTTACTTTATGGTGGTAAGGCAGTCTTTGACCCAACAGGCTCTTCTGAGATTAATATTGGGGATGTTTTAAAGACTTTTAAAATCAACGCAAATGGTCTTAAACTAACTATTGCAGATGCTTCAAGGTCAGCAACTGTTTACCATACTCTGAATAAGCCAAGTCCTAATGAACTTGGAATGAGAACTAATGAAGAAAATGATGCAAGATATGCAAGACTTGCTTTCACAAATAACTTCAGCATTAGGCAAGCAATTGCCACTGATGGTGAGCAGTTAACTCTTAAGAAAGCAACAAATGAAGGCGCTTCATTTATTCAAGGTAGGGATGCTAATAACCAGCAAACTTGGTATGTTGGGCAGGGAAATGCAAACAATAATAGTGTTTATCTGTACAATCATGCCACTGGCTCAATGTTAACCCTTGATGCAACAGCATCATTTAACAAGACACTAAGAATCACTGGACAAGTTCAACCTTCAGATTTATCTAACTTAGATGCCAGATATTTCACTCAGACAGTTGCTAATAAGAAATTTGCACAGTTAGCTGGGGATAACACTTTTAGTGGTGCTAATACTTTCACTAACCTTGTTGTTAAGAAGAATGCTAATGCTATTACTTTGCAGAATACTGATGCAAGTACACCACTTTACATTCTCGGTAAGAAGTCTGATGGGACAAATAAATGGTATGTTGGTACAGATTCTGAAGATACACGTCTAAACATTTATAACTACCTTACAGGTTCGCAGGTTTCACTAGGTACAACTATTGGTATCAATAAAACCGTGCAAATTACTGGACAAGTTCAACCCTCAGATTTCTCTAACATTGACTCTAGATATATTCCGGCAGCAACATTAAGTACAATTGCAAGAACTAATGTAACTAATACATTCTCTGCACGACAAGTTATCAACTCTGACGGTGAAGCTCTTTTATTAAAAGCAAAAACTGCAACGACATTGTTCATTAGAGGGGTAGATGCTGATGGGACAGCTAAGTGGTATGTTGGTAATGGTGATGCCGATAACAATAACAACTTAAGGCTGTTTAACTACAAAACAGGCAAAGGGCTTACCATTGGTTCAACATTCGAAATGAATGCCACTTTAGCAATCACTGGACAGGTTCAACCTTCAGATTTCTCTAACTTAGATGCCAGATATTATACGCAATCTGCCGCAAACTCTAGGTATATGCTTTCTTCCTCTTCTGGTACTGGTACAGAGGTAGGTGATAGTGATGGTGTTACTTGGAATGCTAAAACTGGTCTGTATAATGTTACAGGAAAATCTGGTGGTTCAACGCAACTGGTCTACCATATGTATCAGGGTAGTAGCTCTACACCATCTGCTCAATTAAAATTTAACTATAGAAATGGTGGTTTTTGGTATAGGTCAGCAAGAGATGGGTATGGATTTGAGAATACTTGGGCTAAAATCTATACAGACCAAGATAAGCCAACCCCTTCAGATATTGGTGCATACACTAAAGCAGAAACTGACCAGAAGATTGCGCAGGCAATTAGTGACTCTACAGACCTGAATAAAATCTATCCAGTAGGTATTGTAACGTGGTTTAACAGTAATGTTGACCCTAATACTGCACTTCCTGGATTAACTTGGACGTACCTGAATAATGGTGTTGGTAGAACTATCAGAATTGCGGCAGCAAACGGTTCAGATGTTGCTACAACTGGCGGTTCAGATTCTGTAACGTTATCTGTTGGTAACTTACCTTCACATACCCACAGCTTCTCAGCTACAACGTCAAGCTTTGACTATGGTACGAAGACATCTAACAGTACTGGTGCTCATACACACACTGTGAGTGGTTCTACCAATACCACTGGTAACCACCAGCATAGTGTAGGTGGTCGTTACGGTGGTGACTCTATCGGTGGTAAACAGCGTGTTCAGGTATCGGGAACAAACCAGATTTCAAGTGTTGCTGGTGACCACTCCCACACTGTATCAGGTACTGCTGCATCTGCTGGCGCTCACTCTCATACAGTAGGTATCGGTGCTCATACCCACACAGTATCTGGTAACACTGGTGGTACAGGTTCTGGTTCAGCATTTAGTGTTACTAACCAGTTCTACAAGCTGATGGCTTGGGTAAGGACTGCTTAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098024 virus tail, fiber Cellular Component IEA:UniProtKB-KW (UniProt)
GO:0005198 structural molecule activity Molecular Function IEA:InterPro (UniProt)
GO:0019062 virion attachment to host cell Biological Process IEA:UniProtKB-KW (UniProt)

Tertiary structure

PDB ID
fc74f6a95a8168416e11dbcdfa7c2cb2617c30f10622134e737d78e7243ea20a
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7037
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50