Genbank accession
AUR85102.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect2
Probability 0,65
Protein sequence
MLKCTPIKENITVYEGDDCTLEFVSSLGDITGASIRLQARIDTSSRYAIIDVDGVILDSTKGSYSVTLPSGDTLGKASEDSYHYDVKIIKSDGSIHTDRSGMMTIDARTTDLTTTKTTTVYGVDVDQDGNLLRAKAQQVNFLGAGVKLTSNNQGGVDIEIDRSATAPTLALQFAGIYDDIQALNDAVPKPSEHMQAIVIQPSEYYYHVVGSSWQQLAPVGSFHPKYLGAYDTVIDLRVAEPTPDDDSLAIVGTTSKTFYVYESGKWDIVNHADIPGLDARLTTQENKMVTTQKDVSDLQTVSGKNTSDIAALYAPDKATFDAKLDARLVKPEADISALQASTAQIQAAQTSSEKIVTDNTKRLGTAEQHLQTLQSNDQKHDSEISAIGNDITQLQGQVYDGNDISDDDGNSLSDITGLKFVGAQVSDDDGDRTATVTVSPKITVADGQQPGSHSEMGNAIVFEGSQVSADPNDPNVIKVAVHATSHNGITLGDGANASREVQTILFNGHQAYGSGTTAEIHIEFVHFKTIAERDAWTAKFSTHMDFDVIALVDADENGFVAYYKFDAATKVWLEYDAQGVVMSDSNGAIPKNIKTVVFGPGFTIQQAGDQEDAALVTYSETGDGGIDGITVSQDWGNSAESTKVTAVQTMYPIEVNQNQIGDTPLEGNAMLSIDPRAYETQHGNACLVKLDMVQTVAGQHPHAVYMTEEVVPTGEYFNLNAQGRGIDVQDSTGGDTAETGGQMTEVLVKVSFLDTAPEDGVVNVWLEYKDPSDVIAKEILLDVNGNPLAVGRHCNVGDTVGEFIIAGAFMAKATQPLKVMVETEFDVNSKITLDPVNTMVCLNQFSNGYETSVARIEFLRRAAIQITPAMQKFDNKMLSLSDELKGVNQILTVIGVGDAGDTLNEFGIQNLTSVGVKIENDTLTVKDAGSLADFYVDTLIDNTHTRMLRGKEVTSAITIANPDGAFEYEAYKWAGKPDHVTQVYSGRSNDTITINDGWTVIKGVFIAEQVNGNAYGHVLTFDVPEDANNVLILVRPQLEQSPSTLELQEFDWGTSTDFTGYVEIERYKSREDHYRFDEAYGEYALNNTGYQALRYTINNTPSTGNPMPVGVLLKGKAPIEIDNTVNQVSGSIDPAHDGAIKFLKDGEASISKSYNLWDEQGTDNTVTFWDVLIDTDGNETKIPESEKTFTVKANTGAPGVVYSIPAYAVDVETGQRVGGRATSNKVDGAYVQSQNISEYIVQTIVDFKELVADAGDTSDLISAPLNKSLVVDRRYYTFSGNTAQNVNIADLSIPSDVMLKGVTVEVKGVDGFSSEQAEHRYKESTQTLVVHVGGVSSGTIFLEFWSK
Physico‐chemical
properties
protein length:1347 AA
molecular weight: 145627,57980 Da
isoelectric point:4,46046
aromaticity:0,07424
hydropathy:-0,28241

Domains

Domains [InterPro]
AUR85102.1
1 1347
Architecture
ATT
STR
STR
ATT 3-312 | STR 313-402 | STR 459-1347
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage 1.068.O._10N.261.51.F8
[NCBI]
1881311 No lineage information
Host Vibrio lentus
[NCBI]
136468 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Vibrionales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AUR85102.1 [NCBI]
Genbank nucleotide accession
MG592445.1 [NCBI]
CDS location
range 8319 -> 12362
strand +
CDS
ATGCTTAAATGCACGCCAATTAAAGAAAACATAACAGTATATGAAGGTGATGATTGCACACTAGAGTTCGTCTCGTCTCTTGGTGACATCACTGGTGCAAGTATCCGACTGCAAGCCAGAATTGACACTTCTTCACGTTACGCAATTATTGACGTAGACGGTGTCATCCTTGATTCAACTAAAGGTTCGTACTCTGTCACACTCCCTAGTGGTGACACTCTAGGTAAAGCGTCTGAAGATTCTTACCATTACGACGTCAAAATAATCAAAAGCGATGGTTCAATACATACTGACCGCAGCGGTATGATGACGATTGATGCTCGTACCACCGATTTAACCACTACTAAAACCACCACTGTTTACGGGGTGGATGTCGACCAAGACGGTAACCTCCTAAGAGCTAAAGCTCAGCAAGTCAACTTCCTCGGTGCAGGGGTTAAATTAACCTCAAACAACCAGGGTGGTGTTGACATTGAGATTGACCGTTCTGCTACAGCTCCTACCCTGGCTCTACAGTTCGCAGGTATTTACGACGACATTCAAGCTTTGAATGATGCCGTACCTAAACCTAGTGAGCACATGCAAGCCATAGTCATTCAACCTAGTGAGTATTACTATCATGTCGTAGGCAGTTCATGGCAACAGTTAGCACCTGTAGGTTCATTCCACCCTAAATATTTAGGTGCGTATGACACCGTGATTGACCTAAGAGTTGCGGAACCGACACCTGATGATGACTCGTTAGCTATAGTGGGTACAACCAGTAAAACGTTCTATGTGTACGAGTCTGGTAAGTGGGATATTGTGAATCACGCTGACATACCAGGGCTTGACGCTAGGTTGACCACTCAAGAAAATAAAATGGTTACGACTCAAAAAGACGTATCCGACCTACAAACCGTATCAGGTAAGAATACTTCTGACATTGCCGCTTTGTACGCTCCTGATAAAGCCACGTTTGACGCTAAGCTTGATGCACGTCTAGTTAAGCCTGAAGCAGACATAAGTGCGTTACAGGCCAGTACAGCTCAAATTCAAGCCGCGCAGACATCATCTGAGAAGATTGTTACCGACAACACTAAACGACTCGGTACAGCTGAGCAGCACTTACAGACTCTACAGTCGAATGACCAGAAACATGACAGTGAGATATCTGCTATCGGTAACGACATCACTCAGCTTCAGGGTCAGGTTTATGACGGTAACGACATTAGTGACGACGACGGTAACTCATTATCTGACATCACAGGTTTGAAGTTCGTAGGTGCTCAGGTGTCTGACGATGACGGTGATAGAACAGCAACAGTGACAGTGTCACCTAAAATCACTGTGGCCGACGGTCAACAACCCGGTAGTCATTCAGAGATGGGTAACGCGATTGTATTCGAAGGGTCACAAGTGTCAGCTGACCCGAATGACCCGAATGTGATTAAAGTTGCTGTTCACGCAACCAGTCACAATGGAATCACACTGGGTGACGGTGCAAACGCGAGTCGTGAAGTGCAGACTATTCTGTTCAACGGCCATCAAGCATATGGTTCAGGTACTACTGCTGAGATTCACATTGAGTTCGTCCACTTTAAAACGATTGCTGAGCGTGATGCCTGGACTGCTAAATTTAGCACCCATATGGATTTCGATGTCATTGCTTTAGTTGATGCTGACGAGAACGGTTTCGTTGCTTATTATAAGTTTGATGCAGCAACTAAAGTGTGGTTAGAGTATGACGCTCAAGGTGTCGTAATGTCTGACTCTAATGGCGCCATCCCTAAGAACATTAAAACCGTTGTATTTGGTCCAGGGTTTACAATTCAGCAAGCAGGTGACCAAGAAGACGCAGCTCTCGTGACTTACAGTGAAACTGGTGATGGAGGTATTGACGGTATCACTGTTTCTCAAGATTGGGGTAACTCAGCAGAGTCGACTAAGGTGACTGCCGTTCAGACCATGTACCCGATAGAGGTTAACCAGAATCAAATTGGTGACACTCCTCTAGAAGGTAATGCAATGCTTTCTATTGACCCTCGCGCGTATGAAACTCAGCACGGTAACGCGTGTTTAGTTAAGTTAGACATGGTTCAGACGGTAGCAGGACAGCATCCTCACGCTGTTTACATGACTGAAGAGGTTGTCCCAACTGGTGAATACTTCAACCTTAACGCTCAAGGTAGAGGTATCGACGTCCAAGATAGTACGGGCGGTGACACTGCTGAAACTGGCGGTCAAATGACTGAGGTATTAGTTAAGGTGTCATTCCTAGATACTGCACCTGAAGACGGGGTTGTCAACGTATGGTTGGAATACAAAGACCCCTCAGATGTTATCGCTAAAGAAATCTTACTTGATGTGAACGGTAACCCTCTGGCTGTTGGCCGTCACTGCAATGTGGGTGACACTGTTGGTGAATTTATCATTGCTGGTGCATTTATGGCTAAGGCAACTCAACCTCTGAAAGTGATGGTTGAGACTGAGTTTGATGTTAACTCTAAGATTACACTTGACCCTGTTAACACCATGGTTTGCTTGAACCAGTTTAGTAACGGTTATGAGACTTCAGTTGCGCGTATTGAGTTTTTGCGTAGGGCTGCAATTCAGATTACACCTGCAATGCAAAAGTTTGACAATAAGATGCTTTCACTTTCTGATGAACTGAAAGGTGTCAATCAGATATTGACCGTTATCGGAGTTGGTGATGCTGGTGATACACTCAATGAGTTCGGCATTCAAAACCTAACCTCTGTCGGTGTGAAGATTGAAAACGACACTCTCACTGTTAAAGATGCAGGTTCGTTAGCTGACTTCTATGTTGATACGTTGATTGATAATACCCACACTAGAATGCTTCGAGGTAAAGAGGTTACGTCAGCTATCACTATTGCCAACCCTGACGGTGCGTTTGAATATGAGGCGTACAAGTGGGCTGGTAAGCCTGACCACGTTACACAGGTTTACTCAGGTAGAAGTAATGACACAATTACAATTAATGATGGTTGGACTGTCATTAAAGGTGTATTCATTGCCGAACAAGTTAACGGTAATGCTTACGGTCACGTGCTTACTTTTGACGTACCTGAAGACGCGAATAACGTTTTAATCTTGGTTCGTCCTCAACTGGAACAGTCACCAAGCACACTTGAGTTACAAGAGTTTGATTGGGGTACGTCTACAGACTTCACTGGGTACGTTGAGATTGAGCGATATAAGTCTCGTGAAGACCATTACAGATTTGATGAGGCTTATGGTGAGTACGCTTTAAATAACACTGGTTATCAGGCGCTTCGTTACACAATCAACAACACTCCAAGTACAGGTAACCCGATGCCTGTGGGGGTGCTACTGAAAGGTAAGGCTCCAATTGAGATAGACAATACTGTCAATCAAGTTAGCGGTAGTATTGACCCTGCACATGATGGTGCTATCAAGTTCCTTAAAGATGGTGAAGCATCTATTAGTAAGTCGTACAACCTGTGGGATGAGCAAGGTACTGACAATACCGTTACGTTTTGGGATGTGTTGATTGACACTGATGGTAATGAAACCAAGATACCTGAGTCGGAAAAGACGTTCACCGTTAAAGCTAATACTGGTGCACCAGGTGTTGTTTACTCAATTCCAGCTTATGCGGTTGATGTCGAAACAGGTCAGAGAGTAGGTGGTCGAGCAACCTCCAACAAAGTAGATGGCGCATATGTGCAGAGCCAGAACATTAGTGAGTATATCGTTCAGACCATTGTTGACTTTAAAGAGTTAGTTGCAGATGCTGGTGATACGTCAGACCTAATTTCCGCACCTTTAAATAAATCCTTGGTTGTGGATAGACGTTACTATACGTTCTCAGGAAATACTGCACAAAATGTCAACATTGCTGACCTTTCTATACCGTCAGATGTTATGCTTAAAGGTGTGACAGTTGAGGTTAAAGGTGTGGACGGGTTCTCATCTGAGCAAGCTGAACACAGGTACAAAGAATCGACACAGACACTAGTGGTACATGTTGGGGGTGTCAGTTCTGGAACCATATTCCTTGAGTTTTGGAGTAAATAG

Genome Context

Genome Context

Tertiary structure

PDB ID
325e5a447a5ddd6fdf52afff1a9ad333c4ff87241ac3a12a56713d42b91bccf2
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,4457
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
A major lineage of nontailed dsDNA viruses as unrecognized killers of marine bacteria Kauffman,K.M., Hussain,F.A., Yang,J., Arevalo,P., Brown,J.M., Chang,W.K., VanInsberghe,D., Elsherbini,J., Cutler,M.B., Kelly,L. and Polz,M.F. 2018-01-24 GenBank