Protein
- Genbank accession
- CAM0098221.1 [GenBank]
- Protein name
- long tail fiber protein distal subunit
- RBP type
-
TF
- Protein sequence
-
MAELRSTTAIGGNIVWHGGNLRFDPQGETVLYNGHKIFTEHDTPLPSELGNGGTTSAYTKTESNATFAPIHTAGYVRKDGDAMTGKLTTSGNDIEIQGSSPRLTLLDWTDSKKWFVVGDGGNLTVREDTTSTTRVSIETNVVNLNVDNFNLRGKTAFRSYDSWLRINDTGEFTDGIYFGSSRVRTDGELYVGTTTTNGLYVTPTIFQFQGNRVFHNAYHPNADKWTTARTLTLSGDVSGSVSWDGSANASLSVTVANDSHSHDGRYYTETESDARYVNVAGDTMTGDLVVEDSMIKVGNVSGDNYMRLEQIYTDDYGFTFTHGNASAIRNDQGGLNQAIVLGDSETTGTIFGVSYNKDGSWTRSANLTAAGELYIGLGATARAFHDAYHPNADKWTNARTLTLSGDASGSVVWDGSTNATISVTVANDSHSHDGRYYTETESDSRFAPINGAGYVAKSGDTMTGSIQWGSSSYKGLYFGHLNEAASGFNGAWSFVRQGPASGTLEIGSDNEVNFYETDSFVKRIFMTLNDGTVNAVKFVGALQGNASTATSAAKWTTARTLTTTLTGDVTGSASMSVDGSGNKTVSITTTVANDSHMHTRLDPLNANWNTQTDEFRTGSASNVAGAPTTAFINWIQWGHNGGSKFRHTLYSETGAIDQLHYAYRNNTSTTAADHAQQRIFMDNYHPNADKWTTARTLTTTLTGDVSGSASMSVDGSGNKTVSITTTVANDSHSHDGRYYTETESDARFAQYSEFATLQTYADIAGKYLGKVSTSGVLEVARYIDFHTTNSTADYDIRLDCSASNLLSVTGGDFTCSGNVTAYSDIRLKKNIEKIDDALDKVTQLNGYTFDRADVDCDRQTGVIAQEVQAVLPEAVVEGEDHLSVAYGNLSGLLIESIKELNEKVETLQSEVHELKKPWWKKILGL
- Physico‐chemical
properties -
protein length: 925 AA molecular weight: 100044,92020 Da isoelectric point: 5,03175 aromaticity: 0,09405 hydropathy: -0,44876
Domains
Domains [InterPro]
Legend:
Pfam
SMART
CDD
TIGRFAM
HAMAP
SUPFAM
PRINTS
Gene3D
PANTHER
Other
Taxonomy
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
CAM0098221.1
[NCBI]
Genbank nucleotide accession
OZ196514
[NCBI]
CDS location
range 195154 -> 197931
strand -
strand -
CDS
ATGGCAGAACTAAGATCTACTACCGCAATCGGTGGTAACATAGTATGGCACGGAGGGAACTTACGCTTTGATCCTCAAGGCGAAACAGTTCTGTACAACGGTCATAAGATTTTTACTGAACACGACACACCACTGCCAAGCGAACTAGGCAACGGTGGGACAACTTCGGCATATACAAAAACAGAATCAAACGCGACATTTGCCCCGATTCATACGGCTGGTTATGTGCGCAAAGATGGCGATGCAATGACCGGTAAGTTGACTACTTCTGGTAATGATATTGAAATTCAAGGCAGCTCGCCTCGCTTGACGTTACTGGACTGGACTGATTCGAAGAAATGGTTCGTGGTTGGTGATGGTGGCAATTTAACAGTTCGCGAAGATACAACATCAACTACGCGTGTGTCGATTGAGACTAACGTCGTAAATCTAAACGTTGATAATTTCAATCTACGCGGCAAGACGGCATTTCGCTCGTACGATTCATGGCTACGAATCAACGATACTGGTGAGTTCACTGATGGTATTTACTTTGGTTCGTCTCGCGTTCGCACTGATGGTGAATTGTATGTTGGTACAACTACTACAAATGGATTGTACGTAACGCCTACTATATTTCAGTTTCAAGGCAATCGCGTTTTTCACAATGCGTATCACCCAAATGCTGACAAATGGACGACGGCAAGAACGCTAACACTTTCAGGTGACGTTAGTGGCTCAGTGTCATGGGACGGATCTGCAAATGCTTCATTGTCGGTAACGGTTGCAAACGATAGTCACTCGCATGACGGTCGTTACTACACTGAAACAGAATCTGATGCGCGTTATGTGAACGTTGCAGGTGATACGATGACTGGTGATTTAGTCGTCGAAGATAGTATGATTAAAGTTGGTAATGTGTCAGGTGACAATTACATGAGACTTGAGCAGATCTATACAGACGACTATGGATTTACGTTCACTCACGGTAATGCGTCTGCGATCCGCAACGACCAAGGTGGGTTGAATCAAGCAATCGTACTAGGGGATTCTGAAACGACTGGCACGATCTTCGGCGTTTCATACAACAAAGATGGGTCATGGACGCGTTCTGCAAATCTGACTGCGGCTGGTGAATTGTACATCGGCTTGGGTGCTACAGCACGTGCATTTCACGATGCGTATCACCCAAATGCTGATAAATGGACGAATGCGCGTACGCTGACGCTATCAGGTGATGCTAGTGGTTCAGTAGTATGGGATGGCTCAACTAACGCAACAATTTCTGTAACTGTTGCAAACGATAGTCACTCGCATGACGGTCGTTACTACACTGAAACAGAATCTGATTCTCGATTCGCGCCTATCAACGGCGCGGGATATGTAGCGAAGTCTGGCGATACTATGACTGGTTCGATTCAGTGGGGTTCATCTTCGTATAAAGGTCTGTACTTCGGTCATCTAAACGAAGCCGCGTCTGGGTTTAACGGGGCGTGGTCGTTTGTTAGACAAGGACCTGCGTCTGGAACACTAGAAATCGGCTCTGATAACGAAGTTAATTTCTACGAAACTGATTCGTTTGTTAAACGCATCTTTATGACGTTGAATGACGGTACAGTGAATGCGGTTAAATTTGTCGGTGCACTTCAGGGTAATGCTTCTACTGCGACTAGCGCGGCGAAATGGACAACTGCAAGAACGTTGACAACAACGCTAACTGGTGATGTCACGGGTTCTGCTAGTATGTCAGTCGATGGTAGTGGGAACAAAACAGTATCGATCACGACAACTGTTGCAAACGATAGTCACATGCACACACGACTAGACCCGCTAAACGCGAACTGGAATACTCAAACTGATGAATTCAGAACGGGTTCAGCTTCGAATGTTGCAGGCGCACCAACTACTGCATTTATCAACTGGATTCAGTGGGGTCATAATGGCGGCTCGAAGTTCAGACATACGCTATATTCAGAGACTGGTGCTATTGACCAACTGCATTACGCGTATCGAAACAACACGAGCACAACGGCTGCTGATCATGCGCAACAACGCATTTTCATGGACAACTATCACCCGAATGCTGATAAATGGACAACTGCTCGTACGTTGACAACGACACTTACAGGTGACGTGTCAGGTTCTGCTAGTATGTCAGTTGACGGTAGTGGAAACAAAACAGTATCGATTACGACTACTGTTGCAAACGATAGTCACTCGCATGATGGTCGTTACTACACCGAAACAGAATCTGATGCGCGATTTGCGCAATATAGCGAGTTTGCAACGCTGCAAACATATGCTGACATCGCAGGGAAGTATCTAGGAAAGGTATCAACATCGGGTGTTCTTGAAGTCGCGAGATATATTGATTTTCATACGACGAATAGTACCGCCGACTATGATATTCGATTGGATTGTTCAGCGTCTAACTTGCTGTCAGTGACTGGCGGTGATTTCACGTGTTCTGGTAACGTTACTGCATACTCTGATATTCGCTTGAAGAAGAACATCGAGAAGATCGATGATGCTCTTGATAAAGTGACTCAATTGAACGGGTATACGTTTGATCGTGCGGATGTTGATTGCGACCGACAAACGGGCGTGATTGCTCAAGAGGTGCAAGCTGTTCTACCTGAGGCTGTTGTCGAGGGTGAAGACCATTTAAGCGTCGCGTACGGCAATCTAAGCGGTCTTTTGATTGAATCGATTAAAGAGCTTAACGAGAAAGTTGAGACGCTTCAGAGCGAAGTACACGAGCTTAAAAAACCTTGGTGGAAGAAAATTCTGGGCTTATAA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
9a8b0c844879e3e41b1ef94107df9f0e64d81a6826e1bfcc7d27979f072abfe3
Literature
No literature entries available.