Protein ID | Ophun1|2508 |
Gene name | |
Location | Contig_226:10566..12177 |
Strand | - |
Gene length (bp) | 1611 |
Transcript length (bp) | 1611 |
Coding sequence length (bp) | 1611 |
Protein length (aa) | 537 |
PFAM Domain ID | Short name | Long name | E-value | Start | End |
---|---|---|---|---|---|
PF00150 | Cellulase | Cellulase (glycosyl hydrolase family 5) | 4.1E-36 | 223 | 500 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|P23548|GUN_PAEPO | Endoglucanase OS=Paenibacillus polymyxa PE=3 SV=2 | 174 | 519 | 6.0E-39 |
sp|P54583|GUN1_ACIC1 | Endoglucanase E1 OS=Acidothermus cellulolyticus (strain ATCC 43068 / 11B) GN=Acel_0614 PE=1 SV=1 | 185 | 519 | 3.0E-37 |
sp|P50400|GUND_CELFI | Endoglucanase D OS=Cellulomonas fimi GN=cenD PE=3 SV=1 | 184 | 519 | 7.0E-26 |
sp|P10474|GUNB_CALSA | Endoglucanase/exoglucanase B OS=Caldicellulosiruptor saccharolyticus GN=celB PE=3 SV=1 | 184 | 514 | 4.0E-21 |
sp|P19487|GUNA_XANCP | Major extracellular endoglucanase OS=Xanthomonas campestris pv. campestris (strain ATCC 33913 / DSM 3586 / NCPPB 528 / LMG 568 / P 25) GN=engXCA PE=1 SV=2 | 189 | 519 | 2.0E-19 |
GO Term | Description | Terminal node |
---|---|---|
GO:0071704 | organic substance metabolic process | Yes |
GO:0004553 | hydrolase activity, hydrolyzing O-glycosyl compounds | Yes |
GO:0003674 | molecular_function | No |
GO:0008152 | metabolic process | No |
GO:0008150 | biological_process | No |
GO:0003824 | catalytic activity | No |
GO:0016787 | hydrolase activity | No |
GO:0016798 | hydrolase activity, acting on glycosyl bonds | No |
SignalP signal predicted | Location (based on Ymax) |
D score (significance: > 0.45) |
---|---|---|
No | 1 - 36 | 0.45 |
Domain # | Start | End | Length |
---|---|---|---|
1 | 122 | 144 | 22 |
Type of sequence | Sequence |
---|---|
Locus | Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded. |
Protein | >Ophun1|2508 MAPRKGSPPATPPLKDLKQDPLESFLAWQPPLKDALPTGSSPPPSPLSSPPRPPLPPLPPPLPQPPQPPKQQPPG RHWTAPPLRIINYGTWVKVRGYRQLPLFQRPRSKRVRTLQRLRRWQLSPCRMILIILAIFLLAIVGYIWRKEARL SAPWIPDAESKLFSKGPLPPPRQSSNISQFQLPLKTRGRAIVDQTGRRFKLSSVNWYGASDELFVVGGLEVQHRD VIAQTILRLGFNSVRLPYSDELVMKNPVIESRLVSANPDLAGKRALDVLEAAVTALTEAGIAVIVNNHITTATWC CGIDPCDSGWANDHLGLLCRVAQTEEQWIHHWEKLMARFVDNPRVIGADLRNEVRGLWGTMPWSRWASAAERCGN RLLSMRPDWLIFVEGTESANDVSGARDRPIRLDVADRLVYSAHVYAWSGWGSWQGRFAQRDYDSFAETMRRNWAY LVDGDVAPVWVGELGAPNNPTNGDAHYWKNLWRFLKDVDADFGYWAINPRKPKDNGSESYSIVADDWVTPVLDYR LKDMVDLMHAS* |
Coding | >Ophun1|2508 ATGGCCCCCCGCAAGGGCTCCCCGCCAGCGACCCCTCCCCTGAAGGACCTAAAGCAAGACCCCCTGGAAAGCTTC CTGGCCTGGCAGCCCCCGCTGAAAGATGCCCTCCCCACCGGATCATCACCGCCGCCATCCCCGCTATCGTCACCA CCACGACCACCACTACCACCACTACCACCACCACTACCGCAACCACCACAACCTCCGAAACAACAACCCCCAGGG CGGCATTGGACGGCGCCGCCCCTCCGAATCATCAACTACGGCACCTGGGTCAAGGTCAGGGGCTACCGCCAGCTC CCGCTCTTCCAGCGGCCGCGGTCCAAGCGCGTGCGGACGCTGCAGCGGCTGCGAAGGTGGCAGCTCTCCCCCTGC AGAATGATTCTCATCATCCTCGCCATCTTCCTCCTCGCCATCGTCGGCTACATCTGGCGGAAAGAAGCCCGGCTA TCAGCCCCCTGGATCCCCGATGCCGAATCCAAGCTCTTCAGCAAAGGGCCCCTCCCACCACCGCGGCAGAGCAGC AACATCTCGCAGTTCCAACTCCCGCTCAAGACGCGCGGAAGAGCCATCGTCGATCAGACGGGACGACGCTTTAAG CTCTCGTCGGTGAATTGGTACGGCGCGAGCGATGAGCTCTTCGTCGTGGGCGGACTCGAAGTCCAGCATCGCGAC GTCATCGCCCAGACCATCCTCCGCCTCGGCTTCAACAGCGTTCGCCTCCCTTACTCGGACGAGCTGGTGATGAAG AACCCCGTCATCGAGAGCAGGCTCGTCAGCGCCAACCCGGACCTCGCGGGGAAGCGCGCCCTCGACGTCTTGGAA GCTGCCGTGACGGCCCTGACCGAGGCGGGCATCGCCGTCATCGTGAACAATCATATCACGACGGCCACGTGGTGC TGCGGCATCGATCCTTGTGACTCGGGCTGGGCCAACGATCATCTCGGTCTGCTGTGTCGCGTCGCACAGACGGAA GAGCAATGGATTCACCACTGGGAGAAACTCATGGCCCGCTTCGTCGATAACCCTCGCGTCATCGGCGCCGACCTC CGCAACGAGGTCCGCGGCCTCTGGGGAACCATGCCATGGTCCCGCTGGGCCTCGGCCGCCGAACGATGCGGAAAC CGCCTCCTCTCCATGCGCCCCGACTGGCTCATCTTCGTCGAGGGCACCGAATCAGCAAACGACGTATCCGGAGCC CGAGACCGACCCATCCGCCTCGACGTCGCCGACCGCCTCGTCTACTCAGCCCACGTCTACGCCTGGTCCGGCTGG GGCAGTTGGCAGGGCCGCTTCGCCCAGCGCGACTACGACTCCTTCGCCGAAACCATGCGCCGCAACTGGGCCTAC CTCGTCGACGGCGACGTTGCCCCCGTCTGGGTCGGCGAGCTGGGCGCCCCCAATAACCCTACCAACGGCGACGCC CATTACTGGAAGAACCTGTGGCGCTTTCTCAAAGACGTCGACGCCGACTTTGGATACTGGGCCATCAACCCTCGC AAGCCCAAGGATAACGGCTCAGAGTCGTACTCGATCGTCGCTGATGATTGGGTCACGCCCGTGCTGGACTACCGA CTCAAGGACATGGTTGACCTGATGCACGCTTCTTGA |
Transcript | >Ophun1|2508 ATGGCCCCCCGCAAGGGCTCCCCGCCAGCGACCCCTCCCCTGAAGGACCTAAAGCAAGACCCCCTGGAAAGCTTC CTGGCCTGGCAGCCCCCGCTGAAAGATGCCCTCCCCACCGGATCATCACCGCCGCCATCCCCGCTATCGTCACCA CCACGACCACCACTACCACCACTACCACCACCACTACCGCAACCACCACAACCTCCGAAACAACAACCCCCAGGG CGGCATTGGACGGCGCCGCCCCTCCGAATCATCAACTACGGCACCTGGGTCAAGGTCAGGGGCTACCGCCAGCTC CCGCTCTTCCAGCGGCCGCGGTCCAAGCGCGTGCGGACGCTGCAGCGGCTGCGAAGGTGGCAGCTCTCCCCCTGC AGAATGATTCTCATCATCCTCGCCATCTTCCTCCTCGCCATCGTCGGCTACATCTGGCGGAAAGAAGCCCGGCTA TCAGCCCCCTGGATCCCCGATGCCGAATCCAAGCTCTTCAGCAAAGGGCCCCTCCCACCACCGCGGCAGAGCAGC AACATCTCGCAGTTCCAACTCCCGCTCAAGACGCGCGGAAGAGCCATCGTCGATCAGACGGGACGACGCTTTAAG CTCTCGTCGGTGAATTGGTACGGCGCGAGCGATGAGCTCTTCGTCGTGGGCGGACTCGAAGTCCAGCATCGCGAC GTCATCGCCCAGACCATCCTCCGCCTCGGCTTCAACAGCGTTCGCCTCCCTTACTCGGACGAGCTGGTGATGAAG AACCCCGTCATCGAGAGCAGGCTCGTCAGCGCCAACCCGGACCTCGCGGGGAAGCGCGCCCTCGACGTCTTGGAA GCTGCCGTGACGGCCCTGACCGAGGCGGGCATCGCCGTCATCGTGAACAATCATATCACGACGGCCACGTGGTGC TGCGGCATCGATCCTTGTGACTCGGGCTGGGCCAACGATCATCTCGGTCTGCTGTGTCGCGTCGCACAGACGGAA GAGCAATGGATTCACCACTGGGAGAAACTCATGGCCCGCTTCGTCGATAACCCTCGCGTCATCGGCGCCGACCTC CGCAACGAGGTCCGCGGCCTCTGGGGAACCATGCCATGGTCCCGCTGGGCCTCGGCCGCCGAACGATGCGGAAAC CGCCTCCTCTCCATGCGCCCCGACTGGCTCATCTTCGTCGAGGGCACCGAATCAGCAAACGACGTATCCGGAGCC CGAGACCGACCCATCCGCCTCGACGTCGCCGACCGCCTCGTCTACTCAGCCCACGTCTACGCCTGGTCCGGCTGG GGCAGTTGGCAGGGCCGCTTCGCCCAGCGCGACTACGACTCCTTCGCCGAAACCATGCGCCGCAACTGGGCCTAC CTCGTCGACGGCGACGTTGCCCCCGTCTGGGTCGGCGAGCTGGGCGCCCCCAATAACCCTACCAACGGCGACGCC CATTACTGGAAGAACCTGTGGCGCTTTCTCAAAGACGTCGACGCCGACTTTGGATACTGGGCCATCAACCCTCGC AAGCCCAAGGATAACGGCTCAGAGTCGTACTCGATCGTCGCTGATGATTGGGTCACGCCCGTGCTGGACTACCGA CTCAAGGACATGGTTGACCTGATGCACGCTTCTTGA |
Gene | >Ophun1|2508 ATGGCCCCCCGCAAGGGCTCCCCGCCAGCGACCCCTCCCCTGAAGGACCTAAAGCAAGACCCCCTGGAAAGCTTC CTGGCCTGGCAGCCCCCGCTGAAAGATGCCCTCCCCACCGGATCATCACCGCCGCCATCCCCGCTATCGTCACCA CCACGACCACCACTACCACCACTACCACCACCACTACCGCAACCACCACAACCTCCGAAACAACAACCCCCAGGG CGGCATTGGACGGCGCCGCCCCTCCGAATCATCAACTACGGCACCTGGGTCAAGGTCAGGGGCTACCGCCAGCTC CCGCTCTTCCAGCGGCCGCGGTCCAAGCGCGTGCGGACGCTGCAGCGGCTGCGAAGGTGGCAGCTCTCCCCCTGC AGAATGATTCTCATCATCCTCGCCATCTTCCTCCTCGCCATCGTCGGCTACATCTGGCGGAAAGAAGCCCGGCTA TCAGCCCCCTGGATCCCCGATGCCGAATCCAAGCTCTTCAGCAAAGGGCCCCTCCCACCACCGCGGCAGAGCAGC AACATCTCGCAGTTCCAACTCCCGCTCAAGACGCGCGGAAGAGCCATCGTCGATCAGACGGGACGACGCTTTAAG CTCTCGTCGGTGAATTGGTACGGCGCGAGCGATGAGCTCTTCGTCGTGGGCGGACTCGAAGTCCAGCATCGCGAC GTCATCGCCCAGACCATCCTCCGCCTCGGCTTCAACAGCGTTCGCCTCCCTTACTCGGACGAGCTGGTGATGAAG AACCCCGTCATCGAGAGCAGGCTCGTCAGCGCCAACCCGGACCTCGCGGGGAAGCGCGCCCTCGACGTCTTGGAA GCTGCCGTGACGGCCCTGACCGAGGCGGGCATCGCCGTCATCGTGAACAATCATATCACGACGGCCACGTGGTGC TGCGGCATCGATCCTTGTGACTCGGGCTGGGCCAACGATCATCTCGGTCTGCTGTGTCGCGTCGCACAGACGGAA GAGCAATGGATTCACCACTGGGAGAAACTCATGGCCCGCTTCGTCGATAACCCTCGCGTCATCGGCGCCGACCTC CGCAACGAGGTCCGCGGCCTCTGGGGAACCATGCCATGGTCCCGCTGGGCCTCGGCCGCCGAACGATGCGGAAAC CGCCTCCTCTCCATGCGCCCCGACTGGCTCATCTTCGTCGAGGGCACCGAATCAGCAAACGACGTATCCGGAGCC CGAGACCGACCCATCCGCCTCGACGTCGCCGACCGCCTCGTCTACTCAGCCCACGTCTACGCCTGGTCCGGCTGG GGCAGTTGGCAGGGCCGCTTCGCCCAGCGCGACTACGACTCCTTCGCCGAAACCATGCGCCGCAACTGGGCCTAC CTCGTCGACGGCGACGTTGCCCCCGTCTGGGTCGGCGAGCTGGGCGCCCCCAATAACCCTACCAACGGCGACGCC CATTACTGGAAGAACCTGTGGCGCTTTCTCAAAGACGTCGACGCCGACTTTGGATACTGGGCCATCAACCCTCGC AAGCCCAAGGATAACGGCTCAGAGTCGTACTCGATCGTCGCTGATGATTGGGTCACGCCCGTGCTGGACTACCGA CTCAAGGACATGGTTGACCTGATGCACGCTTCTTGA |