Protein ID | Agabi119p4|016200 |
Gene name | |
Location | scaffold_10:367578..369307 |
Strand | - |
Gene length (bp) | 1729 |
Transcript length (bp) | 1254 |
Coding sequence length (bp) | 1254 |
Protein length (aa) | 418 |
PFAM Domain ID | Short name | Long name | E-value | Start | End |
---|---|---|---|---|---|
PF00150 | Cellulase | Cellulase (glycosyl hydrolase family 5) | 4.3E-55 | 119 | 372 |
PF00734 | CBM_1 | Fungal cellulose binding domain | 6.1E-12 | 27 | 54 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|P07982|GUN2_HYPJE | Endoglucanase EG-II OS=Hypocrea jecorina GN=egl2 PE=1 SV=1 | 23 | 407 | 1.0E-176 |
sp|Q04469|GUN1_CRYFL | Endoglucanase 1 OS=Cryptococcus flavus GN=CMC1 PE=2 SV=1 | 88 | 408 | 1.0E-79 |
sp|Q12624|GUN3_HUMIN | Endoglucanase 3 OS=Humicola insolens GN=CMC3 PE=1 SV=1 | 9 | 401 | 6.0E-48 |
sp|Q7SDR1|GUN3_NEUCR | Endoglucanase gh5-1 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=gh5-1 PE=1 SV=1 | 15 | 401 | 8.0E-48 |
sp|Q5BDU5|EGLA_EMENI | Endo-beta-1,4-glucanase A OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=eglA PE=1 SV=1 | 84 | 391 | 2.0E-38 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|P07982|GUN2_HYPJE | Endoglucanase EG-II OS=Hypocrea jecorina GN=egl2 PE=1 SV=1 | 23 | 407 | 1.0E-176 |
sp|Q04469|GUN1_CRYFL | Endoglucanase 1 OS=Cryptococcus flavus GN=CMC1 PE=2 SV=1 | 88 | 408 | 1.0E-79 |
sp|Q12624|GUN3_HUMIN | Endoglucanase 3 OS=Humicola insolens GN=CMC3 PE=1 SV=1 | 9 | 401 | 6.0E-48 |
sp|Q7SDR1|GUN3_NEUCR | Endoglucanase gh5-1 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=gh5-1 PE=1 SV=1 | 15 | 401 | 8.0E-48 |
sp|Q5BDU5|EGLA_EMENI | Endo-beta-1,4-glucanase A OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=eglA PE=1 SV=1 | 84 | 391 | 2.0E-38 |
sp|Q96WQ8|EGLB_ASPKW | Probable endo-beta-1,4-glucanase B OS=Aspergillus kawachii (strain NBRC 4308) GN=eglB PE=3 SV=1 | 134 | 389 | 2.0E-34 |
sp|P23044|GUN_ROBSP | Endoglucanase 1 OS=Robillarda sp. (strain Y-20) GN=eg 1 PE=1 SV=2 | 120 | 379 | 3.0E-34 |
sp|P58599|GUN_RALSO | Endoglucanase OS=Ralstonia solanacearum (strain GMI1000) GN=egl PE=3 SV=1 | 80 | 391 | 2.0E-33 |
sp|Q1HFS8|EGLB_EMENI | Endo-beta-1,4-glucanase B OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=eglB PE=1 SV=1 | 79 | 391 | 3.0E-33 |
sp|A1DME8|EGLB_NEOFI | Probable endo-beta-1,4-glucanase B OS=Neosartorya fischeri (strain ATCC 1020 / DSM 3700 / FGSC A1164 / NRRL 181) GN=eglB PE=3 SV=1 | 136 | 389 | 9.0E-33 |
sp|P17974|GUN_RALSL | Endoglucanase OS=Ralstonia solanacearum GN=egl PE=1 SV=2 | 83 | 391 | 1.0E-32 |
sp|B8MW97|EGLB_ASPFN | probable endo-beta-1,4-glucanase B OS=Aspergillus flavus (strain ATCC 200026 / FGSC A1120 / NRRL 3357 / JCM 12722 / SRRC 167) GN=eglB PE=3 SV=1 | 134 | 391 | 8.0E-32 |
sp|Q2UPQ4|EGLB_ASPOR | Probable endo-beta-1,4-glucanase B OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=eglB PE=3 SV=1 | 134 | 391 | 8.0E-32 |
sp|O74706|EGLB_ASPNG | Endo-beta-1,4-glucanase B OS=Aspergillus niger GN=eglB PE=1 SV=1 | 136 | 389 | 7.0E-31 |
sp|A2QPC3|EGLB_ASPNC | Probable endo-beta-1,4-glucanase B OS=Aspergillus niger (strain CBS 513.88 / FGSC A1513) GN=eglB PE=3 SV=1 | 136 | 389 | 7.0E-31 |
sp|I1S3C6|XYND_GIBZE | Endo-1,4-beta-xylanase D OS=Gibberella zeae (strain PH-1 / ATCC MYA-4620 / FGSC 9075 / NRRL 31084) GN=XYLD PE=1 SV=1 | 7 | 93 | 1.0E-11 |
sp|G0RV93|CIP2_HYPJQ | 4-O-methyl-glucuronoyl methylesterase OS=Hypocrea jecorina (strain QM6a) GN=cip2 PE=1 SV=1 | 13 | 69 | 3.0E-10 |
sp|Q12647|GUNB_NEOPA | Endoglucanase B OS=Neocallimastix patriciarum GN=CELB PE=2 SV=1 | 127 | 302 | 4.0E-10 |
sp|P10477|CELE_CLOTH | Cellulase/esterase CelE OS=Clostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) GN=celE PE=1 SV=2 | 128 | 302 | 6.0E-10 |
sp|Q9HEZ1|XYNA_PHACH | Endo-1,4-beta-xylanase A OS=Phanerochaete chrysosporium GN=xynA PE=1 SV=1 | 20 | 112 | 6.0E-10 |
sp|P49075|GUX3_AGABI | Exoglucanase 3 OS=Agaricus bisporus GN=cel3 PE=1 SV=1 | 20 | 59 | 1.0E-09 |
sp|Q4WFK4|CBHC_ASPFU | Probable 1,4-beta-D-glucan cellobiohydrolase C OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=cbhC PE=3 SV=1 | 10 | 60 | 1.0E-09 |
sp|B0XWL3|CBHC_ASPFC | Probable 1,4-beta-D-glucan cellobiohydrolase C OS=Neosartorya fumigata (strain CEA10 / CBS 144.89 / FGSC A1163) GN=cbhC PE=3 SV=1 | 10 | 60 | 1.0E-09 |
sp|A1DJQ7|CBHC_NEOFI | Probable 1,4-beta-D-glucan cellobiohydrolase C OS=Neosartorya fischeri (strain ATCC 1020 / DSM 3700 / FGSC A1164 / NRRL 181) GN=cbhC PE=3 SV=1 | 10 | 60 | 2.0E-09 |
sp|B7SIW2|XYNC_PHACH | Endo-1,4-beta-xylanase C OS=Phanerochaete chrysosporium GN=xynC PE=1 SV=1 | 20 | 89 | 2.0E-09 |
sp|G4MPQ7|XYN6_MAGO7 | Endo-1,4-beta-xylanase 6 OS=Magnaporthe oryzae (strain 70-15 / ATCC MYA-4617 / FGSC 8958) GN=XYL6 PE=3 SV=1 | 26 | 104 | 2.0E-09 |
sp|Q8NJ73|XYN6_MAGGR | Endo-1,4-beta-xylanase 6 OS=Magnaporthe grisea GN=XYL6 PE=1 SV=1 | 26 | 104 | 2.0E-09 |
sp|P46239|GUNF_FUSOX | Putative endoglucanase type F OS=Fusarium oxysporum PE=2 SV=1 | 9 | 74 | 9.0E-09 |
sp|A1CCN4|CBHC_ASPCL | Probable 1,4-beta-D-glucan cellobiohydrolase C OS=Aspergillus clavatus (strain ATCC 1007 / CBS 513.65 / DSM 816 / NCTC 3887 / NRRL 1) GN=cbhC PE=3 SV=1 | 19 | 60 | 2.0E-08 |
sp|P46236|GUNB_FUSOX | Putative endoglucanase type B OS=Fusarium oxysporum PE=2 SV=1 | 26 | 71 | 3.0E-08 |
sp|B2ABX7|CEL6A_PODAN | 1,4-beta-D-glucan cellobiohydrolase CEL6A OS=Podospora anserina (strain S / ATCC MYA-4624 / DSM 980 / FGSC 10383) GN=CEL6A PE=1 SV=1 | 25 | 67 | 3.0E-08 |
sp|A1C8U0|MANF_ASPCL | Mannan endo-1,4-beta-mannosidase F OS=Aspergillus clavatus (strain ATCC 1007 / CBS 513.65 / DSM 816 / NCTC 3887 / NRRL 1) GN=manF PE=3 SV=1 | 6 | 116 | 3.0E-08 |
sp|P28623|GUND_CLOC7 | Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) GN=engD PE=1 SV=2 | 128 | 302 | 6.0E-08 |
sp|P07987|GUX2_HYPJE | Exoglucanase 2 OS=Hypocrea jecorina GN=cbh2 PE=1 SV=1 | 24 | 92 | 7.0E-08 |
sp|Q5B2E8|CBHC_EMENI | 1,4-beta-D-glucan cellobiohydrolase C OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=cbhC PE=1 SV=2 | 10 | 60 | 7.0E-08 |
sp|Q0CFP1|CBHC_ASPTN | Probable 1,4-beta-D-glucan cellobiohydrolase C OS=Aspergillus terreus (strain NIH 2624 / FGSC A1156) GN=cbhC PE=3 SV=1 | 20 | 59 | 2.0E-07 |
sp|Q5AR04|MANF_EMENI | Probable mannan endo-1,4-beta-mannosidase F OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=manF PE=3 SV=2 | 2 | 60 | 8.0E-07 |
sp|A2QYR9|CBHC_ASPNC | Probable 1,4-beta-D-glucan cellobiohydrolase C OS=Aspergillus niger (strain CBS 513.88 / FGSC A1513) GN=cbhC PE=3 SV=1 | 20 | 60 | 9.0E-07 |
sp|Q8WZJ4|XYNA_TALFU | 1,4-beta-D-glucan cellobiohydrolase xynA OS=Talaromyces funiculosus GN=xynA PE=1 SV=1 | 24 | 58 | 1.0E-06 |
sp|A3DJ77|GUNC_CLOTH | Endoglucanase C OS=Clostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) GN=celC PE=3 SV=1 | 136 | 269 | 2.0E-06 |
sp|P23340|GUNC_CLOSF | Endoglucanase C307 OS=Clostridium sp. (strain F1) GN=celC307 PE=1 SV=1 | 136 | 269 | 2.0E-06 |
sp|P0C2S3|GUNC_CLOTM | Endoglucanase C OS=Clostridium thermocellum GN=celC PE=1 SV=1 | 136 | 269 | 2.0E-06 |
sp|Q4WBS1|MANF_ASPFU | Mannan endo-1,4-beta-mannosidase F OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=manF PE=1 SV=2 | 13 | 116 | 2.0E-06 |
sp|B0Y9E7|MANF_ASPFC | Probable mannan endo-1,4-beta-mannosidase F OS=Neosartorya fumigata (strain CEA10 / CBS 144.89 / FGSC A1163) GN=manF PE=3 SV=2 | 13 | 116 | 2.0E-06 |
sp|P28621|GUNB_CLOC7 | Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) GN=engB PE=3 SV=1 | 136 | 302 | 3.0E-06 |
sp|A1DBV1|MANF_NEOFI | Probable mannan endo-1,4-beta-mannosidase F OS=Neosartorya fischeri (strain ATCC 1020 / DSM 3700 / FGSC A1164 / NRRL 181) GN=manF PE=3 SV=1 | 9 | 64 | 3.0E-06 |
sp|Q4WBW4|AXE1_ASPFU | Probable acetylxylan esterase A OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=axeA PE=3 SV=1 | 24 | 58 | 4.0E-06 |
sp|P17901|GUNA_CLOCE | Endoglucanase A OS=Clostridium cellulolyticum (strain ATCC 35319 / DSM 5812 / JCM 6584 / H10) GN=celCCA PE=1 SV=1 | 117 | 302 | 6.0E-06 |
sp|A1DBP9|AXE1_NEOFI | Probable acetylxylan esterase A OS=Neosartorya fischeri (strain ATCC 1020 / DSM 3700 / FGSC A1164 / NRRL 181) GN=axeA PE=3 SV=1 | 24 | 58 | 8.0E-06 |
sp|B8NIV9|MANF_ASPFN | Probable mannan endo-1,4-beta-mannosidase F OS=Aspergillus flavus (strain ATCC 200026 / FGSC A1120 / NRRL 3357 / JCM 12722 / SRRC 167) GN=manF PE=3 SV=1 | 6 | 60 | 9.0E-06 |
sp|Q8NJP6|AXE1_TALPU | Acetylxylan esterase A OS=Talaromyces purpurogenus GN=axeA PE=1 SV=1 | 22 | 58 | 9.0E-06 |
sp|Q2U2I3|MANF_ASPOR | Probable mannan endo-1,4-beta-mannosidase F OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=manF PE=3 SV=2 | 6 | 60 | 1.0E-05 |
GO Term | Description | Terminal node |
---|---|---|
GO:0005975 | carbohydrate metabolic process | Yes |
GO:0004553 | hydrolase activity, hydrolyzing O-glycosyl compounds | Yes |
GO:0071704 | organic substance metabolic process | Yes |
GO:0005576 | extracellular region | Yes |
GO:0030248 | cellulose binding | Yes |
GO:0003824 | catalytic activity | No |
GO:0030246 | carbohydrate binding | No |
GO:0044238 | primary metabolic process | No |
GO:0030247 | polysaccharide binding | No |
GO:0008150 | biological_process | No |
GO:0008152 | metabolic process | No |
GO:0016798 | hydrolase activity, acting on glycosyl bonds | No |
GO:0005488 | binding | No |
GO:0003674 | molecular_function | No |
GO:0005575 | cellular_component | No |
GO:0110165 | cellular anatomical entity | No |
GO:0016787 | hydrolase activity | No |
SignalP signal predicted | Location (based on Ymax) |
D score (significance: > 0.45) |
---|---|---|
Yes | 1 - 25 | 0.45 |
Type of sequence | Sequence |
---|---|
Locus | Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded. |
Protein | >Agabi119p4|016200 MMSRKLLQISGFVLTSALTVRGQAAVWQQCGGTGWTGDTTCVSGSVCTPLNEFYSQCIPGEAPTSTVTSPTPTTT TRPPVATGNGKLKFTGVNIAGFDFGCNGDGSCTASAAWPPLTKYYGHDGEGQMQHFVNDDGFNTFRLPVGWQFLT GDVEGGPIVAANLAKYDDLVQACLNTGAFCIIDIHNYARWDRGIIGQGGPTNDEFAALWSAIASKYASEERIMFG VMNEPHDVPDINAWAQSVQEVVTAIREAGATTQLILLPGNNWTSAETFVSNGSAAALQKVTNPNGSTDNLIFDVH KYLDSDNSGTHPECTTNNIDRAWAPLADWLRENKRQAINTETGGGNTDSCVQFMCQQVAYQAANSDVYLGFVGWG AGNFLTNYVLSETPTNNGNGWTDSLLVEKCLAPTSGAQNGLF* |
Coding | >Agabi119p4|016200 ATGATGTCGAGGAAACTTTTGCAAATTTCTGGTTTTGTTTTGACCTCAGCGTTGACCGTACGTGGTCAGGCTGCA GTTTGGCAACAATGTGGTGGAACTGGTTGGACAGGCGATACGACTTGTGTTTCTGGTTCAGTCTGTACTCCTCTC AATGAATTTTATTCCCAATGTATTCCAGGCGAAGCTCCTACGTCCACTGTAACATCACCGACACCCACCACTACC ACTAGGCCGCCCGTAGCCACCGGTAACGGTAAACTCAAGTTTACCGGTGTCAATATCGCTGGATTCGACTTCGGA TGTAATGGCGATGGTTCATGCACAGCTAGTGCCGCTTGGCCCCCTTTGACTAAATACTATGGTCATGATGGCGAA GGACAAATGCAACACTTCGTCAATGACGATGGGTTTAATACGTTCCGTTTACCCGTCGGTTGGCAATTCTTGACC GGTGACGTCGAGGGTGGACCAATCGTTGCTGCGAACTTGGCTAAGTACGATGACCTTGTGCAGGCTTGTCTGAAT ACAGGTGCATTTTGCATTATTGATATACATAATTACGCGAGGTGGGATAGAGGCATAATTGGACAAGGTGGACCT ACCAATGATGAATTTGCCGCCTTGTGGTCAGCGATCGCCAGCAAATATGCATCAGAAGAAAGAATCATGTTTGGA GTTATGAATGAACCGCATGATGTTCCCGATATCAATGCTTGGGCTCAATCTGTTCAAGAGGTTGTAACTGCCATC CGCGAAGCCGGTGCAACGACCCAGCTTATCCTCCTTCCTGGCAACAACTGGACATCAGCAGAAACCTTTGTCTCT AATGGTTCTGCCGCCGCATTACAAAAAGTGACGAACCCCAACGGATCTACGGACAATCTGATTTTCGATGTTCAC AAATATCTCGACAGTGATAATTCCGGCACTCATCCTGAATGTACGACGAACAACATTGATCGCGCATGGGCACCT CTTGCCGACTGGTTACGCGAAAATAAGCGTCAAGCTATCAACACCGAGACCGGTGGTGGAAATACCGATTCGTGT GTGCAATTCATGTGTCAACAGGTTGCTTATCAGGCCGCCAACTCCGATGTCTACTTGGGCTTCGTTGGCTGGGGT GCAGGAAATTTCCTCACTAACTACGTTCTTAGCGAGACCCCAACCAACAATGGCAATGGTTGGACAGACAGTTTA CTCGTTGAGAAATGTTTAGCACCCACGTCTGGTGCTCAAAATGGTTTATTCTGA |
Transcript | >Agabi119p4|016200 ATGATGTCGAGGAAACTTTTGCAAATTTCTGGTTTTGTTTTGACCTCAGCGTTGACCGTACGTGGTCAGGCTGCA GTTTGGCAACAATGTGGTGGAACTGGTTGGACAGGCGATACGACTTGTGTTTCTGGTTCAGTCTGTACTCCTCTC AATGAATTTTATTCCCAATGTATTCCAGGCGAAGCTCCTACGTCCACTGTAACATCACCGACACCCACCACTACC ACTAGGCCGCCCGTAGCCACCGGTAACGGTAAACTCAAGTTTACCGGTGTCAATATCGCTGGATTCGACTTCGGA TGTAATGGCGATGGTTCATGCACAGCTAGTGCCGCTTGGCCCCCTTTGACTAAATACTATGGTCATGATGGCGAA GGACAAATGCAACACTTCGTCAATGACGATGGGTTTAATACGTTCCGTTTACCCGTCGGTTGGCAATTCTTGACC GGTGACGTCGAGGGTGGACCAATCGTTGCTGCGAACTTGGCTAAGTACGATGACCTTGTGCAGGCTTGTCTGAAT ACAGGTGCATTTTGCATTATTGATATACATAATTACGCGAGGTGGGATAGAGGCATAATTGGACAAGGTGGACCT ACCAATGATGAATTTGCCGCCTTGTGGTCAGCGATCGCCAGCAAATATGCATCAGAAGAAAGAATCATGTTTGGA GTTATGAATGAACCGCATGATGTTCCCGATATCAATGCTTGGGCTCAATCTGTTCAAGAGGTTGTAACTGCCATC CGCGAAGCCGGTGCAACGACCCAGCTTATCCTCCTTCCTGGCAACAACTGGACATCAGCAGAAACCTTTGTCTCT AATGGTTCTGCCGCCGCATTACAAAAAGTGACGAACCCCAACGGATCTACGGACAATCTGATTTTCGATGTTCAC AAATATCTCGACAGTGATAATTCCGGCACTCATCCTGAATGTACGACGAACAACATTGATCGCGCATGGGCACCT CTTGCCGACTGGTTACGCGAAAATAAGCGTCAAGCTATCAACACCGAGACCGGTGGTGGAAATACCGATTCGTGT GTGCAATTCATGTGTCAACAGGTTGCTTATCAGGCCGCCAACTCCGATGTCTACTTGGGCTTCGTTGGCTGGGGT GCAGGAAATTTCCTCACTAACTACGTTCTTAGCGAGACCCCAACCAACAATGGCAATGGTTGGACAGACAGTTTA CTCGTTGAGAAATGTTTAGCACCCACGTCTGGTGCTCAAAATGGTTTATTCTGA |
Gene | >Agabi119p4|016200 ATGATGTCGAGGAAACTTTTGCAAATTTCTGGTTTTGTTTTGACCTCAGCGTTGACCGTACGTGGTCAGGCTGCA GTTTGGCAACAATGTATGTGTGTTTCGAAAAACTTGGTTTGGCAAGAGGCTGATGCACGGTTTTAACTAGGTGGT GGAACTGTGAGTACCCAGACATCGTTCCCCAGAAGACAAACACTTAGACATGCATCTTTTCTAGGGTTGGACAGG CGATACGACTTGTGTTTCTGGTTCAGTCTGTACTCCTCTCAATGAATGTACGCCCCTTGATTTCCTTTGCCAATC ATTTTGGATTTGCTTATCACCTTGTTTCATCAGTTTATTCCCAATGTATTCCAGGCGAAGCTCCTACGTCCACTG TAACATCACCGACACCCACCACTACCACTAGGCCGCCCGTAGCCACCGGTAACGGTAAACTCAAGTTTACCGGTG TCAATATCGCTGGATTCGACTTCGGATGTAATGGCGATGGTTCATGCACAGCTAGTGCCGCTTGGCCCCCTTTGA CTAAATACTATGGTCATGATGGCGAAGGACAAATGCAACACTTCGTCAATGACGATGGGTTTAATACGTTCCGTT TACCCGTCGGTTGGCAATTCTTGACCGGTGACGTCGAGGGTGGACCAATCGTTGCTGCGAACTTGGCTAAGTACG ATGACCTTGTGCAGGCTTGTCTGAATACAGGTGCATTTTGCATTATTGATATACATAATTACGCGAGGTGGGATA GAGGCGTAAGTTCAGATATCTCGTAGCTGCCTTTACTATCATGTTTGCTGGTTATTGAAAACGCGTTACAGATAA TTGGACAAGGTGGACCTACCAATGATGAATTTGCCGCCTTGTGGTCAGCGATCGCCAGCAAATATGCATCAGAAG AAAGAATCATGTTTGGAGTGTACGTACTTTTCTTCCCATTAGTTTATCCAATGCATATAAATCTCTCTTTTTTGA CAGTATGAATGAACCGCATGATGTTCCCGATATCAATGCTTGGGCTCAATCTGTTCAAGAGGTTGTAACTGCCAT CCGCGAAGCCGGGTAAGTATCTCTCCATTAGCTGTTACCAGGTCGAAATTTATATTGAAAAAATGGTGATAGTGC AACGACCCAGCTTATCCTCCTTCCTGGCAACAACTGGACATCAGCAGAAACCTTTGTCTCTAATGGTTCTGCCGC CGCATTACAAAAAGTGACGAACCCCAACGGATCTACGGACAATCTGATTTTCGATGTTCACAAATATCTCGACAG TGATAATTCCGGCACTCATCCGTGAGTTGTAAACTCAAGACGCATTTATGAGCGCTTTGTTCATTGATGATTTTT CCCTAGTGAATGTACGACGAACAACATTGATCGCGCATGGGCACCTCTTGCCGACTGGTTACGCGAAAATAAGCG TCAAGCTATCAACACCGAGACCGGTGGTGGAAATACCGATTCGTGTGTGCAATTCATGTGTCAACAGGTTGCTTA TCAGGCCGCCAACTCCGATGGTAAGTCTCGTTCCTGCAAATGTTGTATTTTCTACTAATACAACTCGATTACAGT CTACTTGGGCTTCGTTGGCTGGGGTGCAGGAAATTTCCTCACTAACTACGTTCTTAGCGAGACCCCAACCAACAA TGGCAATGGTTGGACAGACAGTTTACTCGTTGAGAAATGTTTAGCACCCACGTCTGGTGCTCAAAATGGTTTATT CTGA |