Fungal Genomics

at Utrecht University

General Properties

Protein IDAgabiH97|055520
Gene name
Locationscaffold_3:718681..723711
Strand-
Gene length (bp)5030
Transcript length (bp)4080
Coding sequence length (bp)4080
Protein length (aa) 1360

Your browser does not support drawing a protein figure.

PFAM Domains

PFAM Domain ID Short name Long name E-value Start End
PF00141 peroxidase Peroxidase 1.1E-23 65 214
PF16686 POT1PC ssDNA-binding domain of telomere protection protein 4.7E-07 1164 1286
PF02765 POT1 Telomeric single stranded DNA binding POT1/CDC13 3.1E-14 972 1092

Swissprot hits

[Show all]
Swissprot ID Swissprot Description Start End E-value
sp|D4AUF4|WSCD2_ARTBC WSC domain-containing protein ARB_07870 OS=Arthroderma benhamiae (strain ATCC MYA-4681 / CBS 112371) GN=ARB_07870 PE=1 SV=1 9 334 5.0E-45
sp|Q7XZP5|APX5_ARATH L-ascorbate peroxidase 5, peroxisomal OS=Arabidopsis thaliana GN=APX5 PE=1 SV=2 60 277 1.0E-13
sp|Q8GY91|APX6_ARATH Putative L-ascorbate peroxidase 6 OS=Arabidopsis thaliana GN=APX6 PE=2 SV=1 55 202 2.0E-13
sp|Q42564|APX3_ARATH L-ascorbate peroxidase 3, peroxisomal OS=Arabidopsis thaliana GN=APX3 PE=1 SV=1 74 274 6.0E-12
sp|Q4WPF8|CCPR_ASPFU Cytochrome c peroxidase, mitochondrial OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=ccp1 PE=3 SV=1 67 202 1.0E-10
[Show all]
[Show less]
Swissprot ID Swissprot Description Start End E-value
sp|D4AUF4|WSCD2_ARTBC WSC domain-containing protein ARB_07870 OS=Arthroderma benhamiae (strain ATCC MYA-4681 / CBS 112371) GN=ARB_07870 PE=1 SV=1 9 334 5.0E-45
sp|Q7XZP5|APX5_ARATH L-ascorbate peroxidase 5, peroxisomal OS=Arabidopsis thaliana GN=APX5 PE=1 SV=2 60 277 1.0E-13
sp|Q8GY91|APX6_ARATH Putative L-ascorbate peroxidase 6 OS=Arabidopsis thaliana GN=APX6 PE=2 SV=1 55 202 2.0E-13
sp|Q42564|APX3_ARATH L-ascorbate peroxidase 3, peroxisomal OS=Arabidopsis thaliana GN=APX3 PE=1 SV=1 74 274 6.0E-12
sp|Q4WPF8|CCPR_ASPFU Cytochrome c peroxidase, mitochondrial OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=ccp1 PE=3 SV=1 67 202 1.0E-10
sp|P0C0V3|CCPR_EMENI Cytochrome c peroxidase, mitochondrial OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=ccp1 PE=3 SV=1 67 202 5.0E-09
sp|A4QVH4|CCPR_MAGO7 Cytochrome c peroxidase, mitochondrial OS=Magnaporthe oryzae (strain 70-15 / ATCC MYA-4617 / FGSC 8958) GN=CCP1 PE=3 SV=1 67 278 6.0E-09
sp|Q6C0Z6|CCPR_YARLI Cytochrome c peroxidase, mitochondrial OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=CCP1 PE=3 SV=1 67 255 1.0E-08
sp|Q6ZJJ1|APX4_ORYSJ Probable L-ascorbate peroxidase 4 OS=Oryza sativa subsp. japonica GN=APX4 PE=2 SV=1 74 202 2.0E-08
sp|P0CP54|CCPR_CRYNJ Cytochrome c peroxidase, mitochondrial OS=Cryptococcus neoformans var. neoformans serotype D (strain JEC21 / ATCC MYA-565) GN=CCP1 PE=3 SV=1 74 202 2.0E-08
sp|P0CP55|CCPR_CRYNB Cytochrome c peroxidase, mitochondrial OS=Cryptococcus neoformans var. neoformans serotype D (strain B-3501A) GN=CCP1 PE=3 SV=1 74 202 2.0E-08
sp|Q1PER6|APX2_ARATH L-ascorbate peroxidase 2, cytosolic OS=Arabidopsis thaliana GN=APX2 PE=2 SV=3 74 261 2.0E-08
sp|Q7SDV9|CCPR_NEUCR Cytochrome c peroxidase, mitochondrial OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=ccp-1 PE=3 SV=1 60 202 3.0E-08
sp|Q6CAB5|CCPR2_YARLI Putative cytochrome c peroxidase, mitochondrial OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=YALI0D04268g PE=3 SV=1 67 202 4.0E-08
sp|Q5B1Z0|CCPR2_EMENI Putative heme-binding peroxidase OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=AN5440 PE=3 SV=1 67 202 7.0E-08
sp|Q0JEQ2|APX3_ORYSJ Probable L-ascorbate peroxidase 3 OS=Oryza sativa subsp. japonica GN=APX3 PE=3 SV=1 74 205 9.0E-08
sp|Q5AEN1|CCPR_CANAL Cytochrome c peroxidase, mitochondrial OS=Candida albicans (strain SC5314 / ATCC MYA-2876) GN=CCP1 PE=3 SV=1 64 202 1.0E-07
sp|Q9LNL0|PER8_ARATH Peroxidase 8 OS=Arabidopsis thaliana GN=PER8 PE=2 SV=1 61 220 3.0E-07
sp|O23474|PER40_ARATH Peroxidase 40 OS=Arabidopsis thaliana GN=PER40 PE=2 SV=2 69 236 4.0E-07
sp|A2YPX3|PER2_ORYSI Peroxidase 2 OS=Oryza sativa subsp. indica GN=PRX112 PE=3 SV=2 69 314 2.0E-06
sp|Q93V93|PER44_ARATH Peroxidase 44 OS=Arabidopsis thaliana GN=PER44 PE=2 SV=1 61 202 4.0E-06
sp|Q9SD46|PER36_ARATH Peroxidase 36 OS=Arabidopsis thaliana GN=PER36 PE=2 SV=2 69 209 4.0E-06
sp|Q0D3N0|PER2_ORYSJ Peroxidase 2 OS=Oryza sativa subsp. japonica GN=PRX112 PE=2 SV=1 69 314 5.0E-06
sp|P48534|APX1_PEA L-ascorbate peroxidase, cytosolic OS=Pisum sativum GN=APX1 PE=1 SV=2 74 206 7.0E-06
sp|Q96522|PER45_ARATH Peroxidase 45 OS=Arabidopsis thaliana GN=PER45 PE=1 SV=1 74 242 8.0E-06
[Show less]

GO

GO Term Description Terminal node
GO:0043047 single-stranded telomeric DNA binding Yes
GO:0006979 response to oxidative stress Yes
GO:0000781 chromosome, telomeric region Yes
GO:0003677 DNA binding Yes
GO:0000723 telomere maintenance Yes
GO:0004601 peroxidase activity Yes
GO:0020037 heme binding Yes
GO:0098687 chromosomal region No
GO:0006139 nucleobase-containing compound metabolic process No
GO:0008150 biological_process No
GO:0006996 organelle organization No
GO:0071704 organic substance metabolic process No
GO:0071840 cellular component organization or biogenesis No
GO:0005575 cellular_component No
GO:1901360 organic cyclic compound metabolic process No
GO:0046906 tetrapyrrole binding No
GO:0008152 metabolic process No
GO:0005488 binding No
GO:0032200 telomere organization No
GO:1901363 heterocyclic compound binding No
GO:0016209 antioxidant activity No
GO:0003676 nucleic acid binding No
GO:0051276 chromosome organization No
GO:0110165 cellular anatomical entity No
GO:0016684 oxidoreductase activity, acting on peroxide as acceptor No
GO:0006259 DNA metabolic process No
GO:0098847 sequence-specific single stranded DNA binding No
GO:0042162 telomeric DNA binding No
GO:0044260 cellular macromolecule metabolic process No
GO:0016043 cellular component organization No
GO:0016491 oxidoreductase activity No
GO:0044237 cellular metabolic process No
GO:0003824 catalytic activity No
GO:0009987 cellular process No
GO:0006725 cellular aromatic compound metabolic process No
GO:0044238 primary metabolic process No
GO:0003674 molecular_function No
GO:0003697 single-stranded DNA binding No
GO:0006950 response to stress No
GO:0043565 sequence-specific DNA binding No
GO:0006807 nitrogen compound metabolic process No
GO:0050896 response to stimulus No
GO:0043170 macromolecule metabolic process No
GO:0090304 nucleic acid metabolic process No
GO:0046483 heterocycle metabolic process No
GO:0034641 cellular nitrogen compound metabolic process No
GO:0097159 organic cyclic compound binding No

SignalP

[Help with interpreting these statistics]
SignalP signal predicted Location
(based on Ymax)
D score
(significance: > 0.45)
Yes 1 - 26 0.45

Transmembrane Domains

(None)

Transcription Factor Class

(None)

Expression data

Analysis 1: Developmental stages of Agaricus bisporus (strain A15). Published in Pelkmans et al, Applied Microbiology and Biotechnology, 2016

Click here for more information

Sequences

Type of sequenceSequence
Locus Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded.
Protein >AgabiH97|055520
MLLARTSRLLADLLAAGIWCTSVSAGYSWPSPQYEALETFLYEGRRYDGSNLASLQHPCKNRSDTGASIGAEWLR
LAYHDVSTRDAEAGTGGLDGSIAYELDREENAGQAMSASLADFESFSNKYVSRSDVIAAGTIFAVASCGGPIIPF
RGGRFDAVTAASSSFSVPEPFQDFQTHLDMFRRQGFSQTEMITLVACGHTIGGVRNTDFPHIVSPKGDSEKPTLE
NFDPSPQYDNLVVQQYLDGSTNNPLVVHPNKTITSDFRIFSSDNNATMKSLSSPDAYLNSCRTVLEKMLNMVPSN
VALTDEIRLLQEKVFNVQLTLEHGELVFKASLRLAQSVGSTPSKGRVVRMFWCNRYGDDKDCSTNANVVFQAAST
QDSPEISPITERLGYYFINHQFVVKVDPSKSISLFWFESDEHDGSTPKIIDNGGSGYQFDQDRILYVPMSSQLDI
TGGDEASKVYQIVAAVRSDVSPSRVYMSGFDNAIPKHGPAITIAEDLTLNTTLTPVEGYNFYSTSIQSAGLQLTV
DLHADIDGNTYTVDFQQTYFLGSSLPYTPPTISNITAVARIPGEPTDTNSIARSAFRGPTKVRGRVRVVASLSAR
AKIQTRQSCFRAFFVSPNVMKRPRDNEDDTTNKRRRKNLAGDIFDDESLSNCLCNVQGLNSAIGYFTGSVSMRWL
PGGKKFIFQLTSRCCVTLSRTHKINVTFEGDWIESMHARWGLLSIMENIQLSLDGVTSVYPQTSKQTAKLVYTDG
VMYRKMREGKETETFFINSWQLQEEQDKQEDLDWFATPTVPLNGSLSEFVRTASAAGLKSVPSPKENQMSEKRDD
DSSLSKENLPPVDEHPTKQDVVSTATSRVLPQPASISEANVVVNVQQTTSENQSRPPLSRKQEKKAKKKLARAEK
AQREALAQLAISSGAANTATATSSSQMQVSNHLPARVDANAQVVKLGELLPVEDNPAGWMARLRRFKGNYKMLKD
ATDDKSRINIIGVVRSVKPPRLSTKSDWTSNVYLQDPSLFPDGEQFTTNDYTVNFFQRRFKEWLPQVNIGDILVL
RDIKVQNHSGNLFGVGYYDRAKWAVYSIKDECFVPPKLEQAPKEQELVDGGIKFTPFLSASELEERYCQQLSAWW
KEKNKDSAESTGHVYSLSESAIPGRTRRQHLLLCDAGPDVEPGGYFDCTFEVVNAYQSKDYPPTLYVTDYTKNAI
VAPCQTKWCPPGLNDYIVKFELWDAAASLAPSLVPGTYWTVRNARLKRGHDGYVEGKIVEKKFSKLEPDSEYAPL
QELLKRKKEWETANKPRQTRFEHKLLKDVGDGDYANCTVELLHASKGYDEAYLEVTDYTAHPKMSSYRYENDWSR
DLEGLIDSS*
Coding >AgabiH97|055520
ATGCTATTGGCGCGCACCAGTCGGCTCCTCGCCGACTTGCTGGCAGCAGGGATCTGGTGCACCTCGGTTTCTGCT
GGTTACTCATGGCCTTCACCACAATACGAGGCATTGGAAACATTTTTATACGAAGGGAGGAGATACGACGGTTCG
AACTTGGCTTCGCTCCAACACCCATGCAAAAACAGGTCTGATACGGGCGCGTCAATTGGTGCTGAATGGCTGAGG
CTGGCATATCATGATGTTTCGACGCGAGATGCTGAAGCTGGGACGGGCGGGTTGGATGGATCAATTGCATACGAG
TTGGATAGAGAAGAAAATGCGGGACAGGCGATGAGCGCTAGTTTGGCCGACTTTGAATCGTTCTCTAATAAATAC
GTATCTCGATCGGATGTCATCGCCGCCGGTACAATATTCGCCGTTGCATCCTGCGGGGGTCCCATTATACCTTTT
CGTGGCGGCCGTTTCGATGCAGTCACCGCAGCTTCAAGTTCTTTTAGTGTTCCAGAACCGTTCCAGGATTTTCAG
ACTCATCTAGACATGTTCCGCCGGCAGGGATTTTCACAGACGGAGATGATCACTTTAGTTGCTTGTGGACATACT
ATCGGTGGAGTGAGAAACACGGATTTCCCTCATATTGTTTCCCCGAAAGGCGATTCAGAGAAACCTACCCTTGAA
AACTTTGATCCTTCCCCGCAGTATGATAATCTAGTGGTGCAGCAGTATCTGGATGGCAGTACAAATAATCCGCTT
GTTGTTCATCCGAACAAGACGATTACTTCGGACTTTCGAATATTCTCAAGCGATAACAATGCGACTATGAAGAGT
TTGTCATCCCCTGATGCATATTTGAATTCATGTCGAACGGTTCTCGAGAAAATGCTCAACATGGTTCCTTCAAAT
GTCGCGCTGACAGATGAAATACGGCTCTTGCAAGAGAAAGTTTTCAATGTTCAGTTGACACTTGAACATGGGGAA
TTAGTTTTCAAGGCATCGCTTCGGCTCGCTCAGTCTGTGGGCTCCACTCCCAGTAAAGGCAGAGTTGTTCGAATG
TTCTGGTGCAACCGCTACGGAGACGACAAGGATTGTTCTACCAACGCCAATGTTGTTTTTCAGGCAGCATCTACT
CAGGACTCACCAGAAATTTCACCAATCACAGAACGACTAGGCTACTATTTCATCAACCATCAATTTGTTGTCAAA
GTCGATCCGTCAAAGTCGATTTCTCTTTTCTGGTTTGAGAGTGACGAACATGATGGTTCAACGCCAAAAATTATC
GATAATGGAGGGTCGGGGTATCAATTTGACCAAGACCGTATACTGTACGTCCCGATGTCTAGTCAGTTGGATATC
ACAGGTGGAGATGAAGCTTCCAAAGTCTATCAAATTGTCGCCGCAGTTCGCAGCGACGTCTCGCCTTCTCGAGTA
TATATGTCTGGTTTTGACAATGCCATACCCAAGCATGGACCCGCGATTACTATTGCCGAGGACCTCACACTAAAT
ACAACCCTCACTCCCGTAGAAGGCTATAATTTCTACTCTACCTCGATTCAATCTGCTGGACTTCAGTTGACGGTC
GATTTGCATGCTGATATTGATGGTAATACTTATACGGTTGATTTTCAACAGACGTATTTTTTGGGTTCAAGTCTT
CCTTATACACCGCCGACTATTTCTAATATCACCGCTGTTGCTCGGATTCCTGGAGAACCTACTGATACTAACTCT
ATTGCGAGAAGCGCGTTTAGGGGCCCAACTAAGGTAAGGGGGCGCGTCCGAGTCGTCGCGTCATTAAGCGCACGT
GCCAAGATCCAAACGCGTCAATCTTGCTTTAGAGCGTTCTTCGTCTCTCCCAATGTCATGAAGCGACCACGCGAT
AACGAAGACGATACTACCAACAAGAGGCGGCGCAAGAATCTCGCGGGTGATATCTTCGATGACGAATCTCTCAGC
AATTGCCTCTGCAATGTTCAAGGGTTAAACAGTGCCATAGGATATTTCACTGGCTCAGTGTCTATGAGATGGTTA
CCTGGTGGCAAGAAATTCATATTCCAGTTGACGTCTCGATGCTGCGTTACACTTAGTCGAACTCACAAGATCAAC
GTTACTTTCGAGGGGGACTGGATTGAATCTATGCATGCTCGATGGGGTCTTTTGTCTATCATGGAGAACATTCAA
CTATCGCTAGATGGAGTAACTTCCGTCTATCCGCAGACTTCGAAGCAAACAGCGAAGCTGGTGTATACAGACGGT
GTCATGTATCGAAAGATGCGAGAAGGGAAAGAAACGGAGACATTTTTCATCAATTCTTGGCAATTGCAAGAGGAA
CAAGACAAACAAGAGGATTTAGATTGGTTTGCTACGCCTACGGTACCTCTTAACGGATCTTTATCGGAGTTTGTT
CGTACGGCTTCTGCTGCTGGCCTCAAGAGTGTCCCCAGTCCAAAGGAAAACCAAATGTCTGAAAAACGTGATGAT
GACTCGAGTTTGTCGAAGGAGAACCTGCCTCCCGTCGACGAACATCCGACCAAACAAGATGTTGTCAGTACCGCA
ACCTCTAGGGTTCTCCCCCAGCCCGCGTCTATCTCAGAGGCCAACGTCGTCGTAAATGTTCAACAAACGACAAGT
GAGAACCAATCGCGTCCTCCTTTGTCTCGTAAGCAAGAAAAAAAGGCGAAGAAGAAACTCGCCAGGGCGGAAAAA
GCGCAACGTGAAGCATTGGCGCAACTGGCTATATCATCTGGAGCTGCCAATACCGCGACAGCTACATCTTCATCT
CAAATGCAAGTGAGCAATCACCTTCCGGCACGAGTTGATGCAAATGCTCAAGTCGTTAAATTGGGCGAATTACTA
CCTGTTGAAGATAATCCGGCAGGTTGGATGGCGCGTTTGCGGCGTTTCAAGGGCAATTATAAAATGCTCAAAGAC
GCGACCGACGACAAAAGCCGTATAAATATTATTGGAGTAGTGAGGAGTGTTAAGCCCCCTCGTCTGTCAACCAAA
AGTGACTGGACTTCAAACGTTTACTTGCAAGACCCTAGTCTATTTCCAGATGGGGAGCAATTTACTACGAATGAC
TATACCGTTAATTTCTTCCAGAGAAGGTTCAAGGAATGGCTGCCACAAGTGAACATAGGAGACATCTTAGTTTTG
CGTGACATCAAGGTCCAAAATCACAGTGGTAACCTCTTTGGTGTTGGATATTACGACAGAGCCAAATGGGCTGTC
TATTCCATCAAAGACGAGTGCTTTGTCCCACCGAAACTAGAACAGGCTCCGAAAGAACAGGAGTTAGTAGATGGC
GGTATCAAATTCACTCCATTTCTAAGCGCAAGTGAGCTGGAGGAGCGCTACTGCCAACAGTTGTCGGCTTGGTGG
AAAGAGAAGAACAAAGACTCAGCAGAATCGACCGGCCATGTGTATAGCTTGAGTGAATCCGCCATCCCCGGACGG
ACGCGGCGCCAGCATCTTCTACTCTGCGATGCAGGACCGGATGTGGAACCTGGCGGATATTTCGACTGTACATTT
GAGGTTGTCAATGCTTATCAGAGTAAGGATTACCCCCCTACTCTATATGTGACAGACTATACGAAAAATGCCATC
GTTGCGCCTTGCCAGACGAAGTGGTGTCCACCTGGACTAAACGATTACATCGTGAAGTTTGAACTTTGGGATGCG
GCAGCAAGCTTGGCCCCATCGCTTGTCCCTGGAACATATTGGACGGTTAGGAATGCTCGACTCAAGCGCGGCCAT
GATGGATATGTTGAGGGGAAGATAGTTGAAAAGAAATTCTCTAAGCTTGAACCTGACTCCGAGTATGCTCCACTA
CAAGAATTGCTGAAACGCAAGAAGGAATGGGAAACCGCAAACAAACCAAGGCAAACGCGTTTTGAACACAAATTG
CTCAAAGACGTCGGGGATGGAGATTACGCCAACTGCACCGTCGAGTTGTTGCATGCGTCCAAAGGCTATGATGAA
GCTTATCTTGAGGTGACAGACTATACAGCCCATCCAAAGATGTCGTCCTATCGATATGAGAACGACTGGTCCCGG
GACTTGGAAGGCCTTATTGATTCTTCCTGA
Transcript >AgabiH97|055520
ATGCTATTGGCGCGCACCAGTCGGCTCCTCGCCGACTTGCTGGCAGCAGGGATCTGGTGCACCTCGGTTTCTGCT
GGTTACTCATGGCCTTCACCACAATACGAGGCATTGGAAACATTTTTATACGAAGGGAGGAGATACGACGGTTCG
AACTTGGCTTCGCTCCAACACCCATGCAAAAACAGGTCTGATACGGGCGCGTCAATTGGTGCTGAATGGCTGAGG
CTGGCATATCATGATGTTTCGACGCGAGATGCTGAAGCTGGGACGGGCGGGTTGGATGGATCAATTGCATACGAG
TTGGATAGAGAAGAAAATGCGGGACAGGCGATGAGCGCTAGTTTGGCCGACTTTGAATCGTTCTCTAATAAATAC
GTATCTCGATCGGATGTCATCGCCGCCGGTACAATATTCGCCGTTGCATCCTGCGGGGGTCCCATTATACCTTTT
CGTGGCGGCCGTTTCGATGCAGTCACCGCAGCTTCAAGTTCTTTTAGTGTTCCAGAACCGTTCCAGGATTTTCAG
ACTCATCTAGACATGTTCCGCCGGCAGGGATTTTCACAGACGGAGATGATCACTTTAGTTGCTTGTGGACATACT
ATCGGTGGAGTGAGAAACACGGATTTCCCTCATATTGTTTCCCCGAAAGGCGATTCAGAGAAACCTACCCTTGAA
AACTTTGATCCTTCCCCGCAGTATGATAATCTAGTGGTGCAGCAGTATCTGGATGGCAGTACAAATAATCCGCTT
GTTGTTCATCCGAACAAGACGATTACTTCGGACTTTCGAATATTCTCAAGCGATAACAATGCGACTATGAAGAGT
TTGTCATCCCCTGATGCATATTTGAATTCATGTCGAACGGTTCTCGAGAAAATGCTCAACATGGTTCCTTCAAAT
GTCGCGCTGACAGATGAAATACGGCTCTTGCAAGAGAAAGTTTTCAATGTTCAGTTGACACTTGAACATGGGGAA
TTAGTTTTCAAGGCATCGCTTCGGCTCGCTCAGTCTGTGGGCTCCACTCCCAGTAAAGGCAGAGTTGTTCGAATG
TTCTGGTGCAACCGCTACGGAGACGACAAGGATTGTTCTACCAACGCCAATGTTGTTTTTCAGGCAGCATCTACT
CAGGACTCACCAGAAATTTCACCAATCACAGAACGACTAGGCTACTATTTCATCAACCATCAATTTGTTGTCAAA
GTCGATCCGTCAAAGTCGATTTCTCTTTTCTGGTTTGAGAGTGACGAACATGATGGTTCAACGCCAAAAATTATC
GATAATGGAGGGTCGGGGTATCAATTTGACCAAGACCGTATACTGTACGTCCCGATGTCTAGTCAGTTGGATATC
ACAGGTGGAGATGAAGCTTCCAAAGTCTATCAAATTGTCGCCGCAGTTCGCAGCGACGTCTCGCCTTCTCGAGTA
TATATGTCTGGTTTTGACAATGCCATACCCAAGCATGGACCCGCGATTACTATTGCCGAGGACCTCACACTAAAT
ACAACCCTCACTCCCGTAGAAGGCTATAATTTCTACTCTACCTCGATTCAATCTGCTGGACTTCAGTTGACGGTC
GATTTGCATGCTGATATTGATGGTAATACTTATACGGTTGATTTTCAACAGACGTATTTTTTGGGTTCAAGTCTT
CCTTATACACCGCCGACTATTTCTAATATCACCGCTGTTGCTCGGATTCCTGGAGAACCTACTGATACTAACTCT
ATTGCGAGAAGCGCGTTTAGGGGCCCAACTAAGGTAAGGGGGCGCGTCCGAGTCGTCGCGTCATTAAGCGCACGT
GCCAAGATCCAAACGCGTCAATCTTGCTTTAGAGCGTTCTTCGTCTCTCCCAATGTCATGAAGCGACCACGCGAT
AACGAAGACGATACTACCAACAAGAGGCGGCGCAAGAATCTCGCGGGTGATATCTTCGATGACGAATCTCTCAGC
AATTGCCTCTGCAATGTTCAAGGGTTAAACAGTGCCATAGGATATTTCACTGGCTCAGTGTCTATGAGATGGTTA
CCTGGTGGCAAGAAATTCATATTCCAGTTGACGTCTCGATGCTGCGTTACACTTAGTCGAACTCACAAGATCAAC
GTTACTTTCGAGGGGGACTGGATTGAATCTATGCATGCTCGATGGGGTCTTTTGTCTATCATGGAGAACATTCAA
CTATCGCTAGATGGAGTAACTTCCGTCTATCCGCAGACTTCGAAGCAAACAGCGAAGCTGGTGTATACAGACGGT
GTCATGTATCGAAAGATGCGAGAAGGGAAAGAAACGGAGACATTTTTCATCAATTCTTGGCAATTGCAAGAGGAA
CAAGACAAACAAGAGGATTTAGATTGGTTTGCTACGCCTACGGTACCTCTTAACGGATCTTTATCGGAGTTTGTT
CGTACGGCTTCTGCTGCTGGCCTCAAGAGTGTCCCCAGTCCAAAGGAAAACCAAATGTCTGAAAAACGTGATGAT
GACTCGAGTTTGTCGAAGGAGAACCTGCCTCCCGTCGACGAACATCCGACCAAACAAGATGTTGTCAGTACCGCA
ACCTCTAGGGTTCTCCCCCAGCCCGCGTCTATCTCAGAGGCCAACGTCGTCGTAAATGTTCAACAAACGACAAGT
GAGAACCAATCGCGTCCTCCTTTGTCTCGTAAGCAAGAAAAAAAGGCGAAGAAGAAACTCGCCAGGGCGGAAAAA
GCGCAACGTGAAGCATTGGCGCAACTGGCTATATCATCTGGAGCTGCCAATACCGCGACAGCTACATCTTCATCT
CAAATGCAAGTGAGCAATCACCTTCCGGCACGAGTTGATGCAAATGCTCAAGTCGTTAAATTGGGCGAATTACTA
CCTGTTGAAGATAATCCGGCAGGTTGGATGGCGCGTTTGCGGCGTTTCAAGGGCAATTATAAAATGCTCAAAGAC
GCGACCGACGACAAAAGCCGTATAAATATTATTGGAGTAGTGAGGAGTGTTAAGCCCCCTCGTCTGTCAACCAAA
AGTGACTGGACTTCAAACGTTTACTTGCAAGACCCTAGTCTATTTCCAGATGGGGAGCAATTTACTACGAATGAC
TATACCGTTAATTTCTTCCAGAGAAGGTTCAAGGAATGGCTGCCACAAGTGAACATAGGAGACATCTTAGTTTTG
CGTGACATCAAGGTCCAAAATCACAGTGGTAACCTCTTTGGTGTTGGATATTACGACAGAGCCAAATGGGCTGTC
TATTCCATCAAAGACGAGTGCTTTGTCCCACCGAAACTAGAACAGGCTCCGAAAGAACAGGAGTTAGTAGATGGC
GGTATCAAATTCACTCCATTTCTAAGCGCAAGTGAGCTGGAGGAGCGCTACTGCCAACAGTTGTCGGCTTGGTGG
AAAGAGAAGAACAAAGACTCAGCAGAATCGACCGGCCATGTGTATAGCTTGAGTGAATCCGCCATCCCCGGACGG
ACGCGGCGCCAGCATCTTCTACTCTGCGATGCAGGACCGGATGTGGAACCTGGCGGATATTTCGACTGTACATTT
GAGGTTGTCAATGCTTATCAGAGTAAGGATTACCCCCCTACTCTATATGTGACAGACTATACGAAAAATGCCATC
GTTGCGCCTTGCCAGACGAAGTGGTGTCCACCTGGACTAAACGATTACATCGTGAAGTTTGAACTTTGGGATGCG
GCAGCAAGCTTGGCCCCATCGCTTGTCCCTGGAACATATTGGACGGTTAGGAATGCTCGACTCAAGCGCGGCCAT
GATGGATATGTTGAGGGGAAGATAGTTGAAAAGAAATTCTCTAAGCTTGAACCTGACTCCGAGTATGCTCCACTA
CAAGAATTGCTGAAACGCAAGAAGGAATGGGAAACCGCAAACAAACCAAGGCAAACGCGTTTTGAACACAAATTG
CTCAAAGACGTCGGGGATGGAGATTACGCCAACTGCACCGTCGAGTTGTTGCATGCGTCCAAAGGCTATGATGAA
GCTTATCTTGAGGTGACAGACTATACAGCCCATCCAAAGATGTCGTCCTATCGATATGAGAACGACTGGTCCCGG
GACTTGGAAGGCCTTATTGATTCTTCCTGA
Gene >AgabiH97|055520
ATGCTATTGGCGCGCACCAGTCGGCTCCTCGCCGACTTGCTGGCAGCAGGGATCTGGTGCACCTCGGTTTCTGCT
GGTTACTCATGGCCTTCACCACAATACGAGGCATTGGAAACATTTTTATACGAAGGGAGGAGATACGACGGTTCG
AACTTGGCTTCGCTCCAACACCCATGCAAAAACAGGTCTGATACGGGCGCGTCAATTGGTGCTGAATGGCTGAGG
CTGGCATATCATGATGTTTCGACGCGAGATGCTGAAGCTGGGACGGGCGGGTTGGATGGATCAATTGCATACGAG
TTGGATAGAGAAGAAGTGAGTCAAATGGTCCAAGTAGCGTTGAGAATCATGGACTCTGAGCTTCATGTCCGGGTC
TAGAATGCGGGACAGGCGATGAGCGCTAGTTTGGCCGACTTTGAATCGTTCTCTAATAAATACGTATCTCGTGGG
TCTTCGAATGTTTACCTGTTCCCCTATCGCAGACTAACCTCATATATCCTTTTCAGGATCGGATGTCATCGCCGC
CGGTACAATATTCGCCGTTGCATCCTGCGGGGGTCCCATTATACCTTTTCGTGGCGGCCGTTTCGATGCAGTCAC
CGCAGCTTCAAGTTCTTTTAGTGTTCCAGAACCGTTCCAGGATTTTCAGACTCATCTAGACATGTTCCGCCGGCA
GGGATTTTCACAGACGGAGATGATCACTTTAGTTGCTTGTGGACATACTATCGGTGGAGTGAGAAACACGGATTT
CCCTCATATTGTTTCCCCGAAAGGCGATTCAGAGAAACCTACCCTTGAAAACTTTGATCCTTCCCCGCAGTATGA
TAATCTAGTGTAAGTCCCGACTACGTGACGCATTGCTGGACGTATGTTGATGAGGATATGATCAGGGTGCAGCAG
TATCTGGATGGCAGTACAAATAATCCGCTTGTTGTTCATCCGAACAAGACGATTACTTCGGACTTTCGAATATTC
TCAAGCGATAACAATGCGACTATGAAGAGGCGAGCTGCTTTCAAGTCTCAACTTTCGAGTAGCTGGCTGACATCT
ACTTCAGTTTGTCATCCCCTGATGCATATTTGAATTCATGTCGAACGGTTCTCGAGAAAATGCTCAACATGGTTC
CTTCAAATGTCGCGCTGACAGATGAAATACGGCTCTTGCAAGAGAAAGTTTTCAATGTTCAGTTGACACTTGAAC
ATGGGGAATTAGTTTTCAAGGCATCGCTTCGGGTACGGGTTCCTGTAATCTTGAGCCTAGACTTGTTTTTACTCA
ACATATTTGTATAGCTCGCTCAGTCTGTGGGCTCCACTCCCAGTAAAGGCAGAGTTGTTCGAATGTTCTGGTGCA
ACCGCTACGGAGACGACAAGGATTGTTCTACCAACGCCAATGTTGTTTTTCAGGCAGCATCTACTCAGGACTCAC
CAGAAATTTCACCAATCACAGAACGACTAGGCTACTATTTCATCAACCATCAATTTGTTGTCAAAGTCGATCCGT
CAAAGTCGATTTCTCTTTTCTGGTTTGAGAGTGACGAACATGATGGTTCAACGCCAAAAATTATCGATAATGGAG
GGTCGGGGTATCAATTTGACCAAGACCGTATACTGTACGTCCCGATGTCTAGTCAGTTGGATATCACAGGTGGAG
ATGAAGCTTCCAAAGTCTATCAAATTGTCGCCGCAGTGAGTTGATTTGCAAAAACCGTGACATTTTCGTTCATGA
CCCTTTATTTCATTTAGGTTCGCAGCGACGTCTCGCCTTCTCGAGTATATATGTCTGGTTTTGACAATGCCATAC
CCAAGCATGGACCCGCGATTACTATTGCCGAGGACCTCACACTAAATACAACCCTCACTCCCGTAGAAGGCTATA
ATTTCTACTCTACCTCGATTCAATCTGCTGGACTTCAGTTGACGGTCGATTTGCATGCTGATATTGATGGTAATA
CTTATACGGTTGATTTTCAACAGACGTATTTTTTGGGTTCAAGTCTTCCTTATACACCGCCGACTATTTCTAATA
TCACCGCTGTTGCTCGGATTCCTGGAGAACCTACTGATACTAACTCTGTACGATCTTTGCATCCTTCGTTTCTAA
TATTTAGTTATCTAATACTCGTAACCTACATTGGACTTTTTGTATAACATGCTTTACAACTCGGACTTTCAGTAA
CAACAATCTAGTAAATTCTTGATACATAGATTGCGAGAAGCGCGTTTAGGGGCCCAACTAAGGTAAGGGGGCGCG
TCCGAGTCGTCGCGTCATTAAGCGCACGTGCCAAGATCCAAACGCGTCAATCTTGCTTTAGAGCGTTCTTCGTCT
CTCCCAATGTCATGAAGCGACCACGCGATAACGAAGACGATACTACCAACAAGAGGCGGCGCAAGAATCTCGCGG
GTGATATCTTCGATGACGAATCTCTCAGCAATTGCCTCTGCAATGTTCAAGGGTTAAACAGTGCCATAGGATATT
TCACTGGCTCAGTGTCTATGAGATGGTTACCTGGTGGCAAGAAATTCATATTCCAGTTGACGTCTCGATGCTGCG
TTACACTTAGTCGAACTCACAAGATCAACGTTACTTTCGAGGGGGACTGGATTGAATCTATGCATGCTCGATGGG
GTCTTTTGTCTATCATGGAGAACATTCAACTATCGCTAGATGGAGTAACTTCCGTCTATCCGCAGACTTCGAAGC
AAACAGCGAAGCTGGTGTATACAGACGGTGTCATGTATCGAAAGATGCGAGAAGGGAAAGAAACGGAGACATTTT
TCATCAATTCTTGGCAATGTAAGTCCACCCACTCTCGTTGAAGGAACATCGGATGACTACATACACCTCAAACGT
ATAGTGCAAGAGGAACAAGACAAACAAGAGGATTTAGATTGGTTTGCTACGCCTACGGTACCTCTTAACGGATCT
TTATCGGAGTTTGTTCGTACGGCTTCTGCTGCTGGCCTCAAGAGTGTCCCCAGTCCAAAGGAAAACCAAATGTCT
GAAAAACGTGATGATGACTCGAGTTTGTCGAAGGAGAACCTGCCTCCCGTCGACGAACATCCGACCAAACAAGAT
GTTGTCAGTACCGCAACCTCTAGGGTTCTCCCCCAGCCCGCGTCTATCTCAGAGGCCAACGTCGTCGTAAATGTT
CAACAAACGACAAGTGAGAACCAATCGCGTCCTCCTTTGTCTCGTAAGCAAGAAAAAAAGGCGAAGAAGAAACTC
GCCAGGGCGGAAAAAGCGCAACGTGAAGCATTGGCGCAACTGGCTATATCATCTGGAGCTGCCAATACCGCGACA
GCTACATCTTCATCTCAAATGCAAGTGAGCAATCACCTTCCGGCACGAGTTGATGCAAATGCTCAAGTCGTTAAA
TTGGGCGTAAGTACGGTCTCCCTTCACGCGTATATTGACTATTAAATGGCCGAAAAGGAATTACTACCTGTTGAA
GATAATCCGGCAGGTTGGATGGCGCGTTTGCGGCGTTTCAAGGTACGTCCATCTTTTGAGGGTGCTGCTTGTTGA
TACATAACAGTGATCTGATTGCAGGGCAATTATAAAATGCTCAAAGACGCGACCGACGACAAAAGCCGTATAAAT
ATTATTGGAGTAGTGAGGAGTGTTAAGCCCCCTCGTCTGTCAACCAAAAGTGGTCAGTAAATTTCTTTCTCGCAA
TCGCTCATTCCAATGTCCCTTCAGACTGGACTTCAAACGTTTACTTGCAAGACCCTAGTCTATTTCCAGATGGGG
AGCAATTTACTACGAATGACTATACCGTTAATTTCTTCCAGAGAAGGTTCAAGGAATGGCTGCCACAAGTGAACA
TAGGAGACATCTTAGTTTTGCGTGACATCAAGGCAAGGTCACCGTCTTTGACTGACAATTTATCCACTTACTTGT
ACCAAAGGTCCAAAATCACAGTGGTAACCTCTTTGGTGTTGGATATTACGACAGAGCCAAATGGGCTGTCTATTC
CATCAAAGACGAGTGCTTTGTCCCACCGAAACTAGAACAGGCTCCGAAAGAACAGGAGTTAGTAGATGGCGGTAT
CAAATTCACTCCATTTCTAAGCGCAAGTGAGCTGGAGGAGCGCTACTGCCAACAGTTGTCGGCTTGGTGGAAAGA
GAAGAACAAAGACTCAGCAGAATCGACCGGCCATGTGTATAGCTTGAGTGAATCCGCCATCCCCGGACGGACGCG
GCGCCAGCATCTTCTACTCTGCGATGCAGGACCGGATGTGGAACCTGGCGGATATTTCGACTGTACATTTGAGGT
GAGGTAGTCGTTTCGAATTTTCTTTCACTCAAAATCTTGACGATCTACAGGTTGTCAATGCTTATCAGAGTAAGG
ATTACCCCCCTACTCTATATGTGACAGACTATACGAAAAATGCCATCGTTGCGCCTTGCCAGACGAAGTGGTGTC
CACCTGGACTAAACGATTACATCGTGAAGTTTGAACTTTGGGATGCGGCAGCAAGCTTGGCCCCATCGCTTGTCC
CTGGAACATATTGGACGGTTAGGAATGCTCGACTCAAGCGCGGCCATGATGGATATGTTGAGGGGAAGATAGTTG
AAAAGAAATTCTCTAAGCTTGAACCTGACTCCGAGTATGCTCCACTACAAGAATTGCTGAAGTGAGCTCATGGCT
TTCCTGTTATACCCGACTAATACGGTTTCTGACCCAGACGCAAGAAGGAATGGGAAACCGCAAACAAACCAAGGC
AAACGCGTTTTGAACACAAATTGCTCAAAGACGTCGGGGATGGAGATTACGCCAACTGCACCGTCGAGGTAAGCG
ATCCCCATGTTGACATATTTTGGACTAAACGTTGGGCAAGTTGTTGCATGCGTCCAAAGGCTATGATGAAGCTTA
TCTTGAGGTGACAGACTATACAGCCCATCCAAAGATGTCGTCCTATCGATATGAGAACGACTGGTCCCGGGACTT
GGAAGGCCTTATTGTCAGTGTCCACCTACGTGATGCCCAGGTGCAACATGCAAAAACTTTACGTCCAGGATTCTT
CCTGA

© 2022 - Robin Ohm - Utrecht University - The Netherlands

Built with Python Django and Wagtail