Fungal Genomics

at Utrecht University

General Properties

Protein IDAgabiH97|048820
Gene name
Locationscaffold_2:2562488..2567822
Strand+
Gene length (bp)5334
Transcript length (bp)5334
Coding sequence length (bp)5334
Protein length (aa) 1778

Your browser does not support drawing a protein figure.

PFAM Domains

PFAM Domain ID Short name Long name E-value Start End
PF20209 DUF6570 Domain of unknown function (DUF6570) 9.5E-40 49 191
PF14214 Helitron_like_N Helitron helicase-like domain at N-terminus 4.6E-32 317 540
PF05970 PIF1 PIF1-like helicase 6.4E-27 1217 1549
PF13604 AAA_30 AAA domain 7.2E-14 1217 1429
PF13245 AAA_19 AAA domain 3.2E-07 1242 1370

Swissprot hits

[Show all]
Swissprot ID Swissprot Description Start End E-value
sp|Q5AXT5|PIF1_EMENI ATP-dependent DNA helicase PIF1 OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=pif1 PE=3 SV=2 1242 1675 8.0E-28
sp|Q196V4|VF030_IIV3 Uncharacterized protein 106R OS=Invertebrate iridescent virus 3 GN=IIV3-106R PE=3 SV=1 1242 1670 1.0E-24
sp|Q0R4F1|PIF1_XENLA ATP-dependent DNA helicase PIF1 OS=Xenopus laevis GN=pif1 PE=2 SV=1 1242 1669 9.0E-23
sp|Q80SX8|PIF1_MOUSE ATP-dependent DNA helicase PIF1 OS=Mus musculus GN=Pif1 PE=1 SV=2 1217 1669 1.0E-22
sp|Q9QSK3|VF030_IIV6 Uncharacterized protein 030L OS=Invertebrate iridescent virus 6 GN=IIV6-030L PE=3 SV=1 1211 1664 3.0E-22
[Show all]
[Show less]
Swissprot ID Swissprot Description Start End E-value
sp|Q5AXT5|PIF1_EMENI ATP-dependent DNA helicase PIF1 OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=pif1 PE=3 SV=2 1242 1675 8.0E-28
sp|Q196V4|VF030_IIV3 Uncharacterized protein 106R OS=Invertebrate iridescent virus 3 GN=IIV3-106R PE=3 SV=1 1242 1670 1.0E-24
sp|Q0R4F1|PIF1_XENLA ATP-dependent DNA helicase PIF1 OS=Xenopus laevis GN=pif1 PE=2 SV=1 1242 1669 9.0E-23
sp|Q80SX8|PIF1_MOUSE ATP-dependent DNA helicase PIF1 OS=Mus musculus GN=Pif1 PE=1 SV=2 1217 1669 1.0E-22
sp|Q9QSK3|VF030_IIV6 Uncharacterized protein 030L OS=Invertebrate iridescent virus 6 GN=IIV6-030L PE=3 SV=1 1211 1664 3.0E-22
sp|Q1HG60|PIF1_RAT ATP-dependent DNA helicase PIF1 OS=Rattus norvegicus GN=Pif1 PE=2 SV=1 1217 1669 4.0E-21
sp|Q9UUA2|PIF1_SCHPO ATP-dependent DNA helicase pfh1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=pfh1 PE=1 SV=1 1212 1665 6.0E-21
sp|P07271|PIF1_YEAST ATP-dependent DNA helicase PIF1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=PIF1 PE=1 SV=2 1242 1551 7.0E-21
sp|A6ZM04|PIF1_YEAS7 ATP-dependent DNA helicase PIF1 OS=Saccharomyces cerevisiae (strain YJM789) GN=PIF1 PE=3 SV=1 1242 1551 7.0E-21
sp|Q9H611|PIF1_HUMAN ATP-dependent DNA helicase PIF1 OS=Homo sapiens GN=PIF1 PE=1 SV=2 1217 1676 2.0E-20
sp|Q57YG0|PIF7_TRYB2 ATP-dependent DNA helicase PIF7 OS=Trypanosoma brucei brucei (strain 927/4 GUTat10.1) GN=PIF7 PE=3 SV=1 1217 1674 3.0E-20
sp|Q7ZV90|PIF1_DANRE ATP-dependent DNA helicase PIF1 OS=Danio rerio GN=pif1 PE=2 SV=1 1242 1686 8.0E-20
sp|A5WFR0|PIF1_PSYWF ATP-dependent DNA helicase pif1 OS=Psychrobacter sp. (strain PRwf-1) GN=pif1 PE=3 SV=1 1220 1676 2.0E-19
sp|P38766|RRM3_YEAST ATP-dependent DNA helicase RRM3 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=RRM3 PE=1 SV=1 1216 1430 2.0E-16
sp|Q580X6|PIF5_TRYB2 ATP-dependent DNA helicase PIF5 OS=Trypanosoma brucei brucei (strain 927/4 GUTat10.1) GN=PIF5 PE=1 SV=1 1216 1440 2.0E-15
sp|Q59RQ0|PIF1_CANAL ATP-dependent DNA helicase PIF1 OS=Candida albicans (strain SC5314 / ATCC MYA-2876) GN=PIF1 PE=3 SV=1 1242 1429 7.0E-15
sp|Q38CE9|PIF6_TRYB2 ATP-dependent DNA helicase PIF6 OS=Trypanosoma brucei brucei (strain 927/4 GUTat10.1) GN=PIF6 PE=3 SV=1 1220 1673 3.0E-14
sp|Q383A1|PIF3_TRYB2 ATP-dependent DNA helicase PIF3 OS=Trypanosoma brucei brucei (strain 927/4 GUTat10.1) GN=PIF3 PE=3 SV=1 1242 1437 1.0E-10
sp|Q384Y1|PIF1_TRYB2 ATP-dependent DNA helicase PIF1 OS=Trypanosoma brucei brucei (strain 927/4 GUTat10.1) GN=PIF1 PE=1 SV=1 1211 1369 3.0E-09
sp|Q384Y0|PIF2_TRYB2 ATP-dependent DNA helicase PIF2 OS=Trypanosoma brucei brucei (strain 927/4 GUTat10.1) GN=PIF2 PE=1 SV=1 1193 1428 5.0E-09
sp|Q6MHJ5|PIF1_BDEBA ATP-dependent DNA helicase pif1 OS=Bdellovibrio bacteriovorus (strain ATCC 15356 / DSM 50701 / NCIB 9529 / HD100) GN=pif1 PE=3 SV=1 1212 1437 8.0E-08
sp|A6ZM04|PIF1_YEAS7 ATP-dependent DNA helicase PIF1 OS=Saccharomyces cerevisiae (strain YJM789) GN=PIF1 PE=3 SV=1 1612 1677 9.0E-08
sp|P07271|PIF1_YEAST ATP-dependent DNA helicase PIF1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=PIF1 PE=1 SV=2 1612 1677 9.0E-08
sp|Q381V6|PIF4_TRYB2 ATP-dependent DNA helicase PIF4 OS=Trypanosoma brucei brucei (strain 927/4 GUTat10.1) GN=PIF4 PE=3 SV=1 1242 1366 5.0E-07
sp|Q59RQ0|PIF1_CANAL ATP-dependent DNA helicase PIF1 OS=Candida albicans (strain SC5314 / ATCC MYA-2876) GN=PIF1 PE=3 SV=1 1610 1674 2.0E-06
[Show less]

GO

GO Term Description Terminal node
GO:0003678 DNA helicase activity Yes
GO:0006281 DNA repair Yes
GO:0000723 telomere maintenance Yes
GO:0071840 cellular component organization or biogenesis No
GO:0008152 metabolic process No
GO:0006139 nucleobase-containing compound metabolic process No
GO:0004386 helicase activity No
GO:0140640 catalytic activity, acting on a nucleic acid No
GO:0008150 biological_process No
GO:0006996 organelle organization No
GO:0071704 organic substance metabolic process No
GO:0090304 nucleic acid metabolic process No
GO:0006974 cellular response to DNA damage stimulus No
GO:1901360 organic cyclic compound metabolic process No
GO:0140657 ATP-dependent activity No
GO:0032200 telomere organization No
GO:0051276 chromosome organization No
GO:0008094 ATP-dependent activity, acting on DNA No
GO:0006259 DNA metabolic process No
GO:0003674 molecular_function No
GO:0044260 cellular macromolecule metabolic process No
GO:0016043 cellular component organization No
GO:0051716 cellular response to stimulus No
GO:0044237 cellular metabolic process No
GO:0003824 catalytic activity No
GO:0009987 cellular process No
GO:0006725 cellular aromatic compound metabolic process No
GO:0044238 primary metabolic process No
GO:0046483 heterocycle metabolic process No
GO:0140097 catalytic activity, acting on DNA No
GO:0033554 cellular response to stress No
GO:0006950 response to stress No
GO:0006807 nitrogen compound metabolic process No
GO:0050896 response to stimulus No
GO:0043170 macromolecule metabolic process No
GO:0034641 cellular nitrogen compound metabolic process No

SignalP

[Help with interpreting these statistics]
SignalP signal predicted Location
(based on Ymax)
D score
(significance: > 0.45)
No 1 - 20 0.45

Transmembrane Domains

(None)

Transcription Factor Class

(None)

Expression data

Analysis 1: Developmental stages of Agaricus bisporus (strain A15). Published in Pelkmans et al, Applied Microbiology and Biotechnology, 2016

Click here for more information

Sequences

Type of sequenceSequence
Locus Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded.
Protein >AgabiH97|048820
MTLLHGPIITPLSSPLTSCFLDVAGIKFDSHGQPTDIRLCVSCFGSASRGKTPPLSLSNNTFLGAVPPQLKDLTF
IEEVMIARCRAKVWIVRLQENHEQVDVPISQRGIKGHVIIYPQDTSSIALHLPPSMEEISSFICVLFIGSTVPTR
QWLATKARPLLVRKNKIQDALFWLKEHNSLYANVVIDHDVLRRMGNEAFVPPIHIEHLQNSPASDLLTSRYDLDP
AEHNNSGDVYPAVVVTDVDIVAPAHELRVAALRHIKGGGSYFQIPHSSQASREFYNPSLLPLTYPTLFPFGIGGF
EDEERQVSLSFERHVKHAMSLSDRRFQEHYSFPFTVFNIIQRRKALLNTSLKVKRSDFDRVAKTFATVSVETLGK
VTQKVISEGTFSLKTDEERKVMSLMRQVNSVTSGVYGSAAERMNMRNELRALVIEKGLPSFYLTINPADVYNPIV
RLLSGEKLDIDSMFISTSSTYWAQASLVAKNPFLAAQFFNIIMITYIRNILGYQAVKGVNIKNGILGVVKAYYGC
VEAQGRGTLHCHMVIWIEGALNSQQIYEKICVSKNLQFRDRLLAFMDASISNHLPPDPDPSLQIPSSCYHPCSSL
PLDRNNFSNDEEYRVALDKDLHHLVSRSQIHKHSFTCYKYWPGPPAPRICRFGLGDGEVCNASWFNEDTGELELK
RSDLYVNNYNRLSTLAIRSNSDIKWILSGPSARSIFYYVSEYITKHDLKPHIAYAALEVGSRKLGNFDSQEDDIL
FRGKRLLQRCAYAMISHQELSAQQVVSYLCGFEDHFTSHAYRNFYWLSFENLVNKEWPSPECVGTHASSNDEDTH
NNATIADEVNDNMSREEDDSEAEIGFSVYEGQLQANSPQVVNYQLRSVLLEHLSLWEFVATADKVRLRKESNQAP
IIGSVHSSSSINDVQQTGSTFRFLPCHPQYDTHGITLRSSLHRLVPVPRGPPIPRRDQKSSVQRHARLMLIFFKP
WRHASDLRSSESSWYQNFSVFLNSVSRRVKDIIDNMQVLHECKDSRDDHFEKLRSSYRHSTSRSTGDGRRMEDDF
TGSDNYEQIIMEHLENIDKCHSERIDNTLRKSIACLEALESCGIIDVVDVPTIDTTPLGVTLSIPTESNNSTSFA
VRADDQQLEDIWKEAYDERKSIWKKHASQPEEERHGSEQPTVGSVSQASDSTTLFNTPVPLRMTFAQDINQESRN
LSEEKTNINRLVDNFTLNHEQACAFRIIAEHCNTHCGEQLRMYIAGEGGTGKSRILSTIQEFFQQRHESRRFRMA
SYTGVAAQNIDGMTLHTALHLGTNQRHTHDTHSNRDLVAMWEGVDYLFIDEVSMLGCSFLADINQALIRAKGKTS
NFGGINVIFAGDFCQLPPVGQIRLFSSSNTGPDRPKDHDGQMLWQSVDTVVILKQVMRQSGRDNDQFVSLLDRLR
LGTCSESNYELLMSRLPTLVQPDWDTWSQAPILVNDNVTKDLLNEQAAITYARKTGKRLQWFYSIDTWQDLPLRD
LHLQEYLWSLPSSKTNHRLGRLPLVEGMPVMITLNYDVSAGIVNGSMGSLLSVRYIINSNGNHEAISCVVKCPNV
TGHQLPSLDVGDAAVLQDTVNVNFVHPFSSLKCSMKRRQLPLVPAFALTVHKAQGQTLPRAIVDLSTCIGTESPY
VMLSRVQSLSGLMVIQPFDRNKISVRQSEDLRKELLRLRVMEIMSRIKLEYSRQAYDELTDLMCHFPPAIDFPGV
QDNLFKTLTYNDYTTLCARLPVLDSHAPLLEHSVTTLKRMLRLQRRQRLEGE*
Coding >AgabiH97|048820
ATGACTTTACTTCATGGTCCTATTATCACGCCACTATCTTCTCCTCTGACTTCATGTTTTCTTGATGTTGCAGGT
ATCAAATTTGATAGTCATGGACAACCTACTGATATTCGTTTGTGTGTTTCATGTTTTGGATCAGCTAGTCGTGGT
AAAACACCTCCTTTGTCTTTGTCAAATAACACTTTTTTGGGTGCCGTTCCTCCTCAGCTAAAAGATCTTACTTTT
ATAGAGGAAGTTATGATTGCTCGCTGCCGAGCTAAAGTGTGGATTGTCAGGCTTCAGGAAAATCATGAGCAGGTC
GATGTTCCGATTTCCCAGAGAGGTATCAAAGGCCATGTGATCATTTATCCTCAAGATACGTCGTCAATCGCTTTG
CACTTGCCACCTTCTATGGAAGAGATTTCGTCATTCATATGTGTTCTTTTTATTGGATCAACTGTTCCTACAAGA
CAATGGCTTGCAACAAAGGCTAGACCACTGTTAGTTCGAAAAAATAAAATTCAGGATGCATTGTTTTGGTTGAAG
GAACACAATTCTTTATATGCAAATGTTGTTATTGATCATGATGTATTGCGTCGCATGGGAAATGAGGCGTTCGTA
CCACCAATTCATATAGAACACTTGCAGAACAGTCCTGCTTCTGACCTTTTAACTTCACGCTATGACTTGGACCCT
GCTGAACATAATAATTCAGGCGATGTATATCCTGCCGTTGTTGTTACTGATGTTGATATTGTGGCACCAGCACAT
GAATTACGGGTGGCTGCATTACGACACATTAAAGGTGGTGGATCGTATTTCCAGATTCCTCACAGTTCACAGGCG
TCACGCGAGTTTTACAATCCATCTTTGCTTCCACTTACTTATCCCACGTTGTTTCCCTTTGGTATAGGTGGATTT
GAAGATGAAGAGCGACAAGTTTCTCTTAGCTTTGAACGCCATGTTAAACATGCTATGTCTTTGTCAGATCGCCGC
TTTCAGGAGCACTATTCTTTTCCTTTTACAGTTTTTAACATCATTCAACGGCGCAAGGCTTTGTTAAACACTAGT
CTGAAAGTTAAGCGATCAGACTTTGATCGTGTCGCGAAGACATTTGCGACTGTGTCAGTTGAAACATTAGGAAAA
GTGACCCAGAAAGTAATAAGTGAAGGAACTTTTTCACTGAAAACAGATGAGGAAAGAAAAGTTATGTCATTGATG
CGACAAGTTAACTCGGTGACATCTGGTGTCTACGGATCTGCTGCAGAGCGAATGAATATGCGGAATGAACTGCGA
GCTCTTGTGATTGAAAAAGGTTTACCATCCTTTTATCTTACTATAAATCCTGCTGATGTCTATAATCCTATTGTG
CGTCTTCTTTCAGGTGAGAAGTTAGATATAGATAGCATGTTCATATCAACATCGTCTACATATTGGGCCCAGGCA
TCGTTAGTTGCTAAGAATCCCTTTTTAGCTGCACAATTTTTCAATATCATTATGATTACATACATACGAAATATA
TTGGGATATCAGGCCGTCAAGGGTGTCAATATCAAAAATGGAATCTTGGGAGTAGTAAAAGCATATTACGGATGT
GTCGAAGCACAGGGGAGAGGCACATTGCATTGCCATATGGTCATTTGGATTGAAGGAGCATTGAATTCTCAGCAA
ATATATGAAAAGATATGTGTGTCTAAAAATTTGCAATTTCGCGATCGCTTGTTGGCGTTTATGGATGCTAGTATT
TCAAATCATCTGCCTCCAGATCCTGATCCATCTCTTCAAATTCCATCATCTTGCTATCATCCTTGTTCTTCATTG
CCACTGGATCGAAACAATTTTTCAAATGATGAGGAATATCGAGTGGCTCTTGACAAGGATTTGCACCACCTTGTT
AGTCGGTCTCAAATTCATAAACATTCGTTCACATGTTACAAATATTGGCCAGGTCCTCCAGCCCCACGCATATGT
CGTTTTGGATTGGGTGACGGTGAAGTGTGCAATGCAAGCTGGTTTAATGAAGACACAGGTGAACTTGAGTTGAAA
CGGTCTGACTTATACGTTAACAATTACAACAGGTTATCAACTTTAGCAATTCGTTCTAATTCTGACATTAAGTGG
ATTTTATCAGGTCCCAGTGCGAGATCAATATTTTACTACGTGTCGGAATATATCACGAAACATGATTTGAAACCT
CATATAGCCTATGCAGCTTTGGAAGTGGGCTCACGGAAACTTGGGAATTTTGACTCTCAAGAAGATGATATATTA
TTCCGAGGTAAACGGTTATTACAACGTTGTGCTTACGCCATGATATCGCATCAAGAATTGTCTGCTCAACAAGTA
GTGTCTTATTTATGTGGATTTGAAGACCATTTCACTAGTCATGCATACAGGAATTTTTATTGGCTGTCTTTTGAG
AATTTAGTAAATAAAGAATGGCCTTCTCCCGAGTGCGTCGGTACCCATGCTTCTTCAAATGATGAGGATACTCAT
AATAATGCCACTATTGCTGATGAGGTTAATGATAATATGTCACGCGAAGAAGATGATAGTGAAGCAGAGATCGGT
TTTAGTGTTTATGAAGGACAATTACAGGCCAATAGTCCTCAAGTAGTTAATTATCAATTACGAAGTGTACTTCTG
GAGCACCTGTCCCTTTGGGAATTTGTAGCTACTGCTGATAAGGTTCGTTTACGGAAAGAGTCCAATCAAGCACCA
ATCATTGGTAGTGTTCATTCTTCGTCTTCCATTAATGATGTTCAACAGACAGGTTCTACATTTCGTTTTTTACCT
TGCCATCCTCAATATGATACCCATGGCATCACTCTCAGATCATCATTACATCGCTTAGTTCCTGTTCCTCGGGGT
CCACCGATTCCTAGAAGAGACCAGAAATCCAGTGTGCAGCGGCATGCACGGTTGATGCTTATCTTTTTTAAGCCA
TGGCGTCATGCTTCTGATTTAAGAAGTTCAGAATCGTCATGGTATCAAAATTTTTCTGTGTTTTTAAATTCAGTA
AGCCGCAGAGTCAAAGATATCATTGACAATATGCAGGTTTTGCATGAATGTAAAGATAGTCGGGACGACCACTTT
GAAAAATTACGGTCATCGTATCGCCACAGCACTTCTCGCTCTACTGGTGATGGTAGACGCATGGAGGACGACTTT
ACTGGATCAGACAATTATGAGCAGATCATAATGGAACATTTGGAAAATATTGATAAGTGTCATAGTGAACGGATA
GATAACACATTACGCAAGTCCATAGCCTGCCTTGAAGCGTTGGAATCATGTGGAATCATTGATGTTGTTGACGTC
CCGACCATAGACACAACACCATTGGGCGTTACATTGTCAATACCAACAGAATCAAACAACTCAACAAGTTTTGCT
GTGCGTGCCGATGACCAACAGTTAGAGGATATATGGAAGGAAGCATACGATGAGAGAAAAAGTATATGGAAAAAA
CATGCTAGTCAACCAGAAGAAGAAAGACATGGTTCTGAACAGCCTACTGTAGGCTCAGTTTCTCAAGCTTCAGAT
TCGACAACTTTGTTTAATACGCCTGTACCATTAAGAATGACATTTGCCCAGGATATAAACCAGGAAAGTAGAAAT
TTATCTGAAGAAAAGACTAATATCAATCGACTTGTAGATAACTTCACCCTCAACCATGAACAAGCTTGTGCGTTC
AGAATCATAGCAGAACATTGTAATACTCATTGCGGAGAACAATTGCGAATGTATATTGCTGGCGAAGGAGGTACG
GGAAAATCACGTATCTTAAGTACAATCCAAGAATTTTTCCAACAAAGACATGAATCCCGTCGGTTTCGCATGGCC
TCTTATACTGGTGTCGCTGCACAAAATATAGATGGAATGACGTTGCATACTGCTTTGCATTTAGGAACCAATCAA
CGGCACACCCATGATACCCATTCTAATAGGGATTTAGTCGCCATGTGGGAAGGCGTTGATTACCTTTTCATTGAT
GAAGTGTCTATGCTTGGTTGTTCTTTTTTGGCTGACATAAATCAAGCCTTAATACGGGCTAAAGGCAAAACCTCA
AACTTTGGAGGAATCAATGTTATCTTTGCAGGTGATTTTTGCCAACTACCACCAGTAGGCCAGATTCGCCTCTTT
AGCAGTAGTAACACAGGACCTGATAGGCCAAAAGATCATGACGGTCAAATGTTGTGGCAATCGGTGGATACTGTA
GTTATACTCAAACAAGTCATGAGACAATCAGGTAGGGATAATGATCAATTTGTAAGCTTGCTTGACAGGCTTCGG
CTGGGAACTTGCTCTGAAAGCAATTATGAATTACTTATGTCGCGATTGCCAACATTGGTTCAGCCAGATTGGGAT
ACGTGGTCACAAGCACCTATTCTTGTCAATGATAATGTCACAAAGGATCTATTGAATGAGCAGGCTGCCATTACA
TATGCCCGTAAAACTGGCAAGCGCTTACAATGGTTTTATTCTATTGACACATGGCAGGATTTACCTTTGCGGGAT
TTGCATTTACAGGAATACTTATGGTCACTGCCTTCGAGCAAAACCAACCATAGACTGGGGCGCTTACCTTTGGTT
GAAGGAATGCCAGTAATGATCACATTGAATTATGACGTTTCTGCAGGAATCGTAAACGGTTCGATGGGTTCATTA
CTGTCTGTCCGATACATTATCAATTCAAATGGAAATCATGAGGCGATTTCATGCGTTGTAAAATGCCCTAACGTC
ACAGGCCACCAGCTTCCTTCATTGGACGTTGGTGATGCAGCTGTTTTACAAGACACTGTCAATGTAAATTTTGTT
CATCCTTTTTCATCTCTCAAATGTTCAATGAAGCGTCGCCAACTTCCTCTTGTACCTGCTTTCGCTCTCACAGTA
CACAAAGCCCAAGGACAAACACTTCCACGTGCAATAGTAGATCTTTCAACATGTATCGGTACTGAGTCACCGTAT
GTTATGCTTTCAAGAGTACAGTCGCTAAGTGGTCTTATGGTTATTCAACCATTTGATAGAAACAAAATAAGTGTT
CGCCAAAGTGAAGATCTGCGTAAAGAATTACTGAGATTACGAGTCATGGAAATCATGTCACGCATTAAGTTGGAG
TATTCTAGACAAGCCTATGACGAATTGACCGATTTGATGTGCCACTTTCCGCCAGCAATTGACTTTCCTGGCGTT
CAAGATAATTTATTCAAAACTCTCACATACAATGATTACACTACTCTCTGCGCACGGTTACCCGTACTTGACTCT
CATGCTCCACTATTGGAGCACTCTGTTACGACCTTAAAGCGAATGCTCAGGTTGCAGCGAAGGCAAAGACTTGAG
GGCGAGTAA
Transcript >AgabiH97|048820
ATGACTTTACTTCATGGTCCTATTATCACGCCACTATCTTCTCCTCTGACTTCATGTTTTCTTGATGTTGCAGGT
ATCAAATTTGATAGTCATGGACAACCTACTGATATTCGTTTGTGTGTTTCATGTTTTGGATCAGCTAGTCGTGGT
AAAACACCTCCTTTGTCTTTGTCAAATAACACTTTTTTGGGTGCCGTTCCTCCTCAGCTAAAAGATCTTACTTTT
ATAGAGGAAGTTATGATTGCTCGCTGCCGAGCTAAAGTGTGGATTGTCAGGCTTCAGGAAAATCATGAGCAGGTC
GATGTTCCGATTTCCCAGAGAGGTATCAAAGGCCATGTGATCATTTATCCTCAAGATACGTCGTCAATCGCTTTG
CACTTGCCACCTTCTATGGAAGAGATTTCGTCATTCATATGTGTTCTTTTTATTGGATCAACTGTTCCTACAAGA
CAATGGCTTGCAACAAAGGCTAGACCACTGTTAGTTCGAAAAAATAAAATTCAGGATGCATTGTTTTGGTTGAAG
GAACACAATTCTTTATATGCAAATGTTGTTATTGATCATGATGTATTGCGTCGCATGGGAAATGAGGCGTTCGTA
CCACCAATTCATATAGAACACTTGCAGAACAGTCCTGCTTCTGACCTTTTAACTTCACGCTATGACTTGGACCCT
GCTGAACATAATAATTCAGGCGATGTATATCCTGCCGTTGTTGTTACTGATGTTGATATTGTGGCACCAGCACAT
GAATTACGGGTGGCTGCATTACGACACATTAAAGGTGGTGGATCGTATTTCCAGATTCCTCACAGTTCACAGGCG
TCACGCGAGTTTTACAATCCATCTTTGCTTCCACTTACTTATCCCACGTTGTTTCCCTTTGGTATAGGTGGATTT
GAAGATGAAGAGCGACAAGTTTCTCTTAGCTTTGAACGCCATGTTAAACATGCTATGTCTTTGTCAGATCGCCGC
TTTCAGGAGCACTATTCTTTTCCTTTTACAGTTTTTAACATCATTCAACGGCGCAAGGCTTTGTTAAACACTAGT
CTGAAAGTTAAGCGATCAGACTTTGATCGTGTCGCGAAGACATTTGCGACTGTGTCAGTTGAAACATTAGGAAAA
GTGACCCAGAAAGTAATAAGTGAAGGAACTTTTTCACTGAAAACAGATGAGGAAAGAAAAGTTATGTCATTGATG
CGACAAGTTAACTCGGTGACATCTGGTGTCTACGGATCTGCTGCAGAGCGAATGAATATGCGGAATGAACTGCGA
GCTCTTGTGATTGAAAAAGGTTTACCATCCTTTTATCTTACTATAAATCCTGCTGATGTCTATAATCCTATTGTG
CGTCTTCTTTCAGGTGAGAAGTTAGATATAGATAGCATGTTCATATCAACATCGTCTACATATTGGGCCCAGGCA
TCGTTAGTTGCTAAGAATCCCTTTTTAGCTGCACAATTTTTCAATATCATTATGATTACATACATACGAAATATA
TTGGGATATCAGGCCGTCAAGGGTGTCAATATCAAAAATGGAATCTTGGGAGTAGTAAAAGCATATTACGGATGT
GTCGAAGCACAGGGGAGAGGCACATTGCATTGCCATATGGTCATTTGGATTGAAGGAGCATTGAATTCTCAGCAA
ATATATGAAAAGATATGTGTGTCTAAAAATTTGCAATTTCGCGATCGCTTGTTGGCGTTTATGGATGCTAGTATT
TCAAATCATCTGCCTCCAGATCCTGATCCATCTCTTCAAATTCCATCATCTTGCTATCATCCTTGTTCTTCATTG
CCACTGGATCGAAACAATTTTTCAAATGATGAGGAATATCGAGTGGCTCTTGACAAGGATTTGCACCACCTTGTT
AGTCGGTCTCAAATTCATAAACATTCGTTCACATGTTACAAATATTGGCCAGGTCCTCCAGCCCCACGCATATGT
CGTTTTGGATTGGGTGACGGTGAAGTGTGCAATGCAAGCTGGTTTAATGAAGACACAGGTGAACTTGAGTTGAAA
CGGTCTGACTTATACGTTAACAATTACAACAGGTTATCAACTTTAGCAATTCGTTCTAATTCTGACATTAAGTGG
ATTTTATCAGGTCCCAGTGCGAGATCAATATTTTACTACGTGTCGGAATATATCACGAAACATGATTTGAAACCT
CATATAGCCTATGCAGCTTTGGAAGTGGGCTCACGGAAACTTGGGAATTTTGACTCTCAAGAAGATGATATATTA
TTCCGAGGTAAACGGTTATTACAACGTTGTGCTTACGCCATGATATCGCATCAAGAATTGTCTGCTCAACAAGTA
GTGTCTTATTTATGTGGATTTGAAGACCATTTCACTAGTCATGCATACAGGAATTTTTATTGGCTGTCTTTTGAG
AATTTAGTAAATAAAGAATGGCCTTCTCCCGAGTGCGTCGGTACCCATGCTTCTTCAAATGATGAGGATACTCAT
AATAATGCCACTATTGCTGATGAGGTTAATGATAATATGTCACGCGAAGAAGATGATAGTGAAGCAGAGATCGGT
TTTAGTGTTTATGAAGGACAATTACAGGCCAATAGTCCTCAAGTAGTTAATTATCAATTACGAAGTGTACTTCTG
GAGCACCTGTCCCTTTGGGAATTTGTAGCTACTGCTGATAAGGTTCGTTTACGGAAAGAGTCCAATCAAGCACCA
ATCATTGGTAGTGTTCATTCTTCGTCTTCCATTAATGATGTTCAACAGACAGGTTCTACATTTCGTTTTTTACCT
TGCCATCCTCAATATGATACCCATGGCATCACTCTCAGATCATCATTACATCGCTTAGTTCCTGTTCCTCGGGGT
CCACCGATTCCTAGAAGAGACCAGAAATCCAGTGTGCAGCGGCATGCACGGTTGATGCTTATCTTTTTTAAGCCA
TGGCGTCATGCTTCTGATTTAAGAAGTTCAGAATCGTCATGGTATCAAAATTTTTCTGTGTTTTTAAATTCAGTA
AGCCGCAGAGTCAAAGATATCATTGACAATATGCAGGTTTTGCATGAATGTAAAGATAGTCGGGACGACCACTTT
GAAAAATTACGGTCATCGTATCGCCACAGCACTTCTCGCTCTACTGGTGATGGTAGACGCATGGAGGACGACTTT
ACTGGATCAGACAATTATGAGCAGATCATAATGGAACATTTGGAAAATATTGATAAGTGTCATAGTGAACGGATA
GATAACACATTACGCAAGTCCATAGCCTGCCTTGAAGCGTTGGAATCATGTGGAATCATTGATGTTGTTGACGTC
CCGACCATAGACACAACACCATTGGGCGTTACATTGTCAATACCAACAGAATCAAACAACTCAACAAGTTTTGCT
GTGCGTGCCGATGACCAACAGTTAGAGGATATATGGAAGGAAGCATACGATGAGAGAAAAAGTATATGGAAAAAA
CATGCTAGTCAACCAGAAGAAGAAAGACATGGTTCTGAACAGCCTACTGTAGGCTCAGTTTCTCAAGCTTCAGAT
TCGACAACTTTGTTTAATACGCCTGTACCATTAAGAATGACATTTGCCCAGGATATAAACCAGGAAAGTAGAAAT
TTATCTGAAGAAAAGACTAATATCAATCGACTTGTAGATAACTTCACCCTCAACCATGAACAAGCTTGTGCGTTC
AGAATCATAGCAGAACATTGTAATACTCATTGCGGAGAACAATTGCGAATGTATATTGCTGGCGAAGGAGGTACG
GGAAAATCACGTATCTTAAGTACAATCCAAGAATTTTTCCAACAAAGACATGAATCCCGTCGGTTTCGCATGGCC
TCTTATACTGGTGTCGCTGCACAAAATATAGATGGAATGACGTTGCATACTGCTTTGCATTTAGGAACCAATCAA
CGGCACACCCATGATACCCATTCTAATAGGGATTTAGTCGCCATGTGGGAAGGCGTTGATTACCTTTTCATTGAT
GAAGTGTCTATGCTTGGTTGTTCTTTTTTGGCTGACATAAATCAAGCCTTAATACGGGCTAAAGGCAAAACCTCA
AACTTTGGAGGAATCAATGTTATCTTTGCAGGTGATTTTTGCCAACTACCACCAGTAGGCCAGATTCGCCTCTTT
AGCAGTAGTAACACAGGACCTGATAGGCCAAAAGATCATGACGGTCAAATGTTGTGGCAATCGGTGGATACTGTA
GTTATACTCAAACAAGTCATGAGACAATCAGGTAGGGATAATGATCAATTTGTAAGCTTGCTTGACAGGCTTCGG
CTGGGAACTTGCTCTGAAAGCAATTATGAATTACTTATGTCGCGATTGCCAACATTGGTTCAGCCAGATTGGGAT
ACGTGGTCACAAGCACCTATTCTTGTCAATGATAATGTCACAAAGGATCTATTGAATGAGCAGGCTGCCATTACA
TATGCCCGTAAAACTGGCAAGCGCTTACAATGGTTTTATTCTATTGACACATGGCAGGATTTACCTTTGCGGGAT
TTGCATTTACAGGAATACTTATGGTCACTGCCTTCGAGCAAAACCAACCATAGACTGGGGCGCTTACCTTTGGTT
GAAGGAATGCCAGTAATGATCACATTGAATTATGACGTTTCTGCAGGAATCGTAAACGGTTCGATGGGTTCATTA
CTGTCTGTCCGATACATTATCAATTCAAATGGAAATCATGAGGCGATTTCATGCGTTGTAAAATGCCCTAACGTC
ACAGGCCACCAGCTTCCTTCATTGGACGTTGGTGATGCAGCTGTTTTACAAGACACTGTCAATGTAAATTTTGTT
CATCCTTTTTCATCTCTCAAATGTTCAATGAAGCGTCGCCAACTTCCTCTTGTACCTGCTTTCGCTCTCACAGTA
CACAAAGCCCAAGGACAAACACTTCCACGTGCAATAGTAGATCTTTCAACATGTATCGGTACTGAGTCACCGTAT
GTTATGCTTTCAAGAGTACAGTCGCTAAGTGGTCTTATGGTTATTCAACCATTTGATAGAAACAAAATAAGTGTT
CGCCAAAGTGAAGATCTGCGTAAAGAATTACTGAGATTACGAGTCATGGAAATCATGTCACGCATTAAGTTGGAG
TATTCTAGACAAGCCTATGACGAATTGACCGATTTGATGTGCCACTTTCCGCCAGCAATTGACTTTCCTGGCGTT
CAAGATAATTTATTCAAAACTCTCACATACAATGATTACACTACTCTCTGCGCACGGTTACCCGTACTTGACTCT
CATGCTCCACTATTGGAGCACTCTGTTACGACCTTAAAGCGAATGCTCAGGTTGCAGCGAAGGCAAAGACTTGAG
GGCGAGTAA
Gene >AgabiH97|048820
ATGACTTTACTTCATGGTCCTATTATCACGCCACTATCTTCTCCTCTGACTTCATGTTTTCTTGATGTTGCAGGT
ATCAAATTTGATAGTCATGGACAACCTACTGATATTCGTTTGTGTGTTTCATGTTTTGGATCAGCTAGTCGTGGT
AAAACACCTCCTTTGTCTTTGTCAAATAACACTTTTTTGGGTGCCGTTCCTCCTCAGCTAAAAGATCTTACTTTT
ATAGAGGAAGTTATGATTGCTCGCTGCCGAGCTAAAGTGTGGATTGTCAGGCTTCAGGAAAATCATGAGCAGGTC
GATGTTCCGATTTCCCAGAGAGGTATCAAAGGCCATGTGATCATTTATCCTCAAGATACGTCGTCAATCGCTTTG
CACTTGCCACCTTCTATGGAAGAGATTTCGTCATTCATATGTGTTCTTTTTATTGGATCAACTGTTCCTACAAGA
CAATGGCTTGCAACAAAGGCTAGACCACTGTTAGTTCGAAAAAATAAAATTCAGGATGCATTGTTTTGGTTGAAG
GAACACAATTCTTTATATGCAAATGTTGTTATTGATCATGATGTATTGCGTCGCATGGGAAATGAGGCGTTCGTA
CCACCAATTCATATAGAACACTTGCAGAACAGTCCTGCTTCTGACCTTTTAACTTCACGCTATGACTTGGACCCT
GCTGAACATAATAATTCAGGCGATGTATATCCTGCCGTTGTTGTTACTGATGTTGATATTGTGGCACCAGCACAT
GAATTACGGGTGGCTGCATTACGACACATTAAAGGTGGTGGATCGTATTTCCAGATTCCTCACAGTTCACAGGCG
TCACGCGAGTTTTACAATCCATCTTTGCTTCCACTTACTTATCCCACGTTGTTTCCCTTTGGTATAGGTGGATTT
GAAGATGAAGAGCGACAAGTTTCTCTTAGCTTTGAACGCCATGTTAAACATGCTATGTCTTTGTCAGATCGCCGC
TTTCAGGAGCACTATTCTTTTCCTTTTACAGTTTTTAACATCATTCAACGGCGCAAGGCTTTGTTAAACACTAGT
CTGAAAGTTAAGCGATCAGACTTTGATCGTGTCGCGAAGACATTTGCGACTGTGTCAGTTGAAACATTAGGAAAA
GTGACCCAGAAAGTAATAAGTGAAGGAACTTTTTCACTGAAAACAGATGAGGAAAGAAAAGTTATGTCATTGATG
CGACAAGTTAACTCGGTGACATCTGGTGTCTACGGATCTGCTGCAGAGCGAATGAATATGCGGAATGAACTGCGA
GCTCTTGTGATTGAAAAAGGTTTACCATCCTTTTATCTTACTATAAATCCTGCTGATGTCTATAATCCTATTGTG
CGTCTTCTTTCAGGTGAGAAGTTAGATATAGATAGCATGTTCATATCAACATCGTCTACATATTGGGCCCAGGCA
TCGTTAGTTGCTAAGAATCCCTTTTTAGCTGCACAATTTTTCAATATCATTATGATTACATACATACGAAATATA
TTGGGATATCAGGCCGTCAAGGGTGTCAATATCAAAAATGGAATCTTGGGAGTAGTAAAAGCATATTACGGATGT
GTCGAAGCACAGGGGAGAGGCACATTGCATTGCCATATGGTCATTTGGATTGAAGGAGCATTGAATTCTCAGCAA
ATATATGAAAAGATATGTGTGTCTAAAAATTTGCAATTTCGCGATCGCTTGTTGGCGTTTATGGATGCTAGTATT
TCAAATCATCTGCCTCCAGATCCTGATCCATCTCTTCAAATTCCATCATCTTGCTATCATCCTTGTTCTTCATTG
CCACTGGATCGAAACAATTTTTCAAATGATGAGGAATATCGAGTGGCTCTTGACAAGGATTTGCACCACCTTGTT
AGTCGGTCTCAAATTCATAAACATTCGTTCACATGTTACAAATATTGGCCAGGTCCTCCAGCCCCACGCATATGT
CGTTTTGGATTGGGTGACGGTGAAGTGTGCAATGCAAGCTGGTTTAATGAAGACACAGGTGAACTTGAGTTGAAA
CGGTCTGACTTATACGTTAACAATTACAACAGGTTATCAACTTTAGCAATTCGTTCTAATTCTGACATTAAGTGG
ATTTTATCAGGTCCCAGTGCGAGATCAATATTTTACTACGTGTCGGAATATATCACGAAACATGATTTGAAACCT
CATATAGCCTATGCAGCTTTGGAAGTGGGCTCACGGAAACTTGGGAATTTTGACTCTCAAGAAGATGATATATTA
TTCCGAGGTAAACGGTTATTACAACGTTGTGCTTACGCCATGATATCGCATCAAGAATTGTCTGCTCAACAAGTA
GTGTCTTATTTATGTGGATTTGAAGACCATTTCACTAGTCATGCATACAGGAATTTTTATTGGCTGTCTTTTGAG
AATTTAGTAAATAAAGAATGGCCTTCTCCCGAGTGCGTCGGTACCCATGCTTCTTCAAATGATGAGGATACTCAT
AATAATGCCACTATTGCTGATGAGGTTAATGATAATATGTCACGCGAAGAAGATGATAGTGAAGCAGAGATCGGT
TTTAGTGTTTATGAAGGACAATTACAGGCCAATAGTCCTCAAGTAGTTAATTATCAATTACGAAGTGTACTTCTG
GAGCACCTGTCCCTTTGGGAATTTGTAGCTACTGCTGATAAGGTTCGTTTACGGAAAGAGTCCAATCAAGCACCA
ATCATTGGTAGTGTTCATTCTTCGTCTTCCATTAATGATGTTCAACAGACAGGTTCTACATTTCGTTTTTTACCT
TGCCATCCTCAATATGATACCCATGGCATCACTCTCAGATCATCATTACATCGCTTAGTTCCTGTTCCTCGGGGT
CCACCGATTCCTAGAAGAGACCAGAAATCCAGTGTGCAGCGGCATGCACGGTTGATGCTTATCTTTTTTAAGCCA
TGGCGTCATGCTTCTGATTTAAGAAGTTCAGAATCGTCATGGTATCAAAATTTTTCTGTGTTTTTAAATTCAGTA
AGCCGCAGAGTCAAAGATATCATTGACAATATGCAGGTTTTGCATGAATGTAAAGATAGTCGGGACGACCACTTT
GAAAAATTACGGTCATCGTATCGCCACAGCACTTCTCGCTCTACTGGTGATGGTAGACGCATGGAGGACGACTTT
ACTGGATCAGACAATTATGAGCAGATCATAATGGAACATTTGGAAAATATTGATAAGTGTCATAGTGAACGGATA
GATAACACATTACGCAAGTCCATAGCCTGCCTTGAAGCGTTGGAATCATGTGGAATCATTGATGTTGTTGACGTC
CCGACCATAGACACAACACCATTGGGCGTTACATTGTCAATACCAACAGAATCAAACAACTCAACAAGTTTTGCT
GTGCGTGCCGATGACCAACAGTTAGAGGATATATGGAAGGAAGCATACGATGAGAGAAAAAGTATATGGAAAAAA
CATGCTAGTCAACCAGAAGAAGAAAGACATGGTTCTGAACAGCCTACTGTAGGCTCAGTTTCTCAAGCTTCAGAT
TCGACAACTTTGTTTAATACGCCTGTACCATTAAGAATGACATTTGCCCAGGATATAAACCAGGAAAGTAGAAAT
TTATCTGAAGAAAAGACTAATATCAATCGACTTGTAGATAACTTCACCCTCAACCATGAACAAGCTTGTGCGTTC
AGAATCATAGCAGAACATTGTAATACTCATTGCGGAGAACAATTGCGAATGTATATTGCTGGCGAAGGAGGTACG
GGAAAATCACGTATCTTAAGTACAATCCAAGAATTTTTCCAACAAAGACATGAATCCCGTCGGTTTCGCATGGCC
TCTTATACTGGTGTCGCTGCACAAAATATAGATGGAATGACGTTGCATACTGCTTTGCATTTAGGAACCAATCAA
CGGCACACCCATGATACCCATTCTAATAGGGATTTAGTCGCCATGTGGGAAGGCGTTGATTACCTTTTCATTGAT
GAAGTGTCTATGCTTGGTTGTTCTTTTTTGGCTGACATAAATCAAGCCTTAATACGGGCTAAAGGCAAAACCTCA
AACTTTGGAGGAATCAATGTTATCTTTGCAGGTGATTTTTGCCAACTACCACCAGTAGGCCAGATTCGCCTCTTT
AGCAGTAGTAACACAGGACCTGATAGGCCAAAAGATCATGACGGTCAAATGTTGTGGCAATCGGTGGATACTGTA
GTTATACTCAAACAAGTCATGAGACAATCAGGTAGGGATAATGATCAATTTGTAAGCTTGCTTGACAGGCTTCGG
CTGGGAACTTGCTCTGAAAGCAATTATGAATTACTTATGTCGCGATTGCCAACATTGGTTCAGCCAGATTGGGAT
ACGTGGTCACAAGCACCTATTCTTGTCAATGATAATGTCACAAAGGATCTATTGAATGAGCAGGCTGCCATTACA
TATGCCCGTAAAACTGGCAAGCGCTTACAATGGTTTTATTCTATTGACACATGGCAGGATTTACCTTTGCGGGAT
TTGCATTTACAGGAATACTTATGGTCACTGCCTTCGAGCAAAACCAACCATAGACTGGGGCGCTTACCTTTGGTT
GAAGGAATGCCAGTAATGATCACATTGAATTATGACGTTTCTGCAGGAATCGTAAACGGTTCGATGGGTTCATTA
CTGTCTGTCCGATACATTATCAATTCAAATGGAAATCATGAGGCGATTTCATGCGTTGTAAAATGCCCTAACGTC
ACAGGCCACCAGCTTCCTTCATTGGACGTTGGTGATGCAGCTGTTTTACAAGACACTGTCAATGTAAATTTTGTT
CATCCTTTTTCATCTCTCAAATGTTCAATGAAGCGTCGCCAACTTCCTCTTGTACCTGCTTTCGCTCTCACAGTA
CACAAAGCCCAAGGACAAACACTTCCACGTGCAATAGTAGATCTTTCAACATGTATCGGTACTGAGTCACCGTAT
GTTATGCTTTCAAGAGTACAGTCGCTAAGTGGTCTTATGGTTATTCAACCATTTGATAGAAACAAAATAAGTGTT
CGCCAAAGTGAAGATCTGCGTAAAGAATTACTGAGATTACGAGTCATGGAAATCATGTCACGCATTAAGTTGGAG
TATTCTAGACAAGCCTATGACGAATTGACCGATTTGATGTGCCACTTTCCGCCAGCAATTGACTTTCCTGGCGTT
CAAGATAATTTATTCAAAACTCTCACATACAATGATTACACTACTCTCTGCGCACGGTTACCCGTACTTGACTCT
CATGCTCCACTATTGGAGCACTCTGTTACGACCTTAAAGCGAATGCTCAGGTTGCAGCGAAGGCAAAGACTTGAG
GGCGAGTAA

© 2022 - Robin Ohm - Utrecht University - The Netherlands

Built with Python Django and Wagtail