Fungal Genomics

at Utrecht University

General Properties

Protein IDAgabi119p4|758450
Gene name
Locationscaffold_11:1218923..1222755
Strand-
Gene length (bp)3832
Transcript length (bp)3714
Coding sequence length (bp)3714
Protein length (aa) 1238

Your browser does not support drawing a protein figure.

PFAM Domains

PFAM Domain ID Short name Long name E-value Start End
PF17917 RT_RNaseH RNase H-like domain found in reverse transcriptase 2.3E-38 508 610
PF17919 RT_RNaseH_2 RNase H-like domain found in reverse transcriptase 1.7E-37 480 576
PF00078 RVT_1 Reverse transcriptase (RNA-dependent DNA polymerase) 1.4E-18 255 414
PF17921 Integrase_H2C2 Integrase zinc binding domain 3.1E-19 704 760
PF00665 rve Integrase core domain 1.2E-11 778 874
PF08284 RVP_2 Retroviral aspartyl protease 3.2E-09 33 132
PF00385 Chromo Chromo (CHRromatin Organisation MOdifier) domain 1.0E-07 1074 1114
PF13975 gag-asp_proteas gag-polyprotein putative aspartyl protease 4.5E-07 26 117
PF13650 Asp_protease_2 Aspartyl protease 5.3E-06 25 115

Swissprot hits

[Show all]
Swissprot ID Swissprot Description Start End E-value
sp|P0CT39|TF26_SCHPO Transposon Tf2-6 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-6 PE=3 SV=1 30 1050 6.0E-142
sp|P0CT38|TF25_SCHPO Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-5 PE=3 SV=1 30 1050 6.0E-142
sp|P0CT36|TF23_SCHPO Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-3 PE=1 SV=1 30 1050 6.0E-142
sp|P0CT41|TF212_SCHPO Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-12 PE=3 SV=1 30 1050 6.0E-142
sp|P0CT40|TF29_SCHPO Transposon Tf2-9 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-9 PE=3 SV=1 30 1050 7.0E-142
[Show all]
[Show less]
Swissprot ID Swissprot Description Start End E-value
sp|P0CT39|TF26_SCHPO Transposon Tf2-6 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-6 PE=3 SV=1 30 1050 6.0E-142
sp|P0CT38|TF25_SCHPO Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-5 PE=3 SV=1 30 1050 6.0E-142
sp|P0CT36|TF23_SCHPO Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-3 PE=1 SV=1 30 1050 6.0E-142
sp|P0CT41|TF212_SCHPO Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-12 PE=3 SV=1 30 1050 6.0E-142
sp|P0CT40|TF29_SCHPO Transposon Tf2-9 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-9 PE=3 SV=1 30 1050 7.0E-142
sp|P0CT37|TF24_SCHPO Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-4 PE=3 SV=1 30 1050 7.0E-142
sp|P0CT35|TF22_SCHPO Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-2 PE=3 SV=1 30 1050 7.0E-142
sp|P0CT34|TF21_SCHPO Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-1 PE=3 SV=1 30 1050 7.0E-142
sp|P0CT43|TF28_SCHPO Transposon Tf2-8 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-8 PE=3 SV=1 30 1050 2.0E-141
sp|P0CT42|TF27_SCHPO Transposon Tf2-7 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-7 PE=3 SV=1 30 1050 2.0E-141
sp|Q9UR07|TF211_SCHPO Transposon Tf2-11 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-11 PE=3 SV=1 30 1050 4.0E-141
sp|Q7LHG5|YI31B_YEAST Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-I PE=3 SV=2 218 1058 8.0E-133
sp|Q99315|YG31B_YEAST Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-G PE=1 SV=3 218 1060 2.0E-132
sp|P20825|POL2_DROME Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster GN=pol PE=3 SV=1 191 934 3.0E-85
sp|P04323|POL3_DROME Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster GN=pol PE=3 SV=1 191 631 2.0E-77
sp|Q8I7P9|POL5_DROME Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster GN=pol PE=3 SV=1 92 631 7.0E-67
sp|P10401|POLY_DROME Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogaster GN=pol PE=3 SV=1 191 630 3.0E-59
sp|P10394|POL4_DROME Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaster GN=POL PE=3 SV=1 213 631 1.0E-54
sp|P03555|POL_CAMVC Enzymatic polyprotein OS=Cauliflower mosaic virus (strain CM-1841) GN=ORF V PE=3 SV=1 17 630 1.0E-42
sp|Q00962|POL_CAMVN Enzymatic polyprotein OS=Cauliflower mosaic virus (strain NY8153) GN=ORF V PE=3 SV=1 17 630 4.0E-42
sp|P03554|POL_CAMVS Enzymatic polyprotein OS=Cauliflower mosaic virus (strain Strasbourg) GN=ORF V PE=3 SV=1 17 630 1.0E-41
sp|Q02964|POL_CAMVE Enzymatic polyprotein OS=Cauliflower mosaic virus (strain BBC) GN=ORF V PE=3 SV=1 17 630 2.0E-41
sp|P03556|POL_CAMVD Enzymatic polyprotein OS=Cauliflower mosaic virus (strain D/H) GN=ORF V PE=3 SV=1 17 630 6.0E-40
sp|P09523|POL_FMVD Enzymatic polyprotein OS=Figwort mosaic virus (strain DxS) GN=ORF V PE=3 SV=1 213 630 1.0E-39
sp|P05400|POL_CERV Enzymatic polyprotein OS=Carnation etched ring virus GN=ORF V PE=3 SV=1 211 628 4.0E-39
sp|A6NKG5|RTL1_HUMAN Retrotransposon-like protein 1 OS=Homo sapiens GN=RTL1 PE=3 SV=3 30 625 2.0E-34
sp|P19199|POL_COYMV Polyprotein P3 OS=Commelina yellow mottle virus PE=3 SV=2 220 632 7.0E-32
sp|Q91DM0|POLG_PVCV1 Genome polyprotein OS=Petunia vein clearing virus (isolate Shepherd) PE=3 SV=1 185 630 8.0E-32
sp|Q6XKE6|POLG_PVCV2 Genome polyprotein OS=Petunia vein clearing virus (isolate Hohn) PE=3 SV=1 185 630 1.0E-31
sp|Q89703|POL_CSVMV Putative enzymatic polyprotein OS=Cassava vein mosaic virus GN=ORF 3 PE=3 SV=1 222 636 7.0E-31
sp|Q7M732|RTL1_MOUSE Retrotransposon-like protein 1 OS=Mus musculus GN=Rtl1 PE=2 SV=1 30 625 5.0E-30
sp|P10394|POL4_DROME Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaster GN=POL PE=3 SV=1 651 1024 2.0E-29
sp|Q52QI2|RTL1_BOVIN Retrotransposon-like protein 1 OS=Bos taurus GN=RTL1 PE=2 SV=2 30 624 8.0E-29
sp|Q09575|YRD6_CAEEL Uncharacterized protein K02A2.6 OS=Caenorhabditis elegans GN=K02A2.6 PE=3 SV=1 704 1028 3.0E-27
sp|P23074|POL_SFV1 Pro-Pol polyprotein OS=Simian foamy virus type 1 GN=pol PE=1 SV=3 702 964 1.0E-24
sp|Q87040|POL_SFVCP Pro-Pol polyprotein OS=Simian foamy virus (isolate chimpanzee) GN=pol PE=3 SV=1 696 957 3.0E-24
sp|P27401|POL_SFV3L Pro-Pol polyprotein OS=Simian foamy virus type 3 (strain LK3) GN=pol PE=3 SV=2 702 957 5.0E-24
sp|O92815|POL_WDSV Gag-Pol polyprotein OS=Walleye dermal sarcoma virus GN=gag-pol PE=1 SV=2 246 584 2.0E-23
sp|P27502|POL_RTBVP Polyprotein P3 OS=Rice tungro bacilliform virus (isolate Philippines) PE=1 SV=1 33 630 2.0E-23
sp|P14350|POL_FOAMV Pro-Pol polyprotein OS=Human spumaretrovirus GN=pol PE=1 SV=2 696 957 8.0E-23
sp|P21414|POL_GALV Pol polyprotein OS=Gibbon ape leukemia virus GN=pol PE=3 SV=1 208 664 1.0E-22
sp|Q7TD08|POL_CYLCV Enzymatic polyprotein OS=Cestrum yellow leaf curling virus GN=ORF V PE=3 SV=1 241 630 1.0E-22
sp|A1Z651|POL_XMRV6 Gag-Pol polyprotein OS=Xenotropic MuLV-related virus (isolate VP62) GN=gag-pol PE=1 SV=1 211 560 2.0E-22
sp|Q2F7J3|POL_XMRV3 Gag-Pol polyprotein OS=Xenotropic MuLV-related virus (isolate VP35) GN=gag-pol PE=1 SV=1 211 560 2.0E-22
sp|Q2F7J0|POL_XMRV4 Gag-Pol polyprotein OS=Xenotropic MuLV-related virus (isolate VP42) GN=gag-pol PE=3 SV=1 211 560 3.0E-22
sp|P10272|POL_BAEVM Pol polyprotein OS=Baboon endogenous virus (strain M7) GN=pol PE=3 SV=1 213 543 3.0E-22
sp|P03355|POL_MLVMS Gag-Pol polyprotein OS=Moloney murine leukemia virus (isolate Shinnick) GN=gag-pol PE=1 SV=4 211 560 5.0E-22
sp|P26808|POL_MLVFP Pol polyprotein OS=Friend murine leukemia virus (isolate PVC-211) GN=pol PE=3 SV=1 209 560 6.0E-22
sp|P26809|POL_MLVFF Pol polyprotein OS=Friend murine leukemia virus (isolate FB29) GN=pol PE=3 SV=1 211 560 7.0E-22
sp|P03356|POL_MLVAV Pol polyprotein OS=AKV murine leukemia virus GN=pol PE=3 SV=2 211 611 8.0E-22
sp|P11227|POL_MLVRD Pol polyprotein OS=Radiation murine leukemia virus GN=pol PE=3 SV=1 211 560 1.0E-21
sp|P31792|POL_FENV1 Pol polyprotein (Fragment) OS=Feline endogenous virus ECE1 GN=pol PE=3 SV=1 213 543 1.0E-21
sp|P26810|POL_MLVF5 Pol polyprotein OS=Friend murine leukemia virus (isolate 57) GN=pol PE=3 SV=1 211 560 2.0E-21
sp|O93209|POL_FFV Pro-Pol polyprotein OS=Feline foamy virus GN=pol PE=3 SV=1 702 955 2.0E-21
sp|P14350|POL_FOAMV Pro-Pol polyprotein OS=Human spumaretrovirus GN=pol PE=1 SV=2 29 564 4.0E-20
sp|Q9TTC1|POL_KORV Pro-Pol polyprotein OS=Koala retrovirus GN=pro-pol PE=3 SV=1 211 565 7.0E-20
sp|O93209|POL_FFV Pro-Pol polyprotein OS=Feline foamy virus GN=pol PE=3 SV=1 199 564 7.0E-19
sp|A4FUB7|GIN1_BOVIN Gypsy retrotransposon integrase-like protein 1 OS=Bos taurus GN=GIN1 PE=2 SV=1 711 983 8.0E-19
sp|P15629|POL_SOCMV Enzymatic polyprotein OS=Soybean chlorotic mottle virus GN=ORF V PE=3 SV=2 201 525 1.0E-18
sp|A1Z651|POL_XMRV6 Gag-Pol polyprotein OS=Xenotropic MuLV-related virus (isolate VP62) GN=gag-pol PE=1 SV=1 799 1019 2.0E-18
sp|Q2F7J0|POL_XMRV4 Gag-Pol polyprotein OS=Xenotropic MuLV-related virus (isolate VP42) GN=gag-pol PE=3 SV=1 799 1019 2.0E-18
sp|Q2F7J3|POL_XMRV3 Gag-Pol polyprotein OS=Xenotropic MuLV-related virus (isolate VP35) GN=gag-pol PE=1 SV=1 799 1019 3.0E-18
sp|P31795|POL_MLVRK Pol polyprotein (Fragment) OS=Radiation murine leukemia virus (strain Kaplan) GN=pol PE=3 SV=1 775 1019 3.0E-18
sp|P31843|RRPO_OENBE RNA-directed DNA polymerase homolog OS=Oenothera berteroana PE=4 SV=1 261 357 3.0E-18
sp|Q09575|YRD6_CAEEL Uncharacterized protein K02A2.6 OS=Caenorhabditis elegans GN=K02A2.6 PE=3 SV=1 248 459 4.0E-18
sp|P11227|POL_MLVRD Pol polyprotein OS=Radiation murine leukemia virus GN=pol PE=3 SV=1 799 1019 5.0E-18
sp|P03356|POL_MLVAV Pol polyprotein OS=AKV murine leukemia virus GN=pol PE=3 SV=2 775 1019 6.0E-18
sp|Q87040|POL_SFVCP Pro-Pol polyprotein OS=Simian foamy virus (isolate chimpanzee) GN=pol PE=3 SV=1 29 564 8.0E-18
sp|P26808|POL_MLVFP Pol polyprotein OS=Friend murine leukemia virus (isolate PVC-211) GN=pol PE=3 SV=1 775 1019 1.0E-17
sp|P26810|POL_MLVF5 Pol polyprotein OS=Friend murine leukemia virus (isolate 57) GN=pol PE=3 SV=1 775 1019 1.0E-17
sp|P03355|POL_MLVMS Gag-Pol polyprotein OS=Moloney murine leukemia virus (isolate Shinnick) GN=gag-pol PE=1 SV=4 799 1019 2.0E-17
sp|P26809|POL_MLVFF Pol polyprotein OS=Friend murine leukemia virus (isolate FB29) GN=pol PE=3 SV=1 775 1019 2.0E-17
sp|P08361|POL_MLVCB Gag-Pol polyprotein (Fragment) OS=Cas-Br-E murine leukemia virus GN=gag-pol PE=3 SV=1 784 1019 2.0E-17
sp|Q9NXP7|GIN1_HUMAN Gypsy retrotransposon integrase-like protein 1 OS=Homo sapiens GN=GIN1 PE=2 SV=3 688 979 2.0E-16
sp|Q8K259|GIN1_MOUSE Gypsy retrotransposon integrase-like protein 1 OS=Mus musculus GN=Gin1 PE=2 SV=2 688 970 2.0E-16
sp|P23074|POL_SFV1 Pro-Pol polyprotein OS=Simian foamy virus type 1 GN=pol PE=1 SV=3 19 564 5.0E-16
sp|P27401|POL_SFV3L Pro-Pol polyprotein OS=Simian foamy virus type 3 (strain LK3) GN=pol PE=3 SV=2 19 564 5.0E-16
sp|P92523|M860_ARATH Uncharacterized mitochondrial protein AtMg00860 OS=Arabidopsis thaliana GN=AtMg00860 PE=4 SV=1 387 529 5.0E-16
sp|Q5RBK0|GIN1_PONAB Gypsy retrotransposon integrase-like protein 1 OS=Pongo abelii GN=GIN1 PE=2 SV=1 688 979 5.0E-16
sp|Q4R6I1|GIN1_MACFA Gypsy retrotransposon integrase-like protein 1 OS=Macaca fascicularis GN=GIN1 PE=2 SV=1 688 979 1.0E-15
sp|P10272|POL_BAEVM Pol polyprotein OS=Baboon endogenous virus (strain M7) GN=pol PE=3 SV=1 675 1019 3.0E-15
sp|Q9TTC1|POL_KORV Pro-Pol polyprotein OS=Koala retrovirus GN=pro-pol PE=3 SV=1 707 1019 5.0E-15
sp|Q5DTZ0|NYNRI_MOUSE Protein NYNRIN OS=Mus musculus GN=Nynrin PE=2 SV=2 704 1046 1.0E-14
sp|Q66H30|GIN1_RAT Gypsy retrotransposon integrase-like protein 1 OS=Rattus norvegicus GN=GIN1 PE=2 SV=1 688 934 2.0E-14
sp|P31792|POL_FENV1 Pol polyprotein (Fragment) OS=Feline endogenous virus ECE1 GN=pol PE=3 SV=1 733 1019 3.0E-13
sp|P21414|POL_GALV Pol polyprotein OS=Gibbon ape leukemia virus GN=pol PE=3 SV=1 707 934 4.0E-13
sp|P03360|POL_AVIRE Pol polyprotein (Fragment) OS=Avian reticuloendotheliosis virus GN=pol PE=3 SV=1 720 934 9.0E-12
sp|Q9P2P1|NYNRI_HUMAN Protein NYNRIN OS=Homo sapiens GN=NYNRIN PE=2 SV=3 418 543 3.0E-11
sp|Q9P2P1|NYNRI_HUMAN Protein NYNRIN OS=Homo sapiens GN=NYNRIN PE=2 SV=3 705 926 2.0E-10
sp|Q5DTZ0|NYNRI_MOUSE Protein NYNRIN OS=Mus musculus GN=Nynrin PE=2 SV=2 418 543 4.0E-09
sp|Q86TG7|PEG10_HUMAN Retrotransposon-derived protein PEG10 OS=Homo sapiens GN=PEG10 PE=1 SV=2 8 307 4.0E-09
sp|P10401|POLY_DROME Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogaster GN=pol PE=3 SV=1 710 926 6.0E-09
sp|Q8I7P9|POL5_DROME Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster GN=pol PE=3 SV=1 721 1006 7.0E-09
sp|P03359|POL_WMSV Pol polyprotein (Fragment) OS=Woolly monkey sarcoma virus GN=pol PE=3 SV=1 776 934 2.0E-08
sp|P07572|POL_MPMV Pol polyprotein OS=Mason-Pfizer monkey virus GN=pol PE=3 SV=1 222 500 6.0E-08
sp|P04025|POL_SRV1 Pol polyprotein OS=Simian retrovirus SRV-1 GN=pol PE=3 SV=1 222 467 6.0E-08
sp|P31623|POL_JSRV Pol polyprotein OS=Sheep pulmonary adenomatosis virus GN=pol PE=3 SV=1 222 533 8.0E-08
sp|P03365|POL_MMTVB Pol polyprotein OS=Mouse mammary tumor virus (strain BR6) GN=pol PE=3 SV=2 222 470 1.0E-07
sp|P11283|POL_MMTVC Gag-Pro-Pol polyprotein OS=Mouse mammary tumor virus (strain C3H) GN=gag-pro-pol PE=1 SV=2 222 470 1.0E-07
sp|O92815|POL_WDSV Gag-Pol polyprotein OS=Walleye dermal sarcoma virus GN=gag-pol PE=1 SV=2 773 919 2.0E-07
sp|P51517|POL_SRV2 Pol polyprotein OS=Simian retrovirus SRV-2 GN=pol PE=3 SV=1 222 470 9.0E-07
sp|Q4U0X6|POL_HTL3P Gag-Pro-Pol polyprotein OS=Human T-cell leukemia virus 3 (strain Pyl43) GN=gag-pro-pol PE=3 SV=4 210 533 1.0E-06
sp|P04323|POL3_DROME Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster GN=pol PE=3 SV=1 705 931 2.0E-06
[Show less]

GO

GO Term Description Terminal node
GO:0015074 DNA integration Yes
GO:0044260 cellular macromolecule metabolic process No
GO:0006139 nucleobase-containing compound metabolic process No
GO:0009987 cellular process No
GO:0034641 cellular nitrogen compound metabolic process No
GO:0008152 metabolic process No
GO:0008150 biological_process No
GO:0044238 primary metabolic process No
GO:0071704 organic substance metabolic process No
GO:0046483 heterocycle metabolic process No
GO:0043170 macromolecule metabolic process No
GO:1901360 organic cyclic compound metabolic process No
GO:0090304 nucleic acid metabolic process No
GO:0006725 cellular aromatic compound metabolic process No
GO:0044237 cellular metabolic process No
GO:0006807 nitrogen compound metabolic process No
GO:0006259 DNA metabolic process No

SignalP

[Help with interpreting these statistics]
SignalP signal predicted Location
(based on Ymax)
D score
(significance: > 0.45)
No 1 - 66 0.45

Transmembrane Domains

(None)

Transcription Factor Class

(None)

Expression data

No expression data available for this genome

Sequences

Type of sequenceSequence
Locus Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded.
Protein >Agabi119p4|758450
MNNRYVVPQKRLGVQELEVKPLLTTTNGKKLKLSAMVDSGCTHTCIDEGLVKKKKIPTKNLERPITCRNSDGTIA
GKKDITKFVKMDLNINGHNEQLDAVVTPLQSSDLFLGHDWLTNHNPEIDWKQGIIKFNRCPTSCSFPHTDISFEP
RIRRLQSNEDTEEKEPDPTNPEDLPAYMKPFAHLFNKKNFDKLPERTEWDHEINFTENAPTEISSKVYSMTPLER
EELDKFLDENLTTNRIRPSKSPYAAPCFFVPKKDGSLRLCQDYRKLNDITIKDKTPLPLISEVLDQLKDARVFNK
LDIIWGYNNVRIREGDEWKAAFLTNRGLFEPTVMFFGMSNSPATFSRMMATIFREMIQDGSLANYMDDFIIPAKD
DEELEARTIRFLKIAEKHNLSFKRTKCEFNVSSTTVLGTVIGNGKATMEEEKVKAIRDWAVPTTVKQVESFLGFA
NFYRRFIKNFSTIAQPLNELKSKKGEKWYWNDEQQQAFEQIKQAIASEPVLALPKDKGQFRVEVDASNYGTGAVL
SQEQENKWHPVAFMSKTLSEAERNYEIYDKELLAIIKALKLWRHYLLDAKEQFEIWTDHENLKYFREPQKLNARQ
ARWYLMLQEYDFLLRHIPGKTNTKADILSRLIKPDTSNDNRGVEMFKEKMFIRRLEESTPIYDVTLLHNRRFEIS
ADETVLEKIRKCERRETRVLEEMKKQPEKEIRDFILHDHHNSPDAGHPGTYRMLESVKRTFWWPTIKTDIRRYVR
GCDMCQKNKTIRRPDHIPLNPLPIPDKPWEEISIDMIGPLPKSKEKDAIIVIVDRFSKMIHLVPTTTSLMSMDLA
EIYKEEVWRHHGIPKRIISDRGPQFASKFMESLCKALGIERNLSTAYHPQTDGQTERMNQEIETYLRAFINYRQD
DWTRWLPMAEFHYNDKTHAATGQTPFFLNYGLHPWKGNITVETTNPTATSLIEDLESVREEAKSAMEANNEMMRE
RGNNKHHKEPFAEGDKVWLETTNIHSNRPTRKLDHKRYGPFEVLKQIGDRSYKLKLPVTWAIHDVFHTSLLTKVR
DPEFDSQKKPTPPPPDIINEEEEYEVEEIRGHRRKGRGIQFLVHWKGYGNEDDTWLPRSALTNSADILKDIVYWH
LKKIIMSNQQTTTTQTADFGGCLQRLLTIHTDITDLLQFQSNVQLPLFINPPSHIDISDEEKSQLEAQLETEKRL
LKLIENLLRMAKANNKEAKEALSPGSTTWTTNNDGWE*
Coding >Agabi119p4|758450
ATGAACAATCGATATGTTGTCCCTCAAAAACGTCTAGGAGTGCAAGAACTGGAAGTAAAACCCCTCCTCACTACA
ACAAACGGAAAGAAACTCAAACTCTCAGCCATGGTCGATTCGGGATGCACACACACATGTATCGACGAAGGATTA
GTAAAGAAGAAGAAGATCCCGACGAAGAATTTGGAACGGCCGATCACATGCAGGAATTCAGATGGAACGATAGCA
GGAAAGAAGGATATCACCAAATTCGTGAAAATGGATCTCAACATCAACGGTCATAACGAACAACTGGACGCGGTC
GTCACTCCCTTACAATCATCCGACCTCTTTCTAGGCCATGATTGGTTAACGAACCACAACCCCGAGATTGATTGG
AAACAAGGTATAATCAAATTCAACCGGTGCCCCACATCATGTTCCTTTCCCCATACCGACATCTCTTTCGAACCA
CGTATACGACGATTACAGTCTAACGAAGACACTGAGGAGAAAGAACCGGATCCAACGAACCCAGAGGACTTACCA
GCATACATGAAACCCTTCGCCCATCTTTTCAACAAAAAGAATTTCGACAAACTTCCTGAACGGACAGAATGGGAT
CACGAAATCAACTTCACGGAAAACGCACCTACAGAAATATCATCAAAGGTTTATAGCATGACGCCATTGGAGAGA
GAAGAACTGGACAAATTTTTGGATGAAAACCTGACCACCAATCGAATCCGACCATCAAAATCACCCTATGCAGCA
CCATGTTTTTTCGTTCCCAAGAAAGACGGTTCACTACGGTTATGTCAGGATTATAGGAAACTGAACGACATCACG
ATCAAGGACAAAACACCATTACCCCTCATCAGCGAAGTATTAGATCAACTCAAAGACGCCAGGGTTTTCAACAAA
TTGGATATTATCTGGGGATATAACAACGTTCGGATACGAGAAGGAGATGAATGGAAAGCAGCGTTCTTGACGAAT
CGAGGATTATTTGAACCAACGGTCATGTTCTTCGGAATGAGTAATTCCCCTGCCACCTTCTCACGTATGATGGCG
ACAATCTTTCGAGAAATGATACAAGACGGATCCCTAGCCAACTACATGGACGATTTTATCATTCCAGCAAAAGAC
GATGAAGAATTAGAAGCACGAACTATACGTTTCCTCAAAATCGCGGAGAAACACAATCTATCATTCAAACGGACG
AAATGCGAGTTCAATGTCTCATCAACAACGGTGTTGGGAACAGTTATTGGGAATGGAAAGGCAACAATGGAAGAA
GAAAAGGTCAAGGCAATACGAGATTGGGCAGTCCCTACCACAGTCAAACAAGTCGAGAGCTTTTTAGGCTTTGCG
AATTTCTATCGACGTTTTATCAAGAATTTCAGCACCATCGCACAACCACTGAACGAGTTGAAGTCAAAAAAGGGA
GAGAAATGGTATTGGAACGATGAACAACAACAAGCTTTCGAACAAATCAAACAAGCTATTGCCAGTGAACCAGTC
CTAGCACTACCAAAAGACAAAGGACAATTCAGAGTCGAAGTAGACGCATCGAACTATGGAACAGGAGCAGTACTA
TCACAGGAACAGGAAAACAAATGGCACCCGGTCGCTTTCATGTCGAAAACATTATCAGAAGCCGAAAGAAACTAC
GAAATCTACGACAAGGAACTACTAGCCATCATAAAAGCTTTGAAATTATGGCGACACTACCTATTGGATGCAAAG
GAGCAGTTTGAGATATGGACAGATCACGAGAACCTCAAGTATTTCCGAGAACCTCAAAAGCTCAACGCTCGACAA
GCGAGATGGTACCTCATGCTACAAGAATACGACTTCCTTCTACGACACATTCCTGGGAAGACTAACACCAAAGCA
GACATCCTGTCAAGACTAATTAAACCTGACACATCTAACGACAACCGAGGAGTAGAAATGTTCAAAGAGAAGATG
TTTATCCGAAGGCTTGAAGAATCTACCCCCATCTATGACGTCACCTTACTCCACAATCGAAGATTCGAAATTTCA
GCCGATGAAACCGTACTTGAGAAGATTAGGAAATGTGAAAGACGAGAAACCAGAGTATTAGAAGAGATGAAGAAG
CAACCAGAGAAAGAAATCAGAGATTTCATCCTTCACGATCATCATAATTCCCCCGACGCCGGACATCCTGGAACA
TACCGAATGCTAGAATCAGTTAAACGAACCTTTTGGTGGCCTACGATCAAAACGGATATCAGAAGATATGTCAGA
GGATGCGACATGTGCCAGAAGAACAAAACGATTCGACGACCCGATCACATTCCGCTTAATCCATTACCCATCCCC
GACAAACCTTGGGAAGAAATATCTATAGACATGATCGGACCACTACCAAAGTCAAAAGAGAAGGATGCTATTATT
GTTATCGTTGACAGATTTTCCAAAATGATCCACCTCGTTCCCACTACCACGTCACTCATGTCCATGGATCTTGCG
GAAATCTATAAGGAAGAAGTCTGGCGACATCACGGAATTCCGAAACGGATTATTAGCGACAGAGGACCACAATTC
GCATCAAAATTTATGGAATCACTATGCAAAGCGCTAGGCATTGAACGAAACCTTTCTACGGCCTATCACCCACAG
ACAGACGGTCAAACTGAGCGGATGAATCAGGAAATCGAAACCTATCTTCGAGCATTCATCAATTATCGACAAGAC
GATTGGACGAGATGGCTCCCCATGGCAGAATTCCATTACAATGACAAAACCCACGCTGCCACCGGACAAACCCCA
TTCTTCCTAAATTACGGACTTCACCCATGGAAGGGTAATATCACGGTTGAAACGACGAACCCCACCGCCACCTCC
TTGATCGAGGACTTGGAAAGTGTGCGAGAGGAAGCCAAATCTGCGATGGAAGCAAACAACGAGATGATGAGAGAA
AGAGGAAACAACAAGCACCACAAGGAACCCTTTGCCGAAGGAGATAAAGTTTGGTTAGAAACGACAAACATTCAT
TCCAATCGTCCGACTCGGAAACTAGACCACAAACGATATGGACCTTTTGAAGTCTTGAAACAAATCGGTGATCGA
TCTTACAAACTGAAGTTACCGGTTACCTGGGCGATACACGACGTCTTCCATACTTCACTCCTAACAAAAGTCCGA
GACCCCGAGTTTGACAGTCAGAAGAAACCCACCCCACCTCCACCTGACATTATCAACGAAGAAGAGGAATATGAA
GTTGAAGAAATCCGAGGACACCGACGAAAAGGCCGAGGAATACAATTTTTAGTTCACTGGAAAGGTTATGGAAAC
GAAGACGACACATGGCTACCACGCTCTGCTCTAACGAACTCCGCGGACATCCTAAAAGACATCGTCTATTGGCAT
CTGAAGAAGATCATCATGAGCAACCAACAAACCACAACCACCCAAACCGCCGATTTCGGAGGTTGTCTCCAACGA
CTCCTGACCATTCACACCGACATCACCGACCTTTTACAATTCCAATCCAACGTACAACTTCCTCTTTTCATAAAT
CCTCCATCGCATATCGACATATCAGACGAAGAAAAGAGTCAATTAGAGGCGCAACTCGAGACGGAGAAGAGATTA
TTGAAGCTGATAGAGAATCTGTTGAGAATGGCGAAAGCGAACAATAAAGAGGCAAAAGAAGCATTATCACCTGGG
TCTACAACGTGGACAACAAACAATGATGGATGGGAGTAA
Transcript >Agabi119p4|758450
ATGAACAATCGATATGTTGTCCCTCAAAAACGTCTAGGAGTGCAAGAACTGGAAGTAAAACCCCTCCTCACTACA
ACAAACGGAAAGAAACTCAAACTCTCAGCCATGGTCGATTCGGGATGCACACACACATGTATCGACGAAGGATTA
GTAAAGAAGAAGAAGATCCCGACGAAGAATTTGGAACGGCCGATCACATGCAGGAATTCAGATGGAACGATAGCA
GGAAAGAAGGATATCACCAAATTCGTGAAAATGGATCTCAACATCAACGGTCATAACGAACAACTGGACGCGGTC
GTCACTCCCTTACAATCATCCGACCTCTTTCTAGGCCATGATTGGTTAACGAACCACAACCCCGAGATTGATTGG
AAACAAGGTATAATCAAATTCAACCGGTGCCCCACATCATGTTCCTTTCCCCATACCGACATCTCTTTCGAACCA
CGTATACGACGATTACAGTCTAACGAAGACACTGAGGAGAAAGAACCGGATCCAACGAACCCAGAGGACTTACCA
GCATACATGAAACCCTTCGCCCATCTTTTCAACAAAAAGAATTTCGACAAACTTCCTGAACGGACAGAATGGGAT
CACGAAATCAACTTCACGGAAAACGCACCTACAGAAATATCATCAAAGGTTTATAGCATGACGCCATTGGAGAGA
GAAGAACTGGACAAATTTTTGGATGAAAACCTGACCACCAATCGAATCCGACCATCAAAATCACCCTATGCAGCA
CCATGTTTTTTCGTTCCCAAGAAAGACGGTTCACTACGGTTATGTCAGGATTATAGGAAACTGAACGACATCACG
ATCAAGGACAAAACACCATTACCCCTCATCAGCGAAGTATTAGATCAACTCAAAGACGCCAGGGTTTTCAACAAA
TTGGATATTATCTGGGGATATAACAACGTTCGGATACGAGAAGGAGATGAATGGAAAGCAGCGTTCTTGACGAAT
CGAGGATTATTTGAACCAACGGTCATGTTCTTCGGAATGAGTAATTCCCCTGCCACCTTCTCACGTATGATGGCG
ACAATCTTTCGAGAAATGATACAAGACGGATCCCTAGCCAACTACATGGACGATTTTATCATTCCAGCAAAAGAC
GATGAAGAATTAGAAGCACGAACTATACGTTTCCTCAAAATCGCGGAGAAACACAATCTATCATTCAAACGGACG
AAATGCGAGTTCAATGTCTCATCAACAACGGTGTTGGGAACAGTTATTGGGAATGGAAAGGCAACAATGGAAGAA
GAAAAGGTCAAGGCAATACGAGATTGGGCAGTCCCTACCACAGTCAAACAAGTCGAGAGCTTTTTAGGCTTTGCG
AATTTCTATCGACGTTTTATCAAGAATTTCAGCACCATCGCACAACCACTGAACGAGTTGAAGTCAAAAAAGGGA
GAGAAATGGTATTGGAACGATGAACAACAACAAGCTTTCGAACAAATCAAACAAGCTATTGCCAGTGAACCAGTC
CTAGCACTACCAAAAGACAAAGGACAATTCAGAGTCGAAGTAGACGCATCGAACTATGGAACAGGAGCAGTACTA
TCACAGGAACAGGAAAACAAATGGCACCCGGTCGCTTTCATGTCGAAAACATTATCAGAAGCCGAAAGAAACTAC
GAAATCTACGACAAGGAACTACTAGCCATCATAAAAGCTTTGAAATTATGGCGACACTACCTATTGGATGCAAAG
GAGCAGTTTGAGATATGGACAGATCACGAGAACCTCAAGTATTTCCGAGAACCTCAAAAGCTCAACGCTCGACAA
GCGAGATGGTACCTCATGCTACAAGAATACGACTTCCTTCTACGACACATTCCTGGGAAGACTAACACCAAAGCA
GACATCCTGTCAAGACTAATTAAACCTGACACATCTAACGACAACCGAGGAGTAGAAATGTTCAAAGAGAAGATG
TTTATCCGAAGGCTTGAAGAATCTACCCCCATCTATGACGTCACCTTACTCCACAATCGAAGATTCGAAATTTCA
GCCGATGAAACCGTACTTGAGAAGATTAGGAAATGTGAAAGACGAGAAACCAGAGTATTAGAAGAGATGAAGAAG
CAACCAGAGAAAGAAATCAGAGATTTCATCCTTCACGATCATCATAATTCCCCCGACGCCGGACATCCTGGAACA
TACCGAATGCTAGAATCAGTTAAACGAACCTTTTGGTGGCCTACGATCAAAACGGATATCAGAAGATATGTCAGA
GGATGCGACATGTGCCAGAAGAACAAAACGATTCGACGACCCGATCACATTCCGCTTAATCCATTACCCATCCCC
GACAAACCTTGGGAAGAAATATCTATAGACATGATCGGACCACTACCAAAGTCAAAAGAGAAGGATGCTATTATT
GTTATCGTTGACAGATTTTCCAAAATGATCCACCTCGTTCCCACTACCACGTCACTCATGTCCATGGATCTTGCG
GAAATCTATAAGGAAGAAGTCTGGCGACATCACGGAATTCCGAAACGGATTATTAGCGACAGAGGACCACAATTC
GCATCAAAATTTATGGAATCACTATGCAAAGCGCTAGGCATTGAACGAAACCTTTCTACGGCCTATCACCCACAG
ACAGACGGTCAAACTGAGCGGATGAATCAGGAAATCGAAACCTATCTTCGAGCATTCATCAATTATCGACAAGAC
GATTGGACGAGATGGCTCCCCATGGCAGAATTCCATTACAATGACAAAACCCACGCTGCCACCGGACAAACCCCA
TTCTTCCTAAATTACGGACTTCACCCATGGAAGGGTAATATCACGGTTGAAACGACGAACCCCACCGCCACCTCC
TTGATCGAGGACTTGGAAAGTGTGCGAGAGGAAGCCAAATCTGCGATGGAAGCAAACAACGAGATGATGAGAGAA
AGAGGAAACAACAAGCACCACAAGGAACCCTTTGCCGAAGGAGATAAAGTTTGGTTAGAAACGACAAACATTCAT
TCCAATCGTCCGACTCGGAAACTAGACCACAAACGATATGGACCTTTTGAAGTCTTGAAACAAATCGGTGATCGA
TCTTACAAACTGAAGTTACCGGTTACCTGGGCGATACACGACGTCTTCCATACTTCACTCCTAACAAAAGTCCGA
GACCCCGAGTTTGACAGTCAGAAGAAACCCACCCCACCTCCACCTGACATTATCAACGAAGAAGAGGAATATGAA
GTTGAAGAAATCCGAGGACACCGACGAAAAGGCCGAGGAATACAATTTTTAGTTCACTGGAAAGGTTATGGAAAC
GAAGACGACACATGGCTACCACGCTCTGCTCTAACGAACTCCGCGGACATCCTAAAAGACATCGTCTATTGGCAT
CTGAAGAAGATCATCATGAGCAACCAACAAACCACAACCACCCAAACCGCCGATTTCGGAGGTTGTCTCCAACGA
CTCCTGACCATTCACACCGACATCACCGACCTTTTACAATTCCAATCCAACGTACAACTTCCTCTTTTCATAAAT
CCTCCATCGCATATCGACATATCAGACGAAGAAAAGAGTCAATTAGAGGCGCAACTCGAGACGGAGAAGAGATTA
TTGAAGCTGATAGAGAATCTGTTGAGAATGGCGAAAGCGAACAATAAAGAGGCAAAAGAAGCATTATCACCTGGG
TCTACAACGTGGACAACAAACAATGATGGATGGGAGTAA
Gene >Agabi119p4|758450
ATGAACAATCGATATGTTGTCCCTCAAAAACGTCTAGGAGTGCAAGAACTGGAAGTAAAACCCCTCCTCACTACA
ACAAACGGAAAGAAACTCAAACTCTCAGCCATGGTCGATTCGGGATGCACACACACATGTATCGACGAAGGATTA
GTAAAGAAGAAGAAGATCCCGACGAAGAATTTGGAACGGCCGATCACATGCAGGAATTCAGATGGAACGATAGCA
GGAAAGAAGGATATCACCAAATTCGTGAAAATGGATCTCAACATCAACGGTCATAACGAACAACTGGACGCGGTC
GTCACTCCCTTACAATCATCCGACCTCTTTCTAGGCCATGATTGGTTAACGAACCACAACCCCGAGATTGATTGG
AAACAAGGTATAATCAAATTCAACCGGTGCCCCACATCATGTTCCTTTCCCCATACCGACATCTCTTTCGAACCA
CGTATACGACGATTACAGTCTAACGAAGACACTGAGGAGAAAGAACCGGATCCAACGAACCCAGAGGACTTACCA
GCATACATGAAACCCTTCGCCCATCTTTTCAACAAAAAGAATTTCGACAAACTTCCTGAACGGACAGAATGGGAT
CACGAAATCAACTTCACGGAAAACGCACCTACAGAAATATCATCAAAGGTTTATAGCATGACGCCATTGGAGAGA
GAAGAACTGGACAAATTTTTGGATGAAAACCTGACCACCAATCGAATCCGACCATCAAAATCACCCTATGCAGCA
CCATGTTTTTTCGTTCCCAAGAAAGACGGTTCACTACGGTTATGTCAGGATTATAGGAAACTGAACGACATCACG
ATCAAGGACAAAACACCATTACCCCTCATCAGCGAAGTATTAGATCAACTCAAAGACGCCAGGGTTTTCAACAAA
TTGGATATTATCTGGGGATATAACAACGTTCGGATACGAGAAGGAGATGAATGGAAAGCAGCGTTCTTGACGAAT
CGAGGATTATTTGAACCAACGGTCATGTTCTTCGGAATGAGTAATTCCCCTGCCACCTTCTCACGTATGATGGCG
ACAATCTTTCGAGAAATGATACAAGACGGATCCCTAGCCAACTACATGGACGATTTTATCATTCCAGCAAAAGAC
GATGAAGAATTAGAAGCACGAACTATACGTTTCCTCAAAATCGCGGAGAAACACAATCTATCATTCAAACGGACG
AAATGCGAGTTCAATGTCTCATCAACAACGGTGTTGGGAACAGTTATTGGGAATGGAAAGGCAACAATGGAAGAA
GAAAAGGTCAAGGCAATACGAGATTGGGCAGTCCCTACCACAGTCAAACAAGTCGAGAGCTTTTTAGGCTTTGCG
AATTTCTATCGACGTTTTATCAAGAATTTCAGCACCATCGCACAACCACTGAACGAGTTGAAGTCAAAAAAGGGA
GAGAAATGGTATTGGAACGATGAACAACAACAAGCTTTCGAACAAATCAAACAAGCTATTGCCAGTGAACCAGTC
CTAGCACTACCAAAAGACAAAGGACAATTCAGAGTCGAAGTAGACGCATCGAACTATGGAACAGGAGCAGTACTA
TCACAGGAACAGGAAAACAAATGGCACCCGGTCGCTTTCATGTCGAAAACATTATCAGAAGCCGAAAGAAACTAC
GAAATCTACGACAAGGAACTACTAGCCATCATAAAAGCTTTGAAATTATGGCGACACTACCTATTGGATGCAAAG
GAGCAGTTTGAGATATGGACAGATCACGAGAACCTCAAGTATTTCCGAGAACCTCAAAAGCTCAACGCTCGACAA
GCGAGATGGTACCTCATGCTACAAGAATACGACTTCCTTCTACGACACATTCCTGGGAAGACTAACACCAAAGCA
GACATCCTGTCAAGACTAATTAAACCTGACACATCTAACGACAACCGAGGAGTAGAAATGTTCAAAGAGAAGATG
TTTATCCGAAGGCTTGAAGAATCTACCCCCATCTATGACGTCACCTTACTCCACAATCGAAGATTCGAAATTTCA
GCCGATGAAACCGTACTTGAGAAGATTAGGAAATGTGAAAGACGAGAAACCAGAGTATTAGAAGAGATGAAGAAG
CAACCAGAGAAAGTATGAGAGAACAAAGGAATCATTTACCGACAAGGAAGGATCTATGTTCCGGATAACCAGGAA
ATCAGAGATTTCATCCTTCACGATCATCATAATTCCCCCGACGCCGGACATCCTGGAACATACCGAATGCTAGAA
TCAGTTAAACGAACCTTTTGGTGGCCTACGATCAAAACGGATATCAGAAGATATGTCAGAGGATGCGACATGTGC
CAGAAGAACAAAACGATTCGACGACCCGATCACATTCCGCTTAATCCATTACCCATCCCCGACAAACCTTGGGAA
GAAATATCTATAGACATGATCGGACCACTACCAAAGTCAAAAGAGAAGGATGCTATTATTGTTATCGTTGACAGA
TTTTCCAAAATGATCCACCTCGTTCCCACTACCACGTCACTCATGTCCATGGATCTTGCGGAAATCTATAAGGAA
GAAGTCTGGCGACATCACGGAATTCCGAAACGGATTATTAGCGACAGAGGACCACAATTCGCATCAAAATTTATG
GAATCACTATGCAAAGCGCTAGGCATTGAACGAAACCTTTCTACGGCCTATCACCCACAGACAGACGGTCAAACT
GAGCGGATGAATCAGGAAATCGAAACCTATCTTCGAGCATTCATCAATTATCGACAAGACGATTGGACGAGATGG
CTCCCCATGGCAGAATTCCATTACAATGACAAAACCCACGCTGCCACCGGACAAACCCCATTCTTCCTAAATTAC
GGACTTCACCCATGGAAGGGTAATATCACGGTTGAAACGACGAACCCCACCGCCACCTCCTTGATCGAGGACTTG
GAAAGTGTGCGAGAGGAAGCCAAATCTGCGATGGAAGCAAACAACGAGATGATGAGAGAAAGAGGAAACAACAAG
CACCACAAGGAACCCTTTGCCGAAGGAGATAAAGTTTGGTTAGAAACGACAAACATTCATTCCAATCGTCCGACT
CGGAAACTAGACCACAAACGATATGGACCTTTTGAAGTCTTGAAACAAATCGGTGATCGATCTTACAAACTGAAG
TTACCGGTTACCTGGGCGATACACGACGTCTTCCATACTTCACTCCTAACAAAAGTCCGAGACCCCGAGTTTGAC
AGTCAGAAGAAACCCACCCCACCTCCACCTGACATTATCAACGAAGAAGAGGAATATGAAGTTGAAGAAATCCGA
GGACACCGACGAAAAGGCCGAGGAATACAATTTTTAGTTCACTGGAAAGGTTATGGAAACGAAGACGACACATGG
CTACCACGCTCTGCTCTAACGAACTCCGCGGACATCCTAAAAGAGTATCATGGAAGAAATCCTTTCCTATAAAAG
GATATCTGAAACTCTCAACTGGTATAGCATCGTCTATTGGCATCTGAAGAAGATCATCATGAGCAACCAACAAAC
CACAACCACCCAAACCGCCGATTTCGGAGGTTGTCTCCAACGACTCCTGACCATTCACACCGACATCACCGACCT
TTTACAATTCCAATCCAACGTACAACTTCCTCTTTTCATAAATCCTCCATCGCATATCGACATATCAGACGAAGA
AAAGAGTCAATTAGAGGCGCAACTCGAGACGGAGAAGAGATTATTGAAGCTGATAGAGAATCTGTTGAGAATGGC
GAAAGCGAACAATAAAGAGGCAAAAGAAGCATTATCACCTGGGTCTACAACGTGGACAACAAACAATGATGGATG
GGAGTAA

© 2022 - Robin Ohm - Utrecht University - The Netherlands

Built with Python Django and Wagtail