Fungal Genomics

at Utrecht University

General Properties

Protein IDAgabi119p4|694900
Gene name
Locationscaffold_08:455044..456682
Strand+
Gene length (bp)1638
Transcript length (bp)1638
Coding sequence length (bp)1638
Protein length (aa) 546

Your browser does not support drawing a protein figure.

PFAM Domains

PFAM Domain ID Short name Long name E-value Start End
PF17921 Integrase_H2C2 Integrase zinc binding domain 1.5E-19 119 175
PF00665 rve Integrase core domain 4.0E-12 193 289
PF00385 Chromo Chromo (CHRromatin Organisation MOdifier) domain 7.9E-10 489 538

Swissprot hits

[Show all]
Swissprot ID Swissprot Description Start End E-value
sp|P0CT37|TF24_SCHPO Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-4 PE=3 SV=1 1 465 4.0E-54
sp|P0CT40|TF29_SCHPO Transposon Tf2-9 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-9 PE=3 SV=1 1 465 4.0E-54
sp|P0CT41|TF212_SCHPO Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-12 PE=3 SV=1 1 465 4.0E-54
sp|P0CT36|TF23_SCHPO Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-3 PE=1 SV=1 1 465 4.0E-54
sp|P0CT38|TF25_SCHPO Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-5 PE=3 SV=1 1 465 4.0E-54
[Show all]
[Show less]
Swissprot ID Swissprot Description Start End E-value
sp|P0CT37|TF24_SCHPO Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-4 PE=3 SV=1 1 465 4.0E-54
sp|P0CT40|TF29_SCHPO Transposon Tf2-9 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-9 PE=3 SV=1 1 465 4.0E-54
sp|P0CT41|TF212_SCHPO Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-12 PE=3 SV=1 1 465 4.0E-54
sp|P0CT36|TF23_SCHPO Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-3 PE=1 SV=1 1 465 4.0E-54
sp|P0CT38|TF25_SCHPO Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-5 PE=3 SV=1 1 465 4.0E-54
sp|P0CT39|TF26_SCHPO Transposon Tf2-6 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-6 PE=3 SV=1 1 465 4.0E-54
sp|P0CT43|TF28_SCHPO Transposon Tf2-8 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-8 PE=3 SV=1 1 465 4.0E-54
sp|P0CT34|TF21_SCHPO Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-1 PE=3 SV=1 1 465 4.0E-54
sp|P0CT35|TF22_SCHPO Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-2 PE=3 SV=1 1 465 4.0E-54
sp|P0CT42|TF27_SCHPO Transposon Tf2-7 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-7 PE=3 SV=1 1 465 4.0E-54
sp|Q9UR07|TF211_SCHPO Transposon Tf2-11 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-11 PE=3 SV=1 1 465 5.0E-54
sp|Q7LHG5|YI31B_YEAST Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-I PE=3 SV=2 2 473 4.0E-53
sp|Q99315|YG31B_YEAST Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-G PE=1 SV=3 2 463 1.0E-52
sp|Q09575|YRD6_CAEEL Uncharacterized protein K02A2.6 OS=Caenorhabditis elegans GN=K02A2.6 PE=3 SV=1 105 443 5.0E-30
sp|P10394|POL4_DROME Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaster GN=POL PE=3 SV=1 46 441 1.0E-29
sp|P23074|POL_SFV1 Pro-Pol polyprotein OS=Simian foamy virus type 1 GN=pol PE=1 SV=3 102 372 1.0E-27
sp|Q87040|POL_SFVCP Pro-Pol polyprotein OS=Simian foamy virus (isolate chimpanzee) GN=pol PE=3 SV=1 110 370 3.0E-26
sp|P27401|POL_SFV3L Pro-Pol polyprotein OS=Simian foamy virus type 3 (strain LK3) GN=pol PE=3 SV=2 111 372 1.0E-25
sp|O93209|POL_FFV Pro-Pol polyprotein OS=Feline foamy virus GN=pol PE=3 SV=1 76 370 6.0E-25
sp|P14350|POL_FOAMV Pro-Pol polyprotein OS=Human spumaretrovirus GN=pol PE=1 SV=2 110 370 9.0E-25
sp|A4FUB7|GIN1_BOVIN Gypsy retrotransposon integrase-like protein 1 OS=Bos taurus GN=GIN1 PE=2 SV=1 126 398 6.0E-21
sp|P08361|POL_MLVCB Gag-Pol polyprotein (Fragment) OS=Cas-Br-E murine leukemia virus GN=gag-pol PE=3 SV=1 199 434 6.0E-20
sp|P03356|POL_MLVAV Pol polyprotein OS=AKV murine leukemia virus GN=pol PE=3 SV=2 214 434 3.0E-19
sp|P31795|POL_MLVRK Pol polyprotein (Fragment) OS=Radiation murine leukemia virus (strain Kaplan) GN=pol PE=3 SV=1 190 434 3.0E-19
sp|P11227|POL_MLVRD Pol polyprotein OS=Radiation murine leukemia virus GN=pol PE=3 SV=1 214 434 3.0E-19
sp|Q9NXP7|GIN1_HUMAN Gypsy retrotransposon integrase-like protein 1 OS=Homo sapiens GN=GIN1 PE=2 SV=3 100 394 6.0E-19
sp|P26808|POL_MLVFP Pol polyprotein OS=Friend murine leukemia virus (isolate PVC-211) GN=pol PE=3 SV=1 214 434 1.0E-18
sp|Q5RBK0|GIN1_PONAB Gypsy retrotransposon integrase-like protein 1 OS=Pongo abelii GN=GIN1 PE=2 SV=1 100 394 1.0E-18
sp|P26810|POL_MLVF5 Pol polyprotein OS=Friend murine leukemia virus (isolate 57) GN=pol PE=3 SV=1 214 434 1.0E-18
sp|P26809|POL_MLVFF Pol polyprotein OS=Friend murine leukemia virus (isolate FB29) GN=pol PE=3 SV=1 214 434 2.0E-18
sp|P03355|POL_MLVMS Gag-Pol polyprotein OS=Moloney murine leukemia virus (isolate Shinnick) GN=gag-pol PE=1 SV=4 214 434 2.0E-18
sp|Q4R6I1|GIN1_MACFA Gypsy retrotransposon integrase-like protein 1 OS=Macaca fascicularis GN=GIN1 PE=2 SV=1 100 394 2.0E-18
sp|Q8K259|GIN1_MOUSE Gypsy retrotransposon integrase-like protein 1 OS=Mus musculus GN=Gin1 PE=2 SV=2 100 394 1.0E-17
sp|Q2F7J0|POL_XMRV4 Gag-Pol polyprotein OS=Xenotropic MuLV-related virus (isolate VP42) GN=gag-pol PE=3 SV=1 190 434 2.0E-17
sp|Q9TTC1|POL_KORV Pro-Pol polyprotein OS=Koala retrovirus GN=pro-pol PE=3 SV=1 81 318 2.0E-17
sp|A1Z651|POL_XMRV6 Gag-Pol polyprotein OS=Xenotropic MuLV-related virus (isolate VP62) GN=gag-pol PE=1 SV=1 190 434 2.0E-17
sp|Q2F7J3|POL_XMRV3 Gag-Pol polyprotein OS=Xenotropic MuLV-related virus (isolate VP35) GN=gag-pol PE=1 SV=1 190 434 3.0E-17
sp|P10272|POL_BAEVM Pol polyprotein OS=Baboon endogenous virus (strain M7) GN=pol PE=3 SV=1 83 434 1.0E-16
sp|Q66H30|GIN1_RAT Gypsy retrotransposon integrase-like protein 1 OS=Rattus norvegicus GN=GIN1 PE=2 SV=1 100 349 2.0E-16
sp|Q5DTZ0|NYNRI_MOUSE Protein NYNRIN OS=Mus musculus GN=Nynrin PE=2 SV=2 109 451 5.0E-16
sp|P31792|POL_FENV1 Pol polyprotein (Fragment) OS=Feline endogenous virus ECE1 GN=pol PE=3 SV=1 83 409 3.0E-15
sp|P21414|POL_GALV Pol polyprotein OS=Gibbon ape leukemia virus GN=pol PE=3 SV=1 122 349 1.0E-14
sp|P03360|POL_AVIRE Pol polyprotein (Fragment) OS=Avian reticuloendotheliosis virus GN=pol PE=3 SV=1 111 349 2.0E-13
sp|Q9P2P1|NYNRI_HUMAN Protein NYNRIN OS=Homo sapiens GN=NYNRIN PE=2 SV=3 109 341 1.0E-11
sp|P10401|POLY_DROME Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogaster GN=pol PE=3 SV=1 136 340 7.0E-10
sp|P03359|POL_WMSV Pol polyprotein (Fragment) OS=Woolly monkey sarcoma virus GN=pol PE=3 SV=1 191 349 8.0E-10
sp|Q8I7P9|POL5_DROME Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster GN=pol PE=3 SV=1 136 421 4.0E-09
sp|O92815|POL_WDSV Gag-Pol polyprotein OS=Walleye dermal sarcoma virus GN=gag-pol PE=1 SV=2 107 336 7.0E-09
sp|P04323|POL3_DROME Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster GN=pol PE=3 SV=1 2 346 1.0E-08
sp|P20825|POL2_DROME Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster GN=pol PE=3 SV=1 2 349 2.0E-08
[Show less]

GO

GO Term Description Terminal node
GO:0015074 DNA integration Yes
GO:0044260 cellular macromolecule metabolic process No
GO:0006139 nucleobase-containing compound metabolic process No
GO:0009987 cellular process No
GO:0034641 cellular nitrogen compound metabolic process No
GO:0008152 metabolic process No
GO:0008150 biological_process No
GO:0044238 primary metabolic process No
GO:0071704 organic substance metabolic process No
GO:0046483 heterocycle metabolic process No
GO:0043170 macromolecule metabolic process No
GO:1901360 organic cyclic compound metabolic process No
GO:0090304 nucleic acid metabolic process No
GO:0006725 cellular aromatic compound metabolic process No
GO:0044237 cellular metabolic process No
GO:0006807 nitrogen compound metabolic process No
GO:0006259 DNA metabolic process No

SignalP

[Help with interpreting these statistics]
SignalP signal predicted Location
(based on Ymax)
D score
(significance: > 0.45)
No 1 - 19 0.45

Transmembrane Domains

(None)

Transcription Factor Class

(None)

Expression data

No expression data available for this genome

Sequences

Type of sequenceSequence
Locus Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded.
Protein >Agabi119p4|694900
MLQEYDFLLRHIPGKTNTKADILSRLIKPDTSNDNRGVEMFKEKMFIRRLEESTPIYDVTLLHNRRFEILADETV
LEKIRKCERRETRVLEEMKKQPEKVWENKGIIYRQGRIYVPDNQEIRDFILHDHHNSPDAGHPGTYRMLESVKRT
FWWPTIKTDIRRYVRGCDMCQKNKTIRRPDHIPLNPLPIPDKPWEEISIDMIGPLPKSKEKDAIIVIVDRFSKMI
HLVPTTTSLTSMDLAEIYKEEVWRHHGIPKRIISDRGPQFASKFMESLCKALGIERNLSTAYHPQTDGQTERMNQ
EIETYLRAFINYRQDDWTRWLPMAEFHYNDKTHAATGQTPFFLNYGLHPWKGNITVETTNPTATSLIEELENVQK
EAKAAMEANNEMMRERGNNKHHKEPFAEGDKVWLETTNIHSNRPTRKLDHKRYGPFEILKQIGDRSYKLKLPDTW
AIHDVFHTSLLTKVRDPEFDSQKQPTPPPPDIINEEEEYEVEEIRGHRRKGRGIQFLVHWKGYGNEDDSWIPRSS
LENAEEALSEYRAKLPNGQL*
Coding >Agabi119p4|694900
ATGCTACAGGAATACGACTTCCTTCTACGACACATCCCTGGGAAAACTAACACCAAAGCAGACATCCTGTCAAGA
CTAATTAAACCCGACACATCTAACGACAACCGAGGAGTAGAAATGTTCAAAGAGAAAATGTTTATCCGAAGGCTT
GAAGAATCCACCCCCATCTATGATGTCACCTTACTCCACAATCGAAGATTCGAGATTTTAGCCGATGAAACCGTA
CTTGAGAAGATTAGGAAATGTGAAAGACGAGAAACCAGAGTATTAGAAGAGATGAAGAAGCAACCAGAGAAAGTA
TGGGAGAACAAAGGAATCATTTACCGACAAGGAAGAATCTATGTTCCGGATAACCAGGAAATCAGAGATTTCATC
CTTCACGATCATCATAATTCCCCCGACGCCGGACATCCTGGAACATACCGAATGCTAGAATCAGTTAAACGAACC
TTTTGGTGGCCTACGATCAAAACGGATATCAGAAGATATGTCAGAGGATGCGACATGTGCCAGAAGAACAAAACG
ATCCGACGACCCGATCACATTCCGCTTAACCCATTACCCATCCCCGACAAACCTTGGGAGGAAATATCTATAGAC
ATGATTGGACCACTACCGAAGTCAAAAGAGAAGGATGCTATTATTGTTATCGTTGACAGATTTTCCAAAATGATC
CACCTCGTTCCCACTACCACGTCACTCACATCCATGGATCTTGCGGAAATCTATAAGGAAGAAGTCTGGCGACAT
CACGGAATTCCGAAACGGATTATTAGCGACAGAGGACCACAATTCGCATCGAAATTTATGGAATCACTATGCAAA
GCGCTAGGCATTGAACGAAACCTTTCTACGGCCTACCACCCACAAACAGACGGTCAAACAGAACGGATGAATCAG
GAAATCGAGACCTACCTTCGAGCATTCATCAATTATCGACAAGACGATTGGACGAGATGGCTTCCCATGGCAGAA
TTCCATTACAACGACAAAACCCACGCTGCCACCGGACAAACCCCATTCTTCTTAAACTACGGACTTCACCCATGG
AAGGGTAATATCACGGTTGAAACGACAAACCCCACCGCCACCTCCCTGATCGAAGAATTAGAGAATGTGCAGAAA
GAAGCTAAAGCTGCGATGGAAGCAAACAACGAGATGATGAGAGAAAGAGGAAACAACAAGCACCACAAGGAACCC
TTTGCCGAAGGAGATAAAGTTTGGTTGGAAACGACGAACATTCATTCCAATCGTCCGACTCGGAAACTAGACCAC
AAACGATATGGACCTTTCGAGATCTTGAAACAAATCGGCGATCGATCTTACAAACTGAAGTTACCTGATACCTGG
GCGATACACGACGTCTTTCATACATCGCTCCTAACAAAAGTCCGAGACCCAGAGTTTGACAGTCAGAAGCAACCC
ACTCCACCTCCACCTGACATTATCAACGAAGAAGAGGAATATGAAGTTGAAGAAATCCGAGGACACCGACGAAAA
GGCCGAGGAATACAATTTCTAGTTCACTGGAAAGGTTATGGAAACGAAGACGACTCTTGGATACCACGCTCATCC
CTAGAAAATGCAGAAGAAGCACTCTCCGAATATAGAGCAAAACTCCCGAATGGACAGTTATAA
Transcript >Agabi119p4|694900
ATGCTACAGGAATACGACTTCCTTCTACGACACATCCCTGGGAAAACTAACACCAAAGCAGACATCCTGTCAAGA
CTAATTAAACCCGACACATCTAACGACAACCGAGGAGTAGAAATGTTCAAAGAGAAAATGTTTATCCGAAGGCTT
GAAGAATCCACCCCCATCTATGATGTCACCTTACTCCACAATCGAAGATTCGAGATTTTAGCCGATGAAACCGTA
CTTGAGAAGATTAGGAAATGTGAAAGACGAGAAACCAGAGTATTAGAAGAGATGAAGAAGCAACCAGAGAAAGTA
TGGGAGAACAAAGGAATCATTTACCGACAAGGAAGAATCTATGTTCCGGATAACCAGGAAATCAGAGATTTCATC
CTTCACGATCATCATAATTCCCCCGACGCCGGACATCCTGGAACATACCGAATGCTAGAATCAGTTAAACGAACC
TTTTGGTGGCCTACGATCAAAACGGATATCAGAAGATATGTCAGAGGATGCGACATGTGCCAGAAGAACAAAACG
ATCCGACGACCCGATCACATTCCGCTTAACCCATTACCCATCCCCGACAAACCTTGGGAGGAAATATCTATAGAC
ATGATTGGACCACTACCGAAGTCAAAAGAGAAGGATGCTATTATTGTTATCGTTGACAGATTTTCCAAAATGATC
CACCTCGTTCCCACTACCACGTCACTCACATCCATGGATCTTGCGGAAATCTATAAGGAAGAAGTCTGGCGACAT
CACGGAATTCCGAAACGGATTATTAGCGACAGAGGACCACAATTCGCATCGAAATTTATGGAATCACTATGCAAA
GCGCTAGGCATTGAACGAAACCTTTCTACGGCCTACCACCCACAAACAGACGGTCAAACAGAACGGATGAATCAG
GAAATCGAGACCTACCTTCGAGCATTCATCAATTATCGACAAGACGATTGGACGAGATGGCTTCCCATGGCAGAA
TTCCATTACAACGACAAAACCCACGCTGCCACCGGACAAACCCCATTCTTCTTAAACTACGGACTTCACCCATGG
AAGGGTAATATCACGGTTGAAACGACAAACCCCACCGCCACCTCCCTGATCGAAGAATTAGAGAATGTGCAGAAA
GAAGCTAAAGCTGCGATGGAAGCAAACAACGAGATGATGAGAGAAAGAGGAAACAACAAGCACCACAAGGAACCC
TTTGCCGAAGGAGATAAAGTTTGGTTGGAAACGACGAACATTCATTCCAATCGTCCGACTCGGAAACTAGACCAC
AAACGATATGGACCTTTCGAGATCTTGAAACAAATCGGCGATCGATCTTACAAACTGAAGTTACCTGATACCTGG
GCGATACACGACGTCTTTCATACATCGCTCCTAACAAAAGTCCGAGACCCAGAGTTTGACAGTCAGAAGCAACCC
ACTCCACCTCCACCTGACATTATCAACGAAGAAGAGGAATATGAAGTTGAAGAAATCCGAGGACACCGACGAAAA
GGCCGAGGAATACAATTTCTAGTTCACTGGAAAGGTTATGGAAACGAAGACGACTCTTGGATACCACGCTCATCC
CTAGAAAATGCAGAAGAAGCACTCTCCGAATATAGAGCAAAACTCCCGAATGGACAGTTATAA
Gene >Agabi119p4|694900
ATGCTACAGGAATACGACTTCCTTCTACGACACATCCCTGGGAAAACTAACACCAAAGCAGACATCCTGTCAAGA
CTAATTAAACCCGACACATCTAACGACAACCGAGGAGTAGAAATGTTCAAAGAGAAAATGTTTATCCGAAGGCTT
GAAGAATCCACCCCCATCTATGATGTCACCTTACTCCACAATCGAAGATTCGAGATTTTAGCCGATGAAACCGTA
CTTGAGAAGATTAGGAAATGTGAAAGACGAGAAACCAGAGTATTAGAAGAGATGAAGAAGCAACCAGAGAAAGTA
TGGGAGAACAAAGGAATCATTTACCGACAAGGAAGAATCTATGTTCCGGATAACCAGGAAATCAGAGATTTCATC
CTTCACGATCATCATAATTCCCCCGACGCCGGACATCCTGGAACATACCGAATGCTAGAATCAGTTAAACGAACC
TTTTGGTGGCCTACGATCAAAACGGATATCAGAAGATATGTCAGAGGATGCGACATGTGCCAGAAGAACAAAACG
ATCCGACGACCCGATCACATTCCGCTTAACCCATTACCCATCCCCGACAAACCTTGGGAGGAAATATCTATAGAC
ATGATTGGACCACTACCGAAGTCAAAAGAGAAGGATGCTATTATTGTTATCGTTGACAGATTTTCCAAAATGATC
CACCTCGTTCCCACTACCACGTCACTCACATCCATGGATCTTGCGGAAATCTATAAGGAAGAAGTCTGGCGACAT
CACGGAATTCCGAAACGGATTATTAGCGACAGAGGACCACAATTCGCATCGAAATTTATGGAATCACTATGCAAA
GCGCTAGGCATTGAACGAAACCTTTCTACGGCCTACCACCCACAAACAGACGGTCAAACAGAACGGATGAATCAG
GAAATCGAGACCTACCTTCGAGCATTCATCAATTATCGACAAGACGATTGGACGAGATGGCTTCCCATGGCAGAA
TTCCATTACAACGACAAAACCCACGCTGCCACCGGACAAACCCCATTCTTCTTAAACTACGGACTTCACCCATGG
AAGGGTAATATCACGGTTGAAACGACAAACCCCACCGCCACCTCCCTGATCGAAGAATTAGAGAATGTGCAGAAA
GAAGCTAAAGCTGCGATGGAAGCAAACAACGAGATGATGAGAGAAAGAGGAAACAACAAGCACCACAAGGAACCC
TTTGCCGAAGGAGATAAAGTTTGGTTGGAAACGACGAACATTCATTCCAATCGTCCGACTCGGAAACTAGACCAC
AAACGATATGGACCTTTCGAGATCTTGAAACAAATCGGCGATCGATCTTACAAACTGAAGTTACCTGATACCTGG
GCGATACACGACGTCTTTCATACATCGCTCCTAACAAAAGTCCGAGACCCAGAGTTTGACAGTCAGAAGCAACCC
ACTCCACCTCCACCTGACATTATCAACGAAGAAGAGGAATATGAAGTTGAAGAAATCCGAGGACACCGACGAAAA
GGCCGAGGAATACAATTTCTAGTTCACTGGAAAGGTTATGGAAACGAAGACGACTCTTGGATACCACGCTCATCC
CTAGAAAATGCAGAAGAAGCACTCTCCGAATATAGAGCAAAACTCCCGAATGGACAGTTATAA

© 2022 - Robin Ohm - Utrecht University - The Netherlands

Built with Python Django and Wagtail