Fungal Genomics

at Utrecht University

General Properties

Protein IDHirsu2|1906
Gene name
LocationContig_1436:4383..5519
Strand+
Gene length (bp)1136
Transcript length (bp)912
Coding sequence length (bp)912
Protein length (aa) 304

Your browser does not support drawing a protein figure.

PFAM Domains

PFAM Domain ID Short name Long name E-value Start End
PF02265 S1-P1_nuclease S1/P1 Nuclease 2.5E-69 21 287

Swissprot hits

[Show all]
Swissprot ID Swissprot Description Start End E-value
sp|P24021|NUS1_ASPOR Nuclease S1 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=nucS PE=1 SV=2 4 289 5.0E-89
sp|P24504|NUP3_PENSQ Nuclease PA3 OS=Penicillium sp. PE=1 SV=1 21 286 2.0E-67
sp|P24289|NUP1_PENCI Nuclease P1 OS=Penicillium citrinum PE=1 SV=1 21 286 6.0E-67
sp|F4JJL0|ENDO4_ARATH Endonuclease 4 OS=Arabidopsis thaliana GN=ENDO4 PE=3 SV=1 18 288 3.0E-39
sp|Q8LDW6|ENDO3_ARATH Endonuclease 3 OS=Arabidopsis thaliana GN=ENDO3 PE=2 SV=1 18 297 1.0E-36
[Show all]
[Show less]
Swissprot ID Swissprot Description Start End E-value
sp|P24021|NUS1_ASPOR Nuclease S1 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=nucS PE=1 SV=2 4 289 5.0E-89
sp|P24504|NUP3_PENSQ Nuclease PA3 OS=Penicillium sp. PE=1 SV=1 21 286 2.0E-67
sp|P24289|NUP1_PENCI Nuclease P1 OS=Penicillium citrinum PE=1 SV=1 21 286 6.0E-67
sp|F4JJL0|ENDO4_ARATH Endonuclease 4 OS=Arabidopsis thaliana GN=ENDO4 PE=3 SV=1 18 288 3.0E-39
sp|Q8LDW6|ENDO3_ARATH Endonuclease 3 OS=Arabidopsis thaliana GN=ENDO3 PE=2 SV=1 18 297 1.0E-36
sp|F4JJL3|ENDO5_ARATH Endonuclease 5 OS=Arabidopsis thaliana GN=ENDO5 PE=2 SV=1 17 288 6.0E-32
sp|Q9C9G4|ENDO2_ARATH Endonuclease 2 OS=Arabidopsis thaliana GN=ENDO2 PE=1 SV=1 12 286 3.0E-30
sp|Q9SXA6|ENDO1_ARATH Endonuclease 1 OS=Arabidopsis thaliana GN=ENDO1 PE=1 SV=1 20 289 5.0E-27
[Show less]

GO

GO Term Description Terminal node
GO:0006308 DNA catabolic process Yes
GO:0003676 nucleic acid binding Yes
GO:0004519 endonuclease activity Yes
GO:0009987 cellular process No
GO:0016788 hydrolase activity, acting on ester bonds No
GO:1901363 heterocyclic compound binding No
GO:0009057 macromolecule catabolic process No
GO:0019439 aromatic compound catabolic process No
GO:0034655 nucleobase-containing compound catabolic process No
GO:0009056 catabolic process No
GO:0003824 catalytic activity No
GO:0016787 hydrolase activity No
GO:0090304 nucleic acid metabolic process No
GO:0006725 cellular aromatic compound metabolic process No
GO:1901361 organic cyclic compound catabolic process No
GO:0006139 nucleobase-containing compound metabolic process No
GO:0004518 nuclease activity No
GO:1901360 organic cyclic compound metabolic process No
GO:0034641 cellular nitrogen compound metabolic process No
GO:0044237 cellular metabolic process No
GO:0043170 macromolecule metabolic process No
GO:0097159 organic cyclic compound binding No
GO:0006259 DNA metabolic process No
GO:0046700 heterocycle catabolic process No
GO:0071704 organic substance metabolic process No
GO:0006807 nitrogen compound metabolic process No
GO:0005488 binding No
GO:0044238 primary metabolic process No
GO:0008152 metabolic process No
GO:0044248 cellular catabolic process No
GO:0044270 cellular nitrogen compound catabolic process No
GO:0008150 biological_process No
GO:0044260 cellular macromolecule metabolic process No
GO:0046483 heterocycle metabolic process No
GO:0044265 cellular macromolecule catabolic process No
GO:0003674 molecular_function No
GO:1901575 organic substance catabolic process No

SignalP

[Help with interpreting these statistics]
SignalP signal predicted Location
(based on Ymax)
D score
(significance: > 0.45)
No 1 - 21 0.5

Transmembrane Domains

Domain # Start End Length
1 5 27 22

Transcription Factor Class

(None)

Expression data

No expression data available for this genome

Sequences

Type of sequenceSequence
Locus Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded.
Protein >Hirsu2|1906
MKTPLGAAALGLIFAPAAVAWGSLGHITTAYLASHYIADSTESFFKELLRNDDDDYLASVASWADSVRYTRWGRF
TKTFHFIDAHDDPPYSCNVDFERDCKETGCVINALANYTDQLLDDSLPAWRRAQAAKFVIHFVGDLHQPMHNENV
EKGGNGIYVLWEGREFNLHHVWDSSIAEKWIGGLRGGVYPLAEKWANQLAVEISDGKFADDKEDWLKDLDLDDAI
ETAMAWSREANAIVCTHVFPDGVDAVDGQELAGRYFEEAGPVIEKQVARAGFRMAAWLDGIVAECEARRAARQMS
VEL*
Coding >Hirsu2|1906
ATGAAGACGCCGCTCGGCGCCGCGGCCTTGGGCCTCATCTTCGCCCCCGCCGCCGTCGCCTGGGGAAGTCTCGGT
CACATCACGACGGCCTACCTGGCCAGCCACTACATCGCCGACTCGACCGAGTCCTTCTTCAAGGAGCTCCTGCGC
AACGACGACGACGACTACCTGGCCAGCGTCGCCTCGTGGGCCGACTCGGTCCGGTATACGCGATGGGGCCGCTTC
ACCAAGACGTTCCACTTCATCGACGCCCACGACGACCCGCCCTACTCGTGCAACGTCGACTTCGAGCGCGACTGC
AAGGAGACGGGCTGCGTCATCAACGCCCTGGCCAACTACACGGACCAGCTGCTGGACGATTCGCTGCCCGCCTGG
CGACGGGCCCAGGCCGCCAAGTTCGTCATCCACTTCGTCGGCGACCTGCACCAGCCGATGCACAACGAGAACGTC
GAAAAGGGCGGCAACGGCATCTACGTGCTCTGGGAGGGCAGGGAGTTCAACCTGCACCACGTCTGGGACAGCTCC
ATCGCCGAGAAGTGGATCGGCGGCTTGCGAGGAGGCGTCTATCCGCTGGCCGAGAAGTGGGCGAACCAGCTGGCC
GTCGAAATCTCGGACGGCAAGTTCGCCGACGACAAGGAGGACTGGCTCAAGGACCTTGATCTGGACGATGCCATC
GAGACGGCCATGGCGTGGTCGCGCGAGGCCAACGCCATTGTGTGCACCCACGTCTTCCCCGATGGCGTCGATGCC
GTGGATGGCCAGGAGCTAGCCGGACGCTACTTCGAAGAGGCTGGCCCGGTCATCGAGAAACAGGTCGCGCGCGCC
GGCTTCCGGATGGCGGCCTGGCTGGACGGCATCGTTGCCGAGTGTGAGGCGCGCAGGGCGGCACGACAGATGTCG
GTGGAACTGTAG
Transcript >Hirsu2|1906
ATGAAGACGCCGCTCGGCGCCGCGGCCTTGGGCCTCATCTTCGCCCCCGCCGCCGTCGCCTGGGGAAGTCTCGGT
CACATCACGACGGCCTACCTGGCCAGCCACTACATCGCCGACTCGACCGAGTCCTTCTTCAAGGAGCTCCTGCGC
AACGACGACGACGACTACCTGGCCAGCGTCGCCTCGTGGGCCGACTCGGTCCGGTATACGCGATGGGGCCGCTTC
ACCAAGACGTTCCACTTCATCGACGCCCACGACGACCCGCCCTACTCGTGCAACGTCGACTTCGAGCGCGACTGC
AAGGAGACGGGCTGCGTCATCAACGCCCTGGCCAACTACACGGACCAGCTGCTGGACGATTCGCTGCCCGCCTGG
CGACGGGCCCAGGCCGCCAAGTTCGTCATCCACTTCGTCGGCGACCTGCACCAGCCGATGCACAACGAGAACGTC
GAAAAGGGCGGCAACGGCATCTACGTGCTCTGGGAGGGCAGGGAGTTCAACCTGCACCACGTCTGGGACAGCTCC
ATCGCCGAGAAGTGGATCGGCGGCTTGCGAGGAGGCGTCTATCCGCTGGCCGAGAAGTGGGCGAACCAGCTGGCC
GTCGAAATCTCGGACGGCAAGTTCGCCGACGACAAGGAGGACTGGCTCAAGGACCTTGATCTGGACGATGCCATC
GAGACGGCCATGGCGTGGTCGCGCGAGGCCAACGCCATTGTGTGCACCCACGTCTTCCCCGATGGCGTCGATGCC
GTGGATGGCCAGGAGCTAGCCGGACGCTACTTCGAAGAGGCTGGCCCGGTCATCGAGAAACAGGTCGCGCGCGCC
GGCTTCCGGATGGCGGCCTGGCTGGACGGCATCGTTGCCGAGTGTGAGGCGCGCAGGGCGGCACGACAGATGTCG
GTGGAACTGTAG
Gene >Hirsu2|1906
ATGAAGACGCCGCTCGGCGCCGCGGCCTTGGGCCTCATCTTCGCCCCCGCCGCCGTCGCCTGGGGAAGTATGTTG
GTCCCGCTTCCGTCCCCGCCGGCCCCGTCTCCGGCCGTCTCACCTCCGCCGCCGGGGGGGGAAAGCTCCTTGCCT
CCCACCCCCTCCCCGGCCAACAAGGCCGCACGGACTGACGGAAGCGTCAGGTCTCGGTCACATCACGACGGCCTA
CCTGGCCAGCCACTACATCGCCGACTCGACCGAGTCCTTCTTCAAGGAGCTCCTGCGCAACGACGACGACGACTA
CCTGGCCAGCGTCGCCTCGTGGGCCGACTCGGTCCGGTATACGCGATGGGGCCGCTTCACCAAGACGTTCCACTT
CATCGACGCCCACGACGACCCGCCCTACTCGTGCAACGTCGACTTCGAGCGCGACTGCAAGGAGACGGGCTGCGT
CATCAACGCCCTGGCCAACTACACGGACCAGCTGCTGGACGATTCGCTGCCCGCCTGGCGACGGGCCCAGGCCGC
CAAGTTCGTCATCCACTTCGTCGGCGACCTGCACCAGCCGATGCACAACGAGAACGTCGAAAAGGGCGGCAACGG
CATCTACGTGCTCTGGGAGGGCAGGGAGTTCAACCTGCACCACGTCTGGGACAGCTCCATCGCCGAGAAGTGGAT
CGGCGGCTTGCGAGGAGGCGTCTATCCGCTGGCCGAGAAGTGGGCGAACCAGCTGGCCGTCGAAATCTCGGACGG
CAAGTTCGCCGACGACAAGGAGGACTGGCTCAAGGACCTTGATCTGGACGATGCCATCGAGACGGCCATGGCGTG
GTCGCGCGAGGCCAACGCCATTGTGTGCACCCACGGTAAGTCCCAAGTCCAAATGAGACGATTCCTCGACTCCAG
ACGAGCGCCGGGCCTTGGCTCATCCGGTAACCCTGTCTCTGTCGCTTCCAGTCTTCCCCGATGGCGTCGATGCCG
TGGATGGCCAGGAGCTAGCCGGACGCTACTTCGAAGAGGCTGGCCCGGTCATCGAGAAACAGGTCGCGCGCGCCG
GCTTCCGGATGGCGGCCTGGCTGGACGGCATCGTTGCCGAGTGTGAGGCGCGCAGGGCGGCACGACAGATGTCGG
TGGAACTGTAG

© 2020 - Robin Ohm - Utrecht University - The Netherlands

Built with Python Django and Wagtail