Fungal Genomics

at Utrecht University

General Properties

Protein IDHirsu2|6983
Gene name
LocationContig_383:13304..16758
Strand+
Gene length (bp)3454
Transcript length (bp)3324
Coding sequence length (bp)3324
Protein length (aa) 1108

Your browser does not support drawing a protein figure.

PFAM Domains

PFAM Domain ID Short name Long name E-value Start End
PF03221 HTH_Tnp_Tc5 Tc5 transposase DNA-binding domain 4.4E-08 795 858
PF05920 Homeobox_KN Homeobox KN domain 1.5E-15 210 249
PF00046 Homeodomain Homeodomain 7.4E-06 198 251

Swissprot hits

[Show all]
Swissprot ID Swissprot Description Start End E-value
sp|Q86IH1|HBX4_DICDI Homeobox protein 4 OS=Dictyostelium discoideum GN=hbx4 PE=3 SV=1 177 249 1.0E-14
sp|Q8SRR1|HD12_ENCCU Homeobox protein HD-12 OS=Encephalitozoon cuniculi (strain GB-M1) GN=HD-12 PE=3 SV=1 199 249 2.0E-10
sp|Q8SR09|HD2_ENCCU Homeobox protein HD-2 OS=Encephalitozoon cuniculi (strain GB-M1) GN=HD-2 PE=3 SV=1 193 249 2.0E-09
sp|Q8MIE6|TF2LX_HYLLA Homeobox protein TGIF2LX OS=Hylobates lar GN=TGIF2LX PE=2 SV=1 176 249 4.0E-09
sp|Q874N1|MTAL2_KLUDE Mating-type protein ALPHA2 OS=Kluyveromyces delphensis GN=MATALPHA2 PE=3 SV=1 189 255 9.0E-09
[Show all]
[Show less]
Swissprot ID Swissprot Description Start End E-value
sp|Q86IH1|HBX4_DICDI Homeobox protein 4 OS=Dictyostelium discoideum GN=hbx4 PE=3 SV=1 177 249 1.0E-14
sp|Q8SRR1|HD12_ENCCU Homeobox protein HD-12 OS=Encephalitozoon cuniculi (strain GB-M1) GN=HD-12 PE=3 SV=1 199 249 2.0E-10
sp|Q8SR09|HD2_ENCCU Homeobox protein HD-2 OS=Encephalitozoon cuniculi (strain GB-M1) GN=HD-2 PE=3 SV=1 193 249 2.0E-09
sp|Q8MIE6|TF2LX_HYLLA Homeobox protein TGIF2LX OS=Hylobates lar GN=TGIF2LX PE=2 SV=1 176 249 4.0E-09
sp|Q874N1|MTAL2_KLUDE Mating-type protein ALPHA2 OS=Kluyveromyces delphensis GN=MATALPHA2 PE=3 SV=1 189 255 9.0E-09
sp|A8WL06|UNC62_CAEBR Homeobox protein unc-62 OS=Caenorhabditis briggsae GN=unc-62 PE=3 SV=2 194 254 1.0E-08
sp|Q9N5D6|UNC62_CAEEL Homeobox protein unc-62 OS=Caenorhabditis elegans GN=unc-62 PE=1 SV=1 194 254 1.0E-08
sp|Q9SW80|BLH2_ARATH BEL1-like homeodomain protein 2 OS=Arabidopsis thaliana GN=BLH2 PE=1 SV=3 203 249 1.0E-08
sp|Q54VB4|HBX9_DICDI Homeobox protein 9 OS=Dictyostelium discoideum GN=hbx9 PE=3 SV=1 202 249 1.0E-08
sp|Q870I4|MTL32_CANGA Mating-type-like protein ALPHA2, silenced copy at MTL3 OS=Candida glabrata (strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC 0622 / NRRL Y-65) GN=MTL3alpha2 PE=3 SV=1 196 251 2.0E-08
sp|Q9HDS5|MTAL2_KLULA Mating-type protein ALPHA2 OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=HMLALPHA2 PE=3 SV=1 194 252 2.0E-08
sp|Q90655|AKR_CHICK Homeobox protein AKR OS=Gallus gallus PE=2 SV=1 201 249 2.0E-08
sp|Q5U4X3|MEI3A_XENLA Homeobox protein meis3-A OS=Xenopus laevis GN=meis3-a PE=2 SV=1 199 254 3.0E-08
sp|Q1PFD1|BLH11_ARATH BEL1-like homeodomain protein 11 OS=Arabidopsis thaliana GN=BLH11 PE=2 SV=1 203 249 3.0E-08
sp|Q6DIF3|MEIS3_XENTR Homeobox protein meis3 OS=Xenopus tropicalis GN=meis3 PE=2 SV=2 199 254 3.0E-08
sp|Q7ZY13|MEI3B_XENLA Homeobox protein meis3-B OS=Xenopus laevis GN=meis3-b PE=2 SV=2 199 254 3.0E-08
sp|O14770|MEIS2_HUMAN Homeobox protein Meis2 OS=Homo sapiens GN=MEIS2 PE=1 SV=2 199 254 3.0E-08
sp|P97367|MEIS2_MOUSE Homeobox protein Meis2 OS=Mus musculus GN=Meis2 PE=1 SV=2 199 254 3.0E-08
sp|A6NDR6|ME3L1_HUMAN Putative homeobox protein Meis3-like 1 OS=Homo sapiens GN=MEIS3P1 PE=5 SV=2 199 254 3.0E-08
sp|A8K0S8|ME3L2_HUMAN Putative homeobox protein Meis3-like 2 OS=Homo sapiens GN=MEIS3P2 PE=2 SV=1 199 263 3.0E-08
sp|Q99687|MEIS3_HUMAN Homeobox protein Meis3 OS=Homo sapiens GN=MEIS3 PE=2 SV=3 199 254 4.0E-08
sp|Q8MIB7|TF2LX_PANTR Homeobox protein TGIF2LX OS=Pan troglodytes GN=TGIF2LX PE=2 SV=2 202 249 4.0E-08
sp|Q8MIE9|TF2LX_GORGO Homeobox protein TGIF2LX OS=Gorilla gorilla gorilla GN=TGIF2LX PE=2 SV=1 202 249 4.0E-08
sp|Q86Z42|MTAL2_CANGA Mating-type-like protein ALPHA2 OS=Candida glabrata (strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC 0622 / NRRL Y-65) GN=MTL1ALPHA2 PE=2 SV=1 196 258 4.0E-08
sp|A1YGI6|TF2LX_PANPA Homeobox protein TGIF2LX OS=Pan paniscus GN=TGIF2LX PE=3 SV=1 202 249 4.0E-08
sp|Q60954|MEIS1_MOUSE Homeobox protein Meis1 OS=Mus musculus GN=Meis1 PE=1 SV=1 199 254 4.0E-08
sp|P97368|MEIS3_MOUSE Homeobox protein Meis3 OS=Mus musculus GN=Meis3 PE=2 SV=2 199 254 4.0E-08
sp|O00470|MEIS1_HUMAN Homeobox protein Meis1 OS=Homo sapiens GN=MEIS1 PE=1 SV=1 199 254 4.0E-08
sp|Q8IUE1|TF2LX_HUMAN Homeobox protein TGIF2LX OS=Homo sapiens GN=TGIF2LX PE=1 SV=1 202 249 4.0E-08
sp|P79937|MEIS1_XENLA Homeobox protein Meis1 OS=Xenopus laevis GN=meis1 PE=1 SV=1 199 254 5.0E-08
sp|Q8C0Y1|TGIF2_MOUSE Homeobox protein TGIF2 OS=Mus musculus GN=Tgif2 PE=2 SV=1 199 249 5.0E-08
sp|Q94KL5|BLH4_ARATH BEL1-like homeodomain protein 4 OS=Arabidopsis thaliana GN=BLH4 PE=1 SV=2 203 249 6.0E-08
sp|Q9GZN2|TGIF2_HUMAN Homeobox protein TGIF2 OS=Homo sapiens GN=TGIF2 PE=1 SV=1 199 249 6.0E-08
sp|P70284|TGIF1_MOUSE Homeobox protein TGIF1 OS=Mus musculus GN=Tgif1 PE=1 SV=2 201 249 7.0E-08
sp|Q9LZM8|BLH9_ARATH BEL1-like homeodomain protein 9 OS=Arabidopsis thaliana GN=BLH9 PE=1 SV=1 167 249 7.0E-08
sp|Q8S897|BLH5_ARATH BEL1-like homeodomain protein 5 OS=Arabidopsis thaliana GN=BLH5 PE=1 SV=2 202 249 7.0E-08
sp|Q9SIW1|BLH7_ARATH BEL1-like homeodomain protein 7 OS=Arabidopsis thaliana GN=BLH7 PE=2 SV=1 203 379 7.0E-08
sp|O46339|HTH_DROME Homeobox protein homothorax OS=Drosophila melanogaster GN=hth PE=1 SV=1 199 254 8.0E-08
sp|O65685|BLH6_ARATH BEL1-like homeodomain protein 6 OS=Arabidopsis thaliana GN=BLH6 PE=1 SV=1 203 249 9.0E-08
sp|Q93348|IRX_CAEEL Putative iroquois-class homeodomain protein irx-1 OS=Caenorhabditis elegans GN=irx-1 PE=3 SV=3 200 251 1.0E-07
sp|P41817|CUP9_YEAST Homeobox protein CUP9 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=CUP9 PE=1 SV=1 201 249 1.0E-07
sp|Q8MID6|TF2LX_MACMU Homeobox protein TGIF2LX OS=Macaca mulatta GN=TGIF2LX PE=2 SV=1 202 249 1.0E-07
sp|Q9FWS9|BLH3_ARATH BEL1-like homeodomain protein 3 OS=Arabidopsis thaliana GN=BLH3 PE=1 SV=1 188 249 1.0E-07
sp|Q8MID1|TF2LX_MIOTA Homeobox protein TGIF2LX OS=Miopithecus talapoin GN=TGIF2LX PE=2 SV=1 202 249 1.0E-07
sp|Q38897|BEL1_ARATH Homeobox protein BEL1 homolog OS=Arabidopsis thaliana GN=BEL1 PE=1 SV=2 203 249 1.0E-07
sp|Q8MID8|TF2LX_MACFA Homeobox protein TGIF2LX OS=Macaca fascicularis GN=TGIF2LX PE=2 SV=1 202 249 1.0E-07
sp|P53147|TOS8_YEAST Homeobox protein TOS8 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TOS8 PE=3 SV=1 197 249 1.0E-07
sp|Q9FXG8|BLH10_ARATH BEL1-like homeodomain protein 10 OS=Arabidopsis thaliana GN=BLH10 PE=1 SV=1 203 249 1.0E-07
sp|Q9SJ56|BLH1_ARATH BEL1-like homeodomain protein 1 OS=Arabidopsis thaliana GN=BLH1 PE=1 SV=1 203 249 1.0E-07
sp|Q8IUE0|TF2LY_HUMAN Homeobox protein TGIF2LY OS=Homo sapiens GN=TGIF2LY PE=1 SV=1 202 249 1.0E-07
sp|Q5IS58|TGIF1_PANTR Homeobox protein TGIF1 OS=Pan troglodytes GN=TGIF1 PE=2 SV=1 201 249 2.0E-07
sp|Q15583|TGIF1_HUMAN Homeobox protein TGIF1 OS=Homo sapiens GN=TGIF1 PE=1 SV=3 201 249 2.0E-07
sp|P0CY12|MATA2_YEASX Putative mating-type protein A2 OS=Saccharomyces cerevisiae GN=MATA2 PE=3 SV=1 193 251 2.0E-07
sp|P0CY13|HMRA2_YEAST Silenced mating-type protein A2 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=HMRA2 PE=3 SV=1 193 251 2.0E-07
sp|Q8MIB8|TF2LX_PONPY Homeobox protein TGIF2LX OS=Pongo pygmaeus GN=TGIF2LX PE=2 SV=1 202 249 2.0E-07
sp|P0CY08|MTAL2_YEAST Mating-type protein ALPHA2 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=MATALPHA2 PE=1 SV=1 193 251 2.0E-07
sp|P0CY09|HMAL2_YEAST Silenced mating-type protein ALPHA2 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=HMLALPHA2 PE=3 SV=1 193 251 2.0E-07
sp|Q8MIC2|TF2LX_PAPHA Homeobox protein TGIF2LX OS=Papio hamadryas GN=TGIF2LX PE=2 SV=1 202 249 2.0E-07
sp|Q6NVN3|IRX3_XENTR Iroquois-class homeodomain protein irx-3 OS=Xenopus tropicalis GN=irx3 PE=2 SV=1 200 268 2.0E-07
sp|Q9SJJ3|BLH8_ARATH BEL1-like homeodomain protein 8 OS=Arabidopsis thaliana GN=BLH8 PE=1 SV=1 203 249 3.0E-07
sp|O42261|IRX3_XENLA Iroquois-class homeodomain protein irx-3 OS=Xenopus laevis GN=irx3 PE=2 SV=2 200 268 3.0E-07
sp|P78415|IRX3_HUMAN Iroquois-class homeodomain protein IRX-3 OS=Homo sapiens GN=IRX3 PE=2 SV=3 200 268 4.0E-07
sp|P54269|CAUP_DROME Homeobox protein caupolican OS=Drosophila melanogaster GN=caup PE=2 SV=2 200 251 4.0E-07
sp|Q7PMT1|EXD_ANOGA Homeobox protein extradenticle OS=Anopheles gambiae GN=exd PE=3 SV=2 180 251 6.0E-07
sp|P48731|ATH1_ARATH Homeobox protein ATH1 OS=Arabidopsis thaliana GN=ATH1 PE=1 SV=1 202 249 6.0E-07
sp|Q9BYU1|PBX4_HUMAN Pre-B-cell leukemia transcription factor 4 OS=Homo sapiens GN=PBX4 PE=1 SV=2 199 251 7.0E-07
sp|Q29CT2|EXD_DROPS Homeobox protein extradenticle OS=Drosophila pseudoobscura pseudoobscura GN=exd PE=3 SV=1 180 251 7.0E-07
sp|P40427|EXD_DROME Homeobox protein extradenticle OS=Drosophila melanogaster GN=exd PE=1 SV=1 180 251 8.0E-07
sp|P40425|PBX2_HUMAN Pre-B-cell leukemia transcription factor 2 OS=Homo sapiens GN=PBX2 PE=1 SV=2 180 251 1.0E-06
sp|P81067|IRX3_MOUSE Iroquois-class homeodomain protein IRX-3 OS=Mus musculus GN=Irx3 PE=1 SV=2 200 264 1.0E-06
sp|Q9QY61|IRX4_MOUSE Iroquois-class homeodomain protein IRX-4 OS=Mus musculus GN=Irx4 PE=2 SV=1 200 256 1.0E-06
sp|O35984|PBX2_MOUSE Pre-B-cell leukemia transcription factor 2 OS=Mus musculus GN=Pbx2 PE=1 SV=1 180 251 1.0E-06
sp|Q24248|ARA_DROME Homeobox protein araucan OS=Drosophila melanogaster GN=ara PE=1 SV=2 200 251 1.0E-06
sp|P41778|PBX1_MOUSE Pre-B-cell leukemia transcription factor 1 OS=Mus musculus GN=Pbx1 PE=1 SV=2 180 251 1.0E-06
sp|P40424|PBX1_HUMAN Pre-B-cell leukemia transcription factor 1 OS=Homo sapiens GN=PBX1 PE=1 SV=1 180 251 1.0E-06
sp|P81068|IRX1_MOUSE Iroquois-class homeodomain protein IRX-1 OS=Mus musculus GN=Irx1 PE=1 SV=4 200 251 1.0E-06
sp|P78414|IRX1_HUMAN Iroquois-class homeodomain protein IRX-1 OS=Homo sapiens GN=IRX1 PE=2 SV=3 200 251 1.0E-06
sp|P78413|IRX4_HUMAN Iroquois-class homeodomain protein IRX-4 OS=Homo sapiens GN=IRX4 PE=1 SV=2 200 257 1.0E-06
sp|O70477|PKNX1_MOUSE Homeobox protein PKNOX1 OS=Mus musculus GN=Pknox1 PE=1 SV=3 201 289 1.0E-06
sp|Q9YGS0|IRX4_CHICK Iroquois-class homeodomain protein IRX-4 OS=Gallus gallus GN=IRX4 PE=2 SV=1 200 257 1.0E-06
sp|B7ZRT8|IRX4B_XENLA Iroquois-class homeodomain protein irx-4-B OS=Xenopus laevis GN=irx4-b PE=2 SV=1 200 257 2.0E-06
sp|Q9ER75|IRX6_MOUSE Iroquois-class homeodomain protein IRX-6 OS=Mus musculus GN=Irx6 PE=2 SV=2 200 251 2.0E-06
sp|Q2TAQ8|IRX1B_XENLA Iroquois-class homeodomain protein irx-1-B OS=Xenopus laevis GN=irx1-b PE=2 SV=1 200 251 2.0E-06
sp|Q90XW6|IRX4A_XENLA Iroquois-class homeodomain protein irx-4-A OS=Xenopus laevis GN=irx4-a PE=2 SV=1 200 257 2.0E-06
sp|Q2HJ84|PKNX1_BOVIN Homeobox protein PKNOX1 OS=Bos taurus GN=PKNOX1 PE=2 SV=1 201 249 2.0E-06
sp|Q688D0|IRX4_XENTR Iroquois-class homeodomain protein irx-4 OS=Xenopus tropicalis GN=irx4 PE=2 SV=1 200 257 2.0E-06
sp|Q6F2E3|IRX1_XENTR Iroquois-class homeodomain protein irx-1 OS=Xenopus tropicalis GN=irx1 PE=2 SV=1 200 251 2.0E-06
sp|Q5R6L1|PKNX2_PONAB Homeobox protein PKNOX2 OS=Pongo abelii GN=PKNOX2 PE=2 SV=1 199 249 2.0E-06
sp|P55347|PKNX1_HUMAN Homeobox protein PKNOX1 OS=Homo sapiens GN=PKNOX1 PE=1 SV=3 201 249 2.0E-06
sp|P78412|IRX6_HUMAN Iroquois-class homeodomain protein IRX-6 OS=Homo sapiens GN=IRX6 PE=2 SV=3 200 253 2.0E-06
sp|P41779|HM20_CAEEL Homeobox protein ceh-20 OS=Caenorhabditis elegans GN=ceh-20 PE=1 SV=1 180 251 2.0E-06
sp|Q9YGK8|IRX1A_XENLA Iroquois-class homeodomain protein irx-1-A OS=Xenopus laevis GN=irx1-a PE=2 SV=1 200 251 3.0E-06
sp|B3DM47|PBX2_XENTR Pre-B-cell leukemia transcription factor 2 OS=Xenopus tropicalis GN=pbx2 PE=2 SV=1 180 251 3.0E-06
sp|Q96KN3|PKNX2_HUMAN Homeobox protein PKNOX2 OS=Homo sapiens GN=PKNOX2 PE=2 SV=2 199 249 3.0E-06
sp|Q8BG99|PKNX2_MOUSE Homeobox protein PKNOX2 OS=Mus musculus GN=Pknox2 PE=2 SV=1 199 249 3.0E-06
sp|Q99NE9|PBX4_MOUSE Pre-B-cell leukemia transcription factor 4 OS=Mus musculus GN=Pbx4 PE=2 SV=2 199 251 3.0E-06
sp|O35317|PBX3_MOUSE Pre-B-cell leukemia transcription factor 3 OS=Mus musculus GN=Pbx3 PE=2 SV=1 180 251 5.0E-06
sp|P40426|PBX3_HUMAN Pre-B-cell leukemia transcription factor 3 OS=Homo sapiens GN=PBX3 PE=1 SV=1 180 251 6.0E-06
sp|P81066|IRX2_MOUSE Iroquois-class homeodomain protein IRX-2 OS=Mus musculus GN=Irx2 PE=2 SV=2 200 253 7.0E-06
sp|P56659|KNOX1_MAIZE Homeobox protein knotted-1-like 1 (Fragment) OS=Zea mays GN=KNOX1 PE=2 SV=1 202 249 7.0E-06
sp|Q8QGC4|PBX1_XENLA Pre-B-cell leukemia transcription factor 1 OS=Xenopus laevis GN=pbx1 PE=1 SV=1 180 251 7.0E-06
sp|Q9BZI1|IRX2_HUMAN Iroquois-class homeodomain protein IRX-2 OS=Homo sapiens GN=IRX2 PE=1 SV=2 200 253 8.0E-06
sp|P48001|KNAT4_ARATH Homeobox protein knotted-1-like 4 OS=Arabidopsis thaliana GN=KNAT4 PE=1 SV=3 202 259 8.0E-06
sp|B4F6V6|PBX1_XENTR Pre-B-cell leukemia transcription factor 1 OS=Xenopus tropicalis GN=pbx1 PE=2 SV=1 180 251 8.0E-06
sp|Q90XW5|IRX5_XENLA Iroquois-class homeodomain protein irx-5 OS=Xenopus laevis GN=irx5 PE=2 SV=1 200 251 9.0E-06
sp|P78411|IRX5_HUMAN Iroquois-class homeodomain protein IRX-5 OS=Homo sapiens GN=IRX5 PE=1 SV=3 200 253 9.0E-06
sp|Q9JKQ4|IRX5_MOUSE Iroquois-class homeodomain protein IRX-5 OS=Mus musculus GN=Irx5 PE=2 SV=1 200 249 1.0E-05
[Show less]

GO

GO Term Description Terminal node
GO:0003677 DNA binding Yes
GO:0006355 regulation of transcription, DNA-templated Yes
GO:0010468 regulation of gene expression No
GO:0060255 regulation of macromolecule metabolic process No
GO:0003674 molecular_function No
GO:0010556 regulation of macromolecule biosynthetic process No
GO:0009889 regulation of biosynthetic process No
GO:0051252 regulation of RNA metabolic process No
GO:0008150 biological_process No
GO:0005488 binding No
GO:0050794 regulation of cellular process No
GO:0019222 regulation of metabolic process No
GO:0003676 nucleic acid binding No
GO:0031326 regulation of cellular biosynthetic process No
GO:0097159 organic cyclic compound binding No
GO:0065007 biological regulation No
GO:0031323 regulation of cellular metabolic process No
GO:2000112 regulation of cellular macromolecule biosynthetic process No
GO:0080090 regulation of primary metabolic process No
GO:0019219 regulation of nucleobase-containing compound metabolic process No
GO:1903506 regulation of nucleic acid-templated transcription No
GO:0050789 regulation of biological process No
GO:0051171 regulation of nitrogen compound metabolic process No
GO:2001141 regulation of RNA biosynthetic process No
GO:1901363 heterocyclic compound binding No

SignalP

[Help with interpreting these statistics]
SignalP signal predicted Location
(based on Ymax)
D score
(significance: > 0.45)
No 1 - 34 0.45

Transmembrane Domains

(None)

Transcription Factor Class

Transcription Factor Class
(based on PFAM domains)
Homeodomain
Centromere protein B, DNA-binding region

Expression data

No expression data available for this genome

Sequences

Type of sequenceSequence
Locus Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded.
Protein >Hirsu2|6983
MTSIDELDELLDWNQGASVLDDDAHLDQLNLLWSTPNQHDFSFPDLDNVAPDDSLFVADGDGIGSIDLPMDATPA
LDLLEHTDSPCLHCSLGGYSCKKIREGSHKGYCTSCVALGAECSFAGAALDPAQLSDLPPLSANPWDVVDEFTAA
VFHPNLLSETLQTDANVASSGPSASNPADDAPVILAPRPIPPPKVGARFSRESVRILKNWLSSHNRHPYPSDEEK
EALQRQTGLNKTQISNWLANARRRGKVQPPRSTSPHVGNWPGSIDIPKRRGTPALEAMNPLQRWEHSPPEHEAAS
VTAIARAVTASSAEASAFSSGLNSPYSFNYTDDGSSRSLCNQSSTSSLGTSLSSNGSRSSAYSHGSRKSWGSFGS
APFSSRGHRRRRRRGAPQNPGQAATNLTAPLKTFQCTFCTDTFRTKHDWQRHEKSLHLSLERWVCAPVGPRAVNP
ETGQLSCVYCGLADPDDNHLESHNHSACQERAPEERTFYRKDHLNQHLRLVHNGKFVDWSMKQWKEAAAQIRSRC
GFCGIIMDSWTNRVDHLAEHFKTGCSMADWKGDWGFDMPVLDMVENSIPPYLIHHERVSPLPYSATAKSPPESPR
NAYELIKMELAYFGANYWEQHHRAPTDEEMQLEGCRIIFASELLSLQGIATQKSWFRDLIMSSRKMAQKARFGPL
RGAAESRLATLKINGKDNLFEECPMEIQLHKFVKAKQLLGLTAMDDELQEEACKIVGRVEEVCTHPSESVANLLL
RLITGSTDWLADFRRRAHLPRSEDVQDRAIRSTDPKSIDSTIHSYSRLERELTDFLRLRRSEGVEPTDEDLQRQA
RIIIFEFDDGWNQTAADNIHWLEAFKQRHPPEKSSQSASPGRNDFSDLLRPRLNNRAQPMTAFLSNANCYSRLAK
ELRRWARSTMSPNNPKSHVPTDEELQYQARWILYDDDDPWNQTAADNAEWLEQFKRDVGINKEPGDVPERPSGHD
WALEQIATGGLLEADEGGLASVFCSRRLERGLAELVERSVRDGGRFPSDEAIRAEARDITKSSVTAADDVVLLEK
FKAWMRKKLPQALPAADMNTYTYTDAPSLLTTSMEFAISDEDLGNMLQDMEFSLDTT*
Coding >Hirsu2|6983
ATGACGTCCATCGACGAGCTGGACGAGTTGCTCGATTGGAACCAGGGCGCGTCCGTCCTCGACGACGACGCACAC
CTCGATCAGCTGAACCTCCTCTGGTCCACGCCCAACCAGCACGACTTTAGCTTTCCGGACCTCGACAATGTCGCC
CCGGACGACTCCCTCTTTGTCGCTGACGGCGACGGCATCGGCTCCATCGACCTGCCCATGGACGCCACCCCGGCC
CTCGACCTCCTGGAACACACCGACTCGCCCTGCCTTCACTGCAGCCTCGGCGGCTACTCTTGCAAGAAAATCCGC
GAGGGCAGCCACAAGGGCTACTGTACCAGCTGTGTCGCCCTCGGCGCCGAGTGCAGTTTCGCTGGCGCAGCTCTC
GACCCGGCCCAGCTCTCTGACCTGCCTCCCTTATCCGCCAACCCCTGGGACGTCGTAGACGAATTTACTGCCGCC
GTCTTCCACCCGAACCTCCTTTCTGAGACGCTTCAGACCGATGCCAATGTTGCTTCGTCCGGTCCGTCGGCTAGC
AACCCGGCCGATGACGCTCCCGTCATCCTGGCACCCAGGCCCATCCCGCCCCCCAAGGTCGGCGCCCGCTTCTCT
CGCGAGTCGGTCCGCATCCTCAAGAACTGGCTATCCTCCCATAACCGGCACCCCTACCCCAGTGACGAGGAAAAG
GAGGCCCTCCAGCGCCAGACCGGCCTGAACAAGACGCAGATCAGCAATTGGCTCGCCAATGCCCGTCGTCGGGGC
AAGGTCCAGCCGCCTCGTTCCACCTCGCCCCATGTCGGCAACTGGCCTGGCTCGATAGACATCCCCAAGAGACGT
GGCACTCCCGCCCTCGAGGCCATGAACCCGCTGCAGCGATGGGAGCACTCCCCGCCCGAGCACGAAGCCGCCTCC
GTCACCGCCATTGCCAGGGCCGTCACCGCCTCCTCCGCCGAGGCCTCGGCCTTCTCTTCGGGACTCAACAGCCCG
TACAGTTTCAACTACACCGACGACGGCTCCAGCAGATCGCTCTGCAACCAGTCTTCCACCAGCAGTCTGGGCACC
TCGCTCTCCAGCAACGGTTCCCGCAGCTCTGCCTACTCGCACGGCTCCCGCAAGTCCTGGGGCTCCTTTGGATCC
GCCCCCTTCAGCTCCAGGGGGCACCGCCGCCGCCGTCGGAGAGGCGCGCCCCAGAACCCCGGCCAAGCCGCCACC
AACCTGACCGCGCCCCTCAAGACGTTCCAGTGCACCTTCTGCACTGACACCTTCAGGACCAAGCACGACTGGCAG
CGCCACGAGAAGTCTCTGCACCTGTCTCTCGAGAGATGGGTCTGCGCCCCCGTGGGGCCTCGAGCCGTCAACCCG
GAGACCGGCCAGCTGTCGTGCGTCTACTGTGGCCTCGCCGATCCGGACGACAACCATCTGGAGAGCCACAACCAC
TCGGCCTGCCAGGAGCGGGCCCCGGAGGAGAGGACCTTCTATCGTAAGGATCATCTCAACCAGCATCTCCGACTC
GTCCACAACGGCAAGTTCGTCGACTGGTCCATGAAGCAGTGGAAAGAGGCCGCAGCACAGATCCGGTCCCGCTGC
GGCTTCTGCGGCATCATCATGGACAGCTGGACCAACCGAGTGGATCACCTGGCCGAGCATTTCAAGACCGGCTGC
TCCATGGCCGACTGGAAGGGCGACTGGGGCTTCGACATGCCTGTCCTCGACATGGTCGAGAATTCCATCCCCCCG
TATCTCATCCACCACGAACGCGTCTCCCCCCTTCCGTACAGCGCCACCGCCAAGTCGCCGCCCGAGTCCCCCCGA
AACGCCTACGAGCTGATCAAGATGGAGCTCGCCTACTTCGGCGCCAACTACTGGGAGCAACACCACCGGGCCCCG
ACAGACGAGGAGATGCAGCTGGAAGGCTGCCGTATCATCTTCGCCTCCGAGCTGCTGTCGCTGCAGGGCATCGCG
ACCCAAAAGTCCTGGTTTCGTGACCTGATCATGAGCTCCAGGAAGATGGCACAAAAGGCCCGATTCGGCCCGTTG
CGAGGCGCCGCCGAAAGCAGGCTCGCGACGCTCAAGATCAACGGCAAGGACAACTTGTTTGAGGAGTGCCCCATG
GAGATTCAGTTGCACAAGTTTGTCAAGGCCAAGCAGCTCCTGGGCCTGACAGCCATGGACGACGAGCTGCAGGAA
GAGGCGTGCAAGATCGTTGGCCGGGTCGAGGAGGTCTGCACCCACCCTTCCGAGTCCGTCGCCAACCTGTTGCTG
CGGCTCATCACGGGGTCGACGGACTGGCTGGCCGACTTCCGCCGACGTGCCCATCTGCCGCGCTCCGAAGACGTC
CAAGACAGAGCCATTCGGTCCACCGACCCCAAGTCGATCGACTCCACCATCCACAGCTACTCGCGCCTCGAGCGT
GAGCTGACCGACTTCTTGAGGCTGCGGAGGTCCGAGGGCGTCGAGCCGACGGACGAGGACCTCCAGAGGCAAGCC
CGTATCATCATCTTCGAGTTCGACGACGGCTGGAACCAGACCGCCGCCGACAACATCCACTGGCTCGAGGCCTTC
AAGCAGCGCCACCCCCCCGAGAAGAGTTCCCAGAGCGCCTCGCCCGGCCGCAACGACTTTTCAGACCTGCTGAGG
CCCCGCCTGAACAACCGAGCGCAGCCCATGACGGCGTTTCTGTCCAACGCCAACTGCTACAGCAGGCTAGCCAAG
GAGCTGCGGCGCTGGGCCAGGTCGACCATGTCGCCTAACAACCCCAAGTCGCATGTGCCAACGGACGAGGAGCTG
CAGTATCAGGCCAGGTGGATCCTGTACGACGATGACGACCCGTGGAACCAGACAGCGGCAGACAATGCCGAATGG
CTGGAACAGTTCAAGCGCGACGTCGGCATAAACAAGGAGCCCGGGGACGTGCCCGAGCGCCCATCCGGTCACGAC
TGGGCACTTGAGCAGATCGCAACCGGAGGTCTCCTGGAGGCGGACGAGGGCGGCTTGGCAAGCGTCTTCTGCTCG
CGCCGGCTGGAGAGGGGCCTGGCCGAGCTGGTGGAGAGGAGCGTCCGTGACGGCGGCCGGTTCCCTTCAGACGAG
GCGATACGAGCCGAGGCTCGGGATATCACGAAGAGTTCCGTGACCGCGGCGGACGACGTGGTCTTGCTCGAAAAG
TTCAAGGCGTGGATGCGCAAGAAGTTACCGCAGGCCTTGCCGGCCGCGGACATGAACACTTACACGTACACCGAT
GCGCCCTCCCTGCTGACCACGAGTATGGAATTCGCCATTTCGGATGAGGACTTGGGGAACATGCTACAGGACATG
GAATTCAGCCTCGATACCACGTAA
Transcript >Hirsu2|6983
ATGACGTCCATCGACGAGCTGGACGAGTTGCTCGATTGGAACCAGGGCGCGTCCGTCCTCGACGACGACGCACAC
CTCGATCAGCTGAACCTCCTCTGGTCCACGCCCAACCAGCACGACTTTAGCTTTCCGGACCTCGACAATGTCGCC
CCGGACGACTCCCTCTTTGTCGCTGACGGCGACGGCATCGGCTCCATCGACCTGCCCATGGACGCCACCCCGGCC
CTCGACCTCCTGGAACACACCGACTCGCCCTGCCTTCACTGCAGCCTCGGCGGCTACTCTTGCAAGAAAATCCGC
GAGGGCAGCCACAAGGGCTACTGTACCAGCTGTGTCGCCCTCGGCGCCGAGTGCAGTTTCGCTGGCGCAGCTCTC
GACCCGGCCCAGCTCTCTGACCTGCCTCCCTTATCCGCCAACCCCTGGGACGTCGTAGACGAATTTACTGCCGCC
GTCTTCCACCCGAACCTCCTTTCTGAGACGCTTCAGACCGATGCCAATGTTGCTTCGTCCGGTCCGTCGGCTAGC
AACCCGGCCGATGACGCTCCCGTCATCCTGGCACCCAGGCCCATCCCGCCCCCCAAGGTCGGCGCCCGCTTCTCT
CGCGAGTCGGTCCGCATCCTCAAGAACTGGCTATCCTCCCATAACCGGCACCCCTACCCCAGTGACGAGGAAAAG
GAGGCCCTCCAGCGCCAGACCGGCCTGAACAAGACGCAGATCAGCAATTGGCTCGCCAATGCCCGTCGTCGGGGC
AAGGTCCAGCCGCCTCGTTCCACCTCGCCCCATGTCGGCAACTGGCCTGGCTCGATAGACATCCCCAAGAGACGT
GGCACTCCCGCCCTCGAGGCCATGAACCCGCTGCAGCGATGGGAGCACTCCCCGCCCGAGCACGAAGCCGCCTCC
GTCACCGCCATTGCCAGGGCCGTCACCGCCTCCTCCGCCGAGGCCTCGGCCTTCTCTTCGGGACTCAACAGCCCG
TACAGTTTCAACTACACCGACGACGGCTCCAGCAGATCGCTCTGCAACCAGTCTTCCACCAGCAGTCTGGGCACC
TCGCTCTCCAGCAACGGTTCCCGCAGCTCTGCCTACTCGCACGGCTCCCGCAAGTCCTGGGGCTCCTTTGGATCC
GCCCCCTTCAGCTCCAGGGGGCACCGCCGCCGCCGTCGGAGAGGCGCGCCCCAGAACCCCGGCCAAGCCGCCACC
AACCTGACCGCGCCCCTCAAGACGTTCCAGTGCACCTTCTGCACTGACACCTTCAGGACCAAGCACGACTGGCAG
CGCCACGAGAAGTCTCTGCACCTGTCTCTCGAGAGATGGGTCTGCGCCCCCGTGGGGCCTCGAGCCGTCAACCCG
GAGACCGGCCAGCTGTCGTGCGTCTACTGTGGCCTCGCCGATCCGGACGACAACCATCTGGAGAGCCACAACCAC
TCGGCCTGCCAGGAGCGGGCCCCGGAGGAGAGGACCTTCTATCGTAAGGATCATCTCAACCAGCATCTCCGACTC
GTCCACAACGGCAAGTTCGTCGACTGGTCCATGAAGCAGTGGAAAGAGGCCGCAGCACAGATCCGGTCCCGCTGC
GGCTTCTGCGGCATCATCATGGACAGCTGGACCAACCGAGTGGATCACCTGGCCGAGCATTTCAAGACCGGCTGC
TCCATGGCCGACTGGAAGGGCGACTGGGGCTTCGACATGCCTGTCCTCGACATGGTCGAGAATTCCATCCCCCCG
TATCTCATCCACCACGAACGCGTCTCCCCCCTTCCGTACAGCGCCACCGCCAAGTCGCCGCCCGAGTCCCCCCGA
AACGCCTACGAGCTGATCAAGATGGAGCTCGCCTACTTCGGCGCCAACTACTGGGAGCAACACCACCGGGCCCCG
ACAGACGAGGAGATGCAGCTGGAAGGCTGCCGTATCATCTTCGCCTCCGAGCTGCTGTCGCTGCAGGGCATCGCG
ACCCAAAAGTCCTGGTTTCGTGACCTGATCATGAGCTCCAGGAAGATGGCACAAAAGGCCCGATTCGGCCCGTTG
CGAGGCGCCGCCGAAAGCAGGCTCGCGACGCTCAAGATCAACGGCAAGGACAACTTGTTTGAGGAGTGCCCCATG
GAGATTCAGTTGCACAAGTTTGTCAAGGCCAAGCAGCTCCTGGGCCTGACAGCCATGGACGACGAGCTGCAGGAA
GAGGCGTGCAAGATCGTTGGCCGGGTCGAGGAGGTCTGCACCCACCCTTCCGAGTCCGTCGCCAACCTGTTGCTG
CGGCTCATCACGGGGTCGACGGACTGGCTGGCCGACTTCCGCCGACGTGCCCATCTGCCGCGCTCCGAAGACGTC
CAAGACAGAGCCATTCGGTCCACCGACCCCAAGTCGATCGACTCCACCATCCACAGCTACTCGCGCCTCGAGCGT
GAGCTGACCGACTTCTTGAGGCTGCGGAGGTCCGAGGGCGTCGAGCCGACGGACGAGGACCTCCAGAGGCAAGCC
CGTATCATCATCTTCGAGTTCGACGACGGCTGGAACCAGACCGCCGCCGACAACATCCACTGGCTCGAGGCCTTC
AAGCAGCGCCACCCCCCCGAGAAGAGTTCCCAGAGCGCCTCGCCCGGCCGCAACGACTTTTCAGACCTGCTGAGG
CCCCGCCTGAACAACCGAGCGCAGCCCATGACGGCGTTTCTGTCCAACGCCAACTGCTACAGCAGGCTAGCCAAG
GAGCTGCGGCGCTGGGCCAGGTCGACCATGTCGCCTAACAACCCCAAGTCGCATGTGCCAACGGACGAGGAGCTG
CAGTATCAGGCCAGGTGGATCCTGTACGACGATGACGACCCGTGGAACCAGACAGCGGCAGACAATGCCGAATGG
CTGGAACAGTTCAAGCGCGACGTCGGCATAAACAAGGAGCCCGGGGACGTGCCCGAGCGCCCATCCGGTCACGAC
TGGGCACTTGAGCAGATCGCAACCGGAGGTCTCCTGGAGGCGGACGAGGGCGGCTTGGCAAGCGTCTTCTGCTCG
CGCCGGCTGGAGAGGGGCCTGGCCGAGCTGGTGGAGAGGAGCGTCCGTGACGGCGGCCGGTTCCCTTCAGACGAG
GCGATACGAGCCGAGGCTCGGGATATCACGAAGAGTTCCGTGACCGCGGCGGACGACGTGGTCTTGCTCGAAAAG
TTCAAGGCGTGGATGCGCAAGAAGTTACCGCAGGCCTTGCCGGCCGCGGACATGAACACTTACACGTACACCGAT
GCGCCCTCCCTGCTGACCACGAGTATGGAATTCGCCATTTCGGATGAGGACTTGGGGAACATGCTACAGGACATG
GAATTCAGCCTCGATACCACGTAA
Gene >Hirsu2|6983
ATGACGTCCATCGACGAGCTGGACGAGTTGCTCGATTGGAACCAGGGCGCGTCCGTCCTCGACGACGACGCACAC
CTCGATCAGCTGAACCTCCTCTGGTCCACGCCCAACCAGCACGACTTTAGCTTTCCGGACCTCGACAATGTCGCC
CCGGACGACTCCCTCTTTGTCGCTGACGGCGACGGCATCGGCTCCATCGACCTGCCCATGGACGCCACCCCGGCC
CTCGACCTCCTGGAACACACCGACTCGCCCTGCCTTCACTGCAGCCTCGGCGGCTACTCTTGCAAGAAAATCCGC
GAGGGCAGCCACAAGGGCTACTGTACCAGCTGTGTCGCCCTCGGCGCCGAGTGCAGTTTCGCTGGCGCAGCTCTC
GACCCGGCCCAGCTCTCTGACCTGCCTCCCTTATCCGCCAACCCCTGGGACGTCGTAGACGAATTTACTGCCGCC
GTCTTCCACCCGAACCTCCTTTCTGAGACGCTTCAGACCGATGCCAATGTTGCTTCGTCCGGTCCGTCGGCTAGC
AACCCGGCCGATGACGCTCCCGTCATCCTGGCACCCAGGCCCATCCCGCCCCCCAAGGTCGGCGCCCGCTTCTCT
CGCGAGTCGGTCCGCATCCTCAAGAACTGGCTATCCTCCCATAACCGGCACCCCTACCCCAGTGACGAGGAAAAG
GAGGCCCTCCAGCGCCAGACCGGCCTGAACAAGACGCAGATCAGCAATTGGCTCGCCAATGCCCGTCGTCGGGGC
AAGGTCCAGCCGCCTCGTTCCACCTCGCCCCATGTCGGCAACTGGCCTGGCTCGATAGACATCCCCAAGAGACGT
GGCACTCCCGCCCTCGAGGCCATGAACCCGCTGCAGCGATGGGAGCACTCCCCGCCCGAGCACGAAGCCGCCTCC
GTCACCGCCATTGCCAGGGCCGTCACCGCCTCCTCCGCCGAGGCCTCGGCCTTCTCTTCGGGACTCAACAGCCCG
TACAGTTTCAACTACACCGACGACGGCTCCAGCAGATCGCTCTGCAACCAGTCTTCCACCAGCAGTCTGGGCACC
TCGCTCTCCAGCAACGGTTCCCGCAGCTCTGCCTACTCGCACGGCTCCCGCAAGTCCTGGGGCTCCTTTGGATCC
GCCCCCTTCAGCTCCAGGGGGCACCGCCGCCGCCGTCGGAGAGGCGCGCCCCAGAACCCCGGCCAAGCCGCCACC
AACCTGACCGCGCCCCTCAAGACGTTCCAGTGCACCTTCTGCACTGACACCTTCAGGACCAAGCACGACTGGCAG
CGCCACGAGAAGTCTCTGCACCTGTCTCTCGAGAGATGGGTCTGCGCCCCCGTGGGGCCTCGAGCCGTCAACCCG
GAGACCGGCCAGCTGTCGTGCGTCTACTGTGGCCTCGCCGATCCGGACGACAACCATCTGGAGAGCCACAACCAC
TCGGCCTGCCAGGAGCGGGCCCCGGAGGAGAGGACCTTCTATCGTAAGGATCATCTCAACCAGCATCTCCGACTC
GTCCACAACGGCAAGTTCGTCGACTGGTCCATGAAGCAGTGGAAAGAGGCCGCAGCACAGATCCGGTCCCGCTGC
GGCTTCTGCGGCATCATCATGGACAGCTGGACCAACCGAGTGGATCACCTGGCCGAGCATTTCAAGACCGGCTGC
TCCATGGCCGACTGGAAGGGCGACTGGGGCTTCGACATGCCTGTCCTCGACATGGTCGAGAATTCCATCCCCCCG
TGTACGTGATCCGCACATTGGCTTTCGCAGAATCCGCCATCGGCCTCGCCTCTGACCCGTCGAAACAGATCTCAT
CCACCACGAACGCGTCTCCCCCCTTCCGTACAGCGCCACCGCCAAGTCGCCGCCCGAGTCCCCCCGAAACGCCTA
CGAGCTGATCAAGATGGAGCTCGCCTACTTCGGCGCCAACTACTGGGAGCAACACCACCGGGCCCCGACAGACGA
GGAGATGCAGCTGGAAGGCTGCCGTATCATCTTCGCCTCCGAGCTGCTGTCGCTGCAGGGCATCGCGACCCAAAA
GTCCTGGTTTCGTGACCTGATCATGAGCTCCAGGAAGATGGCACAAAAGGCCCGATTCGGCCCGTTGCGAGGCGC
CGCCGAAAGCAGGCTCGCGACGCTCAAGATCAACGGCAAGGACAACTTGTTTGAGGAGTGCCCCATGGAGATTCA
GTTGCACAAGTTTGTCAAGGCCAAGCAGCTCCTGGGCCTGACAGCCATGGACGACGAGCTGCAGGAAGAGGCGTG
CAAGATCGTTGGCCGGGTCGAGGAGGTCTGCACCCACCCTTCCGAGTCCGTCGCCAACCTGTTGCTGCGGCTCAT
CACGGGGTCGACGGACTGGCTGGCCGACTTCCGCCGACGTGCCCATCTGCCGCGCTCCGAAGACGTCCAAGACAG
AGCCATTCGGTCCACCGACCCCAAGTCGATCGACTCCACCATCCACAGCTACTCGCGCCTCGAGCGTGAGCTGAC
CGACTTCTTGAGGCTGCGGAGGTCCGAGGGCGTCGAGCCGACGGACGAGGACCTCCAGAGGCAAGCCCGTATCAT
CATCTTCGAGTTCGACGACGGCTGGAACCAGACCGCCGCCGACAACATCCACTGGCTCGAGGCCTTCAAGCAGCG
CCACCCCCCCGAGAAGAGTTCCCAGAGCGCCTCGCCCGGCCGCAACGACTTTTCAGACCTGCTGAGGCCCCGCCT
GAACAACCGAGCGCAGCCCATGACGGCGTTTCTGTCCAACGCCAACTGCTACAGCAGGCTAGCCAAGGAGCTGCG
GCGCTGGGCCAGGTCGACCATGTCGCCTAACAACCCCAAGTCGCATGTGCCAACGGACGAGGAGCTGCAGTATCA
GGCCAGGTGGATCCTGTACGACGAGTGGGTGCCCGCAACAAACCGGCTTCCCCGACCGTGGATGGGCTGACGCAT
ATCTATCTACAGTGACGACCCGTGGAACCAGACAGCGGCAGACAATGCCGAATGGCTGGAACAGTTCAAGCGCGA
CGTCGGCATAAACAAGGAGCCCGGGGACGTGCCCGAGCGCCCATCCGGTCACGACTGGGCACTTGAGCAGATCGC
AACCGGAGGTCTCCTGGAGGCGGACGAGGGCGGCTTGGCAAGCGTCTTCTGCTCGCGCCGGCTGGAGAGGGGCCT
GGCCGAGCTGGTGGAGAGGAGCGTCCGTGACGGCGGCCGGTTCCCTTCAGACGAGGCGATACGAGCCGAGGCTCG
GGATATCACGAAGAGTTCCGTGACCGCGGCGGACGACGTGGTCTTGCTCGAAAAGTTCAAGGCGTGGATGCGCAA
GAAGTTACCGCAGGCCTTGCCGGCCGCGGACATGAACACTTACACGTACACCGATGCGCCCTCCCTGCTGACCAC
GAGTATGGAATTCGCCATTTCGGATGAGGACTTGGGGAACATGCTACAGGACATGGAATTCAGCCTCGATACCAC
GTAA

© 2022 - Robin Ohm - Utrecht University - The Netherlands

Built with Python Django and Wagtail