Fungal Genomics

at Utrecht University

General Properties

Protein IDHirsu2|2459
Gene name
LocationContig_159:5233..8615
Strand-
Gene length (bp)3382
Transcript length (bp)3315
Coding sequence length (bp)3315
Protein length (aa) 1105

Overview

Your browser does not support drawing a protein figure.

PFAM Domains

PFAM Domain ID Short name Long name E-value Start End
PF20428 Sey1_3HB Sey1 three-helix bundle domain 6.0E-144 606 1045
PF01585 G-patch G-patch domain 4.1E-13 228 270
PF05879 RHD3_GTPase Root hair defective 3 GTP-binding protein (RHD3) GTPase domain 4.6E-13 492 559
PF12457 TIP_N Tuftelin interacting protein N terminal 1.7E-10 39 115
PF12656 G-patch_2 G-patch domain 2.0E-06 229 272

Swissprot hits

[Show all]
Swissprot ID Swissprot Description Start End E-value
sp|Q8TGG5|SEY1_ASPFU Protein sey1 OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=sey1 PE=3 SV=1 492 1101 0.0E+00
sp|B2B1M4|SEY1_PODAN Protein SEY1 OS=Podospora anserina (strain S / ATCC MYA-4624 / DSM 980 / FGSC 10383) GN=SEY1 PE=3 SV=1 490 1101 0.0E+00
sp|A1CPP3|SEY1_ASPCL Protein sey1 OS=Aspergillus clavatus (strain ATCC 1007 / CBS 513.65 / DSM 816 / NCTC 3887 / NRRL 1) GN=sey1 PE=3 SV=2 522 1104 0.0E+00
sp|A2QR20|SEY1_ASPNC Protein sey1 OS=Aspergillus niger (strain CBS 513.88 / FGSC A1513) GN=sey1 PE=3 SV=1 492 1102 0.0E+00
sp|B8MK20|SEY1_TALSN Protein sey1 OS=Talaromyces stipitatus (strain ATCC 10500 / CBS 375.48 / QM 6759 / NRRL 1006) GN=sey1 PE=3 SV=1 492 1102 0.0E+00
[Show all]
[Show less]
Swissprot ID Swissprot Description Start End E-value
sp|Q8TGG5|SEY1_ASPFU Protein sey1 OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=sey1 PE=3 SV=1 492 1101 0.0E+00
sp|B2B1M4|SEY1_PODAN Protein SEY1 OS=Podospora anserina (strain S / ATCC MYA-4624 / DSM 980 / FGSC 10383) GN=SEY1 PE=3 SV=1 490 1101 0.0E+00
sp|A1CPP3|SEY1_ASPCL Protein sey1 OS=Aspergillus clavatus (strain ATCC 1007 / CBS 513.65 / DSM 816 / NCTC 3887 / NRRL 1) GN=sey1 PE=3 SV=2 522 1104 0.0E+00
sp|A2QR20|SEY1_ASPNC Protein sey1 OS=Aspergillus niger (strain CBS 513.88 / FGSC A1513) GN=sey1 PE=3 SV=1 492 1102 0.0E+00
sp|B8MK20|SEY1_TALSN Protein sey1 OS=Talaromyces stipitatus (strain ATCC 10500 / CBS 375.48 / QM 6759 / NRRL 1006) GN=sey1 PE=3 SV=1 492 1102 0.0E+00
sp|Q1DL22|SEY1_COCIM Protein SEY1 OS=Coccidioides immitis (strain RS) GN=SEY1 PE=3 SV=1 495 1076 0.0E+00
sp|B6QIM3|SEY1_TALMQ Protein sey1 OS=Talaromyces marneffei (strain ATCC 18224 / CBS 334.59 / QM 7333) GN=sey1 PE=3 SV=2 492 1102 0.0E+00
sp|A1D2K1|SEY1_NEOFI Protein sey1 OS=Neosartorya fischeri (strain ATCC 1020 / DSM 3700 / FGSC A1164 / NRRL 181) GN=sey1 PE=3 SV=1 492 1101 0.0E+00
sp|B0XQZ0|SEY1_ASPFC Protein sey1 OS=Neosartorya fumigata (strain CEA10 / CBS 144.89 / FGSC A1163) GN=sey1 PE=3 SV=1 492 1101 0.0E+00
sp|Q5BEE5|SEY1_EMENI Protein sey1 OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=sey1 PE=3 SV=1 492 1104 0.0E+00
sp|A6R1D5|SEY1_AJECN Protein SEY1 OS=Ajellomyces capsulatus (strain NAm1 / WU24) GN=SEY1 PE=3 SV=1 492 1098 0.0E+00
sp|C0NJ57|SEY1_AJECG Protein SEY1 OS=Ajellomyces capsulatus (strain G186AR / H82 / ATCC MYA-2454 / RMSCC 2432) GN=SEY1 PE=3 SV=1 492 1098 0.0E+00
sp|Q0D0W7|SEY1_ASPTN Protein sey1 OS=Aspergillus terreus (strain NIH 2624 / FGSC A1156) GN=sey1 PE=3 SV=1 492 1101 0.0E+00
sp|C4JQN4|SEY1_UNCRE Protein SEY1 OS=Uncinocarpus reesii (strain UAMH 1704) GN=SEY1 PE=3 SV=1 492 1067 0.0E+00
sp|B8NJL4|SEY1_ASPFN Protein sey1 OS=Aspergillus flavus (strain ATCC 200026 / FGSC A1120 / NRRL 3357 / JCM 12722 / SRRC 167) GN=sey1 PE=3 SV=1 492 1101 0.0E+00
sp|Q9P5X6|SEY1_NEUCR Protein sey1 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=sey1 PE=3 SV=1 492 1102 0.0E+00
sp|Q2GUT7|SEY1_CHAGB Protein SEY1 OS=Chaetomium globosum (strain ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 / NRRL 1970) GN=SEY1 PE=3 SV=2 492 1104 0.0E+00
sp|Q2UNK6|SEY1_ASPOR Protein sey1 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=sey1 PE=3 SV=2 492 1101 0.0E+00
sp|Q525S7|SEY1_MAGO7 Protein SEY1 OS=Magnaporthe oryzae (strain 70-15 / ATCC MYA-4617 / FGSC 8958) GN=SEY1 PE=3 SV=1 472 1088 0.0E+00
sp|A7ERA6|SEY1_SCLS1 Protein sey1 OS=Sclerotinia sclerotiorum (strain ATCC 18683 / 1980 / Ss-1) GN=sey1 PE=3 SV=1 492 1104 0.0E+00
sp|A6S544|SEY1_BOTFB Protein sey1 OS=Botryotinia fuckeliana (strain B05.10) GN=sey1 PE=3 SV=1 472 1094 0.0E+00
sp|C5FJT2|SEY1_ARTOC Protein SEY1 OS=Arthroderma otae (strain ATCC MYA-4605 / CBS 113480) GN=SEY1 PE=3 SV=1 492 1092 0.0E+00
sp|B2W244|SEY1_PYRTR Protein sey1 OS=Pyrenophora tritici-repentis (strain Pt-1C-BFP) GN=sey1 PE=3 SV=2 492 1102 0.0E+00
sp|C1GWM2|SEY1_PARBA Protein SEY1 OS=Paracoccidioides lutzii (strain ATCC MYA-826 / Pb01) GN=SEY1 PE=3 SV=2 492 1089 0.0E+00
sp|C5K3E1|SEY1_AJEDS Protein SEY1 OS=Ajellomyces dermatitidis (strain SLH14081) GN=SEY1 PE=3 SV=1 492 1081 0.0E+00
sp|C5GMK3|SEY1_AJEDR Protein SEY1 OS=Ajellomyces dermatitidis (strain ER-3 / ATCC MYA-2586) GN=SEY1 PE=3 SV=1 492 1081 0.0E+00
sp|C1G4S9|SEY1_PARBD Protein SEY1 OS=Paracoccidioides brasiliensis (strain Pb18) GN=SEY1 PE=3 SV=1 492 1089 0.0E+00
sp|C0S6S4|SEY1_PARBP Protein SEY1 OS=Paracoccidioides brasiliensis (strain Pb03) GN=SEY1 PE=3 SV=1 492 1089 0.0E+00
sp|B6GX67|SEY1_PENRW Protein sey1 OS=Penicillium rubens (strain ATCC 28089 / DSM 1075 / NRRL 1951 / Wisconsin 54-1255) GN=sey1 PE=3 SV=1 522 1102 0.0E+00
sp|Q0V302|SEY1_PHANO Protein SEY1 OS=Phaeosphaeria nodorum (strain SN15 / ATCC MYA-4574 / FGSC 10173) GN=SEY1 PE=3 SV=2 492 1104 0.0E+00
sp|Q4PEQ0|SEY1_USTMA Protein SEY1 OS=Ustilago maydis (strain 521 / FGSC 9021) GN=SEY1 PE=3 SV=2 492 1055 1.0E-107
sp|A8N5E5|SEY1_COPC7 Protein SEY1 OS=Coprinopsis cinerea (strain Okayama-7 / 130 / ATCC MYA-4618 / FGSC 9003) GN=SEY1 PE=3 SV=1 493 1068 1.0E-103
sp|P0CQ47|SEY1_CRYNB Protein SEY1 OS=Cryptococcus neoformans var. neoformans serotype D (strain B-3501A) GN=SEY1 PE=3 SV=1 493 1077 2.0E-97
sp|P0CQ46|SEY1_CRYNJ Protein SEY1 OS=Cryptococcus neoformans var. neoformans serotype D (strain JEC21 / ATCC MYA-565) GN=SEY1 PE=3 SV=1 493 1077 2.0E-97
sp|B0D0N9|SEY1_LACBS Protein SEY1 OS=Laccaria bicolor (strain S238N-H82 / ATCC MYA-4686) GN=SEY1 PE=3 SV=1 495 1077 9.0E-96
sp|A8QAN4|SEY1_MALGO Protein SEY1 OS=Malassezia globosa (strain ATCC MYA-4612 / CBS 7966) GN=SEY1 PE=3 SV=1 522 1058 4.0E-88
sp|Q6BK59|SEY1_DEBHA Protein SEY1 OS=Debaryomyces hansenii (strain ATCC 36239 / CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968) GN=SEY1 PE=3 SV=2 492 1089 2.0E-69
sp|C4Y2Z9|SEY1_CLAL4 Protein SEY1 OS=Clavispora lusitaniae (strain ATCC 42720) GN=SEY1 PE=3 SV=1 513 1086 7.0E-69
sp|B6K0N7|SEY1_SCHJY Protein sey1 OS=Schizosaccharomyces japonicus (strain yFS275 / FY16936) GN=sey1 PE=3 SV=1 525 1081 7.0E-65
sp|Q6C3B0|SEY1_YARLI Protein SEY1 OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=SEY1 PE=3 SV=1 490 1086 1.0E-64
sp|A5DB26|SEY1_PICGU Protein SEY1 OS=Meyerozyma guilliermondii (strain ATCC 6260 / CBS 566 / DSM 6381 / JCM 1539 / NBRC 10279 / NRRL Y-324) GN=SEY1 PE=3 SV=3 492 1058 2.0E-64
sp|Q9UTE0|SEY1_SCHPO Protein sey1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=sey1 PE=1 SV=1 525 1050 9.0E-64
sp|C5DEL5|SEY1_LACTC Protein SEY1 OS=Lachancea thermotolerans (strain ATCC 56472 / CBS 6340 / NRRL Y-8284) GN=SEY1 PE=3 SV=1 492 1055 2.0E-61
sp|C4YS65|SEY1_CANAW Protein SEY1 OS=Candida albicans (strain WO-1) GN=SEY1 PE=3 SV=1 525 1066 4.0E-61
sp|A3LWM9|SEY1_PICST Protein SEY1 OS=Scheffersomyces stipitis (strain ATCC 58785 / CBS 6054 / NBRC 10063 / NRRL Y-11545) GN=SEY1 PE=3 SV=2 487 1074 4.0E-61
sp|Q9C0L9|SEY1_CANAL Protein SEY1 OS=Candida albicans (strain SC5314 / ATCC MYA-2876) GN=SEY1 PE=1 SV=2 525 1066 5.0E-61
sp|B9WJF2|SEY1_CANDC Protein SEY1 OS=Candida dubliniensis (strain CD36 / ATCC MYA-646 / CBS 7987 / NCPF 3949 / NRRL Y-17841) GN=SEY1 PE=3 SV=1 525 1066 2.0E-58
sp|C4R432|SEY1_PICPG Protein SEY1 OS=Komagataella pastoris (strain GS115 / ATCC 20864) GN=SEY1 PE=3 SV=1 490 1060 1.0E-57
sp|Q74ZD5|SEY1_ASHGO Protein SEY1 OS=Ashbya gossypii (strain ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056) GN=SEY1 PE=3 SV=1 490 1072 9.0E-57
sp|Q6FLC5|SEY1_CANGA Protein SEY1 OS=Candida glabrata (strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC 0622 / NRRL Y-65) GN=SEY1 PE=3 SV=1 490 1058 8.0E-53
sp|C5MCD3|SEY1_CANTT Protein SEY1 OS=Candida tropicalis (strain ATCC MYA-3404 / T1) GN=SEY1 PE=3 SV=2 517 1060 7.0E-52
sp|B5VS52|SEY1_YEAS6 Protein SEY1 OS=Saccharomyces cerevisiae (strain AWRI1631) GN=SEY1 PE=3 SV=1 490 1068 2.0E-50
sp|Q99287|SEY1_YEAST Protein SEY1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=SEY1 PE=1 SV=1 490 1068 2.0E-50
sp|B3LJJ8|SEY1_YEAS1 Protein SEY1 OS=Saccharomyces cerevisiae (strain RM11-1a) GN=SEY1 PE=3 SV=1 490 1068 5.0E-50
sp|Q6CJ97|SEY1_KLULA Protein SEY1 OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=SEY1 PE=3 SV=1 492 1071 6.0E-50
sp|A6ZP10|SEY1_YEAS7 Protein SEY1 OS=Saccharomyces cerevisiae (strain YJM789) GN=SEY1 PE=3 SV=1 490 1068 2.0E-49
sp|A7TJY3|SEY1_VANPO Protein SEY1 OS=Vanderwaltozyma polyspora (strain ATCC 22028 / DSM 70294) GN=SEY1 PE=3 SV=1 490 1077 1.0E-47
sp|A5E240|SEY1_LODEL Protein SEY1 OS=Lodderomyces elongisporus (strain ATCC 11503 / CBS 2605 / JCM 1781 / NBRC 1676 / NRRL YB-4239) GN=SEY1 PE=3 SV=1 509 1060 3.0E-47
sp|C5DTA7|SEY1_ZYGRC Protein SEY1 OS=Zygosaccharomyces rouxii (strain ATCC 2623 / CBS 732 / NBRC 1130 / NCYC 568 / NRRL Y-229) GN=SEY1 PE=3 SV=1 526 1068 8.0E-46
sp|C4M6U3|SEY12_ENTHI Protein SEY1 homolog 2 OS=Entamoeba histolytica GN=EHI_054180 PE=3 SV=1 526 1055 5.0E-37
sp|Q54W90|SEY1_DICDI Protein SEY1 homolog OS=Dictyostelium discoideum GN=DDB_0206311 PE=3 SV=1 493 1056 2.0E-32
sp|B0EKR0|SEY12_ENTDS Protein SEY1 homolog 2 OS=Entamoeba dispar (strain ATCC PRA-260 / SAW760) GN=EDI_026070 PE=3 SV=1 526 1055 1.0E-30
sp|A2FJ32|SEY13_TRIVA Protein SEY1 homolog 3 OS=Trichomonas vaginalis GN=TVAG_204380 PE=3 SV=1 492 1083 3.0E-20
sp|Q9FKE9|RHD32_ARATH Protein ROOT HAIR DEFECTIVE 3 homolog 2 OS=Arabidopsis thaliana GN=At5g45160 PE=2 SV=1 506 1030 2.0E-19
sp|Q2QMH2|RHD31_ORYSJ Protein ROOT HAIR DEFECTIVE 3 homolog 1 OS=Oryza sativa subsp. japonica GN=Os12g0604600 PE=2 SV=1 491 1031 5.0E-19
sp|A2EI35|SEY11_TRIVA Protein SEY1 homolog 1 OS=Trichomonas vaginalis GN=TVAG_273580 PE=3 SV=1 496 1073 1.0E-17
sp|A7AT07|SEY1_BABBO Protein SEY1 homolog OS=Babesia bovis GN=BBOV_II001090 PE=3 SV=1 490 1032 6.0E-17
sp|Q2R224|RHD32_ORYSJ Protein ROOT HAIR DEFECTIVE 3 homolog 2 OS=Oryza sativa subsp. japonica GN=Os11g0582300 PE=2 SV=1 534 1027 1.0E-16
sp|A2EK80|SEY12_TRIVA Protein SEY1 homolog 2 OS=Trichomonas vaginalis GN=TVAG_100140 PE=3 SV=1 495 1033 2.0E-16
sp|Q0JLS6|RHD3_ORYSJ Protein ROOT HAIR DEFECTIVE 3 OS=Oryza sativa subsp. japonica GN=RHD3 PE=2 SV=1 534 1031 2.0E-16
sp|B0E843|SEY11_ENTDS Protein SEY1 homolog 1 OS=Entamoeba dispar (strain ATCC PRA-260 / SAW760) GN=EDI_213660 PE=3 SV=1 518 1010 4.0E-15
sp|Q4U9I8|SEY1_THEAN Protein SEY1 homolog OS=Theileria annulata GN=TA08650 PE=3 SV=1 487 1034 1.0E-14
sp|Q9UTK6|YKR3_SCHPO G-patch domain-containing protein C1486.03 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=SPAC1486.03c PE=3 SV=1 203 442 2.0E-14
sp|Q9SSN0|RHD31_ARATH Protein ROOT HAIR DEFECTIVE 3 homolog 1 OS=Arabidopsis thaliana GN=At1g72960 PE=2 SV=2 534 1029 3.0E-14
sp|P93042|RHD3_ARATH Protein ROOT HAIR DEFECTIVE 3 OS=Arabidopsis thaliana GN=RHD3 PE=1 SV=1 506 1033 4.0E-14
sp|C4LTM1|SEY11_ENTHI Protein SEY1 homolog 1 OS=Entamoeba histolytica GN=EHI_012280 PE=3 SV=1 522 660 1.0E-12
sp|Q4XZY3|SEY1_PLACH Protein SEY1 homolog OS=Plasmodium chabaudi GN=PC000321.02.0 PE=3 SV=1 530 670 2.0E-10
sp|Q4N280|SEY1_THEPA Protein SEY1 homolog OS=Theileria parva GN=TP04_0477 PE=3 SV=2 488 578 1.0E-09
sp|B3LAJ9|SEY1_PLAKH Protein SEY1 homolog OS=Plasmodium knowlesi (strain H) GN=PKH_133110 PE=3 SV=1 529 578 3.0E-09
sp|A5K168|SEY1_PLAVS Protein SEY1 homolog OS=Plasmodium vivax (strain Salvador I) GN=PVX_085645 PE=3 SV=1 529 578 4.0E-09
sp|Q8ILT5|SEY1_PLAF7 Protein SEY1 homolog OS=Plasmodium falciparum (isolate 3D7) GN=PF14_0159 PE=3 SV=2 529 689 7.0E-08
sp|A4HK17|SEY1_LEIBR Protein SEY1 homolog OS=Leishmania braziliensis GN=LbrM32_V2.0440 PE=3 SV=2 526 652 3.0E-07
sp|A4I7K1|SEY1_LEIIN Protein SEY1 homolog OS=Leishmania infantum GN=LinJ32.0740 PE=3 SV=2 532 677 1.0E-06
sp|Q17784|TFP11_CAEEL Septin and tuftelin-interacting protein 1 homolog OS=Caenorhabditis elegans GN=stip-1 PE=1 SV=2 49 551 1.0E-06
sp|Q4DHA1|SEY1_TRYCC Protein SEY1 homolog OS=Trypanosoma cruzi (strain CL Brener) GN=Tc00.1047053511285.50 PE=3 SV=1 532 581 2.0E-06
sp|Q66J74|TFP11_XENLA Tuftelin-interacting protein 11 OS=Xenopus laevis GN=tfip11 PE=2 SV=1 226 507 2.0E-06
sp|Q9Y103|TFP11_DROME Septin-interacting protein 1 OS=Drosophila melanogaster GN=sip1 PE=1 SV=1 226 507 3.0E-06
sp|Q4Q5P8|SEY1_LEIMA Protein SEY1 homolog OS=Leishmania major GN=LmjF32.0370 PE=3 SV=2 532 677 4.0E-06
[Show less]

GO

GO Term Description Terminal node
GO:0003676 nucleic acid binding Yes
GO:0097159 organic cyclic compound binding No
GO:0005488 binding No
GO:1901363 heterocyclic compound binding No
GO:0003674 molecular_function No

Deeploc

[Help with interpreting the results of Deeploc 2.0]
Localizations Signals Cytoplasm Nucleus Extracellular Cell membrane Mitochondrion Plastid Endoplasmic reticulum Lysosome vacuole Golgi apparatus Peroxisome
Nucleus Nuclear localization signal 0.2058 0.682 0.3263 0.0605 0.1939 0.0271 0.1833 0.0225 0.0968 0.0063

SignalP

(None)

Transmembrane Domains

Domain # Start End Length
1 982 1004 22
2 1011 1033 22

Transcription Factor Class

(None)

CAZymes

(None)

Secondary Metabolism

(None)

Expression data

No expression data available for this genome

Orthologs

Orthofinder run ID4
Orthogroup2092
Change Orthofinder run
Species Protein ID
Ophiocordyceps australis 1348a (Ghana) OphauG2|7853
Ophiocordyceps australis map64 (Brazil) OphauB2|6511
Ophiocordyceps camponoti-floridani Ophcf2|06604
Ophiocordyceps camponoti-rufipedis Ophun1|7374
Ophiocordyceps kimflemingae Ophio5|5805
Ophiocordyceps subramaniannii Hirsu2|2459 (this protein)

Sequences

Type of sequenceSequence
Locus Download genbank file of locus Download genbank file of locus (reverse complement)
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded.
Protein >Hirsu2|2459
MENSAQPSFDPSRLTKASAADYSSSASDDDDEYLMPGVNVDDDDFGDHNPRKRRRIGGKSKEKAALGIFGSDSDD
DRPGQRWKSKTLRRKGVDFVSTAVQEGQGQDQGEGEGEDTSDGKAGLGNGAASSDDDEDDDDAGLGLGFGNRSKA
PAWAGTADGMQHQGAPTLKSPVAAVETTFDGKSVLGRGFVPSSATEPVLKEPEADSTPPARNKPQVSAFGPKGKV
NAKSFGARMMAKMGYVNGMGLGKEGQGRNVIIEANLRPQGIGLGAVKEKTERERQEEKRQARLRGDDVVDSDEEE
KRRRKAKKKSLGIANGSASSTPRRQKTKYMTAEEVRAAAPGLHIPDAFTPILDMTGPGSKLLTSTSGVMTPNSGV
FESSEVVEARKLVRRAHADLLAFTEEWKSLEERKSWLNLELREREQEVEDVRSDFERLQAFSTLVTEQLMQASEW
DQVIACLVKAAELGSINPETADVAVAAIHPFLKAGDWDPLEEPGRFASDLKRLRGLFATPGPGSSNGDQELEGGV
FLPEYHRRIPADGLAVYAEGIWDQIVNNKDLDLPTQQELLAQFRCDEISREVLVAFDLVVAPLEEQQTEATKLGR
ALVLVDLGLTGSRAREACAGAFEAQASRYHKPTYARKKLELEAKVDARLKALYHGQLSAAHKAGVAAFSEAVANK
VKAGQKAGGAYEFAEIVAAEKKKTLDVFEAEAEALAIHGVAWTDFKPQYQLFEAELDEVSARLRKEEMRRLATRV
ERWVKSRLGDSVGLEFNKLGSGRGGSGAPETGDKPPAEKDLWDRIWAVFTGVVAEAEGRFAERAKSFDASDDETE
VGMWRLRRKSWVGLREKIDEEVMEGNILLKLRENFEDKFRYDEAGVPRIWLPTDDIEGIYTKARESTLTLIPLLS
RFRLSETYAPPDLPGFVGPQPRGVEVGDEEDLAPIGGVDEEEGKSLEEEMTVLGEGKRQDLVVRFKKTADGVYVE
AKRGAIGGVAQVPWYFYALLLALGWNEFFMVLRNPFLFILLIMMMGGTYVAYTLNLLGPMMQMGNAAVNQGLDIA
KHQLREFIANSETARQALAVPASNGTSGNDIDMDRLDSRGKRTAGTRDEEDDEI*
Coding >Hirsu2|2459
ATGGAAAACTCCGCGCAGCCATCGTTCGACCCGTCGCGTCTCACGAAAGCCTCTGCAGCAGATTACTCGTCCTCG
GCATCCGACGATGACGACGAATACCTCATGCCCGGCGTCAACGTCGACGACGATGATTTCGGCGACCATAACCCC
CGAAAGCGACGTCGCATCGGAGGCAAAAGCAAGGAAAAGGCGGCCCTGGGCATCTTCGGCTCCGACAGCGACGAT
GACCGTCCTGGCCAAAGATGGAAGAGCAAGACTCTGAGGCGCAAAGGGGTGGATTTCGTGTCGACGGCCGTACAA
GAAGGCCAAGGCCAAGACCAGGGCGAAGGCGAAGGCGAAGATACCAGTGACGGAAAGGCCGGTCTTGGCAACGGA
GCGGCTTCTTCGGATGACGATGAAGATGATGACGATGCCGGCCTGGGCCTGGGCTTTGGCAACCGATCGAAGGCA
CCAGCTTGGGCCGGCACGGCGGATGGGATGCAGCATCAAGGCGCGCCTACTCTCAAGTCACCTGTCGCCGCAGTC
GAGACAACGTTTGATGGAAAAAGTGTCCTCGGCCGCGGCTTTGTGCCTTCATCGGCCACGGAGCCAGTGTTGAAA
GAGCCAGAGGCCGACAGTACGCCGCCTGCCCGCAACAAGCCTCAGGTCAGCGCGTTCGGCCCCAAGGGCAAGGTC
AACGCCAAGTCGTTTGGTGCCCGGATGATGGCAAAGATGGGCTACGTCAACGGCATGGGCCTCGGCAAGGAAGGC
CAGGGCCGAAATGTCATCATCGAAGCCAACCTGCGGCCCCAAGGAATCGGTTTGGGTGCGGTCAAGGAAAAGACG
GAGCGGGAACGACAGGAAGAAAAGAGACAGGCGCGGTTGCGTGGCGATGACGTCGTGGATTCCGACGAGGAGGAG
AAACGGCGGCGGAAAGCCAAGAAGAAGTCGTTGGGCATCGCCAACGGCAGCGCCAGCAGCACGCCGCGGAGGCAG
AAGACGAAATACATGACGGCAGAAGAGGTCCGGGCCGCGGCACCGGGACTGCACATCCCGGACGCCTTCACGCCC
ATCTTGGACATGACGGGCCCGGGTAGCAAGCTTCTCACCTCGACAAGTGGCGTGATGACGCCCAACTCTGGCGTT
TTCGAGTCGAGCGAGGTCGTCGAAGCCAGGAAGTTGGTGAGACGGGCCCACGCAGACCTGTTGGCCTTCACGGAG
GAGTGGAAGAGTCTGGAGGAGCGAAAGTCGTGGCTCAACCTGGAGCTGCGAGAGAGGGAGCAGGAGGTGGAGGAC
GTCCGGTCCGACTTTGAGCGGCTGCAAGCATTCTCGACGCTGGTCACAGAGCAGCTGATGCAGGCCTCGGAATGG
GACCAGGTCATTGCCTGCCTCGTCAAGGCGGCCGAGCTCGGCTCCATCAATCCGGAGACGGCCGACGTCGCCGTC
GCAGCCATCCACCCATTTTTGAAGGCGGGCGACTGGGACCCGCTGGAAGAACCCGGCCGGTTTGCTTCCGATCTG
AAGCGGCTGCGCGGTCTGTTTGCCACGCCGGGCCCGGGCAGCAGCAACGGCGACCAGGAGCTCGAGGGGGGCGTC
TTCCTGCCCGAGTATCATCGGCGGATCCCCGCCGACGGGCTCGCCGTCTACGCCGAGGGCATCTGGGACCAGATC
GTCAACAACAAGGATCTCGACCTGCCCACCCAGCAGGAGCTCCTGGCCCAGTTCCGATGCGACGAGATCTCGCGC
GAGGTCCTCGTCGCCTTCGACCTCGTCGTCGCGCCCCTCGAGGAGCAGCAGACCGAGGCGACGAAGCTGGGCAGG
GCCCTGGTCCTCGTCGACCTCGGCCTCACCGGCAGCCGCGCGCGCGAGGCCTGCGCCGGCGCCTTCGAGGCCCAG
GCCAGCCGCTACCACAAGCCGACCTACGCGCGCAAGAAGCTGGAGCTCGAGGCCAAGGTCGACGCCAGGCTCAAG
GCCCTCTACCACGGGCAGCTGTCCGCCGCCCACAAGGCCGGCGTCGCCGCCTTCAGCGAGGCCGTCGCCAACAAG
GTAAAGGCCGGCCAGAAGGCAGGCGGCGCCTACGAGTTCGCCGAGATCGTGGCCGCCGAGAAGAAGAAGACGCTC
GACGTGTTCGAGGCCGAGGCCGAGGCCCTGGCCATCCACGGCGTCGCCTGGACCGACTTCAAGCCGCAGTACCAG
CTGTTCGAGGCCGAGCTGGACGAGGTCAGCGCGAGGCTGCGCAAGGAGGAGATGCGACGGCTCGCGACGCGCGTC
GAGCGCTGGGTCAAGTCGCGACTCGGCGACTCCGTCGGCCTCGAGTTCAACAAGCTGGGCTCCGGCCGGGGCGGC
TCCGGCGCTCCCGAGACGGGCGACAAGCCGCCGGCGGAGAAGGACCTCTGGGACAGGATCTGGGCCGTCTTCACC
GGCGTCGTCGCCGAGGCCGAGGGCCGGTTCGCCGAGCGCGCGAAGAGCTTCGACGCCAGCGACGACGAGACCGAG
GTCGGCATGTGGCGTCTGCGCCGCAAGAGCTGGGTCGGCCTCCGCGAGAAGATTGACGAGGAGGTCATGGAGGGC
AACATCCTCCTGAAGCTTCGCGAGAACTTTGAGGATAAGTTCCGATACGACGAGGCGGGCGTGCCGCGGATATGG
CTCCCGACCGACGACATTGAGGGCATCTACACCAAGGCCAGGGAGTCGACGCTGACGCTCATCCCGCTGCTCTCG
AGATTTCGCCTCTCCGAGACGTACGCGCCGCCCGACCTGCCCGGCTTCGTCGGGCCCCAGCCCCGCGGCGTCGAG
GTCGGCGACGAGGAGGACCTGGCCCCCATCGGCGGCGTCGACGAGGAAGAGGGCAAGAGCCTGGAGGAGGAGATG
ACGGTCCTCGGCGAGGGCAAGCGGCAAGACCTGGTGGTGCGGTTCAAGAAGACGGCCGACGGCGTCTACGTCGAG
GCCAAGCGCGGCGCCATCGGCGGCGTCGCGCAAGTTCCGTGGTACTTTTACGCGCTGCTCCTCGCGCTGGGGTGG
AACGAATTCTTCATGGTCCTGCGAAATCCCTTCTTGTTCATCCTGCTCATCATGATGATGGGCGGGACGTACGTG
GCTTACACGCTCAACCTGCTGGGCCCCATGATGCAGATGGGCAATGCGGCCGTCAACCAGGGGCTCGACATCGCC
AAGCACCAGCTCCGCGAGTTCATCGCCAACTCGGAGACGGCGAGACAGGCGCTGGCCGTGCCGGCGTCCAACGGG
ACGAGCGGAAACGACATTGACATGGATCGACTCGACAGCCGGGGGAAGAGGACGGCCGGGACCCGAGACGAGGAG
GATGACGAGATCTGA
Transcript >Hirsu2|2459
ATGGAAAACTCCGCGCAGCCATCGTTCGACCCGTCGCGTCTCACGAAAGCCTCTGCAGCAGATTACTCGTCCTCG
GCATCCGACGATGACGACGAATACCTCATGCCCGGCGTCAACGTCGACGACGATGATTTCGGCGACCATAACCCC
CGAAAGCGACGTCGCATCGGAGGCAAAAGCAAGGAAAAGGCGGCCCTGGGCATCTTCGGCTCCGACAGCGACGAT
GACCGTCCTGGCCAAAGATGGAAGAGCAAGACTCTGAGGCGCAAAGGGGTGGATTTCGTGTCGACGGCCGTACAA
GAAGGCCAAGGCCAAGACCAGGGCGAAGGCGAAGGCGAAGATACCAGTGACGGAAAGGCCGGTCTTGGCAACGGA
GCGGCTTCTTCGGATGACGATGAAGATGATGACGATGCCGGCCTGGGCCTGGGCTTTGGCAACCGATCGAAGGCA
CCAGCTTGGGCCGGCACGGCGGATGGGATGCAGCATCAAGGCGCGCCTACTCTCAAGTCACCTGTCGCCGCAGTC
GAGACAACGTTTGATGGAAAAAGTGTCCTCGGCCGCGGCTTTGTGCCTTCATCGGCCACGGAGCCAGTGTTGAAA
GAGCCAGAGGCCGACAGTACGCCGCCTGCCCGCAACAAGCCTCAGGTCAGCGCGTTCGGCCCCAAGGGCAAGGTC
AACGCCAAGTCGTTTGGTGCCCGGATGATGGCAAAGATGGGCTACGTCAACGGCATGGGCCTCGGCAAGGAAGGC
CAGGGCCGAAATGTCATCATCGAAGCCAACCTGCGGCCCCAAGGAATCGGTTTGGGTGCGGTCAAGGAAAAGACG
GAGCGGGAACGACAGGAAGAAAAGAGACAGGCGCGGTTGCGTGGCGATGACGTCGTGGATTCCGACGAGGAGGAG
AAACGGCGGCGGAAAGCCAAGAAGAAGTCGTTGGGCATCGCCAACGGCAGCGCCAGCAGCACGCCGCGGAGGCAG
AAGACGAAATACATGACGGCAGAAGAGGTCCGGGCCGCGGCACCGGGACTGCACATCCCGGACGCCTTCACGCCC
ATCTTGGACATGACGGGCCCGGGTAGCAAGCTTCTCACCTCGACAAGTGGCGTGATGACGCCCAACTCTGGCGTT
TTCGAGTCGAGCGAGGTCGTCGAAGCCAGGAAGTTGGTGAGACGGGCCCACGCAGACCTGTTGGCCTTCACGGAG
GAGTGGAAGAGTCTGGAGGAGCGAAAGTCGTGGCTCAACCTGGAGCTGCGAGAGAGGGAGCAGGAGGTGGAGGAC
GTCCGGTCCGACTTTGAGCGGCTGCAAGCATTCTCGACGCTGGTCACAGAGCAGCTGATGCAGGCCTCGGAATGG
GACCAGGTCATTGCCTGCCTCGTCAAGGCGGCCGAGCTCGGCTCCATCAATCCGGAGACGGCCGACGTCGCCGTC
GCAGCCATCCACCCATTTTTGAAGGCGGGCGACTGGGACCCGCTGGAAGAACCCGGCCGGTTTGCTTCCGATCTG
AAGCGGCTGCGCGGTCTGTTTGCCACGCCGGGCCCGGGCAGCAGCAACGGCGACCAGGAGCTCGAGGGGGGCGTC
TTCCTGCCCGAGTATCATCGGCGGATCCCCGCCGACGGGCTCGCCGTCTACGCCGAGGGCATCTGGGACCAGATC
GTCAACAACAAGGATCTCGACCTGCCCACCCAGCAGGAGCTCCTGGCCCAGTTCCGATGCGACGAGATCTCGCGC
GAGGTCCTCGTCGCCTTCGACCTCGTCGTCGCGCCCCTCGAGGAGCAGCAGACCGAGGCGACGAAGCTGGGCAGG
GCCCTGGTCCTCGTCGACCTCGGCCTCACCGGCAGCCGCGCGCGCGAGGCCTGCGCCGGCGCCTTCGAGGCCCAG
GCCAGCCGCTACCACAAGCCGACCTACGCGCGCAAGAAGCTGGAGCTCGAGGCCAAGGTCGACGCCAGGCTCAAG
GCCCTCTACCACGGGCAGCTGTCCGCCGCCCACAAGGCCGGCGTCGCCGCCTTCAGCGAGGCCGTCGCCAACAAG
GTAAAGGCCGGCCAGAAGGCAGGCGGCGCCTACGAGTTCGCCGAGATCGTGGCCGCCGAGAAGAAGAAGACGCTC
GACGTGTTCGAGGCCGAGGCCGAGGCCCTGGCCATCCACGGCGTCGCCTGGACCGACTTCAAGCCGCAGTACCAG
CTGTTCGAGGCCGAGCTGGACGAGGTCAGCGCGAGGCTGCGCAAGGAGGAGATGCGACGGCTCGCGACGCGCGTC
GAGCGCTGGGTCAAGTCGCGACTCGGCGACTCCGTCGGCCTCGAGTTCAACAAGCTGGGCTCCGGCCGGGGCGGC
TCCGGCGCTCCCGAGACGGGCGACAAGCCGCCGGCGGAGAAGGACCTCTGGGACAGGATCTGGGCCGTCTTCACC
GGCGTCGTCGCCGAGGCCGAGGGCCGGTTCGCCGAGCGCGCGAAGAGCTTCGACGCCAGCGACGACGAGACCGAG
GTCGGCATGTGGCGTCTGCGCCGCAAGAGCTGGGTCGGCCTCCGCGAGAAGATTGACGAGGAGGTCATGGAGGGC
AACATCCTCCTGAAGCTTCGCGAGAACTTTGAGGATAAGTTCCGATACGACGAGGCGGGCGTGCCGCGGATATGG
CTCCCGACCGACGACATTGAGGGCATCTACACCAAGGCCAGGGAGTCGACGCTGACGCTCATCCCGCTGCTCTCG
AGATTTCGCCTCTCCGAGACGTACGCGCCGCCCGACCTGCCCGGCTTCGTCGGGCCCCAGCCCCGCGGCGTCGAG
GTCGGCGACGAGGAGGACCTGGCCCCCATCGGCGGCGTCGACGAGGAAGAGGGCAAGAGCCTGGAGGAGGAGATG
ACGGTCCTCGGCGAGGGCAAGCGGCAAGACCTGGTGGTGCGGTTCAAGAAGACGGCCGACGGCGTCTACGTCGAG
GCCAAGCGCGGCGCCATCGGCGGCGTCGCGCAAGTTCCGTGGTACTTTTACGCGCTGCTCCTCGCGCTGGGGTGG
AACGAATTCTTCATGGTCCTGCGAAATCCCTTCTTGTTCATCCTGCTCATCATGATGATGGGCGGGACGTACGTG
GCTTACACGCTCAACCTGCTGGGCCCCATGATGCAGATGGGCAATGCGGCCGTCAACCAGGGGCTCGACATCGCC
AAGCACCAGCTCCGCGAGTTCATCGCCAACTCGGAGACGGCGAGACAGGCGCTGGCCGTGCCGGCGTCCAACGGG
ACGAGCGGAAACGACATTGACATGGATCGACTCGACAGCCGGGGGAAGAGGACGGCCGGGACCCGAGACGAGGAG
GATGACGAGATCTGA
Gene >Hirsu2|2459
ATGGAAAACTCCGCGCAGCCATCGTTCGACCCGTCGCGTCTCACGAAAGCCTCTGCAGCAGATTACTCGTCCTCG
GCATCCGACGATGACGACGAATACCTCATGCCCGGCGTCAACGTCGACGACGATGATTTCGGCGACCATAACCCC
CGAAAGCGACGTCGCATCGGAGGCAAAAGCAAGGAAAAGGCGGCCCTGGGCATCTTCGGCTCCGACAGCGACGAT
GACCGTCCTGGCCAAAGATGGAAGAGCAAGACTCTGAGGCGCAAAGGGGTGGATTTCGTGTCGACGGCCGTACAA
GAAGGCCAAGGCCAAGACCAGGGCGAAGGCGAAGGCGAAGATACCAGTGACGGAAAGGCCGGTCTTGGCAACGGA
GCGGCTTCTTCGGATGACGATGAAGATGATGACGATGCCGGCCTGGGCCTGGGCTTTGGCAACCGATCGAAGGCA
CCAGCTTGGGCCGGCACGGCGGATGGGATGCAGCATCAAGGCGCGCCTACTCTCAAGTCACCTGTCGCCGCAGTC
GAGACAACGTTTGATGGAAAAAGTGTCCTCGGCCGCGGCTTTGTGCCTTCATCGGCCACGGAGCCAGTGTTGAAA
GAGCCAGAGGCCGACAGTACGCCGCCTGCCCGCAACAAGCCTCAGGTCAGCGCGTTCGGCCCCAAGGGCAAGGTC
AACGCCAAGTCGTTTGGTGCCCGGATGATGGCAAAGATGGGCTACGTCAACGGCATGGGCCTCGGCAAGGAAGGC
CAGGGCCGAAATGTCATCATCGAAGCCAACCTGCGGCCCCAAGGAATCGGTTTGGGTGCGGTCAAGGAAAAGACG
GAGCGGGAACGACAGGAAGAAAAGAGACAGGCGCGGTTGCGTGGCGATGACGTCGTGGATTCCGACGAGGAGGAG
AAACGGCGGCGGAAAGCCAAGAAGAAGTCGTTGGGCATCGCCAACGGCAGCGCCAGCAGCACGCCGCGGAGGCAG
AAGACGAAATACATGACGGCAGAAGAGGTCCGGGCCGCGGCACCGGGACTGCACATCCCGGACGCCTTCACGCCC
ATCTTGGACATGACGGGCCCGGGTAGCAAGCTTCTCACCTCGACAAGTGGCGTGATGACGCCCAACTCTGGCGTT
TTCGAGTCGAGCGAGGTCGTCGAAGCCAGGAAGTTGGTGAGACGGGCCCACGCAGACCTGTTGGCCTTCACGGAG
GAGTGGAAGAGTCTGGAGGAGCGAAAGTCGTGGCTCAACCTGGAGCTGCGAGAGAGGGAGCAGGAGGTGGAGGAC
GTCCGGTCCGACTTTGAGCGGCTGCAAGCATTCTCGACGCTGGTCACAGAGCAGCTGATGCAGGCCTCGGAATGG
GACCAGGTCATTGCCTGCCTCGTCAAGGCGGCCGAGCTCGGCTCCATCAATCCGGAGACGGCCGACGTCGCCGTC
GCAGCCATCCACCCATTTTTGAAGGCGGGCGACTGGGACCCGCTGGAAGAACCCGGCCGGTTTGCTTCCGATCTG
AAGCGGCTGCGCGGTCTGTTTGCCACGCCGGGCCCGGGCAGCAGCAACGGCGACCAGGAGCTCGAGGGGGGCGTC
TTCCTGCCCGAGTATCATCGGCGGATCCCCGCCGACGGGCTCGCCGTCTACGCCGAGGGCATCTGGGACCAGATC
GTCAACAACAAGGATCTCGACCTGCCCACCCAGCAGGAGCTCCTGGCCCAGTTCCGATGCGACGAGATCTCGCGC
GAGGTCCTCGTCGCCTTCGACCTCGTCGTCGCGCCCCTCGAGGAGCAGCAGACCGAGGCGACGAAGCTGGGCAGG
GCCCTGGTCCTCGTCGACCTCGGCCTCACCGGCAGCCGCGCGCGCGAGGCCTGCGCCGGCGCCTTCGAGGCCCAG
GCCAGCCGCTACCACAAGCCGACCTACGCGCGCAAGAAGCTGGAGCTCGAGGCCAAGGTCGACGCCAGGCTCAAG
GCCCTCTACCACGGGCAGCTGTCCGCCGCCCACAAGGCCGGCGTCGCCGCCTTCAGCGAGGCCGTCGCCAACAAG
GTAAAGGCCGGCCAGAAGGCAGGCGGCGCCTACGAGTTCGCCGAGATCGTGGCCGCCGAGAAGAAGAAGACGCTC
GACGTGTTCGAGGCCGAGGCCGAGGCCCTGGCCATCCACGGCGTCGCCTGGACCGACTTCAAGCCGCAGTACCAG
CTGTTCGAGGCCGAGCTGGACGAGGTCAGCGCGAGGCTGCGCAAGGAGGAGATGCGACGGCTCGCGACGCGCGTC
GAGCGCTGGGTCAAGTCGCGACTCGGCGACTCCGTCGGCCTCGAGTTCAACAAGCTGGGCTCCGGCCGGGGCGGC
TCCGGCGCTCCCGAGACGGGCGACAAGCCGCCGGCGGAGAAGGACCTCTGGGACAGGATCTGGGCCGTCTTCACC
GGCGTCGTCGCCGAGGCCGAGGGCCGGTTCGCCGAGCGCGCGAAGAGCTTCGACGCCAGCGACGACGAGACCGAG
GTCGGCATGTGGCGTCTGCGCCGCAAGAGCTGGGTCGGCCTCCGCGAGAAGATTGACGAGGAGGTCATGGAGGGC
AACATCCTCCTGAAGCTTCGCGAGAACTTTGAGGATAAGTTCCGATACGACGAGGCGGGCGTGCCGCGGATATGG
CTCCCGACCGACGACATTGAGGGCATCTACACCAAGGCCAGGGAGTCGACGCTGACGCTCATCCCGCTGCTCTCG
AGATTTCGCCTCTCCGAGACGTACGCGCCGCCCGACCTGCCCGGCTTCGTCGGGCCCCAGCCCCGCGGCGTCGAG
GTCGGCGACGAGGAGGACCTGGCCCCCATCGGCGGCGTCGACGAGGAAGAGGGCAAGAGCCTGGAGGAGGAGATG
ACGGTCCTCGGCGAGGGCAAGCGGCAAGACCTGGTGGTGCGGTTCAAGAAGACGGCCGACGGCGTCTACGTCGAG
GCCAAGCGCGGCGCCATCGGCGGCGTCGCGCAAGTTCCGTGGTACTTTTACGCGCTGCTCCTCGCGCTGGGGTGG
AACGAATTCTTCATGGGTAAGGATGAGCGCCCCCCCCCTCCCCTGGCCCTCTCGACGCACTGCTGACGCACCATC
GCCACCAGTCCTGCGAAATCCCTTCTTGTTCATCCTGCTCATCATGATGATGGGCGGGACGTACGTGGCTTACAC
GCTCAACCTGCTGGGCCCCATGATGCAGATGGGCAATGCGGCCGTCAACCAGGGGCTCGACATCGCCAAGCACCA
GCTCCGCGAGTTCATCGCCAACTCGGAGACGGCGAGACAGGCGCTGGCCGTGCCGGCGTCCAACGGGACGAGCGG
AAACGACATTGACATGGATCGACTCGACAGCCGGGGGAAGAGGACGGCCGGGACCCGAGACGAGGAGGATGACGA
GATCTGA

© 2023 - Robin Ohm - Utrecht University - The Netherlands

Built with Python Django and Wagtail