Fungal Genomics

at Utrecht University

General Properties

Protein IDHirsu2|11115
Gene name
LocationContig_965:1129..6043
Strand-
Gene length (bp)4914
Transcript length (bp)4485
Coding sequence length (bp)4485
Protein length (aa) 1495

Your browser does not support drawing a protein figure.

PFAM Domains

PFAM Domain ID Short name Long name E-value Start End
PF00082 Peptidase_S8 Subtilase family 2.3E-36 155 567
PF01565 FAD_binding_4 FAD binding domain 5.7E-21 995 1142
PF06280 fn3_5 Fn3-like domain 3.4E-15 608 721
PF08031 BBE Berberine and berberine like 4.5E-09 1422 1452

Swissprot hits

[Show all]
Swissprot ID Swissprot Description Start End E-value
sp|D4B1Z7|A2478_ARTBC Uncharacterized FAD-linked oxidoreductase ARB_02478 OS=Arthroderma benhamiae (strain ATCC MYA-4681 / CBS 112371) GN=ARB_02478 PE=1 SV=1 935 1460 1.0E-49
sp|D4AS41|A7056_ARTBC Uncharacterized FAD-linked oxidoreductase ARB_02478 OS=Arthroderma benhamiae (strain ATCC MYA-4681 / CBS 112371) GN=ARB_07056 PE=1 SV=2 883 1467 6.0E-40
sp|P29141|SUBV_BACSU Minor extracellular protease vpr OS=Bacillus subtilis (strain 168) GN=vpr PE=1 SV=1 152 637 1.0E-19
sp|P29599|SUBB_BACLE Subtilisin BL OS=Bacillus lentus PE=1 SV=1 143 301 1.0E-17
sp|P08159|HDNO_ARTOX 6-hydroxy-D-nicotine oxidase OS=Arthrobacter oxydans PE=1 SV=2 995 1258 4.0E-17
[Show all]
[Show less]
Swissprot ID Swissprot Description Start End E-value
sp|D4B1Z7|A2478_ARTBC Uncharacterized FAD-linked oxidoreductase ARB_02478 OS=Arthroderma benhamiae (strain ATCC MYA-4681 / CBS 112371) GN=ARB_02478 PE=1 SV=1 935 1460 1.0E-49
sp|D4AS41|A7056_ARTBC Uncharacterized FAD-linked oxidoreductase ARB_02478 OS=Arthroderma benhamiae (strain ATCC MYA-4681 / CBS 112371) GN=ARB_07056 PE=1 SV=2 883 1467 6.0E-40
sp|P29141|SUBV_BACSU Minor extracellular protease vpr OS=Bacillus subtilis (strain 168) GN=vpr PE=1 SV=1 152 637 1.0E-19
sp|P29599|SUBB_BACLE Subtilisin BL OS=Bacillus lentus PE=1 SV=1 143 301 1.0E-17
sp|P08159|HDNO_ARTOX 6-hydroxy-D-nicotine oxidase OS=Arthrobacter oxydans PE=1 SV=2 995 1258 4.0E-17
sp|P29600|SUBS_BACLE Subtilisin Savinase OS=Bacillus lentus PE=1 SV=1 143 301 2.0E-15
sp|P41362|ELYA_BACCS Alkaline protease OS=Bacillus clausii PE=1 SV=1 149 301 2.0E-15
sp|P27693|ELYA_BACAO Alkaline protease OS=Bacillus alcalophilus PE=1 SV=1 149 301 2.0E-15
sp|Q99405|PRTM_BACSK M-protease OS=Bacillus clausii (strain KSM-K16) GN=aprE PE=1 SV=2 149 301 2.0E-15
sp|P16271|P1P_LACLC PI-type proteinase OS=Lactococcus lactis subsp. cremoris GN=prtP PE=3 SV=1 155 734 6.0E-15
sp|P15293|P2P_LACLC PII-type proteinase OS=Lactococcus lactis subsp. cremoris GN=prt PE=3 SV=1 155 734 6.0E-15
sp|P00781|SUBD_BACLI Subtilisin DY OS=Bacillus licheniformis GN=apr PE=1 SV=1 142 299 2.0E-14
sp|P04189|SUBT_BACSU Subtilisin E OS=Bacillus subtilis (strain 168) GN=aprE PE=1 SV=3 135 343 3.0E-14
sp|P07518|SUBT_BACPU Subtilisin OS=Bacillus pumilus GN=apr PE=1 SV=1 135 343 3.0E-14
sp|P35835|SUBN_BACNA Subtilisin NAT OS=Bacillus subtilis subsp. natto GN=aprN PE=1 SV=1 135 343 4.0E-14
sp|Q02470|P2P_LACPA PII-type proteinase OS=Lactobacillus paracasei GN=prtP PE=1 SV=1 155 734 7.0E-14
sp|Q45670|THES_BACSJ Thermophilic serine proteinase OS=Bacillus sp. (strain AK1) PE=1 SV=1 157 302 7.0E-14
sp|Q9FZC6|FOX3_ARATH Flavin-dependent oxidoreductase FOX3 OS=Arabidopsis thaliana GN=FOX3 PE=3 SV=1 931 1212 2.0E-13
sp|P41363|ELYA_BACHD Thermostable alkaline protease OS=Bacillus halodurans (strain ATCC BAA-125 / DSM 18197 / FERM 7344 / JCM 9153 / C-125) GN=BH0855 PE=1 SV=2 141 343 3.0E-13
sp|P15292|P3P_LACLS PIII-type proteinase OS=Lactococcus lactis subsp. cremoris (strain SK11) GN=prtP PE=1 SV=2 155 734 3.0E-13
sp|P29142|SUBT_GEOSE Subtilisin J OS=Geobacillus stearothermophilus GN=aprJ PE=3 SV=1 135 343 1.0E-12
sp|P00783|SUBT_BACSA Subtilisin amylosacchariticus OS=Bacillus subtilis subsp. amylosacchariticus GN=apr PE=1 SV=2 135 343 1.0E-12
sp|P29139|ISP_PAEPO Intracellular serine protease OS=Paenibacillus polymyxa GN=isp PE=1 SV=1 155 297 2.0E-12
sp|P00782|SUBT_BACAM Subtilisin BPN' OS=Bacillus amyloliquefaciens GN=apr PE=1 SV=1 133 320 2.0E-12
sp|P20724|ELYA_BACYA Alkaline elastase YaB OS=Bacillus sp. (strain YaB) GN=ale PE=1 SV=1 151 343 2.0E-12
sp|Q33DQ2|THCAI_CANSA Inactive tetrahydrocannabinolic acid synthase OS=Cannabis sativa PE=3 SV=1 1069 1451 1.0E-11
sp|Q9FZC4|FOX1_ARATH Flavin-dependent oxidoreductase FOX1 OS=Arabidopsis thaliana GN=FOX1 PE=2 SV=1 931 1228 2.0E-11
sp|Q9FZC7|FOX4_ARATH Flavin-dependent oxidoreductase FOX4 OS=Arabidopsis thaliana GN=FOX4 PE=2 SV=1 990 1200 3.0E-11
sp|Q9FZC5|FOX2_ARATH Flavin-dependent oxidoreductase FOX2 OS=Arabidopsis thaliana GN=FOX2 PE=2 SV=1 936 1200 5.0E-11
sp|P00780|SUBT_BACLI Subtilisin Carlsberg OS=Bacillus licheniformis GN=apr PE=1 SV=1 142 265 6.0E-11
sp|P11018|ISP1_BACSU Major intracellular serine protease OS=Bacillus subtilis (strain 168) GN=isp PE=1 SV=2 129 297 8.0E-11
sp|P16396|SUBE_BACSU Minor extracellular protease Epr OS=Bacillus subtilis (strain 168) GN=epr PE=2 SV=1 143 361 1.0E-10
sp|A6P6W0|CASL1_CANSA Cannabidiolic acid synthase-like 1 OS=Cannabis sativa GN=CBDAS2 PE=2 SV=1 1069 1451 3.0E-10
sp|Q9FZC8|FOX5_ARATH Flavin-dependent oxidoreductase FOX5 OS=Arabidopsis thaliana GN=FOX5 PE=3 SV=1 990 1200 3.0E-10
sp|A6P6V9|CBDAS_CANSA Cannabidiolic acid synthase OS=Cannabis sativa GN=CBDAS PE=1 SV=1 1069 1451 4.0E-10
sp|A6P6W1|CASL2_CANSA Cannabidiolic acid synthase-like 2 OS=Cannabis sativa GN=CBDAS3 PE=2 SV=1 1069 1451 5.0E-10
sp|P04072|THET_THEVU Thermitase OS=Thermoactinomyces vulgaris PE=1 SV=1 155 265 5.0E-10
sp|Q8GTB6|THCAS_CANSA Tetrahydrocannabinolic acid synthase OS=Cannabis sativa PE=1 SV=1 995 1176 6.0E-10
sp|I3R794|HLY_HALMT Halolysin OS=Haloferax mediterranei (strain ATCC 33500 / DSM 1411 / JCM 8866 / NBRC 14739 / NCIMB 2177 / R-4) GN=hly PE=1 SV=1 159 301 4.0E-09
sp|Q14703|MBTP1_HUMAN Membrane-bound transcription factor site-1 protease OS=Homo sapiens GN=MBTPS1 PE=1 SV=1 143 297 4.0E-09
sp|Q9Z2A8|MBTP1_CRIGR Membrane-bound transcription factor site-1 protease OS=Cricetulus griseus GN=MBTPS1 PE=1 SV=2 143 297 5.0E-09
sp|Q9WTZ3|MBTP1_RAT Membrane-bound transcription factor site-1 protease OS=Rattus norvegicus GN=Mbtps1 PE=1 SV=2 143 297 5.0E-09
sp|Q9WTZ2|MBTP1_MOUSE Membrane-bound transcription factor site-1 protease OS=Mus musculus GN=Mbtps1 PE=1 SV=1 143 297 5.0E-09
sp|Q796Y5|YGAK_BACSU Uncharacterized FAD-linked oxidoreductase YgaK OS=Bacillus subtilis (strain 168) GN=ygaK PE=3 SV=4 994 1178 1.0E-08
sp|O06997|YVDP_BACSU Uncharacterized FAD-linked oxidoreductase YvdP OS=Bacillus subtilis (strain 168) GN=yvdP PE=1 SV=1 994 1451 7.0E-08
sp|P43485|MCRA_STRLA Mitomycin radical oxidase OS=Streptomyces lavendulae GN=mcrA PE=1 SV=2 984 1215 9.0E-08
sp|Q0WUG6|SBT61_ARATH Subtilisin-like protease SBT6.1 OS=Arabidopsis thaliana GN=SBT6.1 PE=1 SV=1 141 297 2.0E-07
sp|Q07596|NISP_LACLL Nisin leader peptide-processing serine protease NisP OS=Lactococcus lactis subsp. lactis GN=nisP PE=1 SV=1 156 295 2.0E-07
sp|D7UQ40|SOL5_ALTSO Bifunctional solanapyrone synthase OS=Alternaria solani GN=sol5 PE=1 SV=1 1081 1178 3.0E-07
sp|P58502|TKSU_THEKO Tk-subtilisin OS=Thermococcus kodakarensis (strain ATCC BAA-918 / JCM 12380 / KOD1) GN=TK1675 PE=1 SV=1 159 234 9.0E-07
sp|P30986|RETO_ESCCA Reticuline oxidase OS=Eschscholzia californica GN=BBE1 PE=1 SV=1 1003 1451 2.0E-06
[Show less]

GO

GO Term Description Terminal node
GO:0006508 proteolysis Yes
GO:0050660 flavin adenine dinucleotide binding Yes
GO:0005618 cell wall Yes
GO:0055114 oxidation-reduction process Yes
GO:0016020 membrane Yes
GO:0016491 oxidoreductase activity Yes
GO:0004252 serine-type endopeptidase activity Yes
GO:0071704 organic substance metabolic process No
GO:0043170 macromolecule metabolic process No
GO:0004175 endopeptidase activity No
GO:0006807 nitrogen compound metabolic process No
GO:0044238 primary metabolic process No
GO:1901265 nucleoside phosphate binding No
GO:0005488 binding No
GO:0008152 metabolic process No
GO:0008150 biological_process No
GO:0036094 small molecule binding No
GO:0003674 molecular_function No
GO:0008236 serine-type peptidase activity No
GO:0005575 cellular_component No
GO:0008233 peptidase activity No
GO:1901363 heterocyclic compound binding No
GO:0000166 nucleotide binding No
GO:0043167 ion binding No
GO:0043168 anion binding No
GO:0003824 catalytic activity No
GO:0016787 hydrolase activity No
GO:0019538 protein metabolic process No
GO:0044464 cell part No
GO:0070011 peptidase activity, acting on L-amino acid peptides No
GO:0030312 external encapsulating structure No
GO:0050662 coenzyme binding No
GO:0140096 catalytic activity, acting on a protein No
GO:0017171 serine hydrolase activity No
GO:0048037 cofactor binding No
GO:1901564 organonitrogen compound metabolic process No
GO:0097159 organic cyclic compound binding No

SignalP

[Help with interpreting these statistics]
SignalP signal predicted Location
(based on Ymax)
D score
(significance: > 0.45)
Yes 1 - 20 0.45

Transmembrane Domains

(None)

Transcription Factor Class

(None)

Expression data

No expression data available for this genome

Sequences

Type of sequenceSequence
Locus Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded.
Protein >Hirsu2|11115
MKQLLVALAALAAASPVAAGTLPRTVAGAYVVDLDDEVDVNGFAEALQASVQARIRRRMDYELYKGVAIELDGPP
RANQVAAMAGVRKMWPVEEMSMGADVAGSPRGQGQDGRGRRVKRSLGRGRVPANDSTAEASRAPHAMMQIDRLHG
RGITGKGSRIAIIDTGVDYEHPALGGCFGRGCVVAFGADLVGGEPTPRDCHGHGTVVAGLIAARRNEFGVAGAAP
GAELGMYRVSCDGSFATDVLVDALYRAQRDGATIVNLSAGTAGRWESGVLAEAASRLAAHGIVCVRSAGNNGQAA
VFSLDGLGPGVVYAGSMNSAVVPDLQLQASYAVDGGGDDVPFPFWDSDPLTDFTGAPMELWPGSRGPDAPDDADC
GPLPDDTPDLADRIVLVRLCQPFEARTAARLVARGARRVLAYHKPADGPPVAAYFPEPPEGIVAFSMVEADVARL
LLDAAGAGRKVLVTVPHPRDAPLRYRESPDEAPGAVTAFSSWGPGWDLALRPSLTAIGTYVLSTSWRDQNPAGFE
QSSGTSCAAPQVAAIVALIAEARGSLDPAVMESLLVTHAQPQLYHDGDNFLPHLAPVAQQGGGLVRAYDAAYATT
VVDPPGLNFNDTEHFAPSREFSLQNLGHGDVTYTLSHVPAVTVYLFTANGSTAAYPGTLEPVEAPASLDMRHGIL
TIPSKQSATVRVTAAPPERLDARRLALWSGWIAVNGTDGSSLSIPYQGFSGSIREHQVLRPDAASLAYQNGSVGE
DSAIKLPPQGSADDSELILHIDATLGSPLVRADVERSASDDDAAESRKAIGQLHGFPFRHMPPGTFDDPGRPDPD
LYNLTHFKWNGLLDTGSYAPEGSYKLVVRALRIFGDPEKKDDWDMSETPRFHIAYAAGSTTPAPEPGDKSAPGRP
AGSEAGPCKSTDFPGTDKLVLNSETGYSAQVPAMAAAVCHGERADQAACGRVEAAFHANDPVSNAWSQYNGDACL
PRSKLLETDAQAGCKRNGYPRFVANATTREHVQDAIRYARENAVRLSIKNTGHDYLGRFSGPNSLSIWTHHMDSI
VYNKAGFTPAGCDAPVRDTVKIGAGVQTGALYRHLQRLDEARTFVGGRAETVGVAGFAMGGGHSILTGQYGLGAD
QVVEMELVLADGRALVVSGCAHRDLFWALRGGGGSTFGVVTSLIIKTFPMPRIVAAEFDVAAAAGPDVDARARLV
DASTVVWTQLPKLVDQGVNGFIHASASVDDPSAKTPSARIRCSLAMVGPDASEQSLQRAIDGLLDAVSADKDGKR
TRPVRVTYRHEPQNYPTVLEYISSLEMPDPAGQNMFFGSKLLSGEKLVGGAHYLRKSTDKALTSPTSSGIGAIIV
GGPGLSDAARRKEASAHPGWEHAVASMFTSASFNAKDRPSRRAAVDQVDGTLQPLRELDPESGSHASEAFVFEPQ
WRKAYWGANYDRLRQIKKDVDPADVFCCRRCVGNEAYKQNRLRLCTKTATEKADDQAGRDEEDGQGPGQ*
Coding >Hirsu2|11115
ATGAAGCAACTGCTGGTCGCTCTCGCGGCGCTCGCTGCCGCGTCTCCCGTCGCGGCCGGGACTCTGCCGCGGACC
GTGGCCGGCGCGTACGTCGTCGATCTCGACGATGAAGTTGACGTCAACGGCTTCGCCGAAGCGTTGCAGGCGAGC
GTCCAGGCGCGCATCCGCCGTCGGATGGACTACGAGCTGTACAAGGGCGTCGCCATCGAGCTCGACGGCCCGCCC
CGGGCCAACCAGGTGGCGGCCATGGCCGGGGTCCGGAAGATGTGGCCGGTCGAGGAGATGAGCATGGGGGCCGAC
GTCGCCGGCTCCCCGCGCGGGCAGGGGCAAGACGGACGCGGCCGGCGAGTGAAGCGGTCCCTGGGCCGAGGACGG
GTCCCGGCCAACGACAGCACGGCAGAGGCATCGCGGGCGCCGCACGCCATGATGCAGATCGACAGGCTTCACGGC
AGGGGGATCACGGGCAAGGGGTCCAGGATCGCCATCATCGACACGGGGGTCGACTACGAGCACCCGGCGCTCGGC
GGCTGCTTCGGCCGCGGCTGCGTCGTCGCCTTCGGGGCCGACCTCGTCGGGGGCGAGCCGACGCCGAGGGACTGC
CACGGCCACGGGACCGTCGTCGCGGGCCTGATCGCGGCGCGGCGCAACGAGTTCGGCGTCGCCGGCGCGGCGCCG
GGCGCCGAGCTCGGCATGTACCGCGTCTCGTGCGACGGCAGCTTCGCCACCGACGTCCTGGTCGACGCCCTGTAC
CGGGCCCAGCGCGACGGGGCCACCATCGTCAACTTGTCGGCCGGCACGGCCGGCCGGTGGGAAAGCGGCGTCTTG
GCCGAGGCGGCGTCGCGGCTCGCGGCCCACGGCATCGTCTGCGTCCGCAGCGCCGGCAACAACGGGCAGGCGGCC
GTCTTCTCGCTCGACGGCCTCGGGCCGGGCGTCGTCTACGCCGGCTCCATGAACAGCGCCGTCGTGCCGGATTTG
CAGCTGCAGGCCAGCTACGCCGTCGACGGCGGCGGCGACGACGTCCCCTTCCCCTTCTGGGACTCGGACCCCCTG
ACCGACTTCACCGGCGCCCCGATGGAGCTCTGGCCCGGCAGCCGGGGGCCCGACGCGCCCGACGACGCCGACTGC
GGCCCGCTCCCGGACGACACGCCCGACCTGGCCGACAGGATCGTCCTCGTCCGGCTCTGCCAGCCGTTCGAGGCC
CGGACCGCCGCCCGGCTGGTCGCCCGGGGCGCCCGGCGCGTGCTCGCGTACCACAAGCCCGCCGACGGGCCTCCC
GTCGCCGCCTACTTCCCGGAGCCGCCCGAGGGGATCGTGGCCTTCTCCATGGTCGAGGCCGACGTCGCCAGGCTA
CTGCTCGACGCCGCCGGCGCCGGCCGCAAGGTCCTCGTCACCGTGCCCCATCCGCGGGACGCGCCGCTCCGCTAT
CGGGAGTCCCCGGACGAGGCACCGGGGGCCGTCACGGCCTTCAGCTCCTGGGGCCCGGGCTGGGACCTGGCGCTC
AGGCCGTCGCTGACCGCCATCGGCACCTACGTTCTCTCGACCTCGTGGAGGGACCAGAACCCGGCCGGGTTCGAG
CAAAGCTCGGGGACTTCCTGTGCGGCGCCCCAGGTGGCCGCCATCGTGGCCCTGATCGCCGAGGCTCGCGGCTCC
CTTGACCCGGCCGTGATGGAGAGCCTCCTCGTCACGCACGCGCAGCCGCAGCTCTACCACGACGGAGACAACTTT
CTCCCCCATCTGGCCCCGGTCGCCCAGCAGGGGGGCGGTCTGGTGCGCGCCTACGATGCTGCGTATGCGACGACA
GTCGTCGATCCGCCCGGCCTCAACTTCAACGACACGGAGCACTTCGCCCCCTCGCGAGAGTTCTCGCTGCAGAAC
CTCGGCCACGGCGACGTGACTTACACTCTCTCCCACGTCCCGGCCGTCACGGTGTATCTCTTCACGGCCAACGGC
TCGACGGCCGCTTATCCGGGCACTCTCGAACCCGTCGAGGCCCCGGCCTCTCTCGACATGAGGCACGGCATCCTC
ACCATACCCTCGAAGCAGTCGGCCACCGTCCGCGTAACCGCGGCCCCTCCAGAGCGCCTTGACGCCCGGCGTCTT
GCCCTATGGTCTGGCTGGATCGCGGTCAACGGAACGGATGGCTCGTCGCTGTCTATCCCCTATCAGGGCTTCTCG
GGATCCATCCGGGAGCATCAAGTTCTGCGCCCCGACGCGGCCTCGCTCGCATACCAGAACGGGTCGGTGGGCGAG
GATTCTGCGATCAAGCTTCCGCCGCAGGGCTCTGCCGACGACTCCGAGCTCATCCTGCACATCGACGCAACTCTC
GGCTCTCCTCTCGTGCGCGCCGACGTTGAGCGCTCGGCATCCGACGACGACGCCGCCGAAAGTCGCAAAGCCATC
GGCCAGCTGCACGGCTTCCCGTTCCGCCACATGCCCCCCGGCACATTCGACGACCCCGGTCGGCCTGATCCCGAT
CTGTACAATCTCACCCACTTCAAGTGGAACGGCCTGCTCGATACGGGCAGCTATGCCCCCGAGGGATCCTACAAG
CTGGTCGTGCGAGCCCTGAGGATCTTCGGCGACCCCGAAAAGAAGGACGACTGGGACATGAGCGAGACGCCGCGG
TTTCACATCGCGTATGCCGCCGGGTCGACGACGCCCGCGCCGGAGCCAGGCGACAAGTCGGCTCCCGGGCGGCCC
GCCGGGAGCGAGGCCGGCCCGTGCAAGTCGACCGACTTCCCCGGCACGGACAAGCTTGTGCTGAACAGCGAGACG
GGATACAGTGCCCAAGTGCCGGCGATGGCCGCGGCTGTCTGCCACGGGGAGCGGGCTGACCAGGCGGCGTGCGGA
AGAGTCGAGGCTGCGTTTCATGCCAACGACCCGGTCTCGAACGCGTGGAGCCAGTACAACGGTGACGCGTGTCTG
CCTCGCAGCAAGCTCCTCGAGACAGACGCGCAGGCGGGGTGTAAGCGGAACGGATACCCCAGATTCGTCGCCAAC
GCGACCACACGGGAGCACGTCCAGGATGCCATCAGATACGCCAGGGAGAATGCGGTTCGTCTGAGCATCAAGAAC
ACCGGACACGACTATCTCGGCCGCTTCAGCGGGCCCAACTCCCTGTCCATCTGGACGCACCACATGGACTCGATT
GTCTACAACAAGGCCGGCTTCACTCCCGCCGGCTGCGATGCCCCGGTCAGAGACACGGTCAAGATCGGCGCCGGC
GTCCAGACGGGCGCGCTTTACAGGCATCTCCAGCGGCTCGACGAGGCCCGGACCTTTGTCGGCGGCCGCGCCGAG
ACGGTCGGCGTCGCCGGATTCGCCATGGGCGGCGGCCACTCGATCCTGACGGGCCAGTACGGGCTCGGCGCCGAC
CAAGTCGTCGAGATGGAGCTCGTCCTCGCCGACGGCCGGGCCCTCGTCGTCAGCGGGTGCGCCCATCGCGACCTC
TTCTGGGCCCTGCGCGGTGGCGGCGGCTCGACCTTTGGTGTCGTCACCTCCCTCATCATCAAGACCTTCCCGATG
CCCAGGATTGTCGCGGCCGAGTTCGACGTCGCCGCCGCCGCCGGCCCCGACGTCGACGCCAGGGCTCGCCTCGTC
GACGCCAGCACCGTCGTCTGGACACAGCTGCCCAAGCTGGTCGACCAGGGCGTCAACGGATTCATCCACGCCTCG
GCGAGCGTCGACGACCCGTCCGCCAAGACACCCTCCGCCCGCATTCGCTGTTCGCTCGCAATGGTAGGACCAGAC
GCCAGCGAACAAAGTCTGCAGAGGGCGATCGACGGCCTCCTCGACGCCGTGTCGGCAGACAAGGACGGCAAGCGG
ACGCGGCCGGTGCGTGTCACGTACCGCCACGAGCCGCAGAACTATCCCACTGTCCTGGAGTATATCAGCAGCCTC
GAAATGCCCGACCCGGCTGGACAAAACATGTTCTTTGGCTCCAAGCTGCTCAGCGGCGAGAAACTCGTCGGCGGC
GCGCACTATCTGCGGAAGAGCACCGACAAGGCTCTCACCTCACCTACCTCTTCCGGAATCGGCGCCATCATCGTT
GGCGGACCGGGCCTGAGCGATGCCGCGCGACGAAAGGAGGCGTCCGCTCACCCGGGCTGGGAACACGCCGTGGCG
TCGATGTTCACCTCGGCAAGCTTCAACGCCAAGGACAGGCCGTCTCGGAGGGCGGCTGTAGACCAGGTCGACGGC
ACCCTCCAGCCGTTGAGGGAGCTGGACCCGGAGTCGGGAAGCCACGCCAGCGAGGCCTTTGTCTTCGAGCCCCAG
TGGCGCAAGGCCTACTGGGGCGCCAACTACGACAGGCTTAGGCAGATCAAGAAGGACGTCGACCCGGCCGACGTC
TTCTGTTGCCGCCGCTGCGTCGGCAACGAGGCCTACAAGCAGAACCGTCTCAGGCTCTGCACGAAGACCGCCACG
GAGAAGGCTGATGACCAGGCTGGCCGGGACGAGGAGGACGGTCAGGGCCCGGGCCAGTGA
Transcript >Hirsu2|11115
ATGAAGCAACTGCTGGTCGCTCTCGCGGCGCTCGCTGCCGCGTCTCCCGTCGCGGCCGGGACTCTGCCGCGGACC
GTGGCCGGCGCGTACGTCGTCGATCTCGACGATGAAGTTGACGTCAACGGCTTCGCCGAAGCGTTGCAGGCGAGC
GTCCAGGCGCGCATCCGCCGTCGGATGGACTACGAGCTGTACAAGGGCGTCGCCATCGAGCTCGACGGCCCGCCC
CGGGCCAACCAGGTGGCGGCCATGGCCGGGGTCCGGAAGATGTGGCCGGTCGAGGAGATGAGCATGGGGGCCGAC
GTCGCCGGCTCCCCGCGCGGGCAGGGGCAAGACGGACGCGGCCGGCGAGTGAAGCGGTCCCTGGGCCGAGGACGG
GTCCCGGCCAACGACAGCACGGCAGAGGCATCGCGGGCGCCGCACGCCATGATGCAGATCGACAGGCTTCACGGC
AGGGGGATCACGGGCAAGGGGTCCAGGATCGCCATCATCGACACGGGGGTCGACTACGAGCACCCGGCGCTCGGC
GGCTGCTTCGGCCGCGGCTGCGTCGTCGCCTTCGGGGCCGACCTCGTCGGGGGCGAGCCGACGCCGAGGGACTGC
CACGGCCACGGGACCGTCGTCGCGGGCCTGATCGCGGCGCGGCGCAACGAGTTCGGCGTCGCCGGCGCGGCGCCG
GGCGCCGAGCTCGGCATGTACCGCGTCTCGTGCGACGGCAGCTTCGCCACCGACGTCCTGGTCGACGCCCTGTAC
CGGGCCCAGCGCGACGGGGCCACCATCGTCAACTTGTCGGCCGGCACGGCCGGCCGGTGGGAAAGCGGCGTCTTG
GCCGAGGCGGCGTCGCGGCTCGCGGCCCACGGCATCGTCTGCGTCCGCAGCGCCGGCAACAACGGGCAGGCGGCC
GTCTTCTCGCTCGACGGCCTCGGGCCGGGCGTCGTCTACGCCGGCTCCATGAACAGCGCCGTCGTGCCGGATTTG
CAGCTGCAGGCCAGCTACGCCGTCGACGGCGGCGGCGACGACGTCCCCTTCCCCTTCTGGGACTCGGACCCCCTG
ACCGACTTCACCGGCGCCCCGATGGAGCTCTGGCCCGGCAGCCGGGGGCCCGACGCGCCCGACGACGCCGACTGC
GGCCCGCTCCCGGACGACACGCCCGACCTGGCCGACAGGATCGTCCTCGTCCGGCTCTGCCAGCCGTTCGAGGCC
CGGACCGCCGCCCGGCTGGTCGCCCGGGGCGCCCGGCGCGTGCTCGCGTACCACAAGCCCGCCGACGGGCCTCCC
GTCGCCGCCTACTTCCCGGAGCCGCCCGAGGGGATCGTGGCCTTCTCCATGGTCGAGGCCGACGTCGCCAGGCTA
CTGCTCGACGCCGCCGGCGCCGGCCGCAAGGTCCTCGTCACCGTGCCCCATCCGCGGGACGCGCCGCTCCGCTAT
CGGGAGTCCCCGGACGAGGCACCGGGGGCCGTCACGGCCTTCAGCTCCTGGGGCCCGGGCTGGGACCTGGCGCTC
AGGCCGTCGCTGACCGCCATCGGCACCTACGTTCTCTCGACCTCGTGGAGGGACCAGAACCCGGCCGGGTTCGAG
CAAAGCTCGGGGACTTCCTGTGCGGCGCCCCAGGTGGCCGCCATCGTGGCCCTGATCGCCGAGGCTCGCGGCTCC
CTTGACCCGGCCGTGATGGAGAGCCTCCTCGTCACGCACGCGCAGCCGCAGCTCTACCACGACGGAGACAACTTT
CTCCCCCATCTGGCCCCGGTCGCCCAGCAGGGGGGCGGTCTGGTGCGCGCCTACGATGCTGCGTATGCGACGACA
GTCGTCGATCCGCCCGGCCTCAACTTCAACGACACGGAGCACTTCGCCCCCTCGCGAGAGTTCTCGCTGCAGAAC
CTCGGCCACGGCGACGTGACTTACACTCTCTCCCACGTCCCGGCCGTCACGGTGTATCTCTTCACGGCCAACGGC
TCGACGGCCGCTTATCCGGGCACTCTCGAACCCGTCGAGGCCCCGGCCTCTCTCGACATGAGGCACGGCATCCTC
ACCATACCCTCGAAGCAGTCGGCCACCGTCCGCGTAACCGCGGCCCCTCCAGAGCGCCTTGACGCCCGGCGTCTT
GCCCTATGGTCTGGCTGGATCGCGGTCAACGGAACGGATGGCTCGTCGCTGTCTATCCCCTATCAGGGCTTCTCG
GGATCCATCCGGGAGCATCAAGTTCTGCGCCCCGACGCGGCCTCGCTCGCATACCAGAACGGGTCGGTGGGCGAG
GATTCTGCGATCAAGCTTCCGCCGCAGGGCTCTGCCGACGACTCCGAGCTCATCCTGCACATCGACGCAACTCTC
GGCTCTCCTCTCGTGCGCGCCGACGTTGAGCGCTCGGCATCCGACGACGACGCCGCCGAAAGTCGCAAAGCCATC
GGCCAGCTGCACGGCTTCCCGTTCCGCCACATGCCCCCCGGCACATTCGACGACCCCGGTCGGCCTGATCCCGAT
CTGTACAATCTCACCCACTTCAAGTGGAACGGCCTGCTCGATACGGGCAGCTATGCCCCCGAGGGATCCTACAAG
CTGGTCGTGCGAGCCCTGAGGATCTTCGGCGACCCCGAAAAGAAGGACGACTGGGACATGAGCGAGACGCCGCGG
TTTCACATCGCGTATGCCGCCGGGTCGACGACGCCCGCGCCGGAGCCAGGCGACAAGTCGGCTCCCGGGCGGCCC
GCCGGGAGCGAGGCCGGCCCGTGCAAGTCGACCGACTTCCCCGGCACGGACAAGCTTGTGCTGAACAGCGAGACG
GGATACAGTGCCCAAGTGCCGGCGATGGCCGCGGCTGTCTGCCACGGGGAGCGGGCTGACCAGGCGGCGTGCGGA
AGAGTCGAGGCTGCGTTTCATGCCAACGACCCGGTCTCGAACGCGTGGAGCCAGTACAACGGTGACGCGTGTCTG
CCTCGCAGCAAGCTCCTCGAGACAGACGCGCAGGCGGGGTGTAAGCGGAACGGATACCCCAGATTCGTCGCCAAC
GCGACCACACGGGAGCACGTCCAGGATGCCATCAGATACGCCAGGGAGAATGCGGTTCGTCTGAGCATCAAGAAC
ACCGGACACGACTATCTCGGCCGCTTCAGCGGGCCCAACTCCCTGTCCATCTGGACGCACCACATGGACTCGATT
GTCTACAACAAGGCCGGCTTCACTCCCGCCGGCTGCGATGCCCCGGTCAGAGACACGGTCAAGATCGGCGCCGGC
GTCCAGACGGGCGCGCTTTACAGGCATCTCCAGCGGCTCGACGAGGCCCGGACCTTTGTCGGCGGCCGCGCCGAG
ACGGTCGGCGTCGCCGGATTCGCCATGGGCGGCGGCCACTCGATCCTGACGGGCCAGTACGGGCTCGGCGCCGAC
CAAGTCGTCGAGATGGAGCTCGTCCTCGCCGACGGCCGGGCCCTCGTCGTCAGCGGGTGCGCCCATCGCGACCTC
TTCTGGGCCCTGCGCGGTGGCGGCGGCTCGACCTTTGGTGTCGTCACCTCCCTCATCATCAAGACCTTCCCGATG
CCCAGGATTGTCGCGGCCGAGTTCGACGTCGCCGCCGCCGCCGGCCCCGACGTCGACGCCAGGGCTCGCCTCGTC
GACGCCAGCACCGTCGTCTGGACACAGCTGCCCAAGCTGGTCGACCAGGGCGTCAACGGATTCATCCACGCCTCG
GCGAGCGTCGACGACCCGTCCGCCAAGACACCCTCCGCCCGCATTCGCTGTTCGCTCGCAATGGTAGGACCAGAC
GCCAGCGAACAAAGTCTGCAGAGGGCGATCGACGGCCTCCTCGACGCCGTGTCGGCAGACAAGGACGGCAAGCGG
ACGCGGCCGGTGCGTGTCACGTACCGCCACGAGCCGCAGAACTATCCCACTGTCCTGGAGTATATCAGCAGCCTC
GAAATGCCCGACCCGGCTGGACAAAACATGTTCTTTGGCTCCAAGCTGCTCAGCGGCGAGAAACTCGTCGGCGGC
GCGCACTATCTGCGGAAGAGCACCGACAAGGCTCTCACCTCACCTACCTCTTCCGGAATCGGCGCCATCATCGTT
GGCGGACCGGGCCTGAGCGATGCCGCGCGACGAAAGGAGGCGTCCGCTCACCCGGGCTGGGAACACGCCGTGGCG
TCGATGTTCACCTCGGCAAGCTTCAACGCCAAGGACAGGCCGTCTCGGAGGGCGGCTGTAGACCAGGTCGACGGC
ACCCTCCAGCCGTTGAGGGAGCTGGACCCGGAGTCGGGAAGCCACGCCAGCGAGGCCTTTGTCTTCGAGCCCCAG
TGGCGCAAGGCCTACTGGGGCGCCAACTACGACAGGCTTAGGCAGATCAAGAAGGACGTCGACCCGGCCGACGTC
TTCTGTTGCCGCCGCTGCGTCGGCAACGAGGCCTACAAGCAGAACCGTCTCAGGCTCTGCACGAAGACCGCCACG
GAGAAGGCTGATGACCAGGCTGGCCGGGACGAGGAGGACGGTCAGGGCCCGGGCCAGTGA
Gene >Hirsu2|11115
ATGAAGCAACTGCTGGTCGCTCTCGCGGCGCTCGCTGCCGCGTCTCCCGTCGCGGCCGGGACTCTGCCGCGGACC
GTGGCCGGCGCGTACGTCGTCGATCTCGACGATGAAGTTGTAAGTCTGGGAAAAGATCGAAAAAGCAGGCGCAGC
GTCGCTGACCAGCCCGGATCTCGCCAGGACGTCAACGGCTTCGCCGAAGCGTTGCAGGCGAGCGTCCAGGCGCGC
ATCCGCCGTCGGATGGACTACGAGCTGTACAAGGGCGTCGCCATCGAGCTCGACGGCCCGCCCCGGGCCAACCAG
GTGGCGGCCATGGCCGGGGTCCGGAAGATGTGGCCGGTCGAGGAGATGAGCATGGGGGCCGACGTCGCCGGCTCC
CCGCGCGGGCAGGGGCAAGACGGACGCGGCCGGCGAGTGAAGCGGTCCCTGGGCCGAGGACGGGTCCCGGCCAAC
GACAGCACGGCAGAGGCATCGCGGGCGCCGCACGCCATGATGCAGATCGACAGGCTTCACGGCAGGGGGATCACG
GGCAAGGGGTCCAGGATCGCCATCATCGACACGGGGGTCAGTCGGCCGTCTCGTTCGCCTTGTTCCCGGGGCAGA
GACTGACCGTGTCCAGGTCGACTACGAGCACCCGGCGCTCGGCGGCTGCTTCGGCCGCGGCTGCGTCGTCGCCTT
CGGGGCCGACCTCGTCGGGGGCGAGCCGACGCCGAGGGACTGCCACGGCCACGGGACCGTCGTCGCGGGCCTGAT
CGCGGCGCGGCGCAACGAGTTCGGCGTCGCCGGCGCGGCGCCGGGCGCCGAGCTCGGCATGTACCGCGTCTCGTG
CGACGGCAGCTTCGCCACCGACGTCCTGGTCGACGCCCTGTACCGGGCCCAGCGCGACGGGGCCACCATCGTCAA
CTTGTCGGCCGGCACGGCCGGCCGGTGGGAAAGCGGCGTCTTGGCCGAGGCGGCGTCGCGGCTCGCGGCCCACGG
CATCGTCTGCGTCCGCAGCGCCGGCAACAACGGGCAGGCGGCCGTCTTCTCGCTCGACGGCCTCGGGCCGGGCGT
CGTCTACGCCGGCTCCATGAACAGCGCCGTCGTGCCGGATTTGCAGCTGCAGGCCAGCTACGCCGTCGACGGCGG
CGGCGACGACGTCCCCTTCCCCTTCTGGGACTCGGACCCCCTGACCGACTTCACCGGCGCCCCGATGGAGCTCTG
GCCCGGCAGCCGGGGGCCCGACGCGCCCGACGACGCCGACTGCGGCCCGCTCCCGGACGACACGCCCGACCTGGC
CGACAGGATCGTCCTCGTCCGGCTCTGCCAGCCGTTCGAGGCCCGGACCGCCGCCCGGCTGGTCGCCCGGGGCGC
CCGGCGCGTGCTCGCGTACCACAAGCCCGCCGACGGGCCTCCCGTCGCCGCCTACTTCCCGGAGCCGCCCGAGGG
GATCGTGGCCTTCTCCATGGTCGAGGCCGACGTCGCCAGGCTACTGCTCGACGCCGCCGGCGCCGGCCGCAAGGT
CCTCGTCACCGTGCCCCATCCGCGGGACGCGCCGCTCCGCTATCGGGAGTCCCCGGACGAGGCACCGGGGGCCGT
CACGGCCTTCAGCTCCTGGGGCCCGGGCTGGGACCTGGCGCTCAGGCCGTCGCTGACCGCCATCGGCACCTACGT
TCTCTCGACCTCGTGGAGGGACCAGAACCCGGCCGGGTTCGAGCAAAGCTCGGGGACTTCCTGTGCGGCGCCCCA
GGTGGCCGCCATCGTGGCCCTGATCGCCGAGGCTCGCGGCTCCCTTGACCCGGCCGTGATGGAGAGCCTCCTCGT
CACGCACGCGCAGCCGCAGCTCTACCACGACGGAGACAACTTTCTCCCCCATCTGGCCCCGGTCGCCCAGCAGGG
GGGCGGTCTGGTGCGCGCCTACGATGCTGCGTATGCGACGACAGTCGTCGATCCGCCCGGCCTCAACTTCAACGA
CACGGAGCACTTCGCCCCCTCGCGAGAGTTCTCGCTGCAGAACCTCGGCCACGGCGACGTGACTTACACTCTCTC
CCACGTCCCGGCCGTCACGGTGTATCTCTTCACGGCCAACGGCTCGACGGCCGCTTATCCGGGCACTCTCGAACC
CGTCGAGGCCCCGGCCTCTCTCGACATGAGGCACGGCATCCTCACCATACCCTCGAAGCAGTCGGCCACCGTCCG
CGTAACCGCGGCCCCTCCAGAGCGCCTTGACGCCCGGCGTCTTGCCCTATGGTCTGGCTGGATCGCGGTCAACGG
AACGGATGGCTCGTCGCTGTCTATCCCCTATCAGGGCTTCTCGGGATCCATCCGGGAGCATCAAGTTCTGCGCCC
CGACGCGGCCTCGCTCGCATACCAGAACGGGTCGGTGGGCGAGGATTCTGCGATCAAGCTTCCGCCGCAGGGCTC
TGCCGACGACTCCGAGCTCATCCTGCACATCGACGCAACTCTCGGCTCTCCTCTCGTGCGCGCCGACGTTGAGCG
CTCGGCATCCGACGACGACGCCGCCGAAAGTCGCAAAGCCATCGGCCAGCTGCACGGCTTCCCGTTCCGCCACAT
GCCCCCCGGCACATTCGACGACCCCGGTCGGCCTGATCCCGATCTGTACAATCTCACCCACTTCAAGTGGAACGG
CCTGCTCGATACGGGCAGCTATGCCCCCGAGGGATCCTACAAGCTGGTCGTGCGAGCCCTGAGGATCTTCGGCGA
CCCCGAAAAGAAGGACGACTGGGACATGAGCGAGACGCCGCGGTTTCACATCGCGTATGCCGCCGGGTCGACGAC
GCCCGCGCCGGAGCCAGGCGACAAGTCGGCTCCCGGGCGGCCCGCCGGGAGCGAGGCCGGCCCGTGCAAGTCGAC
CGACTTCCCCGGCACGGACAAGCTTGTGCTGAACAGCGAGACGGGATACAGTGCCCAAGTGCCGGCGATGGCCGC
GGCTGTCTGCCACGGGGAGCGGGCTGACCAGGCGGCGTGCGGAAGAGTCGAGGCTGCGTTTCATGCGTGAGTGCT
CTCTCGTGTCCCCGGCCGCGAGGGCTCCCTCTTGACTCCCCTGCCGCTTCCAGCAACGACCCGGTCTCGAACGCG
TGGAGCCAGTACAACGGTGACGCGTGTCTGCCTCGCAGCAAGCTCCTCGAGACAGACGCGCAGGCGGGGTGTAAG
CGGAACGGATACCCCAGATTCGTCGCCAACGCGACCACACGGGAGCACGTCCAGGATGCCATCAGATACGGTGAG
CACAAACCCCCTCACGTGTGGCCTCGCACTGCTCCTGACCGAACCGCACGCGCCTGCAGCCAGGGAGAATGCGGT
TCGTCTGAGCATCAAGAACACCGGACACGACTATCTCGGCCGCTTCAGCGGGCCCAACTCCCTGTCCATCTGGAC
GCACCACATGGACTCGATTGTCTACAACAAGGCCGGCTTCACTCCCGCCGGCTGCGATGCCCCGGTCAGAGACAC
GGTCAAGATCGGCGCCGGCGTCCAGACGGGCGCGCTTTACAGGCATCTCCAGCGGCTCGACGAGGCCCGGACCTT
TGTCGGCGGCCGCGCCGAGACGGTCGGCGTCGCCGGATTCGCCATGGGCGGCGGCCACTCGATCCTGACGGGCCA
GTACGGGCTCGGCGCCGACCAAGTCGTCGAGATGGAGCTCGTCCTCGCCGACGGCCGGGCCCTCGTCGTCAGCGG
GTGCGCCCATCGCGACCTCTTCTGGGCCCTGCGCGGTGTAAGCATTCCTCCCACCGAGAGGCTCGGATCGTCTGA
CAAGCGGCCTCGTGATAGGGCGGCGGCTCGACCTTTGGTGTCGTCACCTCCCTCATCATCAAGACCTTCCCGATG
CCCAGGATTGTCGCGGCCGAGTTCGACGTCGCCGCCGCCGCCGGCCCCGACGTCGACGCCAGGGCTCGCCTCGTC
GACGCCAGCACCGTCGTCTGGACACAGCTGCCCAAGCTGGTCGACCAGGGCGTCAACGGATTCATCCACGCCTCG
GCGAGCGTCGACGACCCGTCCGCCAAGACACCCTCCGCCCGCATTCGCTGTTCGCTCGCAATGGTAGGACCAGAC
GCCAGCGAACAAAGTCTGCAGAGGGCGATCGACGGCCTCCTCGACGCCGTGTCGGCAGACAAGGACGGCAAGCGG
ACGCGGCCGGTGCGTGTCACGTACCGCCACGAGCCGCAGAACTATCCCACTGTCCTGGAGTATATCAGCAGCCTC
GAAATGCCCGACCCGGCTGGACAAAACATGTTCTTTGGCTCCAAGCTGCTCAGCGGCGAGAAACTCGTCGGCGGC
GCGCACTATCTGCGGAAGAGCACCGACAAGGCTCTCACCTCACCTACCTCTTCCGGAATCGGCGCCATCATCGTT
GGCGGACCGGGCCTGAGCGATGCCGCGCGACGAAAGGAGGCGTCCGCTCACCCGGGCTGGGAACACGCCGTGGCG
TCGATGTGTACGCCCCCCCCCCAACCTCGACAGTGCCCCTTGAAAAGAGCATCTGCTGATACTAGCCCTCCCAGT
CACCTCGGCAAGCTTCAACGCCAAGGACAGGCCGTCTCGGAGGGCGGCTGTAGACCAGGTCGACGGCACCCTCCA
GCCGTTGAGGGAGCTGGACCCGGAGTCGGGAAGCCACGCCAGCGAGGTGAGGAGAGCCGCGTCCCTTTCGACGTG
AAGCCAAAGCAGGCTGACGAGAACGTCAATAAGGCCTTTGTCTTCGAGCCCCAGTGGCGCAAGGCCTACTGGGGC
GCCAACTACGACAGGCTTAGGCAGATCAAGAAGGACGTCGACCCGGCCGACGTCTTCTGTTGCCGCCGCTGCGTC
GGCAACGAGGCCTACAAGCAGAACCGTCTCAGGCTCTGCACGAAGACCGCCACGGAGAAGGCTGATGACCAGGCT
GGCCGGGACGAGGAGGACGGTCAGGGCCCGGGCCAGTGA

© 2022 - Robin Ohm - Utrecht University - The Netherlands

Built with Python Django and Wagtail