Protein ID | Hirsu2|131 |
Gene name | |
Location | Contig_102:32326..37630 |
Strand | + |
Gene length (bp) | 5304 |
Transcript length (bp) | 5055 |
Coding sequence length (bp) | 5055 |
Protein length (aa) | 1685 |
PFAM Domain ID | Short name | Long name | E-value | Start | End |
---|---|---|---|---|---|
PF00637 | Clathrin | Region in Clathrin and VPS | 1.4E-22 | 546 | 685 |
PF00637 | Clathrin | Region in Clathrin and VPS | 3.1E-23 | 696 | 833 |
PF00637 | Clathrin | Region in Clathrin and VPS | 5.0E-30 | 844 | 972 |
PF00637 | Clathrin | Region in Clathrin and VPS | 4.7E-27 | 988 | 1126 |
PF00637 | Clathrin | Region in Clathrin and VPS | 4.5E-31 | 1137 | 1275 |
PF00637 | Clathrin | Region in Clathrin and VPS | 1.7E-31 | 1282 | 1426 |
PF00637 | Clathrin | Region in Clathrin and VPS | 4.9E-31 | 1434 | 1573 |
PF01394 | Clathrin_propel | Clathrin propeller repeat | 5.1E-06 | 153 | 191 |
PF13838 | Clathrin_H_link | Clathrin-H-link | 5.0E-31 | 359 | 424 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|Q00610|CLH1_HUMAN | Clathrin heavy chain 1 OS=Homo sapiens GN=CLTC PE=1 SV=5 | 4 | 1684 | 0.0E+00 |
sp|Q0WNJ6|CLAH1_ARATH | Clathrin heavy chain 1 OS=Arabidopsis thaliana GN=CHC1 PE=1 SV=1 | 5 | 1639 | 0.0E+00 |
sp|Q0WLB5|CLAH2_ARATH | Clathrin heavy chain 2 OS=Arabidopsis thaliana GN=CHC2 PE=1 SV=1 | 5 | 1639 | 0.0E+00 |
sp|Q2QYW2|CLH2_ORYSJ | Clathrin heavy chain 2 OS=Oryza sativa subsp. japonica GN=Os12g0104800 PE=3 SV=1 | 5 | 1636 | 0.0E+00 |
sp|Q2RBN7|CLH1_ORYSJ | Clathrin heavy chain 1 OS=Oryza sativa subsp. japonica GN=Os11g0104900 PE=3 SV=1 | 5 | 1636 | 0.0E+00 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|Q00610|CLH1_HUMAN | Clathrin heavy chain 1 OS=Homo sapiens GN=CLTC PE=1 SV=5 | 4 | 1684 | 0.0E+00 |
sp|Q0WNJ6|CLAH1_ARATH | Clathrin heavy chain 1 OS=Arabidopsis thaliana GN=CHC1 PE=1 SV=1 | 5 | 1639 | 0.0E+00 |
sp|Q0WLB5|CLAH2_ARATH | Clathrin heavy chain 2 OS=Arabidopsis thaliana GN=CHC2 PE=1 SV=1 | 5 | 1639 | 0.0E+00 |
sp|Q2QYW2|CLH2_ORYSJ | Clathrin heavy chain 2 OS=Oryza sativa subsp. japonica GN=Os12g0104800 PE=3 SV=1 | 5 | 1636 | 0.0E+00 |
sp|Q2RBN7|CLH1_ORYSJ | Clathrin heavy chain 1 OS=Oryza sativa subsp. japonica GN=Os11g0104900 PE=3 SV=1 | 5 | 1636 | 0.0E+00 |
sp|P25870|CLH_DICDI | Clathrin heavy chain OS=Dictyostelium discoideum GN=chcA PE=1 SV=1 | 1 | 1660 | 0.0E+00 |
sp|P34574|CLH_CAEEL | Probable clathrin heavy chain 1 OS=Caenorhabditis elegans GN=chc-1 PE=3 SV=1 | 4 | 1660 | 0.0E+00 |
sp|P53675|CLH2_HUMAN | Clathrin heavy chain 2 OS=Homo sapiens GN=CLTCL1 PE=1 SV=2 | 4 | 1636 | 0.0E+00 |
sp|P22137|CLH_YEAST | Clathrin heavy chain OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=CHC1 PE=1 SV=1 | 1 | 1658 | 0.0E+00 |
sp|P29742|CLH_DROME | Clathrin heavy chain OS=Drosophila melanogaster GN=Chc PE=1 SV=1 | 3 | 1671 | 0.0E+00 |
sp|P11442|CLH1_RAT | Clathrin heavy chain 1 OS=Rattus norvegicus GN=Cltc PE=1 SV=3 | 4 | 1684 | 0.0E+00 |
sp|P49951|CLH1_BOVIN | Clathrin heavy chain 1 OS=Bos taurus GN=CLTC PE=1 SV=1 | 4 | 1684 | 0.0E+00 |
sp|Q68FD5|CLH1_MOUSE | Clathrin heavy chain 1 OS=Mus musculus GN=Cltc PE=1 SV=3 | 4 | 1684 | 0.0E+00 |
sp|Q10161|CLH_SCHPO | Probable clathrin heavy chain OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=chc1 PE=1 SV=1 | 4 | 1659 | 0.0E+00 |
sp|Q5XIR8|CLHC1_RAT | Clathrin heavy chain linker domain-containing protein 1 OS=Rattus norvegicus GN=Clhc1 PE=2 SV=1 | 367 | 521 | 4.0E-09 |
sp|Q5M6W3|CLHC1_MOUSE | Clathrin heavy chain linker domain-containing protein 1 OS=Mus musculus GN=Clhc1 PE=2 SV=1 | 367 | 521 | 1.0E-07 |
sp|Q8NHS4|CLHC1_HUMAN | Clathrin heavy chain linker domain-containing protein 1 OS=Homo sapiens GN=CLHC1 PE=1 SV=3 | 367 | 521 | 4.0E-07 |
sp|Q4R6I5|CLHC1_MACFA | Clathrin heavy chain linker domain-containing protein 1 OS=Macaca fascicularis GN=CLHC1 PE=2 SV=1 | 367 | 510 | 5.0E-07 |
GO Term | Description | Terminal node |
---|---|---|
GO:0016192 | vesicle-mediated transport | Yes |
GO:0006886 | intracellular protein transport | Yes |
GO:0008150 | biological_process | No |
GO:0051179 | localization | No |
GO:0045184 | establishment of protein localization | No |
GO:0051234 | establishment of localization | No |
GO:0046907 | intracellular transport | No |
GO:0071702 | organic substance transport | No |
GO:0071705 | nitrogen compound transport | No |
GO:0070727 | cellular macromolecule localization | No |
GO:0033036 | macromolecule localization | No |
GO:0015031 | protein transport | No |
GO:0009987 | cellular process | No |
GO:0051641 | cellular localization | No |
GO:0051649 | establishment of localization in cell | No |
GO:0006810 | transport | No |
GO:0008104 | protein localization | No |
Localizations | Signals | Cytoplasm | Nucleus | Extracellular | Cell membrane | Mitochondrion | Plastid | Endoplasmic reticulum | Lysosome vacuole | Golgi apparatus | Peroxisome |
---|---|---|---|---|---|---|---|---|---|---|---|
Cytoplasm | Nuclear localization signal | 0.5768 | 0.4832 | 0.045 | 0.1233 | 0.0571 | 0.0024 | 0.2993 | 0.3262 | 0.1859 | 0.0004 |
Orthofinder run ID | 4 |
Orthogroup | 1635 |
Change Orthofinder run |
Type of sequence | Sequence |
---|---|
Locus | Download genbank file of locus
Download genbank file of locus (reverse complement)
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded. |
Protein | >Hirsu2|131 MAPLPIKFQELVQLASVGVDTQSIGFNSCTLESDSYVCIREKKSEAAQPEVVIVELKNGNNVTRRPIKADSAIMH WKRQVIALKAQSRTLQIFDVEQKKKLKSCTMNEDVQFWKWISESTLGLVTTSSVYHWDVYDAAQDAPSKMFERNA NLNGCQIINYRANVDGKWMVVVGISSQQGRVVGAMQLYSKDRGISQAIEGHAAAFGTLRLDGAPQDTRLFSFAVR GSNGAKLHIVEVDHPESNPVFPKKAVDIFFPPEATNDFPVALQISQKYGVIFMVTKYGFIHLYDLESGTLIFMNR ISSETIFTSCADDDSSGLVGINRKGQVLFVTIDDSTVIPYLLENPANTEIAIKLASRAGLPGADSLYAKQFDQLF NSGNYMEAAKIAANSPRGFLRTAETIDKFKRLPAQPGQMAFTLQYFGMLLDKGTLNHQETIELASPVLQQNRKHL LEKWLKEGKLDCSEQLGDMVRPYDVNMALTIYLKANVPQKVVAGFAETGQFDKILPYASQTGYQPDYIQLLQHII RTNPEKGGEFAISLASSDQGPLVDFERVCDIFQSQGMIQQATSFLLDALKENKPEHARLQTRLLEMNLMHAPQVA DAILGNDMVTHFDKGRIAQLCEQAGLLQRALELYEDPEAIKRVVVTIPGSPNFNPEWFTNFFGKLSVEQSLDCLD AMMKHNIRQNLQSVVNIATKYSELLGPVRLIDLFEKYKTAEGLFYYLGSIVNLSEDPDVHFKYIEAAARSNQFNE VERICRDSNSYNPEKVKNFLKEAKLPEQLPLIIVCDRFNFVHDLILYLYQSQQFQAIEAYVQRVNPSRTPAVVGG LLDVDCDENIIKQLLGTVNASQINIDELVSEVESRNRLKLLLPFLEATLQSGNQQQAVYNALAKIYIDSNNNPEK FLKENDQYDTLTVGKYCEKRDPNLAYIAYSKGQNDLELVNITNENSMYRAQARYLLERSDAELWRFVLSESNIHR RSVVDQVISTAVPESTDPAKVSVAVSALLDNDMPLELIELLEKIVLEPSPFSDNQNLQNLLMFTAAKADKGRVMD YIHKLDGYNADDIATSCIEVGLFEEAFEIYKKADNKTAAVDVLVEHVVSIDRSQAYAEEVDIPEVWSKVAKAQLD GVRVSDGIESYIKAGDPKNYEEVIEIATHAGKNEELVKYLRMARKTLREPAIDTALAFCYARLDQLPELEDFLRA TNVANIEESGDKAYEEGLFEASKIFYSSISNWAKLATTLVHLGDYQAAVECARKANNIKVWKQVHEACVEKREFR LAQICGLNLIIDAEQLQTLVKDYERNGYFDELISLLEQGLGLERAHMGMFTELGIALSRYHPERLMEHIKLFWSR MNMPKMIRACEEAHLWPELVFCYCHYDEFDNAALAVIERPENSWEHQQFKEIVVKVANLEIYYRAIKFYVEQHPS LLTDLLAALTARIDVNRVVKMFQKNDSLPLIKPFLLNVQSQNKRTVNEAVNDLLIEEEDYKTLRDSVQNHDNYDA VGLAARLEKHDLIFFRQIAASIYRKNKRWEKSIALSKQDKLYKDAIETAATSGKTEIVEDLLRYFVDIGHRECYT GMLYACNELIRPDLILELSWRHGLTDFTMPYMINMLAQQTREIAQLKADNEARKAKEKEQEKTEDSTPILGGSRL MITAGPAGGMGGGMTPAPYGQTNGFAPQPTGYGY* |
Coding | >Hirsu2|131 ATGGCGCCTCTGCCCATCAAGTTCCAGGAGCTGGTGCAGCTCGCAAGCGTCGGCGTGGACACGCAATCCATCGGC TTCAACTCTTGCACCCTCGAGTCCGACTCCTACGTCTGCATCCGCGAAAAGAAGAGCGAAGCCGCCCAGCCGGAA GTCGTCATCGTCGAGCTCAAAAATGGCAACAACGTCACGCGCCGGCCAATCAAGGCCGACAGTGCCATCATGCAC TGGAAGCGCCAGGTCATTGCCCTCAAGGCCCAGTCGCGCACCCTCCAAATCTTCGACGTCGAGCAGAAGAAGAAG CTCAAGTCGTGTACCATGAACGAGGATGTCCAGTTCTGGAAGTGGATAAGCGAGTCCACTCTGGGCCTCGTCACC ACCAGCAGCGTCTATCACTGGGATGTCTACGACGCCGCCCAGGACGCGCCCTCCAAGATGTTCGAGCGCAACGCC AACCTCAACGGCTGCCAAATCATTAATTACCGCGCCAACGTCGACGGCAAGTGGATGGTCGTTGTCGGCATCTCC TCCCAGCAGGGCCGCGTCGTAGGCGCCATGCAGCTGTACTCCAAAGACCGGGGCATTAGTCAGGCCATCGAGGGT CATGCGGCTGCCTTTGGCACCCTCAGACTCGACGGCGCCCCCCAGGACACCAGGCTCTTCAGCTTCGCCGTCCGC GGCAGCAACGGCGCCAAGCTGCACATCGTCGAGGTCGACCACCCCGAGTCCAACCCCGTCTTCCCCAAGAAGGCT GTCGACATCTTCTTCCCCCCGGAGGCCACCAACGACTTCCCCGTCGCCCTGCAGATTTCCCAGAAGTATGGCGTC ATCTTCATGGTCACCAAGTATGGCTTCATCCACCTCTACGACCTCGAGTCCGGCACCCTCATCTTCATGAACCGG ATCTCGAGCGAAACCATCTTCACAAGCTGCGCCGACGACGACTCCTCGGGCCTTGTGGGTATCAACCGCAAGGGC CAGGTCCTCTTCGTCACCATCGACGACTCCACCGTCATCCCCTACCTCCTCGAGAACCCTGCCAACACCGAGATT GCCATCAAGCTGGCCTCGAGAGCTGGACTCCCTGGTGCGGACAGCCTCTATGCGAAGCAATTCGATCAGCTGTTC AACTCTGGCAACTACATGGAGGCGGCCAAGATTGCCGCCAACTCGCCCCGCGGCTTCCTCCGCACTGCCGAGACC ATCGACAAGTTCAAGCGGCTCCCGGCCCAGCCCGGCCAGATGGCCTTCACCCTGCAGTACTTTGGCATGCTGCTC GACAAGGGCACGCTGAACCACCAGGAGACTATTGAGCTCGCCAGTCCGGTGCTCCAGCAGAACCGGAAGCACCTA CTGGAGAAGTGGCTCAAGGAAGGCAAGCTGGACTGCTCCGAGCAGCTCGGCGACATGGTGCGGCCCTACGACGTC AACATGGCCCTGACCATCTACCTCAAGGCCAACGTGCCCCAGAAGGTCGTCGCCGGCTTCGCCGAGACGGGCCAG TTCGACAAGATTCTGCCCTACGCCTCCCAGACCGGCTACCAGCCCGACTACATCCAGCTCCTGCAGCACATCATC CGCACCAACCCCGAAAAGGGCGGCGAATTCGCCATCTCCCTCGCTAGCAGCGACCAGGGCCCGTTGGTCGACTTC GAGCGCGTGTGCGACATCTTCCAGTCCCAGGGCATGATCCAGCAGGCCACCAGCTTCCTGCTCGACGCCCTCAAG GAGAACAAGCCGGAGCACGCGCGCCTGCAGACCCGTCTCCTCGAGATGAACCTGATGCACGCCCCCCAGGTGGCC GATGCCATCCTCGGCAACGACATGGTCACCCACTTCGACAAGGGCCGGATCGCCCAGCTGTGCGAGCAGGCCGGC CTTTTGCAGAGGGCGCTCGAGCTGTACGAGGACCCCGAGGCCATCAAGCGCGTCGTCGTCACCATACCCGGCAGC CCCAATTTCAACCCCGAGTGGTTCACCAACTTCTTCGGCAAGCTGTCCGTGGAGCAGTCCCTCGACTGCCTCGAC GCCATGATGAAGCACAACATCCGCCAAAACTTGCAGTCCGTCGTCAACATTGCGACCAAGTACTCCGAGCTCCTC GGGCCCGTCCGTCTCATTGACCTCTTTGAAAAGTACAAGACGGCCGAGGGTCTCTTCTACTACCTCGGCAGCATC GTCAACCTCTCCGAGGACCCCGACGTGCACTTCAAGTACATCGAGGCGGCCGCCAGGTCGAACCAGTTCAACGAG GTGGAGCGCATCTGCCGGGACAGCAACAGCTACAACCCGGAAAAGGTCAAGAATTTCCTCAAGGAGGCCAAGCTT CCCGAGCAGCTGCCTCTCATCATCGTCTGCGACCGTTTCAACTTCGTTCACGACTTGATCCTCTACCTGTACCAG AGCCAGCAGTTCCAGGCCATCGAGGCCTACGTCCAGCGCGTCAACCCCTCCAGGACGCCCGCCGTCGTCGGCGGC CTCCTCGACGTCGACTGCGACGAGAACATCATCAAGCAGCTTCTGGGCACTGTCAATGCCTCGCAGATCAACATC GACGAGCTGGTATCCGAAGTTGAGTCGCGCAACCGCCTCAAGCTGCTCCTGCCTTTCCTCGAGGCCACGCTGCAG TCCGGTAATCAGCAGCAGGCCGTCTATAATGCGCTCGCCAAGATCTACATCGACTCGAACAACAACCCTGAGAAG TTCCTCAAGGAGAACGACCAGTATGACACCCTGACGGTCGGCAAGTATTGCGAGAAGCGTGACCCCAACCTGGCC TACATCGCCTACTCCAAGGGCCAGAACGACCTGGAGCTCGTCAACATCACCAACGAGAACTCCATGTATCGAGCG CAGGCCCGATACCTGCTGGAGCGGTCCGACGCCGAGCTTTGGCGCTTCGTCCTCAGCGAGAGCAATATCCACCGA CGCTCTGTCGTGGACCAGGTCATCTCCACCGCCGTTCCCGAGTCCACCGATCCGGCCAAGGTCTCCGTCGCCGTC TCGGCTCTGCTCGACAACGACATGCCCCTGGAGCTTATTGAGCTGCTGGAGAAGATCGTGCTGGAGCCGTCGCCT TTCAGCGACAACCAGAACTTGCAGAACCTGCTCATGTTCACGGCCGCCAAGGCCGACAAGGGCCGCGTGATGGAC TACATCCACAAGCTCGACGGCTACAACGCCGACGACATCGCGACGTCGTGCATCGAGGTTGGCCTCTTCGAGGAA GCCTTCGAAATTTACAAGAAGGCCGACAACAAGACGGCGGCCGTCGACGTCCTCGTCGAGCACGTCGTCAGCATT GACCGTTCACAAGCGTACGCCGAAGAGGTGGACATTCCCGAGGTCTGGAGCAAGGTTGCCAAGGCACAGCTCGAC GGCGTCCGAGTCTCGGATGGTATCGAGTCGTACATCAAGGCCGGCGACCCCAAGAACTACGAAGAGGTGATTGAG ATCGCCACGCACGCGGGTAAGAATGAGGAGCTCGTCAAGTATCTGCGCATGGCTCGCAAGACGCTGCGCGAGCCT GCCATCGACACGGCACTGGCCTTCTGCTACGCCCGCCTGGATCAGCTTCCCGAGCTCGAAGACTTCCTGCGGGCG ACCAATGTTGCCAACATCGAGGAGTCCGGAGACAAGGCTTACGAGGAGGGTCTCTTTGAGGCGTCCAAGATTTTC TACAGCAGCATCTCCAACTGGGCCAAGCTCGCCACCACTCTCGTGCACCTGGGCGACTACCAGGCCGCCGTCGAG TGCGCGCGCAAGGCGAACAACATCAAGGTCTGGAAGCAGGTGCACGAGGCGTGCGTCGAGAAGAGGGAATTCCGA CTGGCCCAGATCTGCGGCCTGAACTTGATCATCGACGCAGAGCAGCTGCAGACCCTGGTCAAGGACTACGAGCGC AACGGCTACTTTGACGAGCTCATCAGCCTCCTGGAGCAAGGCCTCGGCCTCGAGCGCGCCCACATGGGCATGTTC ACCGAGCTGGGCATCGCCCTGTCTAGGTACCACCCGGAGCGCCTGATGGAGCACATCAAGCTGTTCTGGTCGAGG ATGAACATGCCCAAGATGATTCGCGCTTGCGAGGAAGCCCACCTGTGGCCGGAGCTCGTCTTCTGCTACTGCCAC TACGACGAGTTCGACAACGCCGCCTTGGCCGTCATTGAGAGGCCCGAGAACTCGTGGGAGCACCAGCAGTTCAAG GAGATTGTGGTCAAGGTCGCCAACCTCGAAATCTACTACCGCGCCATCAAGTTCTACGTGGAGCAACACCCGTCG CTGCTCACCGACCTCCTCGCGGCCCTGACGGCCCGCATCGACGTCAACCGCGTCGTCAAGATGTTCCAGAAGAAC GACAGCCTGCCGCTCATCAAGCCCTTCCTCCTCAATGTGCAGTCGCAGAACAAGCGCACGGTCAACGAGGCGGTC AACGACCTGCTCATCGAGGAAGAAGACTACAAGACGCTGCGCGACTCCGTGCAGAACCACGACAACTACGACGCG GTCGGGCTCGCCGCCCGCCTGGAGAAGCACGACCTCATCTTCTTCCGCCAGATCGCCGCCAGCATCTACCGCAAG AACAAGCGGTGGGAGAAGTCGATTGCGCTGTCCAAGCAAGACAAGCTTTACAAGGATGCGATCGAGACGGCCGCC ACATCTGGTAAGACCGAGATTGTGGAGGATCTCCTTCGATATTTCGTCGACATCGGACACCGTGAATGCTATACG GGCATGCTGTACGCTTGCAACGAGCTCATCCGACCCGACCTCATCCTCGAGCTGTCGTGGCGTCACGGCCTCACG GACTTCACGATGCCCTACATGATCAACATGCTGGCGCAGCAGACGCGCGAAATTGCCCAGCTGAAGGCGGACAAC GAGGCCCGCAAAGCCAAGGAGAAGGAACAGGAGAAGACGGAGGACAGCACGCCCATCCTCGGCGGCTCTCGGCTG ATGATCACGGCGGGACCGGCGGGCGGTATGGGCGGAGGCATGACGCCTGCCCCATACGGCCAGACCAACGGCTTC GCACCGCAACCCACGGGCTACGGTTACTAG |
Transcript | >Hirsu2|131 ATGGCGCCTCTGCCCATCAAGTTCCAGGAGCTGGTGCAGCTCGCAAGCGTCGGCGTGGACACGCAATCCATCGGC TTCAACTCTTGCACCCTCGAGTCCGACTCCTACGTCTGCATCCGCGAAAAGAAGAGCGAAGCCGCCCAGCCGGAA GTCGTCATCGTCGAGCTCAAAAATGGCAACAACGTCACGCGCCGGCCAATCAAGGCCGACAGTGCCATCATGCAC TGGAAGCGCCAGGTCATTGCCCTCAAGGCCCAGTCGCGCACCCTCCAAATCTTCGACGTCGAGCAGAAGAAGAAG CTCAAGTCGTGTACCATGAACGAGGATGTCCAGTTCTGGAAGTGGATAAGCGAGTCCACTCTGGGCCTCGTCACC ACCAGCAGCGTCTATCACTGGGATGTCTACGACGCCGCCCAGGACGCGCCCTCCAAGATGTTCGAGCGCAACGCC AACCTCAACGGCTGCCAAATCATTAATTACCGCGCCAACGTCGACGGCAAGTGGATGGTCGTTGTCGGCATCTCC TCCCAGCAGGGCCGCGTCGTAGGCGCCATGCAGCTGTACTCCAAAGACCGGGGCATTAGTCAGGCCATCGAGGGT CATGCGGCTGCCTTTGGCACCCTCAGACTCGACGGCGCCCCCCAGGACACCAGGCTCTTCAGCTTCGCCGTCCGC GGCAGCAACGGCGCCAAGCTGCACATCGTCGAGGTCGACCACCCCGAGTCCAACCCCGTCTTCCCCAAGAAGGCT GTCGACATCTTCTTCCCCCCGGAGGCCACCAACGACTTCCCCGTCGCCCTGCAGATTTCCCAGAAGTATGGCGTC ATCTTCATGGTCACCAAGTATGGCTTCATCCACCTCTACGACCTCGAGTCCGGCACCCTCATCTTCATGAACCGG ATCTCGAGCGAAACCATCTTCACAAGCTGCGCCGACGACGACTCCTCGGGCCTTGTGGGTATCAACCGCAAGGGC CAGGTCCTCTTCGTCACCATCGACGACTCCACCGTCATCCCCTACCTCCTCGAGAACCCTGCCAACACCGAGATT GCCATCAAGCTGGCCTCGAGAGCTGGACTCCCTGGTGCGGACAGCCTCTATGCGAAGCAATTCGATCAGCTGTTC AACTCTGGCAACTACATGGAGGCGGCCAAGATTGCCGCCAACTCGCCCCGCGGCTTCCTCCGCACTGCCGAGACC ATCGACAAGTTCAAGCGGCTCCCGGCCCAGCCCGGCCAGATGGCCTTCACCCTGCAGTACTTTGGCATGCTGCTC GACAAGGGCACGCTGAACCACCAGGAGACTATTGAGCTCGCCAGTCCGGTGCTCCAGCAGAACCGGAAGCACCTA CTGGAGAAGTGGCTCAAGGAAGGCAAGCTGGACTGCTCCGAGCAGCTCGGCGACATGGTGCGGCCCTACGACGTC AACATGGCCCTGACCATCTACCTCAAGGCCAACGTGCCCCAGAAGGTCGTCGCCGGCTTCGCCGAGACGGGCCAG TTCGACAAGATTCTGCCCTACGCCTCCCAGACCGGCTACCAGCCCGACTACATCCAGCTCCTGCAGCACATCATC CGCACCAACCCCGAAAAGGGCGGCGAATTCGCCATCTCCCTCGCTAGCAGCGACCAGGGCCCGTTGGTCGACTTC GAGCGCGTGTGCGACATCTTCCAGTCCCAGGGCATGATCCAGCAGGCCACCAGCTTCCTGCTCGACGCCCTCAAG GAGAACAAGCCGGAGCACGCGCGCCTGCAGACCCGTCTCCTCGAGATGAACCTGATGCACGCCCCCCAGGTGGCC GATGCCATCCTCGGCAACGACATGGTCACCCACTTCGACAAGGGCCGGATCGCCCAGCTGTGCGAGCAGGCCGGC CTTTTGCAGAGGGCGCTCGAGCTGTACGAGGACCCCGAGGCCATCAAGCGCGTCGTCGTCACCATACCCGGCAGC CCCAATTTCAACCCCGAGTGGTTCACCAACTTCTTCGGCAAGCTGTCCGTGGAGCAGTCCCTCGACTGCCTCGAC GCCATGATGAAGCACAACATCCGCCAAAACTTGCAGTCCGTCGTCAACATTGCGACCAAGTACTCCGAGCTCCTC GGGCCCGTCCGTCTCATTGACCTCTTTGAAAAGTACAAGACGGCCGAGGGTCTCTTCTACTACCTCGGCAGCATC GTCAACCTCTCCGAGGACCCCGACGTGCACTTCAAGTACATCGAGGCGGCCGCCAGGTCGAACCAGTTCAACGAG GTGGAGCGCATCTGCCGGGACAGCAACAGCTACAACCCGGAAAAGGTCAAGAATTTCCTCAAGGAGGCCAAGCTT CCCGAGCAGCTGCCTCTCATCATCGTCTGCGACCGTTTCAACTTCGTTCACGACTTGATCCTCTACCTGTACCAG AGCCAGCAGTTCCAGGCCATCGAGGCCTACGTCCAGCGCGTCAACCCCTCCAGGACGCCCGCCGTCGTCGGCGGC CTCCTCGACGTCGACTGCGACGAGAACATCATCAAGCAGCTTCTGGGCACTGTCAATGCCTCGCAGATCAACATC GACGAGCTGGTATCCGAAGTTGAGTCGCGCAACCGCCTCAAGCTGCTCCTGCCTTTCCTCGAGGCCACGCTGCAG TCCGGTAATCAGCAGCAGGCCGTCTATAATGCGCTCGCCAAGATCTACATCGACTCGAACAACAACCCTGAGAAG TTCCTCAAGGAGAACGACCAGTATGACACCCTGACGGTCGGCAAGTATTGCGAGAAGCGTGACCCCAACCTGGCC TACATCGCCTACTCCAAGGGCCAGAACGACCTGGAGCTCGTCAACATCACCAACGAGAACTCCATGTATCGAGCG CAGGCCCGATACCTGCTGGAGCGGTCCGACGCCGAGCTTTGGCGCTTCGTCCTCAGCGAGAGCAATATCCACCGA CGCTCTGTCGTGGACCAGGTCATCTCCACCGCCGTTCCCGAGTCCACCGATCCGGCCAAGGTCTCCGTCGCCGTC TCGGCTCTGCTCGACAACGACATGCCCCTGGAGCTTATTGAGCTGCTGGAGAAGATCGTGCTGGAGCCGTCGCCT TTCAGCGACAACCAGAACTTGCAGAACCTGCTCATGTTCACGGCCGCCAAGGCCGACAAGGGCCGCGTGATGGAC TACATCCACAAGCTCGACGGCTACAACGCCGACGACATCGCGACGTCGTGCATCGAGGTTGGCCTCTTCGAGGAA GCCTTCGAAATTTACAAGAAGGCCGACAACAAGACGGCGGCCGTCGACGTCCTCGTCGAGCACGTCGTCAGCATT GACCGTTCACAAGCGTACGCCGAAGAGGTGGACATTCCCGAGGTCTGGAGCAAGGTTGCCAAGGCACAGCTCGAC GGCGTCCGAGTCTCGGATGGTATCGAGTCGTACATCAAGGCCGGCGACCCCAAGAACTACGAAGAGGTGATTGAG ATCGCCACGCACGCGGGTAAGAATGAGGAGCTCGTCAAGTATCTGCGCATGGCTCGCAAGACGCTGCGCGAGCCT GCCATCGACACGGCACTGGCCTTCTGCTACGCCCGCCTGGATCAGCTTCCCGAGCTCGAAGACTTCCTGCGGGCG ACCAATGTTGCCAACATCGAGGAGTCCGGAGACAAGGCTTACGAGGAGGGTCTCTTTGAGGCGTCCAAGATTTTC TACAGCAGCATCTCCAACTGGGCCAAGCTCGCCACCACTCTCGTGCACCTGGGCGACTACCAGGCCGCCGTCGAG TGCGCGCGCAAGGCGAACAACATCAAGGTCTGGAAGCAGGTGCACGAGGCGTGCGTCGAGAAGAGGGAATTCCGA CTGGCCCAGATCTGCGGCCTGAACTTGATCATCGACGCAGAGCAGCTGCAGACCCTGGTCAAGGACTACGAGCGC AACGGCTACTTTGACGAGCTCATCAGCCTCCTGGAGCAAGGCCTCGGCCTCGAGCGCGCCCACATGGGCATGTTC ACCGAGCTGGGCATCGCCCTGTCTAGGTACCACCCGGAGCGCCTGATGGAGCACATCAAGCTGTTCTGGTCGAGG ATGAACATGCCCAAGATGATTCGCGCTTGCGAGGAAGCCCACCTGTGGCCGGAGCTCGTCTTCTGCTACTGCCAC TACGACGAGTTCGACAACGCCGCCTTGGCCGTCATTGAGAGGCCCGAGAACTCGTGGGAGCACCAGCAGTTCAAG GAGATTGTGGTCAAGGTCGCCAACCTCGAAATCTACTACCGCGCCATCAAGTTCTACGTGGAGCAACACCCGTCG CTGCTCACCGACCTCCTCGCGGCCCTGACGGCCCGCATCGACGTCAACCGCGTCGTCAAGATGTTCCAGAAGAAC GACAGCCTGCCGCTCATCAAGCCCTTCCTCCTCAATGTGCAGTCGCAGAACAAGCGCACGGTCAACGAGGCGGTC AACGACCTGCTCATCGAGGAAGAAGACTACAAGACGCTGCGCGACTCCGTGCAGAACCACGACAACTACGACGCG GTCGGGCTCGCCGCCCGCCTGGAGAAGCACGACCTCATCTTCTTCCGCCAGATCGCCGCCAGCATCTACCGCAAG AACAAGCGGTGGGAGAAGTCGATTGCGCTGTCCAAGCAAGACAAGCTTTACAAGGATGCGATCGAGACGGCCGCC ACATCTGGTAAGACCGAGATTGTGGAGGATCTCCTTCGATATTTCGTCGACATCGGACACCGTGAATGCTATACG GGCATGCTGTACGCTTGCAACGAGCTCATCCGACCCGACCTCATCCTCGAGCTGTCGTGGCGTCACGGCCTCACG GACTTCACGATGCCCTACATGATCAACATGCTGGCGCAGCAGACGCGCGAAATTGCCCAGCTGAAGGCGGACAAC GAGGCCCGCAAAGCCAAGGAGAAGGAACAGGAGAAGACGGAGGACAGCACGCCCATCCTCGGCGGCTCTCGGCTG ATGATCACGGCGGGACCGGCGGGCGGTATGGGCGGAGGCATGACGCCTGCCCCATACGGCCAGACCAACGGCTTC GCACCGCAACCCACGGGCTACGGTTACTAG |
Gene | >Hirsu2|131 ATGGCGCCTCTGCCCATCAAGTTCCAGGAGCTGGTGCAGCTCGCAAGCGTCGGCGTGGACACGCAATCCATCGGC TTCAACTCTTGCGTAAGTGACGCTCTCTCTCCCTCTCCCTCTCCCTCTCCCTCTCCGCAACAGCTGCGGTCGCGC GGCAAAAGCGACGACGCCGACCGACGAGAGTCGAGATTGCTGACGAGCCGCCGACCTCGCAGACCCTCGAGTCCG ACTCCTACGTCTGCATCCGCGAAAAGAAGAGCGAAGCCGCCCAGCCGGAAGTCGTCATCGTCGAGCTCAAAAATG GCAACAACGTCACGCGCCGGCCAATCAAGGCCGACAGTGCCATCATGCACTGGAAGCGCCAGGTCATTGCCCTCA AGGCCCAGTCGCGCACCCTCCAAATCTTCGACGTCGAGCAGAAGAAGAAGCTCAAGTCGTGTACCATGAACGAGG ATGTCCAGTTCTGGAAGTGGATAAGCGAGTCCACTCTGGGCCTCGTCACCACCAGCAGCGTCTATCACTGGGATG TCTACGACGCCGCCCAGGACGCGCCCTCCAAGATGTTCGAGCGCAACGCCAACCTCAACGTATGTTTCCCTCCTC TCGCCCGGACCCGTCCACTCCAGCTAACCTCGACGGTTCTCTGCAGGGCTGCCAAATCATTAATTACCGCGCCAA CGTCGACGGCAAGTGGATGGTCGTTGTCGGCATCTCCTCCCAGCAGGGCCGCGTCGTAGGCGCCATGCAGCTGTA CTCCAAAGACCGGGGCATTAGTCAGGCCATCGAGGGTCATGCGGCTGCCTTTGGCACCCTCAGACTCGACGGCGC CCCCCAGGACACCAGGCTCTTCAGCTTCGCCGTCCGCGGCAGCAACGGCGCCAAGCTGCACATCGTCGAGGTCGA CCACCCCGAGTCCAACCCCGTCTTCCCCAAGAAGGCTGTCGACATCTTCTTCCCCCCGGAGGCCACCAACGACTT CCCCGTCGCCCTGCAGATTTCCCAGAAGTATGGCGTCATCTTCATGGTCACCAAGTATGGCTTCATCCACCTCTA CGACCTCGAGTCCGGCACCCTCATCTTCATGAACCGGATCTCGAGCGAAACCATCTTCACAAGCTGCGCCGACGA CGACTCCTCGGGCCTTGTGGGTATCAACCGCAAGGGCCAGGTCCTCTTCGTCACCATCGACGACTCCACCGTCAT CCCCTACCTCCTCGAGAACCCTGCCAACACCGAGATTGCCATCAAGCTGGCCTCGAGAGCTGGACTCCCTGGTGC GGACAGCCTCTATGCGAAGCAATTCGATCAGCTGTTCAACTCTGGCAACTACATGGAGGCGGCCAAGATTGCCGC CAACTCGCCCCGCGGCTTCCTCCGCACTGCCGAGACCATCGACAAGTTCAAGCGGCTCCCGGCCCAGCCCGGCCA GATGGCCTTCACCCTGCAGTACTTTGGCATGCTGCTCGACAAGGGCACGCTGAACCACCAGGAGACTATTGAGCT CGCCAGTCCGGTGCTCCAGCAGAACCGGAAGCACCTACTGGAGAAGTGGCTCAAGGAAGGCAAGCTGGACTGCTC CGAGCAGCTCGGCGACATGGTGCGGCCCTACGACGTCAACATGGCCCTGACCATCTACCTCAAGGCCAACGTGCC CCAGAAGGTCGTCGCCGGCTTCGCCGAGACGGGCCAGTTCGACAAGATTCTGCCCTACGCCTCCCAGACCGGCTA CCAGCCCGACTACATCCAGCTCCTGCAGCACATCATCCGCACCAACCCCGAAAAGGGCGGCGAATTCGCCATCTC CCTCGCTAGCAGCGACCAGGGCCCGTTGGTCGACTTCGAGCGCGTGTGCGACATCTTCCAGTCCCAGGGCATGAT CCAGCAGGCCACCAGCTTCCTGCTCGACGCCCTCAAGGAGAACAAGCCGGAGCACGCGCGCCTGCAGACCCGTCT CCTCGAGATGAACCTGATGCACGCCCCCCAGGTGGCCGATGCCATCCTCGGCAACGACATGGTCACCCACTTCGA CAAGGGCCGGATCGCCCAGCTGTGCGAGCAGGCCGGCCTTTTGCAGAGGGCGCTCGAGCTGTACGAGGACCCCGA GGCCATCAAGCGCGTCGTCGTCACCATACCCGGCAGCCCCAATTTCAACCCCGAGTGGTTCACCAACTTCTTCGG CAAGCTGTCCGTGGAGCAGTCCCTCGACTGCCTCGACGCCATGATGAAGCACAACATCCGCCAAAACTTGCAGTC CGTCGTCAACATTGCGACCAAGTACTCCGAGCTCCTCGGGCCCGTCCGTCTCATTGACCTCTTTGAAAAGTACAA GACGGCCGAGGGTCTCTTCTACTACCTCGGCAGCATCGTCAACCTCTCCGAGGACCCCGACGTGCACTTCAAGTA CATCGAGGCGGCCGCCAGGTCGAACCAGTTCAACGAGGTGGAGCGCATCTGCCGGGACAGCAACAGCTACAACCC GGAAAAGGTCAAGAATTTCCTCAAGGAGGCCAAGCTTCCCGAGCAGCTGCCTCTCATCATCGTCTGCGACCGTTT CAACTTCGTTCACGACTTGATCCTCTACCTGTACCAGAGCCAGCAGTTCCAGGCCATCGAGGCCTACGTCCAGCG CGTCAACCCCTCCAGGACGCCCGCCGTCGTCGGCGGCCTCCTCGACGTCGACTGCGACGAGAACATCATCAAGCA GCTTCTGGGCACTGTCAATGCCTCGCAGATCAACATCGACGAGCTGGTATCCGAAGTTGAGTCGCGCAACCGCCT CAAGCTGCTCCTGCCTTTCCTCGAGGCCACGCTGCAGTCCGGTAATCAGCAGCAGGCCGTCTATAATGCGCTCGC CAAGATCTACATCGACTCGAACAACAACCCTGAGAAGTTCCTCAAGGAGAACGACCAGTATGACACCCTGACGGT CGGCAAGTATTGCGAGAAGCGTGACCCCAACCTGGCCTACATCGCCTACTCCAAGGGCCAGAACGACCTGGAGCT CGTCAACATCACCAACGAGAACTCCATGTATCGAGCGCAGGCCCGATACCTGCTGGAGCGGTCCGACGCCGAGCT TTGGCGCTTCGTCCTCAGCGAGAGCAATATCCACCGACGCTCTGTCGTGGACCAGGTCATCTCCACCGCCGTTCC CGAGTCCACCGATCCGGCCAAGGTCTCCGTCGCCGTCTCGGCTCTGCTCGACAACGACATGCCCCTGGAGCTTAT TGAGCTGCTGGAGAAGATCGTGCTGGAGCCGTCGCCTTTCAGCGACAACCAGAACTTGCAGAACCTGCTCATGTT CACGGCCGCCAAGGCCGACAAGGGCCGCGTGATGGACTACATCCACAAGCTCGACGGCTACAACGCCGACGACAT CGCGACGTCGTGCATCGAGGTTGGCCTCTTCGAGGAAGCCTTCGAAATTTACAAGAAGGCCGACAACAAGACGGC GGCCGTCGACGTCCTCGTCGAGCACGTCGTCAGCATTGACCGTTCACAAGCGTACGCCGAAGAGGTGGACATTCC CGAGGTCTGGAGCAAGGTTGCCAAGGCACAGCTCGACGGCGTCCGAGTCTCGGATGGTATCGAGTCGTACATCAA GGCCGGCGACCCCAAGAACTACGAAGAGGTGATTGAGATCGCCACGCACGCGGGTAAGAATGAGGAGCTCGTCAA GTATCTGCGCATGGCTCGCAAGACGCTGCGCGAGCCTGCCATCGACACGGCACTGGCCTTCTGCTACGCCCGCCT GGATCAGCTTCCCGAGCTCGAAGACTTCCTGCGGGCGACCAATGTTGCCAACATCGAGGAGTCCGGAGACAAGGC TTACGAGGAGGGTCTCTTTGAGGCGTCCAAGATTTTCTACAGCAGCATCTCCAACTGGGCCAAGCTCGCCACCAC TCTCGTGCACCTGGGCGACTACCAGGCCGCCGTCGAGTGCGCGCGCAAGGCGAACAACATCAAGGTCTGGAAGCA GGTGCACGAGGCGTGCGTCGAGAAGAGGGAATTCCGACTGGCCCAGATCTGCGGCCTGAACTTGATCATCGACGC AGAGCAGCTGCAGACCCTGGTCAAGGACTACGAGCGCAACGGCTACTTTGACGAGCTCATCAGCCTCCTGGAGCA AGGCCTCGGCCTCGAGCGCGCCCACATGGGCATGTTCACCGAGCTGGGCATCGCCCTGTCTAGGTACCACCCGGA GCGCCTGATGGAGCACATCAAGCTGTTCTGGTCGAGGATGAACATGCCCAAGATGATTCGCGCTTGCGAGGAAGC CCACCTGTGGCCGGAGCTCGTCTTCTGCTACTGCCACTACGACGAGTTCGACAACGCCGCCTTGGCCGTCATTGA GAGGCCCGAGAACTCGTGGGAGCACCAGCAGTTCAAGGAGATTGTGGTCAAGGTCGCCAACCTCGAAATCTACTA CCGCGCCATCAAGTTCTACGTGGAGCAACACCCGTCGCTGCTCACCGACCTCCTCGCGGCCCTGACGGCCCGCAT CGACGTCAACCGCGTCGTCAAGATGTTCCAGAAGAACGACAGCCTGCCGCTCATCAAGCCCTTCCTCCTCAATGT GCAGTCGCAGAACAAGCGCACGGTCAACGAGGCGGTCAACGACCTGCTCATCGAGGAAGAAGACTACAAGACGCT GCGCGACTCCGTGCAGAACCACGACAACTACGACGCGGTCGGGCTCGCCGCCCGCCTGGAGAAGCACGACCTCAT CTTCTTCCGCCAGATCGCCGCCAGCATCTACCGCAAGAACAAGCGGTGGGAGAAGTCGATTGCGCTGTCCAAGCA AGACAAGCTTTACAAGGATGCGATCGAGACGGCCGCCACATCTGGTAAGACCGAGATTGTGGAGGATCTCCTTCG ATATGTAAGTCGCTCCCTAGGCCGGCCCCTCGTCGACCGCCAAGGTTTGCTAATACTCTCGCTCAGTTCGTCGAC ATCGGACACCGTGAATGCTATACGGGCATGCTGTACGCTTGCAACGAGCTCATCCGACCCGACCTCATCCTCGAG CTGTCGTGGCGTCACGGCCTCACGGACTTCACGATGCCCTACATGATCAACATGCTGGCGCAGCAGACGCGCGAA ATTGCCCAGCTGAAGGCGGACAACGAGGCCCGCAAAGCCAAGGAGAAGGAACAGGAGAAGACGGAGGACAGCACG CCCATCCTCGGCGGCTCTCGGCTGATGATCACGGCGGGACCGGCGGGCGGTATGGGCGGAGGCATGACGCCTGCC CCATACGGCCAGACCAACGGCTTCGCACCGCAACCCACGGGCTACGGTTACTAG |