Protein ID | Hirsu2|3439 |
Gene name | |
Location | Contig_19:49806..52147 |
Strand | + |
Gene length (bp) | 2341 |
Transcript length (bp) | 2235 |
Coding sequence length (bp) | 2235 |
Protein length (aa) | 745 |
PFAM Domain ID | Short name | Long name | E-value | Start | End |
---|---|---|---|---|---|
PF02065 | Melibiase | Melibiase | 4.2E-149 | 304 | 653 |
PF16875 | Glyco_hydro_36N | Glycosyl hydrolase family 36 N-terminal domain | 8.4E-78 | 55 | 297 |
PF16874 | Glyco_hydro_36C | Glycosyl hydrolase family 36 C-terminal domain | 1.4E-22 | 666 | 741 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|Q92457|AGAL2_HYPJE | Alpha-galactosidase 2 OS=Hypocrea jecorina GN=agl2 PE=1 SV=1 | 5 | 744 | 0.0E+00 |
sp|B8NWY6|AGALC_ASPFN | Probable alpha-galactosidase C OS=Aspergillus flavus (strain ATCC 200026 / FGSC A1120 / NRRL 3357 / JCM 12722 / SRRC 167) GN=aglC PE=3 SV=2 | 19 | 744 | 0.0E+00 |
sp|Q2TW69|AGALC_ASPOR | Probable alpha-galactosidase C OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=aglC PE=3 SV=1 | 19 | 744 | 0.0E+00 |
sp|Q0CVH2|AGALC_ASPTN | Probable alpha-galactosidase C OS=Aspergillus terreus (strain NIH 2624 / FGSC A1156) GN=aglC PE=3 SV=1 | 1 | 743 | 0.0E+00 |
sp|Q9UUZ4|AGALC_ASPNG | Alpha-galactosidase C OS=Aspergillus niger GN=aglC PE=1 SV=1 | 7 | 744 | 0.0E+00 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|Q92457|AGAL2_HYPJE | Alpha-galactosidase 2 OS=Hypocrea jecorina GN=agl2 PE=1 SV=1 | 5 | 744 | 0.0E+00 |
sp|B8NWY6|AGALC_ASPFN | Probable alpha-galactosidase C OS=Aspergillus flavus (strain ATCC 200026 / FGSC A1120 / NRRL 3357 / JCM 12722 / SRRC 167) GN=aglC PE=3 SV=2 | 19 | 744 | 0.0E+00 |
sp|Q2TW69|AGALC_ASPOR | Probable alpha-galactosidase C OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=aglC PE=3 SV=1 | 19 | 744 | 0.0E+00 |
sp|Q0CVH2|AGALC_ASPTN | Probable alpha-galactosidase C OS=Aspergillus terreus (strain NIH 2624 / FGSC A1156) GN=aglC PE=3 SV=1 | 1 | 743 | 0.0E+00 |
sp|Q9UUZ4|AGALC_ASPNG | Alpha-galactosidase C OS=Aspergillus niger GN=aglC PE=1 SV=1 | 7 | 744 | 0.0E+00 |
sp|Q5AU92|AGALC_EMENI | Alpha-galactosidase C OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=aglC PE=1 SV=1 | 16 | 744 | 0.0E+00 |
sp|Q0CEF5|AGALG_ASPTN | Probable alpha-galactosidase G OS=Aspergillus terreus (strain NIH 2624 / FGSC A1156) GN=aglG PE=3 SV=1 | 25 | 744 | 0.0E+00 |
sp|Q5ARP5|AGALG_EMENI | Probable alpha-galactosidase G OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=aglG PE=2 SV=1 | 16 | 744 | 0.0E+00 |
sp|P43467|AGAL1_PEDPE | Alpha-galactosidase 1 OS=Pediococcus pentosaceus GN=agaR PE=3 SV=1 | 28 | 726 | 5.0E-150 |
sp|P27756|AGAL_STRMU | Alpha-galactosidase OS=Streptococcus mutans serotype c (strain ATCC 700610 / UA159) GN=aga PE=3 SV=3 | 48 | 744 | 9.0E-129 |
sp|P43469|AGAL2_PEDPE | Alpha-galactosidase 2 OS=Pediococcus pentosaceus GN=agaS PE=3 SV=1 | 89 | 741 | 6.0E-125 |
sp|P16551|RAFA_ECOLX | Alpha-galactosidase OS=Escherichia coli GN=rafA PE=1 SV=1 | 190 | 710 | 6.0E-88 |
Localizations | Signals | Cytoplasm | Nucleus | Extracellular | Cell membrane | Mitochondrion | Plastid | Endoplasmic reticulum | Lysosome vacuole | Golgi apparatus | Peroxisome |
---|---|---|---|---|---|---|---|---|---|---|---|
Extracellular | Signal peptide | 0.1261 | 0.0657 | 0.9094 | 0.0885 | 0.0994 | 0.1366 | 0.2435 | 0.3208 | 0.2726 | 0.0201 |
SignalP signal predicted | Location | Score |
---|---|---|
Yes | 1 - 19 | 0.999698 |
CAZyme category | E-value | Start | End |
---|---|---|---|
GH36 | 2.3E-238 | 35 | 727 |
Orthofinder run ID | 4 |
Orthogroup | 2424 |
Change Orthofinder run |
Type of sequence | Sequence |
---|---|
Locus | Download genbank file of locus
Download genbank file of locus (reverse complement)
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded. |
Protein | >Hirsu2|3439 MRSALVATLGLSLARLAHAEAASPAQPIAVDGPSFALNGDNVSYRFHVDNATGDLLSDHFGAPVDGDIIEAEVGP INGWVNVVGRVRRELPDLGRGDFRTPAIQIRQSEGYQISDFQYQSHEILQGKPPLNGLPSTFGADNDVSTLLVHL YDKYSMVGADLSYSIFPKYDAVVRSITVTNKGSKNITVEKLASLSVDMPLGDYEMLELRGDWARESMRVRRKVDF GTQGFASTAGYSSHFHNPFFSLMAPAATESHGEVWGFSLVYTGSFAAEIEKGSQGLTRAMIGLNPSQLSWPLGPG EALVSPEAVAVFSDTGVGGMSRKFHSLYRKHLMRSKFATQTRPVLLNSWEGLHFDYDAKKIQKLAEESASLGVKL FVLDDGWFGTQHPRDDDKAGLGDWEVNPSKFPQGLGTLVNGVTTLKSGNSSSGANMKFGLWFEPEMVNPNSSLYE KHPDWALHAGGYPRTETRHQLVLNVALREVQDFIVDSLTNILNSSRIEYVKWDNNRGIHETPAATTDHEYMLGLY RVFKTLTERFPDVLWEGCASGGGRLDPGVLQYFPQVWTSDDTDGLERVYIQFGSSLAYPPSAMGAHISAVPNGQT GRTTPIEFRAHVAMMGGSFGLELNPEEMPAEDRAKLPGLIELAEKVNPVVVRGDMWRLSLPDESNWPAALFVSED GGRAVLFYFQLRATINNSWPALRLQGLDAKARYKVDGGQIVSGATLMNKGLSYRFEGDFASRLVFLERQ* |
Coding | >Hirsu2|3439 ATGAGGAGCGCCCTTGTCGCGACACTTGGCCTGAGCCTGGCCCGGCTCGCTCACGCAGAAGCGGCGAGCCCGGCC CAGCCGATTGCGGTGGACGGGCCCTCGTTCGCGCTCAATGGAGACAACGTCTCCTACCGCTTCCATGTCGACAAC GCCACTGGTGACCTCTTATCCGACCATTTTGGTGCGCCCGTCGACGGCGACATCATCGAGGCCGAGGTCGGCCCC ATCAACGGCTGGGTCAATGTTGTCGGCAGGGTGCGACGGGAGCTGCCGGACCTGGGCCGCGGCGACTTCCGGACC CCGGCCATCCAGATCCGCCAGTCCGAGGGCTACCAGATCAGCGACTTCCAGTATCAGTCTCACGAGATTCTGCAG GGCAAGCCTCCGCTCAATGGCCTCCCCTCGACCTTCGGCGCCGACAACGACGTCTCTACTCTGCTCGTCCACCTG TACGACAAGTACAGCATGGTCGGCGCGGATCTGTCGTACTCCATCTTCCCCAAGTACGATGCCGTCGTGCGCAGC ATCACCGTCACCAACAAGGGCAGCAAAAACATCACCGTGGAAAAGCTGGCCAGCCTGAGCGTCGACATGCCGCTG GGTGACTATGAGATGCTCGAGCTCAGGGGGGACTGGGCGCGGGAGAGCATGCGAGTCCGCCGCAAGGTCGACTTC GGCACCCAAGGCTTCGCAAGCACCGCCGGCTACTCTTCTCACTTCCACAACCCCTTCTTCTCGCTCATGGCACCG GCGGCAACCGAGTCGCACGGCGAGGTCTGGGGCTTCTCCCTCGTCTACACGGGATCCTTTGCCGCCGAGATCGAA AAGGGCTCGCAGGGACTGACCCGCGCCATGATCGGCCTCAACCCGTCCCAGCTCTCCTGGCCGCTCGGCCCCGGC GAGGCCCTCGTGTCCCCAGAGGCCGTGGCCGTCTTCTCCGACACGGGCGTCGGAGGCATGTCGCGCAAGTTCCAC AGCCTCTACCGGAAGCACCTGATGAGGAGCAAGTTTGCGACGCAAACGCGCCCCGTCCTGCTCAACAGCTGGGAG GGGCTCCACTTCGACTATGACGCGAAGAAGATCCAGAAGCTGGCCGAGGAGTCTGCGAGTCTCGGCGTCAAGCTT TTCGTCCTCGACGACGGCTGGTTTGGGACCCAGCATCCGCGCGACGACGACAAAGCCGGGCTGGGAGACTGGGAA GTGAACCCCAGCAAGTTTCCCCAAGGCCTCGGCACGCTCGTCAACGGCGTCACGACGCTCAAGAGCGGCAACTCG TCCTCGGGAGCGAACATGAAGTTCGGACTGTGGTTCGAGCCGGAAATGGTCAACCCCAACTCGAGCCTGTACGAG AAGCACCCGGACTGGGCTCTGCACGCCGGCGGATACCCTCGGACCGAGACGCGCCACCAGCTGGTGCTCAACGTG GCGCTGCGCGAGGTGCAGGACTTCATCGTCGACTCCCTCACCAACATCCTCAACAGCTCGCGCATCGAGTACGTC AAGTGGGACAACAACCGGGGCATCCACGAGACGCCGGCGGCGACGACGGACCACGAGTACATGCTCGGCCTGTAC CGCGTCTTCAAGACGCTGACCGAGCGCTTCCCCGACGTCCTCTGGGAGGGCTGCGCCTCGGGCGGCGGGCGCCTG GACCCGGGCGTCCTGCAGTACTTCCCGCAGGTCTGGACCTCGGACGACACGGATGGGCTGGAGCGCGTCTATATC CAGTTCGGCAGCTCGCTAGCCTATCCACCGTCGGCCATGGGGGCCCACATTTCGGCGGTTCCGAACGGACAGACG GGACGGACGACGCCCATCGAGTTTCGGGCCCACGTGGCAATGATGGGCGGATCCTTCGGCCTCGAGCTGAACCCG GAGGAGATGCCGGCCGAGGACAGGGCCAAGCTGCCCGGCCTGATCGAGCTGGCCGAGAAGGTCAACCCGGTCGTC GTCAGGGGCGACATGTGGCGCCTCAGCCTGCCGGACGAGTCCAACTGGCCGGCGGCCCTGTTCGTGTCCGAAGAC GGCGGCCGGGCAGTCCTCTTCTACTTCCAGCTGCGGGCGACCATCAACAACTCGTGGCCGGCGCTGCGGCTGCAG GGGTTGGATGCGAAGGCCCGGTACAAGGTCGACGGCGGGCAGATAGTGTCGGGGGCGACGCTCATGAACAAGGGC CTCTCGTACAGGTTCGAGGGCGATTTCGCGAGCAGGCTCGTCTTCCTCGAGAGGCAGTAG |
Transcript | >Hirsu2|3439 ATGAGGAGCGCCCTTGTCGCGACACTTGGCCTGAGCCTGGCCCGGCTCGCTCACGCAGAAGCGGCGAGCCCGGCC CAGCCGATTGCGGTGGACGGGCCCTCGTTCGCGCTCAATGGAGACAACGTCTCCTACCGCTTCCATGTCGACAAC GCCACTGGTGACCTCTTATCCGACCATTTTGGTGCGCCCGTCGACGGCGACATCATCGAGGCCGAGGTCGGCCCC ATCAACGGCTGGGTCAATGTTGTCGGCAGGGTGCGACGGGAGCTGCCGGACCTGGGCCGCGGCGACTTCCGGACC CCGGCCATCCAGATCCGCCAGTCCGAGGGCTACCAGATCAGCGACTTCCAGTATCAGTCTCACGAGATTCTGCAG GGCAAGCCTCCGCTCAATGGCCTCCCCTCGACCTTCGGCGCCGACAACGACGTCTCTACTCTGCTCGTCCACCTG TACGACAAGTACAGCATGGTCGGCGCGGATCTGTCGTACTCCATCTTCCCCAAGTACGATGCCGTCGTGCGCAGC ATCACCGTCACCAACAAGGGCAGCAAAAACATCACCGTGGAAAAGCTGGCCAGCCTGAGCGTCGACATGCCGCTG GGTGACTATGAGATGCTCGAGCTCAGGGGGGACTGGGCGCGGGAGAGCATGCGAGTCCGCCGCAAGGTCGACTTC GGCACCCAAGGCTTCGCAAGCACCGCCGGCTACTCTTCTCACTTCCACAACCCCTTCTTCTCGCTCATGGCACCG GCGGCAACCGAGTCGCACGGCGAGGTCTGGGGCTTCTCCCTCGTCTACACGGGATCCTTTGCCGCCGAGATCGAA AAGGGCTCGCAGGGACTGACCCGCGCCATGATCGGCCTCAACCCGTCCCAGCTCTCCTGGCCGCTCGGCCCCGGC GAGGCCCTCGTGTCCCCAGAGGCCGTGGCCGTCTTCTCCGACACGGGCGTCGGAGGCATGTCGCGCAAGTTCCAC AGCCTCTACCGGAAGCACCTGATGAGGAGCAAGTTTGCGACGCAAACGCGCCCCGTCCTGCTCAACAGCTGGGAG GGGCTCCACTTCGACTATGACGCGAAGAAGATCCAGAAGCTGGCCGAGGAGTCTGCGAGTCTCGGCGTCAAGCTT TTCGTCCTCGACGACGGCTGGTTTGGGACCCAGCATCCGCGCGACGACGACAAAGCCGGGCTGGGAGACTGGGAA GTGAACCCCAGCAAGTTTCCCCAAGGCCTCGGCACGCTCGTCAACGGCGTCACGACGCTCAAGAGCGGCAACTCG TCCTCGGGAGCGAACATGAAGTTCGGACTGTGGTTCGAGCCGGAAATGGTCAACCCCAACTCGAGCCTGTACGAG AAGCACCCGGACTGGGCTCTGCACGCCGGCGGATACCCTCGGACCGAGACGCGCCACCAGCTGGTGCTCAACGTG GCGCTGCGCGAGGTGCAGGACTTCATCGTCGACTCCCTCACCAACATCCTCAACAGCTCGCGCATCGAGTACGTC AAGTGGGACAACAACCGGGGCATCCACGAGACGCCGGCGGCGACGACGGACCACGAGTACATGCTCGGCCTGTAC CGCGTCTTCAAGACGCTGACCGAGCGCTTCCCCGACGTCCTCTGGGAGGGCTGCGCCTCGGGCGGCGGGCGCCTG GACCCGGGCGTCCTGCAGTACTTCCCGCAGGTCTGGACCTCGGACGACACGGATGGGCTGGAGCGCGTCTATATC CAGTTCGGCAGCTCGCTAGCCTATCCACCGTCGGCCATGGGGGCCCACATTTCGGCGGTTCCGAACGGACAGACG GGACGGACGACGCCCATCGAGTTTCGGGCCCACGTGGCAATGATGGGCGGATCCTTCGGCCTCGAGCTGAACCCG GAGGAGATGCCGGCCGAGGACAGGGCCAAGCTGCCCGGCCTGATCGAGCTGGCCGAGAAGGTCAACCCGGTCGTC GTCAGGGGCGACATGTGGCGCCTCAGCCTGCCGGACGAGTCCAACTGGCCGGCGGCCCTGTTCGTGTCCGAAGAC GGCGGCCGGGCAGTCCTCTTCTACTTCCAGCTGCGGGCGACCATCAACAACTCGTGGCCGGCGCTGCGGCTGCAG GGGTTGGATGCGAAGGCCCGGTACAAGGTCGACGGCGGGCAGATAGTGTCGGGGGCGACGCTCATGAACAAGGGC CTCTCGTACAGGTTCGAGGGCGATTTCGCGAGCAGGCTCGTCTTCCTCGAGAGGCAGTAG |
Gene | >Hirsu2|3439 ATGAGGAGCGCCCTTGTCGCGACACTTGGCCTGAGCCTGGCCCGGCTCGCTCACGCAGAAGCGGCGAGCCCGGCC CAGCGTAGGTCTTACCGATCGACCTGCCCGTCTCCAATCTTGTCTGACCTCGACATGTAGCGATTGCGGTGGACG GGCCCTCGTTCGCGCTCAATGGAGACAACGTCTCCTACCGCTTCCATGTCGACAACGCCACTGGTGACCTCTTAT CCGACCATTTTGGTGCGCCCGTCGACGGCGACATCATCGAGGCCGAGGTCGGCCCCATCAACGGCTGGGTCAATG TTGTCGGCAGGGTGCGACGGGAGCTGCCGGACCTGGGCCGCGGCGACTTCCGGACCCCGGCCATCCAGATCCGCC AGTCCGAGGGCTACCAGATCAGCGACTTCCAGTATCAGTCTCACGAGATTCTGCAGGGCAAGCCTCCGCTCAATG GCCTCCCCTCGACCTTCGGCGCCGACAACGACGTCTCTACTCTGCTCGTCCACCTGTACGACAAGTACAGCATGG TCGGCGCGGATCTGTCGTACTCCATCTTCCCCAAGTACGATGCCGTCGTGCGCAGCATCACCGTCACCAACAAGG GCAGCAAAAACATCACCGTGGAAAAGCTGGCCAGCCTGAGCGTCGACATGCCGCTGGGTGACTATGAGATGCTCG AGCTCAGGGGGGACTGGGCGCGGGAGAGCATGCGAGTCCGCCGCAAGGTCGACTTCGGCACCCAAGGGTGAGTGC AACGAGCGTCCTGTGCCGATCGACGACTGACGTCCGCCTCAGCTTCGCAAGCACCGCCGGCTACTCTTCTCACTT CCACAACCCCTTCTTCTCGCTCATGGCACCGGCGGCAACCGAGTCGCACGGCGAGGTCTGGGGCTTCTCCCTCGT CTACACGGGATCCTTTGCCGCCGAGATCGAAAAGGGCTCGCAGGGACTGACCCGCGCCATGATCGGCCTCAACCC GTCCCAGCTCTCCTGGCCGCTCGGCCCCGGCGAGGCCCTCGTGTCCCCAGAGGCCGTGGCCGTCTTCTCCGACAC GGGCGTCGGAGGCATGTCGCGCAAGTTCCACAGCCTCTACCGGAAGCACCTGATGAGGAGCAAGTTTGCGACGCA AACGCGCCCCGTCCTGCTCAACAGCTGGGAGGGGCTCCACTTCGACTATGACGCGAAGAAGATCCAGAAGCTGGC CGAGGAGTCTGCGAGTCTCGGCGTCAAGCTTTTCGTCCTCGACGACGGCTGGTTTGGGACCCAGCATCCGCGCGA CGACGACAAAGCCGGGCTGGGAGACTGGGAAGTGAACCCCAGCAAGTTTCCCCAAGGCCTCGGCACGCTCGTCAA CGGCGTCACGACGCTCAAGAGCGGCAACTCGTCCTCGGGAGCGAACATGAAGTTCGGACTGTGGTTCGAGCCGGA AATGGTCAACCCCAACTCGAGCCTGTACGAGAAGCACCCGGACTGGGCTCTGCACGCCGGCGGATACCCTCGGAC CGAGACGCGCCACCAGCTGGTGCTCAACGTGGCGCTGCGCGAGGTGCAGGACTTCATCGTCGACTCCCTCACCAA CATCCTCAACAGCTCGCGCATCGAGTACGTCAAGTGGGACAACAACCGGGGCATCCACGAGACGCCGGCGGCGAC GACGGACCACGAGTACATGCTCGGCCTGTACCGCGTCTTCAAGACGCTGACCGAGCGCTTCCCCGACGTCCTCTG GGAGGGCTGCGCCTCGGGCGGCGGGCGCCTGGACCCGGGCGTCCTGCAGTACTTCCCGCAGGTCTGGACCTCGGA CGACACGGATGGGCTGGAGCGCGTCTATATCCAGTTCGGCAGCTCGCTAGCCTATCCACCGTCGGCCATGGGGGC CCACATTTCGGCGGTTCCGAACGGACAGACGGGACGGACGACGCCCATCGAGTTTCGGGCCCACGTGGCAATGAT GGGCGGATCCTTCGGCCTCGAGCTGAACCCGGAGGAGATGCCGGCCGAGGACAGGGCCAAGCTGCCCGGCCTGAT CGAGCTGGCCGAGAAGGTCAACCCGGTCGTCGTCAGGGGCGACATGTGGCGCCTCAGCCTGCCGGACGAGTCCAA CTGGCCGGCGGCCCTGTTCGTGTCCGAAGACGGCGGCCGGGCAGTCCTCTTCTACTTCCAGCTGCGGGCGACCAT CAACAACTCGTGGCCGGCGCTGCGGCTGCAGGGGTTGGATGCGAAGGCCCGGTACAAGGTCGACGGCGGGCAGAT AGTGTCGGGGGCGACGCTCATGAACAAGGGCCTCTCGTACAGGTTCGAGGGCGATTTCGCGAGCAGGCTCGTCTT CCTCGAGAGGCAGTAG |