Protein ID | Hirsu2|4541 |
Gene name | |
Location | Contig_230:19540..21334 |
Strand | + |
Gene length (bp) | 1794 |
Transcript length (bp) | 741 |
Coding sequence length (bp) | 741 |
Protein length (aa) | 247 |
PFAM Domain ID | Short name | Long name | E-value | Start | End |
---|---|---|---|---|---|
PF00076 | RRM_1 | RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) | 3.9E-17 | 68 | 138 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|Q09330|MLO3_SCHPO | mRNA export protein mlo3 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=mlo3 PE=1 SV=1 | 1 | 225 | 4.0E-33 |
sp|Q12159|YRA1_YEAST | RNA annealing protein YRA1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=YRA1 PE=1 SV=2 | 1 | 218 | 3.0E-22 |
sp|Q8L773|THO4A_ARATH | THO complex subunit 4A OS=Arabidopsis thaliana GN=ALY1 PE=1 SV=1 | 1 | 147 | 2.0E-10 |
sp|Q58EA2|THO4A_XENLA | THO complex subunit 4-A OS=Xenopus laevis GN=alyref-a PE=2 SV=1 | 34 | 150 | 2.0E-08 |
sp|Q28FB9|THOC4_XENTR | THO complex subunit 4 OS=Xenopus tropicalis GN=alyref PE=2 SV=1 | 67 | 150 | 1.0E-07 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|Q09330|MLO3_SCHPO | mRNA export protein mlo3 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=mlo3 PE=1 SV=1 | 1 | 225 | 4.0E-33 |
sp|Q12159|YRA1_YEAST | RNA annealing protein YRA1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=YRA1 PE=1 SV=2 | 1 | 218 | 3.0E-22 |
sp|Q8L773|THO4A_ARATH | THO complex subunit 4A OS=Arabidopsis thaliana GN=ALY1 PE=1 SV=1 | 1 | 147 | 2.0E-10 |
sp|Q58EA2|THO4A_XENLA | THO complex subunit 4-A OS=Xenopus laevis GN=alyref-a PE=2 SV=1 | 34 | 150 | 2.0E-08 |
sp|Q28FB9|THOC4_XENTR | THO complex subunit 4 OS=Xenopus tropicalis GN=alyref PE=2 SV=1 | 67 | 150 | 1.0E-07 |
sp|B5FXN8|THOC4_TAEGU | THO complex subunit 4 OS=Taeniopygia guttata GN=ALYREF PE=2 SV=1 | 60 | 144 | 3.0E-07 |
sp|Q6GLW1|THO4B_XENLA | THO complex subunit 4-B OS=Xenopus laevis GN=alyref-b PE=2 SV=1 | 34 | 150 | 3.0E-07 |
sp|Q94EH8|THO4C_ARATH | THO complex subunit 4C OS=Arabidopsis thaliana GN=ALY3 PE=1 SV=1 | 66 | 142 | 5.0E-07 |
sp|Q86V81|THOC4_HUMAN | THO complex subunit 4 OS=Homo sapiens GN=ALYREF PE=1 SV=3 | 67 | 150 | 5.0E-07 |
sp|O08583|THOC4_MOUSE | THO complex subunit 4 OS=Mus musculus GN=Alyref PE=1 SV=3 | 67 | 144 | 6.0E-07 |
sp|Q8L719|THO4B_ARATH | THO complex subunit 4B OS=Arabidopsis thaliana GN=ALY2 PE=1 SV=1 | 60 | 142 | 8.0E-07 |
sp|Q3T0I4|THOC4_BOVIN | THO complex subunit 4 OS=Bos taurus GN=ALYREF PE=2 SV=1 | 67 | 144 | 1.0E-06 |
sp|Q6NQ72|THO4D_ARATH | THO complex subunit 4D OS=Arabidopsis thaliana GN=ALY4 PE=1 SV=1 | 66 | 142 | 2.0E-06 |
GO Term | Description | Terminal node |
---|---|---|
GO:0003723 | RNA binding | Yes |
GO:0097159 | organic cyclic compound binding | No |
GO:0005488 | binding | No |
GO:1901363 | heterocyclic compound binding | No |
GO:0003676 | nucleic acid binding | No |
GO:0003674 | molecular_function | No |
Localizations | Signals | Cytoplasm | Nucleus | Extracellular | Cell membrane | Mitochondrion | Plastid | Endoplasmic reticulum | Lysosome vacuole | Golgi apparatus | Peroxisome |
---|---|---|---|---|---|---|---|---|---|---|---|
Cytoplasm|Nucleus | Nuclear localization signal | 0.535 | 0.7416 | 0.0096 | 0.018 | 0.0786 | 0.0118 | 0.1062 | 0.0045 | 0.0294 | 0.0034 |
Orthofinder run ID | 4 |
Orthogroup | 844 |
Change Orthofinder run |
Type of sequence | Sequence |
---|---|
Locus | Download genbank file of locus
Download genbank file of locus (reverse complement)
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded. |
Protein | >Hirsu2|4541 MSGKLDKPLDEIVSAQRRSAAGRRRTPRRPAGRPVTSAPVGGVHKATRASAAKPAPAKSASINGESKVIVSNLPK DVSEQQIKEYFVQSVGPIKRVDLVYGPNSVSRGIANVTFHKSDGASKAFQKLNGLLVDNRPIKIEIVVSAAQADK VIPPIKTLAERTSQPKAQPKSAASSKQSTAIAKGVAGKAAANKKRRGKSVRPMKKTAEELDSEMADYFVSTGGNE NAAGGATTATNGDAAMEDEIM* |
Coding | >Hirsu2|4541 ATGTCTGGAAAGCTTGACAAGCCTCTCGACGAGATCGTCTCGGCTCAGCGCCGCTCGGCTGCTGGACGTCGTCGC ACGCCGCGACGACCTGCCGGCCGACCTGTCACCTCCGCCCCCGTCGGCGGTGTCCACAAGGCCACTCGCGCTAGC GCCGCGAAGCCTGCTCCGGCCAAGAGTGCCTCCATCAACGGCGAAAGCAAAGTCATCGTCAGCAACCTGCCCAAG GACGTGTCGGAGCAGCAAATCAAGGAATATTTCGTCCAGTCGGTCGGGCCCATCAAGAGAGTCGACCTTGTCTAC GGCCCGAACTCGGTCAGCCGAGGCATCGCGAATGTGACGTTCCACAAGTCGGACGGGGCCAGCAAGGCCTTCCAG AAACTCAACGGCCTGCTCGTCGACAACCGACCCATCAAGATCGAAATTGTCGTCAGTGCCGCCCAGGCGGACAAG GTGATCCCACCGATCAAGACACTGGCGGAGCGTACCAGTCAACCCAAGGCCCAGCCCAAGTCTGCGGCCAGCAGC AAGCAGAGCACCGCCATCGCCAAGGGCGTGGCCGGCAAGGCAGCGGCCAACAAGAAGCGCCGGGGCAAGAGCGTG CGACCGATGAAGAAGACGGCGGAGGAGCTGGACTCGGAGATGGCAGACTACTTCGTCAGCACGGGCGGCAATGAA AACGCGGCGGGCGGTGCGACGACGGCAACCAACGGCGACGCTGCCATGGAGGACGAGATCATGTGA |
Transcript | >Hirsu2|4541 ATGTCTGGAAAGCTTGACAAGCCTCTCGACGAGATCGTCTCGGCTCAGCGCCGCTCGGCTGCTGGACGTCGTCGC ACGCCGCGACGACCTGCCGGCCGACCTGTCACCTCCGCCCCCGTCGGCGGTGTCCACAAGGCCACTCGCGCTAGC GCCGCGAAGCCTGCTCCGGCCAAGAGTGCCTCCATCAACGGCGAAAGCAAAGTCATCGTCAGCAACCTGCCCAAG GACGTGTCGGAGCAGCAAATCAAGGAATATTTCGTCCAGTCGGTCGGGCCCATCAAGAGAGTCGACCTTGTCTAC GGCCCGAACTCGGTCAGCCGAGGCATCGCGAATGTGACGTTCCACAAGTCGGACGGGGCCAGCAAGGCCTTCCAG AAACTCAACGGCCTGCTCGTCGACAACCGACCCATCAAGATCGAAATTGTCGTCAGTGCCGCCCAGGCGGACAAG GTGATCCCACCGATCAAGACACTGGCGGAGCGTACCAGTCAACCCAAGGCCCAGCCCAAGTCTGCGGCCAGCAGC AAGCAGAGCACCGCCATCGCCAAGGGCGTGGCCGGCAAGGCAGCGGCCAACAAGAAGCGCCGGGGCAAGAGCGTG CGACCGATGAAGAAGACGGCGGAGGAGCTGGACTCGGAGATGGCAGACTACTTCGTCAGCACGGGCGGCAATGAA AACGCGGCGGGCGGTGCGACGACGGCAACCAACGGCGACGCTGCCATGGAGGACGAGATCATGTGA |
Gene | >Hirsu2|4541 ATGTCTGGAAAGCTTGACAAGCCTCTCGACGAGATCGTCTCGGCTCAGCGCCGCTCGGCTGCTGGACGTCGTCGC ACGCCGCGACGACCTGCCGGCCGACCTGTCACCTCCGCCCCCGTCGGCGGTGTCCACAAGGCCACTCGCGCTAGC GCCGCGAAGCCTGCTCCGGCCAAGAGTGCCTCCATCAACGGCGAAAGCAAAGTCATCGTCAGCAACCTGGTAAAT ACAACCTCGCCATTTACGTTCCGCATCTGAAGAGCGCATAGTGCTAACACTGCTCAACAGCCCAAGGACGTGTCG GAGCAGCAAATCAAGGTATGTTTCCGTTGAGGCGCCGTGTCCTCCGGACTTCATTCGACCTGTTATTCTGGCCAC ACCACACCTCTTTTTTTGTCGACGCGCGGTCTCCCCTTGGAAGCCACGACTCTTGCTATCGAACCGCCCGTCGTC TCCTTCCTAGCATTCATACATTGCATTGCGTCACGACGCGTTGCGCTGGAAGTTGAGAGCAGTCTTGGTTTACCG ATATGCCCTCCACATGCCTGGTCAGCATGCTGCGACGGGGTACGGTGCCTTTGAATAGACTGTGGCGTTGTATCG TACCAGCACACGCGTCTCGCGGCCGTCCTTGGACGATCCGATTTCGGCTGGCAAAGTTGGTCCGAGCGAGCATAG CGGTGAGCACATGTCACAAGACGCTGGGTCGCCGAGCGAGGGACTCGGCCGGTCGGTATGGATTCACGAGCGTCA AGGAGCCCAGGACGTGGGTGACGTCTTCCGACGACACGAGGTTGGCATAGTTTGGCTGGTCTTTGCGCTCATGAG CTGTCGGGGCTGCCGAGTGTGCCTTCTGGGGCAGGCCACTCAGGGAAGCAGTTTGACGTCGGTTCCTGCCGCACA TTTTGGGCTTGCGATAACACGGCGTGGCGGCCGTGTTTGCTCACCACCAACAGCGTCTGTCATGACTCACTGCTC TGCTCGCTCGGTTCCTCGATTCCTCTTGCTTCACGGCGGCGGCGGCGGCGGCGGCGATACGTGCCCGGAACGACG TCGGACGATTGTCGAGAAACGGACTTGGGCGGTGCTTGTCCCACTCCGCGCTTTTGATTCACTAACTCCTGTGCC CGATAGGAATATTTCGTCCAGTCGGTCGGGCCCATCAAGAGAGTCGACCTTGTCTACGGCCCGAACTCGGTCAGC CGAGGCATCGCGAATGTGACGTTCCACAAGTCGGACGGGGCCAGCAAGGCCTTCCAGAAACTCAACGGCCTGCTC GTCGACAACCGACCCATCAAGGTATGGCACACAGCCCGGCTGCGAGTCGGCAGCGCGGCCAGGTGGCTGACGCGA GGCAGATCGAAATTGTCGTCAGTGCCGCCCAGGCGGACAAGGTGATCCCACCGATCAAGACACTGGCGGAGCGTA CCAGGTAGGTCTCGACGGCTTTCGCGTCAGACGAGCAGGGCTGACAGGGGCGCAGTCAACCCAAGGCCCAGCCCA AGTCTGCGGCCAGCAGCAAGCAGAGCACCGCCATCGCCAAGGGCGTGGCCGGCAAGGCAGCGGCCAACAAGAAGC GCCGGGGCAAGAGCGTGCGACCGATGAAGAAGACGGCGGAGGAGCTGGACTCGGAGATGGCAGACTACTTCGTCA GCACGGGCGGCAATGAAAACGCGGCGGGCGGTGCGACGACGGCAACCAACGGCGACGCTGCCATGGAGGACGAGA TCATGGTAGGTGATTCAATTTCAGCAGTACACGAGTCAAGGAGCTGACGCTGATGTGTCGCTGTAGTGA |