Protein ID | OphauB2|3271 |
Gene name | |
Location | Contig_21:103383..105417 |
Strand | - |
Gene length (bp) | 2034 |
Transcript length (bp) | 1971 |
Coding sequence length (bp) | 1971 |
Protein length (aa) | 657 |
PFAM Domain ID | Short name | Long name | E-value | Start | End |
---|---|---|---|---|---|
PF03343 | SART-1 | SART-1 family | 4.8E-160 | 71 | 620 |
PF19252 | HIND | HIND motif | 2.2E-07 | 6 | 24 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|O94538|SNU66_SCHPO | U4/U6.U5 tri-snRNP-associated protein snu66 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=snu66 PE=1 SV=1 | 344 | 629 | 2.0E-40 |
sp|O43290|SNUT1_HUMAN | U4/U6.U5 tri-snRNP-associated protein 1 OS=Homo sapiens GN=SART1 PE=1 SV=1 | 3 | 285 | 3.0E-19 |
sp|Q9Z315|SNUT1_MOUSE | U4/U6.U5 tri-snRNP-associated protein 1 OS=Mus musculus GN=Sart1 PE=1 SV=1 | 5 | 285 | 7.0E-17 |
sp|Q5XIW8|SNUT1_RAT | U4/U6.U5 tri-snRNP-associated protein 1 OS=Rattus norvegicus GN=Sart1 PE=1 SV=1 | 5 | 285 | 8.0E-17 |
sp|O43290|SNUT1_HUMAN | U4/U6.U5 tri-snRNP-associated protein 1 OS=Homo sapiens GN=SART1 PE=1 SV=1 | 391 | 620 | 2.0E-11 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|O94538|SNU66_SCHPO | U4/U6.U5 tri-snRNP-associated protein snu66 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=snu66 PE=1 SV=1 | 344 | 629 | 2.0E-40 |
sp|O43290|SNUT1_HUMAN | U4/U6.U5 tri-snRNP-associated protein 1 OS=Homo sapiens GN=SART1 PE=1 SV=1 | 3 | 285 | 3.0E-19 |
sp|Q9Z315|SNUT1_MOUSE | U4/U6.U5 tri-snRNP-associated protein 1 OS=Mus musculus GN=Sart1 PE=1 SV=1 | 5 | 285 | 7.0E-17 |
sp|Q5XIW8|SNUT1_RAT | U4/U6.U5 tri-snRNP-associated protein 1 OS=Rattus norvegicus GN=Sart1 PE=1 SV=1 | 5 | 285 | 8.0E-17 |
sp|O43290|SNUT1_HUMAN | U4/U6.U5 tri-snRNP-associated protein 1 OS=Homo sapiens GN=SART1 PE=1 SV=1 | 391 | 620 | 2.0E-11 |
sp|Q9Z315|SNUT1_MOUSE | U4/U6.U5 tri-snRNP-associated protein 1 OS=Mus musculus GN=Sart1 PE=1 SV=1 | 571 | 620 | 6.0E-11 |
sp|Q5XIW8|SNUT1_RAT | U4/U6.U5 tri-snRNP-associated protein 1 OS=Rattus norvegicus GN=Sart1 PE=1 SV=1 | 571 | 620 | 6.0E-11 |
sp|Q9LFE0|DOT2_ARATH | SART-1 family protein DOT2 OS=Arabidopsis thaliana GN=DOT2 PE=1 SV=1 | 575 | 620 | 8.0E-07 |
GO Term | Description | Terminal node |
---|---|---|
GO:0046540 | U4/U6 x U5 tri-snRNP complex | Yes |
GO:0000398 | mRNA splicing, via spliceosome | Yes |
GO:0034641 | cellular nitrogen compound metabolic process | No |
GO:0090304 | nucleic acid metabolic process | No |
GO:0006397 | mRNA processing | No |
GO:1901360 | organic cyclic compound metabolic process | No |
GO:0006807 | nitrogen compound metabolic process | No |
GO:0097525 | spliceosomal snRNP complex | No |
GO:0000375 | RNA splicing, via transesterification reactions | No |
GO:0030532 | small nuclear ribonucleoprotein complex | No |
GO:0140513 | nuclear protein-containing complex | No |
GO:0006396 | RNA processing | No |
GO:0008150 | biological_process | No |
GO:0006725 | cellular aromatic compound metabolic process | No |
GO:0005575 | cellular_component | No |
GO:0043170 | macromolecule metabolic process | No |
GO:0000377 | RNA splicing, via transesterification reactions with bulged adenosine as nucleophile | No |
GO:0071704 | organic substance metabolic process | No |
GO:1990904 | ribonucleoprotein complex | No |
GO:0044237 | cellular metabolic process | No |
GO:0008152 | metabolic process | No |
GO:0009987 | cellular process | No |
GO:0008380 | RNA splicing | No |
GO:0044238 | primary metabolic process | No |
GO:0032991 | protein-containing complex | No |
GO:0120114 | Sm-like protein family complex | No |
GO:0097526 | spliceosomal tri-snRNP complex | No |
GO:0046483 | heterocycle metabolic process | No |
GO:0016071 | mRNA metabolic process | No |
GO:0016070 | RNA metabolic process | No |
GO:0006139 | nucleobase-containing compound metabolic process | No |
SignalP signal predicted | Location (based on Ymax) |
D score (significance: > 0.45) |
---|---|---|
No | 1 - 11 | 0.45 |
Transcription Factor Class (based on PFAM domains) |
---|
SART1 |
Type of sequence | Sequence |
---|---|
Locus | Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded. |
Protein | >OphauB2|3271 MDAATIHETNRIRLSLGMKPLPVPGAQQAESSHSSDSDGEQASTLETRQAKAYDNYNKHIETEKLKKRRDEKSSA ARKAREKAQRFALIQGKGLADTQDGQDDAKSWLMGLKKRQKKVAEARKLEEELAAAEAAAAQNIQYTSKDLAGLK VAHDTSAFLEGGEQILTLKDATIDENEEQGDELENINLREEEKLQNRLDLKKMRPGYNPNDDNQDEQRGILSQYD EEINGKKTTRFTLDSDGAIAEMSDVMGQSAPKTNKLQNINLDDIVGNMPISSDYLTPSEIKVKKPKKKKKNTRRK QVDDDEDSLFPVQPVNKATADAMDIDSKDDMASRKRKAEADDLDDDDLQASLMIQRRNALKKRRKIKPQDIAKQL KEQVDEPDHDTGAEDGGLIIGDTSEFVAGLSKHIDQDEEIAAKKRTMEREAATRGSPDDEDAWMEDADGYEVGGA HKPEPPEPGANVPEDGFDDEKAVGQGMGAALSLLRERGLIEESQGNDRHSNFRQQQEFLLRKKELEEELEAKARQ QRERDRANGKLDRMSRADREDYARQQNAWRDQQLSRRMADLISAHYRPSVALRYTDEHGRHLGQKEAFKHLSHQF HGKGSGKGKTEKKLKQIEDEKRREAQSLFDASQGGGMNAATTQQLKKRKEAGVRLG* |
Coding | >OphauB2|3271 ATGGACGCCGCAACGATTCACGAGACGAACCGCATCCGTCTGTCGCTGGGCATGAAACCCCTCCCTGTCCCGGGA GCACAGCAGGCCGAATCCAGCCATTCTTCAGACAGTGATGGAGAGCAAGCCAGTACGCTCGAGACGCGCCAGGCC AAGGCCTACGACAACTACAACAAGCACATAGAGACGGAGAAGCTCAAAAAGCGCCGCGATGAAAAGTCGTCTGCC GCTCGCAAGGCGCGCGAAAAGGCTCAGCGCTTTGCCCTCATCCAAGGCAAGGGCCTCGCAGACACCCAAGACGGT CAAGACGATGCAAAATCGTGGCTCATGGGCCTCAAGAAGCGGCAAAAAAAGGTTGCAGAAGCCAGGAAACTCGAG GAGGAACTGGCCGCTGCCGAAGCTGCTGCCGCCCAAAACATCCAATACACTTCAAAAGACCTGGCCGGCCTCAAG GTTGCACACGATACATCAGCCTTTTTGGAGGGTGGCGAGCAGATTCTCACTCTCAAGGATGCCACGATTGATGAA AACGAAGAACAAGGCGATGAACTAGAAAATATTAATCTTCGAGAAGAAGAAAAACTTCAGAATAGACTTGATCTC AAAAAGATGCGGCCCGGTTACAATCCCAACGACGACAACCAAGATGAGCAGCGTGGTATATTGTCCCAGTACGAT GAAGAAATCAATGGCAAAAAGACTACAAGATTTACCCTCGACTCGGATGGAGCCATTGCCGAAATGTCAGATGTG ATGGGGCAGTCGGCTCCAAAAACAAACAAGCTACAAAACATCAATCTAGACGACATTGTTGGAAACATGCCAATT TCGTCAGATTATTTGACCCCTTCGGAAATCAAGGTCAAGAAGCCAAAGAAGAAGAAGAAGAATACAAGACGGAAG CAAGTCGACGATGATGAAGACTCTCTATTTCCCGTTCAACCTGTCAACAAGGCTACAGCGGATGCCATGGACATT GATTCCAAAGATGACATGGCCAGTCGGAAAAGAAAAGCAGAGGCAGATGACCTGGATGATGATGACCTTCAAGCA TCGCTCATGATTCAGCGCCGGAATGCTCTCAAGAAGCGCAGGAAAATCAAGCCGCAGGACATTGCAAAACAACTC AAGGAACAAGTGGATGAGCCAGACCATGACACGGGTGCCGAGGATGGCGGGCTAATCATTGGCGACACGTCGGAG TTTGTCGCCGGGTTGAGCAAGCACATTGATCAGGATGAAGAAATCGCAGCGAAAAAGCGCACAATGGAAAGAGAA GCGGCAACAAGGGGCTCGCCTGACGATGAGGACGCCTGGATGGAAGACGCCGACGGGTACGAGGTTGGCGGCGCG CACAAACCGGAGCCGCCTGAGCCTGGCGCCAATGTCCCAGAAGACGGTTTTGATGACGAAAAAGCTGTCGGCCAG GGCATGGGCGCCGCTTTGTCCCTGCTGCGCGAGCGAGGCCTCATTGAAGAGTCGCAGGGCAACGACCGACACTCA AACTTTCGCCAACAGCAAGAATTCCTGCTGCGCAAAAAGGAGCTCGAGGAGGAACTCGAAGCCAAGGCGCGCCAG CAGCGAGAACGCGACCGCGCCAATGGCAAGCTGGATCGCATGTCGCGCGCCGACCGCGAAGACTATGCCCGCCAG CAAAATGCCTGGCGCGACCAGCAGCTGTCGCGCCGCATGGCCGACCTTATTTCGGCGCACTATAGGCCCAGTGTC GCCCTCCGCTACACCGACGAGCATGGCCGACACTTGGGCCAAAAGGAGGCGTTTAAGCACCTGAGCCACCAGTTC CACGGCAAGGGCAGCGGCAAGGGCAAGACGGAGAAGAAGCTCAAGCAGATTGAGGATGAGAAGCGCCGCGAGGCA CAGAGTTTGTTTGATGCGAGCCAGGGTGGGGGCATGAATGCCGCTACGACTCAGCAGCTCAAGAAGCGCAAAGAA GCGGGTGTTCGGTTGGGGTGA |
Transcript | >OphauB2|3271 ATGGACGCCGCAACGATTCACGAGACGAACCGCATCCGTCTGTCGCTGGGCATGAAACCCCTCCCTGTCCCGGGA GCACAGCAGGCCGAATCCAGCCATTCTTCAGACAGTGATGGAGAGCAAGCCAGTACGCTCGAGACGCGCCAGGCC AAGGCCTACGACAACTACAACAAGCACATAGAGACGGAGAAGCTCAAAAAGCGCCGCGATGAAAAGTCGTCTGCC GCTCGCAAGGCGCGCGAAAAGGCTCAGCGCTTTGCCCTCATCCAAGGCAAGGGCCTCGCAGACACCCAAGACGGT CAAGACGATGCAAAATCGTGGCTCATGGGCCTCAAGAAGCGGCAAAAAAAGGTTGCAGAAGCCAGGAAACTCGAG GAGGAACTGGCCGCTGCCGAAGCTGCTGCCGCCCAAAACATCCAATACACTTCAAAAGACCTGGCCGGCCTCAAG GTTGCACACGATACATCAGCCTTTTTGGAGGGTGGCGAGCAGATTCTCACTCTCAAGGATGCCACGATTGATGAA AACGAAGAACAAGGCGATGAACTAGAAAATATTAATCTTCGAGAAGAAGAAAAACTTCAGAATAGACTTGATCTC AAAAAGATGCGGCCCGGTTACAATCCCAACGACGACAACCAAGATGAGCAGCGTGGTATATTGTCCCAGTACGAT GAAGAAATCAATGGCAAAAAGACTACAAGATTTACCCTCGACTCGGATGGAGCCATTGCCGAAATGTCAGATGTG ATGGGGCAGTCGGCTCCAAAAACAAACAAGCTACAAAACATCAATCTAGACGACATTGTTGGAAACATGCCAATT TCGTCAGATTATTTGACCCCTTCGGAAATCAAGGTCAAGAAGCCAAAGAAGAAGAAGAAGAATACAAGACGGAAG CAAGTCGACGATGATGAAGACTCTCTATTTCCCGTTCAACCTGTCAACAAGGCTACAGCGGATGCCATGGACATT GATTCCAAAGATGACATGGCCAGTCGGAAAAGAAAAGCAGAGGCAGATGACCTGGATGATGATGACCTTCAAGCA TCGCTCATGATTCAGCGCCGGAATGCTCTCAAGAAGCGCAGGAAAATCAAGCCGCAGGACATTGCAAAACAACTC AAGGAACAAGTGGATGAGCCAGACCATGACACGGGTGCCGAGGATGGCGGGCTAATCATTGGCGACACGTCGGAG TTTGTCGCCGGGTTGAGCAAGCACATTGATCAGGATGAAGAAATCGCAGCGAAAAAGCGCACAATGGAAAGAGAA GCGGCAACAAGGGGCTCGCCTGACGATGAGGACGCCTGGATGGAAGACGCCGACGGGTACGAGGTTGGCGGCGCG CACAAACCGGAGCCGCCTGAGCCTGGCGCCAATGTCCCAGAAGACGGTTTTGATGACGAAAAAGCTGTCGGCCAG GGCATGGGCGCCGCTTTGTCCCTGCTGCGCGAGCGAGGCCTCATTGAAGAGTCGCAGGGCAACGACCGACACTCA AACTTTCGCCAACAGCAAGAATTCCTGCTGCGCAAAAAGGAGCTCGAGGAGGAACTCGAAGCCAAGGCGCGCCAG CAGCGAGAACGCGACCGCGCCAATGGCAAGCTGGATCGCATGTCGCGCGCCGACCGCGAAGACTATGCCCGCCAG CAAAATGCCTGGCGCGACCAGCAGCTGTCGCGCCGCATGGCCGACCTTATTTCGGCGCACTATAGGCCCAGTGTC GCCCTCCGCTACACCGACGAGCATGGCCGACACTTGGGCCAAAAGGAGGCGTTTAAGCACCTGAGCCACCAGTTC CACGGCAAGGGCAGCGGCAAGGGCAAGACGGAGAAGAAGCTCAAGCAGATTGAGGATGAGAAGCGCCGCGAGGCA CAGAGTTTGTTTGATGCGAGCCAGGGTGGGGGCATGAATGCCGCTACGACTCAGCAGCTCAAGAAGCGCAAAGAA GCGGGTGTTCGGTTGGGGTGA |
Gene | >OphauB2|3271 ATGGACGCCGCAACGATTCACGAGACGAACCGCATCCGTCTGTCGCTGGGCATGAAACCCCTCCCTGTCCCGGGA GCACAGCAGGCCGAATCCAGCCATTCTTCAGACAGTGATGGAGAGCAAGCCAGTACGCTCGAGACGCGCCAGGCC AAGGCCTACGACAACTACAACAAGCACATAGAGACGGAGAAGCTCAAAAAGCGCCGCGATGAAAAGTCGTCTGCC GCTCGCAAGGCGCGCGAAAAGGCTCAGCGCTTTGCCCTCATCCAAGGCAAGGGCCTCGCAGACACCCAAGACGGT CAAGACGATGCAAAATCGTGGCTCATGGGCCTCAAGAAGCGGCAAAAAAAGGTTGCAGAAGCCAGGAAACTCGAG GAGGAACTGGCCGCTGCCGAAGCTGCTGCCGCCCAAAACATCCAATACACTTCAAAAGACCTGGCCGGCCTCAAG GTTGCACACGATACATCAGCCTTTTTGGAGGGTGGCGAGCAGATTCTCACTCTCAAGGATGCCACGATTGATGAA AACGAAGAACAAGGCGATGAACTAGAAAATATTAATCTTCGAGAAGAAGAAAAACTTCAGAATAGACTTGATCTC AAAAAGATGCGGCCCGGTTACAATCCCAACGACGACAACCAAGATGAGCAGCGTGGTATATTGTCCCAGTACGAT GAAGAAATCAATGGCAAAAAGACTACAAGATTTACCCTCGACTCGGATGGAGCCATTGCCGAAATGTCAGATGTG ATGGGGCAGTCGGCTCCAAAAACAAACAAGCTACAAAACATCAATCTAGACGACATTGTTGGCGAGTGCCTCTCA CAACGTTTTCTTTTGAATGAGTTGTAACAAACCTAATACCATGGTCTAGGAAACATGCCAATTTCGTCAGATTAT TTGACCCCTTCGGAAATCAAGGTCAAGAAGCCAAAGAAGAAGAAGAAGAATACAAGACGGAAGCAAGTCGACGAT GATGAAGACTCTCTATTTCCCGTTCAACCTGTCAACAAGGCTACAGCGGATGCCATGGACATTGATTCCAAAGAT GACATGGCCAGTCGGAAAAGAAAAGCAGAGGCAGATGACCTGGATGATGATGACCTTCAAGCATCGCTCATGATT CAGCGCCGGAATGCTCTCAAGAAGCGCAGGAAAATCAAGCCGCAGGACATTGCAAAACAACTCAAGGAACAAGTG GATGAGCCAGACCATGACACGGGTGCCGAGGATGGCGGGCTAATCATTGGCGACACGTCGGAGTTTGTCGCCGGG TTGAGCAAGCACATTGATCAGGATGAAGAAATCGCAGCGAAAAAGCGCACAATGGAAAGAGAAGCGGCAACAAGG GGCTCGCCTGACGATGAGGACGCCTGGATGGAAGACGCCGACGGGTACGAGGTTGGCGGCGCGCACAAACCGGAG CCGCCTGAGCCTGGCGCCAATGTCCCAGAAGACGGTTTTGATGACGAAAAAGCTGTCGGCCAGGGCATGGGCGCC GCTTTGTCCCTGCTGCGCGAGCGAGGCCTCATTGAAGAGTCGCAGGGCAACGACCGACACTCAAACTTTCGCCAA CAGCAAGAATTCCTGCTGCGCAAAAAGGAGCTCGAGGAGGAACTCGAAGCCAAGGCGCGCCAGCAGCGAGAACGC GACCGCGCCAATGGCAAGCTGGATCGCATGTCGCGCGCCGACCGCGAAGACTATGCCCGCCAGCAAAATGCCTGG CGCGACCAGCAGCTGTCGCGCCGCATGGCCGACCTTATTTCGGCGCACTATAGGCCCAGTGTCGCCCTCCGCTAC ACCGACGAGCATGGCCGACACTTGGGCCAAAAGGAGGCGTTTAAGCACCTGAGCCACCAGTTCCACGGCAAGGGC AGCGGCAAGGGCAAGACGGAGAAGAAGCTCAAGCAGATTGAGGATGAGAAGCGCCGCGAGGCACAGAGTTTGTTT GATGCGAGCCAGGGTGGGGGCATGAATGCCGCTACGACTCAGCAGCTCAAGAAGCGCAAAGAAGCGGGTGTTCGG TTGGGGTGA |