Fungal Genomics

at Utrecht University

General Properties

Protein IDHirsu2|5133
Gene name
LocationContig_257:14575..16947
Strand-
Gene length (bp)2372
Transcript length (bp)2193
Coding sequence length (bp)2193
Protein length (aa) 731

Your browser does not support drawing a protein figure.

PFAM Domains

PFAM Domain ID Short name Long name E-value Start End
PF11526 CFIA_Pcf11 Subunit of cleavage factor IA Pcf11 1.1E-04 584 633

Swissprot hits

[Show all]
Swissprot ID Swissprot Description Start End E-value
sp|Q10237|YD14_SCHPO Uncharacterized protein C4G9.04c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=SPAC4G9.04c PE=3 SV=1 252 690 2.0E-41
sp|Q10237|YD14_SCHPO Uncharacterized protein C4G9.04c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=SPAC4G9.04c PE=3 SV=1 8 152 2.0E-35
sp|P39081|PCF11_YEAST Protein PCF11 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=PCF11 PE=1 SV=2 1 170 3.0E-24
sp|O94913|PCF11_HUMAN Pre-mRNA cleavage complex 2 protein Pcf11 OS=Homo sapiens GN=PCF11 PE=1 SV=3 525 689 2.0E-19
sp|O94913|PCF11_HUMAN Pre-mRNA cleavage complex 2 protein Pcf11 OS=Homo sapiens GN=PCF11 PE=1 SV=3 7 117 2.0E-18
[Show all]
[Show less]
Swissprot ID Swissprot Description Start End E-value
sp|Q10237|YD14_SCHPO Uncharacterized protein C4G9.04c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=SPAC4G9.04c PE=3 SV=1 252 690 2.0E-41
sp|Q10237|YD14_SCHPO Uncharacterized protein C4G9.04c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=SPAC4G9.04c PE=3 SV=1 8 152 2.0E-35
sp|P39081|PCF11_YEAST Protein PCF11 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=PCF11 PE=1 SV=2 1 170 3.0E-24
sp|O94913|PCF11_HUMAN Pre-mRNA cleavage complex 2 protein Pcf11 OS=Homo sapiens GN=PCF11 PE=1 SV=3 525 689 2.0E-19
sp|O94913|PCF11_HUMAN Pre-mRNA cleavage complex 2 protein Pcf11 OS=Homo sapiens GN=PCF11 PE=1 SV=3 7 117 2.0E-18
sp|Q0WPF2|PCFS4_ARATH Polyadenylation and cleavage factor homolog 4 OS=Arabidopsis thaliana GN=PCFS4 PE=1 SV=1 6 117 3.0E-15
sp|Q0WPF2|PCFS4_ARATH Polyadenylation and cleavage factor homolog 4 OS=Arabidopsis thaliana GN=PCFS4 PE=1 SV=1 525 678 3.0E-14
sp|Q09345|PCF11_CAEEL Polyadenylation and cleavage factor homolog 11 OS=Caenorhabditis elegans GN=pcf-11 PE=3 SV=1 5 112 3.0E-13
sp|Q9C710|PCFS1_ARATH Polyadenylation and cleavage factor homolog 1 OS=Arabidopsis thaliana GN=PCFS1 PE=1 SV=1 493 689 2.0E-12
sp|Q9FIX8|PCFS5_ARATH Polyadenylation and cleavage factor homolog 5 OS=Arabidopsis thaliana GN=PCFS5 PE=1 SV=1 493 689 2.0E-11
sp|Q09345|PCF11_CAEEL Polyadenylation and cleavage factor homolog 11 OS=Caenorhabditis elegans GN=pcf-11 PE=3 SV=1 529 690 4.0E-09
[Show less]

GO

GO Term Description Terminal node
GO:0006379 mRNA cleavage Yes
GO:0005849 mRNA cleavage factor complex Yes
GO:0006378 mRNA polyadenylation Yes
GO:0006369 termination of RNA polymerase II transcription Yes
GO:0006807 nitrogen compound metabolic process No
GO:0006397 mRNA processing No
GO:1901360 organic cyclic compound metabolic process No
GO:0034641 cellular nitrogen compound metabolic process No
GO:0043631 RNA polyadenylation No
GO:0044237 cellular metabolic process No
GO:0043170 macromolecule metabolic process No
GO:0016070 RNA metabolic process No
GO:0032774 RNA biosynthetic process No
GO:0071704 organic substance metabolic process No
GO:0044428 nuclear part No
GO:0005575 cellular_component No
GO:0044238 primary metabolic process No
GO:0031124 mRNA 3'-end processing No
GO:0008152 metabolic process No
GO:0008150 biological_process No
GO:0046483 heterocycle metabolic process No
GO:0090305 nucleic acid phosphodiester bond hydrolysis No
GO:0044422 organelle part No
GO:0044446 intracellular organelle part No
GO:0009059 macromolecule biosynthetic process No
GO:0019438 aromatic compound biosynthetic process No
GO:0009987 cellular process No
GO:0031123 RNA 3'-end processing No
GO:0016071 mRNA metabolic process No
GO:0044451 nucleoplasm part No
GO:0006396 RNA processing No
GO:0018130 heterocycle biosynthetic process No
GO:0032991 protein-containing complex No
GO:1901362 organic cyclic compound biosynthetic process No
GO:0044424 intracellular part No
GO:0006139 nucleobase-containing compound metabolic process No
GO:0044249 cellular biosynthetic process No
GO:0090501 RNA phosphodiester bond hydrolysis No
GO:0044271 cellular nitrogen compound biosynthetic process No
GO:0090304 nucleic acid metabolic process No
GO:0006353 DNA-templated transcription, termination No
GO:0006725 cellular aromatic compound metabolic process No
GO:0009058 biosynthetic process No
GO:0034654 nucleobase-containing compound biosynthetic process No
GO:0044464 cell part No
GO:1901576 organic substance biosynthetic process No

SignalP

[Help with interpreting these statistics]
SignalP signal predicted Location
(based on Ymax)
D score
(significance: > 0.45)
No 1 - 65 0.45

Transmembrane Domains

(None)

Transcription Factor Class

(None)

Expression data

No expression data available for this genome

Sequences

Type of sequenceSequence
Locus Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded.
Protein >Hirsu2|5133
MASDAEEVAEDYRHALEDLSSNMRFEISNLTVIARENTEHALAIAEVLQQHILKAPPNKKLPALYVLDSIVKNVG
TPYTLYFGRNLFKTFMEAYAVVDHNIRRKMEEMLRTWKDPVPGSMDTRPVFSHELVRPIENALMKARAASMPQQG
PITGRPRSAMLPHRNTPTPPGMRAPSASTGGFPGQPPYQQQPNGGRPAEPAAGSFPYPGQQQQASRPSAENGSSA
GYQPGHLSQYPHHAPLQPGIAAPAHIQPALSGPYGSIGLAAPAGISVETLSNDIQNLIVAMKAEFSQNPHDASVQ
NRLRALLDLQGVVQRTSLPPDQLELIKNKVTELAAVTMRPSSAQNSAQMPALVPPPALAAPPHASSSVTPSPASA
TASTPTPTPTAIPNSTPTPASSGQASVTLDSLLGPGALAALMARQSTTAASQGSTPNAPYSHAIRSPSTAAAERA
KPPAQNAISLLDQLRQAGMLPPAARGSAASPAAAGLPPPPSAPLMLPPNLASLLSTSAVAAAAAAAAASKSQMAA
PYHTGFDAASLKQQFRPEAVAALYSELGPPCTQCGRRFRTDEEGKRKKTTHMDWHFRVHQRSTEAEKRGRHRSWY
VDQQDWLRSVEVVDSDHVAAPEESSKEASQAEKEPSYMPVPDASSGINNVCPICQDKFENKWLDTAQEWVWLDAV
LVNNRAYHASCHAEATMDREGTPVTSRRTPEPILGKRKAETSMASPKIRVMKTWA*
Coding >Hirsu2|5133
ATGGCGTCGGACGCGGAAGAGGTGGCCGAGGATTACCGGCACGCTCTGGAGGACCTGTCGTCCAACATGCGCTTC
GAAATCAGCAACCTGACCGTCATCGCTCGAGAGAACACGGAACACGCCTTGGCCATCGCCGAGGTGCTTCAGCAG
CACATCCTCAAGGCTCCTCCCAACAAAAAGTTACCGGCCCTCTACGTCCTGGACTCGATTGTGAAGAATGTCGGA
ACGCCCTACACGCTGTATTTCGGTCGCAACCTCTTCAAAACGTTCATGGAGGCCTATGCCGTCGTCGACCACAAC
ATCCGCCGCAAGATGGAAGAAATGCTCAGGACCTGGAAGGACCCGGTCCCTGGATCCATGGACACCCGCCCCGTC
TTCTCCCACGAACTGGTGCGGCCGATCGAGAACGCGCTGATGAAGGCGAGAGCGGCCTCGATGCCCCAGCAAGGC
CCCATTACCGGGCGGCCGCGGTCCGCCATGCTTCCACACCGAAACACGCCGACGCCTCCCGGAATGCGCGCCCCG
TCGGCATCGACGGGAGGCTTCCCCGGACAGCCGCCGTACCAGCAGCAGCCCAACGGCGGACGACCGGCCGAGCCG
GCGGCCGGCAGCTTTCCCTACCCCGGCCAGCAGCAGCAAGCGAGCCGGCCGAGCGCCGAGAACGGCTCGTCTGCA
GGATACCAACCAGGTCATCTCTCGCAATACCCTCACCACGCACCCCTGCAGCCCGGCATTGCCGCCCCGGCACAC
ATTCAGCCCGCCCTCTCGGGACCATACGGGTCGATAGGGCTGGCAGCCCCGGCGGGGATAAGCGTAGAAACGTTG
AGCAATGACATTCAGAACCTGATTGTTGCGATGAAGGCCGAGTTCTCGCAAAACCCACATGATGCTAGTGTTCAG
AACAGATTAAGAGCACTGCTGGATCTGCAGGGTGTTGTGCAACGCACGAGTCTCCCGCCCGACCAGCTCGAGCTC
ATTAAGAACAAAGTAACTGAACTTGCTGCCGTGACAATGAGGCCCTCGTCGGCCCAGAACTCTGCTCAGATGCCA
GCACTCGTACCGCCACCGGCACTGGCGGCGCCTCCGCACGCTTCTTCTTCCGTAACGCCCAGCCCGGCCTCGGCT
ACTGCTTCGACTCCGACTCCGACTCCGACTGCGATTCCGAACTCGACCCCGACTCCCGCGTCTTCGGGCCAGGCG
TCGGTCACTCTGGACTCGCTGCTGGGCCCGGGGGCCCTGGCGGCACTGATGGCGAGACAGTCGACGACGGCGGCC
TCTCAGGGCTCGACTCCTAACGCCCCATACTCTCATGCTATTCGATCGCCGTCGACGGCCGCCGCCGAGCGAGCC
AAGCCGCCTGCTCAGAACGCCATCAGTCTCCTGGACCAGCTTCGCCAGGCGGGGATGCTCCCGCCGGCGGCCCGG
GGCAGCGCCGCCTCTCCCGCCGCAGCAGGCCTGCCCCCTCCCCCGTCCGCCCCGCTGATGCTTCCTCCCAATCTT
GCCAGTCTGCTGTCAACGTCTGCGGTGGCGGCGGCGGCGGCAGCAGCAGCAGCTTCGAAGTCGCAAATGGCGGCT
CCCTACCACACCGGCTTCGACGCGGCTTCGTTAAAACAACAGTTCCGTCCCGAGGCTGTTGCCGCCCTCTACAGT
GAGCTGGGCCCGCCCTGCACGCAGTGCGGCCGGCGCTTCAGGACAGACGAGGAGGGCAAGCGGAAGAAGACCACG
CACATGGACTGGCACTTCCGCGTCCACCAGCGCAGCACGGAGGCGGAGAAGAGAGGAAGACACCGCAGCTGGTAC
GTGGATCAACAGGACTGGCTGCGGTCGGTCGAGGTGGTCGACTCGGACCATGTAGCCGCGCCCGAGGAGTCGTCC
AAGGAGGCGTCGCAGGCGGAGAAGGAGCCGAGTTACATGCCAGTCCCCGACGCCAGCAGCGGCATCAACAACGTC
TGCCCCATCTGTCAGGACAAATTCGAGAACAAGTGGCTGGACACGGCGCAGGAGTGGGTGTGGCTGGACGCCGTC
CTGGTCAACAACCGGGCCTACCACGCCTCGTGCCACGCCGAGGCGACTATGGACCGCGAGGGCACGCCGGTCACG
TCTAGGCGCACGCCGGAGCCCATCCTCGGCAAGAGGAAGGCGGAGACGAGCATGGCGTCGCCCAAGATCCGCGTC
ATGAAGACGTGGGCTTGA
Transcript >Hirsu2|5133
ATGGCGTCGGACGCGGAAGAGGTGGCCGAGGATTACCGGCACGCTCTGGAGGACCTGTCGTCCAACATGCGCTTC
GAAATCAGCAACCTGACCGTCATCGCTCGAGAGAACACGGAACACGCCTTGGCCATCGCCGAGGTGCTTCAGCAG
CACATCCTCAAGGCTCCTCCCAACAAAAAGTTACCGGCCCTCTACGTCCTGGACTCGATTGTGAAGAATGTCGGA
ACGCCCTACACGCTGTATTTCGGTCGCAACCTCTTCAAAACGTTCATGGAGGCCTATGCCGTCGTCGACCACAAC
ATCCGCCGCAAGATGGAAGAAATGCTCAGGACCTGGAAGGACCCGGTCCCTGGATCCATGGACACCCGCCCCGTC
TTCTCCCACGAACTGGTGCGGCCGATCGAGAACGCGCTGATGAAGGCGAGAGCGGCCTCGATGCCCCAGCAAGGC
CCCATTACCGGGCGGCCGCGGTCCGCCATGCTTCCACACCGAAACACGCCGACGCCTCCCGGAATGCGCGCCCCG
TCGGCATCGACGGGAGGCTTCCCCGGACAGCCGCCGTACCAGCAGCAGCCCAACGGCGGACGACCGGCCGAGCCG
GCGGCCGGCAGCTTTCCCTACCCCGGCCAGCAGCAGCAAGCGAGCCGGCCGAGCGCCGAGAACGGCTCGTCTGCA
GGATACCAACCAGGTCATCTCTCGCAATACCCTCACCACGCACCCCTGCAGCCCGGCATTGCCGCCCCGGCACAC
ATTCAGCCCGCCCTCTCGGGACCATACGGGTCGATAGGGCTGGCAGCCCCGGCGGGGATAAGCGTAGAAACGTTG
AGCAATGACATTCAGAACCTGATTGTTGCGATGAAGGCCGAGTTCTCGCAAAACCCACATGATGCTAGTGTTCAG
AACAGATTAAGAGCACTGCTGGATCTGCAGGGTGTTGTGCAACGCACGAGTCTCCCGCCCGACCAGCTCGAGCTC
ATTAAGAACAAAGTAACTGAACTTGCTGCCGTGACAATGAGGCCCTCGTCGGCCCAGAACTCTGCTCAGATGCCA
GCACTCGTACCGCCACCGGCACTGGCGGCGCCTCCGCACGCTTCTTCTTCCGTAACGCCCAGCCCGGCCTCGGCT
ACTGCTTCGACTCCGACTCCGACTCCGACTGCGATTCCGAACTCGACCCCGACTCCCGCGTCTTCGGGCCAGGCG
TCGGTCACTCTGGACTCGCTGCTGGGCCCGGGGGCCCTGGCGGCACTGATGGCGAGACAGTCGACGACGGCGGCC
TCTCAGGGCTCGACTCCTAACGCCCCATACTCTCATGCTATTCGATCGCCGTCGACGGCCGCCGCCGAGCGAGCC
AAGCCGCCTGCTCAGAACGCCATCAGTCTCCTGGACCAGCTTCGCCAGGCGGGGATGCTCCCGCCGGCGGCCCGG
GGCAGCGCCGCCTCTCCCGCCGCAGCAGGCCTGCCCCCTCCCCCGTCCGCCCCGCTGATGCTTCCTCCCAATCTT
GCCAGTCTGCTGTCAACGTCTGCGGTGGCGGCGGCGGCGGCAGCAGCAGCAGCTTCGAAGTCGCAAATGGCGGCT
CCCTACCACACCGGCTTCGACGCGGCTTCGTTAAAACAACAGTTCCGTCCCGAGGCTGTTGCCGCCCTCTACAGT
GAGCTGGGCCCGCCCTGCACGCAGTGCGGCCGGCGCTTCAGGACAGACGAGGAGGGCAAGCGGAAGAAGACCACG
CACATGGACTGGCACTTCCGCGTCCACCAGCGCAGCACGGAGGCGGAGAAGAGAGGAAGACACCGCAGCTGGTAC
GTGGATCAACAGGACTGGCTGCGGTCGGTCGAGGTGGTCGACTCGGACCATGTAGCCGCGCCCGAGGAGTCGTCC
AAGGAGGCGTCGCAGGCGGAGAAGGAGCCGAGTTACATGCCAGTCCCCGACGCCAGCAGCGGCATCAACAACGTC
TGCCCCATCTGTCAGGACAAATTCGAGAACAAGTGGCTGGACACGGCGCAGGAGTGGGTGTGGCTGGACGCCGTC
CTGGTCAACAACCGGGCCTACCACGCCTCGTGCCACGCCGAGGCGACTATGGACCGCGAGGGCACGCCGGTCACG
TCTAGGCGCACGCCGGAGCCCATCCTCGGCAAGAGGAAGGCGGAGACGAGCATGGCGTCGCCCAAGATCCGCGTC
ATGAAGACGTGGGCTTGA
Gene >Hirsu2|5133
ATGGCGTCGGACGCGGAAGAGGTGGCCGAGGATTACCGGCACGCTCTGGAGGACCTGTCGTCCAACATGCGCTTC
GAAATCAGCAACCTGACCGTCATCGCTCGAGAGAACACGGAACACGCCTTGGCCATCGCCGAGGTGCTTCAGCAG
CACATCCTCAAGGTAAGGCCCAAGACCAAAACTTCTTTTTCAGAGCGGACAGCTAACAAAGGCATCAGGCTCCTC
CCAACAAAAAGTTACCGGCCCTCTACGTCCTGGACTCGATTGTGAAGAATGTCGGAACGCCCTACACGCTGTATT
TCGGTCGCAACCTCTTCAAAACGTTCATGGAGGCCTATGCCGTCGTCGACCACAACATCCGCCGCAAGATGGAAG
AAATGCTCAGGACCTGGAAGGACCCGGTCCCTGGATCCATGGACACCCGCCCCGTCTTCTCCCACGAACTGGTGC
GGCCGATCGAGAACGCGCTGATGAAGGCGAGAGCGGCCTCGATGCCCCAGCAAGGCCCCATTACCGGGCGGCCGC
GGTCCGCCATGCTTCCACACCGAAACACGCCGACGCCTCCCGGAATGCGCGCCCCGTCGGCATCGACGGGAGGCT
TCCCCGGACAGCCGCCGTACCAGCAGCAGCCCAACGGCGGACGACCGGCCGAGCCGGCGGCCGGCAGCTTTCCCT
ACCCCGGCCAGCAGCAGCAAGCGAGCCGGCCGAGCGCCGAGAACGGCTCGTCTGCAGGATACCAACCAGGTCATC
TCTCGCAATACCCTCACCACGCACCCCTGCAGCCCGGCATTGCCGCCCCGGCACACATTCAGCCCGCCCTCTCGG
GACCATACGGGTCGATAGGGCTGGCAGCCCCGGCGGGGATAAGCGTAGAAACGTTGAGCAATGACATTCAGAACC
TGATTGTTGCGATGAAGGCCGAGTTCTCGCAAAACCCACATGATGCTAGTGTTCAGAACAGATTAAGAGCACTGC
TGGATCTGCAGGGTGTTGTGCAACGCACGAGTCTCCCGCCCGACCAGCTCGAGCTCATTAAGAACAAAGTAACTG
AACTTGCTGCCGTGACAATGAGGCCCTCGTCGGCCCAGAACTCTGCTCAGATGCCAGCACTCGTACCGCCACCGG
CACTGGCGGCGCCTCCGCACGCTTCTTCTTCCGTAACGCCCAGCCCGGCCTCGGCTACTGCTTCGACTCCGACTC
CGACTCCGACTGCGATTCCGAACTCGACCCCGACTCCCGCGTCTTCGGGCCAGGCGTCGGTCACTCTGGACTCGC
TGCTGGGCCCGGGGGCCCTGGCGGCACTGATGGCGAGACAGTCGACGACGGCGGCCTCTCAGGGCTCGACTCCTA
ACGCCCCATACTCTCATGCTATTCGATCGCCGTCGACGGCCGCCGCCGAGCGAGCCAAGCCGCCTGCTCAGAACG
CCATCAGTCTCCTGGACCAGCTTCGCCAGGCGGGGATGCTCCCGCCGGCGGCCCGGGGCAGCGCCGCCTCTCCCG
CCGCAGCAGGCCTGCCCCCTCCCCCGTCCGCCCCGCTGATGCTTCCTCCCAATCTTGCCAGTCTGCTGTCAACGT
CTGCGGTGGCGGCGGCGGCGGCAGCAGCAGCAGCTTCGAAGTCGCAAATGGCGGCTCCCTACCACACCGGCTTCG
ACGCGGCTTCGTTAAAACAACAGTACGTGAGCCCGGACCACACTAGGCTGGAAGACTACCCTTTTCTAATGCCAC
CCGACAGGTTCCGTCCCGAGGCTGTTGCCGCCCTCTACAGTGAGCTGGGCCCGCCCTGCACGCAGTGCGGCCGGC
GCTTCAGGACAGACGAGGAGGGCAAGCGGAAGAAGACCACGCACATGGACTGGCACTTCCGCGTCCACCAGCGCA
GCACGGAGGCGGAGAAGAGAGGAAGACACCGCAGCTGGTACGTGGATCAACAGGTCAGTGTTCCCCCTCCCCCCC
CCTGCCGTCCTTCGCTCGCCATCCAGGCTGACGGCATACAGGACTGGCTGCGGTCGGTCGAGGTGGTCGACTCGG
ACCATGTAGCCGCGCCCGAGGAGTCGTCCAAGGAGGCGTCGCAGGCGGAGAAGGAGCCGAGTTACATGCCAGTCC
CCGACGCCAGCAGCGGCATCAACAACGTCTGCCCCATCTGTCAGGACAAATTCGAGAACAAGTGGCTGGACACGG
CGCAGGAGTGGGTGTGGCTGGACGCCGTCCTGGTCAACAACCGGGCCTACCACGCCTCGTGCCACGCCGAGGCGA
CTATGGACCGCGAGGGCACGCCGGTCACGTCTAGGCGCACGCCGGAGCCCATCCTCGGCAAGAGGAAGGCGGAGA
CGAGCATGGCGTCGCCCAAGATCCGCGTCATGAAGACGTGGGCTTGA

© 2022 - Robin Ohm - Utrecht University - The Netherlands

Built with Python Django and Wagtail