The sequences and functional annotations for this organism are available for download.
These links are automatically updated as new version of the functional annotations become
Genome and genes
- Genome assembly
The genome sequence in fasta format.
- Gene features
Locations of the predicted genes (including features like exons, coding sequences, etc) in GFF3 format.
- Genbank file
Genome sequence and predicted genes in GenBank format. (NOTE: experimental)
Protein (amino acid) sequences of the predicted genes in fasta format.
- Coding sequences
Coding sequences (DNA) of the predicted genes in fasta format. The coding sequence spans from start codon to stop codon, excluding any introns.
Transcript sequences (DNA) of the predicted genes in fasta format. The transcript sequences are the coding sequences with UTR (untranslated regions), if those are present.
Genes sequences (DNA) of the predicted genes in fasta format. The gene sequences are the transcript sequences with introns, if those are present.
Promoter sequences. The 2.5 kb region upstream of the start codon. If the gene is close to the edge of the sequence, then the promoter may be shorter.
Download part of the genome
Please cite this paper when using data from this genome:
© 2020 - Robin Ohm - Utrecht University - The Netherlands
Built with Python Django and Wagtail