Protein ID | Agabi119p4|068330 |
Gene name | |
Location | scaffold_04:918464..919884 |
Strand | + |
Gene length (bp) | 1420 |
Transcript length (bp) | 1371 |
Coding sequence length (bp) | 1371 |
Protein length (aa) | 457 |
PFAM Domain ID | Short name | Long name | E-value | Start | End |
---|---|---|---|---|---|
PF00320 | GATA | GATA zinc finger | 5.2E-08 | 232 | 265 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|Q9UHF7|TRPS1_HUMAN | Zinc finger transcription factor Trps1 OS=Homo sapiens GN=TRPS1 PE=1 SV=2 | 195 | 305 | 3.0E-07 |
sp|Q5AP95|SFU1_CANAL | Suppressor of ferric uptake 1 OS=Candida albicans (strain SC5314 / ATCC MYA-2876) GN=SFU1 PE=1 SV=1 | 206 | 307 | 4.0E-07 |
sp|Q925H1|TRPS1_MOUSE | Zinc finger transcription factor Trps1 OS=Mus musculus GN=Trps1 PE=1 SV=1 | 224 | 313 | 8.0E-07 |
sp|P18494|GLN3_YEAST | Nitrogen regulatory protein GLN3 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=GLN3 PE=1 SV=2 | 231 | 286 | 1.0E-06 |
sp|P52172|SRP_DROME | Box A-binding factor OS=Drosophila melanogaster GN=srp PE=1 SV=2 | 215 | 284 | 2.0E-06 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|Q9UHF7|TRPS1_HUMAN | Zinc finger transcription factor Trps1 OS=Homo sapiens GN=TRPS1 PE=1 SV=2 | 195 | 305 | 3.0E-07 |
sp|Q5AP95|SFU1_CANAL | Suppressor of ferric uptake 1 OS=Candida albicans (strain SC5314 / ATCC MYA-2876) GN=SFU1 PE=1 SV=1 | 206 | 307 | 4.0E-07 |
sp|Q925H1|TRPS1_MOUSE | Zinc finger transcription factor Trps1 OS=Mus musculus GN=Trps1 PE=1 SV=1 | 224 | 313 | 8.0E-07 |
sp|P18494|GLN3_YEAST | Nitrogen regulatory protein GLN3 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=GLN3 PE=1 SV=2 | 231 | 286 | 1.0E-06 |
sp|P52172|SRP_DROME | Box A-binding factor OS=Drosophila melanogaster GN=srp PE=1 SV=2 | 215 | 284 | 2.0E-06 |
sp|Q90ZS6|TRPS1_XENLA | Zinc finger transcription factor Trps1 OS=Xenopus laevis GN=trps1 PE=1 SV=1 | 218 | 294 | 2.0E-06 |
sp|P70005|GAT6B_XENLA | GATA-binding factor 6-B OS=Xenopus laevis GN=gata6-b PE=2 SV=1 | 232 | 284 | 2.0E-06 |
sp|Q9HEV5|ASD4_NEUCR | GATA type zinc finger protein asd-4 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=asd-4 PE=1 SV=1 | 232 | 273 | 4.0E-06 |
sp|Q5A201|GZF3_CANAL | Transcriptional regulator GZF3 OS=Candida albicans (strain SC5314 / ATCC MYA-2876) GN=GZF3 PE=2 SV=1 | 232 | 284 | 4.0E-06 |
sp|Q91678|GAT6A_XENLA | GATA-binding factor 6-A OS=Xenopus laevis GN=gata6-a PE=2 SV=1 | 232 | 284 | 7.0E-06 |
sp|Q91677|GATA4_XENLA | Transcription factor GATA-4 OS=Xenopus laevis GN=gata4 PE=2 SV=1 | 232 | 284 | 7.0E-06 |
GO Term | Description | Terminal node |
---|---|---|
GO:0006355 | regulation of transcription, DNA-templated | Yes |
GO:0043565 | sequence-specific DNA binding | Yes |
GO:0097159 | organic cyclic compound binding | No |
GO:0003674 | molecular_function | No |
GO:0031323 | regulation of cellular metabolic process | No |
GO:0003676 | nucleic acid binding | No |
GO:0060255 | regulation of macromolecule metabolic process | No |
GO:0031326 | regulation of cellular biosynthetic process | No |
GO:0019219 | regulation of nucleobase-containing compound metabolic process | No |
GO:0019222 | regulation of metabolic process | No |
GO:0005488 | binding | No |
GO:0009889 | regulation of biosynthetic process | No |
GO:0065007 | biological regulation | No |
GO:1903506 | regulation of nucleic acid-templated transcription | No |
GO:2001141 | regulation of RNA biosynthetic process | No |
GO:0051252 | regulation of RNA metabolic process | No |
GO:0050794 | regulation of cellular process | No |
GO:1901363 | heterocyclic compound binding | No |
GO:0050789 | regulation of biological process | No |
GO:0051171 | regulation of nitrogen compound metabolic process | No |
GO:0003677 | DNA binding | No |
GO:0010556 | regulation of macromolecule biosynthetic process | No |
GO:0008150 | biological_process | No |
GO:0010468 | regulation of gene expression | No |
GO:0080090 | regulation of primary metabolic process | No |
Transcription Factor Class (based on PFAM domains) |
---|
GATA type zinc finger |
Orthofinder run ID | 5 |
Orthogroup | 4744 |
Change Orthofinder run |
Species | Protein ID |
---|---|
Agaricus bisporus var bisporus H39 | AgabiH39|068330 |
Agaricus bisporus var bisporus H97 | AgabiH97|068330 |
Agaricus bisporus var burnettii H119p4 | Agabi119p4|068330 (this protein) |
Type of sequence | Sequence |
---|---|
Locus | Download genbank file of locus
Download genbank file of locus (reverse complement)
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded. |
Protein | >Agabi119p4|068330 MDSATFDPYASYSDSSAPHTPEPLPTDMHYCKTNVDDAVRNIFTHPDDSHPDGQYWSHPTFFNSQRGSLLQELYD EQQPPASDVYPDHFVTHPVHQQQLSSRPHDYQMMRRNTFPTVRYDRDDGLPAQQYPPFIQQSHHYQRNGPLYSEQ LNLTAEPAPIPSETYLPAAYDDASNIKLEDPGTLMVPSHSFYRPQSSGGLMGVPFVPPHSGLHVQHTDDAASKET QYLRRRCFNCHTTEPPSWRRSTLNPGKIVCNKCGLYERTHLRPRPLRFDELRAGHKPRKQSKGTASPKAKLSPIV KKEPREPGLTRRSSVSSSSGSVHSGSGASDWDDNVSIYSGSNPPTSFNSPNVQTFPLSRDSHSPPHDGGIRLPNA PLSDIASLQQSHQPSTPSLAPSTPHSGHSSPGYYSPPATSPNAGVQSPEYYHHDAVTTVSGPWTEAPSGILSSPI PTPVAS* |
Coding | >Agabi119p4|068330 ATGGACTCTGCCACCTTCGACCCCTACGCCTCCTACTCCGACAGCTCCGCTCCCCACACCCCAGAGCCTCTCCCC ACAGACATGCACTACTGCAAGACAAACGTCGACGACGCCGTCCGCAACATCTTCACCCACCCCGACGACTCCCAT CCAGACGGCCAGTATTGGTCCCACCCCACCTTTTTCAATTCGCAGCGTGGTTCCCTTTTGCAGGAACTCTATGAC GAGCAACAGCCTCCTGCTTCGGACGTCTATCCCGATCACTTTGTGACTCATCCTGTCCACCAACAGCAGTTATCG TCGCGTCCGCACGACTACCAGATGATGCGACGCAATACCTTTCCCACTGTTCGCTACGACCGCGACGATGGCCTG CCTGCGCAACAATATCCCCCTTTCATCCAGCAATCCCATCACTATCAGCGCAACGGCCCCCTTTATTCCGAGCAA CTCAACTTGACTGCTGAGCCTGCCCCAATCCCATCCGAAACTTACCTCCCCGCCGCTTACGACGATGCCTCTAAT ATCAAGTTGGAGGATCCTGGCACGTTAATGGTTCCCTCCCATTCCTTCTACCGTCCCCAATCATCTGGTGGCCTC ATGGGTGTCCCTTTCGTGCCTCCCCACAGTGGACTACATGTCCAGCATACTGATGATGCCGCTTCGAAAGAGACT CAATACCTTCGTCGCCGCTGCTTCAACTGCCATACCACAGAGCCCCCCTCGTGGCGTCGCTCTACTCTTAATCCC GGAAAAATCGTCTGCAACAAATGCGGGCTCTATGAGCGCACTCACCTGAGACCGCGCCCTCTTCGCTTTGACGAG CTCCGCGCTGGCCACAAGCCCCGAAAACAATCTAAAGGAACCGCCAGCCCCAAAGCGAAGCTGAGTCCCATCGTG AAGAAAGAGCCTCGTGAACCCGGCCTCACGCGGCGCTCCTCTGTCTCATCCTCTTCTGGCTCTGTCCACTCAGGG AGTGGCGCCAGCGACTGGGATGACAATGTCTCCATCTATTCAGGCTCCAACCCTCCCACCTCCTTCAACTCTCCT AATGTCCAGACTTTCCCTCTTTCTCGCGATTCCCACTCTCCCCCGCATGACGGAGGTATTCGCTTGCCGAATGCC CCATTGTCAGACATTGCCTCCCTCCAACAATCCCACCAACCATCGACGCCCTCTCTTGCTCCCTCAACCCCTCAT TCTGGCCATTCCTCGCCAGGTTATTACTCCCCTCCAGCGACTAGTCCCAACGCTGGTGTCCAGTCGCCGGAATAT TACCATCATGATGCCGTGACAACAGTCTCGGGACCTTGGACAGAAGCCCCTAGTGGAATCCTGAGTAGCCCCATT CCTACTCCTGTAGCGTCGTAG |
Transcript | >Agabi119p4|068330 ATGGACTCTGCCACCTTCGACCCCTACGCCTCCTACTCCGACAGCTCCGCTCCCCACACCCCAGAGCCTCTCCCC ACAGACATGCACTACTGCAAGACAAACGTCGACGACGCCGTCCGCAACATCTTCACCCACCCCGACGACTCCCAT CCAGACGGCCAGTATTGGTCCCACCCCACCTTTTTCAATTCGCAGCGTGGTTCCCTTTTGCAGGAACTCTATGAC GAGCAACAGCCTCCTGCTTCGGACGTCTATCCCGATCACTTTGTGACTCATCCTGTCCACCAACAGCAGTTATCG TCGCGTCCGCACGACTACCAGATGATGCGACGCAATACCTTTCCCACTGTTCGCTACGACCGCGACGATGGCCTG CCTGCGCAACAATATCCCCCTTTCATCCAGCAATCCCATCACTATCAGCGCAACGGCCCCCTTTATTCCGAGCAA CTCAACTTGACTGCTGAGCCTGCCCCAATCCCATCCGAAACTTACCTCCCCGCCGCTTACGACGATGCCTCTAAT ATCAAGTTGGAGGATCCTGGCACGTTAATGGTTCCCTCCCATTCCTTCTACCGTCCCCAATCATCTGGTGGCCTC ATGGGTGTCCCTTTCGTGCCTCCCCACAGTGGACTACATGTCCAGCATACTGATGATGCCGCTTCGAAAGAGACT CAATACCTTCGTCGCCGCTGCTTCAACTGCCATACCACAGAGCCCCCCTCGTGGCGTCGCTCTACTCTTAATCCC GGAAAAATCGTCTGCAACAAATGCGGGCTCTATGAGCGCACTCACCTGAGACCGCGCCCTCTTCGCTTTGACGAG CTCCGCGCTGGCCACAAGCCCCGAAAACAATCTAAAGGAACCGCCAGCCCCAAAGCGAAGCTGAGTCCCATCGTG AAGAAAGAGCCTCGTGAACCCGGCCTCACGCGGCGCTCCTCTGTCTCATCCTCTTCTGGCTCTGTCCACTCAGGG AGTGGCGCCAGCGACTGGGATGACAATGTCTCCATCTATTCAGGCTCCAACCCTCCCACCTCCTTCAACTCTCCT AATGTCCAGACTTTCCCTCTTTCTCGCGATTCCCACTCTCCCCCGCATGACGGAGGTATTCGCTTGCCGAATGCC CCATTGTCAGACATTGCCTCCCTCCAACAATCCCACCAACCATCGACGCCCTCTCTTGCTCCCTCAACCCCTCAT TCTGGCCATTCCTCGCCAGGTTATTACTCCCCTCCAGCGACTAGTCCCAACGCTGGTGTCCAGTCGCCGGAATAT TACCATCATGATGCCGTGACAACAGTCTCGGGACCTTGGACAGAAGCCCCTAGTGGAATCCTGAGTAGCCCCATT CCTACTCCTGTAGCGTCGTAG |
Gene | >Agabi119p4|068330 ATGGACTCTGCCACCTTCGACCCCTACGCCTCCTACTCCGACAGCTCCGCTCCCCACACCCCAGAGCCTCTCCCC ACAGACATGCACTACTGCAAGACAAACGTCGACGACGCCGTCCGCAACATCTTCACCCACCCCGACGACTCCCAT CCAGACGGCCAGTATTGGTCCCACCCCACCTTTTTCAATTCGCAGCGTGGTTCCCTTTTGCAGGAACTCTATGAC GAGCAACAGCCTCCTGCTTCGGACGTCTATCCCGATCACTTTGTGACTCATCCTGTCCACCAACAGCAGTTATCG TCGCGTCCGCACGACTACCAGATGATGCGACGCAATACCTTTCCCACTGTTCGCTACGACCGCGACGATGGCCTG CCTGCGCAACAATATCCCCCTTTCATCCAGCAATCCCATCACTATCAGCGCAACGGCCCCCTTTATTCCGAGCAA CTCAACTTGACTGCTGAGCCTGCCCCAATCCCATCCGAAACTTACCTCCCCGCCGCTTACGACGATGCCTCTAAT ATCAAGTTGGAGGATCCTGGCACGTTAATGGTTCCCTCCCATTCCTTCTACCGTCCCCAATCATCTGGTGGCCTC ATGGGTGTCCCTTTCGTGCCTCCCCACAGTGGACTACATGTCCAGCATACTGATGATGCCGCTTCGAAAGAGACT CAATACCTTCGTCGCCGCTGCTTCAACTGCCATACCACAGAGCCCCCCTCGTGGCGTCGCTCTACTCTTAATCCC GGAAAAATCGTCTGCAACAAATGCGGGCTCTATGAGCGCACTCACCTGAGACCGCGCCCTCTTCGCTTTGACGAG CTCCGCGCTGGCCACAAGCCCCGAAAACAATCTAAAGGAACCGCCAGCCCCAAAGCGAAGCTGAGTCCCATCGTG AAGAAAGAGCCTCGTGAACCCGGCCTCACGCGGCGCTCCTCTGTCTCATCCTCTTCTGGCTCTGTCCACTCAGGG AGTGGCGCCAGCGACTGGGATGACAATGGTGAGTTTACTTTGGCAAATACTGTTTACTCTCATATTGATAAAAGT AGTCTCCATCTATTCAGGCTCCAACCCTCCCACCTCCTTCAACTCTCCTAATGTCCAGACTTTCCCTCTTTCTCG CGATTCCCACTCTCCCCCGCATGACGGAGGTATTCGCTTGCCGAATGCCCCATTGTCAGACATTGCCTCCCTCCA ACAATCCCACCAACCATCGACGCCCTCTCTTGCTCCCTCAACCCCTCATTCTGGCCATTCCTCGCCAGGTTATTA CTCCCCTCCAGCGACTAGTCCCAACGCTGGTGTCCAGTCGCCGGAATATTACCATCATGATGCCGTGACAACAGT CTCGGGACCTTGGACAGAAGCCCCTAGTGGAATCCTGAGTAGCCCCATTCCTACTCCTGTAGCGTCGTAG |