Protein ID | Ani_SJS100_1|g10214.t1 |
Gene name | |
Location | scaffold_109:15609..17936 |
Strand | - |
Gene length (bp) | 2327 |
Transcript length (bp) | 2166 |
Coding sequence length (bp) | 2166 |
Protein length (aa) | 722 |
PFAM Domain ID | Short name | Long name | E-value | Start | End |
---|---|---|---|---|---|
PF16413 | Mlh1_C | DNA mismatch repair protein Mlh1 C-terminus | 7.3E-102 | 454 | 721 |
PF01119 | DNA_mis_repair | DNA mismatch repair protein, C-terminal domain | 7.6E-34 | 237 | 355 |
PF13589 | HATPase_c_3 | Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase | 6.0E-14 | 44 | 141 |
PF02518 | HATPase_c | Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase | 6.6E-08 | 43 | 111 |
GO Term | Description | Terminal node |
---|---|---|
GO:0006298 | mismatch repair | Yes |
GO:0030983 | mismatched DNA binding | Yes |
GO:0005524 | ATP binding | Yes |
GO:0033554 | cellular response to stress | No |
GO:0009987 | cellular process | No |
GO:1901363 | heterocyclic compound binding | No |
GO:0032553 | ribonucleotide binding | No |
GO:1901265 | nucleoside phosphate binding | No |
GO:0003690 | double-stranded DNA binding | No |
GO:0017076 | purine nucleotide binding | No |
GO:0003674 | molecular_function | No |
GO:0006281 | DNA repair | No |
GO:0097367 | carbohydrate derivative binding | No |
GO:0006974 | cellular response to DNA damage stimulus | No |
GO:0044238 | primary metabolic process | No |
GO:0008152 | metabolic process | No |
GO:0032555 | purine ribonucleotide binding | No |
GO:0043170 | macromolecule metabolic process | No |
GO:0071704 | organic substance metabolic process | No |
GO:1901360 | organic cyclic compound metabolic process | No |
GO:0043167 | ion binding | No |
GO:0090304 | nucleic acid metabolic process | No |
GO:0036094 | small molecule binding | No |
GO:0043168 | anion binding | No |
GO:0003676 | nucleic acid binding | No |
GO:0044260 | cellular macromolecule metabolic process | No |
GO:0005488 | binding | No |
GO:0032559 | adenyl ribonucleotide binding | No |
GO:0097159 | organic cyclic compound binding | No |
GO:0050896 | response to stimulus | No |
GO:0006139 | nucleobase-containing compound metabolic process | No |
GO:0046483 | heterocycle metabolic process | No |
GO:0008150 | biological_process | No |
GO:0006807 | nitrogen compound metabolic process | No |
GO:0000166 | nucleotide binding | No |
GO:0006725 | cellular aromatic compound metabolic process | No |
GO:0051716 | cellular response to stimulus | No |
GO:0034641 | cellular nitrogen compound metabolic process | No |
GO:0044237 | cellular metabolic process | No |
GO:0030554 | adenyl nucleotide binding | No |
GO:0006259 | DNA metabolic process | No |
GO:0003677 | DNA binding | No |
GO:0006950 | response to stress | No |
GO:0035639 | purine ribonucleoside triphosphate binding | No |
SignalP signal predicted | Location (based on Ymax) |
D score (significance: > 0.45) |
---|---|---|
No | 1 - 24 | 0.45 |
Type of sequence | Sequence |
---|---|
Locus | Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded. |
Protein | >Ani_SJS100_1|g10214.t1 MEPRGTKRSAEDSEEPQRPKRIRALDPDVVNKIAAGEIIVAPMHALKELIENAVDAGSTSIEILVKDGGLKLLQI TDNGHGIDRDDLPILCERFTTSKLKQFEDLSSIGTYGFRGEALASISHIAHLTVTTKTAGSSCAWRAHYSDGKLV PPKPGQSAAPKATAGRGGTQITVEDLFYNVPTRRRAFRSASEEYAKILDVVGRYSVHCSGVAFSCRKHGDSGVSV STPAAANTIDRIRQIHGSAVANELVEFNVEDEKLGFRSSGFATNANYHVKRTTILLFINHRSVESTAIKRAVEQT YSSFLPKGGHPFVYIDLEIEPQRVDVNVHPTKREVNFLNEDEIIECICNEIRSKLAQVDSSRTFLTQTLLPGVTT MEPANRDPEGTDTVPKTPSTTKKPYEHNLVRTDSKVRKITSMLTPATPHTPTASQADTTVLDEGLQYETTSREPH RISFTSVKNLRASVRNAMHNTLTETIASHTYVGLVDERRRIAAIQSGVKLYLIDYGMFCTEFFYQIGLTDFANFG VIKLSPPPKLIDLLRIAADTERNQSSQESTTTEEANEIFTNAPDLVAETLIDRREMLNEYFSLDISPEGDLLSIP LLLKGYLPSLGKLPRFLLRLGPYVDWANEEECFRTFLRELAAFYTPEQLPPPPKLQNGNETEGEGEGEDEFITQR RAQMARMLEHVVFPALRARMVATTRLLRGVVEVADLKGLYRVFERC* |
Coding | >Ani_SJS100_1|g10214.t1 ATGGAGCCTCGAGGCACGAAGAGGTCTGCAGAGGACAGTGAGGAGCCACAGAGGCCGAAGAGGATTAGAGCCTTA GATCCCGATGTTGTAAACAAAATTGCTGCAGGGGAGATTATCGTGGCTCCTATGCATGCGCTGAAAGAGCTAATC GAGAACGCTGTCGATGCTGGATCGACATCGATCGAGATTTTGGTGAAAGACGGTGGATTGAAGCTCCTGCAGATT ACTGACAATGGTCATGGCATCGACAGAGATGACCTGCCCATTCTGTGCGAGAGATTCACAACATCCAAGCTAAAG CAGTTCGAAGACCTTTCATCCATAGGCACGTACGGTTTCCGAGGTGAAGCTTTGGCAAGCATTAGCCATATTGCC CATCTGACTGTGACCACAAAAACTGCTGGTTCCAGCTGCGCATGGAGAGCACACTATAGCGATGGGAAGCTTGTT CCCCCTAAACCTGGACAGAGTGCTGCGCCCAAAGCAACCGCGGGACGCGGCGGCACGCAAATAACAGTAGAGGAC CTGTTCTACAATGTACCGACAAGACGTCGGGCTTTTCGATCAGCAAGTGAAGAATATGCCAAAATCCTCGATGTA GTTGGTCGATACTCCGTCCATTGCTCAGGCGTGGCCTTCTCCTGCCGCAAGCATGGTGATTCAGGCGTCAGTGTC TCCACTCCAGCAGCTGCAAATACAATAGACCGAATTCGTCAAATTCATGGCAGTGCAGTCGCCAACGAACTTGTC GAATTCAATGTTGAAGATGAAAAGCTGGGGTTTCGCTCATCTGGCTTTGCCACGAACGCAAACTACCATGTCAAA AGAACTACTATACTTCTTTTCATCAACCACCGCTCGGTCGAATCCACAGCCATCAAGCGAGCAGTTGAGCAAACA TACTCCAGCTTCCTCCCCAAAGGAGGCCATCCATTTGTCTACATTGACCTCGAAATTGAACCACAACGTGTCGAC GTCAACGTACATCCCACAAAACGCGAAGTGAATTTCCTCAACGAAGACGAAATCATTGAATGCATCTGCAACGAG ATCCGCTCCAAGCTGGCCCAAGTAGACTCAAGCCGGACCTTCCTAACCCAAACCCTCCTCCCAGGTGTAACAACA ATGGAGCCCGCAAACCGCGACCCTGAAGGCACAGACACCGTACCCAAAACACCATCCACGACCAAAAAGCCATAT GAACACAACCTCGTCCGCACCGACTCCAAAGTCCGCAAAATCACCTCCATGCTCACCCCCGCCACACCACACACG CCCACAGCCTCCCAAGCAGACACGACCGTCCTCGACGAAGGCCTCCAATACGAAACCACCTCCCGCGAACCCCAC CGCATCAGCTTCACATCCGTGAAGAACCTCCGTGCCTCCGTACGCAACGCAATGCACAACACCCTAACTGAAACC ATCGCCTCCCACACCTACGTCGGCCTCGTCGACGAACGTCGTCGGATCGCGGCCATCCAATCCGGCGTAAAACTC TACCTAATCGACTACGGCATGTTCTGCACCGAATTCTTCTACCAGATCGGCCTTACAGACTTTGCCAACTTCGGA GTCATCAAGCTGTCCCCACCACCCAAACTCATCGACCTCCTCCGAATCGCCGCAGACACCGAACGCAACCAATCC TCCCAAGAATCAACAACAACAGAAGAAGCCAACGAAATCTTCACCAACGCCCCCGACCTCGTTGCTGAAACCCTC ATCGACCGCCGCGAAATGCTAAACGAGTACTTCTCCCTTGACATCTCGCCTGAGGGGGACCTTCTCTCCATCCCC CTCCTCCTGAAAGGCTACCTCCCCAGCCTGGGAAAACTGCCCCGATTCCTCCTCAGACTAGGTCCCTATGTCGAT TGGGCAAACGAGGAGGAATGCTTCCGCACGTTTCTGCGAGAGCTTGCGGCTTTTTATACCCCTGAACAGTTGCCA CCGCCTCCGAAACTGCAGAATGGTAATGAAACAGAAGGGGAAGGAGAAGGGGAAGATGAGTTTATTACGCAGAGA CGGGCGCAGATGGCGCGGATGCTGGAACATGTGGTTTTCCCGGCTCTGAGGGCGCGGATGGTTGCTACGACGCGG CTGCTCCGGGGGGTGGTGGAGGTGGCGGATTTGAAGGGGTTGTATAGGGTATTTGAGCGGTGTTGA |
Transcript | >Ani_SJS100_1|g10214.t1 ATGGAGCCTCGAGGCACGAAGAGGTCTGCAGAGGACAGTGAGGAGCCACAGAGGCCGAAGAGGATTAGAGCCTTA GATCCCGATGTTGTAAACAAAATTGCTGCAGGGGAGATTATCGTGGCTCCTATGCATGCGCTGAAAGAGCTAATC GAGAACGCTGTCGATGCTGGATCGACATCGATCGAGATTTTGGTGAAAGACGGTGGATTGAAGCTCCTGCAGATT ACTGACAATGGTCATGGCATCGACAGAGATGACCTGCCCATTCTGTGCGAGAGATTCACAACATCCAAGCTAAAG CAGTTCGAAGACCTTTCATCCATAGGCACGTACGGTTTCCGAGGTGAAGCTTTGGCAAGCATTAGCCATATTGCC CATCTGACTGTGACCACAAAAACTGCTGGTTCCAGCTGCGCATGGAGAGCACACTATAGCGATGGGAAGCTTGTT CCCCCTAAACCTGGACAGAGTGCTGCGCCCAAAGCAACCGCGGGACGCGGCGGCACGCAAATAACAGTAGAGGAC CTGTTCTACAATGTACCGACAAGACGTCGGGCTTTTCGATCAGCAAGTGAAGAATATGCCAAAATCCTCGATGTA GTTGGTCGATACTCCGTCCATTGCTCAGGCGTGGCCTTCTCCTGCCGCAAGCATGGTGATTCAGGCGTCAGTGTC TCCACTCCAGCAGCTGCAAATACAATAGACCGAATTCGTCAAATTCATGGCAGTGCAGTCGCCAACGAACTTGTC GAATTCAATGTTGAAGATGAAAAGCTGGGGTTTCGCTCATCTGGCTTTGCCACGAACGCAAACTACCATGTCAAA AGAACTACTATACTTCTTTTCATCAACCACCGCTCGGTCGAATCCACAGCCATCAAGCGAGCAGTTGAGCAAACA TACTCCAGCTTCCTCCCCAAAGGAGGCCATCCATTTGTCTACATTGACCTCGAAATTGAACCACAACGTGTCGAC GTCAACGTACATCCCACAAAACGCGAAGTGAATTTCCTCAACGAAGACGAAATCATTGAATGCATCTGCAACGAG ATCCGCTCCAAGCTGGCCCAAGTAGACTCAAGCCGGACCTTCCTAACCCAAACCCTCCTCCCAGGTGTAACAACA ATGGAGCCCGCAAACCGCGACCCTGAAGGCACAGACACCGTACCCAAAACACCATCCACGACCAAAAAGCCATAT GAACACAACCTCGTCCGCACCGACTCCAAAGTCCGCAAAATCACCTCCATGCTCACCCCCGCCACACCACACACG CCCACAGCCTCCCAAGCAGACACGACCGTCCTCGACGAAGGCCTCCAATACGAAACCACCTCCCGCGAACCCCAC CGCATCAGCTTCACATCCGTGAAGAACCTCCGTGCCTCCGTACGCAACGCAATGCACAACACCCTAACTGAAACC ATCGCCTCCCACACCTACGTCGGCCTCGTCGACGAACGTCGTCGGATCGCGGCCATCCAATCCGGCGTAAAACTC TACCTAATCGACTACGGCATGTTCTGCACCGAATTCTTCTACCAGATCGGCCTTACAGACTTTGCCAACTTCGGA GTCATCAAGCTGTCCCCACCACCCAAACTCATCGACCTCCTCCGAATCGCCGCAGACACCGAACGCAACCAATCC TCCCAAGAATCAACAACAACAGAAGAAGCCAACGAAATCTTCACCAACGCCCCCGACCTCGTTGCTGAAACCCTC ATCGACCGCCGCGAAATGCTAAACGAGTACTTCTCCCTTGACATCTCGCCTGAGGGGGACCTTCTCTCCATCCCC CTCCTCCTGAAAGGCTACCTCCCCAGCCTGGGAAAACTGCCCCGATTCCTCCTCAGACTAGGTCCCTATGTCGAT TGGGCAAACGAGGAGGAATGCTTCCGCACGTTTCTGCGAGAGCTTGCGGCTTTTTATACCCCTGAACAGTTGCCA CCGCCTCCGAAACTGCAGAATGGTAATGAAACAGAAGGGGAAGGAGAAGGGGAAGATGAGTTTATTACGCAGAGA CGGGCGCAGATGGCGCGGATGCTGGAACATGTGGTTTTCCCGGCTCTGAGGGCGCGGATGGTTGCTACGACGCGG CTGCTCCGGGGGGTGGTGGAGGTGGCGGATTTGAAGGGGTTGTATAGGGTATTTGAGCGGTGTTGA |
Gene | >Ani_SJS100_1|g10214.t1 ATGGAGCCTCGAGGCACGAAGAGGTCTGCAGAGGACAGTGAGGAGCCACAGAGGCCGAAGAGGATTAGAGTCAGT TCCCACTTCTTGTCGTATTTTCGCTTAGGTATTGACACTACATAGGCCTTAGATCCCGATGTTGTAAACAAAATT GCTGCAGGGGAGATTATCGTGGCTCCTATGCATGCGCTGAAAGAGCTAATCGAGAACGCTGTCGATGCTGGATCG ACATCGATCGAGATTTTGGTGAAAGACGGTGGATTGAAGCTCCTGCAGATTACTGACAATGGTCATGGCATCGAC GTGAGTAGAGCGAGCAAGATCGTTCAGCAATAAAGTGGCTGATTGTGAGATACAGAGAGATGACCTGCCCATTCT GTGCGAGAGATTCACAACATCCAAGCTAAAGCAGTTCGAAGACCTTTCATCCATAGGCACGTACGGTTTCCGAGG TGAAGCTTTGGCAAGCATTAGCCATATTGCCCATCTGACTGTGACCACAAAAACTGCTGGTTCCAGCTGCGCATG GAGAGCACACTATAGCGATGGGAAGCTTGTTCCCCCTAAACCTGGACAGAGTGCTGCGCCCAAAGCAACCGCGGG ACGCGGCGGCACGCAAATAACAGTGAGTGAGATGTCAGTTTTATCTTTGAGGCTACAACAGCTCATTTGGCTCCC AGGTAGAGGACCTGTTCTACAATGTACCGACAAGACGTCGGGCTTTTCGATCAGCAAGTGAAGAATATGCCAAAA TCCTCGATGTAGTTGGTCGATACTCCGTCCATTGCTCAGGCGTGGCCTTCTCCTGCCGCAAGCATGGTGATTCAG GCGTCAGTGTCTCCACTCCAGCAGCTGCAAATACAATAGACCGAATTCGTCAAATTCATGGCAGTGCAGTCGCCA ACGAACTTGTCGAATTCAATGTTGAAGATGAAAAGCTGGGGTTTCGCTCATCTGGCTTTGCCACGAACGCAAACT ACCATGTCAAAAGAACTACTATACTTCTTTTCATCAACCACCGCTCGGTCGAATCCACAGCCATCAAGCGAGCAG TTGAGCAAACATACTCCAGCTTCCTCCCCAAAGGAGGCCATCCATTTGTCTACATTGACCTCGAAATTGAACCAC AACGTGTCGACGTCAACGTACATCCCACAAAACGCGAAGTGAATTTCCTCAACGAAGACGAAATCATTGAATGCA TCTGCAACGAGATCCGCTCCAAGCTGGCCCAAGTAGACTCAAGCCGGACCTTCCTAACCCAAACCCTCCTCCCAG GTGTAACAACAATGGAGCCCGCAAACCGCGACCCTGAAGGCACAGACACCGTACCCAAAACACCATCCACGACCA AAAAGCCATATGAACACAACCTCGTCCGCACCGACTCCAAAGTCCGCAAAATCACCTCCATGCTCACCCCCGCCA CACCACACACGCCCACAGCCTCCCAAGCAGACACGACCGTCCTCGACGAAGGCCTCCAATACGAAACCACCTCCC GCGAACCCCACCGCATCAGCTTCACATCCGTGAAGAACCTCCGTGCCTCCGTACGCAACGCAATGCACAACACCC TAACTGAAACCATCGCCTCCCACACCTACGTCGGCCTCGTCGACGAACGTCGTCGGATCGCGGCCATCCAATCCG GCGTAAAACTCTACCTAATCGACTACGGCATGTTCTGCACCGAATTCTTCTACCAGATCGGCCTTACAGACTTTG CCAACTTCGGAGTCATCAAGCTGTCCCCACCACCCAAACTCATCGACCTCCTCCGAATCGCCGCAGACACCGAAC GCAACCAATCCTCCCAAGAATCAACAACAACAGAAGAAGCCAACGAAATCTTCACCAACGCCCCCGACCTCGTTG CTGAAACCCTCATCGACCGCCGCGAAATGCTAAACGAGTACTTCTCCCTTGACATCTCGCCTGAGGGGGACCTTC TCTCCATCCCCCTCCTCCTGAAAGGCTACCTCCCCAGCCTGGGAAAACTGCCCCGATTCCTCCTCAGACTAGGTC CCTATGTCGATTGGGCAAACGAGGAGGAATGCTTCCGCACGTTTCTGCGAGAGCTTGCGGCTTTTTATACCCCTG AACAGTTGCCACCGCCTCCGAAACTGCAGAATGGTAATGAAACAGAAGGGGAAGGAGAAGGGGAAGATGAGTTTA TTACGCAGAGACGGGCGCAGATGGCGCGGATGCTGGAACATGTGGTTTTCCCGGCTCTGAGGGCGCGGATGGTTG CTACGACGCGGCTGCTCCGGGGGGTGGTGGAGGTGGCGGATTTGAAGGGGTTGTATAGGGTATTTGAGCGGTGTT GA |