Protein ID | Ani_SJS100_1|g5167.t1 |
Gene name | |
Location | scaffold_028:117648..120265 |
Strand | + |
Gene length (bp) | 2617 |
Transcript length (bp) | 2526 |
Coding sequence length (bp) | 2526 |
Protein length (aa) | 842 |
PFAM Domain ID | Short name | Long name | E-value | Start | End |
---|---|---|---|---|---|
PF01119 | DNA_mis_repair | DNA mismatch repair protein, C-terminal domain | 5.1E-11 | 241 | 354 |
PF02518 | HATPase_c | Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase | 6.2E-08 | 25 | 97 |
PF13589 | HATPase_c_3 | Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase | 2.7E-07 | 27 | 68 |
GO Term | Description | Terminal node |
---|---|---|
GO:0006298 | mismatch repair | Yes |
GO:0030983 | mismatched DNA binding | Yes |
GO:0005524 | ATP binding | Yes |
GO:0033554 | cellular response to stress | No |
GO:0009987 | cellular process | No |
GO:1901363 | heterocyclic compound binding | No |
GO:0032553 | ribonucleotide binding | No |
GO:1901265 | nucleoside phosphate binding | No |
GO:0003690 | double-stranded DNA binding | No |
GO:0017076 | purine nucleotide binding | No |
GO:0003674 | molecular_function | No |
GO:0006281 | DNA repair | No |
GO:0097367 | carbohydrate derivative binding | No |
GO:0006974 | cellular response to DNA damage stimulus | No |
GO:0044238 | primary metabolic process | No |
GO:0008152 | metabolic process | No |
GO:0032555 | purine ribonucleotide binding | No |
GO:0043170 | macromolecule metabolic process | No |
GO:0071704 | organic substance metabolic process | No |
GO:1901360 | organic cyclic compound metabolic process | No |
GO:0043167 | ion binding | No |
GO:0090304 | nucleic acid metabolic process | No |
GO:0036094 | small molecule binding | No |
GO:0043168 | anion binding | No |
GO:0003676 | nucleic acid binding | No |
GO:0044260 | cellular macromolecule metabolic process | No |
GO:0005488 | binding | No |
GO:0032559 | adenyl ribonucleotide binding | No |
GO:0097159 | organic cyclic compound binding | No |
GO:0050896 | response to stimulus | No |
GO:0006139 | nucleobase-containing compound metabolic process | No |
GO:0046483 | heterocycle metabolic process | No |
GO:0008150 | biological_process | No |
GO:0006807 | nitrogen compound metabolic process | No |
GO:0000166 | nucleotide binding | No |
GO:0006725 | cellular aromatic compound metabolic process | No |
GO:0051716 | cellular response to stimulus | No |
GO:0034641 | cellular nitrogen compound metabolic process | No |
GO:0044237 | cellular metabolic process | No |
GO:0030554 | adenyl nucleotide binding | No |
GO:0006259 | DNA metabolic process | No |
GO:0003677 | DNA binding | No |
GO:0006950 | response to stress | No |
GO:0035639 | purine ribonucleoside triphosphate binding | No |
SignalP signal predicted | Location (based on Ymax) |
D score (significance: > 0.45) |
---|---|---|
No | 1 - 18 | 0.45 |
Type of sequence | Sequence |
---|---|
Locus | Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded. |
Protein | >Ani_SJS100_1|g5167.t1 MPIAALPLAAVRAIGSASVISDPCSIVKELLDNALDAAATSVLIEISQNTLDIIQVKDNGHGIPSTDHPFVCKRT FTSKIQSVEDLRTIGGKSLGFRGEALASAAEVSGGVAITTRVQHEPVGSSIKYGRNGELLSTQRASHPVGTSVRI TDLFKHIPVRRQTTLKNAAKTVVRIKKLVQAYAIAQPSKRLSLKVLKAKTETNNWMYAPGNNTSLVDAVLKVVGT EAASSCLVKVWPSEIDSDRPFRMLAFLPKPESDLTKVNGSGQYISIDGRPLSTTRGIAQDIAKLYKSYIRLVASS KEAAANITDPLLCLHLKCTNASYDVNIEPAKDDVLLEDREELLSLVEELLCDAYGVEAKPTERPRTIDKGKEPAS YQNSFELLLARKSPDESRAKSSYGSSSTPRTPALEDPDSFNTENRLEKRIAISNPWSISKPSKQMPTRDSSSSRS DTDKIVRHQIVRYPTESRRNSRETLQTCSPGSLLPSPSATSTTAPGYVTEAQSSPSSPGTPQTPLPRTRQASLDH DRHPRNGALDNWLGKTPASLSQSTIGEVPIGNGDERSLQQLAQERFGSPERSSGDPGSIEGGVVTNNQSPESQAS SAEAETEPEGAGSPLLKSTADLELRESHAPDRGRSSATLRQLLSENNHQELGRALDFENRKREAILKRREQLKSQ RNPSSPHLSRYLAAKAALQKHPDADRDEPTKPVLGPHDPRSYLMRYKVQDQNEDGISKTRRINTEKLPLENIPNG CELYSLALTQEADISLISTTCDHLTKTDLYTGCGDQCEAFNPPHKDTETMLDLWRNQLTSLIQRNYRTSERSGLP DVHLNFSRLAQRGHSK* |
Coding | >Ani_SJS100_1|g5167.t1 ATGCCAATCGCAGCGCTCCCCCTAGCAGCGGTGCGGGCAATTGGCTCCGCGTCTGTTATTTCCGATCCTTGCTCC ATTGTCAAGGAGCTCCTGGACAACGCCTTGGATGCAGCAGCCACTTCGGTACTTATTGAGATCTCACAAAATACA CTTGACATAATACAAGTGAAAGACAATGGCCATGGTATCCCCTCTACCGATCATCCTTTTGTCTGCAAACGCACC TTCACTAGCAAAATTCAATCCGTCGAAGATCTTAGAACTATCGGCGGAAAGTCTCTAGGTTTTAGAGGCGAAGCA CTTGCTAGTGCAGCTGAGGTATCTGGCGGCGTGGCTATCACAACTAGAGTACAGCATGAGCCGGTTGGTTCTTCT ATCAAATACGGCAGAAATGGAGAGCTTCTAAGTACCCAGCGAGCGTCACATCCGGTAGGAACCAGTGTTCGCATA ACGGATTTGTTCAAACACATCCCAGTCCGGAGACAAACTACGTTAAAGAATGCGGCCAAGACGGTCGTGAGGATC AAGAAGCTCGTACAAGCATACGCTATTGCCCAACCCTCCAAACGTCTATCCTTGAAGGTCCTCAAAGCAAAGACA GAGACGAATAACTGGATGTATGCCCCAGGCAACAACACTAGCCTTGTAGATGCAGTATTGAAAGTTGTGGGCACC GAAGCTGCCTCTAGCTGTCTGGTCAAGGTCTGGCCATCTGAGATTGACAGCGACAGGCCATTTCGGATGCTCGCG TTCCTTCCAAAGCCAGAATCCGACCTTACGAAAGTGAACGGCTCGGGGCAGTATATCAGTATCGATGGCAGGCCC TTATCGACCACCCGAGGGATCGCGCAAGATATTGCGAAACTATATAAGTCTTATATTCGTCTCGTTGCTTCCAGT AAGGAGGCAGCTGCAAACATTACAGATCCGCTTCTGTGTCTTCATCTCAAGTGTACTAATGCAAGCTACGACGTC AACATTGAGCCCGCTAAAGATGATGTTCTTCTTGAAGACCGGGAAGAGCTTCTTTCTTTGGTAGAAGAGCTTTTG TGTGATGCGTACGGCGTCGAAGCAAAACCAACTGAGAGACCAAGAACTATCGATAAAGGGAAAGAGCCCGCTTCC TATCAGAACTCATTCGAATTGCTTCTTGCCCGTAAAAGTCCGGATGAATCAAGGGCTAAAAGCAGTTATGGCTCT TCGTCGACACCACGGACTCCGGCTTTAGAGGATCCAGACAGCTTCAATACTGAGAATCGTCTGGAGAAGAGAATC GCGATTAGCAACCCCTGGTCAATATCTAAGCCAAGCAAGCAGATGCCGACGCGCGACAGTTCCTCATCGCGCAGT GATACGGACAAGATAGTGAGACATCAAATTGTGCGGTATCCGACTGAAAGCAGAAGAAACTCACGGGAAACGCTA CAAACATGCTCTCCAGGGTCTCTCCTACCGAGTCCCTCTGCTACCTCGACGACTGCTCCTGGCTACGTGACTGAA GCTCAGTCTTCGCCGTCTAGTCCAGGGACGCCTCAAACTCCTCTTCCCAGGACAAGGCAAGCCTCGTTGGACCAT GACAGACATCCCAGAAACGGAGCCCTTGATAACTGGTTGGGGAAAACCCCAGCCTCCTTGTCTCAGTCAACCATA GGCGAGGTGCCCATAGGAAATGGGGACGAGCGCTCATTGCAGCAGTTGGCGCAAGAAAGATTCGGTTCACCAGAA AGGTCATCAGGTGATCCAGGATCGATAGAGGGTGGCGTTGTTACTAATAATCAGTCTCCCGAATCTCAGGCTAGC TCTGCAGAAGCAGAAACGGAGCCTGAAGGAGCTGGCTCTCCCTTGCTTAAGAGTACTGCAGACCTTGAACTTCGA GAAAGCCACGCACCCGACCGTGGAAGGTCCTCGGCTACTCTACGTCAGCTTCTTTCTGAGAATAACCACCAGGAG CTGGGACGTGCACTCGACTTTGAGAACCGCAAGAGGGAGGCTATCCTTAAGCGCCGGGAACAACTCAAAAGCCAG CGAAACCCGAGCTCACCCCATCTAAGTCGGTATCTAGCTGCCAAAGCCGCACTCCAGAAGCACCCAGATGCTGAT CGGGACGAGCCGACGAAGCCTGTGCTGGGTCCACATGACCCACGGTCGTATCTAATGCGTTACAAGGTGCAAGAT CAGAACGAAGATGGCATATCAAAGACCAGAAGGATTAACACAGAGAAGCTGCCATTAGAAAACATTCCAAATGGG TGTGAGTTGTACAGTCTGGCTCTGACCCAAGAGGCCGACATCTCGCTCATTTCAACCACATGCGACCATTTAACC AAGACAGATCTTTATACCGGATGCGGAGACCAGTGTGAAGCATTTAACCCGCCACATAAAGATACAGAAACAATG CTTGATCTCTGGAGGAATCAACTGACGAGTTTGATACAGCGCAATTACCGGACTTCAGAACGGTCGGGATTGCCG GATGTGCACTTGAATTTCTCTCGTTTAGCGCAGCGCGGGCATTCCAAGTGA |
Transcript | >Ani_SJS100_1|g5167.t1 ATGCCAATCGCAGCGCTCCCCCTAGCAGCGGTGCGGGCAATTGGCTCCGCGTCTGTTATTTCCGATCCTTGCTCC ATTGTCAAGGAGCTCCTGGACAACGCCTTGGATGCAGCAGCCACTTCGGTACTTATTGAGATCTCACAAAATACA CTTGACATAATACAAGTGAAAGACAATGGCCATGGTATCCCCTCTACCGATCATCCTTTTGTCTGCAAACGCACC TTCACTAGCAAAATTCAATCCGTCGAAGATCTTAGAACTATCGGCGGAAAGTCTCTAGGTTTTAGAGGCGAAGCA CTTGCTAGTGCAGCTGAGGTATCTGGCGGCGTGGCTATCACAACTAGAGTACAGCATGAGCCGGTTGGTTCTTCT ATCAAATACGGCAGAAATGGAGAGCTTCTAAGTACCCAGCGAGCGTCACATCCGGTAGGAACCAGTGTTCGCATA ACGGATTTGTTCAAACACATCCCAGTCCGGAGACAAACTACGTTAAAGAATGCGGCCAAGACGGTCGTGAGGATC AAGAAGCTCGTACAAGCATACGCTATTGCCCAACCCTCCAAACGTCTATCCTTGAAGGTCCTCAAAGCAAAGACA GAGACGAATAACTGGATGTATGCCCCAGGCAACAACACTAGCCTTGTAGATGCAGTATTGAAAGTTGTGGGCACC GAAGCTGCCTCTAGCTGTCTGGTCAAGGTCTGGCCATCTGAGATTGACAGCGACAGGCCATTTCGGATGCTCGCG TTCCTTCCAAAGCCAGAATCCGACCTTACGAAAGTGAACGGCTCGGGGCAGTATATCAGTATCGATGGCAGGCCC TTATCGACCACCCGAGGGATCGCGCAAGATATTGCGAAACTATATAAGTCTTATATTCGTCTCGTTGCTTCCAGT AAGGAGGCAGCTGCAAACATTACAGATCCGCTTCTGTGTCTTCATCTCAAGTGTACTAATGCAAGCTACGACGTC AACATTGAGCCCGCTAAAGATGATGTTCTTCTTGAAGACCGGGAAGAGCTTCTTTCTTTGGTAGAAGAGCTTTTG TGTGATGCGTACGGCGTCGAAGCAAAACCAACTGAGAGACCAAGAACTATCGATAAAGGGAAAGAGCCCGCTTCC TATCAGAACTCATTCGAATTGCTTCTTGCCCGTAAAAGTCCGGATGAATCAAGGGCTAAAAGCAGTTATGGCTCT TCGTCGACACCACGGACTCCGGCTTTAGAGGATCCAGACAGCTTCAATACTGAGAATCGTCTGGAGAAGAGAATC GCGATTAGCAACCCCTGGTCAATATCTAAGCCAAGCAAGCAGATGCCGACGCGCGACAGTTCCTCATCGCGCAGT GATACGGACAAGATAGTGAGACATCAAATTGTGCGGTATCCGACTGAAAGCAGAAGAAACTCACGGGAAACGCTA CAAACATGCTCTCCAGGGTCTCTCCTACCGAGTCCCTCTGCTACCTCGACGACTGCTCCTGGCTACGTGACTGAA GCTCAGTCTTCGCCGTCTAGTCCAGGGACGCCTCAAACTCCTCTTCCCAGGACAAGGCAAGCCTCGTTGGACCAT GACAGACATCCCAGAAACGGAGCCCTTGATAACTGGTTGGGGAAAACCCCAGCCTCCTTGTCTCAGTCAACCATA GGCGAGGTGCCCATAGGAAATGGGGACGAGCGCTCATTGCAGCAGTTGGCGCAAGAAAGATTCGGTTCACCAGAA AGGTCATCAGGTGATCCAGGATCGATAGAGGGTGGCGTTGTTACTAATAATCAGTCTCCCGAATCTCAGGCTAGC TCTGCAGAAGCAGAAACGGAGCCTGAAGGAGCTGGCTCTCCCTTGCTTAAGAGTACTGCAGACCTTGAACTTCGA GAAAGCCACGCACCCGACCGTGGAAGGTCCTCGGCTACTCTACGTCAGCTTCTTTCTGAGAATAACCACCAGGAG CTGGGACGTGCACTCGACTTTGAGAACCGCAAGAGGGAGGCTATCCTTAAGCGCCGGGAACAACTCAAAAGCCAG CGAAACCCGAGCTCACCCCATCTAAGTCGGTATCTAGCTGCCAAAGCCGCACTCCAGAAGCACCCAGATGCTGAT CGGGACGAGCCGACGAAGCCTGTGCTGGGTCCACATGACCCACGGTCGTATCTAATGCGTTACAAGGTGCAAGAT CAGAACGAAGATGGCATATCAAAGACCAGAAGGATTAACACAGAGAAGCTGCCATTAGAAAACATTCCAAATGGG TGTGAGTTGTACAGTCTGGCTCTGACCCAAGAGGCCGACATCTCGCTCATTTCAACCACATGCGACCATTTAACC AAGACAGATCTTTATACCGGATGCGGAGACCAGTGTGAAGCATTTAACCCGCCACATAAAGATACAGAAACAATG CTTGATCTCTGGAGGAATCAACTGACGAGTTTGATACAGCGCAATTACCGGACTTCAGAACGGTCGGGATTGCCG GATGTGCACTTGAATTTCTCTCGTTTAGCGCAGCGCGGGCATTCCAAGTGA |
Gene | >Ani_SJS100_1|g5167.t1 ATGCCAATCGCAGCGCTCCCCCTAGCAGCGGTGCGGGCAATTGGCTCCGCGTCTGTTATTTCCGATCCTTGCTCC ATTGTCAAGGAGCTCCTGGACAACGCCTTGGATGCAGCAGCCACTTCGGTACTTATTGAGATCTCACAAAATACA CTTGACATAATACAAGTGAAAGACAATGGCCATGGTATCCCCTCTACCGATCATCCTTTTGTCTGCAAACGCACC TTCACTAGCAAAATTCAATCCGTCGAAGATCTTAGAACTATCGGCGGAAAGTCTCTAGGTTTTAGAGGCGAAGCA CTTGCTAGTGCAGCTGAGGTATCTGGCGGCGTGGCTATCACAACTAGAGTACAGCATGAGCCGGTTGGTTCTTCT ATCAAATACGGCAGAAATGGAGAGCTTCTAAGGTGATTCCTACCGCCAATTGAGATAACCAAAGCTGACCGATGC AGTACCCAGCGAGCGTCACATCCGGTAGGAACCAGTGTTCGCATAACGGATTTGTTCAAACACATCCCAGTCCGG AGACAAACTACGTTAAAGAATGCGGCCAAGACGGTCGTGAGGATCAAGAAGCTCGTACAAGCATACGCTATTGCC CAACCCTCCAAACGTCTATCCTTGAAGGTCCTCAAAGCAAAGACAGAGACGAATAACTGGATGTATGCCCCAGGC AACAACACTAGCCTTGTAGATGCAGTATTGAAAGTTGTGGGCACCGAAGCTGCCTCTAGCTGTCTGGTCAAGGTC TGGCCATCTGAGATTGACAGCGACAGGCCATTTCGGATGCTCGCGTTCCTTCCAAAGCCAGAATCCGGTATGTAG CTGCTCTGTGGGTCCCCGAGTGCGACTGACGGTGTTAGACCTTACGAAAGTGAACGGCTCGGGGCAGTATATCAG TATCGATGGCAGGCCCTTATCGACCACCCGAGGGATCGCGCAAGATATTGCGAAACTATATAAGTCTTATATTCG TCTCGTTGCTTCCAGTAAGGAGGCAGCTGCAAACATTACAGATCCGCTTCTGTGTCTTCATCTCAAGTGTACTAA TGCAAGCTACGACGTCAACATTGAGCCCGCTAAAGATGATGTTCTTCTTGAAGACCGGGAAGAGCTTCTTTCTTT GGTAGAAGAGCTTTTGTGTGATGCGTACGGCGTCGAAGCAAAACCAACTGAGAGACCAAGAACTATCGATAAAGG GAAAGAGCCCGCTTCCTATCAGAACTCATTCGAATTGCTTCTTGCCCGTAAAAGTCCGGATGAATCAAGGGCTAA AAGCAGTTATGGCTCTTCGTCGACACCACGGACTCCGGCTTTAGAGGATCCAGACAGCTTCAATACTGAGAATCG TCTGGAGAAGAGAATCGCGATTAGCAACCCCTGGTCAATATCTAAGCCAAGCAAGCAGATGCCGACGCGCGACAG TTCCTCATCGCGCAGTGATACGGACAAGATAGTGAGACATCAAATTGTGCGGTATCCGACTGAAAGCAGAAGAAA CTCACGGGAAACGCTACAAACATGCTCTCCAGGGTCTCTCCTACCGAGTCCCTCTGCTACCTCGACGACTGCTCC TGGCTACGTGACTGAAGCTCAGTCTTCGCCGTCTAGTCCAGGGACGCCTCAAACTCCTCTTCCCAGGACAAGGCA AGCCTCGTTGGACCATGACAGACATCCCAGAAACGGAGCCCTTGATAACTGGTTGGGGAAAACCCCAGCCTCCTT GTCTCAGTCAACCATAGGCGAGGTGCCCATAGGAAATGGGGACGAGCGCTCATTGCAGCAGTTGGCGCAAGAAAG ATTCGGTTCACCAGAAAGGTCATCAGGTGATCCAGGATCGATAGAGGGTGGCGTTGTTACTAATAATCAGTCTCC CGAATCTCAGGCTAGCTCTGCAGAAGCAGAAACGGAGCCTGAAGGAGCTGGCTCTCCCTTGCTTAAGAGTACTGC AGACCTTGAACTTCGAGAAAGCCACGCACCCGACCGTGGAAGGTCCTCGGCTACTCTACGTCAGCTTCTTTCTGA GAATAACCACCAGGAGCTGGGACGTGCACTCGACTTTGAGAACCGCAAGAGGGAGGCTATCCTTAAGCGCCGGGA ACAACTCAAAAGCCAGCGAAACCCGAGCTCACCCCATCTAAGTCGGTATCTAGCTGCCAAAGCCGCACTCCAGAA GCACCCAGATGCTGATCGGGACGAGCCGACGAAGCCTGTGCTGGGTCCACATGACCCACGGTCGTATCTAATGCG TTACAAGGTGCAAGATCAGAACGAAGATGGCATATCAAAGACCAGAAGGATTAACACAGAGAAGCTGCCATTAGA AAACATTCCAAATGGGTGTGAGTTGTACAGTCTGGCTCTGACCCAAGAGGCCGACATCTCGCTCATTTCAACCAC ATGCGACCATTTAACCAAGACAGATCTTTATACCGGATGCGGAGACCAGTGTGAAGCATTTAACCCGCCACATAA AGATACAGAAACAATGCTTGATCTCTGGAGGAATCAACTGACGAGTTTGATACAGCGCAATTACCGGACTTCAGA ACGGTCGGGATTGCCGGATGTGCACTTGAATTTCTCTCGTTTAGCGCAGCGCGGGCATTCCAAGTGA |