Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Cmo20g00674 ATGTGGAGGGGTTGCTCTGGCGGAGCTGGGGTTTTGCTCAAGCTCTCAAGGTGCTTGCTGAGGATGTCTGTGAGCTGCAAGTTTTCTACCTCCAATGGAAAATTTCCAGCTGGGTTTAAGCTGAAGAAGGCGAAGGAGTATCTGTGTTTGTTAAATTCCACCGGAAAATGGAAGAAGAGGCTTCTCTGTCTTTGGATTCTTGCTCTACTTATACTCTTGAGCTGTTGGTTTCCCTTGCGTTCGTATAATGTAAATAGTGGCACAAAACAGAAGACCTCGTACTTGTTCGACGAAGAAACTCGAACTTTTCTGCGGCATTTCAATGTTAGCAAGAATCAGCTTCAAGCTTCCGAATCAGATCGGATATCCGGGTGCACCAATGATTTTGGACCTGACACATCAGAGTTGGATGGAATTGGCTGTGCTCAGAGGTTACTGTACTGGGAGCAAGATTTGCACAAGCAGTATGTATGGGTTGAAGAAAGTGCAGATTCCAATGTTGGGCGGTGCCCAGTTCGAACTGAGAAGATCCCTGAAAATTCTGACCAATTATTCTCTGAGGACATAACTGTTCCATTTGCTACAGACTTATCCATCTCTTTGCTTTCAAATGGAAATCAGCTTTGTGGAAAGATAATTGAACAAGCAGGACTGCTAAGATGCCTTATAAGAAAGCACTGGAAGATTTTCTGTAGTTTATTGATTGGATGTTTTTGGGTTCTTCTTGGACTGGTAGTGTTCCAGAAAGTATCTGGTTTCCATCTAAAATTGTGGAACAAGAAACATCCAAAGCCAAATCAGCCATTGGTTCATCAACAGTGGGTCTTACTAAGGAGAAAACAGCATCAACAAGTCAAAGATTCTCCTAAAGGCGCTGGTAAATGGCGGAAGGTGCTCTTAAGAATATTTATCGTAGTTGGGATCGTCGGGTCCAGTTGGTTATTTTGGTACCTGAACCAGACAGCCATTTTGAGAAGAGAAGAAACACTTGCAAACATGTGTGATGAGCGAGCCCGGATGTTGCAGGACCAATTTAACGTTAGCATGAACCATGTGCATGCATTGGCTGTTCTTACCTCTACTTTCCACCATGGAAAACAGCCTTCTGCCATTGATCAGAAAACATTTGGTGAATATACAGAAAGAACCGCCTTTGAAAGACCGCTGACTAGCGGTGTAGCTTATGCTCTGAAAGTTAATCACTCAGAAAGAGAGAAATTTGAGATGATGCATGGATGGACAATAAAGAAAATGGAGACTGAGGACCAAACTCTTGTTCAAGATTGTGATCCAGAAAATTTGGACCCTGCACCAATTCAAGATGAATATGCACCTGTCATATTTTCTCAAGAAACAGTCGCGCATATAGTTTCTATCGACATGATGTCGGGAAAAGAGGACCGTGAGAACATCTTGCGAGCTAGGGCGTCTGGAAAGGGAGTGCTTACATCTCCTTTTAAACTTTTGAAATCTAATCACCTTGGAGTTGTTCTCACATTTGCTGTCTATAACACTGACCTCCCACCAGACGCTACTCCCGAGCAACGTATTGAAGCTACTGTTGGCTATCTAGGTGCATCATACGATATCCCCTCACTGGTGGAAAAGCTTCTGCATCAGCTTGCTAGCAAGCAGACAATTGTTGTGAATGTTTATGATACAACCAATGATTCTGCTCCAATCAACATGTACGGTTCTGATTTTACCGATACCGGGCTACTGCATATTAGTAAACTTGATTTTGGGGATCCATTAAGAAGGCATGAGATGCATTGTAGATTCAAGCATAAACCTCCACCTCCTTGGACGGCTATCAATTCATCAGTTGGCGTCCTAATCATTACCTTGTTGGTTGGCCATATATTTCATGCAGCAATAAGTCGGATTGCAAAAGTGGAGAATGACTATCACGAGATGATGGGGCTTAAAAGTCGCGCCGAAGCTGCTGATGTGGCAAAATCACAGTTTCTGGCAACAGTTTCCCATGAGATCAGAACACCAATGAATGGTGTTTTAGGCATGCTCAAGTTACTGATGGACACTAACCTTGATTCAAAGCAACTGGATTTTGCCCAGACTGCCCACGAGAGTGGAAAAGATCTGATATCACTCATAAATAAGGTTCTTGATCAAGCTAAGATAGAATCAGGAAGGCTTGAACTTGAATCTGTGCCCTTCGATCTGCGTGCTATACTTGATAATGTGATCTCACCTTTTTCTCTCAAATCTAATGACAAGGGAATTGAGTTGGCTGTTTATGTCTCTGATTTGGTGCCTGAAGTTGTCATTGGTGACCCTGGACGCTTTCGTCAAATAATTACTCATCTTGTTGGAAATTCACTCAAGTTTACGCATAAAAATGGACATATATTAGTCTCGGTGCATCTGGCTGACGAGGTGAGAGGTCCAGTTGATTTTATGGACATTGTTCTTAAACAAGGCACATATGTAGTTGGAGATACATCAAACAATAGTTGTACTACATTCAGTGGGTTACCTGTGGTGGATAGATGGAAAAGCTGGGAGAAATTCAAAAAGTTCAGCAGGACGGATGCGGTGGAGGAACCCAAAACAATTAGAATACTCGTGACCGTAGAGGATACAGGTGTAGGAATTCCCCACGATGCACAAAACCGAGTATTCATGCCTTTTATGCAGGCAGATAGTTCCACATCTCGGACGTATGGTGGCACGGGAATAGGATTGAGTATTAGCAAACATTTGGTGGATCTCATGGATGGGGAGATTGGGTTTGTGAGTGAGCCTGGTATTGGTAGCACATTTTCTTTTACGGTTTCTTTCCAGAAAGGGGAAATTAGTTCTTTGGATACCAGAGAGCCACACTGTGATCTGGGTGTTGTCGATTTCCAGGGGCTAAGAGCATTAATAATTGATAATAGCTGCGTTCGAGCAGAGGTCACGAGATATCATCTTCAGAGGCTGGGGATCTCTGTTGATATTACTCTGAGTGCAGAGTCTGCTTATCAATATCTATCCAATGCCTCCAACACAAGAGCATCGACACAACTGGCCATGATTCTTATAGACAAAGACATTTGGGACAAGAAAGAGGGTCTCAAGTTTCATCTTTTGTTTAAAGAGCATGTGAATGGTCCAAAACTCTTTCTCTTGGCATCCCCCAAAAGCTCCAATGAGCAATATGAACTTAAATCTTCTGGTCATGTAAATAATGTGTTAAGTAAGCCTCTTCAGTTAAACGTGTTGGTCAGTTGCTTCCGCGACGTCTTTGGGATCGAGAAGAGGAATCAAGTAATTATTAAACGACCATCGACTCTTGGAAATCTGCTGAAAGGAAAGCATATCCTGGTTGTGGATGACAATGCTGTAAACAGAAAGGTGGCCGAAGGCGCCCTAAAGAAATATGGAGCTGTTGTAACTTGTGTAGAACGTGGAACGGATGCTGTGGCTCTCCTCAACCCTCCACACAACTTTGATGCTTGCTTCATGGACCTCCAGATGCCTGAGATGGATGGGTATGAAGCTACAAGACAGGTCCGTGCTGTAGAAAGTGGCGTGAACGCGAAAATTTCATCTGGGGAAGTGTCGAATGAGGATAATAAGGTCCATTGGCACACACCAATCTTTGCTATGACGGCAGATTTAATTCAGGCTATGAATGAAGAGTGTGTGAAGTGTGGGATGGATGGTTATGTAGCTAAGCCATTCGAAGAGGAGCAGCTTTATTCGACTGTAGCACGTTTTCTTGAGACTGCTTGA 3738 43.1 MWRGCSGGAGVLLKLSRCLLRMSVSCKFSTSNGKFPAGFKLKKAKEYLCLLNSTGKWKKRLLCLWILALLILLSCWFPLRSYNVNSGTKQKTSYLFDEETRTFLRHFNVSKNQLQASESDRISGCTNDFGPDTSELDGIGCAQRLLYWEQDLHKQYVWVEESADSNVGRCPVRTEKIPENSDQLFSEDITVPFATDLSISLLSNGNQLCGKIIEQAGLLRCLIRKHWKIFCSLLIGCFWVLLGLVVFQKVSGFHLKLWNKKHPKPNQPLVHQQWVLLRRKQHQQVKDSPKGAGKWRKVLLRIFIVVGIVGSSWLFWYLNQTAILRREETLANMCDERARMLQDQFNVSMNHVHALAVLTSTFHHGKQPSAIDQKTFGEYTERTAFERPLTSGVAYALKVNHSEREKFEMMHGWTIKKMETEDQTLVQDCDPENLDPAPIQDEYAPVIFSQETVAHIVSIDMMSGKEDRENILRARASGKGVLTSPFKLLKSNHLGVVLTFAVYNTDLPPDATPEQRIEATVGYLGASYDIPSLVEKLLHQLASKQTIVVNVYDTTNDSAPINMYGSDFTDTGLLHISKLDFGDPLRRHEMHCRFKHKPPPPWTAINSSVGVLIITLLVGHIFHAAISRIAKVENDYHEMMGLKSRAEAADVAKSQFLATVSHEIRTPMNGVLGMLKLLMDTNLDSKQLDFAQTAHESGKDLISLINKVLDQAKIESGRLELESVPFDLRAILDNVISPFSLKSNDKGIELAVYVSDLVPEVVIGDPGRFRQIITHLVGNSLKFTHKNGHILVSVHLADEVRGPVDFMDIVLKQGTYVVGDTSNNSCTTFSGLPVVDRWKSWEKFKKFSRTDAVEEPKTIRILVTVEDTGVGIPHDAQNRVFMPFMQADSSTSRTYGGTGIGLSISKHLVDLMDGEIGFVSEPGIGSTFSFTVSFQKGEISSLDTREPHCDLGVVDFQGLRALIIDNSCVRAEVTRYHLQRLGISVDITLSAESAYQYLSNASNTRASTQLAMILIDKDIWDKKEGLKFHLLFKEHVNGPKLFLLASPKSSNEQYELKSSGHVNNVLSKPLQLNVLVSCFRDVFGIEKRNQVIIKRPSTLGNLLKGKHILVVDDNAVNRKVAEGALKKYGAVVTCVERGTDAVALLNPPHNFDACFMDLQMPEMDGYEATRQVRAVESGVNAKISSGEVSNEDNKVHWHTPIFAMTADLIQAMNEECVKCGMDGYVAKPFEEEQLYSTVARFLETA 1245
       

Gff information


Chromosome Start End Strand Old_gene Gene Num
20 3286073 3294441 - CmoCh20G006740.1 Cmo20g00674 404991
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Cmo20g00674 1245 ProSiteProfiles Response regulatory domain profile. 960 1083 IPR001789 GO:0000160
Cmo20g00674 1245 SMART HKATPase_4 764 936 IPR003594 -
Cmo20g00674 1245 SUPERFAMILY Homodimeric domain of signal transducing histidine kinase 638 718 IPR036097 GO:0000155|GO:0007165
Cmo20g00674 1245 SMART HisKA_10 652 717 IPR003661 GO:0000155|GO:0007165
Cmo20g00674 1245 Gene3D - 1101 1245 - -
Cmo20g00674 1245 ProSiteProfiles CHASE domain profile. 367 591 IPR006189 -
Cmo20g00674 1245 Gene3D - 716 938 IPR036890 -
Cmo20g00674 1245 ProSiteProfiles Response regulatory domain profile. 1107 1242 IPR001789 GO:0000160
Cmo20g00674 1245 Pfam Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase 764 934 IPR003594 -
Cmo20g00674 1245 Gene3D - 955 1092 - -
Cmo20g00674 1245 PANTHER TWO-COMPONENT HISTIDINE KINASE 280 1243 - -
Cmo20g00674 1245 ProSiteProfiles Histidine kinase domain profile. 659 936 IPR005467 -
Cmo20g00674 1245 PANTHER HISTIDINE KINASE 2 280 1243 - -
Cmo20g00674 1245 Pfam His Kinase A (phospho-acceptor) domain 652 717 IPR003661 GO:0000155|GO:0007165
Cmo20g00674 1245 SUPERFAMILY ATPase domain of HSP90 chaperone/DNA topoisomerase II/histidine kinase 706 932 IPR036890 -
Cmo20g00674 1245 Pfam CHASE domain 370 564 IPR006189 -
Cmo20g00674 1245 Gene3D - 318 370 - -
Cmo20g00674 1245 SMART CHASE_2 367 566 IPR006189 -
Cmo20g00674 1245 CDD REC_hyHK_CKI1_RcsC-like 1108 1238 - -
Cmo20g00674 1245 SUPERFAMILY CheY-like 1103 1243 IPR011006 -
Cmo20g00674 1245 SMART REC_2 1106 1238 IPR001789 GO:0000160
Cmo20g00674 1245 SUPERFAMILY CheY-like 955 1084 IPR011006 -
Cmo20g00674 1245 Pfam Response regulator receiver domain 1108 1238 IPR001789 GO:0000160
Cmo20g00674 1245 PRINTS Bacterial sensor protein C-terminal signature 861 875 IPR004358 GO:0016310|GO:0016772
Cmo20g00674 1245 PRINTS Bacterial sensor protein C-terminal signature 879 889 IPR004358 GO:0016310|GO:0016772
Cmo20g00674 1245 PRINTS Bacterial sensor protein C-terminal signature 896 914 IPR004358 GO:0016310|GO:0016772
Cmo20g00674 1245 PRINTS Bacterial sensor protein C-terminal signature 920 933 IPR004358 GO:0016310|GO:0016772
Cmo20g00674 1245 Gene3D - 621 715 - -
Cmo20g00674 1245 Gene3D CHASE domain 371 598 IPR042240 -
Cmo20g00674 1245 CDD HATPase_EvgS-ArcB-TorS-like 769 932 - -
Cmo20g00674 1245 CDD HisKA 652 713 IPR003661 GO:0000155|GO:0007165
       

Duplication type information


Select Gene1 Location1 Gene2 Location2 E-value Duplicated-type
Cmo06g00111 Cmo-Chr6:629770 Cmo20g00674 Cmo-Chr20:3286073 1.24E-15 dispersed
Cmo15g00925 Cmo-Chr15:4707929 Cmo20g00674 Cmo-Chr20:3286073 0 dispersed
Cmo16g01290 Cmo-Chr16:9191549 Cmo20g00674 Cmo-Chr20:3286073 1.71E-15 dispersed
Cmo17g01385 Cmo-Chr17:10735030 Cmo20g00674 Cmo-Chr20:3286073 0 dispersed
Cmo11g01837 Cmo-Chr11:12825491 Cmo20g00674 Cmo-Chr20:3286073 0 wgd
Cmo02g00226 Cmo-Chr2:1044451 Cmo20g00674 Cmo-Chr20:3286073 0 wgd
       

Transcription factors information


Select Gene Hmm_acc Hmm_name Score E-value Regulatory Factors Family
11753 PF03924 CHASE 5.70E-35 CL0165 Cmo TR
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Cmo20g00674 K14489 - csv:101221294 2187.92