Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Cmo02g00226 ATGTCTTTGAACTGCAAGTTTTCTATCTTTAATGGTAAGTTTCCAGCTGGGTTTAAGCTGAAGAAGGCTAAGGAACATCTATGTTTGTTAAATTCCACCGGAAAATGGAAGAAGAAGCTGTTCTGTCTTTGGATTTTTACTGTAGTTTTAGCTTTGGGCAGCTGGTTTTCCTTGCGTTCGTGTAATGTAAATAGTGGAACAAAACAGCAAATCTCGAACTTGTTCGATGAAGAAACTCAAACTTTGATGCGGCATTTCAATGTAAGCAAGAATCAGATTCAAGCTTTGACTGCCTTGTTATCTGAATCAGATCGGATGACATCCATGGGGTGCATCAATGATTTTGGATCGGACAAATCACAGTTGGATGGAATTGCCTGTGCCCTGAGGTCACTGTACTGGGAGCAAGATTTGCACAAGCAGCATGTATGGGTTGAGGGAAGTGCAGATTCCAATGTTGGGCAGTGCCCAATTCGAACTGAGAAGGTCCCTGAAAATTCTGAACAATTATTCTCTGAGGACATAACTGTTCCATATGCTACAGACTTATCTGTCTCTTTACTTTCTTCTGGAAATCAGCTCTGTGGAAAGATAATTGAACAAACAGGAGTTCTAAGATACTTTATAAGAAAGCACTGGAAGGGTTTCTATAGTTTGTTGATTGGATGTTTTTCAGTCTTTCTTGGAGTGATATCATTCCAGAAAACATCTGGTTTCCATCTAAACTTGTGGAACAAAAAACATCCAAAGCCAAATCAGCCATTGGTTCATCCACAATGGGTCTTACAACGGAGAAAAAAGCATCAACAAGTCAGAGATTCTTCTAAAGGAGCAGGGAAATGGCGGAAGGTGCTCCTAAGAATATTTATTGTATTCGGAATAATCGGGTCCATATGGCTATTTTGGTACCTGAACAAGACAGCCATTTTGAGAAGAGAAGAAACACTTGCAAACATGTGTGATGAGCGAGCTCGGATGTTGCAGGACCAGTTTAATGTTAGCATGAACCATGTTCATGCATTGGCCGTTCTTACCTCGACGTTCCACCACGGAAAACAACCTTCAGCTATTAATCAGAAAACATTTGGTGAGTATACAGAAAGAACATCCTTTGAAAGACCACTAACTAGTGGTGTAGCTTATGCTTTGAAAGTTAATCACTCGGAAAGAGAGCAATTTGAGAGGATGCATGGATGGACAATAAAGAAAATGGAGACAGAGGACCAAACTCTTGTTCAAGATTGTAACCCAGAAAATTTGGAGCCTGCACCAATTCAAGACGAATATGCACCTGTCATATTTTCTCAAGAAACGGTCGCTCATATAGTTTCTATCGACATGATGTCTGGAAAAGAAGATCGTGAGAACATATTGCGAGCTAGAGCTTCTGGAAAGGGAGTGCTTACATCTCCTTTTAAGCTTTTGAAATCTAATCACCTTGGAGTCGTCCTCACGTTTGCCGTCTATGACACTGACCTCCCACCAGATGCTACTCCTGAGCAACGTATCGAAGCTACTGTTGGTTATCTAGGTGCATCTTATGATATCCCCTCACTGGTGGAAAAGCTTCTGCATCAGCTTGCTAGTAAGCAGACAATTGTTGTGAACGTTTATGATACGACCAATCATTCCGCTCCCATTAATATGTATGGTTCGGATTTTTCCGATACTGGGCTACTGCATATTAGTAAACTTGATTTTGGGGATCCATTAAGAAGGCACGAGATGCATTGTAGATTCAAGCATAAACCTCCACCTCCTTGGACAGCCATAAATTCATCGGTTGGCGTTCTTATCATTACCTTGCTGGTTGGGCATATTTTTCATGCAGCCATAAGTCGGATCGCAAAAGTGGAGAATGACTATCACGAGACGATGTGGCTTAAAAGTCTTGCCGAAGCTGCTGATGTCGCAAAATCACAGTTTCTGGCAACAATTTCCCATGAGATCAGAACACCAATGAATGGTGTTTTAGGCATGCTGAAATTACTGATGGACACTAACCTTGATCCGAATCAACTGGATTTTGCCCGGACTGCTCACGAGAGTGGAAAAGATCTGATATCTCTTATAAATAAGGTTCTTGATCAAGCTAAGATAGAATCAGGAAGGCTTGAACTTGAATCTGTACCTTTTGATCTGCGTGCTATACTTGATAATGTTCTCTCACCTTTCTCTCTCAAATCCAATGATATGGGAATTGAGTTGGCTGTTTATGTCTCTGATTTGGTGCCTGAAGTTGTCATCGGTGACCCTGGACGTTTTCGTCAAATAATTACTCATCTCGTTGGAAACTCGGTCAAGTTTACTCATAAAAAGGGACACATATTAGTCTCAGTGCATCTGGCTGATGAAGTGAGAGGCCCAGTTGATTTTATGGACATTGTGCTTAGACAAGGCACGTACTTAGCTGGAGATGCATCAAAGAATGGTTCTACTACATTCAGTGGGTTACCTGTGGTGAATAGATGGAAAAGCTGGGAGGATTTCAAAAAGTTCAGCAATACAAATGCGGTGGAGGAACCCAAAACTATTAAAATACTCGTGACCGTTGAGGATACAGGTGTAGGAATTCCCCGAAAAGCACAAAGCCGAATTTTCACTTCTTTTATGCAGGCTGATAGTTCCACTTCACGGATATATGGTGGCACTGGAATAGGATTGAGTATTAGCAAACGTTTGGTCGATCTCATGGATGGGGAGATCGGGTTTGTGAGTGAGCCTGGTATAGGTAGCACGTTTTCATTTACGGTTTCTTTTCAGAAAGGGGAAACAAATTTTCTGGATACGAAACGGTCACAGTTTGATCTCGGTGTTATGGAGTTCCAGGGGCTAAGAGCTTTAATAATAGATAATAGCTGCATTCGAGCTGAGGTCACGAGATATCATCTGCAGAGGCTCGGGATTTCTGTTGATATAGCTCTGAGTGCAGAATCTGCTTATCAGTATTTATCTAATACTTCCAACACAAGAGTATCGACACAATTAGCCATGATTCTTATAGACAAAGATATTTGGGACAAGAAAGTGGGTCTGAAGTTCCATCTTCTGTTTAAAGAGCACGTTGATAGAAGTGTGACAAATCTCCAGATGAATACTCCAAAGCTTTTTCTTTTGGCGAACCCCATTAGCTCCAATGAGCTGAATGAACTTAAATCTTCAGGTCATGTAAATAATGTATTGATTAAACCTCTTCAGCTAAACATGTTGGTTAGTTGCTTCCACGAAGCGTTTGGGATTGAGAAGCGGAATCAAGTAGTAATTAAAAGACCTTCTACTCTTAAGAACCTGCTGAAAGACAAGCATATTTTGGTTGTGGATGACAATGCGGTTAACAGAAGGGTGGCAGAAGGTGCCTTGAAGAAATATGGAGCTGTTGTAACTTGTGTAGAATGTGGAAAGGATGCGGTGGCTCTCCTCGACCCTCCACACGACTTTGATGCTTGTTTCATGGACCTCCAGATGCCTGAGATGAATGGGTTTGAAGCTACAAGACAGGTCCGTGCTGTAGAAAGAGGGGTAAACGAGAAACTTACATCAGGAGAAGTGTCAACTGAGGACAATAAGGTTTATTGGCACACACCAATATTTGCTATGACGGCAGATTTAATTCAGTCTCTGTATGAACAATGCCTGAATACCGACGCAGCGACAATAACTGTTTCTTACTTCTCTGACCAAAGGGACACGTCTCGTGTGGTAGGACGATGGCTAGCTGGTGGATGGAGTTATCTGGTCACATCCTTCTCATCTTTTACTTTGTGGAATGACAACCACGATTCGGTAGTCTTAGCAATGGCGTACAGGAAAAGCGAGCAAACGATGCATAATCCATAG 3837 41.67 MSLNCKFSIFNGKFPAGFKLKKAKEHLCLLNSTGKWKKKLFCLWIFTVVLALGSWFSLRSCNVNSGTKQQISNLFDEETQTLMRHFNVSKNQIQALTALLSESDRMTSMGCINDFGSDKSQLDGIACALRSLYWEQDLHKQHVWVEGSADSNVGQCPIRTEKVPENSEQLFSEDITVPYATDLSVSLLSSGNQLCGKIIEQTGVLRYFIRKHWKGFYSLLIGCFSVFLGVISFQKTSGFHLNLWNKKHPKPNQPLVHPQWVLQRRKKHQQVRDSSKGAGKWRKVLLRIFIVFGIIGSIWLFWYLNKTAILRREETLANMCDERARMLQDQFNVSMNHVHALAVLTSTFHHGKQPSAINQKTFGEYTERTSFERPLTSGVAYALKVNHSEREQFERMHGWTIKKMETEDQTLVQDCNPENLEPAPIQDEYAPVIFSQETVAHIVSIDMMSGKEDRENILRARASGKGVLTSPFKLLKSNHLGVVLTFAVYDTDLPPDATPEQRIEATVGYLGASYDIPSLVEKLLHQLASKQTIVVNVYDTTNHSAPINMYGSDFSDTGLLHISKLDFGDPLRRHEMHCRFKHKPPPPWTAINSSVGVLIITLLVGHIFHAAISRIAKVENDYHETMWLKSLAEAADVAKSQFLATISHEIRTPMNGVLGMLKLLMDTNLDPNQLDFARTAHESGKDLISLINKVLDQAKIESGRLELESVPFDLRAILDNVLSPFSLKSNDMGIELAVYVSDLVPEVVIGDPGRFRQIITHLVGNSVKFTHKKGHILVSVHLADEVRGPVDFMDIVLRQGTYLAGDASKNGSTTFSGLPVVNRWKSWEDFKKFSNTNAVEEPKTIKILVTVEDTGVGIPRKAQSRIFTSFMQADSSTSRIYGGTGIGLSISKRLVDLMDGEIGFVSEPGIGSTFSFTVSFQKGETNFLDTKRSQFDLGVMEFQGLRALIIDNSCIRAEVTRYHLQRLGISVDIALSAESAYQYLSNTSNTRVSTQLAMILIDKDIWDKKVGLKFHLLFKEHVDRSVTNLQMNTPKLFLLANPISSNELNELKSSGHVNNVLIKPLQLNMLVSCFHEAFGIEKRNQVVIKRPSTLKNLLKDKHILVVDDNAVNRRVAEGALKKYGAVVTCVECGKDAVALLDPPHDFDACFMDLQMPEMNGFEATRQVRAVERGVNEKLTSGEVSTEDNKVYWHTPIFAMTADLIQSLYEQCLNTDAATITVSYFSDQRDTSRVVGRWLAGGWSYLVTSFSSFTLWNDNHDSVVLAMAYRKSEQTMHNP 1278
       

Gff information


Chromosome Start End Strand Old_gene Gene Num
2 1044451 1055006 - CmoCh02G002260.1 Cmo02g00226 375823
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Cmo02g00226 1278 ProSiteProfiles Response regulatory domain profile. 1102 1237 IPR001789 GO:0000160
Cmo02g00226 1278 Pfam CHASE domain 356 550 IPR006189 -
Cmo02g00226 1278 ProSiteProfiles Response regulatory domain profile. 946 1078 IPR001789 GO:0000160
Cmo02g00226 1278 CDD HisKA 638 699 IPR003661 GO:0000155|GO:0007165
Cmo02g00226 1278 CDD HATPase_EvgS-ArcB-TorS-like 755 918 - -
Cmo02g00226 1278 Pfam Response regulator receiver domain 1103 1174 IPR001789 GO:0000160
Cmo02g00226 1278 PRINTS Bacterial sensor protein C-terminal signature 847 861 IPR004358 GO:0016310|GO:0016772
Cmo02g00226 1278 PRINTS Bacterial sensor protein C-terminal signature 906 919 IPR004358 GO:0016310|GO:0016772
Cmo02g00226 1278 PRINTS Bacterial sensor protein C-terminal signature 882 900 IPR004358 GO:0016310|GO:0016772
Cmo02g00226 1278 PRINTS Bacterial sensor protein C-terminal signature 865 875 IPR004358 GO:0016310|GO:0016772
Cmo02g00226 1278 SUPERFAMILY ATPase domain of HSP90 chaperone/DNA topoisomerase II/histidine kinase 692 918 IPR036890 -
Cmo02g00226 1278 PANTHER HISTIDINE KINASE 2 31 103 - -
Cmo02g00226 1278 Gene3D - 304 356 - -
Cmo02g00226 1278 Gene3D - 63 105 - -
Cmo02g00226 1278 Gene3D - 607 701 - -
Cmo02g00226 1278 PANTHER HISTIDINE KINASE 2 267 1213 - -
Cmo02g00226 1278 PANTHER TWO-COMPONENT HISTIDINE KINASE 31 103 - -
Cmo02g00226 1278 PANTHER TWO-COMPONENT HISTIDINE KINASE 267 1213 - -
Cmo02g00226 1278 SUPERFAMILY CheY-like 941 1078 IPR011006 -
Cmo02g00226 1278 SMART HisKA_10 638 703 IPR003661 GO:0000155|GO:0007165
Cmo02g00226 1278 SMART REC_2 1101 1233 IPR001789 GO:0000160
Cmo02g00226 1278 SMART HKATPase_4 750 922 IPR003594 -
Cmo02g00226 1278 Pfam His Kinase A (phospho-acceptor) domain 638 703 IPR003661 GO:0000155|GO:0007165
Cmo02g00226 1278 Gene3D - 1084 1221 - -
Cmo02g00226 1278 CDD REC_hyHK_CKI1_RcsC-like 1103 1213 - -
Cmo02g00226 1278 SMART CHASE_2 353 552 IPR006189 -
Cmo02g00226 1278 ProSiteProfiles CHASE domain profile. 353 577 IPR006189 -
Cmo02g00226 1278 Gene3D - 702 925 IPR036890 -
Cmo02g00226 1278 Gene3D - 940 1083 - -
Cmo02g00226 1278 SUPERFAMILY Homodimeric domain of signal transducing histidine kinase 629 704 IPR036097 GO:0000155|GO:0007165
Cmo02g00226 1278 SUPERFAMILY CheY-like 1099 1216 IPR011006 -
Cmo02g00226 1278 Gene3D CHASE domain 357 584 IPR042240 -
Cmo02g00226 1278 ProSiteProfiles Histidine kinase domain profile. 645 922 IPR005467 -
Cmo02g00226 1278 Pfam Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase 750 920 IPR003594 -
       

Duplication type information


Select Gene1 Location1 Gene2 Location2 E-value Duplicated-type
Cmo02g00226 Cmo-Chr2:1044451 Cmo11g01837 Cmo-Chr11:12825491 0 dispersed
Cmo02g00226 Cmo-Chr2:1044451 Cmo20g00674 Cmo-Chr20:3286073 0 wgd
       

Transcription factors information


Select Gene Hmm_acc Hmm_name Score E-value Regulatory Factors Family
9204 PF03924 CHASE 1.00E-35 CL0165 Cmo TR
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Cmo02g00226 K14489 - csv:101221294 2085.07