Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Cco02g1037 ATGTGGAGGGGTTGCTCTGGCGGAGCTGGGGTTTTTCTCAAGCTCACAAGGTGCCTACTGAGGATGTCTTTAAGCTCCAAGTTCTCTACCACTAACGGGAAATTTCCAGCTGGGTTTAAGCTGAAGAAGGCTAAGGAGCATCTGTGTTTGTTAAATTCAACCGGAAAATGGAAGAAGAAGCTTCTCTGTCTTTGGATTTTTGCTGTCGTTATAGTTTCGAGCAGTTGGTTTTCCTTGCGTTCCTATAATGTGAATAGTGGAACAAAACAAAAGCCCTCAAAGTTGTTCGACGGAGAAACTCGAACTTTGCTGCAGCATTTCAATGTAAGCAAGAATCAGCTTCAAGCTTTAGCTTCCTTGTTATCCGACTCAGATCGGATGTCATCCACTGGGTGCGCCAATGATTTTGGATCTGACACATCACAGTTGAATGGAATTGCCTGTGCCCTAAGGTCACTGTACTGGGAGCAAGATTTGTACAAGCAGTATGTAGGGGCTGAAGAAAGTGAAGATACCAATATTGGGGAGTGTCCAATTCGAACTGAGAAGATCCCTGGAAATTCTGGTCAATTATTCTCTGATGACATAACTGTTCCATTTGCTACAGACTTATCCGCCTCGTTACTCTCTACTGGAAATCAACTTTGTGGAAAGATAATTGAACAAGCAGGAGTGCTAAGATGCCTTTTAAGAAAGCACTGGAAGAATTTCTGTAGTTTGATGATTGGATGTTTTTGGGTCCTCCTTGGAGTGATAGCGTTCAAGAAAATATCTGGTTTCTATCTAAAATTGTGGAACAAGAAACATCCAAAGCCAAATCAGCCATTAGTTCATCAACAATGGGTTTTACTGCGGAGAAAGCAACATCAACAGGTCAAAGAGTCTCCTAAAGGAGCTGGGAAATGGCGGAAGGTGCTCCTAAGAATATTTATTGTAGTTGGAATCGTTGGGTCCATTTGGTTATTTTGGTACCTGAACAAGACAGCTATTTTGAGAAGAGAAGAAACACTTGCAAACATGTGTGATGAGCGAGCCCGGATGTTGCAGGACCAATTTAATGTTAGCATGAACCATGTTCATGCATTGGCCGTTCTTACCTCTACCTTCCACCATGGAAAACAGCCTTCTGCCATTGATCAGAAAACATTTGGTGAATACACAGAAAGAACAGCCTTTGAAAGGCCACTGACCAGTGGTGTAGCTTATGCTTTGAAAGTTAATCACTCAGAAAGAGAGCATTTTGAGATGATGCATGGATGGACAATAAAGAAAATGGAGACAGAGGACCAAACTCTTGTTCAAGATTGTAATCCAGAAAATTTGGAACCTGCACCAATTCGAGATGAATATGCACCTGTCATATTTTCTCAAGAAACGGTAGCTCATATAGTTTCTATCGACATGATGTCTGGAAAAGAAGATCGTGAGAACATCTTGCGAGCTAGAGCTTCTGGAAAGGGTGTGCTTACATCTCCTTTTAAACTTTTGAAATCTAATCACCTTGGAGTTGTCCTCACATTTGCTGTCTATAATACTGACCTCCCACCAGATGCTACTCCTGAGCAACGTATTGAAGCTACTGTTGGTTATCTAGGTGCCTCATATGATATCCCCTCACTGGTGGAAAAGCTTCTGGATCAGCTTGCTAGCAAGCAGACAATTGTTGTGAATGTTTATGATACAACCAATGATTCTGCTCCCATCAACATGTATGGTTCTGATTTAACCGATACTGGGCTACTGCATATTAGTAAACTTGATTTTGGGGATCCTTTAAGAAGGCACGAGATGCACTGTAGATTCAAGCATAAACCTCCACCTCCTTGGACCGCTATCAATTCATCTGTAGGCGTCCTAATCATTACCTTGCTGGTTGGCCATATTTTTGATGCAGCTATAAATCGGATTGCAAAAGTGGAGAATGACTATCTCGAGACGAGGGGACTTAAAAGTCGCGCCGAAGCTGCTGATGTAGCAAAATCACAGTTTCTGGCAACAGTTTCCCATGAGATCAGAACACCAATGAATGGTGTTTTAGGCATGCTGAAACTACTGATGGACACTAACCTTGATCCAAATCAACTGGATTTTGCCTGGACTGCTCACGAGAGTGGAAAAGATCTGATATCACTTATAAATAAGGTTCTTGATCAAGCTAAGATAGAATCAGGAAGGCTTGAACTTGAATCTGTACCTTTCGATCTGCGTGCTATAGTTGATAATGTGCTCTCACCTTTTTCACTCAAATCTAATGACAAGGGAATTGAGTTGGCTGTTTATGTCTCGGATTTGGTGCCTGAAATTGTCATTGGTGACCCTGGACGCTTTCGTCAAATAATTACTCATCTTGTTGGAAATTCACTCAAGTTTACTGATAAAGAGGGACACATACTAGTTTCAGTGCATCTGGCTGATGAAGTCAGAGGCCCAGTTGATTTCATGGACATTGTGCTTAAACAAGGCTCATATGTAAATGGAGATACATCAAGCAATAGCTATACTACATTCAGTGGGTTACCTGTGGTGGATAGATGGAAAAGCTGGGAAGACTTCAAAAAGTTCAGCAGGATGAATGTGGTGGAGGAACCTAAAACAATTAGAATACTCGTGACTGTTGAGGATACAGGTGTAGGAATTCCCCAAGACGCACAAAGCCGAATATTCACTCCTTTTATGCAGGCTGATAGTTCCACTTCTCGGACGTATAGTGGTACTGGAATAGGATTGAGTATTAGCAAACGTTTGGTGGATCTCATGGATGGGGAGATTGGGTTTGTGAGTGAGCCTCGTGTAGGTAGCACATTTTCATTTACGGTTTCTTTCCAGAAAGGGGAAATAAGTTTTTGGAATACCAGGCGGCCACAATATGATGTGGGTATTGGAGAGTTCCAGGGGCTAAAAGCATTAATAATCGATAATAGCTGCATTCGAGCAGAGGTCACGAGATATCATCTGCAGAGGCTGGGGATTTCTGTTGATATAACTGTGGGTACAGAGTCTGCTTATCAATATCTATCCAATACCTCCAACACAAGCTCATCGACACAATTGGCCATGATTCTTATAGACAGAGACATTTGGGAGAAGGAAGTGGGTCCTAAGTTCGATCTTTTGTTTAAAGAGCATGTGGATAAAAGTGTGACAAAGTTCTTTCTCTTGGGAACCCCCAAAAGCTCCAATGAGCAATATGAACTTGAATCTTCTGGTCATGTAAATAATGTGTTAAGTAAACCTCTTCAGTTAGACGTGTTGGTCAGTTGCTTCCGTGAAGCCTTCAGGATTGAGAAGAGGAATCAAGTAATTATTAAGAAACCATCCACTCTTAGGAGTCTGCTGAAAGAAAAGCATATCTTGGTTGTGGATGACAATGCGGTAAACAGAAGGGTGGCAGAAGGTGCCCTAAAGAAATATGGAGCCATTGTAACTTGTGTAAAATGTGGAAAGGATGCTGTGGCTCTCCTCAATCCTCCACACAACTTTGATGCTTGCTTCATGGACCTACAGATGCCTGAGATGGATGGGTATGAAGCTACAAGACAGGTCCGTGCTGTAGAAAGTGAGGTGAATGCGAAAATTGCATCTGGAGAAGTGTCAACTGAGGATAATAAAGTCCATTGGCACACGCCAATATTTGCTATGACAGCGGATTTAGTTCAGGCTATGAATGAGGAGTGCTTGAAGTGTGGGATGGATGGTTATGTAGCTAAGCCATTTGAAGAGGAGCAACTTTATTCGGCTGTAGCACGATTTTTCGAGACTGCTTGA 3765 42.12 MWRGCSGGAGVFLKLTRCLLRMSLSSKFSTTNGKFPAGFKLKKAKEHLCLLNSTGKWKKKLLCLWIFAVVIVSSSWFSLRSYNVNSGTKQKPSKLFDGETRTLLQHFNVSKNQLQALASLLSDSDRMSSTGCANDFGSDTSQLNGIACALRSLYWEQDLYKQYVGAEESEDTNIGECPIRTEKIPGNSGQLFSDDITVPFATDLSASLLSTGNQLCGKIIEQAGVLRCLLRKHWKNFCSLMIGCFWVLLGVIAFKKISGFYLKLWNKKHPKPNQPLVHQQWVLLRRKQHQQVKESPKGAGKWRKVLLRIFIVVGIVGSIWLFWYLNKTAILRREETLANMCDERARMLQDQFNVSMNHVHALAVLTSTFHHGKQPSAIDQKTFGEYTERTAFERPLTSGVAYALKVNHSEREHFEMMHGWTIKKMETEDQTLVQDCNPENLEPAPIRDEYAPVIFSQETVAHIVSIDMMSGKEDRENILRARASGKGVLTSPFKLLKSNHLGVVLTFAVYNTDLPPDATPEQRIEATVGYLGASYDIPSLVEKLLDQLASKQTIVVNVYDTTNDSAPINMYGSDLTDTGLLHISKLDFGDPLRRHEMHCRFKHKPPPPWTAINSSVGVLIITLLVGHIFDAAINRIAKVENDYLETRGLKSRAEAADVAKSQFLATVSHEIRTPMNGVLGMLKLLMDTNLDPNQLDFAWTAHESGKDLISLINKVLDQAKIESGRLELESVPFDLRAIVDNVLSPFSLKSNDKGIELAVYVSDLVPEIVIGDPGRFRQIITHLVGNSLKFTDKEGHILVSVHLADEVRGPVDFMDIVLKQGSYVNGDTSSNSYTTFSGLPVVDRWKSWEDFKKFSRMNVVEEPKTIRILVTVEDTGVGIPQDAQSRIFTPFMQADSSTSRTYSGTGIGLSISKRLVDLMDGEIGFVSEPRVGSTFSFTVSFQKGEISFWNTRRPQYDVGIGEFQGLKALIIDNSCIRAEVTRYHLQRLGISVDITVGTESAYQYLSNTSNTSSSTQLAMILIDRDIWEKEVGPKFDLLFKEHVDKSVTKFFLLGTPKSSNEQYELESSGHVNNVLSKPLQLDVLVSCFREAFRIEKRNQVIIKKPSTLRSLLKEKHILVVDDNAVNRRVAEGALKKYGAIVTCVKCGKDAVALLNPPHNFDACFMDLQMPEMDGYEATRQVRAVESEVNAKIASGEVSTEDNKVHWHTPIFAMTADLVQAMNEECLKCGMDGYVAKPFEEEQLYSAVARFFETA 1254
       

Gff information


Chromosome Start End Strand Old_gene Gene Num
2 13603027 13611199 + CcPI632755_02g010370.1 Cco02g1037 171794
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Cco02g1037 1254 ProSiteProfiles CHASE domain profile. 374 598 IPR006189 -
Cco02g1037 1254 Pfam Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase 866 941 IPR003594 -
Cco02g1037 1254 Pfam CHASE domain 377 571 IPR006189 -
Cco02g1037 1254 FunFam Histidine kinase 4 631 722 - -
Cco02g1037 1254 ProSiteProfiles Histidine kinase domain profile. 666 943 IPR005467 -
Cco02g1037 1254 Gene3D - 86 126 - -
Cco02g1037 1254 Gene3D - 325 377 - -
Cco02g1037 1254 SUPERFAMILY ATPase domain of HSP90 chaperone/DNA topoisomerase II/histidine kinase 713 939 IPR036890 -
Cco02g1037 1254 PRINTS Bacterial sensor protein C-terminal signature 868 882 IPR004358 GO:0016310(InterPro)|GO:0016772(InterPro)
Cco02g1037 1254 PRINTS Bacterial sensor protein C-terminal signature 886 896 IPR004358 GO:0016310(InterPro)|GO:0016772(InterPro)
Cco02g1037 1254 PRINTS Bacterial sensor protein C-terminal signature 903 921 IPR004358 GO:0016310(InterPro)|GO:0016772(InterPro)
Cco02g1037 1254 PRINTS Bacterial sensor protein C-terminal signature 927 940 IPR004358 GO:0016310(InterPro)|GO:0016772(InterPro)
Cco02g1037 1254 SUPERFAMILY CheY-like 962 1093 IPR011006 -
Cco02g1037 1254 Gene3D CHASE domain 378 605 IPR042240 -
Cco02g1037 1254 SMART HKATPase_4 771 943 IPR003594 -
Cco02g1037 1254 CDD REC_hyHK_CKI1_RcsC-like 1117 1247 - -
Cco02g1037 1254 Pfam His Kinase A (phospho-acceptor) domain 659 724 IPR003661 GO:0000155(InterPro)|GO:0007165(InterPro)
Cco02g1037 1254 Pfam Response regulator receiver domain 1117 1247 IPR001789 GO:0000160(InterPro)
Cco02g1037 1254 SMART HisKA_10 659 724 IPR003661 GO:0000155(InterPro)|GO:0007165(InterPro)
Cco02g1037 1254 FunFam Histidine kinase 4 378 605 - -
Cco02g1037 1254 CDD HATPase_EvgS-ArcB-TorS-like 776 939 - -
Cco02g1037 1254 SUPERFAMILY Homodimeric domain of signal transducing histidine kinase 647 725 IPR036097 GO:0000155(InterPro)|GO:0007165(InterPro)
Cco02g1037 1254 CDD HisKA 659 720 IPR003661 GO:0000155(InterPro)|GO:0007165(InterPro)
Cco02g1037 1254 ProSiteProfiles Response regulatory domain profile. 1116 1251 IPR001789 GO:0000160(InterPro)
Cco02g1037 1254 Gene3D - 723 945 IPR036890 -
Cco02g1037 1254 SMART CHASE_2 374 573 IPR006189 -
Cco02g1037 1254 Gene3D - 1109 1254 - -
Cco02g1037 1254 SMART REC_2 1115 1247 IPR001789 GO:0000160(InterPro)
Cco02g1037 1254 ProSiteProfiles Response regulatory domain profile. 967 1092 IPR001789 GO:0000160(InterPro)
Cco02g1037 1254 SUPERFAMILY CheY-like 1113 1251 IPR011006 -
Cco02g1037 1254 Gene3D - 628 722 - -
Cco02g1037 1254 PANTHER TWO-COMPONENT HISTIDINE KINASE 301 1250 IPR050956 -
       

Duplication type information


Select Gene1 Location1 Gene2 Location2 E-value Duplicated-type
Cco02g1037 Cco-Chr2:13603027 Cco06g0219 Cco-Chr6:2329359 5.00E-295 dispersed
Cco10g0065 Cco-Chr10:647061 Cco02g1037 Cco-Chr2:13603027 1.90E-39 transposed
Cco11g1145 Cco-Chr11:22293642 Cco02g1037 Cco-Chr2:13603027 1.40E-26 transposed
Cco02g0290 Cco-Chr2:2629916 Cco02g1037 Cco-Chr2:13603027 0.00E+00 wgd
       

Transcription factors information


Select Gene Hmm_acc Hmm_name Score E-value Regulatory Factors Family
58751 PF03924 CHASE 3.10E-35 CL0165 Cco TR
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Cco02g1037 K14489 - csv:101221294 2180.6