What this is
- Plants must adapt to changing environmental conditions to transition from vegetative growth to reproductive development.
- This review discusses how and photoperiodic responses regulate flowering time in various crops.
- Understanding these mechanisms can enhance crop yield and resilience, particularly in the face of climate change.
Essence
- and light signaling pathways are crucial for regulating flowering time in plants. This understanding can lead to improved agricultural practices and crop resilience.
Key takeaways
- allows plants to sense day length, influencing flowering time. Plants are classified as long-day, short-day, or day-neutral based on their light exposure requirements.
- interact with light signals to regulate flowering. Key genes involved in this process include CONSTANS and FLOWERING LOCUS T, which are modulated by environmental cues.
- Epigenetic factors, such as DNA methylation and chromatin modifications, play a significant role in regulating flowering time and can be influenced by environmental conditions.
Caveats
- The review focuses on model organisms like Arabidopsis, which may not fully represent the complexities in other crops. More research is needed to generalize findings across species.
- While the study discusses genetic and epigenetic mechanisms, the interplay of these factors in real-world environments remains underexplored.
Definitions
- Photoperiodism: Physiological reaction of plants to the length of day and night, influencing flowering time.
- Circadian rhythms: Biological processes that follow a roughly 24-hour cycle, responding to light and darkness in the environment.
- Epigenetics: Heritable changes in gene expression that do not involve alterations to the DNA sequence.
AI simplified
Introduction
Earth’s orbit and rotation around the sun lead to cyclical changes in environmental factors, such as daily and seasonal changes in light, temperature, water, and nutrients, directly affecting plant growth and development, especially the transition from vegetative growth to reproductive development1. The sessile plants have evolved multiple strategies to anticipate these changes, synchronizing their life activities accordingly for survival and reproduction. Photoperiodism is a physiological reaction of plants to sense external environmental cues and to anticipate seasonal growth2–4. For many annual plants, the flowering time depends on the length of daily exposure to light. Based on whether their day length requirement is greater than the critical day length for reproductive transition, they are classified into long-day, short-day, and day-neutral plants5.
The hourglass and the circadian rhythm models are two main hypotheses underlying mechanistic understanding of the photoperiodic response in plants6–8. The hourglass hypothesis assumes that the photoperiodic response of plants to short- and long-day conditions depends mainly on the length measurement of the dark period and the accumulation or activity change of specific chemical components7. This model can explain some photoperiodic events, but is contradictory to others when growing long-day plants under short-day conditions, a short amount of nighttime light can promote their flowering9. Thus, the hourglass model cannot fully explain all photoperiodic flowering phenomena. An alternative hypothesis can better explain them; the circadian rhythm hypothesis emphasizes the interplay of circadian rhythms and light signals in the organism10,11 and proposes key components in regulation of the light-controlled enzyme and substrate by the circadian clock. When the external light cycle synchronizes with the internal rhythm, the enzyme is activated by light and interacts with the substrate, inducing the expression of floral identity genes such as CONSTANS (CO) and FLOWERING LOCUS T (FT), leading to flowering transition2–4,8.
The circadian clock regulates various transcriptional and post-transcriptional events at specific times of day, dating back to redox homeostatic mechanisms after the Great Oxidation Event at ~2.5 billion years ago12. This mechanism, known as circadian gating, adjusts an organism’s sensitivity to environmental stimuli throughout the day. Circadian rhythm model of photoperiodic flowering is a good example. In Arabidopsis, precise daily control of CO expression by the circadian clock is essential for accurately measuring day length. During short winter days, CO peaks at night, leading to protein degradation. Conversely, in early summer’s long days, CO peaks during daylight, stabilizing its protein and activating FT transcription for earlier flowering13,14. In addition to light, daily timing signals are provided by changes in temperature, which allow the clock to be entrained to run within a physiologically critical temperature range for a relatively constant amount of time. In plants, temperature responses and light signals are linked, and this emphasizes how complexly light, temperature, and clock genes interact to enable plants to synchronize their physiological activities with daily and seasonal cycles15.
Notably, key flowering time controlling genes, such as Flowering Wageningen (FWA)16,17 and Flowering Locus C (FLC)18–21 in Arabidopsis and CONSTANS-Like2 (GhCOL2) Gossypium hirsutum (Upland cotton)22, are epigenes or epialleles, and their expression is regulated by DNA methylation, chromatin modification, and/or RNA-mediated mechanisms. Moreover, circadian clock gene expression is epigenetically regulated to promote growth vigor and defense in plant hybrids and allopolyploids23–27. In this review, we outline current progress in understanding the role of circadian regulation in the control of day-length-dependent flowering time in the model plant Arabidopsis thaliana and crops. Understanding circadian and epigenetic regulation of flowering time is crucial for improving crop yield and growth resilience, which may help develop gene-editing and epigenetic engineering technologies to optimize flowering times and yield potential and stability to meet the growing demand for food, feed, fuel, and biomaterials.
Transcriptional architecture of the circadian clock in Arabidopsis
In addition to the negative interactions, multiple transcriptional activators are found to complement the plant circadian regulation. LIGHT-REGULATED WD1 (LWD1) and LWD2 are part of a larger family of proteins known as WD-repeat proteins, which are involved in transmitting light signals to the circadian clock to promote expression of CCA1, TOC1, PRR5, and PRR9 in the morning42–44. The midday-phased components REVEILLEs (RVE4, RVE6, and RVE8) are considered to be homologs of CCA1 and LHY due to their shared structural and functional similarities45. Interestingly, RVEs antagonize the action of CCA1 and LHY and bind to the same evening element (EE) during the midday period. This competition results in the activation of clock genes such as PRR9, PRR5, TOC1, GI, ELF4, and LUX. In turn, PRR5, PRR7, and PRR9 repress RVE8 expression, indicating that RVEs play an important role in maintaining the balance and robustness of the circadian regulation45,46. Two other activating components NIGHT LIGHT INDUCIBLE AND CLOCK-REGULATED1 (LNK1) and LNK2 are transcriptional coactivators of RVE8 to regulate expression of PRR5 and TOC1, both of which repress LNK1 and LNK2 expression in return, forming another loop47.

Feedforward and feedback loops of the circadian clock in Arabidopsis thaliana. The circadian clock inis regulated by a complex network of transcriptional feedback and feedforward loops that involve multiple clock genes. CCA1 and LHY are key transcriptional repressors that negatively regulate the expression of,, as well as their own transcription, and in turn, TOC1 and PRRs inhibit the expression ofand, creating a central feedback loop that helps regulate the clock and its output pathways. The evening complex (EC) formed by ELF3, ELF4 LUX, and GI, respectively, also regulatesandexpression. RVEs are transcriptional activators that directly bind to the promoters of evening-phased genes repressed by CCA1 and LHY and promote their expression, thereby rest the internal oscillator. LNK1 and LNK2 serve as transcriptional coactivators via interaction with RVE to regulate target genes. LWD1 and LWD2 are both involved in perceiving light input to activate the central clock genes, including,,and. White and gray shades indicate morning-phased (sun) and evening-phased (moon) events. Arrows and blunt ends show positive and negative regulation, respectively, while dashed boxes indicate proteins in a complex. For simplicity, protein and gene symbols are interchangeable in this and other figures. A. thaliana TOC1 PRRs CCA1 LHY CCA1 LHY CCA1 TOC1 PRR9, PRR5
Circadian-dependentaccumulation regulates photoperiodic flowering in Arabidopsis CO
In Arabidopsis, the transcription of CDFs is regulated by various clock components. Specifically, CDFs are induced by the morning clock genes CCA1/LHY and repressed by the evening clock genes PRR5, PRR7, and PRR932. This leads to a typical diurnal expression pattern of CDFs with a peak at dawn and this regulation is essential for plants to optimize their reproductive success by flowering at the appropriate time of year53–56. Loss-of-function mutations in CCA1 and LHY lead to early flowering 29, while constitutive expression of CCA1 leads to circadian arrhythmicity and late flowering 28. Additionally, the evening clock genes PRR9, PRR7, and PRR5 function antagonistically with CCA1/LHY and coordinately and positively regulate CO dependent photoperiodic flowering process53. CCA1/LHY also represses the transcription of CO by adjusting the rhythmic expression of the clock component GI and the F-box protein FKF1, and this leads to the appropriate timing of CO expression and photoperiodic flowering35.
The photoperiodic flowering pathway is regulated by the interaction of various proteins. GI together with the blue-light receptors ZTL, LKP2, and FKF1 regulates the degradation of the flowering repressor CDFs to control the clock-mediated photoperiodic flowering pathway 57. Under long-day conditions, the timing of GI and FKF1 expression is synchronized, peaking in the late afternoon, and the blue-light absorption enables FKF1 to form a sufficient protein complex with GI to target CDF proteins for degradation, which alleviates the transcriptional repression of the CO gene and promotes flowering50. In contrast, diurnal expression of FKF1 and GI proteins is out of phase during short days, leading to low levels of FKF1-GI complex and CO expression under light, and consequently, the flowering is delayed. A computational model predicts that FKF1 may control FT expression directly in addition to its role in CO transcriptional activation58. ZTL and LKP2 also play important roles in regulation of flowering time and the circadian clock by ubiquitin-mediated degradation of clock components. Introducing both ztl and lkp2 mutations into the fkf1 mutant further enhances the late-flowering phenotype of the fkf1 mutant and reduces the CO expression level, due to the increased abundance of CDF2 protein59. In contrast to FKF1, overexpression of ZTL or LKP2 leads to an unexpected late-flowering phenotype accompanied with low levels of CO expression, similar to the ztl fkf1 lkp2 triple mutant and the gi mutant60. One explanation for this late-flowering phenotype of these overexpressing lines is that a high level of the ZTL/LKP2 sequester FKF1 in the cytosol by forming the ZTL/LKP2-FKF1 complex. This reduces the amount of FKF1-GI-CDF1 complex in the nucleus, causing repression of CO expression61. Another possibility is that a high level of ZTL/LKP2 destabilizes core clock components TOC1 and PRR5, which interact with and stabilize CO protein to promote flowering in response to the day length40,41,55.

Photoperiodic flowering pathway in Arabidopsis. andare key regulators in the photoperiodic flowering pathway. CO is tightly regulated by the circadian clock and external light signals, and it activatesexpression to initiate flowering. The interaction between FKBP12 and CO protein prevents CO degradation and modulates its nuclear localization. In the early morning, CDF transcription factors that are regulated by the circadian clock accumulate at high levels and directly repressexpression, thereby delaying flowering. GI is another key player in regulating circadian outputs. Under long-day conditions, high levels of GI work together with the blue-light receptor FKF1 to facilitate the degradation of the flowering repressor CDFs, promoting the initiation of flowering. Conversely, the core clock component CCA1 promotes the expression ofby directly binding to their promoters, which enhances the inhibitory effect of CDFs onexpression, leading to delayed flowering. Additionally, PRRs can directly or indirectly regulateexpression, affecting flowering. The light-sensitive proteins CRYs, UVR8, PhyB, and PhyA are capable of sensing blue, UV-B, red, and far-red light, respectively, and modulating the activity of CO protein to regulateexpression and flowering. The line graphs at the bottom panel indicate mRNA oscillation of,, and, respectively. Arrows, lines, and shades are the same as in Fig.. CO FT FT CO CDFs CO CDF FT CO CO/FKBP12 FT 1
Light controls CO protein stability in Arabidopsis
Transcript levels of CO, CO/FKBP12, and FT, respectively, are diurnally oscillated (Fig. 2, bottom panel). At the post-transcriptional level, the stability of CO protein is compromised by the action of post-translational modification. The degradation of CO protein is mainly controlled by two RING-finger E3 ubiquitin ligases, HIGH EXPRESSION OF OSMOTICALLY RESPONSIVE GENE1 (HOS1)62 and CONSTITUTIVE PHOTOMORPHOGENIC 1 (COP1)63, which act in a light-dependent manner. HOS1 directly targets to CO protein to promote its degradation in the early morning, and mutation in HOS1 causes extreme early flowering in both long- and short-day conditions62. However, COP1 promotes CO degradation during night, leading to suppression of the floral integrator FT and delayed flowering64, and SUPPRESSOR OF PHYA (SPA1) can enhance the E3 ubiquitin ligase activity of COP165. In contrast, the small immunophilin FK506 BINDING PROTEIN 12 KD (FKBP12) physically interacts with CO to prevent its degradation by COP1, and this interaction can also modulate its phosphorylation and nuclear localization, enabling it to trigger FT expression and flowering66.
Photoreceptors also play critical roles in regulating CO stability and act antagonistically to generate daily rhythms in CO abundance14. The blue-light photoreceptors cryptochrome1 (CRY1) and CRY2 physically interact with SPA1 to suppress the activity of COP1/SPA1 ubiquitin ligase and stabilize CO protein in a blue-light-dependent manner67. The red/far-red responsive phytochromes (PhyA–PhyE in Arabidopsis) also control flowering time68. Among the five phytochromes, PhyA and PhyB play the most predominant function in CO protein stability8,69. Under the far-red light, PhyA stabilizes CO protein and promotes flowering by inhibiting the COP1-SPA1 complex, while red light-activated PhyB interacts with HOS1 to promote CO protein degradation14,62. The light-dependent control of CO protein stability is a complex and tightly regulated process that controls photoperiodic flowering. The knowledge gained from Arabidopsis research has provided fundamental insights into the mechanisms controlling photoperiodic flowering in other plant species including agricultural crops.
Photoperiodic flowering mechanism in rice
The function of core circadian regulators is conserved among Arabidopsis and crops. For example, the function of CCA1, LHY, and TOC1 is conserved in rice70,71, maize72, and duckweed73. Overexpressing maize ZmCCA1b and rice OsPRR homologs, respectively, can complement mutant phenotypes in Arabidopsis71,72. In rice, OsCCA1 upregulates expression of strigolactone receptor and signaling and responsive genes to repress tiller-bud outgrowth70. Down-regulating and overexpressing OsCCA1 increases and reduces tiller numbers, respectively, while altering OsPRR1 expression leads to the opposite effects. Moreover, both exogenous and endogenous sugars negatively regulate expression of OsCCA1, which in turn mediates expression of strigolactone receptor and signaling pathway genes to control tillering, flowering, and panicle development.
Unlike CO in Arabidopsis, the Hd1 gene in rice is regulated by the circadian component OsGI and displays bifunctional responses to day length: promoting flowering by activating the expression of Hd3a under short-day conditions, but repressing Hd3a expression to delay flowering under long-day conditions77,79. This photoperiod-dependent conversion of Hd1 function from activation to repression is modulated by phytochromes and circadian clock. Photoperiodic Sensitivity5 (SE5) encodes a putative heme oxygenase involved in the biosynthesis of the phytochrome chromophore. In the se5 mutant, Hd3a expression is strongly induced by Hd1 under long-day conditions, resulting in early flowering phenotype similar to phytochrome-deficient mutants, such as OsphyB and OsphyC80,81. Furthermore, overexpression of Hd1 can repress Hd3a expression and delay flowering under short-day conditions, and this effect requires functional phytochromes82. Therefore, phytochrome signals are essential for the conversion of Hd1 from an activator to a repressor of Hd3a transcription.
The rice-specific regulator Ghd7 encoding a CCT (CO, CO-LIKE, and TIMING OF CAB1) domain transcriptional regulator was identified as a major target of phytochrome signals in flowering control and acts as a repressor of Ehd178,83,84. Ehd1 is an important B-type response regulator that activates expression of Hd3a and RFT1, which are responsible for the transition from vegetative to reproductive growth in rice78,84. Ghd7 acting as a repressor of flowering is acutely induced by red light in morning through the PhyA homodimer or the PhyBPhyC heterodimer, and the repressor activity of Ghd7 is modulated post-transcriptionally by PhyB. Specifically, when PhyB is activated by red light, it undergoes a conformational change that allows it to interact with Ghd7 protein, and this interaction leads to the ubiquitination and subsequent degradation of Ghd7 through the 26 S proteasome pathway, which relieves its repressor activity and promotes flowering85. Days to heading8 (DTH8) encodes a putative HAP3 subunit of NF-YB transcription factor that is capable of binding to Hd3a promoter to enhance H3K27 trimethylation and repress Hd3a expression86,87. Under long-day conditions, both Ghd7 and DTH8 can interact with Hd1 to form a strong repressor complex that inhibits Ehd1 expression, and this effect of the Hd1-Ghd7-DTH8 complex on Ehd1 expression can also partly explain the photoperiod-dependent conversion of Hd1 from activation to repression86,88,89.
In sorghum, SbPRR37 activates expression of several downstream genes that repress flowering in long days90. SbPRR37 expression is dependent on light and regulated by the circadian clock. In short days, SbPRR37 is not expressed during the evening phase, allowing sorghum to flower. Similarly, clock components such as OsPRR37/DTH7 and the EC components, also regulate rice flowering by enhancing photoperiod sensitivity. OsPRR37 is predicted to act downstream of the OsPhyB to switch the genetic effects of Hd1 on Hd3a expression and delay flowering in long-day conditions through the formation of a transcriptional repressor complex91–93.
The EC is a crucial flowering repressor33. OsELF3-1 and OsELF3-2, two rice ELF3 paralogs, physically interact with OsELF4a in the nucleus, whilst OsELF3-1 shows a stronger interaction with OsLUX compared to OsELF3-294. Recent studies have shown that functional OsELF3 proteins accumulate during the evening in short-day conditions, forming the EC to induce flowering by inhibiting the expression of floral repressors OsPRR37 and Ghd793. However, in long-day conditions, OsELF3 protein levels are low and insufficient to suppress floral repressors, resulting in delayed flowering95. Taken together, these data indicate that rice possesses several complex genetic pathways to affect flowering time and photoperiod sensitivity.

Circadian and photoperiodic regulation of flowering pathways in rice. Rice has evolved two distinct photoperiodic flowering pathways to adapt varying short (SD) and long (LD) day lengths. The evolutionarily conserved-pathway is activated under SD conditions and the uniquely evolved--/pathway is activated under LD conditions, and they are regulated by both the circadian clock and light signaling. Hd1 plays a crucial role in regulating rice flowering time by interacting with the NF-YB transcription factor DTH8 in response to day length changes. Under SD conditions, Hd1 promotes the expression ofandto activate floral meristem identity genes. However, under LD conditions, Hd1 suppresses the expression ofandto prevent premature flowering, contributing to the photoperiodic adaptability of rice. Hd1 also interacts with the flowering repressor Ghd7 to inhibitexpression, thereby delaying flowering under LD conditions. OsGI triggers the initiation of the-pathway by activatingexpression and can enhanceandexpression by inhibiting Ghd7, leading to flowering. The clock genealso regulates photoperiodic flowering by activatingexpression under SD conditions and suppressing it under LD conditions, allowing rice plants to fine-tune flowering timing for optimal reproductive success under changing environmental conditions. Sugar suppresses expression of, which in turn controls strigolactone pathway genes to regulate tiller buds and flowering.are floral identity genes. Arrows, lines, and shades are the same as in Fig.. Hd1 Hd3a Ghd7 Ehd1 Hd3a RFT1 Hd3a Ehd1 Hd3a Ehd1 Ehd1 Hd1 Hd3a Hd1 Hd3a Ehd1 OsPRR37 Hd3a OsCCA1 MADS14/15 1
Photoperiodic flowering mechanism in soybean
The flowering repressor gene E1 encodes a legume-specific transcription factor that induces GmFT1a and GmFT4a expression but represses GmFT2a and GmFT5a expression. This gene is directly repressed by four soybean CCA1/LHY genes (GmLHY1a, GmLHY1b, GmLHY2a and GmLHY2b) at the transcriptional levels98,99,105. Indeed, the soybean quadruple knockout mutant lhy1a lhy1b lhy2a lhy2b generated by CRISPR-Cas9 displayed late flowering and high E1 transcript levels in long days105. Meanwhile, GmPRR3a and GmPRR3b, orthologs of Arabidopsis PRR3, induce E1 expression to delay photoperiodic flowering through downregulation of GmLHY genes105,106. This pathway provides an additional layer of regulation for soybean flowering. The E2 locus in soybean encodes a homolog (GmGIa) of the Arabidopsis GI, which acts upstream of CO and FT in the photoperiodic flowering pathway. Like the role of GI in Arabidopsis, GmGIa plays a crucial role in integrating light and circadian signals in response to photoperiodic changes, ultimately leading to the transition from vegetative growth to reproductive development101. GmGIb and GmGIc are two homologs of GmGIa and can interact with two Arabidopsis FKF1 orthologs (GmFKF1 and GmFKF2) and one Arabidopsis CDF1 ortholog (GmCDF1) proteins, but GmGIa cannot interact with these three proteins, suggesting that GmGIa may play a unique role in regulating soybean flowering107. GmELF3, an ortholog of clock component ELF3, physically interacts with GmLUX2 to directly repress E1 expression and promotes soybean flowering108.
In addition to these genes, two maturity loci, E3 and E4, encode the phytochrome A proteins GmPhyA3 and GmPhyA2, respectively. These proteins induce expression of E1 and which, in turn, suppresses expression of GmFT2a and GmFT5a, resulting in delayed flowering under both natural day length and artificially long-day conditions109,110. The PhyA-regulated E1-GmFT pathway is a key determinant for soybean adaption to different latitude environments. The blue-light photoreceptor protein cryptochromes also play an essential role in regulating circadian rhythm and flowering time. Soybean contains two cryptochrome genes, GmCRY1a and GmCRY2a. However, only GmCRY1a has been found to strongly promote floral initiation in soybean, and the circadian rhythm of GmCRY1a protein has different phase characteristics in different photoperiods, which suggests that GmCRY1a plays a more predominant role in regulating flowering time in response to changes in day length111. The role of GmCRY2a in soybean flowering remains unclear and to be further investigated to elucidate its function.

Genetic control of photoperiodic flowering in soybean. Soybean has multiple-like genes, and GmFT2a and GmFT5a promote flower formation by enhancing the expression of flower meristem identity genes, including,, and. Conversely, GmFT1a and GmFT4a inhibit flower formation by repressing these genes. The central regulator E1 inhibits soybean flowering by downregulating expression ofandand upregulating expression ofand. Under LD conditions,expression is strongly induced by the red/far-red light receptors GmPhyA2 and GmPhyA3, leading to delayed flowering. These photoreceptors can also transmit light signals to circadian clock factor homologs such as GmGIa, GmPRR3a/b, GmELF3, and GmLUX. These factors mediate expression of, leading to distinct expression rhythms under SD and LD conditions, affecting flowering time in soybean plants.and/may form-regulons and play an important role in photoperiodic flowering in soybean. These models are based on genetic and gene expression results and may be refined by biochemical studies. Arrows, lines, and shades are the same as in Fig., while dashed arrows indicate predicted interactions. FT GmSOC1 GmAP1 GmLFY GmFT2a GmFT5a GmFT1a GmFT4a E1 GmLHYs GmCOLs GmFT2a 5a CO FT 1
Epistasis and epigenetic regulation of photoperiodic flowering in Arabidopsis and crops
Epigenetic regulation of circadian and flowering-time pathway genes plays a broader role in determining photoperiodic flowering in plants. Epigenetics refers to heritable changes in gene expression that do not involve alterations to the DNA sequence. Epigenetic mechanisms, involving DNA methylation, chromatin modifications, long-noncoding RNAs, and small interferring RNAs, have been shown to regulate expression of key photoperiodic flowering pathway genes and flowering time.
One of the best-studied epigenetic flowering events is known as vernalization, which refers to the induction of flowering after prolonged exposure to cold temperatures in winter, mediated by changes in the chromatin status of FLC112. COLD INDUCED LONG ANTISENSE INTRAGENIC RNA (COOLAIR)113 and COLD ASSISTED INTRONIC NONCODING RNA (COLDAIR)114 originated from the first intron and 3’ end of FLC, respectively. They have been proposed to facilitate FLC silencing by removing H3K36me3 from chromatin during vernalization, particularly under low-temperature conditions113,114. However, recent research suggests that the COOLAIR is not required for vernalization. Support for this notion comes from the normal vernalization response observed in the CRT/DRE-binding factors (CBFs) cbfs mutants with reduced levels of COOLAIR induction; moreover, both cbfs and FLCΔCOOLAIR mutants show a normal vernalization response despite their inability to activate COOLAIR expression during cold115. This work also highlights the importance of CBFs in initiating COOLAIR expression during the early stage of vernalization. CBFs bind to CRT/DREs at the 3’-end of FLC and increase progressively during vernalization, promoting COOLAIR expression. As vernalization progresses, FLC chromatin shifts to an inactive state, prompting CBF proteins to detach from the CRT/DREs in the COOLAIR promoter, thereby diminishing COOLAIR levels.
Epialleles or epigenes can also result from cross-fertilization such as imprinting117,118 and polyploidy 25,26. In Arabidopsis intraspecific hybrids119,120 and allotetraploids121, expression waveforms of circadian clock genes such as CCA1 and LHY are altered by epigenetic and chromatin modifications to increase photosynthesis, chlorophyll biosynthesis, and starch metabolism121. The more starch accumulates during the day, the more it can be degraded at night to promote growth vigor, in a widespread phenomenon known as hybrid vigor or heterosis24,27. Stress-responsive gene expression is gated by the circadian clock122. Altered circadian gene expression in the hybrids also regulates expression of abiotic and biotic stress-responsive genes as a trade-off to balance the energy used for growth and defense119, as well as mediates ethylene biosynthesis that in turn regulates growth vigor123. Mechanistically, CCA1 expression changes are related to parent-of-origin effect on DNA methylation120, suggesting an epigenetic cause. Further analysis showed that circadian clock genes mediate diurnal regulation of histone H3K4 methylation reader and eraser genes, which in turn regulate rhythmic histone modification dynamics for the clock and its output genes124. This reciprocal regulatory module between chromatin modifiers and circadian clock oscillators orchestrates diurnal gene expression that governs plant growth and development.
All flowering plants are polyploids or of polyploid origin125,126, which result in genetic and epigenetic changes25,26. Duplication of flowering pathway genes in polyploids may increase the range of flowering time variation or induce other changes including epigenetic modifications and epistasis or trans-acting effects25,127,128. Natural variation of flowering time in Arabidopsis suecica allotetraploids is largely controlled by two epistatically acting loci, namely FRIGIDA (FRI) and FLC129,130. FRI upregulates FLC expression that represses flowering131. In Arabidopsis allotetraploids that are formed by pollinating A. arenosa with A. thaliana132,133, there are two sets of FLC and FRI genes and they flower late128. Inhibition of early flowering is caused by upregulation of A. thaliana FLC (AtFLC) that is trans-activated by A. arenosa FRI (AaFRI) (Fig. 5b). Two duplicate FLCs (AaFLC1 and AaFLC2) originating from A. arenosa are expressed in some allotetraploids but silenced in other lines. The expression variation of FLCs in the allotetraploids is associated with deletions in the promoter regions and first introns of A. arenosa FLCs. The strong AtFLC and AaFLC loci are maintained in natural Arabidopsis allotetraploids, leading to extremely late flowering128. Furthermore, FLC expression correlates positively with histone H3K4me2 and H3K9ac marks and negatively with the H3K9me2 mark. This combination of epistasis between loci and chromatin regulation within a locus may provide a complexity of flowering time variation in polyploid plants in response to environmental cues and adaptive niches.
Cotton allotetraploids were formed ~1–1.5 million years ago (Mya)134 (Fig. 5c), followed by natural diversification and crop domestication135. Polyploidization between an A-genome African species (Gossypium arboreum-like) and a D-genome American species (G. raimondii-like) in the New World created a new allotetraploid or amphidiploid (AD-genome) cotton clade134, which has diversified into five polyploid lineages, Gossypium hirsutum (Gh) (AD)1, G. barbadense (Gb) (AD)2, G. tomentosum (AD)3, G. mustelinum (AD)4, and G. darwinii (AD)5. Gossypium ekmanianum and G. stephensii are recently characterized and closely related to G. hirsutum136. Gh and Gb were separately domesticated from perennial shrubs to become annualized Upland and Pima cottons135.
Interspecific hybridization induces genome shock, including nonadditive gene expression and epigenetic changes in polyploid plants and crops133,137. The epigenetic changes produce epigenes or epialleles, which are selected during evolution and domestication. In allotetraploid cotton, over 500 epigenes have been identified and maintained over 600,000 years of genomic diversification, some of which are predicted to originate during polyploidization over 1–1.5 million years ago22. Many of these epigenes are related to domestication traits including flowering time, stress response, seed development, and seed dormancy.
Cotton, being originated in tropical and subtropical areas, flowers in short-day conditions138. Loss of photoperiod sensitivity is a major “domestication syndrome” trait139 of Upland or American cotton (G. hirsutum L.) and Pima or Egyptian cotton (G. barbadense L.) that accounts for >95% and ~5% of annual cotton crop worldwide, respectively140. This process is associated with cotton CO-liked (COL) genes, which are Arabidopsis CO homolog141. Among 23 COLs identified in cotton142, eight G. hirsutum COLs (GhCOLs) are in the same subgroup. Among them, only GhCOL2 exhibited similar expression rhythms with GhFT, indicating that GhCOL2 is a major regulator of GhFT.
Allotetraploid cotton has two COL2 homoeologs, GhCOL2A and GhCOL2D (Fig. 5c). COL2D is heavily methylated and silenced in G. raimondii that is photoperiod sensitive, while COL2A is hypomethylated and highly expressed in cultivated G. arboreum, which is photoperiod insensitive138. In allotetraploid cotton, the COL2A homoeolog was hypermethylated and repressed in cultivated Upland and Pima cottons, while the COL2D homoeolog was highly expressed. This suggests that COL2A in the allotetraploid cottons is likely silenced after polyploid formation since it is expressed in the extant G. arboreum species. The high-level expression of COL2D is likely associated with positive selection and loss of DNA methylation of COL2D during domestication of Upland and Pima cottons142, which could lead to the loss of photoperiod sensitivity for global cotton cultivation. Indeed, methylation levels of COL2D are lower and their expression levels are higher in cultivated and photoperiod-insensitive G. hirsutum and G. barbadense than in their photoperiod-sensitive wild relatives. Removal of DNA methylation in the wild G. hirsutum (TX2095) seedlings using 5’-aza-2’-deoxycytidine (5-aza-dC), a chemical inhibitor for DNA methylation143, activates COL2D expression. Moreover, using virus induced gene silencing (VIGS)144, GhCOL2 is down-regulated, which is consistent with the repression of GhFT. Down-regulating COL2 and GhFT expression has delayed flowering time for 9 days, compared to the control plants and wild cottons that do not flower in the LD conditions138. This example has demonstrated important roles of epigenes and epialleles in photoperiodic flowering during natural selection and crop domestication. These epigenes and epialleles can be candidate targets for gene-editing and biological breeding to improve crop production and resilience.

Epigenetic and epistatic regulation of flowering time in Arabidopsis and polyploid plants. FWA and FLC proteins inhibit flowering pathway genes by binding to their promoters or proteins. In the wild-type,locus was methylated to suppress its activity. In themutant or epiallele induced by,is demethylated to activate its expression, which represses floral pathway genes such asandto inhibit flowering. Vernalization can induce long-noncoding RNAs like COOLAIR or COLDAIR in the first intron oflocus and H3K27me3 through polycomb-group2 (PRC2) protein complex, which coincides with repression ofto promote flowering.Inallotetraploids that were formed fromand,is silenced, whileis expressed.-activesexpression, consistent with a wide range of flowering phenotypes in resynthesized and natural.In allotetraploid Upland cotton (, AADD),() has two homoeologsand.promoter is heavily methylated and silenced, whileis a reversible epiallele, which is methylated and repressed in wild relatives ofbut demethylated and expressed in cultivated cotton to promoteexpression and photoperiodic flowering. This may contribute to worldwide cultivation of Upland cotton. Photos are not in scale. a b c FWA fwa DDM1 FWA FT FLC FLC FLC Arabidopsis suecica A. thaliana A. arenosa AtFRI AaFRI AaFRI trans AtFLC A. suecica Gossypium hirsutum CONSTANS-LIKE2 COL2 GhCOL2A GhCOL2D GhCOL2A GhCOLD2 G. hirsutum GhFT 2 2 5 5
Concluding remarks
Growth, development, and reproduction during appropriate times of the year is essential for plants to adapt to seasonal changes. The photoperiodic response to flowering is a key mechanism for plants to optimize utilization of natural resources, adapt crop varieties in regional environments, and significantly impact crop yield stability and potential. Insights gained from studying the interplay between light signaling, the circadian clock, and photoperiodism in the model plant Arabidopsis have facilitated the discovery of complex molecular circuitry networks underlying photoperiodic flowering pathways in both short- and long-day crops, many of which are polyploids and have genomic redundancy that is subject to epigenetic modifications. Indeed, some key regulators, including circadian clock genes, are epigenetically regulated, presumably resulting from natural selection, crop domestication, and/or modern breeding.
Genome sequencing and genome-wide association studies in crop plants have supported that allele variants of several key circadian genes have been selected by breeding and under agricultural culture of adapting crop flowering time to certain seasons. While most current studies are focused on light and circadian control of photoperiodic flowering, temperature changes can drastically affect plant flowering and have been grossly under-investigated. Further investigations are needed to maintain agricultural and ecological sustainability in response to changes in extreme weather and climate such as drought, heat, and flood.
There are good examples of the genes in the circadian clock and photoperiodic flowering pathways that are associated with agronomic traits in crops. Rice Hd177, an ortholog of the Arabidopsis CO, controls expression of two florigen genes Hd3a and RFT174–76 during the floral transition. ZmCCT9 in maize is diurnally regulated and negatively regulates the expression of the florigen ZCN8 through a CACTA-like TE insertion in ZmCCT promoter region145, resulting in reduction of photoperiod sensitivity, thus accelerating maize spread to long-day environments146. The circadian clock of cultivated tomatoes has slowed during domestication. EMPFINDLICHER IM DUNKELROTEN LICHT1 (EID1) is an F-box protein that targets phytochrome A for degradation in Arabidopsis147. The EID1 allele in cultivated tomatoes is under selection sweep and enhances plant performance under long-day photoperiods presumably resulting from moving away from the equator148. Fruit yield in hybrid tomato is related to SINGLE FLOWER TRUSS (SFT)149, a homolog of FT in Arabidopsis. Moreover, epigenetic alteration of circadian clock genes including CCA1 and LHY in plant hybrids are related to growth vigor in Arabidopsis119–121 and maize72,150. In cotton, GhCOL2 is an epiallele, which may be selected during domestication to increase its expression and thereby reduce photoperiod sensitivity, helping spread worldwide cotton cultivation22. These examples will help us design strategies to optimize circadian input and output signals and improve flowering time and crop yield using genome-editing tools. Likewise, discovery and utilization of new epigenes and epialleles in combination of targeted-gene editing will improve flowering time and crop yield and resilience.
Supplementary information
Peer review file