Prediction of mammalian tissue-specific CLOCK–BMAL1 binding to E-box DNA motifs

May 12, 2023Scientific reports

Predicting where the CLOCK-BMAL1 proteins bind to DNA in different mammal tissues

AI simplified

Abstract

, local DNA shape, and flanking sequences are sufficient predictive features for BMAL1-DNA binding.

  • BMAL1 forms a complex with CLOCK or NPAS2 to regulate circadian clock genes.
  • The dimer binds to specific E-box gene regulatory elements on DNA.
  • Identifying transcription factor binding sites presents significant challenges due to multiple binding motifs.
  • Three machine learning models incorporating DNA sequence, shape, and histone modifications were developed to predict BMAL1 binding.
  • The models revealed insights into the mechanisms of BMAL1-DNA binding and its tissue specificity.

AI simplified

Key numbers

398
Common E-boxes
E-boxes bound by BMAL1 across liver, kidney, and heart tissues.
0.99
Mean AUROC score
Performance of the sequence plus shape plus histone modification model.
50%
Feature importance
Contribution of the second nucleotide upstream of the E-box to binding prediction in the liver.

Full Text

What this is

  • This research investigates the binding of the BMAL1 transcription factor to E-box DNA motifs across different tissues in mice.
  • Using machine learning models, the study identifies key genomic and epigenomic features that predict BMAL1 binding.
  • The findings reveal that DNA shape, , and local sequence context significantly influence BMAL1's binding specificity.

Essence

  • BMAL1 binding to E-box motifs is predicted by DNA shape, , and flanking sequences. The study demonstrates significant tissue-specific differences in binding patterns.

Key takeaways

  • BMAL1 binds most frequently to the CACGTG across tissues, with only 398 E-boxes common among liver, kidney, and heart. This indicates a strong tissue specificity in BMAL1 binding.
  • Incorporating DNA shape and histone modification features into predictive models significantly improves accuracy, with mean AUROC scores of 0.99 for liver, kidney, and heart using the sequence plus shape plus histone modification model.
  • The second nucleotide upstream of the E-box is a strong predictor of BMAL1 binding in the liver, contributing over 50% to the model's predictive power.

Caveats

  • The study only considers E-boxes in accessible chromatin, potentially overlooking binding events in inaccessible regions, which may limit the understanding of BMAL1's binding dynamics.
  • Models trained on one tissue performed poorly when predicting binding in another, highlighting the complexity and tissue specificity of BMAL1 binding.

Definitions

  • E-box motif: A specific DNA sequence (CANNTG) that transcription factors like BMAL1 bind to for gene regulation.
  • histone modifications: Chemical modifications to histone proteins that influence gene expression by altering chromatin structure.

AI simplified

what lands in your inbox each week:

  • 📚7 fresh studies
  • 📝plain-language summaries
  • direct links to original studies
  • 🏅top journal indicators
  • 📅weekly delivery
  • 🧘‍♂️always free