Login / Signup

Boosting tissue-specific prediction of active cis-regulatory regions through deep learning and Bayesian optimization techniques.

Luca CappellettiAlessandro PetriniJessica GliozzoElena CasiraghiMax SchubachMartin KircherGiorgio Valentini
Published in: BMC bioinformatics (2022)
Results show that (1) automatic model selection by Bayesian optimization improves the quality of the learner; (2) data rebalancing considerably impacts the prediction performance of the models; test set rebalancing may provide over-optimistic results, and should therefore be cautiously applied; (3) despite working on sequence data, convolutional models obtain performance close to those of feed forward models working on epigenomic information, which suggests that also sequence data carries informative content for CRR-activity prediction. We therefore suggest combining both models/data types in future works.
Keyphrases
  • electronic health record
  • deep learning
  • big data
  • machine learning
  • artificial intelligence
  • data analysis
  • healthcare