The global transition to personalized medicine has been accelerated by the advancements in high-throughput technologies in biology and increasing computational power and storage capacities. Accompanying the diversity and growing volume of the complex biological datasets, called “–omics” data (e.g., genomics and proteomics), revolutionized our understanding and our way of interpreting disease and health states, although they generate new challenges.
To reveal the complex and hidden patterns in omics data, machine learning (ML) showed promising results and brought new opportunities for transforming scientific discovery today. Popular packages such as scikit-learn or XGBoost enable predictive data analysis. However, the researchers still require programming skills to write their own ML pipelines which are not always easy to follow by non-specialists due to lacking domain knowledge and a graphical interface.
Furthermore, several parameters can be changed to tune the algorithms, which might show differences from version to version, resulting in reproducibility issues. To reproduce published results, the same software environment needs to be set up and configured with the matching package versions and algorithm parameters.
Additionally, omics sciences and ML require special domain knowledge since metrics can be deceiving and algorithms might need extra preselection or preprocessing steps.
Thus, transparent and open-source software is highly valuable for open and reproducible science. To address all the issues and to enable researchers to access state-of-the-art ML algorithms without requiring any prior bioinformatics and programming knowledge, we introduce OmicLearn (OmicLearn.org), a ready-to-use, open-source, web-based, ML platform specifically developed for omics datasets.
In this talk, you will see how to build a machine learning platform from open-source tools and how to apply state-of-the-art algorithms to omics datasets in a standardized format.
What do you need to know to enjoy this talk
Python level
Medium knowledge: You use frameworks and third-party libraries.
About the topic
You used or did it just a few times.