Issue |
BIO Web Conf.
Volume 109, 2024
Conference on Water, Agriculture, Environment and Energy (WA2EN2023)
|
|
---|---|---|
Article Number | 01037 | |
Number of page(s) | 10 | |
DOI | https://doi.org/10.1051/bioconf/202410901037 | |
Published online | 20 May 2024 |
Machine Learning-Powered Prediction of molecule Solubility: Paving the Way for environmental, and energy applications
1 Mohammed V University in Rabat, ENSIAS, SSLAB, Rabat, Morocco
2 Natural Resources and Sustainable Development laboratory, Faculty of Sciences, Ibn Tofail University, B.P 242, Kenitra, Morocco
* Corresponding author: yassine.mouniane@uit.ac.ma
Predicting aqueous solubility is pivotal for selecting materials in pharmaceuticals, environmental, and renewable energy fields. For instance, it plays a vital role in drug development and the design of chemical and synthetic routes. In the realm of Cheminformatics, the accurate prediction of molecule solubility is indispensable for drug discovery and development. Traditional methods often rely on labor-intensive experimental assays, presenting challenges in terms of time and cost. To address these limitations, this study leverages advanced machine learning techniques to predict molecule solubility with exceptional accuracy. Using the PyCaret library, a versatile low-code machine learning tool, we develop and evaluate a diverse set of linear regression models. Key performance metrics, including R², RMSLE, MAE, MSE, MAPE, and RMSE, are employed to assess model performance comprehensively. Through rigorous model comparison and evaluation, we identify the optimal model for predicting molecule solubility. Our findings not only demonstrate the efficacy of machine learning in Cheminformatics but also offer insights into the complex relationship between molecular features and solubility. This study contributes to the advancement of computational chemistry by bridging the gap between theory and practice. By elucidating the predictive capabilities of machine learning models, we pave the way for more efficient and cost-effective drug discovery processes.
© The Authors, published by EDP Sciences, 2024
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.