Bandgap prediction of non-metallic crystals through machine learning approach.
Sadhana BarmanHarkishan DuaUtpal SarkarPublished in: Journal of physics. Condensed matter : an Institute of Physics journal (2024)
The determination of bandgap is the heart of electronic structure of any material and is a crucial factor for thermoelectric performance of it. Due to large amount to data (features) that are related to bandgap are now a days available, it is possible to make use of machine learning approach to predict the bandgap of the material. The study commences by selecting the feature through Pearson correlation study between bandgap and various thermoelectric parameters in non-metallic crystals. Among the forty two parameters available in the dataset, the seebeck coefficient and its corresponding temperatures show high correlation with the bandgap. With these three selected features we have used different machine learning models like multilinear regression, polynomial regression, random forest regression, support vector regression to predict the bandgap. Amongst the different machine learning models considered, random forest regression outperforms the other models to predict the bandgap with R2 value of 97.55 % between actual bandgap and predicted bandgap.