Machine learning approaches for sex estimation using cranial measurements.
Diana H TonevaSilviya Y NikolovaGennady AgreDora K ZlatarevaVassil HadjidekovNikolai E LazarovPublished in: International journal of legal medicine (2020)
The aim of the present study is to apply support vector machines (SVM) and artificial neural network (ANN) as sex classifiers and to generate useful classification models for sex estimation based on cranial measurements. Besides, the performance of the generated sub-symbolic machine learning models is compared with models developed through logistic regression (LR). The study was carried out on computed tomography images of 393 Bulgarian adults (169 males and 224 females). The three-dimensional coordinates of 47 landmarks were acquired and used for calculation of the cranial measurements. A total of 64 measurements (linear distances, angles, triangle areas and heights) and 22 indices were calculated. Two datasets were assembled including the linear measurements only and all measurements and index, respectively. An additional third dataset comprising all possible interlandmark distances between the landmarks was constructed. Two machine learning algorithms-SVM and ANN and a traditional statistical analysis LR-were applied to generate models for sex estimation. In addition, two advanced attribute selection techniques (Weka BestFirst and Weka GeneticSearch) were used. The classification accuracy of the models was evaluated by means of 10 × 10-fold cross-validation procedure. All three methods achieved accuracy results higher than 95%. The best accuracy (96.1 ± 0.5%) was obtained by SVM and it was statistically significantly higher than the best results achieved by ANN and LR. SVM and ANN reached higher accuracy by training on the full datasets than the selection datasets, except for the sample described by the interlandmark distances, where the reduction of attributes by the GeneticSearch algorithm improved the accuracy.