The image is being processed, do not close the tab or the browser until it is finished, thanks.

Legal age estimation using orthopantomographies and artificial intelligence

THE PROBLEM

Legal age estimation is the great challenge par excellence of the human identification process in the fields of forensic anthropology and forensic medicine. This is due to the associated social implications and the problems it aims to solve: contributing to the identification of presumed undocumented migrant minors without family references who often seek asylum in the country of destination (13600 in the EU in 2020), girls and adolescents forced into marriage (12 million girls under 18 married), or human trafficking and sexual exploitation (affecting more than 40 million people, 75% of whom are between 13 and 25 years old). For this reason, the development of accurate, objective, efficient and robust methods for legal age estimation is essential. This would make it possible to provide the appropriate information to the courts of justice in charge of these cases. Different protocols and best practice guidelines have been proposed in the EU to solve this problem, the most detailed one being the protocol of the German Group for Forensic Age Estimation of the University of Münster (AGFAD), which proposes the application of medical evidence in case of doubts about the age of the minor. Firstly, the use of an x-ray of the carpal bones of the non-dominant hand is recommended, followed by a panoramic radiograph, also called orthopantomography (OPT), which is the focus of the present study, and a CT scan of the clavicle in case there is still uncertainty about the estimated age.

METHODS EMPLOYED

The analysis of teeth, and more specifically of the third molar, to estimate legal age, is a widely studied and validated methodology in the field of forensic anthropology, where different techniques are used: Demirjian [1], Cameriere [2], Mincer et al. [3] and Uys et al. [4]. Although these methods are accurate (using the third molar, forensic anthropologists obtain accuracies ranging from 80 to 90%), they are based on a visual analysis of the teeth and are therefore subjective and dependent on the experience of the observer analyzing the images. As an alternative, Artificial Intelligence (AI) allows the development of methods with fixed error metrics, thus being more objective. In addition, advances in recent years have allowed classification methods (in this case age estimation) using images and convolutional neural networks (CNNs) to greatly increase their accuracy. This, added to the fact that, once trained, an AI would be able to estimate the age of an individual in less than a second, allows the development of accurate, objective and efficient methods. These methods can estimate age in an automatic manner, or better yet, serve as a support system for the expert, reducing errors and estimation time.


With respect to robustness, it depends on the data used to train the model. If said data is sufficient (in quantity and variety), the method should be robust enough to estimate the age of any individual. In our case, we have used 10,739 OPTs of individuals between 14 and 26 years old and from different countries (Chile, Bosnia, Lebanon, Uganda, Turkey, Botswana, Dominican Republic, Italy, Kosovo, Malaysia, Japan and Korea). Moreover, to validate that the model is robust, we have used it to estimate age in four samples from geographical origins not used in training (Russia, Egypt, Ethiopia and Australia). Using this data, we performed several experiments to test which is the best approach to solve the problem, proving, among other things, that using the whole OPT is better than using only the left mandibular third molar. The results of our best model are shown numerically in Table 1 and graphically in Figure 1.

MetricTrainValTestRussiaEthiopiaEgyptAustralia
MAE (years)0.251.071.121.211.451.361.51
Accuracy
(< o ≥ 18)97.33%89.79%88.38%92.04%81.97%85.03%86.08%
Table 1: Results of our best model, which solves a regression problem using the full OPT as input to a convolutional neural network (CNN). The table shows the mean absolute error (MAE) when estimating age in years and the percentage of individuals correctly classified as a minor or an adult (accuracy). The Train, Val and Test columns represent the model training, validation and test errors, respectively. The other four columns show the error on data from populations never seen by the model.
Figure 1: Regression results. It can be seen that the error is higher for older individuals, increasing from the age of 22 onwards.

The obtained results, with a MAE of 1.12 years and an accuracy of 88.38% in test, aresimilar to that provided by several studies using AI for estimating legal age from orthopantomographies. However, our method has a great advantage over existing methods: it has been trained with data from multiple samples, which improves generalization. Methods trained with a single sample are limited to it and have problems to generalize, i.e., to work outside the sample with which they have been trained.


The results obtained by the model are also similar to those obtained by human experts. Using Cameriere’s method [2] , different accuracies have been obtained: 91.4% by De Luca et al.[5] (between 13 and 22 years), 88% by Melo et al. [6] (between 10 and 26 years), 90.7% by Albernaz-Neves et al. [7] (between 12 and 24 years) and 80.3% by Angelakopoulos et al. [8], using 10181 OPTs from 4 continents (being the most similar sample to ours) in an age range of 13 to 24 years. De Micco et al. [9] analyzed several age estimation papers using this method and the accuracy varies between 74 and 95%. Using Demirjian’s method [1], Melo et al. [6] obtained an accuracy of 93% in a sample of individuals between the ages of 10 and 26 years. Also regarding the comparison with human experts, we have been able to verify not only that they and the model obtain similar results, but also a more specific behavior. In this sense, for individuals where the AI makes large errors, a human expert does as well. Some examples of this are shown in Table 2. 

OriginSexEstimation using Uys et al. 2018Estimation using Mincer et al. 1993Our AI estimationReal age
BosniaF15 < 16.21 < 18.5216 ± 1.6415.3124.39
BotswanaF16.84 < 22.27 < 25.9920.9 ± 2.0121.5715.11
KosovoF16.17 < 18.81 < 24.0219.1 ± 2.1821.2514.92
ItalyF16.17 < 18.81 < 24.0219.1 ± 2.1820.9016.12
LebanonM15.17 < 16.26 < 19.6117.3 ± 2.4717.2521.31
Dominican
Republic
M17.26 < 21.97 < 24.9020.5 ± 1.9720.8116.15
Table 2: Comparison of ages estimated by the AI and by a human expert when large errors are made.

Regarding how the model makes these estimates, i.e., what it is observing to estimate the age of an individual, we have obtained activation maps using the Integrated Gradients algorithm [10]. These activation maps are shown in Figure 2, where it can be seen that the estimation is done by observing, mainly, several of the third molars and, occasionally, also the second molars, mainly the apices still in the process of root closure. This is in perfect agreement with expert knowledge regarding the estimation of legal age from the teeth.

19.5 years estimated as 19.6 years
18.4 years estimated as 18.9 years
15.1 years estimated as 16.2 years
14.5 years estimated as 14.7 years
Figure 2: Activation maps that show which regions of the image are more important for the AI when estimating age.

Although the results are good and the image regions observed by the network are in agreement with expert knowledge, we consider that there is still an important problem to be solved: the ethical problem of estimating that a minor is an adult. In this sense, we should penalize this more than the opposite, i.e., estimating that a minor is an adult is much worse than doing the opposite. Our best model, which we have outlined in the previous paragraphs, has the confusion matrix shown in Table 3. As can be seen, the number of minors classified as adults is greater than the number of adults classified as minors. We seek just the opposite. For this we have two alternatives: to estimate a prediction interval from the predicted age and use the lower point of the interval as the estimate or to bias what our model learns (penalize the loss function), so that the AI learns by itself that overestimating age is much worse than underestimating it. The results of both alternatives are compared with the original model in Tables 3, 4, 5 and 6 (in green the number of cases where the estimation is right (true positives and negatives) and in red the number of cases where the model is wrong (false positives and negatives)).

Real\Est<18≥18
<18703150
≥181001199
Table 3: Confusion matrix of the original model.
Real\Est<18≥18
<1880548
≥182731026
Table 4: Confusion matrix of the model estimating the infimum of a prediction interval.
Real\Est<18≥18
<1882726
≥18514785
Table 5: Confusion matrix when penalizing overestimates by 10 times.
Real\Est<18≥18
<1884112
≥18596703
Table 6: Confusion matrix when penalizing overestimates by 40 times.

When using the initially proposed model and estimating the minimum of the obtained age interval, out of a total of 2,152 cases only 48 subjects are classified as adults when they are, in fact, minors (Table 4). This number drops to only 12 cases (Table 6) when we use a biased model to penalize overestimates more than underestimates. As a trade-off, approaches that reduce the number of minors classified as adults have a lower overall accuracy.

AI-BASED AGE ESTIMATION

The result of our research is a system capable of estimating age in an automatic, robust, objective and accurate manner, with an MAE of 1.12 years. Moreover, having obtained prediction intervals would allow it to be used in the age estimation protocols established at the European level. The improvement of explainability and interpretability beyond the activation maps that have been shown is, together with the development of techniques using other anatomical regions, one of our main interests in the future, as it would help in decision making by courts of law.

You can test the algorithm completely free of charge with our tool

REFERENCES

[1] A. Demirjian, H. Goldstein, y J. M. Tanner, «A New System of Dental Age Assessment», Hum. Biol., vol. 45, n.o 2, pp. 211-227, 1973.

[2] R. Cameriere, L. Ferrante, y M. Cingolani, «Age estimation in children by measurement of open apices in teeth», Int. J. Legal Med., vol. 120, n.o 1, pp. 49-52, ene. 2006, doi: 10.1007/s00414-005-0047-9.

[3] H. H. Mincer, E. F. Harris, y H. E. Berryman, «The A.B.F.O. study of third molar development and its use as an estimator of chronological age», J. Forensic Sci., vol. 38, n.o 2, pp. 379-390, mar. 1993.

[4] A. Uys, H. Bernitz, S. Pretorius, y M. Steyn, «Estimating age and the probability of being at least 18 years of age using third molars: a comparison between Black and White individuals living in South Africa», Int. J. Legal Med., vol. 132, n.o 5, pp. 1437-1446, sep. 2018, doi: 10.1007/s00414-018-1877-6.

[5] D. L. S et al., «Accuracy of Cameriere’s cut-off value for third molar in assessing 18 years of age», Forensic Sci. Int., vol. 235, feb. 2014, doi: 10.1016/j.forsciint.2013.10.036.

[6] M. Melo, F. Ata-Ali, J. Ata-Ali, J. M. Martinez Gonzalez, y T. Cobo, «Demirjian and Cameriere methods for age estimation in a Spanish sample of 1386 living subjects», Sci. Rep., vol. 12, n.o 1, p. 2838, feb. 2022, doi: 10.1038/s41598-022-06917-x.

[7] J. Albernaz Neves et al., «Validation of the Third Molar Maturation Index (I3M) to assess the legal adult age in the Portuguese population», Sci. Rep., vol. 10, n.o 1, p. 18466, oct. 2020, doi: 10.1038/s41598-020-75324-x.

[8] N. Angelakopoulos et al., «Comparison of the third molar maturity index (I3M) between left and right lower third molars to assess the age of majority: a multi-ethnic study sample», Int. J. Legal Med., vol. 135, n.o 6, pp. 2423-2436, nov. 2021, doi: 10.1007/s00414-021-02656-2.

[9] F. De Micco, F. Martino, L. A. Velandia Palacio, M. Cingolani, y C. P. Campobasso, «Third molar maturity index and legal age in different ethnic populations: Accuracy of Cameriere’s method», Med. Sci. Law, vol. 61, n.o 1_suppl, pp. 105-112, ene. 2021, doi: 10.1177/0025802419900686.

[10] M. Sundararajan, A. Taly, y Q. Yan, «Axiomatic attribution for deep networks», en International conference on machine learning, PMLR, 2017, pp. 3319-3328. Accedido: 18 de diciembre de 2023. [En línea]. Disponible en: http://proceedings.mlr.press/v70/sundararajan17a.html

AUTHOR

Javier Venema

Computer scientist and Msc. in Data Science and Computer Engineering from the University of Granada. He is currently working as a researcher at Panacea Cooperative Research while developing a PhD thesis focused on biological profile estimation using artificial intelligence.