A novel machine learning model for class III surgery decision

Lee, Hunter; Ahmad, Sunna; Frazier, Michael; Dundar, Mehmet Murat; Turkkahraman, Hakan

doi:10.1007/s00056-022-00421-7

A novel machine learning model for class III surgery decision

Ein innovatives Machine-Learning-Modell für die Entscheidungsfindung bei Klasse-III-Operationen

Original Article
Open access
Published: 26 August 2022

(2022)
Cite this article

Download PDF

You have full access to this open access article

Journal of Orofacial Orthopedics / Fortschritte der Kieferorthopädie Aims and scope Submit manuscript

A novel machine learning model for class III surgery decision

Download PDF

Hunter Lee DDS MSD¹,
Sunna Ahmad²,
Michael Frazier DDS MS BS¹,
Mehmet Murat Dundar BS MS PhD³ &
…
Hakan Turkkahraman DDS PhD¹

2166 Accesses
12 Citations
Explore all metrics

Abstract

Purpose

The primary purpose of this study was to develop a new machine learning model for the surgery/non-surgery decision in class III patients and evaluate the validity and reliability of this model.

Methods

The sample consisted of 196 skeletal class III patients. All the cases were allocated randomly, 136 to the training set and the remaining 60 to the test set. Using the test set, the success rate of the artificial neural network model was estimated, along with a 95% confidence interval. To predict surgical cases, we trained a binary classifier using two different methods: random forest (RF) and logistic regression (LR).

Results

Both the RF and the LR model showed high separability when classifying each patient for surgical or non-surgical treatment. RF achieved an area under the curve (AUC) of 0.9395 on the test set. 95% confidence intervals were computed by bootstrap sampling as lower bound = 0.7908 and higher bound = 0.9799. On the other hand, LR achieved an AUC of 0.937 on the test set. 95% confidence intervals were computed by bootstrap sampling as lower bound = 0.8467 and higher bound = 0.9812.

Conclusions

RF and LR machine learning models can be used to generate accurate and reliable algorithms to successfully classify patients up to 90%. The features selected by the algorithms coincide with the clinical features that we as clinicians weigh heavily when determining a treatment plan. This study further supports that overjet, Wits appraisal, lower incisor angulation, and Holdaway H angle can be used as strong predictors in assessing a patient’s surgical needs.

Zusammenfassung

Zielsetzung

Primäres Ziel dieser Studie war es, ein neues Machine-Learning-Modell für die Entscheidung Operation vs. nichtoperative Behandlung bei Klasse-III-Patienten zu entwickeln und die Validität und Reliabilität dieses Modells zu bewerten.

Methoden

Die Stichprobe bestand aus 196 Patienten der skelettalen Klasse III. Alle Fälle wurden randomisiert einer Gruppe zugewiesen, 136 der Trainingsgruppe und die übrigen 60 der Testgruppe. Anhand des Testsatzes wurde die Erfolgsquote des künstlichen neuronalen Netzes mit einem Konfidenzintervall von 95% abgeschätzt. Zur Prädiktion chirurgischer Fälle wurde ein binärer Klassifikator mit 2 unterschiedlichen Methoden trainiert: Random Forest (RF) und logistische Regression (LR).

Ergebnisse

Sowohl das RF- als auch das LR-Modell zeigten eine hohe Trennschärfe bei der Klassifizierung der einzelnen Patienten für eine chirurgische bzw. eine nichtchirurgische Behandlung. RF erreichte eine AUC („area under the curve“) von 0,9395 in der Testgruppe. Die 95%-Konfidenzintervalle wurden mittels Bootstrap-Stichproben als untere Grenze = 0,7908 und obere Grenze = 0,9799 berechnet. Andererseits erreichte LR eine AUC von 0,937 in der Testgruppe. Die 95%-Konfidenzintervalle wurden durch Bootstrap-Sampling als untere Grenze = 0,8467 und obere Grenze = 0,9812 berechnet.

Schlussfolgerungen

Mithilfe von RF- und LR-Modellen für maschinelles Lernen lassen sich genaue und zuverlässige Algorithmen erstellen, die Patienten in bis zu 90% der Fälle erfolgreich klassifizieren. Die von den Algorithmen ausgewählten Merkmale stimmen mit den klinischen Merkmalen überein, die wir als Kliniker bei der Festlegung eines Behandlungsplans stark gewichten. Diese Studie belegt außerdem, dass Overjet, Wits-Appraisal, die Angulation der unteren Inzisiven und der Holdaway-H-Winkel als starke Prädiktoren für die Beurteilung des Operationsbedarfs eines Patienten verwendet werden können.

Machine Learning in Orthodontics: A New Approach to the Extraction Decision

The prediction of sagittal chin point relapse following two-jaw surgery using machine learning

Article Open access 09 October 2023

Development and evaluation of machine learning models based on X-ray radiomics for the classification and differentiation of malignant and benign bone tumors

Article Open access 09 April 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The most important part of orthodontic treatment is a proper diagnosis and the establishment of a treatment plan [1]. A proper diagnosis defines the problems of the patient so that a problem list can be identified. Once the diagnosis is made, clinicians should establish treatment goals to address the identified problems. There are many instances in which orthodontic therapy alone can be used to camouflage skeletal discrepancies with dental compensations. Other times it is necessary for the clinician to include orthognathic surgery as a part of the treatment plan. The pivotal part of treatment planning is the decision about whether orthognathic surgery is needed. Various factors such as desired profile changes, size of the upper airway, crowding, incisor position, and long-term stability must be taken into consideration [2]. Previous studies have identified several cephalometric measurements that can be used to help distinguish between surgical and non-surgical treatment with specificity as high as 90% [2,3,4,5,6]. The importance of this decision must be seriously considered in order to protect patients from unnecessary risks that may lead to complications such as infection, postoperative malocclusion, hemorrhage, bad splits, inferior alveolar nerve injury, and irreversible treatment such as extractions [7].

Expert clinicians have been sculpted by their education and clinical experiences to develop their treatment philosophies. It is very difficult to develop this process in a short amount of time for inexperienced clinicians. Treatment planning is a complex process in which diagnostic data is organized and combined with background knowledge and clinical experience that simply cannot be standardized into a formula [8]. An inexperienced orthodontist would benefit greatly if an artificial intelligence (AI) system existed that can be used to supplement this gap in experience. Moreover, AI systems may act as a complementary method that aids in decision-making, like a second opinion. AI systems are not new to the field of dentistry [9]. Over the last two decades, AI models have been generated to help with endodontic diagnosis [10], radiographic diagnosis [11], and to determine orthodontic treatment needs [12]. More recently in orthodontics, a variety of methods have been studied in the construction of an AI system that can support diagnosis, treatment planning, and planned tooth movement [13,14,15].

Among the methods of constructing an AI system, supervised machine learning is a method that allows computers to mimic the expert thought process and rationale in decision making. Supervised learning methods use a training dataset usually retrospectively collected from electronic archives and contains a set of dependent and independent variables for each case [16]. In the context of the proposed project, the dependent variable was the diagnostic decision assigned to each case by the practicing orthodontist, and independent variables were demographic data and the measurements obtained from diagnostic records. Two main categories of supervised learning techniques involve discriminative and generative models. Discriminative models learn a mapping between input values and corresponding output values for all cases in the training set by optimizing linear or nonlinear discriminant functions [17]. Among the most popular algorithms in this category are logistic regression [18], support vector machines [19], and neural networks [20]. On the other hand, generative models estimate the underlying probability distributions for each class and renders classification based on Bayes’ rule [17]:

$$P(A|B)=\frac{P(B|A)\times P(A)}{P(B)}$$

The current project required a binary decision which leads to two classes: surgery vs. non-surgery.

There is currently only one other study that has used machine learning to develop and evaluate a model to incorporate this technology in the treatment planning of orthognathic surgery cases [21]. However, this study only included a limited number of cephalometric values and additional objective indexes. It was our goal to increase the number of cephalometric values in the input data set to expand the search for causal relationships between the independent and dependent variables. We also took into consideration the patient’s subjective desire to seek surgical treatment for esthetic reasons. It was our aim to develop a new machine learning model for surgery/non-surgery decision in class III patients and evaluate the validity and reliability of this novel model.

Materials and methods

Ethical statement

This project was submitted for review to the Indiana University Institutional Review Board and approved (March 03, 2021, #10220).

Study design

This was a retrospective study, and the sample consisted of 196 skeletal class III patients who visited the Department of Orthodontics and Orofacial Genetics, Indiana University. The subjects included in the study had a negative ANB value and a Wits analysis that measured less than negative one millimeter. The exclusion criteria for the study included subjects with missing teeth except for third molars, malformed teeth, craniofacial anomalies such as cleft palate, and patients with a documented anterior functional shift.

A full set of orthodontic records was collected for each. Treatment plans were decided by 1 orthodontic resident and 2 faculty orthodontic specialists. All 3 clinicians were blinded against the others’ decisions, when the initial treatment decision was first made. A complete agreement was reached in 167 out of 196 cases (85%) during this blinded initial treatment decision process. The remaining 29 cases (15%) were re-evaluated for a second time as a group, and a final treatment decision was made by complete agreement of all the examiners.

A flow chart representing the group allocation, training, and testing processes is shown in Fig. 1. All the cases were allocated randomly, 136 to the training set and the remaining 60 to the test set. Randomization to the training and test sets was stratified by age, gender, and surgery, with proportional allocation to training/test sets based on those three factors. The test set was not used for the model construction and only used to evaluate the validity of the constructed model. To assess the reliability of the constructed model, 50 cases from the training set were used. The input values were obtained from 46 cephalometric measurement values (Table 1) and 7 additional indexes (Table 2). Categorical variables (“Sex at birth”, “Chief complaint” and “Molar classification”) in the data were first converted into one-hot encoding vectors. With this extension the number of features increased from 53 to 60. All feature values were normalized to between 0 and 1. A regularization constant that adjusts the tradeoff between regularization and empirical error was set to 0.5. Tracing and measurement of the lateral cephalogram for each patient were performed digitally by one investigator (H.L.) using Dolphin Imaging Version 12.0.09.39 (Patterson Dental Supply Inc., Chatsworth, CA, USA). Of the 196 included patients, 20 were randomly chosen and the cephalometric radiographs were traced again by the same examiner to measure method error of the tracing.

Table 1 Tab. 1 Description of the lateral cephalometric dataBeschreibung der lateralen kephalometrischen Daten

Full size table

Table 2 Tab. 2 Additional input dataErgänzende Input-Daten

Full size table

Statistical analyses

Bland–Altman plots, intraclass correlation coefficients (ICCs), and standard deviation of the repeated measurements were calculated for each cephalometric measurement. Using the test set, the success rate of the artificial neural network model was estimated, along with a 95% confidence interval (CI). To predict surgical cases, we trained a binary classifier using two different methods: random forest (RF) and logistic regression (LR).

These two machine learning algorithms were chosen as representative examples of the broader category of techniques that they belong to. RF is a non-parametric classifier and operates as an ensemble of decision trees, where each decision tree in the ensemble is considered a weak learner [22]. It is inspired by the fact that a large number of poorly correlated weak learners can outperform an individual constituent learner when operated as a committee. Classification in RF is performed by majority voting. The key component of the RF algorithm is the diversity of the individual models. To create a set of poorly correlated models, RF uses a random subset of features to create decision trees. The smaller the number of features selected, the less the correlation among individual models will be. However, if too few features are selected, then more trees will be needed, which will in turn increase the computational cost of training. LR belongs to the broader category of discriminative classifiers. Unlike other discriminative classifiers, LR uses a probabilistic discriminative model and can perform classification and feature selection at the same time when a 1-norm regularizer is used to optimize the discriminant vector. LR optimizes a linear hyperplane to maximize the joint posterior probabilities of training examples. As the decision surface between two classes is constrained to be linear, LR in general has very good generalization properties and is less likely to overfit the training data compared to other more complex algorithms such as artificial neural networks (ANNs) or nonlinear support vector machines (SVMs) [23] that can generate highly nonlinear decision boundaries. Confidence of a classification decision can be readily interpreted by the posterior probabilities which LR generates during testing. Hyperparameters of each classifier were tuned on the training set by 10-fold cross validation to maximize the area under the receiver operating characteristics (ROC) curve (AUC).

Results

Descriptive statistics

Descriptive statistics including mean, standard deviation, minimum and maximum values for the cephalometric input data are given in Table 3.

Table 3 Tab. 3 Descriptive statistics of the variablesDeskriptive Statistik der Variablen

Full size table

Reliability analyses

Bland–Altman plots, intraclass correlation coefficient (ICC), and standard deviation of the repeated measurements were calculated for each cephalometric measurement. The ICC was used to evaluate the test–retest reliabilities of the tracings. The values were scored as follows: ICC less than 0.50, poor reliability; ICC between 0.50 and 0.75, moderate reliability; ICC between 0.75 and 0.90, good reliability, and ICC greater than 0.90, excellent reliability [24]. The ICC for each repeated measurement was greater than 0.83 for all measurements except for two soft tissue measurements, interlabial gap (0.69) and nasolabial angle (0.74), demonstrating good reliability. For the initial, blinded treatment decisions, an 85% interexaminer agreement was achieved.

Results with RF

The number of trees in the ensemble and the number of features to subsample for training individual models are considered as tuning parameters. Another parameter that affects the performance of individual trees is the minimum number of samples required for each leaf node beyond which splitting of the node stops. These three parameters were tuned by grid optimization to maximize AUC performance for the ensemble and the final model was trained by the following values of these parameters: number of decision trees = 200, number of features to sample = 7, minimum leaf size = 5. An AUC of 0.9395 was obtained on the testing set. The 95% CIs were computed by bootstrap sampling as lower bound = 0.7908 and higher bound = 0.9799. As the lower bound was higher than 0.50, the results were statistically significantly better than a random classifier. The ROC curve is plotted in Fig. 2a. Feature importance scores were computed for the RF classifier. Although scores and rank of features varied between different runs, RF consistently found “Molar classification”, “Overjet (mm)”, and “Wits appraisal (mm)” as the top three features with the highest importance scores. RF assigned an absolute importance score of 0.05 or higher to around 80% of the 53 features available. Using a probability threshold of 0.50, the RF model was able to correctly classify cases with a 90% accuracy. The sensitivity for this model was 84% and the specificity was 93%. The RF model also showed a strong negative predictive value (NPV) of 93% and a positive predictive value (PPV) of 84% (Fig. 2b).

Results with LR

Categorical variables (“sex at birth”, “chief complaint” and “class”) in the data were first converted into one-hot encoding vectors. With this extension the number of features increased from 53 to 60. All feature values were normalized to between 0 and 1. A regularization constant that adjusted the tradeoff between regularization and empirical error was set to 0.5. LR achieved an AUC of 0.937 on the test set. The 95% CIs were computed by bootstrap sampling as lower bound = 0.8467 and higher bound = 0.9812. As the lower bound was higher than 0.50, the results were statistically significantly better than a random classifier. The ROC curve is plotted in Fig. 2c. Only 8 of the 60 features had a non-zero weight (Table 4), which suggests that the model finds the rest of features not useful for discriminating between surgical and non-surgical cases. Using a probability threshold of 0.50, the LR model was able to correctly classify 78% of the patients. The sensitivity for this model was 89% and specificity was 73%. This model also showed a NPV of 94% and a PPV of 61% (Fig. 2d).

Table 4 Tab. 4 Features selected by the logistic regression classifier with non-zero weights. Weights are optimized on normalized featuresVom logistischen Regressionsklassifikator ausgewählte Merkmale mit Gewichtungen ungleich Null. Die Gewichtungen werden für normalisierte Merkmale optimiert

Full size table

Discussion

Machine learning has been applied in many areas in dentistry for classification problems [13, 25]. The decision for surgery or non-surgery can be seen as a classification problem. Both models used in this study have previously been proven to be useful when the primary goal was outcome prediction and important interactions, or complex nonlinearities existed in a data set [26]. As RF is an ensemble of 200 decision trees and each individual tree in turn contains multiple leaf nodes (each node constitutes a rule) the results predicted by RF cannot be easily interpreted by the end user. It is often used as a black-box system, which may not present a desirable use case scenario in clinical settings. LR only used a single rule involving eight variables making it a far better interpretable model than RF. The best measurement to determine the success of each model is to assess their performance over a range of various threshold settings rather than a single operating point. Both the RF and the LR model showed high separability when classifying each patient for surgical or nonsurgical treatment with an AUC of 0.9395 and 0.937, respectively.

At a probability threshold of 0.50, RF was a little better overall at correctly classifying 90% patients for surgical or non-surgical treatment. RF was also slightly better for correctly identifying non-surgical patients with a specificity of 93%. Similarly, high levels of success were seen in other machine learning models when faced with classification between extractions [27] or surgery [21]. LR was slightly better for identifying patients requiring surgery with a sensitivity of 89%, but the tradeoff was that it was a bit worse for PPV. This shows that the model had a higher chance of identifying a patient as needing surgery when it was not recommended by the clinicians. In this study, borderline cases were defined by the 29 cases in which complete agreement was not obtained in the initial blinded treatment planning by each clinician. Of these cases, 22 were assigned to the training set and 7 were in the test set. In both models, all the cases that failed to identify the need for surgery were borderline cases. In the LR model, only 1 of the misidentified non-surgery cases was considered a borderline case. There were no borderline cases misidentified in the RF model for non-surgery. For the misidentified surgery cases, 2 of the cases in the LR model were considered borderline cases. There were 3 borderline cases misidentified in the RF model for surgery.

For this study, the input features were increased when compared to studies using similar models to expand the search for causal relationships between the independent and dependent variables [21, 27]. Many of the selected features are identical to what was found in previous studies that evaluated the surgery decision for skeletal class III patients [2, 4, 28]. More importantly, all these features play an important role in our clinical evaluation and treatment planning process. From a clinician’s perspective, the greatest indicator for orthognathic surgery is a severe anteroposterior (AP) discrepancy between both jaws. This is mostly seen with patients with a very negative ANB and Wits appraisal [28]. These patients also tend to present with a very negative overjet and severe class III molar classification.

In the most severe class III cases, patients will have an increased vertical skeletal pattern which is a combination of AP and vertical problems that typically presents with an increased lower face height [29]. These cases almost always require surgery because the movements necessary to correct the vertical relationship will worsen the AP relationship [30]. However, the advancement of skeletal anchorage systems has allowed for better non-surgical treatment success in patients with mild to moderate anterior skeletal open bites [31].

Some of the more challenging clinical decisions are on cases that could be considered borderline. The most important clinical consideration in these patients is whether the patient will be able to tolerate the dental compensation without critically effecting the esthetic result [32, 33]. The angulation of the lower incisors tends to become more compensated with camouflage treatment [34]. Patients who will more likely require surgical treatment exhibit more protrusive maxillary incisors, lingually inclined mandibular incisors, and a retrusive upper lip [30]. Generally, surgical treatment results in greater skeletal and profile changes due to the normalization of the skeletal bases [28]. The Holdaway H angle can be used to assess the balance of the lip profile to the rest of the face to determine an acceptable treatment goal for a surgical versus non-surgical approach [35]. Eslami et al. showed that the Holdaway H angle and the Wits appraisal can be used as critical diagnostic features to correctly classify 81% of patients when determining a treatment decision [4]. In another study by Stellzig-Eisenhauer et al., 92% of the patients were correctly classified with the Wits appraisal being the most decisive parameter [5]. The Holdaway H angle alone has been used to successfully classify 87% of patients [2].

Limitations and future directions

This study was designed as a feasibility study to demonstrate the possibility of using machine learning with cephalometric and demographic data and was limited by the sample size available during the time the study was conducted. However, even with the relatively small training sample, the method was found to be successful at classifying patients in the test sample. Further follow-up studies with bigger data will help to improve the accuracy of the algorithm and allow these models to serve as another tool for orthodontists that can be used to aid in the treatment planning of surgery cases. Furthermore, adding a larger patient sample size will allow future studies to include the treatment decisions of a greater variety of experienced clinicians to incorporate differences in treatment philosophies to help refine the algorithm and shed more light on the borderline cases. Future studies should also incorporate diagnostic variables associated with the transverse dimension of occlusion which has been previously shown to improve the success rate of the model [6].

Conclusions

This study shows that logistic regression and random forest machine learning models can be used to generate accurate and reliable algorithms to successfully classify up to 90% of patients in the treatment planning of class III orthognathic surgery. The features selected by each algorithm coincide with the clinical features that we as clinicians weigh heavily when determining a treatment plan for these patients. This study further supports that overjet, Wits appraisal, lower incisor angulation, and Holdaway H angle can be used as strong predictors in assessing a patient’s surgical needs.

References

Proffit WR, Fields HW, Sarver DM (2013) Contemporary orthodontics, 5th edn. Elsevier Mosby, St. Louis
Google Scholar
Benyahia H, Azaroual MF, Garcia C, Hamou E, Abouqal R, Zaoui F (2011) Treatment of skeletal Class III malocclusions: orthognathic surgery or orthodontic camouflage? How to decide. Int Orthod 9(2):196–209
PubMed Google Scholar
Tseng YC, Pan CY, Chou ST et al (2011) Treatment of adult Class III malocclusions with orthodontic therapy or orthognathic surgery: receiver operating characteristic analysis. Am J Orthod Dentofacial Orthop 139(5):e485–e493
Article PubMed Google Scholar
Eslami S, Faber J, Fateh A, Sheikholaemmeh F, Grassia V, Jamilian A (2018) Treatment decision in adult patients with class III malocclusion: surgery versus orthodontics. Prog Orthod 19(1):28
Article PubMed PubMed Central Google Scholar
Stellzig-Eisenhauer A, Lux CJ, Schuster G (2002) Treatment decision in adult patients with Class III malocclusion: orthodontic therapy or orthognathic surgery? Am J Orthod Dentofacial Orthop 122(1):27–37 (discussion 37–28)
Article PubMed Google Scholar
Kochel J, Emmerich S, Meyer-Marcotty P, Stellzig-Eisenhauer A (2011) New model for surgical and nonsurgical therapy in adults with Class III malocclusion. Am J Orthod Dentofacial Orthop 139(2):e165–e174
Article PubMed Google Scholar
Zaroni FM, Cavalcante RC, da Costa JD, Kluppel LE, Scariot R, Rebellato NLB (2019) Complications associated with orthognathic surgery: a retrospective study of 485 cases. J Craniomaxillofac Surg 47(12):1855–1860
Article PubMed Google Scholar
Lee R, MacFarlane T, O’Brien K (1999) Consistency of orthodontic treatment planning decisions. Clin Orthod Res 2(2):79–84
Article PubMed Google Scholar
Stheeman SE, van der Stelt PF, Mileman PA (1992) Expert systems in dentistry. Past performance—future prospects. J Dent 20(2):68–73
Article PubMed Google Scholar
Mallishery S, Chhatpar P, Banga KS, Shah T, Gupta P (2020) The precision of case difficulty and referral decisions: an innovative automated approach. Clin Oral Investig 24(6):1909–1915
Article PubMed Google Scholar
Orhan K, Bayrakdar IS, Ezhov M, Kravtsov A, Ozyurek T (2020) Evaluation of artificial intelligence for detecting periapical pathosis on cone-beam computed tomography scans. Int Endod J 53(5):680–689
Article PubMed Google Scholar
Wang X, Cai B, Cao Y et al (2016) Objective method for evaluating orthodontic treatment from the lay perspective: an eye-tracking study. Am J Orthod Dentofacial Orthop 150(4):601–610
Article PubMed Google Scholar
Faber J, Faber C, Faber P (2019) Artificial intelligence in orthodontics. APOS Trends Orthod 9(4):201–205
Article Google Scholar
Asiri SN, Tadlock LP, Schneiderman E, Buschang PH (2020) Applications of artificial intelligence and machine learning in orthodontics. APOS Trends Orthod 10(1):17–24
Article Google Scholar
Li P, Kong D, Tang T et al (2019) Orthodontic treatment planning based on artificial neural networks. Sci Rep 9(1):2037
Article PubMed PubMed Central Google Scholar
Coppin B (2004) Artificial intelligence illuminated. Jones & Bartlett Learning
Google Scholar
Bishop CM (2006) Pattern recognition and machine learning. Springer, Berlin Heidelberg
Google Scholar
McCullagh P, Nelder JA (1989) Generalized linear models vol 37. CRC Press
Book Google Scholar
Vapnik V (2013) The nature of statistical learning theory. Springer, Berlin Heidelberg
Google Scholar
Schmidhuber J (2015) Deep learning in neural networks: an overview. Neural Netw 61:85–117
Article PubMed Google Scholar
Choi HI, Jung SK, Baek SH et al (2019) Artificial intelligent model with neural network machine learning for the diagnosis of orthognathic surgery. J Craniofac Surg 30(7):1986–1989
Article PubMed Google Scholar
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Article Google Scholar
Vapnik VN (2000) The nature of statistical learning theory. Statistics for engineering and information science, 2nd edn. Springer, New York
Book Google Scholar
Koo TK, Li MY (2016) A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med 15(2):155–163
Article PubMed PubMed Central Google Scholar
Kunz F, Stellzig-Eisenhauer A, Zeman F, Boldt J (2020) Artificial intelligence in orthodontics : evaluation of a fully automated cephalometric analysis using a customized convolutional neural network. J Orofac Orthop 81(1):52–68
Article PubMed Google Scholar
Khanagar SB, Al-Ehaideb A, Maganur PC et al (2021) Developments, application, and performance of artificial intelligence in dentistry—A systematic review. J Dent Sci 16(1):508–522
Article PubMed Google Scholar
Jung SK, Kim TW (2016) New approach for the diagnosis of extractions with neural network machine learning. Am J Orthod Dentofacial Orthop 149(1):127–133
Article PubMed Google Scholar
Georgalis K, Woods MG (2015) A study of Class III treatment: orthodontic camouflage vs orthognathic surgery. Aust Orthod J 31(2):138–148
PubMed Google Scholar
Ellis E 3rd, McNamara JA Jr. (1984) Components of adult Class III open-bite malocclusion. Am J Orthod 86(4):277–290
Article PubMed Google Scholar
Ngan P, Moon W (2015) Evolution of Class III treatment in orthodontics. Am J Orthod Dentofacial Orthop 148(1):22–36
Article PubMed Google Scholar
Turkkahraman H, Sarioglu M (2016) Are temporary anchorage devices truly effective in the treatment of skeletal open bites? Eur J Dent 10(4):447–453
Article PubMed PubMed Central Google Scholar
Bou Wadi MN, Freitas KMS, Freitas DS et al (2020) Comparison of profile attractiveness between Class III orthodontic camouflage and predictive tracing of orthognathic surgery. Int J Dent 2020:7083940
Article PubMed PubMed Central Google Scholar
Reis GM, de Freitas DS, Oliveira RC et al (2021) Smile attractiveness in class III patients after orthodontic camouflage or orthognathic surgery. Clin Oral Investig 25(12):6791–6797. https://doi.org/10.1007/s00784-021-03966-w
Article PubMed Google Scholar
Troy BA, Shanker S, Fields HW, Vig K, Johnston W (2009) Comparison of incisor inclination in patients with Class III malocclusion treated with orthognathic surgery or orthodontic camouflage. Am J Orthod Dentofacial Orthop 135(2):146.e1–146.e9 (discussion 146–147)
Google Scholar
Holdaway RA (1984) A soft-tissue cephalometric analysis and its use in orthodontic treatment planning. Part II. Am J Orthod 85(4):279–293
Article PubMed Google Scholar

Download references

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations

Department of Orthodontics and Oral Facial Genetics, Indiana University School of Dentistry, 1121 West Michigan Street, 46202, Indianapolis, IN, USA
Hunter Lee DDS MSD, Michael Frazier DDS MS BS & Hakan Turkkahraman DDS PhD
Indiana University School of Dentistry, Indianapolis, IN, USA
Sunna Ahmad
Department of Computer and Information Science, School of Science, Indiana University Purdue University Indianapolis, Indianapolis, IN, USA
Mehmet Murat Dundar BS MS PhD

Authors

Hunter Lee DDS MSD
View author publications
You can also search for this author in PubMed Google Scholar
Sunna Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Michael Frazier DDS MS BS
View author publications
You can also search for this author in PubMed Google Scholar
Mehmet Murat Dundar BS MS PhD
View author publications
You can also search for this author in PubMed Google Scholar
Hakan Turkkahraman DDS PhD
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hakan Turkkahraman DDS PhD.

Ethics declarations

Conflict of interest

H. Lee, S. Ahmad, M. Frazier, M.M. Dundar and H. Turkkahraman declare that they have no competing interests.

Ethical standards

This project was submitted for review to the Indiana University Institutional Review Board and approved on March 03, 2021 with protocol number 10220. This research was exempt under Category 4(iii). Waiver of authorization criteria satisfied in accordance with 45 CFR 164.512(i)(2)(ii). Waiver of authorization approved in accordance with 45 CFR 164.512(i). Brief description of Protected Health Information for waiver of authorization for participation: Chronologic age, skeletal age, sex at birth, chief complaint, maxillary crowding, mandibular crowding, molar classification, three extraoral photographs (frontal at rest, frontal smiling, lateral profile), five intraoral photographs (maxillary occlusal, mandibular occlusal, center in occlusion, right buccal, and left buccal), lateral cephalometric radiograph, panoramic radiograph.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Data Availability

The data underlying this article cannot be shared publicly due to the privacy of individuals who participated in the study. The data will be shared on reasonable request to the corresponding author.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, H., Ahmad, S., Frazier, M. et al. A novel machine learning model for class III surgery decision. J Orofac Orthop (2022). https://doi.org/10.1007/s00056-022-00421-7

Download citation

Received: 09 December 2021
Accepted: 24 July 2022
Published: 26 August 2022
DOI: https://doi.org/10.1007/s00056-022-00421-7

Keywords

Schlüsselwörter

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A novel machine learning model for class III surgery decision

Abstract

Purpose

Methods

Results

Conclusions

Zusammenfassung

Zielsetzung

Methoden

Ergebnisse

Schlussfolgerungen

Similar content being viewed by others

Machine Learning in Orthodontics: A New Approach to the Extraction Decision

The prediction of sagittal chin point relapse following two-jaw surgery using machine learning

Development and evaluation of machine learning models based on X-ray radiomics for the classification and differentiation of malignant and benign bone tumors

Introduction

Materials and methods

Ethical statement

Study design

Statistical analyses

Results

Descriptive statistics

Reliability analyses

Results with RF

Results with LR

Discussion

Limitations and future directions

Conclusions

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical standards

Additional information

Publisher’s Note

Data Availability

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Schlüsselwörter

Search

Navigation