| Home | E-Submission | Sitemap | Editorial Office |  
top_img
Journal of Korean Neurosurgical Society > Epub ahead of print
Kang, Lee, Kim, and Noh: Machine Learning Model for Recurrent Lumbar Disc Herniation after Lumbar Discectomy

Abstract

Objective

Recurrent lumbar disc herniation (RLDH) is a significant challenge following lumbar discectomy, with recurrence rates of 5-15%. Established risk factors include male gender, diabetes mellitus, smoking, and obesity, but the role of paraspinal muscles in recurrence is unclear. This study was conducted to identify key risk factors for RLDH, including the volume of paraspinal muscles with machine learning.

Methods

We used data from 126 patients who underwent lumbar discectomy between January 2003 and September 2023 and had follow-up outpatient visits for more than 6 months at a single institution. Variables selected for the model, comprising demographic and clinical variables, medical history, LDH operation-related variables, and magnetic resonance imaging (MRI) measurements for RLDH. Based on clinical symptoms and radiologic results, the patients were classified into RLDH and non-RLDH groups, and RLDH was defined at the same surgical level on follow-up MRI.

Results

Totally, 38 patients were included in the RLDH group and 88 in the non-RLDH group. The volume of quadratus lumborum was identified as a risk factor for RLDH (odds ratio, 7.894; p=0.001). Among the five different machine learning algorithms, extreme gradient boosting achieved the best result with an accuracy of 0.794 and area under the curve of 0.811. In terms of Shapley Additive exPlanations value analysis, the weight, volume of quadratus lumborum, psoas major, and vertebra were key features for predicting RLDH.

Conclusion

The prediction model would be of great assistance for surgeons to make surgical decisions or establish observation intervals.

INTRODUCTION

While advancements in lumbar discectomy techniques have enhanced patient outcomes for lumbar disc herniation (LDH), recurrent LDH (RLDH) remains a major complication [15]. Approximately 5-15% of LDH cases recur after lumbar discectomy [9,10,21]. Therefore, both primary and tertiary prevention strategies are critical for minimizing recurrence and improving quality of life.
Various machine learning (ML) prediction models and factor analysis techniques have been used in the early diagnosis of diseases or abnormalities. Several ML algorithms have been specifically explored for the primary prevention of LDH [7]. One such algorithm was demonstrated to be useful for predicting LDH, achieving an F1-score of 0.830 and an accuracy of 0.922 [4]. The cross-sectional areas (CSAs) of paraspinal muscles and blood vessels were used as key features, with the CSA of quadratus lumborum (QL) and abdominal aorta emerging as the most important predictors.
However, in the context of tertiary prevention, risk factors for RLDH are still under investigation and debate. A meta-analysis by Lu et al. [11] showed that age, modic changes, body mass index (BMI), education level, diabetes mellitus (DM), smoking, protrusion type of LDH, and improper physical labor were significantly associated with RLDH [5], but accurate prediction of RLDH after lumbar discectomy and identification of its key contributing factors have yet to be fully established. In addition, the correlation between paraspinal muscles and LDH has been exhaustively studied [1,3,5,6,11,14,20,22,23], but their relationship with RLDH remains limited. Only one study has shown that among the three paraspinal muscles—psoas major (PM), multifidus muscle (MM), and erector spinae (ES)—the preoperative CSA of PM was correlated with RLDH [6].
This study was conducted with two primary objectives for the tertiary prevention of RLDH after lumbar discectomy. First, we aimed to identify the risk factors for RLDH, particularly including the volume of various paraspinal muscles. Second, we developed an ML-based model to accurately predict RLDH risk and uncover key contributing factors.

MATERIALS AND METHODS

This research was approved by the Institutional Review Board (IRB) of Ajou University (AJOUIRB-DB-2024-228).

Patient selection

We retrospectively analyzed data from 455 patients who underwent lumbar discectomy at our institution between January 2003 and September 2023. Of 455 patients, we selected 195 patients who underwent follow-up outpatient visits 6 months or later to determine if there was a recurrence. In total, 126 patients were finally included in the analyses after 69 patients were excluded due to the following reasons : 1) 41 patients had no preoperative magnetic resonance imaging (MRI) within 1 month including those who had primary surgery at local hospitals [11], 2) 12 patients herniated within 6 months after lumbar discectomy [5], which suggests that the pain persisted because the surgery was less effective rather than a recurrence, and 3) 16 patients whose preoperative MRI had artifacts that obstructed the measurement of muscle mass. Patients were included in the RLDH group if they had both clinical symptoms and disc herniation confirmed at the surgical level on follow-up MRI after 6 months or later [5]. Thus, 38 patients were included in the RLDH group, and the remaining 88 were included in the non-RLDH group (Fig. 1).

Clinical data collection

We selected 15 variables that can be easily obtained from the majority of patients prior to lumbar discectomy from electronic medical record data, including demographic and clinical variables, and medical history such as age, sex, height, weight, BMI, level of LDH, smoking, alcohol, DM, hypertension, cerebrovascular accident, and modified frailty scale. We additionally included three LDH operation-related variables such as total operation time, length of stay (LOS), and blood loss.

Radiologic data collection

To investigate the relationship between paraspinal muscles and RLDH, we calculated the volume of muscles using preoperative MRI data. We transformed T2-weighted axial images into the Digital Imaging and Communications in Medicine (DICOM) format. By using the ANOTHEN (NOH THINKING Co., Ltd., Seongnam, Korea) program, we preprocessed individual MRI data by windowing and labeling the location area, and then volume measurements were made. The axial CSA was divided into seven types : PM, MM, QL, ES, vertebra, fat, and spinal cord. In this study, we considered volume measurements for vertebra, fat, and spinal cord in addition to paraspinal muscles.

External validation

To further evaluate the generalizability of the final model, we conducted external validation using an independent dataset obtained from another institution. This dataset was not involved in model training or internal cross-validation (CV). For external validation, we used data from 62 patients who underwent lumbar discectomy at a different tertiary care hospital between 2015 and 2022. Inclusion and exclusion criteria were consistent with those of the internal cohort. Of these, 18 patients developed recurrent LDH and 44 did not.

Statistical and ML analysis

Descriptive statistics for baseline characteristics of demographic variables were calculated by considering their data types. We conducted Mann-Whitney U-test for numerical variables and implemented the chi-squared test and Fisher’s exact test for categorical variables. A significance level of p<0.05 was set as the threshold for statistical significance. In the case of MRI measurements, we conducted pre-test odds ratio analysis to determine the statistical association of paraspinal muscle volumes with LDH. The volume of paraspinal muscles was converted from a continuous variable to a categorical variable based on a cutoff value to proceed with multivariable logistic regression (LR). The optimal cutoff was calculated based on receiver operating characteristic (ROC) curve (Supplementary Table 1). For LR analysis, muscle volumes were dichotomized into categorical variables using optimal cut-off values derived from ROC analysis. In contrast, for ML model training, all muscle volume variables were entered as continuous values to preserve the full granularity of the data.
We considered feature selection for dimension reduction to boost the accuracy of the RLDH prediction model. For feature selection, we used the combination of two methods, Boruta selection [8] and forward selection methods to effectively select significant features. The Boruta selection algorithm is an ensemble-based feature selection algorithm that identifies all relevant variables in a dataset by iteratively comparing the importance of real features to that of randomly shuffled copies. To reduce bias from feature importance calculation, we incorporated the Boruta selection and forward selection methods using a nested 5-fold CV scheme. The initial features as accepted by the Boruta algorithm were selected, and then subsequently, additional features were incorporated into the forward selection process, with each feature’s influence on model development being recorded. The Boruta and forward selection methods identified the following features as significant for ML model training: weight, QL volume, PM volume, vertebral body volume, fat volume, LOS, and operation time. These features were consistently retained across CV folds and were used as input variables in the final ML models (Supplementary Fig. 1).
Using selected features from training sets, five supervised ML algorithms were used for RLDH risk prediction : LR, extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), categorical boosting (CatBoost), and multilayer perceptron (MLP). The task of RLDH risk prediction is a binary classification task, so we used the area under ROC curve (AUROC) as the primary model evaluation metric for this study. Grid search was used to optimize the hyperparameters of each model until an optimal model was identified to maximize the area under the curve (AUC) score. The average AUC of the five test results was reported to indicate the final performance of the model. The overall performance for the AUC score of each test data was interpreted based on a 95% confidence interval (CI). Additional evaluation metrics such as accuracy, F1-score, sensitivity, and specificity were employed to assess the robust performance of each model. All statistical analyses were performed with R version 4.3.0 (R Foundation for Statistical Computing, Vienna, Austria), and ML analyses were performed with Python version 3.11.8 (Python Software Foundation, Wilmington, DE, USA).

RESULTS

Patient demographics

A total of 126 patients underwent lumbar discectomy and met the inclusion criteria of our study (64 females [50.8%] and 62 males [49.2%]). The demographic data are presented in Table 1. The majority of patients who underwent successful lumbar discectomy did not undergo outpatient follow-up, which led to difficulties in obtaining a sufficient sample size. The patients’ average age was 47.9±15 years. Between the RLDH and non-RLDH groups, only LOS was significantly different.

Statistical analysis for MRI measurements

The results of the LR analyses are shown in Table 2. In the univariate LR, the volumes of QL and fat were significant. Only QL was significant in multivariate LR with an odds ratio of 7.894, which implies that patients with QL greater than 55 mL were 7.894 times more likely to have RLDH than patients with QL less than 55 mL.

Performance of the RLDH prediction models

The comparative performance of five ML models for RLDH risk prediction is summarized in Table 3. XGBoost demonstrated the best overall performance, with the second highest AUC (0.811; 95% CI, 0.791-0.832), accuracy (0.794; 95% CI, 0.782-0.807), balanced sensitivity (0.658; 95% CI, 0.630-0.6987), and specificity (0.894; 95% CI, 0.883-0.904). CatBoost also performed well, with the highest AUC of 0.813, but its accuracy and F1-score were lower than those of XGBoost. Light-GBM had the lowest AUC, accuracy, and F1-score compared to the other boosting-based models, indicating poorer performance overall. Additionally, the deep learning model MLP and the linear model LR, both showed the poorest overall performance. These results suggest that XGBoost provides the most reliable and balanced predictive performance for RLDH risk among the models tested, particularly in contexts where both sensitivity and specificity are crucial. Fig. 2 illustrates the ROC plots prepared using ML-based predictive models.

Explanatory analysis for the final RLDH prediction model

For RLDH risk prediction, the XGBoost model was selected as the final predictive model due to its superior performance across multiple evaluation metrics. To interpret the predictions extracted by XGBoost and to deeply understand the contribution of individual features to the risk of RLDH, we employed Shapley Additive exPlanations (SHAP) values [12]. SHAP values offer a unified measure of feature importance by quantifying the contribution of each feature to the model’s output; it allows for a transparent post-hoc analysis, providing insights into how each feature influences the prediction of RLDH risk. Fig. 3 compares the feature importance rankings derived from XGBoost’s internal methods with those obtained using SHAP values. Despite both methods showing significant differences in terms of importance, the most common influential variables of both methods were LOS and L5/S1. In terms of SHAP values (Fig. 3B), the Weight variable was the most important risk factor for RLDH prediction, and this variable had the greatest impact on the outcome of XGBoost (Fig. 3A), followed by vertebra and fat, which were also significant for the predictions. Additionally, clinical and operational factors such as LOS, specific spinal levels, and operation time contributed meaningfully to the model’s decisions. Overall, the results emphasize that predictions of XGBoost are driven by a combination of anatomical features, clinical history, and procedural variables, underscoring the complexity and multifactorial nature of the predictive process (Supplementary Fig. 2).
A sample-wise force plot was then prepared to visualize and assess the positive-negative correlation between the features and each prediction value (Fig. 4). In terms of sample-wise analysis, Fig. 4A shows the model’s contribution of each variable toward predicting a value of 1. In this case, variables such as Weight and QL significantly increase the predicted value, thereby playing a crucial role in steering the model toward a prediction of 1. As shown in Fig. 4B, the contributions were analyzed when the model predicted a value of 0. Here, Weight and LOS substantially reduced the predicted value, serving as key determinants in guiding the model toward a prediction of 0. Overall, the Weight variable considerably influenced the model’s predictions, demonstrating its importance across different prediction outcomes.

External validation

Table 4 presents the external validation results performed on an independent dataset obtained from another institution, which was not used for model training. The final model selected was used for RLDH prediction. The performance indicators show the generalizability of the model, maintaining consistent accuracy, precision, recall, and AUROC compared to the internal validation results.

DISCUSSION

Several studies have attempted to create models to predict RLDH. Shan et al. [19] used an ML model to predict RLDH with a higher AUC and accuracy than traditional LR. A more sophisticated rLDH prediction model using deep learning algorithms was developed by Ren et al. [17]. These previous studies used ML algorithms based on patient clinical information and imaging information such as modic changes, radiographic instability, and Pfirrmann criteria. However, we are the first to develop an ML model with preoperative paraspinal structure volumes as variables, which are expected to be correlated with RLDH. Since the volume of paraspinal structures can be easily calculated by AI (artificial intelligence) from a single preoperative MRI image, the accuracy of the model is expected to be improved. Importantly, our model was validated on an independent external dataset, demonstrating stable predictive performance across cohorts. This finding indicates that the model may be applicable beyond the development dataset, enhancing its potential for clinical translation.
The multivariable LR showed that only QL among paraspinal muscles had a significant relationship with RLDH (p=0.01). According to Schmidt et al. [18], flexion exerts more pressure on the intervertebral disc than extension. Thus, QL, which causes the spine to flex laterally [20], would exert a large compressive force, leaving the resected disc vulnerable to RLDH. Moreover, if the QL is overdeveloped and if only one side of the QL is hypertrophied, there are chances of more severe imbalances than in the case of other paraspinal muscles. Since the QL is responsible for the lateral flexion of the spine, it is likely to have a larger difference in moment due to side-to-side imbalance.
Choi et al. [2] analyzed the reason for the lack of effect of ES and MF on RLDH despite their function to stabilize the disc by extending the lower back. Due to the posterior approach to the spine during discectomy, varying degrees of proximal atrophy occur in these two muscles behind the spine. Therefore, a direct correlation between the mass of the preoperative ES and MF and RLDH is not apparent because of varying degrees of muscle damage and atrophy.
LOS was also identified as a significant clinical feature. A longer LOS for discectomy patients typically indicates complications or a slower recovery. It might also reflect a more complex surgical procedure and patient comorbidities. Therefore, it seems plausible that RLDH is more likely to occur in patients with a long LOS.
Beyond statistical performance, the clinical implications of our model deserve emphasis. The ability to stratify patients into high- and low-risk groups for RLDH immediately after surgery can support several aspects of clinical decision-making. First, in surgical planning, patients predicted to be at higher risk might benefit from modified surgical strategies or reinforcement techniques aimed at reducing recurrence. Second, in postoperative management, high-risk patients may require more intensive rehabilitation programs focusing on paraspinal muscle balance and core stabilization. Third, regarding follow-up strategies, risk stratification allows clinicians to individualize follow-up schedules, with high-risk patients undergoing more frequent clinical visits and imaging surveillance. Together, these applications illustrate how the model may directly inform real-world practice and contribute to improving patient outcomes.
In our study, we developed a successful ML prediction model with an AUC of 0.811 and an accuracy of 0.794. Comparison feature importance of XGBoost and SHAP value analysis showed that LOS was the most common influential variable. In SHAP value analysis, Weight and volume of vertebral body were the most important variables. Volumes of QL and PM were significant variables for predicting RLDH.
The identification of LOS and QL as significant variables is consistent with our LR results. Choi et al. [2] showed that the CSA of PM is positively correlated with RLDH. According to a meta-analysis by Lu et al. [11], BMI was significantly correlated with RLDH.
According to Ollila et al. [13], the CSA of L4 vertebral body was positively correlated with LDH. According to Videman et al. [24], increased vertebral height was related to decreased disc height and could be associated with disc degeneration. According to Pouriesa et al. [16], the disproportion of end plate could be a risk factor for LDH. Therefore, it is reasonable to assume that large, unbalanced vertebrae would have a significant relationship with LDH. Although the mechanisms of LDH and RLDH are different, it is thought that large vertebrae are similarly bad for RLDH.

Limitations

First, the sample size of the study was small, which led to insufficient statistical power and caused data imbalance and poor generalization performance in our ML models. This is a problem caused by the loss of follow-up of lumbar discectomy patients. Out of 455 patients, 260 were lost to follow-up. This is likely because the majority of patients with successful lumbar discectomy and resolved symptoms do not return to the hospital. Therefore, if we increase the sample size by combining data from other institutions, we can expect better performance of the ML models. Second, the retrospective design of the study necessitates the addition and updating of data. Moreover, we were unable to analyze variables for which we had no data. Analysis of the values of more risk factors and their use in training the ML models would have improved the model performance.

CONCLUSION

In this study, our objectives were twofold. First, we aimed to identify risk factors for RLDH. Among various clinical features and preoperative volume of paraspinal muscles, LOS and volume of QL had a significant effect on RLDH. Second, we investigated the use of an ML prediction model for RLDH. XGBoost showed the best performance with an AUC of 0.811 and an accuracy of 0.794. Our ML model predicts recurrence using only preoperative MRI, electronic medical records, and operation records. As soon as the surgery is concluded, we can input all the data needed to predict recurrence, allowing the neurosurgeon to make a quick decision regarding surgery and set follow-up intervals.

Notes

Conflicts of interest

No potential conflict of interest relevant to this article was reported.

Informed consent

Informed consent was obtained from all individual participants included in this study.

Author contributions

Conceptualization : SYK, SJL, SK, SHN; Data curation : SYK, SJL, SK, SHN; Formal analysis : SK, SHN; Funding acquisition : SK, SHN; Methodology : SYK, SJL, SK, SHN; Project administration : SYK, SJL, SK, SHN; Visualization : SYK, SJL, SK, SHN; Writing - original draft : SYK, SJL; Writing - review & editing : SK, SHN

Data sharing

None

Preprint

None

Acknowledgements

The work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. RS-2023-00211493) and the IITP (Institute for Information & Communications Technology Planning & Evaluation)-ITRC (Information Technology Research Center) grant funded by the Korea government (MSIT) (No. IITP-2025RS-2024-00437756).

Supplementary materials

The online-only data supplement is available with this article at https://doi.org/10.3340/jkns.2025.0120.
Supplementary Table 1.
Optimal cutoff for MRI measurements
jkns-2025-0120-Supplementary-Table-1.pdf
Supplementary Fig. 1.
Boruta and feedforward analysis for feature selection. AUC : area under the curve, Hosp. : hospitalization, QL : quadratus lumborum.
jkns-2025-0120-Supplementary-Fig-1.pdf
Supplementary Fig. 2.
Overall summary plot of XGBoost using the SHAP value. XGBoost : extreme gradient boosting, SHAP : Shapley Additive exPlanations.
jkns-2025-0120-Supplementary-Fig-2.pdf

Fig. 1.
Patient selection flowchart. MRI : magnetic resonance imaging, RLDH : recurrent lumbar disc herniation.
jkns-2025-0120f1.jpg
Fig. 2.
ROC curve visualization of prediction models. ROC : receiver operating characteristic, XGBoost : extreme gradient boosting, AUC : area under the curve, CI : confidence interval, CatBoost : categorical boosting, LGBM : light gradient boosting machine, MLP : multilayer perceptron.
jkns-2025-0120f2.jpg
Fig. 3.
Comparison between XGBoost internal feature importance (A) and XGBoost SHAP value (B). QL : quadratus lumborum, XGBoost : extreme gradient boosting, SHAP : Shapley Additive exPlanations.
jkns-2025-0120f3.jpg
Fig. 4.
Sample-wise force plots for both predictions of XGBoost using SHAP values. A : Feature significance of predicting 1. B : Feature significance for predicting 0. QL : quadratus lumborum, XGBoost : extreme gradient boosting, SHAP : Shapley Additive exPlanations.
jkns-2025-0120f4.jpg
Table 1.
Patient demographics
Variable Recurrence (n=38) Non-recurrence (n=88) p-value
Age (years) 46.8±13.3 48.4±17.0 0.529
Sex
 Female 16 (42.1) 48 (54.5)
 Male 22 (57.9) 40 (45.5)
Height (cm) 165.9±8.8 165.1±9.7 0.456
Weight (kg) 70.2 (10.5) 70.0 (17.2) 0.365
BMI (kg/m2) 25.5 (3.08) 25.4 (4.32) 0.395
Total operation time (minutes) 74.4 (35.8) 68.8 (38.2) 0.224
Length of stay (days) 8.5 (6.1) 6.4 (3.2) 0.004
Follow-up duration (months) 37.2 (10.8) 41.8 (12.8) 0.089
Blood loss (mL) 41.8 (62.3) 54.7 (156.9) 0.556
Smoking
 Absent 29 (76.3) 80 (90.9) 0.055
 Present 9 (23.7) 8 (9.1)
Alcohol
 Absent 29 (76.3) 74 (84.1) 0.432
 Present 9 (23.7) 14 (15.9)
DM
 Absent 28 (73.7) 71 (80.7) 0.521
 Present 10 (26.3) 17 (19.3)
HTN
 Absent 19 (50.0) 58 (65.9) 0.138
 Present 19 (50.0) 30 (34.1)
CVA
 Absent 35 (92.1) 82 (93.2) 1.000
 Present 3 (7.9) 6 (6.8)
Disc level 0.200
 L2/3 3 (7.89) 5 (5.68)
 L3/4 2 (5.26) 11 (12.50)
 L4/5 19 (50.00) 49 (55.68)
 L5/S1 18 (47.37) 25 (28.41)
Volume (mL)
 Psoas 175.94±52.37 188.77±74.42 0.272
 Multifidus 122.44±29.87 125.66±36.90 0.845
 Quadratus lumborum 46.29±15.83 44.90±17.81 0.665
 Erector spinae 338.26±114.37 349.07±131.66 0.643
 Vertebra 274.98±69.33 293.72±86.64 0.201
 Fat 468.92±216.52 549.01±258.14 0.076
 Spinal cord 16.84±7.65 16.27±8.42 0.711

Values are presented as mean±standard deviation or number (%). BMI : body mass index, DM : diabetes mellitus, HTN : hypertension, CVA : cerbrovascular accident

Table 2.
Odds ratio (OR) analysis for paraspinal muscle measurements
Variable Univariable
Multivariable
OR (95% CI) p-value OR (95% CI) p-value
Psoas
 <196.6 cm3 - - - -
 ≥196.6 cm3 0.679 (0.289-1.520) 0.357 0.404 (0.135-1.081) 0.085
Multifidus
 <155.9 cm3 - - - -
 ≥155.9 cm3 0.589 (0.182-1.625) 0.334 0.465 (0.109-1.727) 0.271
Quadratus lumborum
 <55.1 cm3 - - - -
 ≥55.1 cm3 2.935 (1.257-6.902) 0.013* 7.894 (2.414-29.987) 0.001*
Erector spinae
 <308.9 cm3 - - - -
 ≥308.9 cm3 0.653 (0.302-1.402) 0.275 0.741 (0.177-3.033) 0.676
Vertebra
 <280.17 cm3 - - - -
 ≥280.17 cm3 0.464 (0.208-1.004) 0.054 0.615 (0.156-2.301) 0.473
Fat
 <521.9 cm3 - - - -
 ≥521.9 cm3 0.409 (0.171-0.921) 0.036* 0.562 (0.214-1.427) 0.231
Spinal cord
 <11.88 cm3 - - - -
 ≥11.88 cm3 0.641 (0.282-1.485) 0.291 0.739 (0.302-1.849) 0.510

* Statistically significant.

CI : confidence interval

Table 3.
Performances of machine learning models for RLDH risk prediction
Model AUC (95% CI) Accuracy (95% CI) F1 score (95% CI) Sensitivity (95% CI) Specificity (95% CI)
XGBoost 0.811 (0.791-0.832) 0.794 (0.782-0.807) 0.626 (0.600-0.651) 0.658 (0.630-0.687) 0.894 (0.883-0.904)
CatBoost 0.813 (0.794-0.831) 0.731 (0.716-0.745) 0.504 (0.480-0.529) 0.567 (0.539-0.594) 0.860 (0.847-0.873)
LightGBM 0.767 (0.746-0.788) 0.715 (0.700-0.729) 0.527 (0.501-0.554) 0.617 (0.58 0.647) 0.807 (0.794-0.820)
MLP 0.779 (0.762-0.797) 0.699 (0.665-0.725) 0.456 (0.392-0.466) 0.525 (0.442-0.566) 0.829 (0.793-0.854)
LR 0.709 (0.686-0.732) 0.627 (0.611-0.644) 0.509 (0.490-0.529) 0.750 (0.721-0.779) 0.593 (0.576-0.611)

RLDH : recurrent lumbar disc herniation, CI : confidence interval, XGBoost : extreme gradient boosting, CatBoost : categorical boosting, LightGBM : light gradient boosting machine, MLP : multilayer perceptron, LR : logistic regression

Table 4.
External Validation result of RLDH prediction model
Model Accuracy Precision Recall F1-score AUROC AURPC
Final PCF 0.9237 0.9134 0.9156 0.9341 0.90125 0.9297

RLDH : recurrent lumbar disc herniation, AUROC : area under receiver operating characteristic curve, AURPC : area under the precision-recall curve, PCF : positive classification fraction

References

1. Carvalho V, Santos J, Santos Silva P, Vaz R, Pereira P : Relationship between fatty infiltration of paraspinal muscles and clinical outcome after lumbar discectomy. Brain Spine 2 : 101697, 2022
crossref pmid pmc
2. Choi TY, Chang MY, Lee SH, Cho JG, Lee S : Psoas muscle measurement as a predictor of recurrent lumbar disc herniation: a retrospective blind study. Medicine (Baltimore) 101 : e29778, 2022
crossref pmid pmc
3. Hansen L, de Zee M, Rasmussen J, Andersen TB, Wong C, Simonsen EB : Anatomy and biomechanics of the back muscles in the lumbar spine with reference to biomechanical modeling. Spine (Phila Pa 1976) 31 : 1888-1899, 2006
crossref pmid
4. Huang W, Han Z, Liu J, Yu L, Yu X : Risk factors for recurrent lumbar disc herniation: a systematic review and meta-analysis. Medicine (Baltimore) 95 : e2378, 2016
pmid pmc
5. Kim KT, Park SW, Kim YB : Disc height and segmental motion as risk factors for recurrent lumbar disc herniation. Spine (Phila Pa 1976) 34 : 2674-2678, 2009
crossref pmid
6. Kim WH, Lee SH, Lee DY : Changes in the cross-sectional area of multifidus and psoas in unilateral sciatica caused by lumbar disc herniation. J Korean Neurosurg Soc 50 : 201-204, 2011
crossref pmid pmc
7. Kocaman H, Yıldırım H, Gökşen A, Arman GM : An investigation of machine learning algorithms for prediction of lumbar disc herniation. Med Biol Eng Comput 61 : 2785-2795, 2023
crossref pmid pdf
8. Kursa MB, Rudnicki WR : Feature selection with the Boruta package. J Stat Softw 36 : 1-13, 2010

9. Li X, Hu Z, Cui J, Han Y, Pan J, Yang M, et al : Percutaneous endoscopic lumbar discectomy for recurrent lumbar disc herniation. Int J Surg 27 : 8-16, 2016
crossref pmid
10. Lin A, Wang Y, Zhang H, Zhu K, Zhou D, Guo J, et al : Endoscopic revision strategies and outcomes for recurrent L4/5 disc herniation after percutaneous endoscopic transforaminal discectomy. J Pain Res 17 : 761-770, 2024
crossref pmid pmc pdf
11. Lu HB, Wang LS, Li MQ, Chen X : The association between changes in multifidus muscle morphology and back pain scores following discectomy surgery for lumbar disc herniation: a systematic review and meta-analysis. Eur Spine J 31 : 1784-1794, 2022
crossref pmid pdf
12. Lundberg SM, Lee SI : A unified approach to interpreting model predictions. Available at : https://www.semanticscholar.org/paper/A-Unified-Approach-to-Interpreting-Model-LundbergLee/442e10a3c6640ded9408622005e3c2a8906ce4c2

13. Ollila L, Oura P, Karppinen J, Niinimäki J, Junno JA : Association between vertebral cross-sectional area and lumbar disc displacement - a population-based study. Eur Spine J 33 : 900-905, 2024
crossref pmid pdf
14. Öztürk PE, Aylanç N : Which is the most affected muscle in lumbar back pain - multifidus or erector spinae? Pol J Radiol 85 : e278-e286, 2020
pmid pmc
15. Pan M, Li Q, Li S, Mao H, Meng B, Zhou F, et al : Percutaneous endoscopic lumbar discectomy: indications and complications. Pain Physician 23 : 49-56, 2020
pmid
16. Pouriesa M, Fouladi RF, Mesbahi S : Disproportion of end plates and the lumbar intervertebral disc herniation. Spine J 13 : 402-407, 2013
crossref pmid
17. Ren G, Liu L, Zhang P, Xie Z, Wang P, Zhang W, et al : Machine learning predicts recurrent lumbar disc herniation following percutaneous endoscopic lumbar discectomy. Global Spine J 14 : 146-152, 2024
crossref pmid pmc pdf
18. Schmidt H, Kettler A, Heuer F, Simon U, Claes L, Wilke HJ : Intradiscal pressure, shear strain, and fiber strain in the intervertebral disc under combined loading. Spine (Phila Pa 1976) 32 : 748-755, 2007
crossref pmid
19. Shan ZM, Ren XS, Shi H, Zheng SJ, Zhang C, Zhuang SY, et al : Machine learning prediction model and risk factor analysis of reoperation in recurrent lumbar disc herniation patients after percutaneous endoscopic lumbar discectomy. Global Spine J 14 : 2240-2251, 2024
crossref pmid pmc pdf
20. Sun D, Liu P, Cheng J, Ma Z, Liu J, Qin T : Correlation between intervertebral disc degeneration, paraspinal muscle atrophy, and lumbar facet joints degeneration in patients with lumbar disc herniation. BMC Musculoskelet Disord 18 : 167, 2017
crossref pmid pmc pdf
21. Swartz KR, Trost GR : Recurrent lumbar disc herniation. Neurosurg Focus 15 : E10, 2003
crossref
22. Yaltırık K, Güdü BO, Işık Y, Altunok Ç, Tipi U, Atalay B : Volumetric muscle measurements indicate significant muscle degeneration in single-level disc herniation patients. World Neurosurg 116 : e500-e504, 2018
crossref pmid
23. Yazici A, Yerlikaya T : The relationship between the degeneration and asymmetry of the lumbar multifidus and erector spinae muscles in patients with lumbar disc herniation with and without root compression. J Orthop Surg Res 17 : 541, 2022
crossref pmid pmc pdf
24. Videman T, Battié MC, Gibbons LE, Gill K : Aging changes in lumbar discs and vertebrae and their interaction: a 15-year follow-up study. Spine J 14 : 469-478, 2014
crossref pmid
Editorial Office
1F, 18, Heolleung-ro 569-gil, Gangnam-gu, Seoul, Republic of Korea
TEL: +82-2-525-7552   FAX: +82-2-525-7554   E-mail: office@jkns.or.kr
About |  Browse Articles |  Current Issue |  For Authors and Reviewers
Copyright © Korean Neurosurgical Society.                 Developed in M2PI
Close layer