- Research
- Open access
- Published:
Differentiating high-grade patterns and predominant subtypes for IASLC grading in invasive pulmonary adenocarcinoma using radiomics and clinical-semantic features
Cancer Imaging volume 25, Article number: 42 (2025)
Abstract
Objectives
The International Association for the Study of Lung Cancer (IASLC) grading system for invasive non-mucinous adenocarcinoma (ADC) incorporates high-grade patterns (HGP) and predominant subtypes (PS). Following the system, this study aimed to explore the feasibility of predicting HGP and PS for IASLC grading.
Materials and methods
A total of 529 ADCs from patients who underwent radical surgical resection were randomly divided into training and validation datasets in a 7:3 ratio. A two-step model consisting of two submodels was developed for IASLC grading. One submodel assessed whether the HGP exceeded 20% for ADCs, whereas the other distinguished between lepidic and acinar/papillary PS. The predictions from both submodels determined the final IASLC grades. Two variants of this model using either radiomic or clinical-semantic features were created. Additionally, one-step models that directly assessed IASLC grades using clinical-semantic or radiomic features were developed for comparison. The area under the curve (AUC) was used for model evaluation.
Results
The two-step radiomic model achieved the highest AUC values of 0.95, 0.85, 0.96 for grades 1, 2, 3 among models. The two-step models outperformed the one-step models in predicting grades 2 and 3, with AUCs of 0.89 and 0.96 vs. 0.53 and 0.81 for radiomics, and 0.68 and 0.77 vs. 0.44 and 0.63 for clinical-semantics (p < 0.001). Radiomics models showed better AUCs than clinical-semantic models for grade 3 regardless of model steps.
Conclusions
Predicting HGP and PS using radiomics can achieve accurate IASLC grading in ADCs. Such a two-step radiomics model may provide precise preoperative diagnosis, thereby supporting treatment planning.
Introduction
Lung cancer continues to be the foremost cause of cancer-related deaths globally [1]. Among its various forms, lung adenocarcinoma is the most prevalent histological subtype and has become the primary driver of lung cancer mortality. Although curative intent surgery can substantially improve survival rates or disease-specific outcomes [2,3,4], more than 30% of lung cancer patients still suffer from recurrence or die from their disease after surgery [5, 6].
The histological grading system is crucial for assessing the prognosis of patients with lung adenocarcinoma. The 2015 World Health Organization classification proposed five main histological subtypes that were closely associated with prognosis for grading lung adenocarcinoma patients [7,8,9,10]. However, this grading system may underestimate the impact of minor percentages of micropapillary, secondary subtypes and complex glandular patterns which can also significantly influence patient outcomes [10,11,12]. Considering this limitation, the International Association for the Study of Lung Cancer (IASLC) introduced a novel grading system that considered both predominant subtypes and the proportion of high-grade patterns regarding micropapillary, solid, and complex glandular [13]. Recent studies had evaluated this new grading system [14, 15]. Deng et al. found that patients with stage IB to III high-grade invasive non-mucinous adenocarcinomas exhibited an improved survival rate when treated with adjuvant chemotherapy [14]. The study by Rokutan-Kurata et al. showed a concordance index of 0.77 for assessing recurrence risk in stage I-IIIA lung adenocarcinoma patients using the IASLC grading system [15]. These results indicated that the IASLC system, which used primary subtypes with a threshold of 20% to define high-grade patterns, was superior for guiding postoperative prognosis.
In addition to the benefits of analysing postoperative pathology for prognosis, understanding histological subtypes before surgery is also of importance to guide preoperative treatment planning [16]. However, preoperative biopsy is limited by tumour heterogeneity and sampling constraints, making it challenging to comprehensively assess pathological subtypes. As the most commonly used method for preoperative evaluation of lung nodules, CT imaging can provide detailed morphological information. Building on this, the development of an accurate preoperative imaging-based model using CT scans to predict IASLC grades might serve as a valuable reference for clinical decisions regarding neoadjuvant therapy and lymph node dissection before surgery, potentially improving patient outcomes. For example, in patients with stage IB lung adenocarcinoma, those classified as IASLC grade 3 postoperatively are often advised to undergo chemotherapy or radiotherapy to reduce the risk of recurrence. By identifying patients who are likely to have IASLC grade 3 before surgery, the model can support clinical decision-making regarding neoadjuvant therapy.
To this end, recent studies have focused on using CT-based models for non-invasive IASLC grading [17, 18]. For instance, Yang et al. combined radiomics and semantic features on CT scans to create a nomogram that distinguished IASLC grades 1 and 2 from high-grade patients, achieving an AUC of 0.84 on the validation set [17]. Another study employed radiomic features to develop a model that attained an AUC of 0.90 in differentiating grade 1 from grades 2 and 3 [18]. These models classified patients into IASLC grades directly without predicting predominant subtypes and the proportion of high-grade patterns. While they showed acceptable performance in IASLC grading, incorporating the classification of predominant subtypes and estimating the proportion of high-grade patterns according to the IASLC system might further enhance grading accuracy.
Therefore, our research aimed to explore the effectiveness of predicting high-grade patterns and predominant subtypes in two steps for IASLC grading in invasive non-mucinous pulmonary adenocarcinoma using radiomic or clinical-semantic features. The study was also designed to compare these two-step models with one-step models that directly assessed IASLC grades using clinical-semantic or radiomic features. The ultimate goal was to accurately identify high-grade patients, thereby supporting preoperative treatment planning.
Materials and methods
Study participants
This retrospective study involved 958 consecutive patients who underwent radical surgical resection and presented with lung adenocarcinoma from Tianjin Medical University Cancer Institute and Hospital between April 2022 and December 2023. The study received approval from the medical ethical committee of the hospital (EK2023044). The inclusion criteria were: (1) thin-slice, plain chest CT images taken within one month before surgery; (2) detailed postoperative pathology reports with information on the proportion of pathological subtypes; (3) having received no treatment before surgery. The exclusion criteria were: (1) poor quality CT images; (2) without preoperative CT images within one month; (3) without clear histopathological subtypes; (4) precursor glandular lesions, minimally invasive adenocarcinoma or invasive mucinous adenocarcinoma [13].
Histopathology assessment
Two pathologists were involved in assessing and classifying invasive non-mucinous pulmonary adenocarcinoma, one with 3 years of experience and the other with 10 years of experience. The initial pathological analysis was performed by the junior pathologist, followed by a review and final confirmation by the senior pathologist to ensure diagnostic accuracy. The pathological classification followed the 5th edition of the World Health Organization classification of thoracic tumours [19]. Specifically, the diagnosis of invasive non-mucinous pulmonary adenocarcinoma involved a detailed examination of histologic patterns, categorized into lepidic, acinar, papillary and high-grade patterns including micropapillary, solid, and complex glandular [20]. They documented the proportion of each pathological subtype in increments of 5% and assigned grades based on the IASLC grading system [15]. Grade 1 represented a lepidic predominant pattern with no or less than 20% of high-grade components; Grade 2 denoted an acinar or papillary predominant pattern with no or less than 20% of high-grade components; Grade 3 corresponded to any tumour with 20% or more of high-grade components.
CT examination and clinical-semantic features
Low-dose chest CT scans were performed using GE Discovery CT750 HD, GE LightSpeed 16, and Siemens Somatom Sensation 64 CT devices. The scanning range was from the lung apex to below the diaphragm. The tube voltage was 120 kV, and the tube current was automatically adjusted. The reconstruction thickness was 1.25 mm with a spacing of 0.984 mm in the GE CT system, whereas the reconstruction thickness was 1.5 mm with a spacing of 0.95 mm for the Siemens CT device. Lung nodules on CT scans for each patient were independently evaluated by two experienced radiologists. During the evaluation, the radiologists were blinded to the histopathological results, and any discrepancies were resolved through mutual consultation. After evaluation, fourteen clinical-semantic features were recorded. The specific characteristics were as follows: (1) age, (2) gender, (3) nodule diameters averaged from the long-axis and short-axis diameters, (4) nodule types, (5) nodule location, (6) shape, (7) lobulation, (8) spiculation, (9) cavitation, (10) vacuole, (11) air bronchograms, (12) pleural traction, (13) vascular convergence, and (14) obstructive pneumonia.
Nodule delineation and radiomics feature extraction
The region of interest for each pulmonary lesion was semi-automatically delineated using the Samm Base plugin (https://github.com/bingogome/samm) in 3D Slicer software (Version 5.6.2). The examples of delineations are shown in Fig. 1. Two radiologists evaluated the segmentation results and manually adjusted the delineations. Before radiomics feature extraction, delineated CT data was resampled to 1 × 1 × 1 mm voxel size to ensure consistent spatial resolution and improve the robustness of the extracted features. Pyradiomics (Version 3.1.0) was then applied to extract radiomics features including first-order features, shape features, grey level co-occurrence matrix features, grey level size zone matrix features, grey level run length matrix features, grey level dependence matrix features, and neighbourhood grey tone difference matrix features. As a consequence, a total of 1834 radiomics features were obtained after extraction. To find robust radiomics features for model development, we conducted a consistency evaluation between radiologists using the intraclass correlation coefficient (ICC). A subset comprising one-third of the cases was randomly selected. After two month, intra-observer and inter-observer segmentation reproducibility were performed on the subset by two radiologists separately [17]. Ultimately, 1465 radiomics features with an ICC > 0.75 were retained for the experiment [21].
Model development
In order to explore the feasibility of predicting high-grade patterns and predominant subtypes for IASLC grading, we developed two distinct two-step models, with one based on radiomic features and the other on clinical-semantic features. To further evaluate model effectiveness, we compared these two-step models with two one-step models that directly assessed IASLC grades using either clinical-semantic or radiomic features. The detailed development process of the two-step and one-step models is outlined below. The basic concepts of the four models are shown in Fig. 2.
Development of two-step models
the two-step model for IASLC grading was developed using two submodels. The first submodel assessed whether the proportion of high-grade patterns was ≥ 20%, while the second submodel differentiated between lepidic and acinar/papillary predominant subtypes. The IASLC grading of invasive pulmonary adenocarcinoma was then determined based on the estimations from these two submodels. In this analysis, two types of two-step models were established. One model consisted of two submodels that used clinical-semantic features as input, whereas the other type also comprising two submodels relied on radiomic features derived from nodule delineation.
The submodels using clinical-semantic were built using weighted logistic regression, where the class weight was calculated as the total sample size divided by the product of the sample size of a specific class and the total number of classes. Classes with larger quantities were assigned smaller weights, while those with smaller quantities were assigned larger weights. The introduced class weights can mitigate the impact of the data imbalance on model performance. Variates including age, gender, diameter and semantic CT features were used. Univariate analysis was applied to preliminarily identify potential predictive features. Any input feature with a p-value smaller than 0.1 in the univariate analysis was included for further analysis [22]. Subsequently, we conducted a multivariate analysis using the backward likelihood ratio method to construct the final submodels.
Regarding the submodels using radiomics features, minimum redundancy maximum relevance (mRMR), least absolute shrinkage and selection operator (LASSO), and logistic regression methods were applied. Initially, all radiomics features were standardized using the Z-score normalization method. The mRMR approach was then employed to process the radiomics features, identifying the top 30 most significant features with the highest correlation to the outcomes. Subsequently, the LASSO algorithm, combined with ten-fold cross-validation, was used to determine the optimal subset of predictive features and their corresponding coefficients. These optimal features were used to construct the radiomics submodels via weighted binary logistic regression. For each lung nodule, the radiomics feature score (Rad-score) of a submodel was derived by linearly combining the selected features, weighted by their respective coefficients.
Development of one-step models
for comparisons, we also developed two types of one-step models that used only clinical-semantic or radiomics features for IASLC grading. The model using clinical-semantic features was built using ordinal logistic regression with class weights to alleviate data imbalance. Univariate ordinal logistic regression with a proportional odds ratio was first applied to select potential features for predicting IASLC grades. Then the multivariate ordinal logistic regression with stepwise selection was employed to construct the final one-step clinical-semantic model. In contrast, the one-step model using radiomics features was developed by using the mRMR, LASSO, and multivariate ordinal logistic regression methods with class weights to mitigate data imbalance. Compared to the two-step models, the one-step models directly assessed IASLC grades without predicting the proportion of high-grade patterns or predominant subtypes.
Statistical analysis and evaluation metrics
Statistical analyses were conducted using Python (version 3.11.7). Group comparisons for categorical variables in the training and validation sets were performed using the chi-squared test, while continuous variables were analyzed using either the t-test or the Kruskal-Wallis H test [23,24,25], depending on the data distribution. Model performance was evaluated by comparing the area under the curve (AUC), sensitivity, specificity and F1 scores. The optimal threshold in the receiver operating characteristic curve for classification was determined by selecting the threshold that maximized the sum of sensitivity and specificity. At this optimal threshold, sensitivity was calculated as the ratio of true positives to the sum of true positives and false negatives, while specificity was calculated as the ratio of true negatives to the sum of true negatives and false positives. The DeLong test applied to assess differences in AUC between models [26]. The Cohen’s kappa test was used to assess the classification agreement between the models and the pathologist’s IASLC grading. The clinical utility of the model was evaluated through decision curve analysis, which involved assessing net benefits across a range of threshold probabilities in the validation cohort. A two-tailed p-value of less than 0.05 was considered statistically significant.
Results
Patient characteristics
After data selection, 529 lung nodules remained in the study. The lung nodules were randomly divided into training and validation sets in a 7:3 ratio. The detailed flow chart for data selection is shown in Fig. 3. Patient characteristics in the training and validation sets for IASLC grading are shown in Table 1. Patient characteristics for the prediction of high-grade patterns and predominant subtypes are described in Tables S1 and S2. The inter-reader agreement for semantic feature classifications are shown in Table S3. For IASLC grading, significant differences were observed in diameter, types, spiculation, air bronchograms, pleural traction and obstructive pneumonia among the three groups in the training and validation datasets. There was no significant difference found in cavitation and vascular convergence among the groups in both datasets.
Two-step models
In the development of clinical-semantic submodels in two-step models, univariable analysis showed that age, gender, diameter, types, nodule location, spiculation, cavitation, vacuole, pleural traction, obstructive pneumonia had the potential to be predictive in assessing whether the proportion of high-grade patterns was greater than or equal to 20% in invasive non-mucinous pulmonary adenocarcinoma. In contrast, age, diameter, types, and air bronchograms had potential be predictive for the discrimination between lepidic and acinar/papillary predominant subtypes. After performing multivariate logistic regression analysis, diameter, types, spiculation and vacuole emerged as statistically significant features for high-grade pattern prediction, whereas types was the only significant feature for predominant subtype prediction. The detailed feature selection results univariate and multivariate analysis are shown in Tables S4 and S5.
In the development of radiomics submodels in two-step models, the analysis of the mRMR, LASSO and multivariate binary logistic regression approaches found four features (wavelet-LLL_gldm_LargeDependenceHighGrayLevelEmphasis, lbp-2D_firstorder_Entropy, logarithm_ngtdm_Strength, exponential_firstorder_RobustMeanAbsoluteDeviation) were identified as statistically significant for the prediction of high-grade patterns greater than or equal to 20%. In contrast, a similar set of six features (exponential_firstorder_RobustMeanAbsoluteDeviation, wavelet-LLL_gldm_LargeDependenceHighGrayLevelEmphasis. logarithm_firstorder_Skewness, logarithm_glcm_DifferenceAverage, square_gldm_LargeDependenceLowGrayLevelEmphasis, logarithm_glcm_Idn) were selected to construct the radiomics submodel for the prediction of predominant subtypes. These radiomics features were used to build calculation equations for the prediction scores of high-grade patterns and predominant subtypes, which were presented in the supplementary file. Besides, we also tried to combine both significant clinical-semantic features and radiomic prediction scores to build submodels for both tasks employing using logistic regression with backward selection. However, only radiomics prediction scores were retained.
One-step models
For the one-step model using clinical-semantic features, the univariable ordinal logistic regression analysis showed that age, gender, diameter, types, nodule location, spiculation, cavitation, vacuole, air bronchograms, pleural traction, obstructive pneumonia were independent variables for differentiating IASLC grading. The final model only included types after the stepwise multivariable ordinal logistic regression analysis. Regarding the one-step model using radiomics features, the analysis of the mRMR, LASSO and multivariate ordinal logistic regression methods selected the predictive radiomics features including exponential_firstorder_RobustMeanAbsoluteDeviation, wavelet-LLL_glrlm_HighGrayLevelRunEmphasis, and logarithm_firstorder_Skewness to construct the final model for IASLC grading. The parallel regression assumption was also not violated. The Rad-score in the one-step model using radiomics features was shown in the supplementary.
Performance of submodels in two-step models
The AUC performance of the clinical-semantic and radiomics submodels in the two-step models are shown in Fig. 4. The radiomics submodel had significantly higher AUCs than the clinical-semantic submodel for high grade pattern prediction (0.95 vs. 0.77, p < 0.001; 0.96 vs. 0.77, p < 0.001) in both the training and validation sets. For the prediction of predominant subtypes, the radiomics submodel significantly outperformed the clinical-semantic submodel in the training set and showed no significant difference in performance on the validation set (p = 0.52).
Performance comparison between two-step and one-step models
We calculated the performance of the two-step and one-step models using either radiomics features or clinical-semantic features for IASLC grading, as shown in Table 2. When differentiating IASLC grades, the two-step model using radiomics features yielded the highest AUC values. The AUCs for the models using radiomics features in grade 3 were higher than those for the models using clinical-semantic features regardless of the model steps. In the comparison of one-step and two-step models using radiomics features for IASLC grading, the two-step model was found to outperform the one-step model in predicting grades 2 and 3 (0.88 vs. 0.63, p < 0.001, 0.95 vs. 0.82, p < 0.001; 0.85 vs. 0.53, p < 0.001; 0.96 vs. 0.81, p < 0.001) in both datasets. Similarly, the two-step model had significantly better performance than the one-step model when using clinical-semantic features as input for the classification of grades 2 and 3 (all p < 0.001). We also assessed the classification agreement between the models and the pathologist’s IASLC grading in Table 2. The table showed that compared to the two-step model using clinical-semantic features or the one-step models, the two-step model using radiomics achieved an overall higher classification agreement with the pathologist in IASLC grading. The agreement value of the two-step radiomics model exceeded 0.7 for grade 3, indicating the potential of the model for accurately identifying high-grade cases and assisting in preoperative treatment planning.
For the three IASLC grades, the two-step radiomics model showed better net benefit than the other models, as shown in Fig. 5. In particular, the two-step radiomics model for grade 3 achieved good net benefits across various thresholds, highlighting its potential to identify high-grade patients and providing valuable insights for clinical decisions regarding neoadjuvant therapy and lymph node dissection before surgery. Additionally, the two-step models, whether using radiomics or clinical-semantics features, outperformed the one-step models when using the same input features. This may reflect the superiority of predicting high-grade patterns and predominant subtypes in two steps for IASLC grading.
Discussion
We developed a two-step radiomics model for the differentiation of IASLC grades in invasive non-mucinous pulmonary adenocarcinoma. The model yielded accurate results on the prediction of high grade patterns > 20% and predominant subtypes, which results in high AUC scores of 0.95, 0.85 and 0.96 on the validation set for three category IASLC grading. The results suggested that the proposed model may assist radiologists in the stratification of patients according to different IASLC grades, thereby supporting treatment planning prior to surgery.
Studies have shown that radiomics features on CT images can describe predict the presence of high-grade patterns [8, 27]. For example, Chen et al. utilized cases with nearly pure lung adenocarcinoma to develop a radiomics method [27]. The method can assess micropapillary and solid components using patch-wise image analysis with an AUC of 0.86. Another study built and compared four machine learning algorithms with radiomics features for the prediction of presence of micropapillary or solid patterns [28]. The best performed algorithm achieved an AUC of 0.75 in the internal validation. Compared to their studies, our radiomics submodel yielded a higher AUC of 0.96 on the validation set. This might be explained by the fact that predicting high-grade patterns > 20% was easier than assessing the presence of high-grade patterns. Other researchers also studied if the radiomics method was able to predict predominant subtypes. The research from Yang et al. showed radiomics features can be used to stratify near-pure lepidic, papillary, acinar, micropapillary, and solid subtypes in lung adenocarcinoma [29]. Similarly, Park et al. found that using CT-based radiomics features differentiated the predominant lepidic subtype from acinar or papillary subtypes [30], achieving an AUC of 0.86, which was comparable to the AUC of our radiomics submodel at 0.78. Unlike their studies, we further utilized the prediction results of predominant subtypes along with the information regarding whether high-grade patterns exceeded 20% to achieve IASLC grading.
Accurate IASLC grading of lung adenocarcinoma preoperative is essential to patients since it not only helps predict patient prognosis but also enables the formulation of the treatment approach. To attain this goal, Li proposed an ordinal radiomics method utilizing 682 nodules for IASLC grading. Their method was considered as a one-step model and tended to misclassify the grade 2 to grades 1 or 3 [18]. This is consistent with the results of the one-step radiomics model as illustrated in the model performance comparison. However, though a two-step strategy, our model using radiomics features was able to not maintain the high AUCs of grades 1 and 3, but also improve the performance in grade 2. In addition, our two-step radiomics model outperformed another radiomics method that was designed to directly select grade 3 patients [17], with AUC values of 0.95 vs. 0.82 and 0.96 vs. 0.81 on the training and validation sets, respectively. These findings somehow showed the superiority of our proposed two-step model with radiomics features for IASLC grading. In addition to our two-step radiomics model, deep learning can also have good performance in the classification of IASLC grades. For example, a study employed deep learning models including ResNet, VGG, and InceptionNet to perform IASLC grading [31]. Trained on CT data from 339 patients with invasive adenocarcinoma, their deep learning system achieved AUC values of 0.93, 0.80, and 0.95 for grades 1, 2, and 3 on a test set of 76 patients. However, deep learning often faces challenges in the interpretability of its predictions. In contrast, radiomics can provide a rad_score equation incorporating first-order features, which describe the statistical distribution of pixel intensities and transform-based features, such as wavelet-transformed characteristics, which capture multi-scale image information. By integrating these features, radiomics models can enhance interpretability, making them more accessible for both clinicians and patients and facilitating their clinical application.
We also explored the impact of CT acquisition variability on model performance. Table S7 shows that there was no difference in the AUC values of the two-step radiomics model between the two vendor settings. However, the F1 score of the model for grade 1 in the GE scans was significantly higher than that of the model in the Siemens scans. This might suggest that including more Siemens scans on grade 1 is needed to improve model performance.
The study also had some limitations. First, the data used in this study was obtained from a single centre. Difference in population, parameters of CT scanners and image reconstruction algorithms may affect the performance of the model applied on other cohorts. In the future study, we will collect data from other institutions to further optimize the model and validate it across diverse patient populations. Second, The study retrospectively collected data to develop a model for predicting IASLC grading. Further investigation is warranted to assess the accuracy of the model in prospective patient cohorts and to explore the correlation between grading predictions and patient outcomes. Third, we only considered semantic CT features and patient information including age and gender to develop the clinical-semantic models. Integrating other data, such as smoking history, lymphovascular invasion, and genomic alterations, may provide added value for better reflecting the relationship between invasive pulmonary adenocarcinoma and IASLC grades [32].
Conclusion
In conclusion, through the prediction of predominant subtypes and high-grade patterns, we presented a two-step radiomics model for the differentiation of IASLC grading in invasive pulmonary adenocarcinoma. The model had potential to provide accurate IASLC grading for patients before surgery.
Data availability
No datasets were generated or analysed during the current study.
References
Zheng S, van Ooijen PM, Oudkerk M. Lung cancer screening and nodule detection: the role of artificial intelligence. Artificial intelligence in cardiothoracic imaging. edn.: Springer; 2022. pp. 459–69.
Zhong Y, Cai C, Chen T, Gui H, Chen C, Deng J, Yang M, Yu B, Song Y, Wang T. PET/CT-based deep learning grading signature to optimize surgical decisions for clinical stage I invasive lung adenocarcinoma and biologic basis under its prediction: a multicenter study. Eur J Nucl Med Mol Imaging. 2024;51(2):521–34.
Puri V, Crabtree TD, Bell JM, Broderick SR, Morgensztern D, Colditz GA, Kreisel D, Krupnick AS, Patterson GA, Meyers BF. Treatment outcomes in stage I lung cancer: a comparison of surgery and stereotactic body radiation therapy. J Thorac Oncol. 2015;10(12):1776–84.
Raviv S, Hawkins KA, DeCamp MM Jr, Kalhan R. Lung cancer in chronic obstructive pulmonary disease: enhancing surgical options and outcomes. Am J Respir Crit Care Med. 2011;183(9):1138–46.
Uramoto H, Tanaka F. Recurrence after surgery in patients with NSCLC. Translational Lung cancer Res. 2014;3(4):242.
Bortolotto C, Lancia A, Stelitano C, Montesano M, Merizzoli E, Agustoni F, Stella G, Preda L. Filippi ARJEroat: radiomics features as predictive and prognostic biomarkers in NSCLC. Expert Rev Anticancer Ther. 2020;21(3):257–66.
Travis WD, Brambilla E, Nicholson AG, Yatabe Y, Austin JH, Beasley MB, Chirieac LR, Dacic S, Duhig E, Flieder DB. The 2015 world health organization classification of lung tumors: impact of genetic, clinical and radiologic advances since the 2004 classification. J Thorac Oncol. 2015;10(9):1243–60.
Song SH, Park H, Lee G, Lee HY, Sohn I, Kim HS, Lee SH, Jeong JY, Kim J, Lee KS. Imaging phenotyping using radiomics to predict micropapillary pattern within lung adenocarcinoma. J Thorac Oncol. 2017;12(4):624–32.
Borczuk AC. Prognostic considerations of the new world health organization classification of lung adenocarcinoma. Eur Respiratory Rev. 2016;25(142):364–71.
Marx A, Chan JK, Coindre J-M, Detterbeck F, Girard N, Harris NL, Jaffe ES, Kurrer MO, Marom EM, Moreira AL. The 2015 world health organization classification of tumors of the thymus: continuity and changes. J Thorac Oncol. 2015;10(10):1383–95.
Zhang R, Hu G, Qiu J, Wu H, Fu W, Feng Y, Zhang M, Chen C, Sun J, Zhang Y. Clinical significance of the cribriform pattern in invasive adenocarcinoma of the lung. J Clin Pathol. 2019;72(10):682–8.
Lee G, Lee HY, Jeong JY, Han J, Cha MJ, Lee KS, Kim J, Shim YM. Clinical impact of minimal micropapillary pattern in invasive lung adenocarcinoma: prognostic significance and survival outcomes. Am J Surg Pathol. 2015;39(5):660–6.
Moreira AL, Ocampo PS, Xia Y, Zhong H, Russell PA, Minami Y, Cooper WA, Yoshida A, Bubendorf L, Papotti M. A grading system for invasive pulmonary adenocarcinoma: a proposal from the international association for the study of lung cancer pathology committee. J Thorac Oncol. 2020;15(10):1599–610.
Deng C, Zheng Q, Zhang Y, Jin Y, Shen X, Nie X, Fu F, Ma X, Ma Z, Wen Z. Validation of the novel international association for the study of lung cancer grading system for invasive pulmonary adenocarcinoma and association with common driver mutations. J Thorac Oncol. 2021;16(10):1684–93.
Rokutan-Kurata M, Yoshizawa A, Ueno K, Nakajima N, Terada K, Hamaji M, Sonobe M, Menju T, Date H, Morita S. Validation study of the international association for the study of lung cancer histologic grading system of invasive lung adenocarcinoma. J Thorac Oncol. 2021;16(10):1753–8.
Ohtani-Kim SJ-y, Taki T, Tane K, Miyoshi T, Samejima J, Aokage K, Nagasaki Y, Kojima M, Sakashita S, Watanabe R. Efficacy of preoperative biopsy in predicting the newly proposed histologic grade of resected lung adenocarcinoma. Mod Pathol. 2023;36(9):100209.
Yang Z, Cai Y, Chen Y, Ai Z, Chen F, Wang H, Han Q, Feng Q, Xiang Z. A CT-based radiomics nomogram combined with clinic-radiological characteristics for preoperative prediction of the novel IASLC grading of invasive pulmonary adenocarcinoma. Acad Radiol. 2023;30(9):1946–61.
Li Y, Liu J, Yang X, Wang A, Zang C, Wang L, He C, Lin L, Qing H, Ren J. An ordinal radiomic model to predict the differentiation grade of invasive non-mucinous pulmonary adenocarcinoma based on low-dose computed tomography in lung cancer screening. Eur Radiol. 2023;33(5):3072–82.
Nicholson AG, Tsao MS, Beasley MB, Borczuk AC, Brambilla E, Cooper WA, Dacic S, Jain D, Kerr KM, Lantuejoul S. The 2021 WHO classification of lung tumors: impact of advances since 2015. J Thorac Oncol. 2022;17(3):362–87.
Tan KS, Reiner A, Emoto K, Eguchi T, Takahashi Y, Aly RG, Rekhtman N, Adusumilli PS, Travis WD. Novel insights into the international association for the study of lung cancer grading system for lung adenocarcinoma. Mod Pathol. 2024;37(7):100520.
Yang C, Huang M, Li S, Chen J, Yang Y, Qin N, Huang D, Shu J. Radiomics model of magnetic resonance imaging for predicting pathological grading and lymph node metastases of extrahepatic cholangiocarcinoma. Cancer Lett. 2020;470:1–7.
Song JW, Do K-H, Kim M-Y, Jang SJ, Colby TV, Kim DS. Pathologic and radiologic differences between idiopathic and collagen vascular disease-related usual interstitial pneumonia. Chest. 2009;136(1):23–30.
Plackett RL. Karl pearson and the chi-squared test. Int Stat Review/revue Int De Statistique 1983:59–72.
Kim TK. T test as a parametric statistic. Korean J Anesthesiology. 2015;68(6):540–6.
Ostertagova E, Ostertag O, Kováč J. Methodology and application of the Kruskal-Wallis test. Appl Mech Mater. 2014;611:115–20.
DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 1988:837–45.
Chen L-W, Yang S-M, Wang H-J, Chen Y-C, Lin M-W, Hsieh M-S, Song H-L, Ko H-J, Chen C-M, Chang Y-C. Prediction of micropapillary and solid pattern in lung adenocarcinoma using radiomic values extracted from near-pure histopathological subtypes. Eur Radiol. 2021;31:5127–38.
He B, Song Y, Wang L, Wang T, She Y, Hou L, Zhang L, Wu C, Babu BA, Bagci U. A machine learning-based prediction of the micropapillary/solid growth pattern in invasive lung adenocarcinoma with radiomics. Translational Lung Cancer Res. 2021;10(2):955.
Yang S-M, Chen L-W, Wang H-J, Chen L-R, Lor K-L, Chen Y-C, Lin M-W, Hsieh M-S, Chen J-S, Chang Y-C. Extraction of radiomic values from lung adenocarcinoma with near-pure subtypes in the international association for the study of lung Cancer/the American thoracic society/the European respiratory society (IASLC/ATS/ERS) classification. Lung Cancer. 2018;119:56–63.
Park S, Lee SM, Noh HN, Hwang HJ, Kim S, Do K-H, Seo JB. Differentiation of predominant subtypes of lung adenocarcinoma using a quantitative radiomics approach on CT. Eur Radiol. 2020;30:4883–92.
Zhou J, Hu B, Feng W, Zhang Z, Fu X, Shao H, Wang H, Jin L, Ai S, Ji Y. An ensemble deep learning model for risk stratification of invasive lung adenocarcinoma using thin-slice CT. NPJ Digit Med. 2023;6(1):119.
Ahn B, Yoon S, Kim D, Chun S-M, Lee G, Kim H-R, Jang SJ, Hwang HS. Clinicopathologic and genomic features of high-grade pattern and their subclasses in lung adenocarcinoma. Lung Cancer. 2022;170:176–84.
Acknowledgements
We thank the high performance computing platform of Tianjin Medical University for support.
Funding
This study was funded by grants from the National Natural Science Foundation of China (Grant No. 82302180, and 82171932), Ministry of Science and Technology of China (Grant No. 2024ZD0520002), Chinese National Key Research and Development Project (Grant No. 2021YFC2500402 and 2021YFC2500400), National Health Commission Capacity Building and Continuing Education Center (Grant No. YXFSC2022JJSJ011),Tianjin Key Medical Discipline (Specialty) Construction Project (Grant No. TJYXZDXK-010 A), Scientific Developing Foundation of Tianjin Education Commission (Grant No. 2024KJ182).
Author information
Authors and Affiliations
Contributions
Conception and design: S.Z., J. L, J. X. Acquisition of data: K.B., J.L. Analysis and interpretation of data: S.Z., J. L., J.X. Statistical analysis: S.Z., J. L. Drafting the article: S.Z., J. L. Critically revising the article: all authors. Reviewed and approved submitted version of manuscript: all authors. Study supervision: Z.Y., D.Y., X. C.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
The study received approval from the medical ethical committee of the Tianjin Medical University Cancer Institute and Hospital (EK2023044).
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Abbreviations
IASLC International Association for the Study of Lung Cancer Pathology Committee.
ADC invasive non-mucinous adenocarcinoma.
PS predominant subtypes.
HGP high-grade patterns.
AUC Area Under the Curve.
ICC Intraclass Correlation Coefficient.
mRMR Minimum Redundancy Maximum Relevance.
LASSO Least Absolute Shrinkage and Selection Operator.
OR Odds Ratio.
CI Confidence Interval.
Author details
1Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center for Cancer, Tianjin’s Clinical Research Center for Cancer, State Key Laboratory of Druggability Evaluation and Systematic Translational Medicine, Tianjin Key Laboratory of Digestive Cancer, Key Laboratory of Cancer Prevention and Therapy, Department of Radiology, Tianjin, China. 2Tianjin Medical University Cancer Institute and Hospital, Key Laboratory of Cancer Prevention and Therapy, National Clinical Research Center of Cancer, Department of Lung Cancer, Lung Cancer Center, Tianjin, China. 3The Second Hospital of Shanxi Medical University, Department of Radiology, Taiyuan, China. 4Zhejiang University, College of Computer Science and Technology, Hangzhou, China. 5Tianjin University of Traditional Chinese Medicine, School of Public Health, Tianjin, China.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Zheng, S., Liu, J., Xie, J. et al. Differentiating high-grade patterns and predominant subtypes for IASLC grading in invasive pulmonary adenocarcinoma using radiomics and clinical-semantic features. Cancer Imaging 25, 42 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s40644-025-00864-2
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s40644-025-00864-2