The Reliability of Ultrasound Diagnosis in Differentiating Malignant from Benign Thyroid Nodules Using TI-RADS Selection Followed by FNA ()
1. Introduction
Thyroid nodular disease is a common healthcare problem, prevalent among 20% - 50% of the population [1]. However, thyroid malignancies are uncommon, only account for approximately 2% - 7% of all thyroid nodules [1] [2] [3]. Thyroid cancer incidence has increased in the last few decades, without a significant change in the disease mortality. Diagnostic imaging has increased the rate of thyroid nodules detection and improved utilization of fine-needle aspiration (FNA) [4]. Ultrasound (US) guided FNA using the Bethesda [5] pathological evaluation system is the gold standard for thyroid nodules evaluation because of its cost-effectiveness and accurate detection rate [6] - [11]. Early assessment of thyroid nodules using the US has become the most valuable noninvasive diagnostic tool [12]. American College of Radiology (ACR) provides an easy-to-apply method for radiologists and physicians for proper management and increasing consistency across ultrasound practices. Thyroid Imaging Reporting and Data System (TI-RADS) [13], using the classification system, offers reliable, feasible, practical, and non-invasiveness techniques to eliminate 26% - 53% of FNA cases [14] [15]. The aim of this study is to determine if using Ultrasound using TI-RADS selection can differentiate malignant from benign thyroid nodules confirmed by FNA. This will eliminate unnecessary invasive procedures for diagnostic purpose only in the management of thyroid nodules, thus avoiding unnecessary patients’ morbidities, reflecting positively over the net cost.
2. Methods and Materials
2.1. Patients
Using the ACR TI-RADS, we randomly selected and retrospectively evaluated nodules in 190 patients visiting King Khalid University Hospital (KKUH) from 2016 to 2020, for thyroid US and FNA. A total of 126 nodules were evaluated in 111 patients; the outcome showed the absence of primary malignancy. Approximately 88 nodules in 79 patients were evaluated. Exclusion criteria were: 1) nodules without FNA recommendation according to the ACR TI-RADS, either because of size and/or grade (69 nodules). 2) Nodules with indefinite cytology, including any BETHESDA that scores more than 2 or 6 (19 nodules). No other exclusion criteria were applied. Table 1 provides a summary of all the evaluated nodules.
2.2. Imaging and TI-RADS Scoring
Two expert radiologists with ultrasonographic imaging experience re-evaluated and reviewed the images using the ACR TI-RADS protocols. To eliminate evaluation bias, both reviewers were blinded to the FNA cytology results before evaluation. The ACR TI-RADS was used to evaluate the ultrasonographic images, using five imaging evaluation criteria. For each feature, the calculated points were summed to determine the FNA recommendation and malignancy risk levels of each nodule, from TI-RADS 1 (lowest risk) to TI-RADS 5 (highest risk) [13]. Figure 1 provides the ACR-TI-RADS assessment categories. To confirm all the selected nodules, we validated the corresponding US-guided FNA procedure images and reviewed its correlated pathological report.
Table 1. Summary of the total evaluated nodules after applying ACR-TI-RADS and reviewing the corresponding cytology results.
Figure 1. Assessment categories for thyroid nodules using ultrasound suggested by the ACR TI-RADS committee (*Originally published in JACR at: https://www.jacr.org/article/S1546-1440(17)30186-2/fulltext?_ga=2.188257452.525486094.1565030521-1858166925.1544129320).
2.3. FNA Technique and Cytology
Well-trained and experienced interventional radiology consultants conducted all FNA procedures using local anesthesia. The US scan and guided biopsy images were reviewed to confirm the biopsied nodules. All the cytological samples were evaluated using the BETHESDA scoring system; with 1 unsatisfactory, 2 benign, 3 typical cells, 4 follicular neoplasms, 5 suspicious for malignancy, and 6 malignancy [5].
3. Statistical Analysis
Study Population
Between 2016 and 2020, 190 patients with 214 nodules visiting King Khalid University Hospital were randomly (non-consecutive) selected and retrospectively reviewed. Following the ACR TI-RADS recommendations for FNA and correlating cytology reports, 88 nodules (41%) in 79 patients were excluded because the nodule size was smaller than the FNA recommended size, was of the improper initial nodule’s measurements, incorrect TI-RADS category, or inconclusive result from cytology. ACR TI-RADS reliability evaluation was conducted on the rest of the 111 patients (95 Females and 16 Males) with 126 thyroid nodules (13 patients 9 females and 4 males with 2 nodules, and 1 female patient with 3 nodules). The male mean age is 49.4 (mean ± SD 49.4 ± 17, maximum 69, and minimum 17); the female mean age is 47 (mean ± SD 47 ± 12.8, maximum 74, and minimum 21). Table 2 summarizes the patient’s demographic characteristics. The excluded nodules were 69 nodules (around 32%) not included in the evaluation criteria. However, 19 nodules (about 9%) were completely excluded owing to inconclusive pathology results. This study received approval from the Research Ethics Committee of Health Affairs in Riyadh, KSA (KSU-IRB 017E).
4. Results
Microsoft Excel was used to collect data. Afterward, we used SPSS version 22.0 for the analysis. The sensitivity, specificity, accuracy, precision, and negative predictive value (with a confidence interval of 95%) were conducted to determine the ACR TI-RADS reliability and differentiate benign from malignant thyroid nodules. A p-value < 0.05 was considered a significant level.
Table 2. Age and gender demographics of patients with FNA recommended by the ACR TI-RADS.
A significantly moderate correlation and very strong association between TI-RADS and FNA results were observed using Spearman rho correlation (rs(195) = 0.450, p < 0.001, φCramer = 0.520, p < 0.001). A Mann-Whitney U-test was used to evaluate the difference between the ages of patients with benign and malignant tumors. The result was significantly different (Mdn = 5241.5, 1724.5, p < 0.05).
No other significant correlation was found between age, gender, site, and malignant or benign lesions.
Of the total 214 nodules, 19 nodules (18 patients) were excluded owing to their inconclusive cytology result (any BETHESDA score but 2 and 6). Thus, 195 nodules in 172 patients were properly evaluated using the ACR TI-RADS with an overall malignancy of 14.4%, 28 nodules. Table 3 shows the percentage of malignancy in each category: 1.2% in TI-RADS 3 (1 out of 85), 14.1% in TI-RADS 4 (12 out of 85), and 60% in TI-RADS 5 (15 out of 25).
Following the ACR TI-RADS for FNA recommended selection, 27 nodules (21.4%) out of the recommended 126 nodules were consistent with malignancy in cytology, with overall mean sensitivities, specificities, accuracies, precisions, and negative predictive values (NPV) of 96.4%, 40.7%, 48.7%, 28.4%, and 98.6% respectively. The nodules were subdivided into the TI-RADS 3, 4, and 5. Table 4 summarizes the results of the ACR TI-RADS for malignancy detection, and Figure 2 plotted the ROC curve.
Figure 2. ROC curve was to evaluate the specificity and selectivity of the TI-RADS system.
Table 3. Total assessed ACR-TI-RADS nodules with conclusive cytology results.
Table 4. Thyroid nodule malignancy detection after the application of ACR-TI-RADS for nodule assessment, following FNA, recommended selection criteria.
TI-RADS 3 included 46 nodules (36.5%): 1 malignant nodule (2.2%) and 45 benign nodules (97.8%), showing sensitivity, specificity, accuracy, and precision with an NPV of 100%, 46.4%, 47.1%, 2.2% and 100%, respectively. TI-RADS 4 included 56 nodules (44.4%): 11 malignant nodules (19.6%) and 45 benign nodules (80.4%), showing sensitivity, specificity, accuracy, precision with an NPV of 91.7%, 38.4%, 45.9%, 19.6%, and 96.6%, respectively. TI-RADS 5 included 24 nodules (19.1%): 15 malignant nodules (62.5%) and 9 benign nodules (37.5%), showing sensitivity, specificity, accuracy, precision with NPV of 100%, 10%, 64%, 62.5%, and 100% respectively.
ROC curve was plotted to evaluate the specificity and selectivity of the TI-RADS system for the malignant thyroid nodules detection, AUC = 0.686 (95% CI, 0.598 - 0.773), p < 0.05.
5. Discussion
While excluding thyroid malignancy using FNA is a gold standard for suspicious thyroid nodules evaluation, it is proven to be user-dependent based on the 20% non-diagnostic results [6] [7] [8] [9]. The procedure exposes patients to multiple sample collection, cytological evaluation, and more stress to the families. Thus, improving FNA diagnostic accuracy was mandatory. In addition, the ACR TI-RADS evaluation criteria are more effective than other classification methods because it improves the diagnostic accuracy, decreases unnecessary sampling, cut costs, and improves overall patient care [14] [15] [16]. Many radiologists are still reluctant to use the ACR TI-RADS evaluation tool because they are either unfamiliar or hesitant with it. Some are simply uncomfortable to change the “routine,” thereby relying on personal experience. Thus, a simple and applicable selection criterion is critical to change the situation dramatically.
In our retrospective study, recent and old ultrasonographic images completed between 2016 and 2020 were selected using a customized system filter that was based on patients’ FNA history collected from their electronic files. Thus, cases might be subjected to selection and evaluation bias owing to the discrepancy in the included lesions versus excluded lesions (126 vs. 69) and evaluating cases of previous ACR TI-RADS classification (19 cases). For better accuracy and enhanced evaluation, we assigned well-trained and senior radiologists with over15 years of experience in diagnostic ultrasound to complete a blind review of the corresponding cytology reports. We reevaluated all the cases without further selection. Yet, cases in the lower ACR TI-RADS subcategories (TR-1 and TR-2) were minimal and could not be properly evaluated. However, the ACR and many recent studies concluded that the malignancy risk in TR-1 and TR-2 subcategories never exceeded 2% in the ACR partial analysis [13] and 0% of studies conducted by Warinthorn Phuttharak et al. [15] and Hoang et al. [17].
Our TI-RADS evaluation results were in agreement with the ACR TI-RADS paper that focused on malignancy risk stratification: Less than 5% in TR-3 (1.2%), from 5% to 20% in TR-4 (14.1%), and more than 20% in TR-5 (60%) [13]. Figure 3 and Figure 4 provide examples of the evaluated nodules with corresponding cytology and histopathology. The classification system eliminated the FNA procedures in many non-recommended cases, which exceeded 32 (32.4 %) out of the initially included lesions. If well implemented, the procedure significantly reduces the costs.
From the application of ACR TI-RADS, we did not recommend FNA in 69 nodules (in 63 patients), with only 1 small TI-RADS 4 malignant lesions (approximately 0.5% of the total evaluated lesions and 1.5% of the excluded lesions). Although this nodule fails to receive an FNA recommendation used in the ACR TI-RADS due to its small size, this nodule is categorized as a TI-RADS 4 nodule, and it measures 1 × 0.8 cm diameter. Hence, follow-up is not dismissed. Nonetheless, 62 patients went through a stressful procedure without proper imaging or medical justification, wasting medical and financial resources. In addition, the procedure is without standardized, clear diagnostic methods, and guidelines subjecting to many incorrect user-dependent judgmental discrepancies and resource misuse.
6. Conclusion
In conclusion, ACR TI-RADS is feasible, reliable, and well structured, easily applicable in thyroid nodules reporting. ACR TI-RADS can eliminate many unnecessary FNAs, providing a decline in costs and complications. We recommend the ACR TI-RADS in our radiology department to eliminate reporting discrepancies and cut costs, thereby standardizing the reports, improving intra-user agreements, and improving overall patients’ health care. Our future research topic is to find ancillary non-invasive modality to enhance the accuracy in differentiating malignant from benign thyroid nodules.
Disclosure
This work was not supported or funded by any drug or machine company.
Acknowledgements
We thank Professor Ammar C. Al Rikabi for his aid in acquiring and reporting cytology and histopathology images.