Publications

2026

McCarthy, Colin J, Vijay Ramalingam, Yiftach Barash, Seetharam Chadalavada, Xiao Wu, Oleksandra Kutsenko, Daniel Raskin, Vera Sorin, and Ammar Sarwar. (2026) 2026. “Decreasing Administrative Effort Related to Non-Approval of Image-GuidED Procedures Using Large Language Models - The DENIED-AI Pilot Study.”. Academic Radiology. https://doi.org/10.1016/j.acra.2026.04.021.

RATIONALE AND OBJECTIVES: To evaluate whether large language models (LLMs) can generate accurate, clinically valid, and usable letters to appeal insurance denials for radiology procedures.

MATERIALS AND METHODS: This pilot study generated insurance appeal letters for a simulated clinical scenario. Four LLMs (Claude 3.5, Nova Pro, Llama-3.1-70B, ChatGPT-4o) were used with zero-shot, few-shot, and retrieval-augmented generation (RAG) techniques. Four board-certified interventional radiologists, blinded to model and technique, scored letters for content (accuracy, personalization, references), grammar and structure (readability, tone, persuasiveness), and usability (estimated editing time, usefulness as a template). References were verified for accuracy, and outputs were carefully examined for hallucinations. Statistical analyses included ANOVA, Chi-square, and Fleiss' Kappa for interrater reliability.

RESULTS: Mean content and grammar scores were 3.9 ± 0.95 and 4.3 ± 0.9 (out of 5), with no significant differences by model or technique (p >.05). Reviewer agreement was poor (Fleiss' Kappa -0.18 for content, -0.085 for grammar). Hallucinations were flagged by reviewers in 16/48 assessments, significantly more often with the online model (ChatGPT-4o: 58% vs offline 25%; p =.03). Of 44 references, 80% from the offline models were fabricated compared with 29% from ChatGPT-4o (p <.001). Estimated editing time was less than 10 min in 71% of responses, and the reviewers felt the letters would be useful as templates in 73% of cases.

CONCLUSION: LLM-generated appeal letters for insurance denials were generally well received, with high usability and adequate quality. However, fabricated references and hallucinations remain prevalent, necessitating careful human review before clinical use.

von Wedel, Dario, Simone Redaelli, Maxime Fosset, Joris Pensier, Denys Shay, Elena Ahrens, Luca J Wachtendorf, et al. (2026) 2026. “The Predicted Body Weight Equation Overestimates Lung Sizes of Female, Critically Ill Patients: An Analysis of Randomized, Controlled Trials and Real-World Clinical Data.”. Intensive Care Medicine. https://doi.org/10.1007/s00134-026-08442-1.

PURPOSE: Low tidal volume (Vt) ventilation is the standard of care among critically ill patients. Guidelines recommend scaling Vt to the predicted body weight (PBW) to avoid ventilator-induced lung injury (VILI). Concerns exist that the PBW overestimates lung volumes of critically ill females. We investigated whether this applies to clinically relevant measures of lung volume, whether PBW-guided mechanical ventilation yields comparable risk of lung stress among male and female patients, and whether this affects mortality.

METHODS: Mechanically ventilated, critically ill patients from ten randomized trials and two real-world retrospective clinical datasets were analyzed. Risk of high driving pressures (≥ 15 cmH2O) at comparable Vt/kg PBW as well as measures of anatomical and functional lung sizes, including computed tomography-measured lung volumes at the same PBW were compared between female and male patients.

RESULTS: Among 30,516 patients (39.4% female), ventilation with comparable tidal volumes standardized to PBW (ml/kg PBW) was associated with 4.2% (95% CI 3.2-5.3; aOR 1.26, 95% CI 1.19-1.33; p < 0.001) higher absolute risk of high driving pressures among females, mediating 8.4% of excess 28-day mortality (p < 0.001). At the same PBW, female patients had lower anatomical and aerated lung volumes (- 343 ml, 95% CI - 449 to - 237, p < 0.001; and - 188, 95% CI - 282 to - 94, p < 0.001, respectively) than males.

CONCLUSIONS: The widely used PBW equation overestimates lung volumes in female critically ill patients, resulting in excess risk of injurious driving pressures among females, mediating higher mortality. Personalized mechanical ventilation by using driving pressure-guided strategies might mitigate these disparities.

Metrouh, Oussama, Julie Bulman, Sarah Schroeppel DeBacker, Muneeb Ahmed, and Jeffrey Weinstein. (2026) 2026. “Does Residency Rank List Placement Predict Clinical Performance in Interventional Radiology Training?”. Academic Radiology. https://doi.org/10.1016/j.acra.2026.03.051.

RATIONALE AND OBJECTIVES: This study aimed to evaluate the correlation between Interventional Radiology (IR) trainees' clinical performance during residency and their final National Residency Matching Program (NRMP) rank order list (ROL) placement during the match application, and to identify application metrics predictive of strong clinical performance.

MATERIALS AND METHODS: A retrospective review of application data for IR residents and fellows graduating from a single academic center between 2020-2025 was conducted. Metrics included United States Medical Licensing Examination (USMLE) scores, number of clinical and research experiences, peer-reviewed publications, abstracts, awards and the final placement on the ROL. A structured survey aligned with Accreditation Council for Graduate Medical Education (ACGME) milestones was designed and distributed to faculty who directly worked with each trainee but were not involved in the program's NRMP rank order list formation to evaluate their clinical performance during IR training. Inter-rater reliability was assessed using intraclass correlation coefficients (ICCs), and a composite clinical score was calculated. Associations between application metrics, NRMP rank, and clinical performance were evaluated using Spearman correlation and univariate linear regression.

RESULTS: Moderate inter-rater reliability was observed for medical knowledge (ICC=0.60, p < 0.001), procedural competence (ICC=0.50, p < 0.001), and patient care (ICC = 0.50, p = 0.004). No significant correlation was found between NRMP rank list placement and clinical performance (Spearman ρ = 0.07, p = 0.82). USMLE Step 2 score was the only significant predictor of clinical performance (β = 0.17, p = 0.01), with the greatest separation observed at a cutoff score of 239.

CONCLUSION: These findings suggest that interview-driven rank placement may not reliably identify high-performing residents, whereas Step 2 scores may provide better predictive value for clinical performance during IR training.

Laimer, Gregor, Edward W Johnston, Christiaan G Overduin, Iwan Paolucci, Muneeb Ahmed, Ronald S Arellano, Marie Beermann, et al. (2026) 2026. “International Multisociety Delphi Consensus for Liver Tumour Thermal Ablation: Procedural and Practice Standards.”. The Lancet. Oncology 27 (5): e259-e270. https://doi.org/10.1016/S1470-2045(26)00114-2.

Thermal ablation offers a safer, less invasive, and more cost-effective curative-intent treatment for selected patients with primary and metastatic liver tumours than surgery; when done with appropriate technique, ablation can deliver similar oncological outcomes. However, effectiveness in routine practice varies because structured training, planning, and procedural governance remain scarce. These international multidisciplinary, multi-society guidelines-formally endorsed by the European Society of Surgical Oncology, the Cardiovascular and Interventional Radiological Society of Europe, and the Society of Interventional Oncology-define key domains contributing to procedural difficulty and practice variation in liver tumour thermal ablation. A Delphi consensus initiative held in Innsbruck, Austria, engaged 72 experts across three iterative rounds of scoring across 135 statements grouped into five domains: credentialing, indications, approach, procedural factors, and safety measures. Consensus was achieved for 94 (70%) of 135 statements. The least invasive route-typically percutaneous-should be prioritised, and margin adequacy was reaffirmed as the principal technical goal. Procedural difficulty was considered context-dependent, shaped by tumour factors, institutional infrastructure, and operator experience. Organ displacement techniques were endorsed to maintain safety and expand treatable indications. Complex ablations should be done by experienced operators (more than 100 previous cases), with programmes underpinned by structured training, multidisciplinary team participation, and routine audit. Future efforts should develop and validate practical tools such as difficulty scoring systems, standardised procedural reporting templates, and comprehensive training curricula to improve consistency, standardisation, and clinical outcomes globally.

Paolucci, Iwan, Christiaan G Overduin, Edward W Johnston, Gregor Laimer, Muneeb Ahmed, Ronald S Arellano, Marie Beerman, et al. (2026) 2026. “International Multisociety Delphi Consensus for Liver Tumour Thermal Ablation: Margin Assessment.”. The Lancet. Oncology 27 (5): e248-e258. https://doi.org/10.1016/S1470-2045(26)00143-9.

This multisociety, multidisciplinary consensus-formally endorsed by the European Society of Surgical Oncology, the Cardiovascular and Interventional Radiological Society of Europe, and the Society of Interventional Oncology-was developed to standardise the assessment of ablation margins in liver tumour thermal ablation. A modified Delphi process, consisting of two online surveys and a hybrid (online and in-person meeting in Innsbruk) consensus meeting of 72 experts from North America, South America, Europe, and Asia. Formal consensus was reached for 150 (75%) of 199 statements. Strong agreement was observed between interventional and surgical oncologists, with only 12 (6%) of 199 statements showing significantly different ratings. Participants agreed that ablation margins should be assessed and documented for every treated tumour. Margins should be assessed quantitatively in three dimensions, with contrast-enhanced CT or MRI, preferably intraprocedurally with ablation confirmation software. Ablation margins should be categorised as A0 (tumour completely covered with sufficient margin), A1 (tumour completely covered but insufficient margin), or A2 (portion of tumour remains unablated). This effort is, to our knowledge, the first international consensus initiative to define best-practice recommendations for margin assessment in liver tumour thermal ablation to standardise practices, aiming to improve and promote uniform outcomes.

Stechele, Matthias, Justin Amadi, Lukas Salvermoser, Lorenzo Sperlich, Jonathan Monin, Daniel Khademi, Moritz Nikolaus Gröper, et al. (2026) 2026. “Hepatic Radiofrequency Ablation Induces Widespread Cellular Activation Throughout the Liver.”. European Radiology Experimental 10 (1). https://doi.org/10.1186/s41747-026-00687-1.

OBJECTIVE: We investigated the extent of cellular, transcriptional, and translational activation throughout the liver following radiofrequency ablation (RFA).

MATERIALS AND METHODS: RFA of the healthy liver was performed in two 8-10-week-old male C57/Bl6 mice, no/sham procedure in one. One and 7 days after, single-cell RNA sequencing (scRNAseq) was performed on distant, untreated liver to examine > 6,000 genes from normalized datasets of > 6,000 cells/sample, enabling identification of ten major cell populations. We defined cell-to-cell interactions by CellphoneDB and identified active pathways via STRING-db analysis with Markov clustering. Twelve distant liver lobe samples were homogenized on day 3 or day 6 after RFA/sham procedure for SomaLogic proteomic analysis (> 1,300 genes), subsequent STRING-db analysis, and assessment of cellular origin (PanglaoDB-2021).

RESULTS: CellphoneDB identified crosstalk among all ten populations with 4,658 and 4,218 receptor/ligand pairs, identified on day 1 and day 7 post-RFA, respectively. On day 1, 360 differentially expressed genes were identified; on day 7, 430. Activated genes distributed into 16 clusters, including 66 chemokines/cytokines, including Ccl2 and Ccl7; 57 immunomodulators, including Il6, Ctla4 and Pdcd1; and 54 growth factors, including Vegf, Hgf, Pdgf, and Fgf. Angiogenesis pathway genes were observed in endothelial cells and hepatocytes. Pdcd1 and Ctla4 were notably increased transiently in T cells. Proteomic analysis included 228/443 genes (51%) identified by scRNAseq; 73/228 proteins (32%) demonstrated 25% elevation over controls. Overall, 427 proteins were elevated, with 9/10 cell populations contributing to increased protein expression (odds ratio 4.9‒7.0).

CONCLUSION: RFA diffusely activates cellular processes remotely from the ablation zone on both transcriptional and translational levels, altering tumorigenic and immunologic pathways simultaneously.

RELEVANCE STATEMENT: This study offers insights into liver tissue biology after RFA and provides a comprehensive picture of the molecular mechanisms put into motion by this procedure. A better understanding of these processes could provide a potential basis to develop specific biomarkers and effective adjuvant therapies following local tumor ablation.

KEY POINTS: RFA activates a multiplicity of hepatic cellular processes remotely from the ablation zone on a transcriptional and translational level. Single-cell RNA sequencing provides insights into widespread cellular origins of activated pro-immunogenic, pro-tumorigenic, and other pathways detected post-ablation. Consideration of the nature of this response may help achieve the clinical goals of adjuvant therapies and predictive biomarkers.

Panta, Om Biju, Hadiseh Kavandi, Yasir Andrabi, Natasha Larocque, Francesca Rigiroli, Alexander Brook, and Olga R Brook. (2026) 2026. “Clinical and Radiological Predictors of Surgical or Endovascular Intervention and Adverse Outcomes in Spontaneous Rectus Sheath Hematoma Patients.”. Abdominal Radiology (New York). https://doi.org/10.1007/s00261-026-05464-7.

PURPOSE: To determine clinical and radiographic parameters predictive of interventions and adverse outcomes in patients with spontaneous rectus sheath hematoma.

METHODS: This retrospective, institutional review board approved study identified 261 patients with spontaneous rectus sheath hematoma from January 2000 to August 2021 through CT report searches. Demographics, clinical presentation, comorbidities, medications, and laboratory values were collected. CT scans were reviewed for hematoma characteristics including size, location, density, and active contrast extravasation. The primary outcome was the invasive intervention; and the secondary outcomes included transfusion requirements, length of hospital stay, acute kidney injury, readmission, and mortality. Categorical variables were compared using Fisher's exact test, while continuous variables were analyzed using Student's t-test or Mann-Whitney U test as appropriate based on normality assessment.

RESULTS: The cohort comprised 261 patients (median age 71 (IQR, 60-78) years, 59% female). Invasive treatments were performed in 41 patients (15.7%): 35 underwent angiography with embolization; 6 had surgical exploration. Invasive treatment was significantly more frequent among patients with trans-umbilical extension (25.7% vs 9.4%; p < 0.001), active contrast extravasation (46.7% vs 33.0%, p < 0.001), and hematoma volume > 443.5 cc (31.8% vs 9.6%, p < 0.001)." These patients had higher red blood cell transfusion requirements (85.4% vs. 47.7%, p < 0.001) and longer hospital stays (7 vs. 4 days, p < 0.001). The overall in-hospital mortality was 6.9%, with 1.9% attributable to rectus sheath hematoma.

CONCLUSION: While most rectus sheath hematomas are managed conservatively, specific CT findings-trans-umbilical extension, larger volume, and active contrast extravasation-predict invasive intervention and guide clinical decision-making.

Rawson, James, V, Olga Brook, Danilo Sirias, and Omar Msto Hussain Nasser. (2026) 2026. “Application of the Theory of Constraints to Radiology.”. Radiographics : A Review Publication of the Radiological Society of North America, Inc 46 (4): e250101. https://doi.org/10.1148/rg.250101.

The central principle of the theory of constraints is that in any complex system there are only a few constraints that limit the performance (ie, "throughput") of the system. These constraints are rate-limiting steps to throughput. Once the constraint or bottleneck is identified, resources are used to improve the utilization rate at the point of the constraint to make the process as productive as possible. Others in the organization also must work to maintain the high utilization rate at the constraint. Strategies such as buffers are used to increase throughput at the constraints. Although it was initially developed in manufacturing, the theory of constraints has been applied to many industries. By using the theory of constraints to evaluate the radiologic workflow, constraints can be identified through targeted process improvement projects for optimization at steps that affect the total throughput of the system. Constraints can be physical, such as equipment or space (eg, imaging units or recovery room beds), or related to personnel. Because radiology is currently in a resource-constrained environment, targeted interventions with highly effective processes could improve productivity or flow. Given the workforce shortages in radiology, determining whether the constraint is equipment- or personnel-related may lead to different improvement projects.

Pantoja-Burbano, Omar A, Anisha Mittal, Molly N Pantelic, Nastaran Hosseini, and Katja N De Paepe. (2026) 2026. “Carney Triad.”. Radiographics : A Review Publication of the Radiological Society of North America, Inc 46 (4): e250235. https://doi.org/10.1148/rg.250235.