Rising Influence of Artificial Intelligence: Trends in Large Language Model Usage in Dermatology Residency Personal Statements

May 2025 | Volume 24 | Issue 5 | 544 | Copyright © May 2025


Published online April 22, 2025

doi:10.36849/JDD.8914

Priya Patel Housley MSa, Valeria Fadhel-Hernandez BSb, Marjorie Montanez-Wiscovich MD PhDc, Dan Neal MSd, Mary Bohannon MHSc, Matthew A. Schiefer PhDe

aUniversity of Florida College of Medicine, Gainesville, FL
bSan Juan Bautista School of Medicine, Caguas, PR
cDept of Dermatology, University of Florida College of Medicine, Gainesville, FL
dDepartment of Vascular Surgery, University of Florida College of Medicine, Gainesville, FL
eMalcom Randall Department of Veterans Affairs Medical Center; Dept of Biomedical Engineering, University of Florida, Gainesville, FL

Abstract
With the public release of ChatGPT before the 2024 dermatology residency application cycle, applicants gained access to an advanced language model capable of generating or enhancing personal statements. This study examines trends in artificial intelligence (AI)-generated content in dermatology residency applications from 2022 to 2024 and its implications for program directors. Deidentified personal statements (n=1500) and control essays were analyzed using Quillbot, Scribbr, and ZeroGPT. A significant increase in AI-generated content was observed in the 2024 cycle compared to 2022 and 2023. Statistical tests, including ANOVA, Kruskal-Wallis, and MANOVA, confirmed year-specific differences in AI detection scores (Wilks' Lambda, P<0.0001). These findings highlight increasing reliance on AI tools, and future research should explore stakeholder perspectives to inform regulatory frameworks for AI usage and disclosures in residency applications.

Citation: Patel Housley P, Fadhel-Hernandez V, Montanez-Wiscovich M, et al. Rising influence of artificial intelligence: trends in large language model usage in dermatology residency personal statements. J Drugs Dermatol. 2025;24(5):544-547. doi:10.36849/JDD.8914

INTRODUCTION

ChatGPT (OpenAI), a widely used large language model (LLM), was publicly released prior to the 2024 dermatology residency application cycle, allowing applicants access to an advanced tool capable of generating or enhancing their personal statements in a human-like manner.1 Personal statements are one way residency programs identify applicants for interviews.2 However, studies in specialties such as anesthesiology and plastic surgery suggest that program directors have difficulties differentiating between AI-generated and human-written essays, which raises questions about the use of LLMs in applications.3,4 This manuscript investigates trends in LLM AI usage in personal statements to dermatology residency and explores ethical considerations of transparency in the application process.

MATERIALS AND METHODS

We deidentified and compiled dermatology personal statements submitted through ERAS during the 2022, 2023, and 2024 application cycles. We used AI to generate positive control essays using the version of ChatGPT available to applicants during their application cycle: ChatGPT-3 for 2022 and ChatGPT-3.5 for 2023 and 2024. Both models were prompted with, "Create a personal statement for someone applying to dermatology residency incorporating a person's life experiences." Personal statements known to be human-written from prior application cycles were used as negative controls.

We analyzed the applicant's personal statements (n=1500), AI-generated positive controls (n=20), and human-written negative controls (n=9) using 3 AI detection software: ZeroGPT, Quillbot, and Scribbr.5,6,7 These tools were selected due to their reported accuracy and availability at no cost. ZeroGPT and Quillbot reported the percentage of text that was AI-generated and Scribbr reported the percent chance that the text was AI-generated.

To attain a broader understanding of the data and differences between application cycles, multiple statistical analyses were conducted in Minitab. To explore if the application cycle influenced AI detection scores across each detection software,