Cancer of the voice box, medically known as laryngeal cancer, remains a significant global health challenge, affecting over a million people worldwide each year. In 2021 alone, approximately 1.1 million new cases were reported, alongside nearly 100,000 deaths attributed directly to this disease. Traditionally, risk factors such as persistent smoking, chronic alcohol abuse, and human papillomavirus (HPV) infection have been linked to the onset and progression of laryngeal malignancies. Despite advancements in medical interventions, survival rates fluctuate dramatically between 35% and 78% depending on how early the disease is detected and treated, as well as its precise anatomical location within the larynx.
Early detection has long been recognized as the fundamental determinant for improved prognosis in laryngeal cancer. However, current diagnostic protocols rely on invasive procedures such as video nasal endoscopy combined with surgical biopsies. These methods, while effective, pose challenges including patient discomfort and logistical delays caused by the need to schedule and access specialized clinical services. These barriers often result in postponed diagnoses, thereby negatively impacting treatment outcomes. But a groundbreaking new study, published in the journal Frontiers in Digital Health, suggests a revolutionary alternative: detecting vocal fold abnormalities through non-invasive voice recordings analyzed with artificial intelligence (AI).
The research team, led by Dr. Phillip Jenkins of Oregon Health & Science University, has demonstrated that subtle changes in vocal acoustic patterns can serve as early biomarkers for vocal fold lesions, encompassing both benign conditions like nodules and polyps, as well as potential precursors of laryngeal cancer. This discovery hinges on the principle that structural and physiological changes in the vocal folds directly influence voice quality, altering measurable parameters such as pitch, tone, and clarity. By leveraging machine learning algorithms, these vocal alterations can be identified and classified without the need for cumbersome clinical instruments.
.adsslot_mziMEeJjUa{width:728px !important;height:90px !important;}
@media(max-width:1199px){ .adsslot_mziMEeJjUa{width:468px !important;height:60px !important;}
}
@media(max-width:767px){ .adsslot_mziMEeJjUa{width:320px !important;height:50px !important;}
}
ADVERTISEMENT
Central to this investigation was the Bridge2AI-Voice project, a critical component of the broader US National Institutes of Health’s Bridge to Artificial Intelligence consortium. This ambitious initiative aims to harness AI technologies to tackle increasingly complex biomedical problems. For the study, the researchers curated and analyzed the first public version of the Bridge2AI-Voice dataset, which comprises over 12,500 voice recordings from 306 participants spanning North America. Among these participants were individuals diagnosed with laryngeal cancer, those with benign vocal fold lesions, and patients suffering from other voice box disorders like spasmodic dysphonia and unilateral vocal fold paralysis.
The analysis focused intensively on acoustic features that have known correlations with vocal fold physiology. These included the fundamental frequency, often perceived as pitch; jitter, which quantifies variations in pitch during sustained phonation; shimmer, representing amplitude fluctuations; and the harmonic-to-noise ratio (HNR), a metric that distinguishes between periodic and aperiodic sound components in voice signals. Each of these parameters reflects intricate aspects of how the vocal folds vibrate and how airflow is modulated during speech.
Among male participants, the team observed pronounced differences in both the harmonic-to-noise ratio and fundamental frequency when comparing healthy individuals, those with benign lesions, and patients with diagnosed laryngeal cancer. This finding is particularly notable because higher HNR values generally indicate clearer, more periodic vibrations of the vocal folds, while reductions often point to pathological changes. Interestingly, the study did not identify similarly significant acoustic markers among female participants, a limitation the researchers attributed to the smaller sample size or potentially differing pathophysiological manifestations of vocal fold disorders in women.
While these results are preliminary, the implications are profound. The ability to monitor HNR and related vocal biomarkers non-invasively opens unprecedented avenues for routine, cost-effective screening of high-risk populations. Imagine a future where patients can simply submit a voice recording via a smartphone app, and AI algorithms instantly assess their risk for vocal fold lesions or early-stage cancer. Such developments could democratize laryngeal cancer diagnostics, particularly in underserved areas with limited access to otolaryngology specialists.
Dr. Jenkins elaborated on the significance of these findings, emphasizing the promise of ethical, large-scale datasets like Bridge2AI-Voice for training robust AI models. “Our study demonstrates that vocal biomarkers can differentiate individuals with vocal fold pathology from healthy controls, at least among men,” he noted. “This paves the way toward integrating voice analysis into routine clinical workflows and remote monitoring platforms.”
Of course, several hurdles remain before these AI tools can be implemented clinically. Foremost among them is the need to expand dataset sizes significantly, especially to include more female participants and diverse demographic groups, ensuring that predictive models are fair and generalizable. Moreover, clinical validation in real-world healthcare settings is crucial to confirm the sensitivity, specificity, and overall reliability of AI-driven voice diagnostics.
Looking ahead, the research team plans to refine their algorithms and incorporate professional voice pathology assessments to annotate larger voice datasets accurately. This labeling process will enhance machine learning training efficiency and improve diagnostic precision. Concurrently, pilot testing within hospital and outpatient clinics will help identify practical challenges and guide integration strategies.
Voice-based health technologies are not entirely novel — pilot programs have explored their utility in detecting conditions ranging from Parkinson’s disease to respiratory infections. Yet applying these tools for early cancer detection, particularly in the voice box, represents an exciting frontier. Given the global burden of laryngeal cancer and its often devastating consequences, such innovations could transform patient care paradigms, facilitating timely treatment and improving survival outcomes.
In a broader context, the success of the Bridge2AI consortium underscores the transformative potential of artificial intelligence in biomedical research. By linking data science experts, clinicians, and engineers in collaborative networks, complex diseases can be understood and confronted more effectively than ever before. The leap from proof-of-principle studies to clinical-grade applications, while challenging, increasingly appears as an attainable goal.
In summary, the recent findings affirm that human voice carries rich diagnostic information beyond mere communication. As research progresses, vocal biomarkers analyzed through AI promise to evolve into powerful, non-invasive tools for detecting benign and malignant vocal fold lesions. Such progress aligns with contemporary moves toward personalized, accessible healthcare, harnessing everyday technology to save lives and reduce suffering caused by laryngeal cancer.
Subject of Research: People
Article Title: Voice as a Biomarker: Exploratory Analysis for Benign and Malignant Vocal Fold Lesions
News Publication Date: 12-Aug-2025
Web References:
https://www.frontiersin.org/journals/digital-health/articles/10.3389/fdgth.2025.1609811/full
References:
DOI: 10.3389/fdgth.2025.1609811
Keywords:
laryngeal cancer, vocal fold lesions, voice biomarker, artificial intelligence, harmonic-to-noise ratio, fundamental frequency, Bridge2AI consortium, early cancer detection, voice analysis, machine learning
Tags: advancements in cancer detection technologyAI voice analysis for cancer detectionchallenges in laryngeal cancer diagnosisearly detection of laryngeal cancerimproving cancer prognosis through early detectioninnovative approaches to cancer screeninglaryngeal cancer risk factorsnon-invasive cancer diagnosis methodsreducing discomfort in cancer diagnosisrole of artificial intelligence in healthcarevoice box cancer statisticsvoice recordings for medical analysis