This paper is published in Volume-12, Issue-3, 2026
Area
Artificial Intelligence / Machine Learning
Author
Syed Naseemtaj, Syed Nafeesa Thehseen
Org/Univ
Kandula Lakshumma Memorial College of Engineering for Women, Andhra Pradesh, India
Pub. Date
02 June, 2026
Paper ID
V12I3-1201
Publisher
Keywords
Lung Cancer Detection, Breath Analysis, Volatile Organic Compounds (VOCS), Artificial Intelligence, Machine Learning, Ensemble Learning, Feature Selection.

Citationsacebook

IEEE
Syed Naseemtaj, Syed Nafeesa Thehseen. AI-Driven Breath Analysis for Early Lung Cancer Detection Using Optimized Ensemble Learning of VOC Biomarkers, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.

APA
Syed Naseemtaj, Syed Nafeesa Thehseen (2026). AI-Driven Breath Analysis for Early Lung Cancer Detection Using Optimized Ensemble Learning of VOC Biomarkers. International Journal of Advance Research, Ideas and Innovations in Technology, 12(3) www.IJARIIT.com.

MLA
Syed Naseemtaj, Syed Nafeesa Thehseen. "AI-Driven Breath Analysis for Early Lung Cancer Detection Using Optimized Ensemble Learning of VOC Biomarkers." International Journal of Advance Research, Ideas and Innovations in Technology 12.3 (2026). www.IJARIIT.com.

Abstract

Lung cancer is one of the most prevalent and deadly diseases worldwide, primarily due to late-stage diagnosis and the limitations of conventional detection methods. Early and accurate identification is crucial for improving patient survival rates. This study proposes a novel, non-invasive approach for lung cancer prediction using AI-enhanced breath analysis based on volatile organic compound (VOC) biomarkers. The proposed system utilizes sensor-based breath data to capture VOC patterns associated with lung cancer. Advanced Preprocessing and feature selection techniques are applied to identify the most relevant biomarkers, reducing data complexity and improving model efficiency. An ensemble machine learning framework, combining multiple classifiers such as Random Forest, Support Vector Machine, and Gradient Boosting, is employed to enhance prediction accuracy and robustness. Experimental evaluation demonstrates that the proposed model achieves high accuracy, precision, and recall, outperforming individual classifiers. The system also shows strong potential for real-time implementation due to its computational efficiency. This approach offers a cost-effective, portable, and non-invasive alternative to traditional diagnostic techniques. Overall, the integration of VOC biomarker analysis with feature-selected ensemble learning provides a promising solution for early lung cancer detection, paving the way for improved screening methods and better clinical outcomes.