A Comprehensive Survey on Automated Concept Extraction and Prerequisite Dependency Detection in Educational Texts

Aditi Patil; Riya Hankare; Shreya Araganji; Nidhi Chaudhari; Siddharth K. Gaikwad

doi:XX.XXX/IJARIIT-V12I2-1151

This paper is published in Volume-12, Issue-2, 2026

Paper Details
Abstract & PDF

Area

Computer Engineering

Author

Aditi Patil, Riya Hankare, Shreya Araganji, Nidhi Chaudhari, Siddharth K. Gaikwad

Org/Univ

COEP Technological University, Maharashtra, India

Pub. Date

16 March, 2026

Paper ID

V12I2-1151

Publisher

IJARIIT

Edition

Volume-12, Issue-2, 2026

Keywords

Concept Extraction, Prerequisite Learning, Educational Data Mining, Knowledge Graphs, Natural Language Processing, Large Language Models.

Citations

IEEE
Aditi Patil, Riya Hankare, Shreya Araganji, Nidhi Chaudhari, Siddharth K. Gaikwad. A Comprehensive Survey on Automated Concept Extraction and Prerequisite Dependency Detection in Educational Texts, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.

APA
Aditi Patil, Riya Hankare, Shreya Araganji, Nidhi Chaudhari, Siddharth K. Gaikwad (2026). A Comprehensive Survey on Automated Concept Extraction and Prerequisite Dependency Detection in Educational Texts. International Journal of Advance Research, Ideas and Innovations in Technology, 12(2) www.IJARIIT.com.

MLA
Aditi Patil, Riya Hankare, Shreya Araganji, Nidhi Chaudhari, Siddharth K. Gaikwad. "A Comprehensive Survey on Automated Concept Extraction and Prerequisite Dependency Detection in Educational Texts." International Journal of Advance Research, Ideas and Innovations in Technology 12.2 (2026). www.IJARIIT.com.

Give proper credits, use Citation.

Abstract

Automated extraction of concepts and detection of dependencies are important for analyzing educational texts and supporting applications like curriculum development, learning path suggestions, and intelligent tutoring systems. The aim of these tasks is to automatically extract key domain concepts from educational materials and find dependencies or prerequisites between them. Several approaches are put forward in recent studies, such as weakly supervised methods based on semantic embeddings and clustering, distant supervision techniques that make use of domain-specific glossaries, and supervised transformer-based models. Supervised models frequently exhibit high accuracy, but they rely significantly on large, manually labeled datasets. On the other hand, weakly and distantly supervised approaches drastically lower annotation costs, but they also face issues with seed quality, dictionary coverage, and cross-domain applicability. In order to guide future work in automated educational concept analysis, this study offers a thorough overview of current methods, frequently used datasets, assessment metrics, and recognized limits. It also addresses important issues and unresolved research gaps.

All content is copyright protected.