This paper is published in Volume-11, Issue-3, 2025
Area
Engineering And Technology
Author
Nachiket Parjane, Kartik Patare, Rohan Ingle, Renuka Wakhare
Org/Univ
G.H. Raisoni College of Engineering and Management, Pune, India
Keywords
Whoosh, Query, Indexing
Citations
IEEE
Nachiket Parjane, Kartik Patare, Rohan Ingle, Renuka Wakhare. Deep Search: An Intelligent File Searching through Content Analysis, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.
APA
Nachiket Parjane, Kartik Patare, Rohan Ingle, Renuka Wakhare (2025). Deep Search: An Intelligent File Searching through Content Analysis. International Journal of Advance Research, Ideas and Innovations in Technology, 11(3) www.IJARIIT.com.
MLA
Nachiket Parjane, Kartik Patare, Rohan Ingle, Renuka Wakhare. "Deep Search: An Intelligent File Searching through Content Analysis." International Journal of Advance Research, Ideas and Innovations in Technology 11.3 (2025). www.IJARIIT.com.
Nachiket Parjane, Kartik Patare, Rohan Ingle, Renuka Wakhare. Deep Search: An Intelligent File Searching through Content Analysis, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.
APA
Nachiket Parjane, Kartik Patare, Rohan Ingle, Renuka Wakhare (2025). Deep Search: An Intelligent File Searching through Content Analysis. International Journal of Advance Research, Ideas and Innovations in Technology, 11(3) www.IJARIIT.com.
MLA
Nachiket Parjane, Kartik Patare, Rohan Ingle, Renuka Wakhare. "Deep Search: An Intelligent File Searching through Content Analysis." International Journal of Advance Research, Ideas and Innovations in Technology 11.3 (2025). www.IJARIIT.com.
Abstract
Real-time full-text search holds essential value in current digital libraries because it helps users find documents with content rather than names [1]. Users can perform content-based searches that reveal files through the extraction of textual contents within documents and optimize the retrieval process for research databases as well as legal document search and enterprise knowledge management solutions [2]. A full-text search technique-powered document retrieval system, which seeks to create a content-based file searching mechanism, is analyzed within this report. These search systems implement ranking as their main step to evaluate document relevance through the combination of term frequency and document length analysis with inverse document frequency factors [3]. The foundation for improving search accuracy and efficiency depends heavily on knowledge about directory creation as well as score calculation methods. The research also explores performance comparison between Whoosh and Elasticsearch regarding their scaling capabilities and their abilities to index data and respond to search queries and rank results [4]. Whoosh functions best for compact document sets, yet Elasticsearch delivers real-time search functionality for extensive data collections. The final report will present the most effective solution for creating a content-based search system with high performance levels for various application domains.