Query Refinement using Latent Dirichlet Allocation

Authors

  • Dhiraj Poudel Nepal College of Information Technology
  • Ashim Khadka Nepal College of Information Technology
  • Rishav Dahal Nepal College of Information Technology
  • Sandhya Gotame Nepal College of Information Technology
  • Binit Bikram KC Nepal College of Information Technology

DOI:

https://doi.org/10.65091/icicset.v2i1.15

Keywords:

Semantic Query Refinement, Latent Dirichlet Allocation, Topic Modeling, Unsupervised Learning

Abstract

This study introduces a query refinement technique
that leverages Latent Dirichlect Algorithm ( LDA ) and coher
ence techniques. The proposed method falls under unsupervised
learning, firstly preprocessing large text corpus, applying topic
modelling via LDA and finding the optimal number of topics
using coherence scores eventually leading to the topics which
are meaningful and relevant to the text corpus. We made use
of coherence measures to make sure the topics extracted from
the corpus are relevant to each other and understandable. The
results demonstrated that the method was successful in topic
extraction and modelling of large text corpus. However, the
results significantly vary on the quality of text corpus on which
the method is being applied.

Downloads

Published

2025-12-24

How to Cite

[1]
D. Poudel, A. Khadka, R. Dahal, S. Gotame, and B. B. KC, “Query Refinement using Latent Dirichlet Allocation”, ICICSET2025, vol. 2, no. 1, Dec. 2025.