Clustering Techniques for Text Documents in Vernacular Languages: A Review

Mukesh Kumar,  Amandeep Verma

Published: Jan 4, 2024

Mukesh Kumar, Amandeep Verma

Abstract

Clustering is an unsupervised machine learning technique which designates creation of classes with certain number of similar objects without prior knowledge. These classes of similar objects are known as clusters. The process of clustering divides the data sets into number of clusters in such a way that there is high intra-cluster similarity and low inter-cluster similarity. In this paper, a literature survey of clustering techniques for text documents in vernacular languages has been carried out and presented a review of various studies especially for the text documents in Gurmukhi script. The purpose of review of literature is to explore the text clustering techniques and to facilitate the researchers for the future inventions.

Issue

Vol. 45 No. 01 (2024)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details