Clustering Techniques for Text Documents in Vernacular Languages: A Review
Main Article Content
Abstract
Clustering is an unsupervised machine learning technique which designates creation of classes with certain number of similar objects without prior knowledge. These classes of similar objects are known as clusters. The process of clustering divides the data sets into number of clusters in such a way that there is high intra-cluster similarity and low inter-cluster similarity. In this paper, a literature survey of clustering techniques for text documents in vernacular languages has been carried out and presented a review of various studies especially for the text documents in Gurmukhi script. The purpose of review of literature is to explore the text clustering techniques and to facilitate the researchers for the future inventions.
Article Details
Issue
Section
Articles