Clustering of cancer text documents in the medical field using machine learning heuristics
Downloads
Published
DOI:
https://doi.org/10.58414/SCIENTIFICTEMPER.2025.16.5.06Keywords:
Machine learning, soft computing paradigm, cancer text documents, redundancy reductionDimensions Badge
Issue
Section
License
Copyright (c) 2025 The Scientific Temper

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
The data clustering over medical text documents plays a major role in extracting relevant information from the documents. However, most of the methods fails in finding the accurate solution on finding the relevant cancer type due to the presence of redundant data items. It is hence necessary to develop a clustering framework that strictly eliminates the redundant data items. In this paper, we present a clustering framework that tends to accurately cluster the cancer text documents to predict what type of cancer is present in a patient. A large database is tested and clustering using the machine learning model. The clustering framework consists of pre-processing the text documents, feature extraction, feature selection and clustering. The clustering using multi-support vector machine enables optimal clustering of text documents. The cancer datasets is used to validate the models over various medline cancer documents dataset. The experimental validation shows improved clustering of documents using the proposed models than other methods.Abstract
How to Cite
Downloads
Similar Articles
- Pravin P. P, J. Arunshankar, Development of digital twin for PMDC motor control loop , The Scientific Temper: Vol. 14 No. 01 (2023): The Scientific Temper
You may also start an advanced similarity search for this article.

