eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

An Improved Weighted KNN Algorithm About Text Classification Based on Spark Framework

Resource Type: Conference
Authors: Yang, Tianming; Du, Shaobo
Source: 2022 IEEE 10th International Conference on Information, Communication and Networks (ICICN) Information, Communication and Networks (ICICN), 2022 IEEE 10th International Conference on. :655-661 Aug, 2022
Subject: Communication, Networking and Broadcast Technologies
Dimensionality reduction
Text categorization
Classification algorithms
Sparks
Parallel algorithms
k-nearest neighbor
text classification
weighted KNN
spark
Hadoop
parallelization
Language

Online Access

Full Text (IEEE)

초록

K-nearest neighbor classification algorithm can quickly deal with the classification problem in this paper, but when calculating the similarity, it will assign the same weight to all distances, and does not pay attention to the impact of small distance on classification accuracy. At the same time, the k-nearest neighbor classification algorithm will be affected by the number of samples and dimensions, which will affect the efficiency of the classification algorithm. Therefore, an improved weighted KNN classification algorithm based on spark framework is proposed, which can improve the operation efficiency of the algorithm by cutting and reducing the dimension of sample data. Experimental results show that the algorithm has better accuracy and speedup ratio than the parallel algorithm based on Hadoop platform, and can process large-scale text data quickly and accurately.

공지

DAU Library

eArticles

요약정보

An Improved Weighted KNN Algorithm About Text Classification Based on Spark Framework

Online Access

초록