An Improved Parallel Algorithm for Text Categorization
- Resource Type
- Conference
- Authors
- Yang, Wenchuan; Fu, Yimin; Zhang, Dong
- Source
- 2016 International Symposium on Computer, Consumer and Control (IS3C) Computer, Consumer and Control (IS3C), 2016 International Symposium on. :451-454 Jul, 2016
- Subject
- Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Classification algorithms
Algorithm design and analysis
Text categorization
Clustering algorithms
Support vector machines
Computers
Data models
MapReduce
Rocchio
Text Classification
Filtering
- Language
This paper proposes an approach using MapReduce-based Rocchio relevance feedback algorithm, which improved the traditional Rocchio algorithm in the MapReduce paradigm, to resolve the problem of massive information filtering. Traditional text classification algorithms have vital impact on information filtering.