A Review on Classification of Data Imbalance using BigData

  International Journal of Managing Information Technology (IJMIT) WJCI Indexed

ISSN: 0975-5586 (Online); 0975-5926 (Print)

http://airccse.org/journal/ijmit/ijmit.html

Article:

A Review on Classification of Data Imbalance using BigData

Authors

Ramasubramanian and Hariharan Shanmugasundaram, Shadan Women’s College of Engineering and Technology, India

Abstract

Classification is one among the data mining function that assigns items in a collection to target categories or collection of data to provide more accurate predictions and analysis. Classification using supervised learning method aims to identify the category of the class to which a new data will fall under. With the advancement of technology and increase in the generation of real-time data from various sources like Internet, IoT and Social media it needs more processing and challenging. One such challenge in processing is data imbalance. In the imbalanced dataset, majority classes dominate over minority classes causing the machine learning classifiers to be more biased towards majority classes and also most classification algorithm predicts all the test data with majority classes. In this paper, the author analysis the data imbalance models using big data and classification algorithm.

Keywords

Data imbalance, Big data, IoT, Data analytics & Classification.

Paper URL

https://aircconline.com/ijmit/V13N3/13321ijmit02.pdf

Abstract URL

https://aircconline.com/abstract/ijmit/v13n3/13321ijmit02.html

Volume URL

https://airccse.org/journal/ijmit/vol13.html







Comments

Popular posts from this blog

Engineering Life Cycle Enables Penetration Testing and Cyber Operations

Top 20 Cited Research Articles in Information Management - 2021