An under-Sampled Approach for Handling Skewed Data Distribution using Cluster Disjuncts

Syed Ziaur Rahman

An under-Sampled Approach for Handling Skewed Data Distribution using Cluster Disjuncts

Article PDF

Keywords

classification
class imbalance
cluster disjunct
under sampling
MAJOR_CD

How to Cite

Syed Ziaur Rahman. (2014). An under-Sampled Approach for Handling Skewed Data Distribution using Cluster Disjuncts. Global Journal of Computer Science and Technology, 14(C7), 1–11. Retrieved from https://gjcst.com/index.php/gjcst/article/view/1183

Abstract

In Data mining and Knowledge Discovery hidden and valuable knowledge from the data sources is discovered The traditional algorithms used for knowledge discovery are bottle necked due to wide range of data sources availability Class imbalance is a one of the problem arises due to data source which provide unequal class i e examples of one class in a training data set vastly outnumber examples of the other class es Researchers have rigorously studied several techniques to alleviate the problem of class imbalance including resampling algorithms and feature selection approaches to this problem In this paper we present a new hybrid frame work dubbed as Majority Under-sampling based on Cluster Disjunct MAJOR_CD for learning from skewed training data This algorithm provides a simpler and faster alternative by using cluster disjunct concept We conduct experiments using twelve UCI data sets from various application domains using five algorithms for comparison on six evaluation metrics The empirical study suggests that MAJOR_CD have been believed to be effective in addressing the class imbalance problem

Article PDF

This work is licensed under a Creative Commons Attribution 4.0 International License.

Similar Articles

V Vineeth Kumar, Dr. N Satyanarayana., Probability of Semantic Similarity and N-grams Pattern Learning for Data Classification , Global Journal of Computer Science and Technology: Vol. 17 No. H2 (2017): GJCST-H Information & Technology: Volume 17 Issue H2
Rupinder Singh, Dr. Jatinder Singh, Rupinder Singh, Securing Cluster Head Selection in Wireless Sensor Networks , Global Journal of Computer Science and Technology: Vol. 16 No. E7 (2016): GJCST-E Network, Web & Security: Volume 16 Issue E7
Joy Nkechinyere Olawuyi, Afolabi B.Samuel, Domain Specific Deep Neural Network Model for Classification of Abnormalities on Chest Radiographs , Global Journal of Computer Science and Technology: Vol. 23 No. D1 (2023): GJCST-D Neural & AI: Volume 23 Issue D1
Shruti Karva, Naveen Choudhary, Energy Efficient Cluster based Multipath Routing in Wireless Sensor Networks , Global Journal of Computer Science and Technology: Vol. 16 No. E1 (2016): GJCST-E Network, Web & Security: Volume 16 Issue E1
Bhanu Pratap, Navneet Agarwal, Sunil Joshi, Development of ANN based Efficient Fruit Recognition Technique , Global Journal of Computer Science and Technology: Vol. 14 No. C5 (2014): GJCST-C Software & Data Engineering: Volume 14 Issue C5
Adapala Praveen Kumar, J Krishna Chaithanya, Automatic Classification and Segmentation of Tumors from Skull Stripped Images using PNN , Global Journal of Computer Science and Technology: Vol. 15 No. F1 (2015): GJCST-F Graphics & Vision: Volume 15 Issue F1
V. Padmanabhan, Probabilistic Color Image Classifier Based on Volumetric Robust Features , Global Journal of Computer Science and Technology: Vol. 13 No. F9 (2013): GJCST-F Graphics & Vision: Volume 13 Issue F9
Abid Hasan, Shaikh Jeeshan Kabeer, Kamrul Hasan, Discriminative Gene Selection Employing Linear Regression Model , Global Journal of Computer Science and Technology: Vol. 13 No. C4 (2013): GJCST-C Software & Data Engineering: Volume 13 Issue C4
Tarun Rao, T.V.Rajinikanth, Supervised Classification of Remote Sensed Data using Support Vector Machine , Global Journal of Computer Science and Technology: Vol. 14 No. C1 (2014): GJCST-C Software & Data Engineering: Volume 14 Issue C1
Mittu Mittal, Gagandeep Kaur, Mixed Pixel Resolution by Evolutionary Algorithm: A Survey , Global Journal of Computer Science and Technology: Vol. 13 No. F5 (2013): GJCST-F Graphics & Vision: Volume 13 Issue F5

1 2 3 4 5 6 7 8 9 10 > >>

You may also start an advanced similarity search for this article.

An under-Sampled Approach for Handling Skewed Data Distribution using Cluster Disjuncts

Keywords

How to Cite

Download Citation

Abstract

Similar Articles