Home Journals AMA_B Hybrid Clustering Algorithm ‘KCu’ for Combining the Features of K-Means and CURE Algorithm for Efficient Outliers Handling

JOURNAL METRICS

CiteScore 2019: 0.50 ℹCiteScore:

CiteScore is the number of citations received by a journal in one year to documents published in the three previous years, divided by the number of documents indexed in Scopus published in those same three years.

SCImago Journal Rank (SJR) 2019: 0.117 ℹSCImago Journal Rank (SJR):

The SJR is a size-independent prestige indicator that ranks journals by their 'average prestige per article'. It is based on the idea that 'all citations are not created equal'. SJR is a measure of scientific influence of journals that accounts for both the number of citations received by a journal and the importance or prestige of the journals where such citations come from It measures the scientific influence of the average article in a journal, it expresses how central to the global scientific discussion an average article of the journal is.

Source Normalized Impact per Paper (SNIP) 2019: 0.415 ℹSource Normalized Impact per Paper(SNIP):

SNIP measures a source’s contextual citation impact by weighting citations based on the total number of citations in a subject field. It helps you make a direct comparison of sources in different subject fields. SNIP takes into account characteristics of the source's subject field, which is the set of documents citing that source.

123.png

Hybrid Clustering Algorithm ‘KCu’ for Combining the Features of K-Means and CURE Algorithm for Efficient Outliers Handling

B. Renuka Devi^*| S. Pallam Setty

Department of CSE, Vignan’s Nirula Institute of Technology & Science for Women, Guntur 522005, Andhra Pradesh, India

Department of CS & SE, College of Engineering, Andhra University, Andhra Pradesh 530003, India

Corresponding Author Email:

dr.b.renukacse@gmail.com

Received:

26 April 2018

| |

Accepted:

2 June 2018

| | Citation

61.02_04.pdf

OPEN ACCESS

Abstract:

In the ongoing situation, the volume of information expands step by step. By the year 2020 the volume of Big Data would reach up to 40zb according to International Data Corporation (IDC). Big Data has turned out to be prevalent for handling, putting away and overseeing huge volumes of information. The grouping of datasets has turned into a testing issue in the field of Big Data examination; however, there are entanglements for applying conventional bunching calculations to huge information because of expanding the volume of information step by step. In this manuscript a new hybrid clustering algorithm, namely KCu to combine the features of both K-Means and CURE clustering algorithms is proposed. The proposed algorithm first applies k-means on data set and then applies CURE on resultant clusters from k-means. We experimented KCu and we show that, when compared to k-means and Cure. Which gives accurate results because of CURE? CURE can handle outliers and it gives non spherical shapes it is the disadvantage of other clustering algorithm.

Keywords:

big data, clustering, partitioning, hierarchical k-means, CURE hybrid algorithm

1. Introduction

2. Related Work

3. Clustering Techniques

4. Hybrid Clustering Method

5. Results

6. Conclusion

References

[1] Ramprasad R, Darshika GP. (2017). A fast and scalable FPGA-based parallel processing architecture for k-means clustering for big data analysis. IEEE.

[2] Liu C, Wang CZ, Hu JX, et al. (2017). Improved K-means algorithm based on hybrid rice optimization algorithm. IEEE 21-23.

[3] Xiong CQ, Hua Z, et al. (2016). An improved k-means text clustering algorithm by optimizing initial cluster centers. IEEE.

[4] Karimov J, Ozbayoglu M. (2015). Clustering quality improvement of k-means using a hybrid evolutionary model. Elsevier.

[5] Han JK, Luo M. (2014). Bootstrapping k-means for big data analysis. IEEE International Conference on Big Data.

[6] Anupama C, Suresh K. (2014). An improved k-means clustering algorithm: A step forward for removal of dependency on K. International conference on reliability. Optimization and Information Technology ICROIT 2014, India.

[7] Wang JT, Su XL. (2011). An improved k-means clustering algorithm. IEEE International Conference on Big Data.

[8] Shi N, Liu XM, et al. (2010). Research on k-means clustering algorithm an improved k-means clustering algorithm. IEEE.

[9] Makadiya KN. (2015). An enhance approach to improve cure clustering using appropriate linkage function for datasets. IJRCCE.

[10] Drias H, Cherif NF, Kechid A. (2016). K-MM: A hybrid clustering algorithm based on k-means and k-medoids. Springer.

[11] Wang HL, Zhou MT. (2012). A reﬁned rough k-means clustering with hybrid threshold. Springer.

[12] Kumar D, Bezdek JC. (2015). A hybrid approach to clustering in big data. IEEE Transactions on Cybernetics.

[13] Fahad A, Alshatri N, Tari Z. (2014). A Survey of clustering algorithms for big data: taxonomy & empirical analysis. IEEE Transactions.

IJHT
MMEP
ACSM
EJEE
ISI
I2M
JESA
RCMA
RIA
TS
IJSDP
IJSSE
IJDNE
JNMES
IJES
EESRJ
RCES
AMA_A
AMA_B
AMA_C
AMA_D
MMC_A
MMC_B
MMC_C
MMC_D

Username
Password
Remember me

Search form

Hybrid Clustering Algorithm ‘KCu’ for Combining the Features of K-Means and CURE Algorithm for Efficient Outliers Handling