A Novel Ensemble Based Decision Tree Model For High Dimensional Biomedicine Data
Citation
Dande Rushalini, Mr. K. Vijay Kumar"A Novel Ensemble Based Decision Tree Model For High Dimensional Biomedicine Data", International Journal of Computer & organization Trends (IJCOT), V6(6):20-24 Nov - Dec 2016, ISSN:2249-2593, www.ijcotjournal.org. Published by Seventh Sense Research Group.
Abstract Knowledge discovery is an essantial mechanism for the intelligent data analysis to transform data in to meaningful information that will support for the decision making data mining approaches support automatic extraction of data and attempts to discover the hidden rules and patterns in data and also detect relevant decision rules from the high dimensional dataset. Classification from imbalanced data is significantly affected the performance of the algorithm due to noise and high dimensionality. Sparsity and high dimensionality of the classifier algorithm becomes a major problem in many traditional decision tree models on medical datasets. A novel decision trees allows estimating on topmost features to assess the class prior probability and estimates the chance of misleading false positive patterns. In this research work, a new framework is proposed by integrating random forest decision tree for pattern analysis. Experimental results show that proposed model has better accuracy compare to existing approaches.
References
[1]Quinlan J R,”Simplifying Decision Tree,” Internet Journal of Man-Machine Studies,1987,27,pp.221-234.
[2] Yang Xue-bing,Zhang Jun, ”Decision Tree Algorithm and its core technology”, techno Logy and development, 2013.
[3] Qu Kai-she ,Wen Cheng-li, Wang Jun-hong, ”An improved algorithm of ID3 algorithm,” Computer Engineering and Applications, 2003,(25),pp.104-107.
[4] Mao Cong-liYi Bo, ”The most simple decision tree generation algorithm based on decision-making degree of coordination ,“ Computer Engineering and Design,2008,29(5),pp.1250-1252.
[5] Huang Ai-hui,”Improvement and application of decision tree C4.5 algorithm ,“ Science Technology and Engineering,2009, (1),pp.34-37.
[6] J. Gehrke, R.Ramakrishnan, and V. Ganti, "Rainforest, a framework for fast decision tree construction of large datasets", in Springer Netherlands-Data mining and knowledge discovery vol.4. Issue(2-3) July 2000.
[7] M. Kantardzic “Data Mining. Concepts, Models, Methods and Algoritms”. John Wiley and Sons Inc, 2003.
[8] Xu.M.Wang, J.and Chen.T. “Improved decision tree algorithm: ID3+” Intelligent Computing in Signal Processing and Pattern Recognition, Vol.345, pp.141-149, 2006.
[9] Quinlan, J. R. “C4.5: Programs for Machine Learning” Morgan Kaufmann, San Mateo, CA 1993.
[10] Lewis, R.J. “An Introduction to Classification and Regression Tree (CART) Analysis” Annual Meeting of the Society for Academic Emergency Medicine, Francisco 2000.
[11] Ruoming Jin, Ge Yang and Gagan Agrawal, “Shared memory parallelization of Data mining algorithms: Techniques, Programming interface and Performance”, IEEE Transactions on Knowledge & data engineering, 2005.
[12] Song Xudong, Cheng Xiaolan “Decision tree Algorithm based on Sampling” IFIP International conference on Netwok and Parallel Computing-Workshops 2007.
Keywords
PSO model, Disease detection, Random forest, classification ,UCI repository.