p.1399
p.1404
p.1408
p.1412
p.1416
p.1420
p.1425
p.1430
p.1436
Feature Selection and Optimization of Random Forest Modeling
Abstract:
Traditional random forest algorithm is difficult to achieve very good effect for the classification of small sample data set. Because in the process of repeated random selection, selection sample is little, resulting in trees with very small degree of difference, which floods right decisions, makes bigger generalization error of the model, and the predict rate is reduced. For the sample size of sepsis cases data, this paper adopts for parameters used in random forest modeling interval division choice; divide feature interval into high correlation and uncertain correlation intervals; select data from two intervals respectively for modeling. Eventually reduce model generalization error, and improve accuracy of prediction.
Info:
Periodical:
Pages:
1416-1419
Citation:
Online since:
November 2014
Authors:
Keywords:
Price:
Permissions: