info@biomedres.us   +1 (502) 904-2126   One Westbrook Corporate Center, Suite 300, Westchester, IL 60154, USA   Site Map
ISSN: 2574 -1241

Impact Factor : 0.548

  Submit Manuscript

Research ArticleOpen Access

Class Based Variable Importance for Medical Decision Making

Volume 1 - Issue 5

Danielle Baghernejad*

Received: September 16, 2017;   Published: October 12, 2017

DOI: 10.26717/BJSTR.2017.01.000431

Full Text PDF

To view the Full Article   Peer-reviewed Article PDF

Abstract

In this paper we explore variable importance within tree-based modeling, discussing its strengths and weaknesses with regard to medical inference and action ability. While variable importance is useful in understanding how strongly a variable influences a tree, it does not convey how variables relate to different classes of the target variable. Given that in the medical setting, both prediction and inference are important for successful machine learning, a new measure capturing variable importance with regards to classes is essential. A measure calculated from the paths of training instances through the tree is defined, and initial performance on benchmark datasets is explored.

Keywords : Machine learning; Tree-based modeling; Decision trees; Variable importance; Class Variable Importance

Abbreviations : CART: Classification and Regression Trees; CVI: Class Variable Importance; ET: Extra Trees; RF: Random Forests; GBT: Gradient Boosted Trees; AUC: Area under the Curve; ROC: Receiver Operating Characteristic

Abstract| Introduction| Generating a Decision Tree| Interpreting a Tree| Class Variable Importance| Performance on Benchmark Data| Conclusion| Acknowledgement| References|