Decision Trees
Definition
- A decision tree is a flow-chart-like tree structure
- Internal node denotes a test on an attribute (feature)
- Branch represents an outcome of the test
- All records in a branch have the same value for the tested attribute
- Leaf node represents class label or class label distribution
| Decision Tree |
Advantages - Common type of classifier
- Predictive accuracy: Captures complex patterns.
- Speed: Faster than many others to build. Very quick to apply model.
- Robustness: Can handle noise and missing values.
- Scalability: Some implementations are scalable.
- Interpretability: Very readable. To classify an instance, walk your way down the tree following the rules.
Decision Tree Classification task
Example
The provided image is an excellent example of a Decision Tree (DT) model, a fundamental concept in machine learning. On the left is the training data table, which contains several features (age, income, student, credit_rating) and a target variable (buys_computer). The goal is to predict whether a person will buy a computer based on their characteristics.
For example, to predict if a person buys a computer, the tree first checks their age. If their age is <=30, it then checks if they are a student. If they are a student, the tree predicts "yes" (they will buy a computer). This process of recursively partitioning the data based on the most informative features is the core mechanism of decision tree algorithms, such as the ID3 method by Quinlan mentioned in the image.
Geometric interpretation
Next up -
Comments
Post a Comment