Why using Feature Scaling ?
Check this data set for example:
Position | Level | Salary |
---|---|---|
Business Analyst | 1 | 45000 |
Junior Consultant | 2 | 50000 |
Senior Consultant | 3 | 60000 |
Manager | 4 | 80000 |
Country Manager | 5 | 110000 |
Region Manager | 6 | 150000 |
Partner | 7 | 200000 |
Senior Partner | 8 | 300000 |
C-level | 9 | 500000 |
CEO | 10 | 1000000 |
The math behind basic Feature Scaling:
Here is the formula for standard scaling (z-score normalization):
$$ z = \frac{x - \mu}{\sigma} $$
where:
Standard scaling, also known as z-score normalization, is a technique used to standardize the range of independent variables or features of data. The goal is to transform the data to have a mean of 0 and a standard deviation of 1. This is particularly important for machine learning algorithms like Support Vector Regression (SVR), which are sensitive to the scale of input features. By applying this transformation, the machine ensure that each feature contributes equally to the distance calculations, improving model performance and convergence.