Model Evaluation

Carlos Salas

Portfolio Manager and Data Scientist

Evaluating model performance for accuracy is crucial. But models are a simplified version of reality and are prone to model error. Join Carlos Salas in this video as he explores the 3 kinds of model error and how to find the right balance.

Data Science

Subscribe to watch

Access this and all of the content on our platform by signing up for a 7-day free trial.

Model Evaluation

7 mins 46 secs

Key learning objectives:

Identify the 3 types of model error
Understand how to fix model errors

Overview:

Measuring a model’s performance is one of the most important aspects of the data science workflow. There are 3 types of model error to be aware of: model bias (originating from erroneous assumptions in the learning algorithm), model variance (how sensitive the model is to small fluctuations in the utilised training dataset) and random error. The right path for a data scientist is to always try to find a model that has a balance between accuracy and precision, called the bias-variance trade-off. Fixing high bias can be achieved by adding more input features or by increasing the complexity of the model. Fixing high variance models can be achieved by using more training data and reducing the number of features so that only the most important are considered.

Subscribe to watch

Access this and all of the content on our platform by signing up for a 7-day free trial.

Summary

What are the 3 types of model error?

1. Model bias. An error originating from erroneous assumptions in the learning algorithm. Models suffering from high bias can cause an algorithm to miss the relevant relations between features and the response variable, commonly referred to as underfitting.

2. Model variance. This is related to the instability of the model - how sensitive the model is to small fluctuations in the utilised training dataset. Models suffering from high variance can cause an algorithm to miss relevant relations, commonly referred to as overfitting.

3. Random error. A model is a simplification of reality and can never be perfect, so it will therefore always suffer from random errors. A different way of interpreting the random error is by looking at it as a measure of the amount of data noise present.

How can you fix model error?

As random error is inevitable, the only two error components that can be minimised are bias and variance. The right path for a data scientist is to always try to find a model that has a balance between accuracy and precision, called the bias-variance trade-off.

Fixing high bias can be achieved by adding more input features or by increasing the complexity of the model. Fixing high variance models can be achieved by using more training data and reducing the number of features so that only the most important are considered.

Subscribe to watch

Access this and all of the content on our platform by signing up for a 7-day free trial.

Carlos Salas

Carlos Salas is a professional investor passionate about the lifelong development of an investment process that blends man and machine. Over the last 15 years, he has worked in investment roles for firms such as Santander AM, BNP Paribas, Jefferies, and LCAM. He is currently pursuing three careers simultaneously - as an investment manager, consultant and lecturer.

There are no available Videos from "Carlos Salas"