Starter quiz
- Extrapolation is when you estimate values that are ______ the range of available data
- 'outside' ✓
- Which values are obtained by interpolation on this scatter diagram?
- A ✓
- B
- C ✓
- D
-
- Select the regions where extrapolation can be used to make estimations on this scatter graph.
- A only
- B only
- C only
- A and C ✓
- B and C
-
- Alex says, “I can always use interpolation to make a valid prediction when my bivariate data has correlation.” Is Alex definitely correct?
- No, interpolation is always completely unreliable.
- No, it has to be used with caution as the trend might not continue.
- Yes, interpolation is always completely reliable.
- Yes, but you can only make accurate predictions about similar situations. ✓
-
- The scatter graph shows data taken from the ONS, where each point represents a region in England and Wales in 2023. Select which of these Oak pupils can use this graph to make an accurate prediction.
- Laura: I want the house price in Oakford where the median income is £19 000
- Alex: I want the house price in Ashfield where the median income is £35 000 ✓
- Jacob: I want the house price in Spain where the median income is £35 000
- Sam: I want the house price in Elmway where the median income is £92 000
-
- The scatter graphs shows information about house prices and income in different regions. Estimate the difference between the house prices in the North and South for a median income of £25 000
- £100 000
- £125 000
- £175 000 ✓
- £250 000
- £375 000
-
Exit quiz
- An ______ is a data point that is extremely large or small compared to the rest of the dataset.
- 'outlier' ✓
- Which of these points are an outlier?
- A ✓
- B ✓
- C
- D ✓
- E
-
- Sam says that it is possible to have a dataset with no outliers. Is Sam correct?
- No, all datasets will have a highest value.
- No, there will always be a datapoint that is lower than the rest.
- No, not all the data points will cluster.
- Yes, there may be no points that are far away from the main cluster of data. ✓
- Yes, not every dataset has errors in it.
-
- The scatter graph shows information about number of children and number of elderly people in a set of towns. Each point represents a town. Match each outlier to the correct statement.
- A⇔town with a low number of children and many elderly people ✓
- B⇔town with a high number of children and many elderly people ✓
- C⇔town with a high number of children and few elderly people ✓
- D⇔town with a low number of children and few elderly people ✓
- Sofia says you should never ignore an outlier. Is Sofia correct?
- It depends, you should investigate each outlier carefully. ✓
- No, if a data point is extremely large then it must be an error.
- Yes, an outlier is always just an unusual result in the data.
-
- Jacob says you should show the outlier on a scatter graph. Is Jacob correct?
- No, outliers are always too far from the rest of the data to show.
- Yes, you may need to zoom in on the graph to use interpolation. ✓
- No, removing it can make the trend of the data easier to see. ✓
- Yes, otherwise the scatter graph is wrong.
- Yes, the outlier may help highlight useful information. ✓
-
Worksheet
Loading worksheet ...
Presentation
Loading presentation ...
Video
Lesson Details
Key learning points
- Outliers can be visually seen on a scatter graph.
- Outliers do not necessarily occur at the start/end of the data set.
- Outliers must be carefully considered and only removed if they were recorded in error.
Common misconception
An outlier always has values that are outside the range of the dataset (i.e. at the start or end of a dataset).
For bivariate data, a data point may be distinct from the rest of the data because the value of one of its variable is extremely less/greater than other points that have a similar value in its other variable.
Keywords
Outlier - An outlier is a data point that is extremely large or small compared to the rest of the dataset. Visually, outliers lie far away from where the majority of the results are clustered.
+