Interval Score Examples
Example Plots to show visually how the interval score calculation behaves in different cases.
Last updated
Was this helpful?
Example Plots to show visually how the interval score calculation behaves in different cases.
Last updated
Was this helpful?
The process used to score and rank the Interval Forecasts is relatively straightforward if observed graphically. Below we’ve drawn three examples of hypothetical forecasts along with 12 “observed” price points to illustrate the calculation. The y-axis below represents “Price (USD)” and the x-axis is “Time (sec)”
The first example shows the score when the forecast price interval is excessively wide. The inclusion-factor is 100%, but the width-factor penalizes the final score:
As shown, the maximum and minimum observed price, along with the predicted max and min are given by:
Therefore, in this case the effective top and bottom are both determined by the observed price:
Leading to a weight factor of 0.8, an inclusion factor of 1.0, and a final Interval Score of 0.8:
By contrast, when the interval forecast does not include the entire price time series, the width-factor can reach 100%, because none of the prediction is “wasted”; however the inclusion factor lowers the final score.
The maximum and minimum observed price, along with the predicted max and min are given by:
Therefore, in this case the effective top and bottom are both determined by the predicted interval:
Leading to a weight factor of 1.0, an inclusion factor of 0.75, and a final Interval Score of 0.75:
As a final example, it is of course possible for both the width and inclusion factors to be sub-optimal at the same time. This happens when part of the Interval Forecast is wasted, but it also fails to include the entire observed time series. Note the reduced score, even though the width of the forecast and the observed range is the same ($8):
The maximum and minimum observed price, along with the predicted max and min are given by:
In this case the effective top and bottom are by the prediction and observed prices, respectively:
Leading to a weight factor of 0.6875, an inclusion factor of 0.8333, and a final Interval Score of approximately 0.573: