You are viewing a javascript disabled version of the site. Please enable Javascript for this site to function properly.
Go to headerGo to navigationGo to searchGo to contentsGo to footer
In content section. Select this link to jump to navigation

Investigating the efficiency of the Asian handicap football betting market with ratings and Bayesian networks

Abstract

Despite the massive popularity of the Asian Handicap (AH) football (soccer) betting market, its efficiency has not been adequately studied by the relevant literature. This paper combines rating systems with Bayesian networks and presents the first published model specifically developed for prediction and assessment of the efficiency of the AH betting market. The results are based on 13 English Premier League seasons and are compared to the traditional market, where the bets are for win, lose or draw. Different betting situations have been examined including a) both average and maximum (best available) market odds, b) all possible betting decision thresholds between predicted and published odds, c) optimisations for both return-on-investment and profit, and d) simple stake adjustments to investigate how the variance of returns changes when targeting equivalent profit in both traditional and AH markets. While the AH market is found to share the inefficiencies of the traditional market, the findings reveal both interesting differences as well as similarities between the two.

1Introduction

Football prediction models have become immen-sely popular over the last couple of decades, and this is due to the increasing popularity of football betting. In the academic literature, such models typically focus on predicting the outcome of a match in terms of home win, draw, or away win; known as the 1X2 distribution. Several types of models have been published for this purpose and include rating systems (Leitner et al., 2008; Hvattum & Arntzen, 2010; Constantinou & Fenton, 2013; Wunderlich & Memmert, 2018), statistical methods (Dixon & Coles, 1997; Rue & Salvesen, 2000; Crowder et al., 2002; Goddard, 2005; Angelini & De Angelis, 2017), machine learning techniques (Huang & Chang, 2010; Arabzad et al., 2014; Pena, 2014), knowledge-based systems (Joseph et al., 2006), and hybrid methods that combine any of the above (Constantinou & Fenton, 2017; Constantinou, 2018; Hubacek et al., 2018). In the recent special issue international competition Machine Learning for Soccer, the models that topped the performance table are hybrid and heavily rely on rating systems (Constantinou, 2018; Hubacek et al., 2018).

In Asia, the most popular form of betting (also common in Europe) is the so-called Asian Handicap (AH). This form of betting introduces a hypothetical handicap (i.e., advantage) typically in favour of the weaker team. Specifically, traditional AH introduces a goal deficit to the team more likely to win before kick-off. The manipulation of the match outcome creates interesting situations in which betting is determined by hypothetical, rather than actual, match outcome. Examples of the various types of AH betting are provided in Section 2. This type of betting has also become popular in the UK over the last couple of decades. Football (soccer) syndicates are rumoured to bet millions per week, often on behalf of clients, on AH outcomes offered by bookmakers in Asia (Williams-Grut, 2016). This is because the Asian markets attract higher volumes of bets and offer greater market liquidity. Estimates suggest that over 70% of the betting turnover for football is recorded with Asian bookmakers (Kerr, 2018).

Whereas there is a large literature analysing traditional betting strategies, with researchers investigating how to optimise their betting, there has been only four published papers involving some analysis related to AH. Specifically, Vlastakis et al (2008) used AH odds as one of their model variables to predict match scores and showed that they are a strong predictor of match outcomes. Grant et al (2018) used AH odds, in conjunction with 1X2 odds, to analyse arbitrage opportunities and showed that these exist across a large number of fixed-odds and exchange market odds. Hofer and Leitner (2017) described how to derive information from live AH and Under/Over odds in order to maximise expected returns. Finally, and in an effort to educate gamblers, Hassanniakalager and Newall (n.d.) investigated the product risk associated with different football odds and showed that the AH odds would generally generate lower losses compared to other popular types of bet such as the 1X2, Under/Over, and correct score. Remarkably, no previous published work involves a model specifically designed for, and assessed with, AH bets.

The purpose of this paper is to investigate the efficiency of the AH market in relation to the 1X2 market. The 1X2 market has been extensively studied and the literature provides mixed empirical evidence regarding its efficiency, with most evidence pointing towards a weak-form efficient market (Giovanni & De Angelis, 2019). In this paper, the efficiency of both markets is measured in terms of the ability of the model in discovering profitable betting opportunities given both average and maximum market odds. The model is based on a modified version of the pi-rating system, which is a previously published football rating system (Constantinou & Fenton, 2013), that generates ratings that reflect team scoring ability. The ratings are provided as input to a novel hybrid Bayesian Network (BN) model specifically constructed to simulate the influential relationships between possession, shots, and goals, to predict both 1X2 and AH outcomes.

A BN is a type of a probabilistic model introduced by Pearl (Pearl, 1985) that consists of nodes and arcs. Nodes represent variables and arcs represent conditional dependencies. A BN that consists of both discrete and continuous variables, such as the one constructed in this study, is called a hybrid BN. Each variable has a corresponding Conditional Probability Distribution (CPD) that captures the magnitude as well as the shape of the relationship between directly linked variables. If we assume that the arcs in the BN represent influential relationships, then such a model can be viewed as a causal graph and represents a unique Directed Acyclic Graph (DAG) that can be used for interventional analysis. Otherwise, the arcs represent conditional dependencies that are not necessarily causal relationships, and such a BN is not a unique DAG but rather a Partial DAG that represents an equivalence class of DAGs. For a quick introduction to BNs, with a focus on football examples, see (Constantinou & Fenton, 2018).

Based on 13 English Premier League seasons and betting simulations under different assumptions, the findings reveal interesting differences as well as similarities between the AH and 1X2 markets. Importantly, the AH market is found to share inefficiencies with the traditional 1X2 market, and this provides opportunities for ‘beating’ the market. The paper is structured as follows: Section 2 describes the rules of the AH betting market, Section 3 describes the model, Section 4 covers the data and the process of model fitting, Section 5 presents the results, and Section 6 provides the concluding remarks.

2Asian handicap betting rules

In what follows, the decimal odds system is used (also known as European odds) for payoff in the event of winning a bet. The decimal odds represent the total return ratio of the stake; implying that the stake is already included in the decimal number. For example, a payoff of ‘3’ returns three times the stake; i.e., a bet of £1 would return 1×3=£3 (£2 profit). Odds also reflect probability that incorporates the bookmakers’ profit margin. An example from data is the Arsenal versus Crystal Palace match played on 21/04/2019 with average 1X2 market odds 1.54, 4.44, 6.03, corresponding to probabilities {64.94%, 22.52%, 16.58% }. Summing up these probabilities gives us 104.04%, and the implied average profit margin of 4.04%.

The AH betting market operates such that adversaries are handicapped according to their difference in strength. The term handicap means that one team is assigned a hypothetical score (including fractional) advantage before the match is played. All types of AH betting offered by the bookmakers involve two possible outcomes. This means that the AH betting market reduces the possible match outcomes from three (i.e., 1X2) to two. Each binary outcome corresponds to each team winning, with the odds adjusted subject to the given handicap.

The standard AH betting involves assigning a hypothetical score advantage to the underdog. This represents the most common type of AH betting, and aims to make the contest equal. That is, the handicap applied is the one 1 that optimises the odds, for both teams to win, towards 2 (or 50% chance of winning) and hence, it maximises the uniformity of the AH payoff distribution. While this represents the most popular type of bet, the AH market offers different types of score advantage, each of which we discuss below, including the possibility to assign a hypothetical score advantage to the favourite rather than the underdog.

The market offers three types of AH betting that need to be modelled explicitly into the model, as well as in the betting simulation. These are:

  • i. Whole goal handicap: A team is given a whole-goal handicap such as -1 or +1. In this case, the possibility of a draw is eliminated by removing the draw outcome from the equation and normalising the probabilities of the residual two outcomes to sum up to 1. If a handicap draw is observed, the bet is voided (refunded).

    Let us revisit the example from data discussed at the beginning of this section, involving Arsenal versus Crystal Palace with average 1X2 market odds {1.54, 4.44, 6.03}. Arsenal was the strong favourite. The bookmakers introduced the handicap of –1, which maximised the uniformity of the AH distribution with odds {1.87, 1.99}. The match ended 2-3; i.e., –1 for Arsenal. The AH winner was Crystal Palace since it won the match by one goal difference, which makes it two goals difference given the handicap; i.e., this made the settlement score, which is the match result after the handicap is considered, equal to 2. Table 1 illustrates how the whole-goal AH is determined based on other hypothetical score lines between Arsenal and Crystal Palace.

  • ii. Half-goal handicap: A team is given a half-goal handicap such as -1.5 or +1.5. Assuming a match between X and Y and a handicap of +1.5 (i.e., X receives a 1.5 goal advantage), and that a bet is placed on X, the bet would win as long X does not lose by more than one goal difference; otherwise the bet is lost. In this case, the possibility of a draw is eliminated by the handicap itself, since it is not possible for the settlement score to be a draw.

    An example from data is the Liverpool versus Wolves match played on 12/05/2019 with average 1X2 market odds {1.30, 5.62, 10.17}. Liverpool was the strong favourite. The bookmakers introduced the handicap of -1.5, which maximised the uniformity of the AH distribution with odds {1.91,1.95}. The match ended 2–0 (i.e., +2) in favour of Liverpool. The AH winner was Liverpool since it won the match by two goals difference; i.e., 0.5 goals more than the handicap. This made the settlement score equal to 0.5. Table 2 illustrates how the half-goal AH is determined based on other hypothetical score lines between Liverpool and Wolves.

  • iii. Quarter-goal handicap: A team is given a quarter-goal handicap such as –0.25 or +0.25. This type of handicap is, in fact, a combined whole-goal and a half-goal handicap. For example, if we bet £10 on the away team to win given AH –0.25 with odds 2 (i.e., 50%), the stake would be divided between the nearest whole-goal and half-goal handicaps. That is, a £5 bet will be placed on the away team to win given AH±0 2 with odds ∼2.5 (i.e., 40%) and another £5 bet on the away team to win given AH –0.5 with odds ∼1.66 (i.e., 60%). Note that the odds for the quarter-goal handicap reflect the average payoff, in terms of probability, of the two nearest handicaps. Since this is a combination of two bets, each bet is executed independently. For example, a score of 0–0 would have resulted in voiding AH±0 (i.e., £5 are returned) and winning AH –0.5 (i.e., £5×1.66 =£8.3 are returned).

    An example from data is the Fulham versus Newcastle match played on 12/05/2019 with average 1X2 market odds {2.50, 3.53, 2.78}. Fulham was the weak favourite. The bookmakers introduced the handicap of –0.25, which maximised the uniformity of the AH distribution with odds {2.15,1.75}. The match ended 0–4(i.e., –4) in favour of Newcastle. The AH winner was Newcastle, since it won the match by four goals difference; i.e., 4.25 goals more than the handicap. This made the settlement score equal to –4.25. Table 3 illustrates how the quarter-goal AH is determined based on other hypothetical score lines between Fulham and Newcastle.

Table 1

The whole-goal AH outcome for different hypothetical scores between Arsenal and Crystal Palace

Arsenal goalsCrystal Palace goalsScore differenceHandicapSettlement scoreAH winner
101–10Void
110–1–1Crystal Palace
312–11Arsenal
413–12Arsenal
000–1–1Crystal Palace
01–1–1–2Crystal Palace
12–1–1–2Crystal Palace
Table 2

The half-goal AH outcome for different hypothetical scores between Liverpool and Wolves

Liverpool goalsWolves goalsScore differenceHandicapSettlement scoreAH winner
101–1.5–0.5Wolves
110–1.5–1.5Wolves
312–1.50.5Liverpool
413–1.51.5Liverpool
000–1.5–1.5Wolves
01–1–1.5–2.5Wolves
12–1–1.5–2.5Wolves
Table 3

The quarter-goal AH outcome for different hypothetical scores between Fulham and Newcastle

Fulham GoalsNewcastle goalsScore differenceHandicapSettlement scoreAH winner
101–0.25 (0 and –0.5)1 and 0.5Fulham
110–0.25 (0 and –0.5)0 and –0.5Void and Newcastle
312–0.25 (0 and –0.5)2 and 1.5Fulham
413–0.25 (0 and –0.5)3 and 2.5Fulham
000–0.25 (0 and –0.5)0 and –0.5Void and Newcastle
01–1–0.25 (0 and –0.5)–1 and –1.5Newcastle
12–1–0.25 (0 and –0.5)–1 and –1.5Newcastle

3The model

The overall model combines ratings with BNs. The rating system captures the skill of teams over time, and provides the ratings as an input into the BN model which captures the magnitude of the relationships between variables of interest. The two subsections that follow describe the rating system and the BN model respectively.

3.1The rating system

The pi-rating is a football rating system that determines team ability based on the relative discrepancies in scores between adversaries. It was first introduced in (Constantinou & Fenton, 2013) and thereafter used in (Constantinou, 2018; Hubacek et al., 2018; 2019; Van Cutsem, 2019; Wheatcroft, 2020). Modified versions of the pi-rating also formed part of the top two performing models in the international competition Machine Learning for Soccer (Constantinou, 2018; Hubacek et al., 2018). This paper makes use of the original pi-rating system (Constantinou & Fenton, 2013), with two modifications described below.

The pi-ratings assign a ‘home’ (H) and an ‘away’ (A) rating to each team, to account for team-specific home advantage and away disadvantage. Therefore, when a team X plays against team Y, the match prediction is determined by team’s X rating H versus team’s Y rating A. The ratings are revised after each match based on two learning rates: a) the learning rate λ which determines to what extent new match results override previous match results in terms of the impact in determining current team ratings, and b) the learning rate γ which determines to what extent performances at home grounds influence a team’s away rating and vice versa. Therefore, at the end of a match between teams X and Y, the new ratings at time t are revised given the most recent ratings at time t - 1 as follows:

X'sHrating:RXHt=RXHt1+eHλX'sArating:RXAt=RXAt1+γ(RXHrRXHt1)nY'sArating:RYHt=RYHt1+eAλnY'sHrating:RYHt=RYHt1+γ(RYAtRYAt1)
where e is the error between the observed goal difference Δo and rating difference Δ _ p which, for home and away teams, is measured as follows:
eH=ΔoH-ΔpHandeA=ΔoA-ΔpA

respectively, where

ΔoH=GoH-GoHandΔoA=GoA-GoHΔpH=GpH-GpAandΔpA=GpA-GpH

where GoH and GoA are goals observed for home and away teams respectively, and similarly GpH and GpA are goals predicted for home and away teams.

While the original pi-ratings represent a diminished expectation of goal difference against the average opponent in the data, in this paper they represent the actual goal difference expectation. Specifically, the rating equation in this paper is simplified not to include the deterministic function Ψ (e) =3 × log 10 (1 + e) defined in the original paper (Constantinou & Fenton, 2013), which is a function that diminishes the importance of each additional goal difference under the assumption that a win is more important than increasing goal difference. The justification for this first modification is that, in AH, we are only interested in goal differences and thus, the motivation here is to optimise for goal difference rather than the ability to win matches.

The second modification involves the learning rate λ. In this paper, λ is multiplied by k when a match involves at least one team which had previously played less than 38 matches, according to available data. This modification aims to increase the speed by which team ratings converge for new teams during their first EPL season (each team plays 38 matches in a season), and is expected to be especially impactful during the very first season in the data since, at that point, all teams are considered ‘new’ by the rating. Therefore, the revised pi-ratings exclude ψ (e), defined above, and include k, as follows:

RXHt=RXHt-1+eHλkandRYAt=RYAt-1+eAλk

where k = 3 for match instances in which both teams had previously played less than 38 matches; otherwise k = 1. The parameter k was optimised in terms of minimising prediction error e. A limitation here is that the k parameter was optimised given integer inputs from 1 to 10. For future work, it is recommended that the k parameter is optimised given real numbers.

3.2The BN model

The graph of a BN model can be automatically discovered from data, determined by knowledge, or a combination of the two. Learning the correct graph of a BN from data remains a major challenge in the fields of probabilistic machine learning and causal discovery. While some structure learning algorithms perform well with synthetic data, it is widely acknowledged that this level of performance does not extend to real-world data which typically incorporate noise and latent confounders.

In disciplines like bioinformatics, applying structure learning algorithms can reveal new insights that would otherwise remain unknown. However, these algorithms are less effective in areas with access to domain knowledge, such as in sports. As a result, the BN model in this paper has had its graphical structure determined by the temporal fact that possession influences the number of shots created, which in turn influence the number of shots on target, and which in turn influence the number of goals scored. Each of these factors is also dependent on the level of rating difference between the two teams, as illustrated in Fig. 1. This type of model can also be characterised as a hierarchical Bayesian model of which the network is the structural representation, and where the nodes represent variables or hyperparameters of statistical distributions.

Fig. 1

The BN model. The Rating Difference (RD) is the only observable node in the network, determined by the pi-ratings, and represents the difference between the home team’s prior home rating and the away team’s prior away rating RXHt-1-RYAt-1 .

The BN model. The Rating Difference (RD) is the only observable node in the network, determined by the pi-ratings, and represents the difference between the home team’s prior home rating and the away team’s prior away rating 
RXHt-1-RYAt-1
.

The temporal order of events in the BN graph naturally captures the importance of each event in predicting goals scored. For example, the graph assumes that shots on target have a direct influence on goals scored, whereas possession has an indirect influence and hence, while influential, it is assumed to be less impactful than shots on target. While the temporal order defines direct and indirect influences, note that the magnitude of direct influences is still determined by data.

For each match, the prior ratings are retrieved and the difference in team ratings is used as an input into the BN model, which is a Hybrid BN model consisting of both discrete and continuous variables, designed in AgenaRisk (Agena, 2019). Specifically, the actual input is the difference between prior home and away ratings, and is passed to the BN model as an observation to node Rating Difference (RD) in the form of

RXHt-1-RYAt-1

To ensure that the BN model is trained accurately with respect to the rating data, the parameterisation of the CPDs is also restricted to match instances in which both teams had previously played at least 38 matches. All the residual variables in the BN model are latent. Specifically,

  • i. The node RD, which represents the observable rating difference between teams, is a mixture of Gaussian probability density functions ∼  N (μ,  σ2); one for each state of node Rating Difference Level (RDL). Specifically, for -∞  < RD <  ∞,

    f(RD|μ,σ2,RDL)=[(12πσ2e-(RD-μ)22σ2)|RDL]

    where parent RDL is a discrete distribution, μ is the average rating difference and σ2 the variance of the rating differences. RDL consists of 23 states3, where each state corresponds to a pre-determined level of rating difference as shown in Table 4. For example, the rating difference level 3 is parameterised based on all historical match instances in which adversaries had rating difference RD=RXHt-1-RYAt ranging from 1.765 to < 1.93. The granularity of the 23 states has been chosen to ensure that for any combination of rating difference there is enough data points (more than 50) for a reasonably well informed prior.

  • ii. The node P, which represents ball possession, is a mixture of probability density functions ∼  Beta (a, β); one for each state of RDL. Specifically, for P  ∈ [0, 1],

    f(P|a,β,RDL)=(Pa-1(1-P)β-1Beta(a,β)|RDL)

    where Beta (α, β) is the Beta function, α is the first shape parameter of the Beta function, also known as the alpha parameter, and represents the number of minutes the home team is in possession of the ball, and β is the second shape parameter of the Beta function, also known as the beta parameter, that represents the number of minutes the away team is in possession of the ball. Thus, P reflects the possession rate associated with the home team, over a Beta distribution, whereas for the possession of the away team the model assumes 1 - P.

  • iii. The node p (SM), which represents the probability to generate a shot per minute spent in possession of the ball, is also a mixture of probability density functions ∼  Beta (a, β) given RDL, where α is the number of shots and β is the number of minutes minus the number of shots.

  • iv. The node S, which represents the expected number of shots, is a Binomial probability mass function ∼  B (n, p),

    f(k,n,p)-Pr(k|n,p)=Pr(S=k)=(n!k!(n-k)!)pk(1-p)n-k

    where n represents the number of minutes in possession of the ball defined as 4 P×90, under the assumption a match lasts 90 playable minutes, and p is p (SM); i.e., the probability to generate a shot per minute spent in possession of the ball, as defined above.

  • v. The node p (ST), which represents the probability for a shot to be on target, is also a mixture of probability density functions ∼  Beta (a, β) given RDL, where a is the number of shots on target, and β is the number of shots off target; i.e., total shots minus shots on target.

  • vi. The node ST, which represents the expected number of shots on target, is also a Binomial probability mass function ∼  B (n, p), where n is the expected number of shots S and p is the probability for a shot to be on target p (ST).

  • vii. The node p (G), which represents the probability to score a goal, is also a mixture of probability density functions ∼  Beta (a, β) given RDL, where α is the number of goals scored, and β is the number of shots on target successfully defended; i.e., total shots on target minus goals scored.

  • viii. The node G, which represents the expected number of goals scored, is also a Binomial probability mass function ∼  B (n, p), where n is the expected number of shots on target ST, and p is the probability to score a goal p (G).

  • ix. The node 1X2 is a discrete distribution with states Home win, Draw, and Away win, determined by the distributions G of both home (H) and away (A) teams; i.e., 1X2 = “Home win” if GoH > GoA, “Away win” if GoH < GoA, “Draw” otherwise.

  • x. The node GD, which represents goal difference, is simply GoH - GoA.

  • xi. The node AH represents a set of nodes corresponding to all the possible AH outcomes with state probabilities for home and away wins, given GD, as defined in Section 2.

It should be clear by this point that for both home and away teams: a) nodes P and p (SM) are hyperparameters of Beta node S, b) nodes S and p (ST) are hyperparameters of Beta node ST, and c) nodes ST and p (G) are hyperparameters of node G; effectively creating a Beta-Binomial Hybrid BN process.

Table 4

Predetermined levels of rating difference

RDL123 . . . 212223
RD≥2.095≥1.93 & < 2.095≥1.765 & < 1.93intervals of 0.165 rating≥ –1.205 & < –1.04≥–1.37 & < –1.205< –1.37
Data points5568107 . . . 935855

4Data and model fitting

4.1Data

The rating method, the BN model, and the betting simulation are based on data collected from www.football-data.co.uk and manually recorded from www.nowgoal.com. Table 5 specifies which of the data variables are used by the rating system, the BN model, and the betting simulation. For example, the rating system only requires information about goal data and hence, it only considers variables Date, Home team, Away team, Home goals, and Away goals. Since the ratings are used as an input into the BN model, they represent an additional BN variable and at the same time make the BN model independent of variables Date, Home team and Away team.

Table 5

The data variables used to train the rating system (R), the BN model (BN), and to simulate betting (B)

VariableDetailsUsed in
DateThe date of the matchR
Home teamThe team playing at home groundsR
Away teamThe team playing at away groundsR
Match outcome (1X2)The outcome of the match in terms of home win, draw, or away winBN, B
Home possession rateThe ball possession rate of the team playing at homeBN
Away possession rateThe ball possession rate of the team playing awayBN
Home shotsThe number of shots created by the home teamBN
Away shotsThe number of shots created by the away teamBN
Home shots on targetThe number of shots on target created by the home teamBN
Away shots on targetThe number of shots on target created by the away teamBN
Home goalsThe number of goals scored by the home teamR, BN, B
Away goalsThe number of goals scored by the away teamR, BN, B
Team ratingsThe difference between home team and away team ratingsBN
HandicapThe AH on which the market odds are basedB
1X2 oddsThe average and maximum (i.e., best available) bookmaker 1X2 oddsB
AH oddsThe average and maximum (i.e., best available) bookmaker AH oddsB

The data are based on the English Premier League (EPL) seasons 1992/93 to 2018/19. However, AH odds data were available only from season 2006/07 onwards, whereas ball possession data which is needed by the BN model were available only from season 2009/10 onwards. As a result, the rating system is trained with up to 27 seasons of data, the BN is parameterised with up to 10 seasons of data (since it requires possession data), and the betting simulation is performed over 13 seasons (since it requires AH odds data).

4.2Model fitting

By definition, the ratings are developed in a temporal manner. That is, for a match prediction at time t the model considers team ratings at time t - 1. For any match prediction, a team’s rating will always be based on the most recent rating prior to the date of the match under prediction, and a team’s rating will always be based on past match results.

Conversely, the BN model functions as a machine learning model independent of time and is validated using leave-one-out cross validation (LOOCV). A prediction between teams that have rating difference Z, where Z is one of the 23 RDLs as defined in Table 4, is derived from all data matches with rating difference Z, excluding the match under prediction during validation.

This combination of model parameterisation and validation with a rating system and a BN model is adopted by (Constantinou, 2018). The validation approach is unconventional because the BN model assumes no temporal relationships. When applied to past matches, it generates predictions at time t based on the whole data set which may include future match results. The reason this approach works well, without overestimating the future accuracy of the model, is because it does not matter whether the data comes from past or future. This is because the model assumes that the relationship between, for example, shots on target and goals scored remains invariant over time for the average EPL team, and empirical results support this claim. These include:

  • i. The results presented in Sections 5.2 and 5.3 which show that predictive accuracy is consistent across all 13 seasons tested, including the three seasons 2006/07 to 2008/09 which did not form part of the BN’s training data;

  • ii. The model in (Constantinou, 2018) which was based on this approach and ranked 2nd in the international Machine Learning for Soccer competition, with a prediction error consistent with the validation error.

The empirical support for the model extends to demonstrating that the model can produce good predictions for matches between teams X and Y even when the prediction is derived from match data that neither X nor Y participated in (Constantinou, 2018). This claim is also supported by the results presented in this paper. Specifically, during Seasons 2006/07 to 2008/09 the following teams have had their performance determined by data that did not include any of their matches: a) Sheffield United, b) Charlton, and c) Derby. The reason this occurred is because, as discussed above, the BN model was trained with data from season 2009/10 onwards, which does not include any match instances associated with these teams. Their performance in terms of possession, shots, shots on target and goals scored was derived by other similar match situations in terms of rating difference between home and away teams.

This approach has advantages and disadvantages. The disadvantage is that, for those who are interested in how such a model would have performed in the past, the results only approximate past performance under the assumption the model would have been trained with at least the same amount of data as the test model. On the other hand, the advantage is that this approach allows us to preserve the sample size of the training data throughout validation, and this enables us to validate how the resulting model would have performed over multiple seasons without modifying its parameterisation (excluding the removal of a single sample; i.e. the match under assessment during validation).

To understand why this is important, consider evaluating match instances five seasons in the past. A temporal model would require the removal of the five most recent seasons from the training data. This would have led to limited samples for some of the predetermined levels of rating difference shown in Table 4. The limited data issue can only be overcome by reducing the number of predetermined levels of rating difference (i.e., the dimensionality of the model); but doing so would produce a different model than the one described. Instead, the approach adopted by (Constantinou, 2018) enables us to address the temporal aspect of the problem through the ratings and to preserve the fitting of the BN across all seasons tested; effectively enabling us to test the current parameterised model on multiple seasons independent of time.

5Results

The results are reported in terms of rating (i.e., goal difference) error, predictive accuracy and profitability. Specifically, Section 5.1 assesses the accuracy of the modified pi-ratings in terms of expected goal difference error, Section 5.2 assesses the accuracy of the overall model in predicting both AH and 1X2 outcomes, and Section 5.3 assesses the capability of the model in terms profitability in both 1X2 and AH markets.

5.1Pi-ratings accuracy and overall model fitting

Figure 2 shows that the optimal λ and γ parameters, that minimise the goal difference error as defined in Section 3, are λ= 0.018 and γ= 0.7. Note that while the results are based on training data from seasons 1992/93 to 2018/19, the optimisation is restricted to match instances in which both teams had previously played at least 38 matches; a total of 9,073 match instances. This is to ensure that the model is optimised on matches in which both teams have had their ratings developed by at least one football season.

Fig. 2

The optimal modified pi-rating learning rates and associated prediction error e, given k = 3, are λ= 0.018 and γ= 0.7. The results are based on training data from seasons 1992/93 to 2018/19. The optimisation is restricted to match instances in which both teams had previously played at least 38 matches; a total of 9,073 match instances.

The optimal modified pi-rating learning rates and associated prediction error e, given k = 3, are λ= 0.018 and γ= 0.7. The results are based on training data from seasons 1992/93 to 2018/19. The optimisation is restricted to match instances in which both teams had previously played at least 38 matches; a total of 9,073 match instances.

The optimal learning rates are fairly consistent with those reported in (Constantinou & Fenton, 2013) (i.e., λ= 0.035 and γ= 0.7) on the basis of minimising goal difference error over five EPL seasons, with those reported in (Van Cutsem, 2019) (i.e., where λ= 0.06 and γ= 0.6) on the basis of minimising mean squared goal difference error over eight EPL seasons, with those reported in (Constantinou, 2018), λ= 0.054 and γ= 0.79, on the basis of minimising the Rank Probability Score (RPS) error metric (Constantinou & Fenton, 2012) over multiple leagues worldwide, and with those reported in (Hubacek et al., 2018), λ= 0.06 and γ= 0.5, where pi-ratings had been used in conjunction with Gradient boosted trees parameters to minimise RPS over multiple leagues worldwide.

However, note that the optimal learning rate λ is lower in this study, and this is likely due to the modification that performs more aggressive rating revisions to the first 38 matches of each team, since it is intended to improve the speed of rating convergence. Interestingly, the overall mean goal difference error shown in Fig. 2, e = 1.2283 (or e2= 1.509), is considerably lower than those reported in (Constantinou & Fenton, 2013) and (Van Cutsem, 2019), where e2= 2.625 and e2= 2.66 respectively, and this suggests that the modifications have had a positive impact on the ratings.

Figure 3 illustrates the expected goal difference for each of the 23 rating difference levels. Level 23 represents the highest rating discrepancy in favour of the away team, where the average expectation of the match is a score difference of –1.38 (or 1.38 goals in favour of the away team), and level 1 represents the highest rating discrepancy in favour of the home team with an expected score difference of 2.18 (or 2.18 goals in favour of the home team). The graph reveals a linear relationship between rating discrepancy and score discrepancy.

Fig. 3

Sensitivity analysis between the 23 states of RDL node and the expected goal difference, with linear trendline superimposed as a dashed red line.

Sensitivity analysis between the 23 states of RDL node and the expected goal difference, with linear trendline superimposed as a dashed red line.

As shown in Table 4, the granularity of the 23 intervals was selected to ensure that for any rating difference state there are at least 50 data points for a reasonably well-informed prior of observed goal difference. As with any discretised variable, different splits produce slightly different results. In the case of the RDL distribution, any changes in discretisation will remain faithful to the linear relationship illustrated in Fig. 3; implying that we should expect minor changes to the interval averages as long as the number of splits remains invariant and data points for each interval are maintained above 50.

Any minor model amendment is naturally expected to have minor impact on the predicted probabilities, and any minor impact is expected to have some influence on the results based on small discrepancies between predicted and published market odds (i.e., small θ values as defined later in subsection 5.3, such as θ = 1 or 2). However, no changes are expected for larger discrepancies. Since the conclusions in this paper are not driven by results that are based on such small differences between predicted and observed odds, any minor modification is not expected to alter concluding remarks.

Table 6 presents the discrepancy in prediction error e for the different hyperparameter combinations of λ and γ, relative to the prediction error generated by the optimal inputs of λ= 0.018 and γ= 0.7. Recommended values for λ and γ, that could be considered as an alternative to the optimal values of λ= 0.018 and γ= 0.7, are the ones that generate up to 0.01% discrepancy in prediction error. A total of 31 different combinations, excluding the optimal combination, generate discrepancy within the 0.01% threshold. By keeping the value for λ static, the recommended hyperparameter combinations are a) λ= 0.017 and γ= 0.65–0.73, b) λ= 0.018 and γ= 0.65–0.74, c) λ= 0.019 and γ= 0.65–0.74, and d) λ= 0.02 and γ= 0.7–0.72.

Table 6

The discrepancy in prediction error e for the different combinations of λ and γ, relative to the optimal inputs of λ= 0.018 and γ= 0.7. Darker green cells represent lower discrepancy, whereas darker red cells represent higher discrepancy. Recommended inputs for λ and γ are the ones that generate up to 0.01% discrepancy

The discrepancy in prediction error e for the different combinations of λ and γ, relative to the optimal inputs of λ= 0.018 and γ= 0.7. Darker green cells represent lower discrepancy, whereas darker red cells represent higher discrepancy. Recommended inputs for λ and γ are the ones that generate up to 0.01% discrepancy

5.2.Predictive accuracy

Predictive accuracy is measured for both 1X2 and AH distributions. The Brier Score is used to measure the accuracy of the binary AH outcome, and the RPS metric (Constantinou & Fenton, 2012) is used to measure the accuracy of the multinomial 1X2 distribution. The RPS can be viewed as a Brier Score extended to multinomial ordinal distributions.

Table 7 shows that the RPS error for the 1X2 outcomes ranges from 0.184 to 0.213 with an average RPS of 0.195 across all 13 EPL seasons. This result compares well relative to previous studies that assumed pi-ratings. Specifically, in (Constantinou, 2018) the RPS ranged from 0.187 to 0.236 for 52 different leagues worldwide, with an average RPS of 0.211 at validation, an average RPS of 0.203 for EPL matches, and an average RPS of 0.208 in the competition. Similarly, the average RPS was ∼0.2 in (Hubacek et al., 2018), according to Fig. 3, and 0.206 in the competition.

Table 7

Predictive accuracy across all seasons, based on the Rank Probability Score (RPS) for multinomial 1X2 predictions and the Brier Score (BS) for binary AH predictions. Lower score indicates higher predictive accuracy for both RPS and BS

SeasonRPS (1X2 accuracy)BS (AH accuracy)
2006/070.1970.252
2007/080.1840.248
2008/090.1920.229
2009/100.1880.199
2010/110.2020.248
2011/120.2050.257
2012/130.1910.258
2013/140.1950.249
2014/150.1990.254
2015/160.2130.254
2016/170.1910.267
2017/180.1920.253
2018/190.1910.260
Overall0.1950.248

These results are consistent with those reported in Section 5.1, which show that the overall error e optimises lower in this study; i.e., the ratings more accurately predict score difference. The results from profitability presented in Section 5.3 are also consistent with these findings.

5.2.1Time-series analysis and sample size requirements

The model is trained with data the covers 13 years of data, and not all the variables could be measured throughout this period. Still, the model seems to work well without evidence of bias. This subsection investigates whether the variables used in the model show any drift over time that the model might have filtered out. Moreover, because the dimensionality of the model has been adjusted relative to the available sample size, this subsection also reports the sample size required for the model priors to be well informed by data.

Figure 4 presents the results from time-series analysis in investigating potential shifts in the predictive outputs of the model over time. The analysis is performed by increasing the training data set by a single football season’s worth of data at a time, and the shift is measured in terms of changes in the expected value of the given distribution. As shown in Fig. 4, the analysis focuses on the four main variables of the BN model; namely possession, shots, shots on target, and goals scored. Moreover, the different levels of rating difference (refer to Table 4) are categorised into four groups, and are measured with reference to the 10 specified seasons.

Fig. 4

Investigating the prediction shift over time, with reference to the four main variables of the BN model. The shift is measured in terms of the expected value of the specified distribution, and by adding a football season’s worth of data, to the training data set, at a time.

Investigating the prediction shift over time, with reference to the four main variables of the BN model. The shift is measured in terms of the expected value of the specified distribution, and by adding a football season’s worth of data, to the training data set, at a time.

The reason the first three, out of the 13, seasons are not considered here is because (as later shown in Table 8) the BN model was not trained with data samples from the first two seasons, and this also means that any results obtained during the third season rely on very low samples. As previously discussed in subsection 3.2, the reason the first two seasons are not considered by the BN model is because the BN is trained with match instances in which both teams had previously played at least 38 matches, to allow for the pi-ratings to converge to reasonably accurate estimates before considered for model training.

Table 8

The number of data samples observed in each of the 23 intervals of Rating Difference Level (RDL) after each subsequent league data set is added to the training data. Darker red cells represent sample size less than 19 (equivalent to less than half a season’s data), orange cells represent sample size less than 38 (equivalent to more than half, and less than, a season’s data), and white cells represent sufficient sample size

jsa-8-jsa200588-g008.jpg

The results are discussed with reference to Table 8, which presents the available samples for each of the 23 levels of rating difference, after each subsequent league data set is added to the training data set that was used to learn the BN model. The results show that many of the shifts occur in the first few seasons, and this is reasonable since the first seasons are the ones which rely on fewer samples. Shifts are also observed after adding the most recent leagues, but these shifts are largely restricted to the outputs of possession and shots on target, and involve matches with a level of rating difference between 1 to 6. The most important output of the model, which is the goal difference, remains essentially unchanged over time. Therefore, while it is reasonable to assume that shifts in playing style might occur naturally from season to season, collectively these shifts appear to have no meaningful impact on the prediction of goal difference, from which the 1X2 and AH distributions are formed.

Lastly, the results in Table 8 suggest that the model described in this paper should be trained with at least five seasons of league data, to ensure that the model priors are well informed by data. Because the model is trained with publicly available data, and which only increases over time, there is no motivation to use less data than what is currently available and hence, this requirement should not be viewed as a limitation.

5.3Profitability

The assessment of profitability is based on 13 EPL seasons and considers both the average and the best available (maximum) market odds. The simulation is evaluated both in terms of maximising profit as well as the Return On Investment (ROI). A standard betting strategy is used where fixed singe-unit bets (e.g., £1) are placed on 1X2 and AH outcomes with payoff that is higher than the model’s estimated unbiased payoff by at least θ, where θ is the discrepancy between the predicted probability and the payoff probability. For example, if the model predicts 51% and the bookmakers’ payoff for that event is 50% (i.e., odds of 2), then θ = 1; i.e., the bookmakers pay 1% more than the model’s estimate. The betting simulation is performed across all payoff decision thresholds θ, for both 1X2 and AH outcomes. The results are first discussed in terms of 1X2 betting performance with reference to Tables 9, 10 and 11; then in terms of AH betting performance with reference to Tables 12, 13 and 14.

Table 9

Profitability based on average (left) and maximum (right) market odds for 1X2 bets simulated over 13 EPL seasons; from 2006/09 to 2018/19

θAverage market oddsMaximum market odds
BetsBets wonOddsWin RateReturnsProfitROIBets wonBetsOddsWin RateReturnsProfitROI
0%433412903.0629.8%3942.5–391.5–9.03%493815143.2230.7%4879.0–59.01–1.20%
1%371211053.0629.8%3375.9–336.1–9.05%479414683.2430.6%4755.3–38.67–0.81%
2%31099373.1130.1%2909.9–199.2–6.41%430513073.1930.4%4167.1–137.91–3.20%
3%25387753.0930.5%2397.8–140.2–5.52%365811223.2330.7%3623.2–34.81–0.95%
4%20726433.0931.0%1987.2–84.9–4.10%30679503.2631.0%3096.429.380.96%
5%16995243.1530.8%1650.9–48.1–2.83%25538023.2731.4%2620.067.022.63%
6%13394153.1931.0%1325.3–13.7–1.02%20766563.2931.6%2157.881.773.94%
7%10363293.2631.8%1074.038.03.67%16825263.3431.3%1756.374.294.42%
8%8142603.3031.9%858.744.75.49%13454223.4231.4%1444.699.567.40%
9%6121983.1732.4%626.814.82.42%10493523.4833.6%1226.2177.1816.89%
10%4521483.1332.7%462.610.62.34%8072673.4033.1%906.699.6312.35%
11%3201013.1231.6%315.2–4.8–1.51%6081993.2832.7%652.344.297.28%
12%241753.1631.1%236.9–4.1–1.68%4511553.3034.4%511.860.7813.48%
13%191653.2734.0%212.321.311.13%3241083.2733.3%352.728.658.84%
14%143503.4535.0%172.529.520.59%241803.3833.2%270.429.3712.19%
15%103373.1035.9%114.811.811.48%184623.4833.7%216.031.9817.38%
16%76273.4035.5%91.815.820.84%132463.4834.8%160.027.9921.20%
17%51173.6433.3%61.810.821.20%101373.3136.6%122.321.3121.10%
18%37123.8132.4%45.78.723.59%76263.4634.2%89.913.9118.30%
19%2473.1029.2%21.7–2.3–9.58%52183.8234.6%68.716.6732.06%
20%1953.4426.3%17.2–1.8–9.37%34113.5132.4%38.64.6113.56%
Table 10

The payoff discrepancies θ that maximise ROI per season (in yellow shading), based on 1X2 bets and for both average (left) and maximum (right) market odds. The optimal θ discrepancy is chosen over all θ that generate at least 30 bets in a single season

SeasonAverage market oddsMaximum market odds
θBetsBets wonOddsWin RateReturnsProfitROIθBetsBets wonOddsWin RateReturnsProfitROI
2006/078%34133.438.2%43.79.6628.41%10%39163.441.0%54.615.640.00%
2007/087%63212.833.3%58.4–4.56–7.24%11%41153.136.6%47.16.1114.90%
2008/093%181723.139.8%224.343.323.92%5%202773.538.1%268.066.0432.69%
2009/102%242573.523.6%199.9–42.09–17.39%13%3093.730.0%33.03.0210.07%
2010/1110%33172.851.5%48.215.1846.00%10%59293.049.2%86.627.5646.71%
2011/120%3261033.331.6%338.712.683.89%1%3701193.632.2%433.463.417.14%
2012/137%65223.533.8%77.912.8919.83%9%71253.835.2%94.523.4533.03%
2013/1410%30103.733.3%36.66.5921.97%12%33133.839.4%49.716.750.61%
2014/157%73333.145.2%101.628.5839.15%9%72333.345.8%109.537.5152.10%
2015/1610%65253.038.5%74.79.6514.85%12%57213.236.8%67.710.7118.79%
2016/179%58144.824.1%67.79.7216.76%10%80195.023.8%94.714.7318.41%
2017/188%85243.928.2%93.38.289.74%13%31113.535.5%38.47.4323.97%
2018/1913%32113.534.4%38.36.2819.63%14%34113.732.4%40.46.3618.71%
Overall4.35%12874223.3832.79%1403.16116.169.03%8.01%11193983.5935.57%1417.62298.6226.69%
Table 11

The payoff discrepancies θ that maximise profit per season (in yellow shading), based on 1X2 bets and for both average (left) and maximum (right) market odds. The optimal θ discrepancy is chosen over all θ that generate at least 30 bets in a single season

SeasonAverage bookmaker oddsMaximum bookmaker odds
θBetsBets wonOddsWin RateReturnsProfitROIθBetsBets wonOddsWin RateReturnsProfitROI
2006/078%34133.438.2%43.79.6628.41%5%166563.433.7%188.422.3913.49%
2007/088%45142.931.1%40.8–4.16–9.24%11%41153.136.6%47.16.1114.90%
2008/093%181723.139.8%224.343.323.92%5%202773.538.1%268.066.0432.69%
2009/1010%41113.126.8%33.7–7.28–17.76%13%3093.730.0%33.03.0210.07%
2010/118%59272.845.8%75.916.8528.56%6%164633.138.4%197.233.1920.24%
2011/120%3261033.331.6%338.712.683.89%1%3701193.632.2%433.463.417.14%
2012/137%65223.533.8%77.912.8919.83%9%71253.835.2%94.523.4533.03%
2013/145%123443.335.8%143.220.2116.43%0%3801273.433.4%438.158.1115.29%
2014/157%73333.145.2%101.628.5839.15%8%91403.344.0%133.242.2246.40%
2015/168%94343.136.2%104.710.6611.34%1%3711303.135.0%400.429.417.93%
2016/179%58144.824.1%67.79.7216.76%10%80195.023.8%94.714.7318.41%
2017/188%85243.928.2%93.38.289.74%9%93264.228.0%108.015.0416.17%
2018/194%188673.135.6%204.516.468.76%5%206733.035.4%222.316.257.89%
Overall4.62%13724783.2834.84%1549.85177.8512.96%4.61%22657793.5234.39%2658.36393.3617.37%
Table 12

Profitability based on average (left) and maximum (right) market odds for AH bets simulated over 13 EPL seasons; from 2006/07 to 2018/19

θAverage market oddsMaximum market odds
BetsBets wonOddsWin RateReturnsProfitROIBetsBets wonOddsWin RateReturnsProfitROI
0%391423291.6259.5%3762.8–151.22–3.86%470328301.6660.2%4707.04.010.09%
1%347120511.6259.1%3326.5–144.52–4.16%422825271.6659.8%4203.2–24.76–0.59%
2%306418171.6159.3%2929.4–134.61–4.39%378822471.6659.3%3736.8–51.25–1.35%
3%266515841.6159.4%2554.4–110.65–4.15%337519981.6759.2%3340.7–34.28–1.02%
4%228613571.6259.4%2201.3–84.71–3.71%297417661.6759.4%2949.3–24.73–0.83%
5%193211571.6259.9%1873.4–58.56–3.03%258615261.6759.0%2545.8–40.17–1.55%
6%16409841.6360.0%1600.9–39.09–2.38%219213041.6759.5%2171.3–20.74–0.95%
7%13868371.6360.4%1368.5–17.52–1.26%188311241.6759.7%1877.9–5.15–0.27%
8%11356881.6360.6%1124.2–10.78–0.95%15879551.6860.2%1600.013.030.82%
9%9365721.6461.1%938.52.450.26%13077901.6860.4%1325.118.061.38%
10%7654601.6560.1%759.0–5.98–0.78%10806581.6960.9%1111.731.732.94%
11%6143681.6659.9%612.1–1.89–0.31%8965411.7060.4%920.624.612.75%
12%4882921.6559.8%480.9–7.12–1.46%7284271.7158.7%728.30.250.03%
13%3942331.6359.1%379.7–14.29–3.63%5713301.7157.8%564.5–6.46–1.13%
14%3121911.6561.2%314.52.490.80%4632791.7160.3%478.315.263.30%
15%2501551.6562.0%255.15.072.03%3692221.7060.2%376.87.762.10%
16%2011261.6762.7%210.29.204.57%3001831.6861.0%308.08.022.67%
17%138831.6760.1%138.80.800.58%2371491.7062.9%253.716.757.07%
18%113701.6861.9%117.74.724.18%1801081.6960.0%182.12.121.18%
19%87531.6660.9%87.90.860.99%136851.7062.5%144.58.486.24%
20%62401.7464.5%69.77.6512.34%108691.7263.9%118.810.8110.01%
21%44311.7470.5%54.010.0122.75%78511.7365.4%88.310.3413.26%
22%30201.8766.7%37.37.3124.37%60401.7766.7%70.810.8118.02%
23%24171.8570.8%31.47.3730.71%41261.9563.4%50.79.7023.66%
24%16111.8468.8%20.24.2026.25%33212.0563.6%43.09.9830.24%
25%1181.8172.7%14.53.4531.36%27192.0470.4%38.711.7343.44%
Table 13

The payoff discrepancies θ that maximise ROI per season (in yellow shading), based on AH bets and for both average (left) and maximum (right) market odds. The optimal θ discrepancy is chosen over all θ that generate at least 30 bets in a single season

SeasonAverage market oddsMaximum market odds
θBetsBets wonOddsWin RateReturnsProfitROIθBetsBets wonOddsWin RateReturnsProfitROI
2006/077%66461.669.7%74.98.94513.55%8%80571.671.3%91.411.38514.23%
2007/088%71471.566.2%72.21.1651.64%10%70471.567.1%72.72.7453.92%
2008/0912%30221.873.3%38.88.7829.27%14%31211.967.7%40.99.8831.87%
2009/1011%52331.563.5%49.6–2.39–4.60%16%31221.471.0%31.60.611.97%
2010/1111%43281.865.1%51.88.7520.35%14%33221.866.7%39.06.0118.21%
2011/1210%51291.856.9%51.70.741.45%14%34201.858.8%36.01.985.82%
2012/139%55401.672.7%65.610.5519.18%13%30231.576.7%35.45.41518.05%
2013/1410%58381.765.5%64.05.9610.28%14%33221.766.7%37.94.9214.91%
2014/158%64391.860.9%68.74.687.31%9%72461.863.9%83.411.4115.85%
2015/168%109681.662.4%107.2–1.76–1.61%11%87551.763.2%91.74.7455.45%
2016/1715%33211.663.6%33.40.4051.23%17%33211.663.6%34.21.183.58%
2017/1814%41251.761.0%43.32.295.59%17%31201.864.5%36.35.30517.11%
2018/192%2671631.761.0%272.55.4852.05%4%2561561.760.9%266.610.594.14%
Overall7.45%9405991.6663.7%993.653.65.70%10.04%8215321.6864.8%897.276.1759.28%
Table 14

The payoff discrepancies θ that maximise profit per season (in yellow shading), based on AH bets and for both average (left) and maximum (right) market odds. The optimal θ discrepancy is chosen over all θ that generate at least 30 bets in a single season

SeasonAverage market oddsMaximum market odds
θBetsBets wonOddsWin RateReturnsProfitROIθBetsBets wonOddsWin RateReturnsProfitROI
2006/071%2271511.666.5%244.217.227.59%1%3082041.766.2%339.531.4610.21%
2007/088%71471.566.2%72.21.1651.64%8%103681.666.0%106.63.643.53%
2008/099%67461.868.7%80.913.8920.73%6%1681021.860.7%188.520.4712.18%
2009/1012%39251.564.1%37.1–1.89–4.85%16%31221.471.0%31.60.611.97%
2010/1111%43281.865.1%51.88.7520.35%0%3562201.761.8%376.720.685.81%
2011/1210%51291.856.9%51.70.741.45%14%34201.858.8%36.01.985.82%
2012/139%55401.672.7%65.610.5519.18%6%1541041.667.5%163.09.025.86%
2013/144%1721111.764.5%188.916.99.83%6%1611041.864.6%183.022.03513.69%
2014/156%95581.861.1%101.66.646.99%9%72461.863.9%83.411.4115.85%
2015/168%109681.662.4%107.2–1.76–1.61%11%87551.763.2%91.74.7455.45%
2016/1715%33211.663.6%33.40.4051.23%17%33211.663.6%34.21.183.58%
2017/1814%41251.761.0%43.32.295.59%0%3692401.665.0%382.613.5653.68%
2018/192%2671631.761.0%272.55.4852.05%2%3181971.761.9%330.512.5153.94%
Overall5.57%12708121.6663.9%1350.480.3856.33%5.55%219414031.6963.9%2347.3153.316.99%

Table 9 presents the profitability generated by 1X2 bets over all possible payoff decision thresholds θ, assuming static θ across all 13 seasons, for both average and maximum market odds. Unsurprisingly, the results show that it is much easier for the model to generate profit when taking advantage of the maximum market odds. Moreover, low thresholds θ (i.e., when the predictions are roughly in agreement with market odds) are not profitable. Interestingly, ROI maximises at much higher thresholds θ compared to profit; i.e., profit maximises at 8% and 9% whereas ROI at 18% and 16%, for average and maximum market odds respectively. This is because lower thresholds θ generate a higher number of bets which can generate higher profit even if ROI is lower.

Tables 10 and 11 show how the profitability changes when we consider the threshold θ that maximises ROI (Table 10) or profit (Table 11) per season, rather than considering a static θ across all seasons (Table 9), for both average and maximum market odds. Profit, once more, tends to maximise on lower thresholds θ compared to ROI. As an example, Table A1 provides the information used during the betting simulation to assess profitability for 1X2 outcomes, based on average odds of season 2010/11 as shown in Table 10.

The results show that the optimal threshold θ varies dramatically between seasons, and there is much to be gained by identifying the optimal θ. However, the high variance of θ suggests that is not reasonable to expect that we will be able to successfully predict the optimal value of θ before a season starts. Moreover, while the results are restricted to cases with 30 or more bets in a single season, it is clear that in many cases the sample size remains insufficient for deriving reliable and robust single-season conclusions. This means that the maximised profitability presented in Tables 10 and 11 is not a realistic expectation of real-world performance; only Table 9 is. These results are important because they highlight the danger when optimising models based on the results of a single season (or few seasons), which is often the case in the literature. This outcome is also discussed in Section 6, point iv.

Interestingly, while maximising profit per season is guaranteed to also maximise profit over all seasons (Table 11), the same does not apply to ROI (see Table 10 and compare it to Table 11). That is, optimising the betting strategy for maximum ROI per season does not necessarily imply that ROI will maximise across all seasons. Table 10 shows that even though ROI is maximised for each season independently, the overall ROI across all 13 seasons is 9.03% in the case of average odds, which is notably lower compared to the respective overall ROI of 12.96% in Table 11. However, this observation does not extend to the case of maximum market odds. This outcome is also discussed in Section 6, point vi.

Tables 12, 13, and 14 repeat the analysis of Tables 9, 10, and 11, but for AH rather than 1X2 betting. Table A2 presents an example of the information used during the betting simulation to assess profitability from AH bets, and it is based on average odds of season 2010/11 as shown in Table 13. Overall, the AH bets appear to generate both lower profit as well as ROI compared to 1X2 bets. As with 1X2 bets, optimising for maximum ROI per season leads to a lower ROI across all seasons, compared to maximising profit. Specifically, Table 13 shows that when maximising ROI per season leads to an overall ROI of 5.7% for average odds, which is lower than the respective average ROI of 6.33% in Table 14 when maximising profit. Once more, this observation only applies to the case of average market odds.

An interesting observation is that AH betting generates a lower number of bets when θ is low, compared to 1X2 betting, and a higher number of bets when θ is high. This suggests that AH betting is less sensitive to the betting decision threshold θ compared to 1X2 betting, for both average and maximum market odds. Furthermore, the optimal threshold θ for AH bets does not vary as much as it did for 1X2 bets. Despite the relatively low variance of θ and the common occurrence of winning 60+ out of 100 AH bets, profitability is still inconsistent between seasons. This is because match bets do not share the same payoff.

5.3.1Odds of bets simulated

When it comes to the bets simulated, the 1X2 bets tend to average odds greater than 3 which suggests that the model tends to recommend bets on outsiders; a behaviour that is consistent with previous studies including the original pi-rating (Constantinou & Fenton, 2013; Constantinou, 2018). Conversely, the AH bets tend to be simulated on favourite outcomes with average season odds typically ranging between 1.6 and 1.8. However, it is important to note that an issue with the AH odds retrieved from www.football-data.co.uk is that they do not always represent the odds associated with the handicap that maximises the uniformity of the AH distribution, as discussed in Section 2. For example, the AH odds for seasons 2009/10 and 2010/11 appear to be predominantly based on ±0 AH; i.e., no handicap, with the outcome of draw eliminated. Examples of this issue can also be viewed in Table A2; e.g., refer to the imbalanced AH odds for dates 14/08, 11/09 and 27/11.

According to Table 15, at least part of the AH odds of the first five seasons do not reflect the standard AH outcome, whereas the eight most recent seasons appear to be correctly based on the standard AH outcome that aims to make the competition equal. Results from predictive accuracy and profitability suggest that there is no meaningful difference between the first five and the last eight seasons. Therefore, we have no reason to assume that this might have influenced the overall conclusions. Finally, the preference of the model to bet on favourite AH outcomes remains consistent across all 13 seasons. This outcome is also discussed in Section 6, point ii.

Table 15

The mean average and mean maximum AH odds for each of the 13 seasons

Average oddsMaximum odds
SeasonHTATHTAT
2006/071.891.971.952.05
2007/081.922.012.002.09
2008/091.852.301.942.50
2009/102.083.012.243.38
2010/111.872.241.942.38
2011/121.931.941.992.01
2012/131.931.951.992.01
2013/141.931.942.002.01
2014/151.921.951.982.01
2015/161.941.931.991.99
2016/171.951.932.011.99
2017/181.951.932.001.99
2018/191.961.942.032.00

A possible limitation here is that, while the AH market offers multiple handicaps for each match, this study has only considered one handicap per match. However, it is reasonable to assume that the results presented in this paper approximate the overall AH market. This is because when the model suggests a bet on team X for a given handicap, then we should expect the model to suggest a bet on team X regardless the handicap, since any handicap must remain faithful to the expected goal difference of the match, which determines θ.

5.3.2Betting stake adjustments

Figure 5 presents the cumulative profit generated over eight different betting scenarios that represent the combinations of the following betting options: a) optimising for maximum ROI or profit, b) optimising θ per season or across all seasons, and c) simulating 1X2 or AH bets. The results illustrate how the difference in profit and ROI evolves across the 13 seasons between 1X2 and AH bets. While AH bets generate considerably lower profit and ROI, the profitability is much less volatile than 1X2 bets and hence, it is subject to a lower risk of loss which can often be detrimental. For example, note the significant losses for the two best performing scenarios during matches 1900 to 2100, which are both based on 1X2 bets. However, the lower risk of loss also limits profits.

Fig. 5

Cumulative profit when the betting procedure is optimised for either profit or ROI, overall or per season, and based on either 1X2 or AH maximum market odds. The results are based on 13 EPL seasons; from 2006/09 to 2018/19. Optimisations for overall profit and ROI, across all 13 seasons, are restricted to θ discrepancies that generate at least 100 bets over those 13 seasons.

Cumulative profit when the betting procedure is optimised for either profit or ROI, overall or per season, and based on either 1X2 or AH maximum market odds. The results are based on 13 EPL seasons; from 2006/09 to 2018/19. Optimisations for overall profit and ROI, across all 13 seasons, are restricted to θ discrepancies that generate at least 100 bets over those 13 seasons.

A fairer assessment of risk between 1X2 and AH profits would be to simply optimise stakes such that, at the end of the betting period, they both produce the same profit. Figure 6 provides these results by extending the scenarios of Fig. 5 to include an additional betting scenario in which AH stakes are increased proportional to the difference in cumulative profit between 1X2 and AH bets. For example, in Fig. 6a the new AH betting scenario assumes an increase of 5.59 times the stakes of AH bets, in order for the cumulative profit to become equal to that generated by 1X2 bets.

Fig. 6

Comparing the volatility of profits when the stakes of AH bets is increased by as much required for the cumulative profit to match that of 1X2 bets.

Comparing the volatility of profits when the stakes of AH bets is increased by as much required for the cumulative profit to match that of 1X2 bets.

Overall, the graphs suggest that if we want AH bets to generate as much profit as 1X2 bets do, then profit from AH bets will likely be subject to a similar risk of loss as with 1X2 bets. Therefore, while AH is often preferred due to the lower variance of returns, this advantage is rather eliminated when we need to bet proportionally larger to match the expected profit produced by the corresponding 1X2 bets. This outcome is also discussed in Section 6, point iii.

6Discussion and concluding remarks

This paper presented a model specifically developed for the prediction and assessment of the AH football betting market. The model is based on a modified version of the pi-ratings system which measures the relative scoring ability between teams. The modified pi-ratings are used as an input into a novel BN model that had its graphical structure determined by the temporal assumption Possession → Shots → ShotsonTarget  → Goalsscored, which captures the natural causal chain of these events via a Beta-Binomial Hybrid BN modelling process. One example of this assumption is that possession occurs before shots (or shots on target) and hence, shots are assumed to be more impactful than possession in terms of determining goals scored.

Using goal scoring data over the last 27 EPL seasons, the modified pi-ratings discovered a strong linear relationship between team rating difference and expected goal difference. However, the linear relationship is oscillatory (refer to Fig. 3) and this suggests that goal data alone may be insufficient in completely explaining team ability. Future work will investigate whether factors beyond goals scored could better explain this relationship. For example, in (Constantinou & Fenton, 2017) it was shown that the three teams who were promoted to the EPL, from the English Championship, tend to perform significantly better than the teams they replace. This is an important factor not taken into consideration by the pi-ratings; i.e., the teams are promoted with either an ignorant rating (if it is their first time in the EPL) or with the rating they had when they were last relegated, which clearly underestimates their performance once they return to the EPL.

AH betting is assessed with reference to the traditional 1X2 betting. The assessment is based on both average and maximum market odds and over all possible betting decision thresholds in terms of discrepancy between predicted and offered market odds. Furthermore, the assessment differentiates between betting strategies that are optimised for ROI and betting strategies that are optimised for profit. Key observations include:

  • i. The previous literature has generally focused on maximum market odds, and this is understandable since professional gamblers aim to maximise payoff. Still, average odds are important because they reveal the expected returns for the average gambler. Moreover, maximum odds are not attainable by everyone since many countries do not allow access to many of the online bookmakers, including exchange-based websites which often offer the best odds (excluding commission). This study shows that the maximum available market odds increase profits by up to four times relative to average odds. Specifically, taking advantage of the maximum market odds can lead to increased profits that range anywhere between 42% (refer to overall profits in Table 13) and 296% (refer to maximised profits in Table 9).

  • ii. The recommended AH bets tend to be on favourite outcomes with odds that typically average between 1.6 and 1.8 per season. Conversely, the recommended 1X2 bets tend to be on outsider outcomes with odds averaging above 3. The reduction of the problem from a three-state multinomial to a binary distribution (i.e., from 1X2 to AH) explains why the odds move from 1-in-3 to 1-in-2, but not why the recommended bets switch from outsiders to favourites.

  • iii. AH bets generate lower profit as well as ROI compared to 1X2 bets. Specifically, 1X2 bets are found to generate ∼2.5 to ∼5.5 times higher profit and ∼2.5 to ∼4 times higher ROI compared to AH bets (refer to Fig. 6). For this reason, returns from AH bets tend to be considerably less volatile and subject to a lower risk of loss. While this outcome is in agreement with (Hassanniakalager & Newall, n.d.), this presumed advantage of AH betting is flawed. This is because, when the betting stakes of AH bets are increased proportional to the difference in cumulative profit between 1X2 and AH bets, the variance of profit from AH bets increases towards the variance of profit from 1X2 bets. This implies that, when aiming for the same profit at the end of the same period of time, AH bets are not necessarily less risky than 1X2 bets.

  • iv. Past studies often focus on a single football season, and profitability tends to be reported based on the betting decision threshold that maximises ROI under the assumption that the optimal betting decision threshold remains invariant between seasons. However, the results in this paper show that the optimal betting decision threshold varies dramatically between seasons, despite predictive accuracy being consistent across the 13 seasons, and this applies to both 1X2 and AH bets; albeit to a lower degree for AH bets.

    This implies that the profitability presented in Tables 10, 11, 13, and 14 is not a realistic expectation of real-world performance. This is because the optimal betting decision threshold is not consistent between seasons, and the high variance suggest that it is unreasonable to assume we will be able to predict the decision threshold that maximises profit or ROI before a season starts. Therefore, the choice of evaluating football models based on the threshold that maximises profitability in a single football season, which is often the case in the literature, should be discouraged. Moreover, the optimal betting decision threshold is also dependent on whether we would like to maximise ROI or profit. On the other hand, Tables 9 and 12 represent a more realistic expectation of real-world performance, even though it is unlikely that we will follow a static betting decision threshold across these many seasons.

  • v. Neither profit nor ROI are consistent between seasons, and this applies to both 1X2 and AH bets. While the overall performance of the model is good enough to beat the market, it is still possible for the best possible betting decision threshold to be lossmaking for a whole season (see Tables 10, 11, 13, 14). While this is true for average market odds, the risk is eliminated when we consider maximum odds; though some seasons were barely profitable.

  • vi. Finally, the results show that choosing to optimise for maximum ROI per season will likely produce undesired results in the long term, and this applies to both 1X2 and AH bets. On the other hand, choosing to optimise for maximum profit (rather than ROI) per season, not only guarantees that the profit is maximised across all seasons, but also often generates a higher overall ROI, across all seasons, compared to the overall ROI generated when optimising for maximum ROI for each season independently. This finding is important since most of the previous studies focus on maximising ROI, often for individual seasons.

Lastly, it is important to note that this paper has considered football data up to season 2018/19. The two subsequent seasons have been partly affected by the COVID-19 pandemic, where many matches were played with fewer or without fans. Relevant studies have shown that this event had an insignificant or a significant negative effect on home advantage (Wunderlich et al., 2021; McCarrick et al., 2021). The model described in this paper does not to consider this event. However, because the model relies on pi-ratings which involve a home and an away rating for each team, it can be easily adjusted to consider such previously unseen events. For example, if we assume that home advantage is not relevant for a particular match, we could consider assigning the ‘away’ ratings to both teams for that match. Future research works could investigate whether such modelling modifications, that take into consideration the effect of playing in empty stadiums, improve predictive accuracy.

Acknowledgments

This research was supported by the ERSRC Fellowship project EP/S001646/1 on Bayesian Artificial Intelligence for Decision Making under Uncertainty, by The Alan Turing Institute in the UK, and by Agena Ltd.

References

1 

Agena. 2019. AgenaRisk: Bayesian Network Software for Risk Analysis and Decision Making. (Agena Ltd) Retrieved August 1, 2019, from https://www.agenarisk.com/

2 

Angelini, G. & De Angelis, L. (2017) . PARX model for football match predictions, Journal of Forecasting, 36: , 795–807.

3 

Arabzad, S. M. , Araghi, M. E. , Sadi-Nezhad, S. M. , & Ghofrani, S. (2014) . Football match results prediction using artificial neural networks; The case of Iran Pro League, International Journal of Applied Research on Industrial Engineering, 1: (3), 159–179.

4 

Constantinou, A. . 2018. Dolores: A model 1110 that predicts football match outcomes from all over the world. Machine Learning, 1-27.

5 

Constantinou, A. , & Fenton, N. (2012) . Solving the Problem of Inadequate Scoring Rules for Assessing Probabilistic Football Forecast Models, Journal of Quantitative Analysis in Sports, 8: (1).

6 

Constantinou, A. , & Fenton, N. (2013) . Determining the level of ability of football teams by dynamic ratings based on the relative discrepancies in scores between adversaries, Journal of Quantitative Analysis in Sports, 9: (1), 37–50.

7 

Constantinou, A. , & Fenton, N. (2017) . Towards smart-data: Improving predictive accuray in long-term football team performance, Knowledge-Based Systems, 124: , 93–104.

8 

Constantinou, A. , & Fenton, N. (2018) . Things to know about Bayesian Networks, Significance, 15: (2), 19–23.

9 

Crowder, M. , Dixon, M. , Ledford, A. , & Robinson, M. (2002) . Dynamic modelling and prediction of English Football League matches for betting, Journal of the Royal Statistical Society: Series D (The Statistician), 51: (2), 157–168.

10 

Dixon, M. , & Coles, S. (1997) . Modelling association football scoresand inefficiencies in the football betting market, Applied Statistics, 46: (2), 265–280.

11 

Giovanni, A. , & De Angelis, L. (2019) . , ., &, . Efficiency of online football betting markets, International Journal of Forecasting, 35: , 712–721.

12 

Goddard, J. (2005) . Regression models for forecasting goals and match results in association football, International Journal of Forecasting, 21: (2), 331–340.

13 

Grant, A. , Oikonomidis, A. , & Bruce, A. J. (2018) . New entry, strategic diversity and efficiency in soccer betting markets: the creation and suppression of arbitrage opportunities, The European Journal of Finance, 24: (18), 1799–1816.

14 

Hassanniakalager, A. , & Newall, P. , (n.d.). A machine learning perspective on responsible gambling. Behavioural Public Policy, 1-24.

15 

Hofer, V. , & Leitner, J. (2017) . Relative pricing of binary options in live soccer betting markets, Journal of Economic Dynamics & Control, 76: , 66–85.

16 

Huang, K. , & Chang, W. (2010) . A neural network method for prediction of 2006World Cup Football Game. In Proceedings of the IEEE 2010 International Joint Conference on Neural Networks (IJCNN). Barcelona.

17 

Hubacek, O. , Gustav, S. , & Zelezny, F. (2019) . Score-based soccer match outcome modeling –an experimental review. Math-Sport International. Athens, Greece.

18 

Hubacek, O. , Sourek, G. , & Zelezny, F. (2018) . Learning to predict soccer results from relational data with gradient boosted trees, Machine Learning, 108: , 29–47.

19 

Hvattum, L. M. , & Arntzen, H. (2010) . Using ELO ratings for match result prediction in association football, International Journal of Forecasting, 26: , 460–470.

20 

Joseph, A. , Fenton, N. , & Neil, M. (2006) . Predicting football results using Bayesian nets and other machine learning techniques, Knowledge-Based Systems, 19: (7), 544–553.

21 

Kerr, J. (2018) . How can legislators protect sport from the integrity threat posed by cryptocurrencies? The International Sports Law Journal, 18: , 79–97.

22 

Leitner, C. , Zeileis, A. , & Hornik, K. (2008) . Forecasting Sports Tournaments by Ratings of (prob)abilities: A comparison for the EURO, International Journal of Forecasting, 26: , 471–481.

23 

McCarrick, D. , Bilalic, M. , Neave, N. , & Wolfson, S. , 2021. Home advantage during the COVID-19 pandemic: Analyses of European football leagues. Psychology of Sport and Exercise, 56, Article 102013.

24 

Pearl, J. 1985. Bayesian Networks: A model of activated memory for evidential reasoning. In Proceedings of the Cognitive Science Society, 329-334.

25 

Pena, J. L. 2014. A Markovian model for association football possession and its outcomes. arXiv:1403.7993 [math.PR].

26 

Rue, H. , Salvesen, O. (2000) . Prediction and retrospective analysis of soccer matches in a league, Journal of the Royal Statistical Society: Series D (The Statistician), 49: (3), 399–418.

27 

Van Cutsem, L. 2019. Pi Ratings. Retrieved July 8, 2019, from https://cran.r-project.org/web/packages/piratings/vignettes/README.html

28 

Vlastakis, N. , Dotsis, G. , & Markellos, R. (2008) . Nonlinear modelling of European football scores using support vector machines, Applied Economics, 40: , 111–118.

29 

Wheatcroft, E. 2020. A profitable model for predicting the over/under market in football. International Journal of Fore-casting.

30 

Williams-Grut, O. (2016) . Inside Starlizard: The story of Britain’s most successful gambler and the secretive company that helps him win. London, UK: Business Insider.

31 

Wunderlich, F. , & Memmert, D. (2018) . The Betting Odds Rating System: Using soccer forecasts to forecast soccer, PLoS ONE, 13: (6), e0198668.

32 

Wunderlich, F. , Weigelt, M. , Rein, R. , & Memmert, D. (2021) . How does spectator presence affect football? Home advantage remains in European top-class football matches played without spectators during the COVID-19 pandemic, PLoS ONE, 16: (3), e0248590.

Appendices

Appendix A: Sample results from betting simulations

Table A1

Details of profitability for case in Table 10: Season = 2010/11, Odds = Average, Bets = 1X2, Optimisation = ROI, and θ =10%

Model predictionsAverage bookmakers’ oddsBookmakers’ unnormalised predictionPayoff discrepancyBets simulatedReturns from bets
DateHTAT1X2p(1)p(X)p(2)Odds(1)Odds(X)Odds(2)p(1)p(X)p(2) θ(1) θ(X) θ(2)Bet(1)Bet(X)Bet(2)Return(1)Return(X)Return(2)Profit
14/08/2010Aston VillaWest Ham10.620.220.161.963.304.030.510.300.250.11–0.08–0.091001.96000.96
14/08/2010WiganBlackpool20.330.260.411.823.454.500.550.290.22–0.22–0.030.19001004.53.5
26/09/2010WolvesAston Villa20.250.250.502.833.252.500.350.310.40–0.10–0.060.10001002.51.5
02/10/2010SunderlandMan UnitedX0.130.200.674.933.451.750.200.290.57–0.08–0.090.10001000–1
23/10/2010BirminghamBlackpool10.390.270.341.853.484.310.540.290.23–0.15–0.020.11001000–1
24/10/2010LiverpoolBlackburn10.730.170.101.663.645.430.600.270.180.12–0.11–0.081001.66000.66
01/11/2010BlackpoolWest Brom10.470.260.262.793.252.540.360.310.390.11–0.04–0.131002.79001.79
10/11/2010Man CityMan UnitedX0.250.250.502.573.222.750.390.310.36–0.14–0.060.14001000–1
27/11/2010BoltonBlackpoolX0.410.270.321.573.965.820.640.250.17–0.220.010.15001000–1
11/12/2010Aston VillaWest Brom10.620.220.162.103.303.530.480.300.280.15–0.08–0.131002.1001.1
11/12/2010StokeBlackpool20.410.270.321.623.865.410.620.260.18–0.210.010.14001005.414.41
12/12/2010TottenhamChelseaX0.250.250.512.843.282.490.350.300.40–0.10–0.060.10001000–1
13/12/2010Man UnitedArsenal10.620.220.161.953.403.920.510.290.260.11–0.07–0.091001.95000.95
28/12/2010SunderlandBlackpool20.410.270.321.603.815.820.630.260.17–0.210.000.15001005.824.82
28/12/2010West BromBlackburn20.360.270.371.823.484.490.550.290.22–0.19–0.020.15001004.493.49
05/01/2011ArsenalMan CityX0.630.210.151.943.493.860.520.290.260.12–0.07–0.11100000–1
05/01/2011EvertonTottenham10.480.260.262.663.242.640.380.310.380.10–0.05–0.121002.66001.66
15/01/2011West BromBlackpool10.330.260.411.783.684.480.560.270.22–0.23–0.010.19001000–1
16/01/2011LiverpoolEvertonX0.630.220.162.203.223.430.450.310.290.17–0.09–0.14100000–1
23/01/2011BlackburnWest Brom10.560.240.212.223.273.240.450.310.310.11–0.07–0.101002.22001.22
01/02/2011West BromWiganX0.410.270.321.713.585.130.580.280.19–0.17–0.010.13001000–1
12/02/2011Man UnitedMan City10.670.200.131.773.574.610.560.280.220.10–0.08–0.081001.77000.77
12/02/2011West BromWest HamX0.360.270.371.933.523.860.520.280.26–0.16–0.020.11001000–1
20/02/2011West BromWolvesX0.360.260.371.863.404.330.540.290.23–0.18–0.030.14001000–1
26/02/2011WolvesBlackpool10.360.260.371.833.584.280.550.280.23–0.18–0.020.14001000–1
19/03/2011Man UnitedBolton10.830.120.061.404.418.540.710.230.120.11–0.11–0.061001.4000.4
19/03/2011West BromArsenalX0.120.200.684.673.591.760.210.280.57–0.09–0.080.11001000–1
09/04/2011WolvesEverton20.250.250.502.553.252.800.390.310.36–0.14–0.060.14001002.81.8
11/04/2011LiverpoolMan City10.550.240.212.593.212.760.390.310.360.16–0.07–0.151002.59001.59
14/05/2011West BromEverton10.260.240.502.633.282.690.380.300.37–0.12–0.060.13001000–1
22/05/2011BoltonMan City20.330.260.414.923.721.690.200.270.590.13–0.01–0.19100000–1
22/05/2011Man UnitedBlackpool10.820.120.061.564.135.570.640.240.180.18–0.12–0.121001.56000.56
22/05/2011StokeWigan20.550.240.212.763.422.450.360.290.410.19–0.05–0.20100000–1
TOTAL1501822.66025.5215.18

Appendices

Table A2

Details of profitability for case in Table 13: Season = 2010/11, Odds = Average, Bets = AH, Optimisation = ROI, and θ= 11%

GoalsGoalModel predictionsAverage bookmakers’Bookmakers’ unnormalizedPayoff discrepancyBets simulatedReturns from bets
DateHTATHTATdiferAHp(1)p(2)Odds(1)Odds(2)p(1)p(2) θ(1) θ(2)Bet(1)Bet(2)Return(1)Return(2)Profit
14/08/2010WiganBlackpool04–400.450.551.323.190.760.31–0.310.240103.192.19
21/08/2010ArsenalBlackpool606–20.280.721.762.120.570.47–0.280.240100–1
11/09/2010NewcastleBlackpool02–200.650.351.204.230.830.24–0.180.110104.233.23
25/09/2010West HamTottenham10100.510.492.541.470.390.680.11–0.19102.5401.54
23/10/2010BirminghamBlackpool202–0.50.390.611.852.010.540.50–0.150.110100–1
24/10/2010LiverpoolBlackburn211–0.750.690.311.852.020.540.500.15–0.19101.42500.425
30/10/2010Man UnitedTottenham202–10.650.352.041.830.490.550.16–0.19102.0401.04
01/11/2010BlackpoolWest Brom21100.640.361.991.810.500.550.14–0.20101.9900.99
10/11/2010Man CityMan United00000.330.671.821.980.550.51–0.220.1601010
20/11/2010BirminghamChelsea1010.750.360.641.882.000.530.50–0.170.140100–1
27/11/2010BoltonBlackpool22000.560.441.204.170.830.24–0.270.2001010
11/12/2010Aston VillaWest Brom21100.800.201.502.490.670.400.13–0.20101.500.5
11/12/2010StokeBlackpool01–1–10.240.762.071.820.480.55–0.240.210101.820.82
26/12/2010Aston VillaTottenham12–100.570.432.221.640.450.610.12–0.181000–1
28/12/2010SunderlandBlackpool02–2–10.250.752.061.810.490.55–0.240.200101.810.81
28/12/2010West BromBlackburn13–2–0.50.360.641.822.050.550.49–0.190.150102.051.05
29/12/2010ChelseaBolton101–1.50.620.381.981.890.510.530.12–0.151000–1
01/01/2011Man CityBlackpool101–1.50.370.631.832.030.550.49–0.170.130102.031.03
05/01/2011ArsenalMan City000–0.50.630.371.941.930.520.520.12–0.151000–1
05/01/2011EvertonTottenham21100.650.351.901.920.530.520.12–0.17101.900.9
15/01/2011West BromBlackpool321–0.750.260.741.961.910.510.52–0.250.220100.5–0.5
16/01/2011LiverpoolEverton22000.800.201.552.410.650.410.16–0.2210100
01/02/2011West BromWigan220–0.750.340.661.891.980.530.51–0.190.160101.980.98
12/02/2011Man UnitedMan City211–0.750.620.381.991.890.500.530.12–0.15101.49500.495
12/02/2011West BromWest Ham330–0.50.360.641.921.960.520.51–0.160.130101.960.96
20/02/2011West BromWolves110–0.50.360.641.882.010.530.50–0.170.140102.011.01
26/02/2011WolvesBlackpool404–0.50.360.641.832.050.550.49–0.180.150100–1
05/03/2011ArsenalSunderland000–10.670.331.822.060.550.490.12–0.161000–1
19/03/2011Man UnitedBolton101–1.250.700.302.011.870.500.530.20–0.23100.50–0.5
19/03/2011West BromArsenal2200.750.370.631.842.030.540.49–0.180.140100–1
03/04/2011FulhamBlackpool303–10.310.691.921.940.520.52–0.210.170100–1
09/04/2011Man UnitedFulham202–10.700.301.862.010.540.500.16–0.20101.8600.86
09/04/2011WolvesEverton03–300.340.661.822.000.550.50–0.210.1601021
10/04/2011BlackpoolArsenal13–21.50.650.351.872.000.530.500.11–0.151000–1
11/04/2011LiverpoolMan City30300.720.281.861.970.540.510.19–0.23101.8600.86
16/04/2011West BromChelsea13–20.750.360.641.951.940.510.52–0.160.130101.940.94
07/05/2011TottenhamBlackpool110–1.50.400.601.812.050.550.49–0.160.120102.051.05
14/05/2011SunderlandWolves13–200.650.351.881.950.530.510.12–0.171000–1
14/05/2011West BromEverton10100.340.661.891.940.530.52–0.190.140100–1
15/05/2011ArsenalAston Villa12–1–1.250.400.601.812.070.550.48–0.150.110102.071.07
22/05/2011BoltonMan City02–20.750.670.332.001.880.500.530.17–0.201000–1
22/05/2011Man UnitedBlackpool422–10.770.232.001.880.500.530.27–0.3110201
22/05/2011StokeWigan01–100.720.282.001.850.500.540.22–0.261000–1
TOTAL202320.1131.648.75

Notes

1 The other handicaps do not share the same market liquidity; implying limited stakes and possibly also higher profit margins, for the bookmaker, due to lower competition.

Notes

2 A zero-goal AH implies no handicap, but that there must be a match winner; otherwise, the bet is voided.

Notes

3 The decision to discretise RDL represents a practical choice for Hybrid BN modelling. In this case, discretising RD into RDL was necessary to capture conditional Beta-Binomial relationships from Possession to Goals scored, given the rating difference between adversaries.

Notes

4 For the away team (i.e., AT) it is (1 - P) × 90.