View Full Version : Sabermetrics applying to Soccer
Pages :
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
[
16]
mpruitt
03 Jun 2009, 08:16 PM
ORLY?
http://sports.espn.go.com/espn/news/story?id=4206427
Dr.Marin
04 Jun 2009, 03:02 AM
I'm a huge sabermetrics buff, and consider myself fairly knowledgeable. Soccer is my favorite sport, and the one I really grew up loving, though. It's clear to me that objective analysis is really needed in the sport, though it's also clear that there are many obstacles in the way just by the nature of the sport. I see the resources thread here, and it's fairly disappointing to be honest. I thought we were further along. Are there any primers or books I should read regarding soccer sabermetrics (we also need a cool term!)?
I do have some ideas of my own, though I'm not sure how stupid they are unrealistic....:o
Dr.Marin
04 Jun 2009, 03:34 AM
And I'm going to read through this thread, as I see it has some fascinating stuff.
voros
04 Jun 2009, 04:21 AM
I do have some ideas of my own, though I'm not sure how stupid they are unrealistic....:o
There's a lot you can do team by team, but individual player stuff is hard. I've always thought a "win shares" like approach could be much more useful here than it actually is in baseball. If you can figure out how good a team is (possible), and then divide up credit within that team (harder but also approachable) it can be done.
blank_frackis
05 Jun 2009, 08:46 AM
There's a lot you can do team by team, but individual player stuff is hard. I've always thought a "win shares" like approach could be much more useful here than it actually is in baseball. If you can figure out how good a team is (possible), and then divide up credit within that team (harder but also approachable) it can be done.
This is pretty difficult to do in practice, but sorely needed. One of the greatest myths in the game today is that a goalscorer is necessarily the most important player in a given team. In terms of value this might well be the case, but there are numerous examples where a player with a great goalscoring record is far less important to a team's success than a creative midfielder, or a central defender, or the forward who plays alongside the main striker.
One of the most glaring examples of this principle is Kris Boyd (striker at Rangers in the Scottish Premier League) who has a phenomenal goalscoring record, but is severely lacking in terms of general play (passing, technique, first touch) to the extent that he undermines much of the good work he does by finishing chances. Indeed, his club side actually has a greater goals per game record without him playing in the last two seasons than they do with him. Try and make this point in the present climate of footballing opinion, however, and you'll be met with a torrent of cliches about "goals winning games" or whatever else.
Some equivalent to win shares would solve this problem, but despite the difficulty in creating such a statistic there's some hope on that front if you consider that the tendency to quote goalscoring records as an indicator of a player's worth to the team is itself an attempt at statistical analysis - just an exceptionally poor one. There is more of a market for statistics than the knee jerk reaction of many football fans would suggest.
przeszczepan
09 Jun 2009, 05:15 AM
This is pretty difficult to do in practice, but sorely needed. One of the greatest myths in the game today is that a goalscorer is necessarily the most important player in a given team. In terms of value this might well be the case, but there are numerous examples where a player with a great goalscoring record is far less important to a team's success than a creative midfielder, or a central defender, or the forward who plays alongside the main striker.
Hi Guys,
You will find an approach addressing this (and other) issues in this article (http://www.ingentaconnect.com/content/uwic/ujpa/2008/00000008/00000002/art00007).
arkanian215
09 Jun 2009, 02:55 PM
I don't see this working for soccer. I'm surprised they have distance covered and passes attempted and converted. however, stats in soccer are rare and often are not independent of the skill of other players around them or their opposition, even if we were to measure every single thing each player did out there.
RichardHKirkando
10 Jun 2009, 12:51 AM
There's a lot you can do team by team, but individual player stuff is hard. I've always thought a "win shares" like approach could be much more useful here than it actually is in baseball. If you can figure out how good a team is (possible), and then divide up credit within that team (harder but also approachable) it can be done.
I'm actually trying to do exactly that. I have some of what I've done on my site (http://championshipatbest.com/), which is mostly Fulham-specific at this point, but I'm in the process of doing another site covering a bit more, and also including a searchable/sortable database with all of my stats.
A lot of the calculations are borrowed from the Win Shares formulas. I've calculated marginal goals for each team, and then broken those down into marginal goals scored, prevented by outfield defence, and prevented by goalkeepers. Then, they are assigned to individuals based on lots of things, but for main areas - goalscoring, playmaking, defensive actions, and a possession score (tendency to retain possession vs giving the ball away). The end result is a score that can be read like "a team of player_x would win y games in a season". So really, you could divide by 11 and get the player's value in terms of actual wins, but I feel that its a little more presentable as-is.
One of the things I've tried really hard to do is eliminate elements of luck and the more team-oriented things. For example, instead of using assists in my playmaking calculations, I use assist attempts (any pass leading to a shot), since that doesn't rely on another player's shooting ability. I've also adjusted each goalkeeper's save percentage based on the quality of shots faced. There's much more refining to be done, but I like the results so far.
I do agree with most of what you've said, and so do my findings. Central midfielders are generally the most valuable players, although the very best goalkeepers will out-produce them slightly. The value of forwards is difficult to explain - I think you're right about forwards whose "general play" is lacking being less valuable. For example, Kevin Davies was the league's 8th leading scorer, but was only the 14th most valuable forward according to my system. Dirk Kuyt and Wayne Rooney had the same number of goals, but both were more than a full win better than Davies. As you can see, I've concentrated on the Premier League so far since there is a lot more data available, but I'll eventually start doing other leagues as well.
Vandervaart
14 Jun 2009, 05:02 PM
Surely strikers are the most valuable players, as the object of the game is to win, and to do that you need to score goals, which are scarse. For instance, I reckon Torres and Villa to be more valuable than Xavi and Iniesta.
voros
15 Jun 2009, 01:25 AM
Surely strikers are the most valuable players, as the object of the game is to win, and to do that you need to score goals, which are scarse. For instance, I reckon Torres and Villa to be more valuable than Xavi and Iniesta.
In terms of this thread and this board, the question I guess would be then to try and examine the evidence for that.
Undoubtedly that's possible and certainly the attacking geniuses _seem_ to command higher transfer fees. I also am somewhat sympathetic to that viewpoint, because I think announcer types can really go overboard on the praise for central midfielders, particularly here in the states.
But the question for the thread is how to do we go about finding out whether this is actually true or not.
RichardHKirkando
16 Jun 2009, 12:10 AM
Surely strikers are the most valuable players, as the object of the game is to win, and to do that you need to score goals, which are scarse. For instance, I reckon Torres and Villa to be more valuable than Xavi and Iniesta.
In some ways, yes. Great forwards are certainly more scarce than great midfielders - according to my findings, only 22 forwards in the Premier League accounted for 1 or more wins on their own, while 50 defenders and 56 midfielders did the same. I think its very likely that having a top 5 striker can give you a bigger competitive advantage over your opponent than having a top 5 midfielder or defender would. Average strikers are more interchangeable, their results depending more on service from the midfield than their own skill.
Example: Aston Villa is playing Fulham. Their top striker, Gabriel Agbonlahor, is worth 2.5 wins over replacement. Fulham's Andy Johnson is worth just under 1 win over replacement. In the midfield, you have Gareth Barry at 3 wins vs Danny Murphy at 2 wins, which is obviously a smaller difference.
So when you're talking about the extreme high-end, those game-changing strikers can be more valuable. If one of those players is not obtainable though, its best to build from the midfield and defence.
przeszczepan
17 Jun 2009, 10:06 AM
Your work is impressive, Colin.
I’ve gone through the recent 7 pages of your blog and there are certainly lots of imaginative concepts and loads of good work done out there.
What I am missing there is a kind of glossary with definitions of the statistics you are putting to practical use throughout your blog. It took me some time to dig through your earlier posts to find what you mean by all the abbreviations. I am not saying it was unpleasant, though :)
I particularly liked the 2008/09 Premier League Shot Quality bit. Have you ever looked at how a logistic regression approach values shots on goal depending on the circumstances (distance from goal, angle, proximity of the defender, previous action, …) it has been taken in?
The paper I have linked to in my previous post uses it as a starting point and takes it a step further to estimate the goal scoring potential in each game situation. Knowing that, each action performed on the field can be assessed based on how much of that goal scoring potential is created or lost by performing it. As a result, players’ performance and various types of team moves can be assessed based on how much they contributed to the chance of scoring a goal by their own team and by the opposition.
Some of the desirable characteristics of this methodology include:
- a goal scored by a striker is not exclusively his own contribution if it has been worked out all the way through the midfield; similarly merits for a goal conceded are distributed as a result of the opposite team’s build up not opposed to the sufficient extent by the defending side and not only goalkeeper’s fault for example;
- a great pass opening the striker’s way to the goal remains a great pass if the latter fails to convert it to a goal; if he does fail, the passer is rewarded for the opportunity he had created irrespective of the striker’s action and the striker is punished for the missed chance (even though he is rewarded for finding himself in the good position); the goalkeeper shares the merits (positive or negative) with the striker if the latter had tested him;
- in preventing the opposite side form getting their goals, defending creates results (victories or losses) as much as the offensive actions do;
- a pass into the penalty area allowing strikers to get into a shooting position is much more valuable than a pass in the middle of the field; similarly a key tackle of the last defender which prevents the one-on-one striker vs. keeper situation is worth more than any other.
One of the things I find the most appealing in this approach is that the goal potential created by all the players on the teams sums up to the number of goals scored and conceded by those teams by definition.
The methodology developed in this paper allows for a wide range of practical applications:
1. Comparison of various options of playing the ball available to a player in a given area of the field under given circumstances. Separation of irrational strategies from those that should be chosen between, given the current result of the game and time remaining to the end of it.
2. Identification of strengths and weaknesses of a given team based on the effectiveness of past performance in a given zone of the field and with a given type of action. Particularly effective patterns of play may also be recognized as a series of productive actions.
3. Quantification of players’ input in the result of a game which combines both the amount and the quality of their effort. In the short run this could form the foundation of a team selection process and in the long run may be one of the fundamentals for pricing players.
Szczepański, Łkasz: International Journal of Performance Analysis in Sport (http://www.ingentaconnect.com/content/uwic/ujpa;jsessionid=es5k6fhhsi8qs.alice), Volume 8, Number 2, July 2008 , pp. 55-66(12)
happyforever
18 Jun 2009, 06:20 PM
There has been a whole thread on this issue before:
http://www.bigsoccer.com/forum/showpost.php?p=5456273&postcount=20
I seriously admire people working on it, but fear that if you want to get the best grading for different players, go and ask the Asian bookies. They simply put their wallet where their mouth is!
Vandervaart
13 Jul 2009, 02:39 PM
Off topic and probably already mentioned dozens of times in the beginning of this thread, read Moneyball last weekend. Great book. Too bad that it was published before Epstein's success at the Red Sox.
ElChupacabra67
30 Sep 2009, 09:53 PM
Your work is impressive, Colin.
I’ve gone through the recent 7 pages of your blog and there are certainly lots of imaginative concepts and loads of good work done out there.
What I am missing there is a kind of glossary with definitions of the statistics you are putting to practical use throughout your blog. It took me some time to dig through your earlier posts to find what you mean by all the abbreviations. I am not saying it was unpleasant, though :)
I particularly liked the 2008/09 Premier League Shot Quality bit. Have you ever looked at how a logistic regression approach values shots on goal depending on the circumstances (distance from goal, angle, proximity of the defender, previous action, …) it has been taken in?
The paper I have linked to in my previous post uses it as a starting point and takes it a step further to estimate the goal scoring potential in each game situation. Knowing that, each action performed on the field can be assessed based on how much of that goal scoring potential is created or lost by performing it. As a result, players’ performance and various types of team moves can be assessed based on how much they contributed to the chance of scoring a goal by their own team and by the opposition.
Some of the desirable characteristics of this methodology include:
- a goal scored by a striker is not exclusively his own contribution if it has been worked out all the way through the midfield; similarly merits for a goal conceded are distributed as a result of the opposite team’s build up not opposed to the sufficient extent by the defending side and not only goalkeeper’s fault for example;
- a great pass opening the striker’s way to the goal remains a great pass if the latter fails to convert it to a goal; if he does fail, the passer is rewarded for the opportunity he had created irrespective of the striker’s action and the striker is punished for the missed chance (even though he is rewarded for finding himself in the good position); the goalkeeper shares the merits (positive or negative) with the striker if the latter had tested him;
- in preventing the opposite side form getting their goals, defending creates results (victories or losses) as much as the offensive actions do;
- a pass into the penalty area allowing strikers to get into a shooting position is much more valuable than a pass in the middle of the field; similarly a key tackle of the last defender which prevents the one-on-one striker vs. keeper situation is worth more than any other.
One of the things I find the most appealing in this approach is that the goal potential created by all the players on the teams sums up to the number of goals scored and conceded by those teams by definition.
I've been working on applying Sabermetrics and linear weights in particular as applied to soccer for the last few months. I've come up with one measure of Player Production: http://nordeckeluchador.blogspot.com/2009/08/introducing-el-luchadors-player.html
but it needs work. The measures of Team production I'm more happy with but am still doing much needed research. Stumbling on this thread was a godsend.
brew42
06 Nov 2009, 05:59 AM
Hey
I have recently developed a Gaelic Football/Hurling Tracker for the iPhone. Check it out in the iTunes Store @ iTunes.com/apps/gaatracker. Its something we use quite a lot for gathering & viewing stats on Gaa games.
Would people use a similar app for soccer. Check out Footy Tacker @ iTunes.com/apps/footytracker
I think it could be useful for minor teams that don't have premiership budgets