Home > Soccer Forum > World of Soccer > Statistics and Analysis

Reply
 
Thread Tools Search this Thread Rate Thread Display Modes
Old 11 Feb 2004, 05:04 PM   #1
ChrisE
BigSoccer Member+
 
Join Date: Jul 2002
Location: Chicago, IL
Default Winning Percentage, Goal Differential, and the Shootout

Quote:
Originally posted by maxim-1
Well the reason I was using PTS/GP was that the ultimate object of soccer is obviously to get points. GF/GA allowed would be interesting to do but I think PTS/G is a little more accurate in terms of teams' success.


Well, there's this idea, in baseball at least, that runs differential is in fact a more accurate evaluation of how good a team is than how many games they win. It's because doing things like winning one-run games presumably has a lot more to do with luck than with skill. Admittedly, strategy may play a more important role in preserving one goal leads in soccer, but I think it's at least something worth looking at.

Quote:

It's easy enough to go back and recalibrate the pre-2000 point numbers, just look at how many SOW they had to figure otu the number of total ties and adjust from there. The better question is however, is it doing a diservice to the data by rearranging it in that way? Before 2000 that's not the way that MLS operated. I'm trying to look at all-time numbers and get consistent results from data by taking away SOW. But in the process it may be affect the true relative success of a team pre-2000. I wonder what you think about that? When doing any kind of historical analysis should you just let the data in terms of points per game stay inconsistent by respecting the infamous Shoot Out Wins?


I don't have any problem taking out shootout wins (and losses), or more particularly just calling them ties, I think they're pretty much random. Furthermore, since a shootout win was only worth one point (and you've only got a 50% chance of winning it), I can't imagine anybody playing for a shootout; you're not distorting anything about how teams played by extracting shootout wins.

Quote:

Also, another way to measure this data wich I didn't include was team winning percentage. Honestly I've got to look how they calibrate that in MLS. It's a league statistic that has always struck me as odd, looking at winning percentages when you're incorporating ties. I imagine the way to do it would be (3W+T)/(3GP) so you have the number of points gained in a season by the number of possiable ones. If anyone knows otherwise then please do tell.


In the spermatological stats and analysis thread, somebody mentioned (or I believed linked to an article) indicating that scoring is not particularly increased whether a team gets 2 points for a win or 3. So I think there's two ways of doing this: one, the way you described, points won/points available, but also something like (W+1/2T)/(W+L+T) more akin to winning percentage.

Last edited by ChrisE; 11 Feb 2004 at 05:19 PM.
ChrisE is offline   Quote 

TRY BIGSOCCER
NOW!
NEWS, SCORES & TABLES FOR 1,300 CLUBS

Connect in the web's largest forums.
Blog about soccer from your point of view.
Shop 17,000 authentic soccer items.




On sale for $64.99
at our soccer store

On sale for $14.99
or buy soccer jerseys

Old 11 Feb 2004, 05:14 PM   #2
ChrisE
BigSoccer Member+
 
Join Date: Jul 2002
Location: Chicago, IL
Default

To begin with, these are the shootout percentage (W/[W+L])of the league from its institution in 1996 to its unfortunate demise in 1999:

Code:

SO Record	1996	1997	1998	1999
				
Colorado	0.333	0.400	0.500	0.667
Columbus	0.444	0.429	0.000	0.600
D.C.	0.250	0.500	0.700	0.625
Dallas	0.625	0.600	0.667	0.333
K. City	0.714	0.778	0.333	0.250
L.Angeles	0.500	0.333	0.500	0.429
Metros	0.600	0.500	1.000	0.429
N. Eng	0.750	0.500	0.333	0.417
San Jose	0.333	0.333	0.375	0.769
Tampa Bay	0.250	0.600	0.167	0.417
Miami			1.000	0.556
Chicago	0.667	0.375		
Running a correlation between a team's performance one year and their performance the subsequent year gives a very very small correlation of 0.043 (I don't know a lot about statistics but I know that's not very good). Now, the problem may just be that we're using extremely small samples here (average of 3.5 shootouts per team per year), I have no idea.

Clubs' all-time shootout percentages look like this:

Code:

Chicago	0.455
Colorado	0.500
Columbus	0.419
D.C.	0.567
Dallas	0.536
K. City	0.533
L.Angeles	0.440
Metros	0.579
N. Eng	0.500
San Jose	0.487
Miami	0.714
Tampa Bay	0.370

which looks like a pretty random distribution to me.
ChrisE is offline   Quote 
Old 11 Feb 2004, 05:26 PM   #3
ChrisE
BigSoccer Member+
 
Join Date: Jul 2002
Location: Chicago, IL
Default

So, assuming that shootouts were pretty much random, I counted shootout wins and shootout losses as ties. Then, for comparison's sake, I drew up two ways of measuring how good a team is, points/points available and winning %(where ties are counted as half win and half loss); if anyone wants me to post the w-l-t numbers, just ask.

Here were those two:

Code:

Win %	1996	1997	1998	1999	2000	2001	2002	2003
Chicago	------	------	0.609	0.594	0.625	0.685	0.464	0.633
Colorado	0.375	0.453	0.500	0.578	0.469	0.346	0.536	0.483
Columbus	0.484	0.484	0.547	0.563	0.422	0.615	0.482	0.467
D.C.	0.531	0.656	0.688	0.688	0.344	0.346	0.411	0.483
Dallas	0.500	0.484	0.438	0.641	0.500	0.481	0.554	0.283
K. City	0.484	0.578	0.406	0.313	0.641	0.463	0.482	0.517
L.Angeles	0.594	0.531	0.750	0.641	0.563	0.635	0.625	0.450
Metros	0.453	0.406	0.422	0.234	0.578	0.558	0.429	0.517
N. Eng	0.406	0.469	0.375	0.406	0.500	0.370	0.464	0.550
San Jose	0.516	0.422	0.438	0.484	0.344	0.615	0.554	0.617
Miami	------	------	0.391	0.391	0.453	0.712	------	------
Tampa Bay	0.656	0.516	0.438	0.469	0.563	0.185	------	------
and:

Code:

Pts	1996	1997	1998	1999	2000	2001	2002	2003
Chicago	------	------	0.594	0.552	0.594	0.654	0.440	0.589
Colorado	0.344	0.427	0.479	0.531	0.448	0.295	0.512	0.444
Columbus	0.438	0.448	0.521	0.510	0.396	0.577	0.452	0.422
D.C.	0.510	0.615	0.635	0.646	0.313	0.333	0.381	0.433
Dallas	0.458	0.458	0.406	0.594	0.479	0.449	0.512	0.256
K. City	0.448	0.531	0.375	0.271	0.594	0.444	0.429	0.467
L.Angeles	0.552	0.500	0.729	0.604	0.521	0.603	0.607	0.400
Metros	0.427	0.385	0.406	0.198	0.563	0.538	0.417	0.467
N. Eng	0.365	0.427	0.344	0.344	0.469	0.333	0.452	0.500
San Jose	0.469	0.375	0.396	0.417	0.302	0.577	0.536	0.567
Miami	------	------	0.365	0.344	0.427	0.679	------	------
Tampa Bay	0.635	0.490	0.406	0.406	0.542	0.173	------	------
They are appreciably different, but they have a .993 correlation, so I don't really think it matters which you use.
ChrisE is offline   Quote 
Old 11 Feb 2004, 05:36 PM   #4
mpruitt
BigSoccer Member+
 
mpruitt's Avatar
 
Join Date: Feb 2002
Location: E. Somerville

Supporter: New England Revolution
Default

Thanks for the response. For my mind the most accurate way to tally winning perentage would be total points by possiable points. While indicating that a tie is a half of a win makes sense, it's not quite in line with how it's recorded. However, I still am wondering how MLS records winning percentage for official purposes.

In terms of SOW, I think that I probably just should re-calibrate the SOW into ties and adjust from there. It's not our fault that MLS had a stupid idea, and really the point would be to analyze historical trends or compare teams over a historic basis. Taking away the SOW certainly would be in line with trying to diminish the occurance of luck as a factor in greatness.

As to running GF/GA as a more accurate means of measuring a teams greatness, thats just fine when comparing anything as a factor related to a teams greatness the more ways the better. It would just tell us something different. Whether goal differential is a better determination of a team's greatness than winning percentage, I think that's getting a bit ahead of ourselves.
mpruitt is online now   Quote 
Old 11 Feb 2004, 05:36 PM   #5
ChrisE
BigSoccer Member+
 
Join Date: Jul 2002
Location: Chicago, IL
Default

Furthermore, I thought I'd compare them to goal differential. So, I used a formula that beineke had suggested, and that seemed simple enough for me to implement, of GF/(GF+GA). This is what you get:

Code:

beineke	1996	1997	1998	1999	2000	2001	2002	2003
Chicago	------	------	0.579	0.586	0.568	0.625	0.531	0.552
Colorado	0.427	0.459	0.473	0.494	0.422	0.434	0.473	0.471
Columbus	0.496	0.506	0.545	0.552	0.453	0.576	0.506	0.500
D.C.	0.525	0.569	0.607	0.602	0.411	0.457	0.437	0.514
Dallas	0.510	0.529	0.422	0.607	0.500	0.505	0.506	0.354
K. City	0.492	0.528	0.474	0.384	0.618	0.384	0.451	0.522
L.Angeles	0.546	0.556	0.659	0.628	0.560	0.591	0.571	0.500
Metros	0.489	0.448	0.462	0.333	0.533	0.521	0.466	0.500
N. Eng	0.434	0.430	0.445	0.418	0.490	0.402	0.500	0.539
San Jose	0.500	0.482	0.444	0.495	0.412	0.618	0.563	0.563
Miami	------	------	0.404	0.416	0.491	0.613	------	------
Tampa Bay	0.564	0.478	0.447	0.505	0.554	0.320	------	------
(correlation to the other two record-based numbers was approximately .92)

So then, in a rather dubious attempt to see which is a more accurate measure of how good a team is, I decided to see how good a club's individual season's percentage was at predicting performance for the club in the next year. Maybe there's a better way, but I've recently been introduced to the correlation function in Excel, so that's what I used. You get:

Code:

Comparison	%-%	Pts-%	b.-%		Pts-Pts	b.-Pts		b.-b.
Correlation	0.181	0.179	0.230		0.191	0.252		0.242
So, in predicting the winning percentage, pt's/p.pt's, or GF/GA ratio of a club from season to season, beineke's GF/GA number was a good deal better (though none was particularly good). I don't know if this is significant or what, but I think it makes a pretty good case that GF/GA is at least as good a predictor as any kind of win%.
ChrisE is offline   Quote 
Old 15 Feb 2004, 06:34 PM   #6
beineke
BigSoccer Member
 
Join Date: Sep 2000
Default

Quote:
Originally posted by ChrisE
I decided to see how good a club's individual season's percentage was at predicting performance for the club in the next year. Maybe there's a better way, but I've recently been introduced to the correlation function in Excel, so that's what I used.
Correlation is a very good way to look at these things ... this is a nice finding.

A success story of goal percentage is Chicago this past season. Even though they had a poor win percentage (.464) in 2002, their goal percentage (.531) was third best in the league. That was a solid predictor of their success in 2003.

Bodes well for DC this coming year, not so well for Colorado.
beineke is offline   Quote 
Old 20 Feb 2004, 01:11 AM   #7
ChrisE
BigSoccer Member+
 
Join Date: Jul 2002
Location: Chicago, IL
Default

As a postscript:

I admittedly don't know a whole lot about correlations, so I was thinking, maybe regression to the mean, the tendency of exceptionally good teams to be worse the next year and exceptionally poor teams to be better, was diluting the strength of the year-year+1 correlations.

So, I divided winning percentages into two categories: greater than or equal to .500, and less than .500. I then did the same calculation (read: typing) as I had previously, comparing the winning percentages to those of the subsequent year. I got what, in my mind, was a truly surprising result.

When winning percentages were above .500, the correlation to winning percentages in the subsequent year was pretty close to what I originally calculated, .26 (vs. an original .18). However, for teams below .500, things were very different. Instead of simply getting a slightly stronger result, I got the opposite: a correlation of -.05.

Results were even stronger using goal differential rather than using win %:
>.500: .217
<.500: -.268

I had a pretty good guess what the deal was with winning percentages: while really good teams tend to be really good in subsequent years, and average teams tend to be pretty much average, the teams that end up at the bottom are propped up with allocations and high draft picks etc. in the name of parity. I've got no answers, however, as to why the worst goal differential teams would tend to be better than the slightly below average ones...

(Of course, the numbers probably mean less than they appear to. Since there have only been 88 team-seasons played in MLS history, we're looking at a really small sample. One significantly different result, e.g., the 2000 Metros sucking about as bad as did the 1999 Metros, would make things look pretty significantly different. I should probably be checking p values or something...)

Last edited by ChrisE; 20 Feb 2004 at 01:32 AM.
ChrisE is offline   Quote 
Old 20 Feb 2004, 02:38 AM   #8
ChrisE
BigSoccer Member+
 
Join Date: Jul 2002
Location: Chicago, IL
Default

Some more semi-related stuff:

Teams who score fewer goals should, one assumes, expect to have more ties. Testing this out using my now ubiquitous correlation function, we find exactly that: we get a correlation of -.379 between total goals scored/game and ties/game.

Likewise, teams who have more ties should expect to have winning percentages closer to .500. So I divided the the total goals/game numbers into a top half and a bottom half. Unsurprisingly, their average winning percentages are both very close to .500, .497 for fewer goals, .503 for more goals. However, their records' standard deviations are markedly different: .0924 for the fewer goals group, .1239 (34% greater) for more goals.

I think that any formula that attempts to predict team performance ought to take this into account.


(further sidenote: It occured to me that there also should be something of a positive correlation between goals scored for per game and goals scored against. Teams who play an aggressive game should not only score more, but should also be scored against more. Likewise, conservative teams should not only concede fewer goals, but also score fewer. Surprisingly, I got a weak negative correlation (-.116) between goals for and goals against. Perhaps team differences in ability are strong enough to drown this out, I really don't know.)

Last edited by ChrisE; 20 Feb 2004 at 03:00 AM.
ChrisE is offline   Quote 
Old 20 Feb 2004, 03:08 PM   #9
ChrisE
BigSoccer Member+
 
Join Date: Jul 2002
Location: Chicago, IL
Default

So most of my previous post was wrong.

I tested the standard deviation win% of high goals/low goals teams because I figured it would be a slam dunk with ties: teams with more ties would have significantly lower standard deviations than teams with more ties. Not the case, apparently:

Code:

	Win %	St. Dev
More Ties	0.511	0.111
Fewer Ties	0.489	0.106
The goals scored effect was created (I think) because teams that are really good tend to score more goals than average teams (so are in the top half) and teams that are really bad tend to be scored on more than average (and so are also in the top half). If you eliminate the teams above .600 and below .400, and divide them into high goal and low goal groups you get almost exactly the same win % variance.

The negative correlation for goals scored disappears when you get rid of the really good and really bad teams, too. For the same .400-.600 range (which consists of 57/88 teams), you get a correlation between goals scored and goals against of .379.
ChrisE is offline   Quote 
Old 20 Feb 2004, 03:17 PM   #10
ChrisE
BigSoccer Member+
 
Join Date: Jul 2002
Location: Chicago, IL
Default

Interested in the >.500/<.500 effect, I divided historical MLS winning percentages into quartiles.

This is what comes out:

Code:

	yr. 1	yr. 2	Correlation
1st	0.375	0.461	-0.437
2nd	0.463	0.517	-0.073
3rd	0.530	0.473	-0.008
4th	0.638	0.549	0.168

Of interest here is that we see some pretty significant recovery by the bottom quartile, and, most interestingly, the worse the team is, the worst teams seem to perform better than the best of the worst. Teams in the middle quartiles seem to drift around aimlessly, with slightly below .500's tending to improve and the above .500's tending towards decline. The fourth quartile was the only group where one year's positive performance was predictive of positive performance in the next year.
ChrisE is offline   Quote 
Share

Reply

  Home > Forums > World of Soccer > Statistics and Analysis


On sale for $102.44
at our soccer store

On sale for $39.99
or buy soccer jerseys

Share
Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes Rate This Thread
Rate This Thread:

Forum Jump

World of Soccer
On The Pitch
Equipment & Gear
Soccer Store
England
Europe
USA
Americas
Asia, Oceania & Africa
Women's Soccer
Not Soccer Related
Customer Service







All times are GMT -5. The time now is 11:24 AM.



 

Copyright © 2009 Big Internet Group, LLC. All rights reserved. PRIVACY POLICY. TERMS OF USE.
The BigSoccer name and logo and 'Share the Passion!' are service marks of Big Internet Group, LLC.
The BIG Network: Soccer | Aussie Rules Football | Travel | Cricket | Lacrosse | Music
Views expressed by the bloggers and users of BigSoccer do not represent the views of Big Internet Group, LLC.