Forums
There is currently 1 person viewing this thread.
frog2
08 Mar 17 10:00
Joined:
Date Joined: 01 Feb 08
| Topic/replies: 4,229 | Blogger: frog2's blog
I have mainly gambled on horseracing over the last couple of decades using my own computer generated ratings. The last 18 months or so I have been working on a pet project to generate automated cricket ratings for every player for every major match (all the ones on TV basically). Cricket is the sport I love whereas racing bets pay the bills.

I have made these available to anyone interested on cricsq dot com. I would be interested what cricket gamblers on here think of methodology I have used and if the ratings look about right or way off the mark. Any opinions would be welcome.

Post your reply

Text Format: Table: Smilies:
Forum does not support HTML
Insert Photo
Cancel
Page 1 of 3  •  Previous 1 | 2 | 3 | Next
sort by:
Show
per page
Replies: 98
By:
pxb
When: 08 Mar 17 12:35
I'll have a look at it later.
By:
frog2
When: 08 Mar 17 13:41
Thanks
By:
Fatslogger
When: 08 Mar 17 18:02
Interesting. I tend to regard the ICC rankings as a pretty good guide to merit, with a few significant caveats, the two largest being overall sample size issues and over weighting of historical performances for obviously deteriorating players (well, apparently obviously - one could argue that saying a player is past it is itself getting very caught up on small sample sizes).
By:
DStyle
When: 08 Mar 17 18:08
this is so hard.

it's an idea i've been mulling over for some time as well.

i'm not suggesting that it's impossible to improve on the current way the data is analysed and compiled to provide better indicators of performance, but i still think that data alone isn't enough.

my problem is that so often the nature of dismissal as well as missed chances and unrewarded spells can all only be compiled from watching a match. and that batting is as dependent on the quality of the bowling as it is on a batsman's skill. I noticed you gave Brathwaite a high rating in the world t20 final, but what choice did he have, and how awful was Stokes' decision making. I couldn't give Brathwaite any more than a third of the rating points I would award Kohli for his semi final knock against Australia. I find that a huge factor in t20. **** or bust situations regularly present themselves and there will always be a risk that noise is mistaken for achievement.

i came to the conclusion that you'd need knowledgeable cricket fans, familiar with cognitive biases, to amend a technically derived score (the structure of which they understood) at the end of each match, and you'd need to keep an audit of amendment process to ensuring consistency.
By:
frog2
When: 08 Mar 17 18:52
I can see your point with Brathwaite. I was not sure about using that example as I was not happy he was given so many more points than Samuels.

The trouble with T20 internationals is there are so few games and the associate teams play the associate teams and the top teams play the top teams. Brathwaite is rated 11th in t20 internationals and 214th in T20 league games! Clearly that is not right but he only had a handful of matches in T20is.

Kolhi is an interesting one. Clearly an amazing player but 9 years with RCB and still no title despite thousands of runs. I think he is too slow in the powerplay and the middle overs when setting totals. He then explodes from 16 onwards. He doesnt seem to increase it much in the middle overs even if there are plenty of wickets in hand.

With ratings I do on horseracing they also sometimes produce strange results but as a whole they are a good guide. I find if you start adding human input you cannot repeat them in an unemotional way. But I can see how for cricket this might be required. Thanks for the feedback.
By:
Fatslogger
When: 08 Mar 17 18:58
Very interesting post DS. I think all you really have are the data. Yes, there is a lot of noise and some serious sample size issues but as you clearly know and imply, there's no objective or reliable way of analysing performance otherwise. Clearly it is important to understand the limitations of the method used though.

The T20I thing is also interesting. I don't rate the ratings for them at all, because there are so few games played. The guys at the top might be the best players but I don't think there's much more than a vague tendency towards having any real idea. As you say, frog2, looking at league games with a far bigger sample size is probably more sensible but even then you're assessing a rather random format.
By:
frog2
When: 08 Mar 17 19:15
I agree the leagues are random and sometimes strange. But due to the number of games I think things tend to balance out. There is so much more data to go on but I cannot find anyone rating the players on the web. So for my T20 league batting you have a top 8 of:

1    AB de Villiers    56.9    Barbados Tridents
2    Andre Russell    53.6    Sydney Thunder
3    David Warner    52.5    Sunrisers Hyderabad
4    Kieron Pollard    48.8    Karachi Kings
5    Kevin Pietersen    47.9    Quetta Gladiators
6    Chris Lynn    47.5    Brisbane Heat
7    Virat Kohli    46.7    Royal Challengers Bangalore
8    Jos Buttler    46.6    Lancashire

Which I think feels reasonable.

I think the point about who is bowling to who etc is important. Maybe the algorithm needs to be more iterative to include this information once the initial ratings are worked out.
By:
Charlie
When: 08 Mar 17 19:22
If I'm reading your ratings correctly then some of the T20 League batting ratings look miles out to me (they're the only ones I've looked at so far). The top ranked players seem reasonable but as you go down the rankings then they seem to deteriorate.

Chris Broad ranked at 68 with so many good batsmen rated below him e.g. Bairstow ranked 99, Stokes 106, Hales 156, Faf 198, Root 220. Common sense tells us that something's deeply wrong with these.
By:
Charlie
When: 08 Mar 17 19:31
Just noticed Dernbach rated slightly higher Sam Billings!
By:
Charlie
When: 08 Mar 17 19:31
higher than Sam
By:
frog2
When: 08 Mar 17 19:42
Broad got one high rating when he won a game for Hobart in the final over. I think he has only batted 3 times in T20 league games since 2014.Thanks for pointing that out. I think there needs to be a min amount of matches to be rated.

Bairstow has had one T20 league win in the last 13 matches he has played in. His last good performance was 92 for Yorkshire off 42 balls in July 2015. His win rate in League matches is 38% since 2014

Stokes' recent scores are 18, 56, 1, 11, 5, 40, 19, 33, 4. Average of 19 since 2014 with a win rate of 45%

Hales averages 23 for Notts which as an opener is not great.

Faf averages 22 in 48 matches with a strike rate of 121. He had a terrible CPL.

Root has only played 5 matches for Yorkshire and they lost 4 of them.

.....


The algorithm can only go on the data supplied but I see your point. These are class international players. Should the algorithm combine league and international performances?
By:
Charlie
When: 08 Mar 17 19:48
Initial thoughts were that combining data from different leagues is quite difficult as not all leagues are equal.

Apologies will poster later as watching footy now.
By:
frog2
When: 08 Mar 17 19:55
Dernbach had 3 bats only. I think you are right there needs to be a min amount.

But Billings is only averaging 17 since 2014
http://www.cricsq.com/players/sam-billings-t20l

How many match winning innings has he played? 78 off 50 v Quetta. 13 matches for Kent last summer he got 142 runs off 129 balls. Top score 55 from 30 when the team had a solid base.

Bowlers that dont bat much need to be taken out I think. But some of the batsmen actually dont have that great a record in T20 league matches.
By:
Charlie
When: 08 Mar 17 20:51
Half-time in footy.

You're covering quite a lot of things very quickly here and need to think about them.

Another early thought I had is what would you do with the ratings? Horse ratings I can see as used to do my own speed ratings for AW meetings only. That was initially why I joined Betfair.

If you had two teams where one team had better ratings for every player than the other then I'm not at all certain that I would back that team. Odds, whose batting first, pitch, weather, need to win, recent form, individual match ups, etc, so many other things to consider than just player ratings.

Not trying to be negative just throwing some thoughts onto the pitch.
By:
frog2
When: 08 Mar 17 21:14
Not negative at all. Very useful thoughts.

What I have been doing is putting the ratings for all the players, pitch info (spin/pace ratings, average winning 1st score etc), who bats first, and so on into a neural network with the target value the result (win lose (or draw)). So I have rating histories use.

Have to wait for the toss and then run it to give a probability for each team. Then look for value in the odds.
By:
pxb
When: 08 Mar 17 21:28
As mentioned you have a sample size issue. In addition, the majority of dismissals are caught which is beyond the control of the bowler (as are stumpings and runouts). Dollies are dropped and crackers are taken. And edges behind sometimes hit the stumps and sometimes go for boundaries. In combination, these factors will give a biased picture of both bowling and batting proficiency. What I would like to see is a weighting for how good dismissal chances are irrespective of whether they are taken or not. Although I suspect that is too difficult.

You say you use a Neural Network, but don't say how. What does a NN give you over a simple summing and averaging of individual players performance?

Otherwise, I'd say you are on the right track, and it would be good enough to give you a betting edge. How much of an edge I can't say.
By:
frog2
When: 08 Mar 17 21:40
What the NN does is slightly a black box. But the batting order is set by the average historical position to go to 1 to 11. So in T20 the NN will probably give less weight to batsmen further down the order (I would hope it does anyway!). This to some extent gets rid of the problem mentioned above of Broad and Dernbach getting too high ratings. For bowlers it uses average overs bowled for T20 so it should, for instance, not weigh much for Joe Root's bowling.

Just averaging the totals would not do this.
By:
frog2
When: 08 Mar 17 21:45
The NN also gives the chance to set the target value when building the model so you get a win probability from it. With averaging you just have, say, Team 1 rating 42, Team 2 rating 25. But what should the odds be?
By:
pxb
When: 08 Mar 17 21:46
What the NN does is slightly a black box.

As we say in the business, 'Magic happens here'.Grin
By:
Fatslogger
When: 08 Mar 17 22:27
How much value are you awarding for "winning" a game? Only I think I'd give roughly equal weighting to a guy getting 30 off 12 setting a total that happened to get overhauled off the last ball versus the same game where the final ball was a dot rather than a 4 so the defending team won. The Braithwaite innings getting that many points smacks a bit of results merchantry. What matters in terms of prediction for future games is the skill that was displayed (as far as you can fathom that from what happened in a match, which may not be all that far), I think, rather than the eventual match outcome. You say something similar about Kohli not winning the IPL but as one of 22 players on the pitch each time, I don't think that's the way you'd work out how good he is.
By:
pxb
When: 09 Mar 17 00:25
Just averaging the totals would not do this.

So weight them.

Personally, I'd use a Monte Carlo simulation.
By:
Charlie
When: 09 Mar 17 07:26
Just another quick observation. On your T20I batting rankings you have Afghans ranked 6 and 7, a Hong Konger at 9 and a Jock at 15. If you ask anyone to name their best twenty T20I batsmen I doubt if they would include those players. I imagine that the NN doesn't take in to account the strength of the opposition so high rankings can be achieved by performing well against relatively weak opponents.
By:
frog2
When: 09 Mar 17 08:00
Good point Charlie. The algorithm needs a second level to take into account who you are playing against. Who is bowling or batting. The major problem is in T20 the associate teams generally just play against themselves and the top teams play against themselves.

They are hard to compare and then when they do play you get results like Afghanistan beating the West Indies.

For the t20 team ratings I split them into associate and full members otherwise Afghanistan would be rated 2nd. Maybe I should have two lists for the player ratings as well?
By:
frog2
When: 09 Mar 17 08:08
Fatslogger,

From what I can make out the machine is doing it does not rate first innings performances on the final result. Scoring 30 off 12 balls in exactly the same situation in the first innings would get the same rating regardless if the team beating second won of lost.

Batting second it would be dependent on keeping up with the required rate and getting the team across the line.
By:
frog2
When: 09 Mar 17 08:12
pbx,

The NN does basically weight the inputs. But it works out those weights from historical data rather than having to guess the weights by hand.

Montesim is something I have tried a few times but I seem to always get slightly strange odds forecasts.
By:
Charlie
When: 09 Mar 17 09:00
Is there a difference in the inputs to the net depending on whether it's say a T20I or a T20 league game?
By:
frog2
When: 09 Mar 17 09:09
Yes. The same algorithm is used for all T20 matches. The reinforcement learning component that works out the inputs has formed 132 inputs based on the match situation.

Currently it does not take account of opposition. Looking at the league team ratings (which is just average ratings of the last 3 teams fielded) IPL teams are in the top 5 of domestic teams which implies the IPL is the best league in the world.
By:
frog2
When: 09 Mar 17 09:11
Very useful feedback. A fair bit of revision is needed to iron out the flaws.
By:
Charlie
When: 09 Mar 17 09:11
Is there a different algorithm (different net?) for calculating Team Rankings and Player Rankings?
By:
Charlie
When: 09 Mar 17 09:24
Presumably the players are some of the inputs? For a T20I if you only have one set of player rankings (as you currently do) then surely associate team players will be overrated when playing against a top team?

If you aggregate each team's players (maybe just a simple sum) does the aggregate correlate with the Team's Ranking?
By:
frog2
When: 09 Mar 17 09:27
There is several layers to it.

So firstly you have player batting and bowling ratings for each match. These are a rough guide to impact each player performance has on the match outcome.

Secondly you have the master bowling and batting ratings for each player for each format. This is calculated from previous match performances for each player.

Then you have the teams. The team ratings do not use NNs at the moment. The teams are the average of the sum of the master ratings for players who played for that team in the last 3 matches they played. Generally this seems to look ok but you get situations where Australia play a weak team V Sri Lanka for a 3 match T20i series so they have a low current rating.

On the day of each match I use the players in the actual team playing rather than the team ratings on that website to calculate probabilities.
By:
frog2
When: 09 Mar 17 09:28
To your last post. Yes. That is how they are calculated at the moment just using the individual players. Yes on the rare occassion like a world t20 when associates play full teams it would not make sense.
By:
frog2
When: 09 Mar 17 09:31
These are very clear issues. Do the test numbers (where this issues shoudl not exist) look weird to you as well?
By:
Charlie
When: 09 Mar 17 09:32
Have only looked at T20 so far!
By:
Charlie
When: 09 Mar 17 09:33
How do you deal with player's making their debut?
By:
Charlie
When: 09 Mar 17 09:36
Does DL screw things up?
By:
frog2
When: 09 Mar 17 09:40
Retrospective DLS messes up slightly but it seems to manage out as it uses target and balls left etc as inputs. So this game for instance:

http://www.cricsq.com/match-ratings/psl-2017-02-17-islamabad-united-v-karachi-kings
The ratings look okish I think
By:
frog2
When: 09 Mar 17 09:41
Debut some as for horseracing I used have one input as 0 for debut and 1 for not debut.
By:
frog2
When: 09 Mar 17 09:41
I have to go out now. But thanks for your advise.
Page 1 of 3  •  Previous 1 | 2 | 3 | Next
sort by:
Show
per page

Post your reply

Text Format: Table: Smilies:
Forum does not support HTML
Insert Photo
Cancel
‹ back to topics
www.betfair.com