[R] machine learning and horse racing
Gerard Smits
g_smits at verizon.net
Tue Sep 18 23:03:43 CEST 2007
Hi Stephen,
Not responding to the R memory question, but to the racing.
I worked on this many years ago and found no way of overcoming the
19% or so paramutual take. That being said, I suggest you take class
into account (based on purse, type of race (maiden claiming, claiming
$, NWxx allowance, etc). Make sure that you are accounting for the
size of the field. it is much easier to win a race of 6 than 12
horses. A similar bias applies to the advantage of inner post
position, if you do not account for number of entries.
Re validation, I would not build a mode on X years of data and then
validate. Patterns change and a model needs to be adaptive. I would
use a hold out day, per week (randomly chosen) and then use that.
good luck in a difficult task.
Gerard
More information about the R-help
mailing list