Thursday, January 1, 2015

Vector Autoregression with Random Forests: New Technqiue

My previous predictions were fairly naive in that it only took previous values from each car make separately. However, other variables should be of importance (economic mostly) and could increase accuracy of predictions.

I've made new predictions based on a vector auto random forests. Like vector auto regressions, it models each time series with lagged values of itself and lag values of all other variables. The nice thing about using random forests, is that you don't necessarily need to worry about model specifications as the algorithm generally gives good predictions.

Below are predictions for December 2014 car sales.

Code

Predicted Values for 2014-12-01
Acura638.471
Audi714.237
BMW1,347.554
Buick782.536
Cadillac580.159
Chevrolet6,122.498
Chrysler1,088.584
Dodge1,796.103
Ford7,841.763
GMC1,898.404
Honda4,646.172
Hyundai2,195.227
Infiniti498.734
Jaguar49.113
Jeep2,445.042
Kia1,695.402
Land.Rover153.189
Lexus1,261.509
Lincoln359.453
Mazda900.577
Mercedes.Benz1,530.815
Mini214.853
Mitsubishi263.551
Nissan3,752.873
Porsche178.117
Subaru2,012.891
Toyota6,486.288
Volkswagen1,307.579
Volvo144.812

1 comment:

  1. Hi Sam,

    Can you share the in-sample and out-of-sample error rates?

    Regards,
    Naveen

    ReplyDelete