machine-learninggoogle-bigqueryarima

How to use external regressors for training Arima_PLUS model in BigQuery?


I created a model on big query, Is it possible to include additional columns as external regressors ?

For example I'd like to include Date, Users, page per session, bounce rate etc. for forecasting users.

create or replace model bqml_tutorial.create_model
options
(model_type= 'ARIMA_PLUS',
time_series_timestamp_col='Date',
time_series_data_col='Users',
auto_arima=True,
data_frequency = 'AUTO_FREQUENCY',
decompose_time_series= True)
as
select Date, cv as Users from `bqml_tutorial.cvrate` ORDER BY Date 

Solution

  • Looking at the documentation this is currently not available. The ARIMA_PLUS model that you can train in BigQuery already does a lot of things (seasonality study, outlier removal, missing data interpolation, etc). But in terms of external regressors, you cannot add specific columns for training your model.

    The only additional data you can put into your model is the holiday information (with HOLIDAY_REGION option). Which is already awesome !

    Note that you can train models for multiple time series at the same time by specifying the columns you want to forecast with the TIME_SERIES_ID_COL parameter. But that will get you forecasts for all these columns from independant models (therefore the effect from one column on the other will not be modelled).