Given a collection of (ideally sparsely correlated) time series, this software provides functions to
make fast and accurate predictions of the future value of a particular time series by using the l1-regularized least squares model and cholesky decomposition
and cholesky update. Interested users should carefully read the documentations and integrate our functions to their Matlab projects.
Given:
s1, ..., sn -- n time series that all start (at t0) and end (at t1) at the same time.
the series number i that will be predicted.
a data source that constantly provide the new values of s1, ..., sn at the next time step.
the size of a sliding window.
the factor mu in the optimization problem |Ax-b|_2^2+mu*|x|_1.
At every future time point t (t>=t1) where we know the values of series s1, ..., sn from t0 to t, return the predicted value of series i at time step t+1.
FPS
operates with a reasonable number (between 10 and 100) of series and possibly a large number (>10000) of time steps, in an efficient and incremental
fashion, so that as new points are appended to the timeseries, the value of series i at next time step can be predicted in real time.
Install Matlab on your computer, and then download our software in the download page and unzip it into a directory of your choice.
Next set up Matlab PATH variable as described in the download page
Note that FPS is not an executable project.
Instead FPS is a package that contains a function to train our l1-regularized least squares model,
and a Matlab object that performs predictions on a particular time series given other time series and itself. There are two Matlab programs
that should be of interest to you. train_model.m is the function that suggests the user
possibly good parameters to use in actual predictions., and IncrementalPredict.m is a Matlab object that
makes future predictions on a particular time series. We assume that the user already have some initial data
for each series so that we can first run train_model.m on these data to obtain some parameters before we
any actual prediction. We also assume that the user will constantly obtain new values for each series from
some data source. Both the initial data and the incoming new data are likely to be part of a large system, so
we expect that both of train_model.m and IncrementalPredict.m will be imbeded in that system. In fact there's
no "main" method in this package. We only provide a Matlab class IncrementalPredict.m for doing prediction and
a Matlab function train_model.m that output suggested parameters based on the training_data. We assume that
interested user will imbed this class and this function into a larger Matlab project. To get a feeling of how to use IncrementalPredict.m and
train_model.m, see the file sample_run.m. The default parameters to run this program should be appropriate for a
start. This sample program will make predictions for a particular series
in a given time interval. The program assumes that the input data is volatile; therefore, it revokes the train_model
function every 100 time steps to get the most up-to-date parameters for actual predictions.
We suggest that interested users carefully read the documentations
in sample_run.m, IncrementalPredict.m and train_model.m to understand how IncrementalPredict.m and train_model.m
may be applied to their Matlab project.
WARNING: This demo program is going to take a long time (a few
hours to about a day) to finish if the difference of start_time and end_time is large (say larger than 3000).
If you have any question
or feedback, please let us know. Email: shasha@cs.nyu.edu or jjx203@cs.nyu.edu.
Maintained by jjx203@cs.nyu.edu Last Updated Dec 17th, 2009
Send
your feedback