Data Sets

Download free customer sample Machine Learning data sets for use with Squark. These data sets support Squark’s no code supervised machine learning capabilities for:

  1. Regression: “How much?” (forecasting)
  2. Classification: “Which of two things?” (binary), or “Which of three or more things?” (multivariate or multinomial)
  3. Time Series Forecasting. “How much for how long?”

Included in each data set download is a training file and a production file. The training file contains historical data. The production file contains new data on which you want to make predictions.

You’ll notice the training files have an extra column, the target column/target label. The target column, also known as the dependent variable, is the column you are ‘training’ the algorithms to predict. It will be the column that Squark will predict the answer for on the production file.

For more info on data sets, view the video What Are Training and Production Data Files.



Customer Success



