Creating query models and making predictions in the DataEditor

Overview

You can use the DataEditor to create machine learning models and make predictions with data stored in the DataEditor.

The DataEditor currently supports the following types of models:

Model	Explanation
Regression	This model predicts for numerical values. Example use cases include using weather/day of the week data to train a model to predict for customer or sales numbers.
Classification	This model classifies data into categories. Example use cases include predicting whether credit care usage is normal or fraudulent and predicting which users are likely to register for membership campaigns.
Clustering	This model groups data into a set number of clusters based on similarities. It can be used for things like market analysis and computer vision.

Model

Explanation

Regression

This model predicts for numerical values.

Example use cases include using weather/day of the week data to train a model to predict for customer or sales numbers.

Classification

This model classifies data into categories.

Example use cases include predicting whether credit care usage is normal or fraudulent and predicting which users are likely to register for membership campaigns.

Clustering

This model groups data into a set number of clusters based on similarities.

It can be used for things like market analysis and computer vision.

There are two types of classification and regression models that you can create: ones made with the Model Generator and one made without the Model Generator.

Models created with the Model Generator can be used to solve more complicated problems, but as a result, training the model takes a longer time. Models created without the Model Generator can be trained more quickly, but are more suited for simple problems.

We’ll refer to models created with the Model Generator as “Model Generator versions” to distinguish them.

Some advantages to using the DataEditor machine learning features include:

All you need to do is create your data in the DataEditor. No difficult machine learning expertise required.
You can prepare your training data, create a model, and evaluate your model all within the DataEditor.

Regression model (Model Generator version) example

This section will use the electricity demand data from Demand forecasting with the regression model to demonstrate how to create a regression model (Model Generator version) from the DataEditor:

Preparing the data

We’ll start by preparing the data we’ll use to train and test the model, which is the same data used in the Demand forecasting with the regression model tutorial.

If you haven’t followed followed that tutorial and need to prepare the data, complete the steps in the Preparing the data as a CSV file and Splitting the data in the DataEditor sections.

Creating the model

Once your training data is ready, create the model by doing the following:

Opening the editor for the training data

Click Electricity Demand_train.

Selecting to create a regression (model generator version) model

Click Create Model.
Click Regression.
Enter a name for the model.
Designate the Google Cloud Storage (GCS) folder that will store your models (only if it’s the first time you are creating a Model Generator).
Click Create Folder (only if it’s the first time you are creating a Model Generator).
Select Automatic Setup or Manual Setup:
- Automatic Setup: Automatically sets the maximum time until timeout to 30 minutes and the maximum number of trials to 20.
- Manual Setup: Allows you to configure the the maximum time until timeout, maximum number of trials, and training data settings before creating the model.
Click Create Model.

You can check on the progress of the model’s training by doing the following:

Click Close.

Click <.

Click Models.
View the progress bar to see the training’s progress.
Click the refresh icon to refresh the progress bar.
Click the name to view the model’s details page.

The following is an example of a model’s details:

In the Schema tab, you can view the model’s schema information (column names and types).
Click < to return to the model list.

Viewing the menu for a model from the model list

You can click a model’s menu icon () (❶) to open a menu with the following options:

Change name
Stop training
Delete

Making predictions

Once the model is created, you can use it with the testing data to make predictions by doing the following:

You can also make predictions from a Flow Designer, which also allows you to set automated schedules and run batch predictions. For more information about making predictions from a Flow Designer, refer to Using Flow templates or the classification/regression tutorial pages (classification predictions/regression predictions).

Open the testing data by doing the following:

Click Data.
Click Electricity Demand_test.

This will open the testing data in the editor.

Selecting the model and making predictions

Click Predict.
Select Predict Electricity Demand Model (Model Generator).
Click Predict.

The prediction results will appear after a moment.

The following chart explains the meaning of each column in the results:

Column	Explanation
output	The predicted value.
key high_temperature low_temperature sunlight_hours average_humidity daytime_minutes	The testing data.

Regression example

This section will use the electricity demand data from Demand forecasting with the regression model to demonstrate how to create a regression model from the DataEditor: