Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Log into the ML UI using admin as both the username and password. The following will be displayed in the Home page.
     
  2. Click ADD DATASET to open the Create Dataset page. 
  3. In the Data Source field, click Choose File and browse for the <ML_HOME>/samples/tuned/naive-bayes/breastCancerWisconsin.csv file. Enter values for the rest of the parameters as shown below.
    Image RemovedImage Added

    Parameter NameValue
    Dataset NameBreast_Cancer_Dataset
    Version1.0.0
    Description Breast cancer data in Winconsin.
    Source Type File
    Data Format CSV
    Column Header Available Yes
  4. Click CREATE DATASET to save your changes. The Datasets page will open and the dataset you entered will be displayed as follows.
    Image Added Note that the status of the dataset is Processing.
  5. Click REFRESH. The status of the dataset will change to Processed as shown below.
    Image Added 

Step 2: Create a project

Create a new project as follows.

  1. Log into the ML Management Console if you are not already logged in.
  2. Click ADD PROJECT
    Image Added

    If you are already logged in, you can click CREATE PROJECT in the DATASETS page as shown below. 
     Image Added
  3. In the Create Project page, enter information as shown below.
    Image Added

    Parameter NameDescription
    Project NameBreast_Cancer_data_analytics_project
    DescriptionThis project performs predictive analysis on the breast cancer data in Wisconsin.
    DatasetBreast_Cancer_Dataset
  4. Click Create Project to save the information. The project will be displayed in the Projects page as follows.
    Image Added 

Step 3: Create an analysis and train a model

  1. Log into the ML UI if you are not already logged in. 
  2. Click the You have X projects link as shown below.
    Image Added
  3. Click on the Breast_Cancer_data_analytics_project project to expand it.
  4. Enter breast_cancer_analysis_1 as the analysis name and click CREATE ANALYSIS. The following page will appear displaying the summary statistics.
    Image Added 
  5. Click Next without making any changes to the summary statistics.
    Image Added
    The Explore view will open. You will notice that Parallel Sets and Trellis Chart visualisations are enabled, and Scatter Plot and Cluster Diagram visualisations are disabled. This is determined by the feature types of the dataset. Select and clear the checkboxes for categorical features as follows.
     Image Added

    Image Added
  6. Click Next. The Algorithms view will be displayed. Enter values as shown below.
    Image Added

    ParameterValue
    Algorithm nameLOGISTIC REGRESSION L_BFGS
    Response variableClass
    Train data fraction0.7
  7. Click Next. The Parameters view will appear.Enter L2 as the reg type.
    Image Added 
  8. Click Next. The Model view will appear. Select Breast_Cancer_Dataset-1.0.0 as the dataset version.
    Image Added 
  9. Click Run. The analysis will be created and displayed for the project as shown below.
    Image Added

Step 4: Predict