Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Table of Contents
maxLevel5
minLevel3

General ML configuration related questions

Where can I find the ML configuration file?

The <ML_HOME>/repository/conf/machine-learner.xml file includes all the ML-specific configurations

How can I change the ML datasource name?

The default name is jdbc/WSO2ML_DB. You can change it by changing the value of the <DataSourceName> element in the <ML_HOME>/repository/conf/machine-learner.xml file. For more information, see ML-specific configurations

How can I change the sample points size that ML use to generate summary statistics?

Default size is 10000. You can change it by changing the value of the <SampleSize> property within the <SummaryStatisticsSettings> element in the <ML_HOME>/repository/conf/machine-learner.xml file. For more information, see ML-specific configurations

How can I change the directory which holds datasets?

By default, the <ML_HOME>/datasets/ directory holds datasets, and the default storage type is ‘file’. You can change it by changing the value of the <StorageDirectory> property within the <DatasetStorage> element in the <ML_HOME>/repository/conf/machine-learner.xml file. For more information, see ML-specific configurations

How can I change the directory with models?

By default, the <ML_HOME>/models/ directory holds models, and default storage type is ‘file’You can change it by changing the value of the <StorageDirectory> property within the <ModelStorage> element in the <ML_HOME>/repository/conf/machine-learner.xml fileFor more information, see ML-specific configurations

How can I increase ML thread pool size?

WSO2 ML uses threads in a thread pool to run different tasks such as dataset summary generation and model generation etc. You can control the size of this thread pool by changing the value of the following property in the <ML_HOME>/repository/conf/machine-learner.xml file: <Property name="ml.thread.pool.size" value="100"/> For more information, see ML-specific configurations

Where should I configure the email addresses of recipients whom will be notified upon a model generation?

You can configure WSO2 ML to send emails on the completion of a model generation. You can have a comma-separated set of email addresses as the value of the <EmailNotificationEndpoint> property in the <ML_HOME>/repository/conf/machine-learner.xml file. For more information on  configuring email support, see Enabling Email Notifications.

How can I change datasets storage to HDFS?

If you want to change the dataset storage type to HDFS, change the value of the <StorageType> property within the <DatasetStorage> element to ‘hdfs’. For more information, see ML-specific configurations

How can I change model storage to HDFS?

If you want to change the model storage type to HDFS, change the value of the <StorageType> property within the <ModelStorage> element to ‘hdfs’. For more information, see ML-specific configurations

How can I give a HDFS URL?

If you want to store your datasets and models in an HDFS, you need to enter HDFS URL as the value of the <HdfsURL> property in the <ML_HOME>/repository/conf/machine-learner.xml fileFor more information, see ML-specific configurations

 

 

Data related questions

What should be the format of my dataset?

WSO2 ML currently supports the following formats.

  • CSV with comma separated values
  • TSV with tab separated values
Where can I have my data?

Data can be retrieved from the following sources.

  • File system
  • Hadoop distributed file system
  • WSO2 Data Analytics Server table
Do I need to have a header row in my dataset?

It is not mandatory to have a header row. If the dataset does not have a header row, it will be indicated when you upload the dataset. Then WSO2 ML will generate a header similar to V1, V2 .. Vn. See Creating a Dataset for more information.

Is there a file size limit to my dataset?

Yes, currently it is 100MB. You can change it via the following property in <ML_HOME>/bin/wso2server.bat file (for Windows) or <ML_HOME>/bin/wso2server.sh file (for Linux).

100MB = 100 x 1024 x 1024 = 104857600 Bytes

-Dorg.apache.cxf.io.CachedOutputStream.Threshold=104857600 \

Algorithm related questions

Does ML support numerical prediction?

Yes it does. The following algorithms are available in this version.

  • Linear Regression
  • Ridge Regression
  • Lasso Regression

See How to Select an Algorithm in WSO2 ML for more information 

Does ML support classification algorithms?

Yes it does. The classification algorithms currently supported are as follows.

  • Logistic Regression with Stochastic Gradient Descent
  • Logistic Regression with Limited memory Broyden-Fletcher-Goldfarb-Shanno
  • Decision tree
  • Random forest
  • Naive bayes

See How to Select an Algorithm in WSO2 ML for more information 

Does ML support clustering algorithms?

At present, very primitive support is available for clustering. K-means is the only clustering algorithm supported for this ML version. We have plans to improve on this area. Please contact us if you are someone who is interested to learn more about these plans.

Analysis related questions

What type of data preprocessing does WSO2 ML support?

It supports feature selection and missing value filling/filtering.

Can I get an insight about the dataset before creating an analysis out of it?

Yes, WSO2 ML supports dataset exploration functionality with multiple visualization techniques. See Exploring Data for more information.

Model related questions

How can I find the details of a built model?
How can I calculate the accuracy for a given model?
Can I download a built model?
Does you support exporting models in PMML format?
Can I use a built model in a Java program?
How can I make predictions for a test dataset using ML UI wizard?

Once you build a model, you can make predictions to a test dataset by uploading a csv/tsv file from the Predict page in the ML UI.  See Making Predictions for more information.

Can I use a built model in other WSO2 products?

Yes, you can use the generated models in WSO2 ESB with the Predict mediator and in WSO2 CEP with CEP ML Extension.

 

REST API related questions

What is the root context of the ML REST API?

 

What authentication mechanisms you support?

 

Table of Contents
maxLevel5
minLevel3

...

General ML configuration related questions

Anchor
Where can I find the ML configuration file?
Where can I find the ML configuration file?
Where can I find the ML configuration file?

The <ML_HOME>/repository/conf/machine-learner.xml file includes all the ML-specific configurations

Anchor
How can I change the ML datasource name?
How can I change the ML datasource name?
How can I change the ML datasource name?

The default name is jdbc/WSO2ML_DB. You can change it by changing the value of the <DataSourceName> element in the <ML_HOME>/repository/conf/machine-learner.xml file. For more information, see ML-specific configurations

Anchor
How can I change the sample points size that ML use to generate summary statistics?
How can I change the sample points size that ML use to generate summary statistics?
How can I change the sample points size that ML use to generate summary statistics?

Default size is 10000. You can change it by changing the value of the <SampleSize> property within the <SummaryStatisticsSettings> element in the <ML_HOME>/repository/conf/machine-learner.xml file. For more information, see ML-specific configurations

Anchor
How can I change the directory which holds datasets?
How can I change the directory which holds datasets?
How can I change the directory which holds datasets?

By default, the <ML_HOME>/datasets/ directory holds datasets, and the default storage type is ‘file’. You can change it by changing the value of the <StorageDirectory> property within the <DatasetStorage> element in the <ML_HOME>/repository/conf/machine-learner.xml file. For more information, see ML-specific configurations

Anchor
How can I change the directory with models?
How can I change the directory with models?
How can I change the directory with models?

By default, the <ML_HOME>/models/ directory holds models, and default storage type is ‘file’You can change it by changing the value of the <StorageDirectory> property within the <ModelStorage> element in the <ML_HOME>/repository/conf/machine-learner.xml fileFor more information, see ML-specific configurations

Anchor
How can I increase ML thread pool size?
How can I increase ML thread pool size?
How can I increase ML thread pool size?

WSO2 ML uses threads in a thread pool to run different tasks such as dataset summary generation and model generation etc. You can control the size of this thread pool by changing the value of the following property in the <ML_HOME>/repository/conf/machine-learner.xml file: <Property name="ml.thread.pool.size" value="100"/> For more information, see ML-specific configurations

Anchor
Where should I configure the email addresses of recipients whom will be notified upon a model generation?
Where should I configure the email addresses of recipients whom will be notified upon a model generation?
Where should I configure the email addresses of recipients whom will be notified upon a model generation?

You can configure WSO2 ML to send emails on the completion of a model generation. You can have a comma-separated set of email addresses as the value of the <EmailNotificationEndpoint> property in the <ML_HOME>/repository/conf/machine-learner.xml file. For more information on  configuring email support, see Enabling Email Notifications.

Anchor
How can I change datasets storage to HDFS?
How can I change datasets storage to HDFS?
How can I change datasets 
storage to HDFS?

If you want to change the dataset storage type to HDFS, change the value of the <StorageType> property within the <DatasetStorage> element to ‘hdfs’. For more information, see ML-specific configurations

Anchor
How can I change model storage to HDFS?
How can I change model storage to HDFS?
How can I change 
model storage to HDFS?

If you want to change the model storage type to HDFS, change the value of the <StorageType> property within the <ModelStorage> element to ‘hdfs’. For more information, see ML-specific configurations

Anchor
How can I give a HDFS URL?
How can I give a HDFS URL?
How can I give a HDFS URL?

If you want to store your datasets and models in an HDFS, you need to enter HDFS URL as the value of the <HdfsURL> property in the <ML_HOME>/repository/conf/machine-learner.xml fileFor more information, see ML-specific configurations 

...

Data related questions

Anchor
What should be the format of my dataset?
What should be the format of my dataset?
What should be the format of my dataset?

WSO2 ML currently supports the following formats.

  • CSV with comma separated values
  • TSV with tab separated values
Anchor
Where can I have my data?
Where can I have my data?
Where can I have my data?

Data can be retrieved from the following sources.

  • File system
  • Hadoop distributed file system
  • WSO2 Data Analytics Server table
Anchor
Do I need to have a header row in my dataset?
Do I need to have a header row in my dataset?
Do I need to have a header row in my dataset?

It is not mandatory to have a header row. If the dataset does not have a header row, it will be indicated when you upload the dataset. Then WSO2 ML will generate a header similar to V1, V2 .. Vn. For more information see Exploring Data.

Anchor
Is there a file size limit to my dataset?
Is there a file size limit to my dataset?
Is there a file size limit to my dataset?

Yes, currently it is 100MB. You can change it via the following property in <ML_HOME>/bin/wso2server.bat file (for Windows) or <ML_HOME>/bin/wso2server.sh file (for Linux).

100MB = 100 x 1024 x 1024 = 104857600 Bytes

-Dorg.apache.cxf.io.CachedOutputStream.Threshold=104857600 \

...

Algorithm related questions

Anchor
Does ML support numerical prediction?
Does ML support numerical prediction?
Does ML support numerical prediction?

Yes it does. The following algorithms are available in this version.

  • Linear Regression
  • Ridge Regression
  • Lasso Regression

See How to Select an Algorithm in WSO2 ML for more information 

Anchor
Does ML support classification algorithms?
Does ML support classification algorithms?
Does ML support classification algorithms?

Yes it does. The classification algorithms currently supported are as follows.

  • Logistic Regression with Stochastic Gradient Descent
  • Logistic Regression with Limited memory Broyden-Fletcher-Goldfarb-Shanno
  • Decision tree
  • Random forest
  • Naive bayes

See How to Select an Algorithm in WSO2 ML for more information 

Anchor
Does ML support clustering algorithms?
Does ML support clustering algorithms?
Does ML support clustering algorithms?

At present, very primitive support is available for clustering. K-means is the only clustering algorithm supported for this ML version. We have plans to improve on this area. Please contact us if you are someone who is interested to learn more about these plans.

...

Analysis related questions

Anchor
What type of data preprocessing does WSO2 ML support?
What type of data preprocessing does WSO2 ML support?
What type of data preprocessing does WSO2 ML support?

It supports feature selection and missing value filling/filtering.

Anchor
Can I get an insight about the dataset before creating an analysis out of it?
Can I get an insight about the dataset before creating an analysis out of it?
Can I get an insight about the dataset before creating an analysis out of it?

Yes, WSO2 ML supports dataset exploration functionality with multiple visualization techniques. See Exploring Data for more information.

...

Model related questions

Anchor
How can I find the details of a built model?
How can I find the details of a built model?
How can I find the details of a built model?

Once you built a model, you can view its model summary, in which you find a summary of the model evaluation. For more information, see Evaluating Models.

Anchor
How can I calculate the accuracy for a given model?
How can I calculate the accuracy for a given model?
How can I calculate the accuracy for a given model?

For classification type algorithms, generate an accuracy measurement based on the predictions made by the model for the test dataset. Test dataset is extracted from the uploaded dataset and the proportion is configurable for each analyses.

Anchor
Can I download a built model?
Can I download a built model?
Can I download a built model?

You can download a built model or publish it to WSO2 registry. For more information, see Generating Models.

Anchor
Do you support exporting models in PMML format?
Do you support exporting models in PMML format?
Do you support exporting models in PMML format?

WSo2 ML 1.0.0 does not support PMML format. This is in the roadmap to be provided in future versions.

Anchor
Can I use a built model in a Java program?
Can I use a built model in a Java program?
Can I use a built model in a Java program?

You can use a built model in a Java program. For a sample on how to use a built model in a Java program, see Using ML Models in a Java Client.

Anchor
How can I make predictions for a test dataset using ML UI wizard?
How can I make predictions for a test dataset using ML UI wizard?
How can I make predictions for a test dataset using ML UI wizard?

Once you build a model, you can make predictions to a test dataset by uploading a csv/tsv file from the Predict page in the ML UI. For more information on making predictions, see Making Predictions Using the ML UI

Anchor
Can I use a built model in other WSO2 products?
Can I use a built model in other WSO2 products?
Can I use a built model in other WSO2 products?

Yes, you can use the generated models in WSO2 ESB with the Predict mediator and in WSO2 CEP with CEP ML Extension.

...

REST API related questions

Anchor
What is the root context of the ML REST API?
What is the root context of the ML REST API?
What is the root context of the ML REST API?

WSO2 ML REST API root context is ‘/ml’. For more information, see REST API Guide.

Anchor
What authentication mechanisms you support?
What authentication mechanisms you support?
What authentication mechanisms you support?

WSO2 ML supports basic authentication and cookie based authentication. For more information, see REST API Guide.

Anchor
What are the main APIs you expose?
What are the main APIs you expose?
What are the main APIs you expose?

WSO2 ML exposes five main APIs. They are configurations, datasets, projects, analyses, and models. For more information, see REST API Guide.