Page Comparison

This section describes some recommended performance tuning configurations to optimize BAM. It assumes that you have set up the BAM server on Unix/Linux, which is recommended for a production deployment. If you have high volumes of data with high concurrency, it is recommended to use a distributed BAM setup. For instructions, see product deployment and clustering guide.

Table of Contents

Info

Performance tuning requires you to modify important system files, which affect all programs running on the server. We recommend you to familiarize yourself with these files using Unix/Linux documentation before editing them.
The parameter values we discuss below are just examples. They might not be the optimal values for the specific hardware configurations in your environment. We recommend that you carry out load tests on your environment to tune the product accordingly.

OS-Level Settings

To optimize network and OS performance, configure the following settings in /etc/sysctl.conf file of Linux. These settings specify a larger port range, a more effective TCP connection timeout value, and a number of other important parameters at the OS-level.

Code Block

net.ipv4.tcp_fin_timeout = 30
fs.file-max = 2097152
net.ipv4.tcp_tw_recycle = 1
net.ipv4.tcp_tw_reuse = 1
net.core.rmem_default = 524288
net.core.wmem_default = 524288
net.core.rmem_max = 67108864
net.core.wmem_max = 67108864
net.ipv4.tcp_rmem = 4096 87380 16777216
net.ipv4.tcp_wmem = 4096 65536 16777216
net.ipv4.ip_local_port_range = 1024 65535

Info
When we have the localhost port range configuration lower bound to 1024, there is a possibility that some processes may pick the ports which are already used by WSO2 servers. Therefore, it's good to increase the lower bound as sufficient for production, e.g., 10,000.

To alter the number of allowed open files for system users, configure the following settings in /etc/security/limits.conf file of Linux.
Code Block
* soft nofile 4096 * hard nofile 65535
Optimal values for these parameters depend on the environment.
To alter the maximum number of processes your user is allowed to run at a given time, configure the following settings in /etc/security/limits.conf file of Linux (be sure to include the leading * character). Each carbon server instance you run would require upto 1024 threads (with default thread pool configuration). Therefore, you need to increase the nproc value by 1024 per each carbon server (both hard and soft).
Code Block
* soft nproc 20000 * hard nproc 20000

JDBC Pool Configurations

Within the WSO2 platform, we use Tomcat JDBC pooling as the default pooling framework due to its production ready stability and high performance. The table below indicates some recommendations on how to configure the JDBC pool using the <PRODUCT_HOME>/repository/conf/datasources/master-datasources.xml file. For more details about recommended JDBC configurations, see The Tomcat JDBC Connection Pool.

...

The maximum number of active connections that can be allocated from the connection pool at the same time. The default value is 100.

...

The indication of whether connection objects will be validated before they are borrowed from the pool. If the object validation fails, it will be dropped from the pool, and we will attempt to borrow another connection.

...

Setting this property to 'true' is recommended as it will avoid connection requests from failing. The validationQuery property should be used if testOnBorrow is set to true. To increase the efficiency of connection validation and to improve performance, validationInterval property should also be used.

...

To avoid excess validation, run validation at most at this frequency (time in milliseconds). If a connection is due for validation, but has been validated previously within this interval, it will not be validated again. The default value is 30000 (30 seconds).

...

Note

...

...

WSO2 BAM specific settings

Info
The values discussed below are general recommendations. They might not be optimal for the specific hardware configurations in your environment. We recommend you to carry out load tests on your environment to tune BAM accordingly.

Improvement Area Performance Recommendations

Data receiver nodes

Xms1024m -Xmx1024m -XX:MaxPermSize=512m
Change the /etc/security/limits.conf with the following values:
Code Block
* soft nofile 4096 * hard nofile 65535

Analyzer nodes Xms1024m -Xmx1024m -XX:MaxPermSize=512m

Dashboard nodes Xms1024m -Xmx1024m -XX:MaxPermSize=512m

Hadoop nodes

Recommended OS: Linux
At least 10 GB storage capacity in each node
At least 100 Mbps network bandwidth
Set hadoop.root.logger=ERROR

Cassandra nodes

Have separate disks for your commit log and data dirs (sstables)
Set the Heap memory as follows:
System memory Heap size
Less than 2 GB 1/2 of system memory
2-4 GB 1 GB
More than 4 GB 1/4 system memory, but not more than 8GB
Set following configuration in cassandra.yaml according to your hardware resources:
- concurrent_reads: 4 * no of cores
- concurrent_writes: 8 * no of CPU cores

For more information see Apache Cassandra 1.0 Documentation.

Memory size of Hive

In order to increase the memory size of Hive try the following configurations:

Increase the setting for the Hadoop heapsize. For example increase it to 4096 as shown below:
export HADOOP_HEAPSIZE=4096
Add the below entry in the <PRODUCT<BAM_HOME>/repository/conf/advanced/hive-site.xml file:

Code Block

language	xml

<property>  
   <name>mapred.child.java.opts</name>  
   <value> -server -Xmx512M</value>  
 </property>

Versions Compared

Old Version 9

New Version 10

Key

OS-Level Settings

JDBC Pool Configurations

WSO2 BAM specific settings