This section describes some recommended performance tuning configurations to optimize BAM. It assumes that you have set up the BAM server on Unix/Linux, which is recommended for a production deployment. If you have high volumes of data with high concurrency, it is recommended to use a distributed BAM setup. For instructions, see product deployment and clustering guide.
The values discussed below are general recommendations. They might not be optimal for the specific hardware configurations in your environment. We recommend you to carry out load tests on your environment to tune BAM accordingly.
Improvement Area | Performance Recommendations | ||||||||
---|---|---|---|---|---|---|---|---|---|
Data receiver nodes |
| ||||||||
Analyzer nodes | Xms1024m -Xmx1024m -XX:MaxPermSize=512m | ||||||||
Dashboard nodes | Xms1024m -Xmx1024m -XX:MaxPermSize=512m | ||||||||
Hadoop nodes |
| ||||||||
Cassandra nodes |
Also see http://www.datastax.com/docs/1.0/operations/tuning in Cassandra documentation. | ||||||||
Memory size of Hive | In order to increase the memory size of Hive, increase the setting for the Hadoop heapsize to 4096 in the <property> <name>mapred.child.java.opts</name> export HADOOP_HEAPSIZE=4096 <value> -server -Xmx512M</value> </property> |