Biowulf High Performance Computing at the NIH

Java

Several Java Development Kits are installed in /usr/local/Java. Older versions are available for applications that require them. The latest version of java is symlinked to /usr/local/java.

When compiling against Java, load the appropriate java module file, for example:
module load java/1.8.0_92

This will set the appropriate JAVA_HOME, PATH and LD_LIBRARY_PATH environment variables.

Executing a Jar file with Java

You may run a jar file directly using the -jar flag:

java [options] -jar jarfile
Common Java Options

All java-based applications can utilize these options.

Specifying Memory

Including these options will configure the amount of memory required to run the java-based application. [size] can be defined in kilobytes (e.g. 5k), megabytes (10m), or gigabytes (8g).

It is very common to include -Xmx4g with calls to java. This requires that 4GB of memory is available to the java instance.

Specifying Scratch Space

Java-based applications will very often require a scratch space for creating temporary files during execution. By default, this is set to /tmp. Unfortunately, many genomic java applications require much more scratch space than is available in /tmp. Worse, running multiple instances of java on a single node may fill up /tmp. In this case, including the option

will configure java to use [TMPDIR] as a scratch space. Typically, this can be set to /scratch:

java -Djava.io.tmpdir=/scratch -jar jarfile

Disabling X11 Display and Keyboard Interaction

Some java applications fail to run under batch conditions because an X11 display is not available, or no keyboard is detected, even though the command should run in batch or in the background. For these situations, running java in so-called "headless" mode may allow batch runs:

java -Djava.awt.headless=true -jar jarfile

For more information about how to configure java-based applications, type

java --help

at the prompt, or go to http://www.oracle.com/technetwork/java.

Specifying the Number of Garbage Collection (GC) threads.

By default, the Java virtual machine (JVM) will calculate the available number of CPUs at runtime. This default behaviour can easily consume most of the available CPUs. In most cases, you should use the following option to limit the number of threads the JVM will use for garbage collection:

This will limit the number of parallel garbage collection threads to 2. Adjust the value as needed for your application.

Specifying the Server VM

The Server VM is intended for long-running Java applications. It is optimized and tuned to maximize peak operating speed. For long running Java processes, it is recommended to use the following option: