Post

Chapter 13: System Configuration for In-Silico Analysis

Chapter 13: System Configuration for In-Silico Analysis

This chapter provides step-by-step instructions for installing Linux (Ubuntu) on Windows using WSL2, accessing the SIUC BigDawg supercomputer, installing and configuring GROMACS for molecular dynamics simulations, and running simulations with the installed software. For detailed information, visit the SIUC Research Computing and Cyberinfrastructure and read the official user documentation.

13.1 Introduction

In-silico molecular modeling, particularly molecular dynamics (MD) simulations, requires significant computational resources. These simulations are most commonly performed in Linux-based environments, which provide a strong ecosystem of open-source scientific software, powerful command-line tools, and well-established high-performance computing (HPC) infrastructure.

Many widely used MD software packages, including GROMACS (GROningen MAchine for Chemical Simulations), are primarily developed and maintained primarily for Linux. This chapter provides intruction to setting up an MD simulation environment on both local computers and HPC systems.

We begin by installing the Ubuntu Linux distribution on a Windows computer using Windows Subsystem for Linux 2 (WSL2). WSL2 allows users to run a full Linux environment directly inside Windows. This approach provides:

  • High compatibility with Linux applications
  • Compatibility with most Linux-based MD applications
  • Easy integration with the Windows operating system

With WSL2, users can keep their Windows workflow while using the Linux tools commonly required for molecular simulations.

After setting up WSL2, we will install GROMACS, a widely used molecular dynamics package for simulating biomolecular systems. The software will be compiled and installed within the Linux environment, along with the required dependencies.

While local computers are suitable for small to medium simulations, larger molecular systems require more computational power. For these cases, researchers typically use high-performance computing (HPC) clusters.

In this chapter, we will setup account at SIU’s BigDawg HPC cluster and installing GROMACS and necessary packages at user level. BigDawg provides:

  • Parallel computing to accelerate simulations
  • Efficient resource allocation for large jobs
  • Scalable computing for complex biomolecular studies

To manage software installations at the user level on the cluster, we will use Miniconda , a lightweight package and environment manager that helps maintain consistent software environments. This chapter includes step-by-step instructions for:

  • Setting up Linux using WSL2
  • Accessing and configuring the BigDawg HPC cluster
  • Installing and configuring GROMACS
  • Running simulations using SLURM, a common workload manager for HPC systems

BBy the end of this chapter, you will have a working MD simulation environment on both local systems (Windows + WSL2) and the BigDawg HPC cluster.

13.2. WSL2

This section provides a detailed guide on installing Ubuntu (A popular Linux distribution) on Windows using Windows Subsystem for Linux 2 (WSL2). WSL2 is a full Linux kernel implementation that runs within a lightweight, virtualized environment on Windows, offering near-native performance for Linux applications. For official documentation, system requirements, and troubleshooting, refer to the Microsoft Windows Subsystem for Linux Documentation .

In many cases you will find yourself needing to a linux command line to manuever around your files and utilize important software. As such, you can install Windows Terminal, an official application designed to handle WSL and all the other command lines you may interact with on your windows machine. You can get all these tools directly from the Microsoft Store:

Description of the image
Terminal after install WSL2 & setting up

Windows Terminal (link, press view in store, if you get a prompt, click Open Microsoft Store, and then click the blue Get button to install)

Ubuntu (same as instructions as Windows Terminal)

Once these are both installed, open up the newly installed Ubuntu app. You should now see some messages appear to set up your user info.

If you aren’t too familiar with how to use a linux command line, read any of these beginner resources here to learn the essential commands.

  • We can access your Windows files from within WSL2 using the following command: cd /mnt/c
  • You can also use Windows applications from the terminal to open files to edit: notepad.exe {{text file in PWD}}
  • You can also view and edit Linux files directly in Windows File Explorer by navigating to: \\wsl.localhost\ (This provides native access to WSL2 files without requiring SSH or third-party tools)

Tip: Store large datasets inside the WSL filesystem (/home/user/) instead of Windows (/mnt/c/) to avoid performance issues.

13.3 Miniconda

Miniconda is an open-source software distribution that provides a minimal installation of Anaconda, including Conda, Python, and essential packages they depend on. It is lightweight and allows users to manage environments efficiently. More information can be found on the official Miniconda documentaion.

Using Conda simplifies package installation and dependency management. Because Miniconda can be installed without root privileges, it is especially useful for personal computers and HPC systems. The following steps describe how to install Miniconda on Linux systems, including WSL2. A similar approach can be used when installing software on the SIU BigDawg HPC cluster.

  1. Download & Install Miniconda
    • Download the installer using curl:
      curl -O https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
  • Run the installer with bash:
    bash Miniconda3-latest-Linux-x86_64.sh
  • Follow the on-screen prompts and accept the license agreement. Press Enter to review the license and type yes to accept.
  1. Initialize Conda After installation, initialize Conda:
    • conda init Restart your terminal or reload the shell configuration through
    • source ~/.bashrc
  2. Verify Installation Check if Conda is installed correctly
    • conda --version - Display a version number like conda 23.x.x or newer
    • conda info - Shows information about the Conda installation, including the active environment, Conda version, and installation paths.
  3. Conda Envirnment Conda allows you to create isolated environments for different projects. This prevents conflicts between software packages and dependencies.
  • conda list - list installed packages in the current environment:

Create a new environment (recommended for molecular dynamic tools):

  • conda create -n gromacs_env python=3.10 - create the environment
  • conda env list - List all the conda environments.
  • conda activate gromacs_env - Activate specific envirnment: biobb.
  • conda activate - Switch back to the baes (default) environment.
  1. Installing Specific Packages Many scientific packages are available through the conda-forge repository. Add the conda-forge channel:
    • conda config --add channels conda-forge
    • conda config --set channel_priority strict

General syntax for installing a package:

  • conda install --name <environment-name> <package-name>

To install GROMACS:

  • conda install gromacs - This installs GROMACS and all required dependencies automatically.

13.3 GROMACS Local Installation

For the local machine which is WSL2. We utilize the package manager.

  • sudo apt-get update && sudo apt-get install gromacs

This method ensures that dependencies are handled automatically and provides a stable version of GROMACS available in the Ubuntu package repository.

13.4 High Performance Computing Setup

13.4.1 Requesting Account in SIU BigDawg

BigDawg, SIUC’s HPC cluster, is available at no cost to faculty, researchers, and students. However, student access requires faculty or researcher supervision. To gain access to SIU’s HPC resources, follow the official instructions at: Request access to SIU’s BigDawg . For further inquiries, contact research computing:

  • Phone: 618-536-2438
  • Email: research-computing@siu.edu

After successfully gaining access to the HPC system, we will learn how to install and configure essesntial packages for in-silicousing Miniconda on the cluster. This includes installing GROMACS and its required dependencies, such as FFTW and MPI, to enable efficient parallel processing.

13.4.2 Connecting to the SIU BigDawg

To connect to the BigDawg cluster and perform remote computing from a Windows system, we will use MobaXterm or Command Prompt.

Description of the image
Overview of Cisco Secure Client connection
  1. Connect Cisco Secure Client
    • open Cisco Secure Client (VPN)
    • Connect to (see Figure 2): private.siu.edu/bigdawg
    • Use your SIU credentials:
    • Username: siu85xxxxx or email@siu.edu
    • Password : xxxx

Connecting BigDawg via SSH: Once connected to the VPN, use MobaXterm or SSH to access the cluster

  • ssh siu853xxx@bigdawg.research.siu.edu

Recommended Working Directory: It is recommended to work in the /scratch/siu85xxxxx directory, as it provides more storage compared to /home/siu85xxxxx.

  • cd /scratch/siu85xxxxx
  1. Useful Commands for Working Bash Shell

File Transfer to and from the Cluster: If you are using MobaXterm, you can easily transfer files between your local computer and the cluster using drag and drop via the built-in SFTP panel.

If you are using Command Prompt, PowerShell, or a Linux/macOS terminal, file transfers are done using the scp (secure copy) command.

  • scp <filename> siu85xxxxx@bigdawg.research.siu.edu:/scratch/siu85xxxxx/1AKI/- Copy a File from Local Computer to BigDawg
  • scp siu85xxxxx@bigdawg.research.siu.edu:/scratch/siu85xxxxx/3HTB ./local_directory/ - Copy a File from BigDawg to Local Computer

Notes:

  • Replace siu85xxxxx with your actual SIU user ID.
  • /scratch is typically used for large, temporary simulation data.
  • ./local_directory/ refers to a folder on your local machine.

To learn more about SLURM, please visit: User Guide for SLURM Scheduler .

To learn more about Linux usage in HPC environments, please visit: Introduction to Linux in HPC .

13.4.3 Installing GROMACS on SIU BigDawg

Many clusters already provide GROMACS via environment modules:

  • module avail : to display available modules.
  • load module package_name such as gromacs-2018-gcc…,
  • which gmx_gmi

If not available, Miniconda is the safest user-level method.

  1. Install GROMACS using miniconda Follow the instruction of installing and setting up miniconda at section 13.3.

Option A — Serial / OpenMP version (most stable)

  • conda install gromacs

This installs:

  • GROMACS
  • FFTW
  • OpenMP support
  • CPU-only binaries

Recommended for beginners and teaching.

Option B — MPI-enabled GROMACS (for parallel jobs)

  • conda install gromacs mpi4py openmpi

This enables:

  • gmx_mpi
  • domain decomposition across nodes Verify:
  • which gmx
  • which gmx_mpi
  • gmx --version - To verify installation
This post is licensed under CC BY 4.0 by the author.