Skip to Content
DocsHoodiniInstallation

Installation

Hoodini can be installed using several methods. Choose the one that best fits your workflow.

⏱️

Note: Hoodini requires downloading several databases on first run (~35GB uncompressed). Initial setup takes 5+ minutes depending on your internet connection and machine specs.

Installation Methods

Mamba is the recommended method for most users. It handles all dependencies automatically.

Hoodini is not yet on Bioconda. For now, clone the repository and install from source.

Install Mamba

If you don’t have Mamba installed, get it via Miniforge :

curl -L -O "https://github.com/conda-forge/miniforge/releases/latest/download/Miniforge3-$(uname)-$(uname -m).sh" bash Miniforge3-$(uname)-$(uname -m).sh

Clone the repository

git clone https://github.com/pentamorfico/hoodini.git cd hoodini

Create environment and install

mamba env create -f environment.yml mamba activate hoodini pip install -e .

Verify installation

hoodini --help

Database Download

On first run, Hoodini will download the required databases. Here are the approximate sizes:

💾

Storage requirements: Make sure you have at least 40GB of free disk space for all databases.

DatabaseSizeDescription
eggNOG-mapper~24 GBFunctional annotation (largest)
MetaCerberus HMMs~4.5 GBDomain annotation (Pfam, COG, KEGG, etc.)
Contig lengths~3.8 GBNCBI assembly metadata
geNomad~1.4 GBVirus/plasmid detection
PADLOC~1 GBDefense system detection
DefenseFinder~350 MBDefense system detection

You can skip specific databases during download:

hoodini download databases --skip-emapper --skip-genomad

Verify Installation

After installation, verify everything works:

# Check version hoodini --version # Run with test data (will download databases on first run) hoodini run -i test.fasta -o test_output --threads 4

Tip: Use the --threads flag to speed up processing. Hoodini scales well with multiple cores.

Last updated on