Installation
Hoodini can be installed using several methods. Choose the one that best fits your workflow.
Note: Hoodini requires downloading several databases on first run (~35GB uncompressed). Initial setup takes 5+ minutes depending on your internet connection and machine specs.
Installation Methods
Mamba (Recommended)
Mamba is the recommended method for most users. It handles all dependencies automatically.
Hoodini is not yet on Bioconda. For now, clone the repository and install from source.
Install Mamba
If you don’t have Mamba installed, get it via Miniforge :
curl -L -O "https://github.com/conda-forge/miniforge/releases/latest/download/Miniforge3-$(uname)-$(uname -m).sh"
bash Miniforge3-$(uname)-$(uname -m).shClone the repository
git clone https://github.com/pentamorfico/hoodini.git
cd hoodiniCreate environment and install
mamba env create -f environment.yml
mamba activate hoodini
pip install -e .Verify installation
hoodini --helpDatabase Download
On first run, Hoodini will download the required databases. Here are the approximate sizes:
Storage requirements: Make sure you have at least 40GB of free disk space for all databases.
| Database | Size | Description |
|---|---|---|
| eggNOG-mapper | ~24 GB | Functional annotation (largest) |
| MetaCerberus HMMs | ~4.5 GB | Domain annotation (Pfam, COG, KEGG, etc.) |
| Contig lengths | ~3.8 GB | NCBI assembly metadata |
| geNomad | ~1.4 GB | Virus/plasmid detection |
| PADLOC | ~1 GB | Defense system detection |
| DefenseFinder | ~350 MB | Defense system detection |
You can skip specific databases during download:
hoodini download databases --skip-emapper --skip-genomadVerify Installation
After installation, verify everything works:
# Check version
hoodini --version
# Run with test data (will download databases on first run)
hoodini run -i test.fasta -o test_output --threads 4Tip: Use the --threads flag to speed up processing. Hoodini scales well with multiple cores.