Enabling collaborative scientific research through interpretable predictive modeling and seamless model sharing.
EMUSES transforms scientific data into predictive insights, supporting research workflows from individual analysis to community-wide collaboration. Built for researchers who need both quick results and deep analytical control across diverse domains including neuroimaging, astronomy, genetics, sociology, economics, and beyond.
- Python 3.11+ (recommended for optimal performance)
- Basic command line familiarity
# 1. Create isolated environment
conda create -n emuses-env python=3.11
conda activate emuses-env
# 2. macOS ONLY: Install OpenMP (one-time setup)
# Required for XGBoost and other ML libraries
conda install -c conda-forge llvm-openmp # macOS only
# 3. Install EMUSES
pip install git+https://github.com/chrisfoulon/emuses.git
# 4. Verify installation
python -m emuses.cli --help
# 5. Run your first analysis (with sample data)
python -m emuses.cli full output_folder docs/examples/sample_data/hcp_input_data.csv --scores docs/examples/sample_data/hcp_labels.csv# 1. Create isolated environment
python -m venv emuses-env
source emuses-env/bin/activate # Linux/macOS
# emuses-env\Scripts\activate # Windows
# 2. macOS ONLY: Install OpenMP via Homebrew (one-time setup)
# brew install libomp # Required for XGBoost on macOS
# 3. Install EMUSES
pip install git+https://github.com/chrisfoulon/emuses.git
# 4. Verify installation
python -m emuses.cli --help
# 5. Run your first analysis (with sample data)
python -m emuses.cli full output_folder docs/examples/sample_data/hcp_input_data.csv --scores docs/examples/sample_data/hcp_labels.csvπ macOS Users: XGBoost requires OpenMP for multi-threading. Install via
brew install libomp(pip/venv) orconda install -c conda-forge llvm-openmp(conda). This is a one-time setup that enables high-performance ML libraries. Important: Conda users must use the conda installation (llvm-openmp) - the brew version won't be found inside conda environments.
β
Success: Your first scientific prediction model is ready in output_folder/!
π Understanding Your Results: Your analysis includes prediction models, statistical heatmaps, correlation analysis, and effect size maps. See results guide β
# Local analysis with your data
python -m emuses.cli full my_results/ my_brain_data.csv --scores my_cognitive_scores.csvPerfect for: Exploratory analysis, method development, personal research projects
# Multi-user collaboration with shared models
python -m emuses.cli models install shared_model.zip
python -m emuses.cli models list --workspace our_labPerfect for: Team collaboration, model validation, reproducible workflows
# Access community models and benchmarks
python -m emuses.cli models search "fMRI working memory"
python -m emuses.cli models info community_model_v2Perfect for: Meta-analyses, benchmarking, scientific reproducibility
- π¬ Research-Optimized: Designed for complex scientific prediction tasks (neuroimaging, astronomy, genetics, sociology, economics, and more)
- π Multi-Mode Flexibility: Local, collaborative, or cloud-based workflows
- π Model Registry: Share, discover, and reproduce predictive models
- π― Research-Focused: Designed for scientific rigor and interpretability
- β‘ Quick Start: From installation to results in under 5 minutes
- π¬ Deep Control: Comprehensive configuration for advanced users
Choose your path based on your needs:
π Quick Start Guide
5-minute path to your first results
β For time-constrained researchers who need immediate results
π Model Registry Guide
Comprehensive model sharing documentation
β For researchers who want to understand model registry capabilities
Scientific use case patterns and methodological examples
β For researchers implementing scientific analysis workflows across diverse domains (neuroimaging, astronomy, genetics, sociology, economics)
π§ API Documentation
Interactive API reference
β For computational scientists integrating EMUSES into workflows
π₯ Developer Guide
Integration and contribution guide
β For extending EMUSES or contributing to development
# Automatic setup - no configuration needed
python -m emuses.cli full output/ input_data.csv --scores scores.csv# Multi-user setup with PostgreSQL
python -m emuses.cli models status # Shows current mode
# See: docs/USER_GUIDE.md#database-mode-setup# Full production deployment
# See: docs/USER_GUIDE.md#cloud-mode-setup# Create environment
conda create -n emuses-env python=3.11
conda activate emuses-env
# macOS: Install OpenMP
conda install -c conda-forge llvm-openmp # macOS only
# Install EMUSES
pip install git+https://github.com/chrisfoulon/emuses.gitgit clone https://github.com/chrisfoulon/emuses.git
cd emuses
# Create conda environment
conda create -n emuses-dev python=3.11
conda activate emuses-dev
# macOS: Install OpenMP
conda install -c conda-forge llvm-openmp # macOS only
# Install in editable mode
pip install -e .# With Docker for full deployment
docker pull ghcr.io/chrisfoulon/emuses:latest
# See: docs/deployment/ for complete setupEMUSES includes real-world sample data from the Human Connectome Project:
- Input: Neuroimaging features from 1068 subjects
- Target: Fluid intelligence prediction task
- Location:
docs/examples/sample_data/
Perfect for testing workflows and learning EMUSES capabilities.
EMUSES enables reproducible scientific research through:
- Model Sharing: Publish and discover predictive models
- Reproducible Workflows: Standardized analysis pipelines
- Community Benchmarks: Compare methods across research groups
- Open Science: Transparent and reproducible research practices
If you use EMUSES in your research, please cite:
@software{emuses2024,
title={EMUSES: Scientific Predictive Modeling Platform},
author={Foulon, Chris and Contributors},
year={2024},
url={https://github.com/chrisfoulon/emuses},
version={0.9.0}
}- π Documentation: Full documentation portal
- π Quick Start: 5-minute tutorial
- π Model Registry: Model sharing guide
- π Issues: GitHub Issues
- π¬ Discussions: GitHub Discussions
Current Version: 0.9.0-dev (Model Registry Complete)
Next Release: 1.0.0 (Production Ready)
Test Coverage: 47.1% (Exceeds research software standards)
Status: Pre-production, active development
Documentation developed using LAD (LLM-Assisted Development) methodology with human oversight.
π§ Built for scientists, by scientists | β‘ Quick results, deep control | π€ Individual to community scale