Skip to content
@CraftJarvis

CraftJarvis

This is the collection of our joint efforts to Craft an open-ended, multitask, generalist agent (Jarvis).

Welcome to Team CraftJarvis

At CraftJarvis, we're a passionate team committed to exploring the vast potential of AI in the dynamic, open-world environment of Minecraft. Our focus is on developing a generalist agent, an AI entity capable of mastering a wide range of tasks and challenges within this virtual world.

Publications

Here are a list of our latest publications on Open-world Agents. (Sort by time order)

  • Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents (ROCKET-3)

    [Website] [Paper] [Code]

  • JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse (ACL 2025)

    [Website] [Paper] [Videos] [Datasets] [Models]

  • MCU: An Evaluation Framework for Open-Ended Game Agents (ICML 2025)

    [Website] [Paper] [Code]

  • Open-World Skill Discovery from Unsegmented Demonstrations (ICCV 2025)

    [Website] [Paper] [Code]

  • MineStudio: A Streamlined Package for Minecraft AI Agent Development

    [Paper] [Code] [Document]

  • ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment (AAAI 2026)

    [Website] [Paper] [Code] [Demo]

  • ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting (CVPR 2025)

    [Website] [Paper] [Code] [Demo]

  • GROOT-2: Weakly Supervised Multi-Modal Instruction Following Agents (ICLR 2025)

    [Paper]

  • OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents (NeurIPS 2024)

    [Website] [Paper]

  • JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models (T-PAMI 2024)

    [Website] [Paper] [Code]

  • GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR 2024)

    [Website] [Paper] [Code]

  • Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents (NeurIPS 2023)

    [Paper] [Code]

  • Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction (CVPR 2023)

    [Paper] [Code]

Popular repositories Loading

  1. JARVIS-1 JARVIS-1 Public

    JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models

    Java 388 20

  2. MineStudio MineStudio Public

    MineStudio: A Streamlined Package for Minecraft AI Agent Development

    Python 325 27

  3. MC-Planner MC-Planner Public

    Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents"

    Python 289 24

  4. RAT RAT Public

    Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".

    Python 248 27

  5. JarvisVLA JarvisVLA Public

    Official Implementation of "JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse"

    Python 115 11

  6. GROOT GROOT Public

    GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)

    Java 66 3

Repositories

Showing 10 of 20 repositories
  • OpenHA Public

    Repo for Paper "OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft"

    CraftJarvis/OpenHA’s past year of commit activity
    Python 18 MIT 0 1 0 Updated Dec 14, 2025
  • .github Public
    CraftJarvis/.github’s past year of commit activity
    1 0 0 0 Updated Nov 8, 2025
  • MCU Public
    CraftJarvis/MCU’s past year of commit activity
    Python 31 3 2 0 Updated Oct 21, 2025
  • MineStudio Public

    MineStudio: A Streamlined Package for Minecraft AI Agent Development

    CraftJarvis/MineStudio’s past year of commit activity
    Python 325 MIT 27 6 (1 issue needs help) 0 Updated Oct 12, 2025
  • SkillDiscovery Public

    [ICCV 2025] Official implementation of Open-World Skill Discovery from Unsegmented Demonstration Videos

    CraftJarvis/SkillDiscovery’s past year of commit activity
    Python 10 MIT 0 0 0 Updated Sep 4, 2025
  • JarvisVLA Public

    Official Implementation of "JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse"

    CraftJarvis/JarvisVLA’s past year of commit activity
    Python 115 11 8 0 Updated Aug 27, 2025
  • ROCKET-3 Public

    Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents

    CraftJarvis/ROCKET-3’s past year of commit activity
    13 1 1 0 Updated Aug 19, 2025
  • webchat Public
    CraftJarvis/webchat’s past year of commit activity
    Python 1 0 0 0 Updated Aug 6, 2025
  • ROCKET-2 Public

    Official Implementation of Paper "ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment" (AAAI'26)

    CraftJarvis/ROCKET-2’s past year of commit activity
    Python 40 0 0 0 Updated Jul 2, 2025
  • CraftJarvis/craftjarvis.github.io’s past year of commit activity
    HTML 1 MIT 0 0 0 Updated May 25, 2025