Sage - AI Python Coding Assistant

Your intelligent Python coding companion with context-aware semantic search

✨ Features

🧠 Context-Aware: Uses vector embeddings and semantic search to understand your codebase
⚡ Fast & Local: Runs locally using Ollama - no API keys, no data leaves your machine
🎯 Smart File Inference: Automatically detects which files you want to edit from natural language
🔍 Tree-Sitter Parsing: Function-level code understanding, not just file-level
💬 Interactive Chat: Streaming responses with syntax highlighting
📝 File Operations: Create, edit, and modify files with confirmation
🎨 Beautiful UI: Rich terminal interface with syntax highlighting and themes

🚀 Quick Start

Prerequisites

Install Ollama (for running models locally)

# macOS
brew install ollama

# Linux
curl -fsSL https://ollama.com/install.sh | sh

# Windows
# Download from https://ollama.com

Pull a coding model

# Recommended: Balanced speed and quality
ollama pull qwen2.5-coder:3b

# Alternative options:
# ollama pull qwen2.5-coder:1.5b  # Faster, lower quality
# ollama pull deepseek-coder:6.7b # Slower, higher quality

Installation

# Clone the repository
git clone https://github.com/yttrium400/sage.git
cd sage

# Install dependencies
cd cli
python3 -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
pip install -r requirements.txt

# Run setup wizard
python3 main.py setup

Usage

# Start interactive chat
python3 main.py chat

# Ask a single question
python3 main.py ask "how does streaming work?"

# Index your codebase
python3 main.py index

# List available models
python3 main.py models

💡 Example Queries

Natural Language File Operations

You: create a calculator with add, subtract, multiply, divide
Sage: [Creates calc.py with proper functions]

You: add authentication to the project
Sage: [Creates auth.py with authentication logic]

You: edit model.py to add error handling
Sage: [Finds model.py and adds error handling]

Context-Aware Code Search

You: How does streaming work?
Sage: [Finds generate_response_streaming in model.py]

You: Show me all file operations
Sage: [Finds write_file, read_file functions]

You: What files use the model?
Sage: [Identifies chat.py imports model.py]

🏗️ Architecture

Technology Stack

Vector Search: ChromaDB + sentence-transformers for semantic code search
Code Parsing: Tree-sitter for AST-based Python parsing
LLM Backend: Ollama for local model inference
UI: Rich library for beautiful terminal output
CLI Framework: Click for command-line interface

How It Works

Indexing: Tree-sitter parses your Python files into function/class chunks
Embedding: Each chunk is converted to a 384-dim vector using sentence-transformers
Storage: Vectors stored in ChromaDB for fast similarity search
Query: Your question is embedded and matched against code chunks
Context: Top-K relevant chunks sent to LLM with your query
Response: LLM generates answer with full codebase understanding

┌─────────────────┐
│   Your Query    │
└────────┬────────┘
         │
         ├──► File Inference (regex patterns)
         │
         ├──► Embedding (sentence-transformers)
         │
         ├──► Vector Search (ChromaDB)
         │
         └──► Context Assembly
              │
              ├──► Top-K Code Chunks
              ├──► Import Dependencies
              └──► File Metadata
                   │
                   ▼
          ┌──────────────────┐
          │  LLM + Context   │
          └──────────────────┘

📊 Performance

Indexing: ~5 seconds for typical project (7 files, 40 chunks)
Query Speed: <300ms per semantic search
Memory: ~100MB for embedding model + vectors
Model Size: 1.9GB (qwen2.5-coder:3b)

🧪 Testing

# Run automated test suite
python3 run_tests.py

# Run manual tests
python3 test_context_awareness.py

# Quick smoke test (5 minutes)
# See docs/QUICK_TEST_REFERENCE.md

Test Coverage:

✅ Semantic search accuracy: 91.7%
✅ File inference: Keyword and explicit path detection
✅ Cross-file understanding: Import dependency tracking
✅ Multiple file operations

📚 Documentation

Context Awareness - Technical implementation details
Testing Guide - Comprehensive test scenarios
Quick Test Reference - Fast 5-minute smoke tests
Architecture - System design and components

🛠️ Development

Project Structure

sage/
├── cli/
│   ├── main.py           # Entry point (Click commands)
│   ├── chat.py           # Interactive chat interface
│   ├── model.py          # Ollama integration
│   ├── context.py        # Vector search & code parsing
│   ├── file_ops.py       # File read/write/edit operations
│   ├── theme.py          # UI themes and styling
│   ├── run_tests.py      # Automated test runner
│   └── requirements.txt  # Python dependencies
└── docs/                 # Documentation

Key Components

context.py: Core intelligence - semantic search, file inference, tree-sitter parsing
chat.py: User interaction - streaming, syntax highlighting, file proposals
model.py: LLM integration - prompts, streaming, model management
file_ops.py: File operations - smart code extraction, diff preview, confirmation

🎯 Roadmap

Context-aware semantic search
Smart file inference from natural language
Tree-sitter AST parsing
Interactive chat with streaming
File creation/editing with confirmation
Multi-language support (JavaScript, TypeScript, Go)
VSCode extension
Incremental indexing (watch mode)
Custom model fine-tuning for Python

🤝 Contributing

Contributions welcome! Please feel free to submit a Pull Request.

📄 License

MIT License - see LICENSE file for details

🙏 Credits

Technologies Used:

Ollama - Local LLM inference
ChromaDB - Vector database
sentence-transformers - Text embeddings
tree-sitter - Code parsing
Rich - Terminal UI

Built with ❤️ for Python developers

Report Bug · Request Feature

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
cli		cli
docs		docs
vscode-extension		vscode-extension
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sage - AI Python Coding Assistant

✨ Features

🚀 Quick Start

Prerequisites

Installation

Usage

💡 Example Queries

Natural Language File Operations

Context-Aware Code Search

🏗️ Architecture

Technology Stack

How It Works

📊 Performance

🧪 Testing

📚 Documentation

🛠️ Development

Project Structure

Key Components

🎯 Roadmap

🤝 Contributing

📄 License

🙏 Credits

About

Uh oh!

Releases

Packages

Languages

License

yttrium400/sage

Folders and files

Latest commit

History

Repository files navigation

Sage - AI Python Coding Assistant

✨ Features

🚀 Quick Start

Prerequisites

Installation

Usage

💡 Example Queries

Natural Language File Operations

Context-Aware Code Search

🏗️ Architecture

Technology Stack

How It Works

📊 Performance

🧪 Testing

📚 Documentation

🛠️ Development

Project Structure

Key Components

🎯 Roadmap

🤝 Contributing

📄 License

🙏 Credits

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages