The AI coding teammate that plans before it codes β and runs on your machine, your models, your terms.
English Β· EspaΓ±ol Β· FranΓ§ais Β· δΈζ Β· ΰ€Ήΰ€Ώΰ€¨ΰ₯ΰ€¦ΰ₯ Β· Π ΡΡΡΠΊΠΈΠΉ Β· Italiano
AILIENANT is an autonomous coding agent that lives inside VS Code. You describe what you want in plain language; AILIENANT writes a real plan, makes the edits, runs the code in a sandbox, reads the results, and fixes its own mistakes β while showing you every step of its thinking.
What makes it different from the popular AI assistants is where it runs and how it decides. AILIENANT is local-first: it can run entirely on your own machine with open models (Ollama, LM Studio, and others), reaching for the cloud only when a task genuinely needs it β and it tells you, in dollars, when it does. Your code doesn't have to leave your laptop, and you are never locked into a single vendor.
Your code deserves more than magical autocomplete or autonomous agents that operate blindly. Ailienant is the only agent for VS Code built for engineers who demand rigor, auditability, and total control.
We operate under the Spec-Driven Development philosophy. Ailienant reads your architecture as a living graph (GraphRAG) and uses state-of-the-art reasoningβDreaming, MCTS (Monte Carlo Tree Search), and Harness Systemsβto plan and execute complex tasks across multiple files. This guarantees pinpoint accuracy and drastically fewer errors, even overcoming the context limitations of local models on PCs.
Everything is integrated in an easy-to-use way, but without being a "black box." You dictate the rules: choose the model, change the execution engine, control the budget, and audit every step of the pipeline. Ailienant executes autonomously, but pauses for your approval before any critical action (HITL). True autonomy, zero vendor lock-in. You own your pipeline.
In one line: a private, cost-aware, plan-first AI engineer for your codebase β open source, with no vendor lock-in.
- π§ It plans before it codes. A real team of specialized agents β a Researcher maps your code, a Planner turns the request into a concrete spec and task list and freezes the scope, an Orchestrator drives the steps, a Coder (in one of 8 expert roles) makes the edits, and an Analyst you can chat with explains the codebase. A drift guard keeps the agent from quietly wandering off and rewriting half your project.
- π Your code stays yours. Run 100% locally with your own models. No mandatory cloud, no telemetry phone-home, no training on your repository.
- πΈ You see the cost. Every task has a live token ledger and a hard budget ceiling. Local vs. cloud usage and estimated savings are shown, not hidden.
- πͺ You see the thinking. A live "Thought Box" streams the model's reasoning, and a step-by-step trace shows every file read, command run, and patch proposed.
- βͺ You can rewind. Every step of a task is a durable checkpoint. Branch from any point to explore an alternative β true time-travel debugging for an agent.
- π‘οΈ It runs code safely. Generated commands execute inside a sandbox (Docker, with WebAssembly and human-approval fallbacks), never blindly against your machine.
- π No lock-in. Bring your own model and provider β Ollama, LM Studio, vLLM, llama.cpp, OpenAI, Anthropic, Google, DeepSeek, Mistral, and more β and switch any time.
| AILIENANT | Typical cloud assistant | |
|---|---|---|
| Runs fully on your machine | β Local-first, BYO model | β Cloud-only |
| Plans, researches, codes, self-checks | β A 5-agent team with a drift guard | β One model, one shot |
| Smart localβcloud routing | β Picks the cheapest tier that can do the job | β Fixed |
| Shows cost in real time | β Token ledger + budget ceiling | |
| Time-travel / branch a run | β Durable checkpoints | β Stateless |
| Sandboxed execution | β Docker / Wasm / approval-gated | |
| Vendor lock-in | β None β swap providers freely | β Locked to one |
A fuller technical comparison lives in HowItWorks.md.
AILIENANT isn't one model doing everything β it's a small team of specialists, each with one job, wired together by a stateful LangGraph engine:
| Agent | What it does |
|---|---|
| π Researcher | Builds a "skeleton map" of your codebase β signatures and cross-module relationships β so the Planner reasons over real structure, not guesses. |
| π§ Planner | Turns your request into a concrete, schema-validated spec and task list (a WBS), then freezes the scope so the work can't sprawl. |
| ποΈ Orchestrator | Drives the plan step by step, coordinating state and routing each step to the right model tier. |
| π οΈ Coder | Makes the actual edits β adopting one of 8 expert roles per task. |
| π¬ Analyst (Natt) | A read-only tutor you can chat with. It explains your code and AILIENANT itself, but never touches files β the voice, not the hand. |
The Coder specializes into the role each task needs: core-dev, architect/refactor, devops/infra, secops, qa-tester, doc-manager, vcs-manager, data/ML engineer β each with its own tools, guardrails, and approval triggers (e.g. a .env edit always pauses for you).
When a step fails, a self-healing loop reads the error and proposes a corrected patch before giving up; for open-ended steps, a bounded ReAct cell works against a live terminal until the job is done. The full per-agent breakdown is in HowItWorks.md.
AILIENANT assumes that an autonomous agent will eventually try to do something it shouldn't β and is built to contain it.
- Sandboxed by default. Commands run in an isolated Docker container (read-only workspace, no network, non-root) with WebAssembly and human-in-the-loop fallbacks when Docker isn't available.
- Fail-closed permissions. Every tool is classified by privilege; anything unrecognized is treated as dangerous until proven safe, never the other way around.
- Human approval where it matters. Risky actions and budget overruns pause for your explicit approval.
- Tamper-evident audit trail. Approvals are recorded in a cryptographically chained (blake2b) ledger you can verify.
- Multi-tenant isolation. Every piece of indexed memory is namespaced to its workspace, so projects never leak into each other.
Full walkthrough: HowToUseIt.md
Prerequisites: Python 3.10+ (3.13 recommended), Node.js 20+, VS Code 1.85+, and at least one model source (a local Ollama/LM Studio install, a LiteLLM proxy, or cloud API keys).
# 1. Backend (the orchestration engine)
cd ailienant-core
python -m venv venv
.\venv\Scripts\activate # Unix: source venv/bin/activate
pip install -r requirements.txt
copy ..\.env.example ..\.env # Unix: cp ../.env.example ../.env
# 2. Extension (the VS Code UI)
cd ..\ailienant-extension
npm install
npm run compileThen open the project in VS Code and press F5 to launch the extension. The first time you open an AILIENANT session it starts the backend for you on an automatically-assigned local port (a free 127.0.0.1 port, e.g. http://127.0.0.1:59247/) and wires the UI to it β there's no port to configure. It then begins indexing your workspace. Configure your models from the in-app BYOM panel, type a request, and you're off.
Running the backend by hand (headless / CI)? Launch it with
uvicorn main:app --port 8000and point the extension'sbackendUrlsetting at it. The auto-assigned port is only for the normal in-VS-Code flow.
You ask ββΆ Researcher ββΆ Planner ββΆ drift guard ββΆ Coder ββΆ sandbox runs it
(maps the (spec + (scope (edits β² β
codebase) plan) locked) files) β βΌ
self-heal ββ read the result
Behind the scenes, a stateful LangGraph engine routes each task between local and cloud models using a context-and-complexity score β always picking the cheapest tier that can do the job and only reaching for the cloud when a task genuinely needs it.
It retrieves the right files with GraphRAG: instead of dumping whole files into the prompt, it indexes your code as a dependency graph (Tree-sitter) with vector embeddings, then pulls just the relevant slice via vector search + a k-hop dependency walk ranked by importance (PageRank). That keeps prompts small β a ~70 % mean reduction in prompt size β which is exactly what lets AILIENANT run well on modest hardware: per-tier budgets keep context inside a small local model's window (as little as a 4 K-token window), and the index lives in a fast, RAM-first store. Every step is checkpointed so nothing is lost.
Built on a spec, not a guess. Before any file is touched, the Planner turns your request into a frozen MissionSpecification β outcome, scope, WBS steps, constraints, and acceptance criteria (TDD + DDD terminology included). Once frozen, neither the Planner nor the Coder can silently change scope: a drift_monitor compares every re-plan against the original using a multi-factor similarity metric and escalates to you if they diverge. The spec is the contract; the agent can't self-authorize scope changes.
Failures route, they don't crash. Every agent turn runs inside a structured execution harness: a reflexion_guard intercepts exceptions and routes them to a dedicated repair agent (instead of surfacing a traceback), a deterministic finops_gate enforces your cost ceiling on every graph step, and structured verdicts β not raw stdout β drive all retry decisions. If a node has an unhandled exception it's written to a dead-letter queue before the error propagates, so you can inspect and resume.
The deep version β diagrams, the full spec schema, and the repair routing logic β is in HowItWorks.md.
Not every question needs the agent to do something β sometimes you just want to understand. The Analyst (Natt) is a chat companion living in a side panel: ask it "how does auth flow through this service?", "what would break if I change this function?", or even "how does AILIENANT's routing actually work?" and it answers in plain language.
It's a read-only tutor β the voice, never the hand. It explains, traces, and teaches, but it never edits your files, so you can explore freely without it changing anything.
What makes its answers trustworthy is what it's grounded in β three sources at once: your code's knowledge graph (so it cites the real structure, not a hallucination), your workspace README (so it knows your project's intent), and AILIENANT's own product docs (so it can explain the tool itself). And because explaining is cheaper than coding, you pick the answer model from a small selector β a fast local model for quick questions, a stronger one for a deep architectural walkthrough β without affecting retrieval quality.
AILIENANT's understanding of your codebase isn't a black box. The built-in dashboard renders the GraphRAG index as an interactive knowledge graph β a force-directed map of your files and their dependencies, where the most-connected "hub" files stand out, related modules share a color, and importance (PageRank) drives the layout. A companion 2D vector map projects how the engine semantically groups your code. It's a living picture of what the agent knows, and how it decides what to read.
- π§© MCP servers. AILIENANT speaks the Model Context Protocol, with a curated registry of vetted servers (GitHub, Brave Search, Docker, Postgres) you can enable in a click. Every MCP tool is privilege-classified β unknown tools are treated as dangerous until proven safe β and trusted only for the session after you approve them.
- β‘ Skills. Save reusable instruction snippets β global or per-workspace β and drop them into any prompt. Your own command templates, versioned with the project.
- π§° Tools. Agents act through a typed, role-gated tool registry: reading and tracing code, editing files transactionally, running commands in the sandbox, and asking you when they're unsure. The catalog is growing toward ~56 role-assigned tools (see the roadmap in docs/PROJECT_MANIFEST.md); the full table β which agent uses which tool β is in HowItWorks.md.
Coding is bursty β you step out for lunch, you log off for the night. Dreaming Mode turns that idle time into progress. You point AILIENANT at what to think about β architecture & patterns, refactoring & technical debt, bug fixes, the whole workspace, or a theme you type β and while you're away it works that focus autonomously: studying the code, consolidating what it learns into long-term memory, and exploring improvements. It self-corrects as it goes and stops on its own if errors start to compound.
Crucially, it never wakes on a timer to ambush your machine β you decide when to spend the resources by starting it when you step away. It's budget-capped (it refuses once a session's spend ceiling is hit) and safe: if you come back and save a file mid-pass, that pass aborts cleanly without writing.
Pick the profile that fits the break you're taking β they trade speed, cost, and depth:
| Profile | Best for | Roughly |
|---|---|---|
| Medium | A lunch break β light, fully local | 1 task Β· 3 files Β· ~60 min |
| Big | Overnight β deeper, more files, local | 3 tasks Β· 10 files Β· nightly |
| Cloud | Top-quality reasoning, bounded by tokens | 1 task Β· 5 files Β· token-capped |
| Hybrid | Cloud plans, local model edits β quality at lower cost | 2 tasks Β· 6 files |
The full mechanism β what each profile can actually achieve, the time envelopes, and how the offline tree-search (MCTS) vets candidate changes β is in HowItWorks.md.
The agent works against a persistent, interactive terminal β a real shell session that remembers its working directory and environment across commands, streams output live, and can be interrupted β all inside the sandbox. The control panel (an in-app dashboard, served locally) gives you eleven views over a running session: cost and routing telemetry, hardware and runtime status, the memory graph, BYOM models, MCP servers and skills, governance rules, a staging area to review pending patches, a tamper-evident audit ledger, and crash recovery.
| Doc | For whom |
|---|---|
| HowToUseIt.md | Anyone β install, configure, and run your first task, step by step |
| HowItWorks.md | The curious β architecture, routing, and the safety model explained |
| DEVELOPERS.md | Core developers β deep internals, diagrams, pseudocode, code map |
| CONTRIBUTING.md | Contributors β setup, standards, and how to send a great PR |
| docs/PROJECT_MANIFEST.md | The full phase-by-phase roadmap |
| docs/DEV_JOURNAL.md | Active engineering log (Phase 8.x) |
| docs/DEV_JOURNAL_ARCHIVE.md | Compressed history β Phase 0 through 7.19 |
AILIENANT is open source and contributions are welcome β from fixing a typo to closing a roadmap item. Please start with CONTRIBUTING.md.
One thing to know up front: because the project is dual-licensed (see below), every contributor signs a quick Contributor License Agreement before their first PR is merged. It's a one-time step and you keep the copyright to your work.
AILIENANT is open-core and dual-licensed:
- Community Edition β GNU AGPL-3.0. Free to use, study, modify, and share. If you distribute it or run a modified version as a network service, you share your source under the same license.
- Commercial / Enterprise Edition. For organizations that can't accept the AGPL's terms or want enterprise features and support.
See LICENSING.md for the full picture and how to obtain a commercial license.
The AILIENANT name and logos are trademarks of the project and are not covered by the AGPL.
Built for engineers who want an AI teammate they can actually trust β and audit.
Standing on the shoulders of LangGraph Β· LanceDB Β· Tree-sitter Β· LiteLLM Β· Pydantic.