Vector Mcp

CLI or API | MCP | Agent

Version: 1.39.0

Overview

Vector Mcp is a production-grade Agent and Model Context Protocol (MCP) server designed to interface directly with Integrate RAG into AI Agents via MCP Server. Supports multiple Vector database technologies..

Key Features

Consolidated Action-Routed MCP Tools: Minimizes token overhead and eliminates tool bloat in LLM contexts by grouping methods into optimized, togglable tool modules.
Enterprise-Grade Security: Comprehensive support for Eunomia policies, OIDC token delegation, and granular execution context tracking.
Integrated Graph Agent: Built-in Pydantic AI agent supporting the Agent Control Protocol (ACP) and standard Web interfaces (AG-UI).
Native Telemetry & Tracing: Out-of-the-box OpenTelemetry exports and native Langfuse tracing.

CLI or API

This agent wraps the Integrate RAG into AI Agents via MCP Server. Supports multiple Vector database technologies. API. You can interact with it programmatically or via its integrated execution entrypoints.

Detailed instructions on how to use the underlying API wrappers, extended schema bindings, and developer SDK references are maintained in docs/index.md.

MCP

This server utilizes dynamic Action-Routed tools to optimize token overhead and maximize IDE compatibility.

Available MCP Tools

Tool Module	Toggle Env Var	Enabled by Default	Description & Nested Methods
Collection Management	`COLLECTION_MANAGEMENT_TOOL`	`True`	Manage collection management operations.

    Actions:
      - 'create_collection': Creates a new collection or retrieves an existing one in the vector database.
      - 'add_documents': Adds documents to an existing collection in the vector database.
      - 'delete_collection': Deletes a collection from the vector database.
      - 'list_collections': Lists all collections in the vector database. Action-routed methods: `add_documents`, `create_collection`, `delete_collection`, `list_collections`. |

| Search | SEARCH_TOOL | True | Manage search operations.

    Actions:
      - 'semantic_search': Retrieves and gathers related knowledge from the vector database instance using the question variable.
      - 'lexical_search': This is a lexical or term based search that retrieves and gathers related knowledge from the database instance using the question variable via BM25.
      - 'search': Performs a hybrid search combining semantic (vector) and lexical (BM25) methods. Action-routed methods: `lexical_search`, `search`, `semantic_search`. |

    Actions:
      - 'create_collection': Creates a new collection or retrieves an existing one in the vector database.
      - 'add_documents': Adds documents to an existing collection in the vector database.
      - 'delete_collection': Deletes a collection from the vector database.
      - 'list_collections': Lists all collections in the vector database. Action-routed methods: `add_documents`, `create_collection`, `delete_collection`, `list_collections`. |

| Search | SEARCH_TOOL | True | Manage search operations.

    Actions:
      - 'semantic_search': Retrieves and gathers related knowledge from the vector database instance using the question variable.
      - 'lexical_search': This is a lexical or term based search that retrieves and gathers related knowledge from the database instance using the question variable via BM25.
      - 'search': Performs a hybrid search combining semantic (vector) and lexical (BM25) methods. Action-routed methods: `lexical_search`, `search`, `semantic_search`. |

    Actions:
      - 'create_collection': Creates a new collection or retrieves an existing one in the vector database.
      - 'add_documents': Adds documents to an existing collection in the vector database.
      - 'delete_collection': Deletes a collection from the vector database.
      - 'list_collections': Lists all collections in the vector database. Action-routed methods: `add_documents`, `create_collection`, `delete_collection`, `list_collections`. |

| Search | SEARCH_TOOL | True | Manage search operations.

    Actions:
      - 'semantic_search': Retrieves and gathers related knowledge from the vector database instance using the question variable.
      - 'lexical_search': This is a lexical or term based search that retrieves and gathers related knowledge from the database instance using the question variable via BM25.
      - 'search': Performs a hybrid search combining semantic (vector) and lexical (BM25) methods. Action-routed methods: `lexical_search`, `search`, `semantic_search`. |

Detailed tool schemas, parameter shapes, and validation constraints are preserved in docs/mcp.md.

Dynamic Tool Selection & Visibility

This MCP server supports dynamic toolset selection and visibility filtering at runtime. This allows you to restrict the set of exposed tools in order to prevent blowing up the LLM's context window.

You can configure tool filtering via multiple input channels:

CLI Arguments: Pass --tools or --toolsets (or their disabled counterparts --disabled-tools and --disabled-toolsets) during startup.
Environment Variables: Define standard environment variables:
- MCP_ENABLED_TOOLS / MCP_DISABLED_TOOLS
- MCP_ENABLED_TAGS / MCP_DISABLED_TAGS
HTTP SSE Request Headers: Pass custom headers during transport initialization:
- x-mcp-enabled-tools / x-mcp-disabled-tools
- x-mcp-enabled-tags / x-mcp-disabled-tags
HTTP SSE Request Query Parameters: Append query parameters directly to your transport connection URL:
- ?tools=tool1,tool2
- ?tags=tag1

When query strings or parameters are supplied, an LLM-free Knowledge Graph resolution layer (using DynamicToolOrchestrator) matches query intents against known tool tags, names, or descriptions, with safe fallback and automated 24-hour background cache refreshing.

MCP Configuration Examples

stdio Transport (Recommended for local IDEs e.g., Cursor, Claude Desktop)

Configure your IDE's mcp.json to launch the MCP server via uvx:

{
  "mcpServers": {
    "vector-mcp": {
      "command": "uvx",
      "args": [
        "--from",
        "vector-mcp",
        "vector-mcp"
      ],
      "env": {
        "VECTOR_URL": "your_vector_url_here",
        "EMBEDDING_MODEL_ID": "your_embedding_model_id_here",
        "CHUNK_SIZE": "your_chunk_size_here",
        "VECTOR_API_KEY": "your_vector_api_key_here"
      }
    }
  }
}

Streamable-HTTP Transport (Recommended for production deployments)

Configure your client's mcp.json to launch the Streamable-HTTP server via uvx with explicit host and port definition:

{
  "mcpServers": {
    "vector-mcp": {
      "command": "uvx",
      "args": [
        "--from",
        "vector-mcp",
        "vector-mcp"
      ],
      "env": {
        "TRANSPORT": "streamable-http",
        "HOST": "0.0.0.0",
        "PORT": "8000",
        "VECTOR_URL": "your_vector_url_here",
        "EMBEDDING_MODEL_ID": "your_embedding_model_id_here",
        "CHUNK_SIZE": "your_chunk_size_here",
        "VECTOR_API_KEY": "your_vector_api_key_here"
      }
    }
  }
}

Alternatively, connect to a pre-deployed remote or local Streamable-HTTP instance:

{
  "mcpServers": {
    "vector-mcp": {
      "url": "http://localhost:8000/vector-mcp/mcp"
    }
  }
}

Deploying the Streamable-HTTP server via Docker:

docker run -d \
  --name vector-mcp-mcp \
  -p 8000:8000 \
  -e TRANSPORT=streamable-http \
  -e PORT=8000 \
  -e VECTOR_URL="your_value" \
  -e EMBEDDING_MODEL_ID="your_value" \
  -e CHUNK_SIZE="your_value" \
  -e VECTOR_API_KEY="your_value" \
  knucklessg1/vector-mcp:latest

Agent

This repository features a fully integrated Pydantic AI Graph Agent. It communicates over the Agent Control Protocol (ACP) and interacts seamlessly with the Agent Web UI (AG-UI) and Terminal interface.

Running the Agent CLI

To start the interactive command-line agent:

# Set credentials
export VECTOR_URL="your_value"
export EMBEDDING_MODEL_ID="your_value"
export CHUNK_SIZE="your_value"
export VECTOR_API_KEY="your_value"

# Run the agent server
vector-agent --provider openai --model-id gpt-4o

Docker Compose Orchestration

The following docker/agent.compose.yml configures the Agent, Web UI, and Terminal Interface together:

version: '3.8'

services:
  vector-mcp-mcp:
    image: knucklessg1/vector-mcp:latest
    container_name: vector-mcp-mcp
    hostname: vector-mcp-mcp
    restart: always
    env_file:
      - ../.env
    environment:
      - PYTHONUNBUFFERED=1
      - HOST=0.0.0.0
      - PORT=8000
      - TRANSPORT=streamable-http
    ports:
      - "8000:8000"
    healthcheck:
      test: ["CMD", "python3", "-c", "import urllib.request; urllib.request.urlopen('http://localhost:8000/health')"]
      interval: 30s
      timeout: 10s
      retries: 3
      start_period: 10s
    logging:
      driver: json-file
      options:
        max-size: "10m"
        max-file: "3"

  vector-mcp-agent:
    image: knucklessg1/vector-mcp:latest
    container_name: vector-mcp-agent
    hostname: vector-mcp-agent
    restart: always
    depends_on:
      - vector-mcp-mcp
    env_file:
      - ../.env
    command: [ "vector-agent" ]
    environment:
      - PYTHONUNBUFFERED=1
      - HOST=0.0.0.0
      - PORT=9023
      - MCP_URL=http://vector-mcp-mcp:8000/mcp
      - PROVIDER=${PROVIDER:-openai}
      - MODEL_ID=${MODEL_ID:-gpt-4o}
      - ENABLE_WEB_UI=True
      - ENABLE_OTEL=True
    ports:
      - "9023:9023"
    healthcheck:
      test: ["CMD", "python3", "-c", "import urllib.request; urllib.request.urlopen('http://localhost:9023/health')"]
      interval: 30s
      timeout: 10s
      retries: 3
      start_period: 10s
    logging:
      driver: json-file
      options:
        max-size: "10m"
        max-file: "3"

Detailed graph node architecture explanations, custom skill configurations, and agentic trace guides are available in docs/agent.md.

Security & Governance

Built directly upon the enterprise-ready agent-utilities core, standard security parameters are fully supported:

Access Control & Policy Enforcement

Eunomia Policies: Fine-grained, policy-driven tool authorization. Supports none, local embedded (mcp_policies.json), or centralized remote modes.
OIDC Token Delegation: Compliant with RFC 8693 token exchange for flowing authenticating user credentials from Web UI / ACP → Agent → MCP.
Scoped Credentials: Execution context runs restricted to the specific caller identity.

Runtime Security Grid

Feature	Functionality	Enablement
Tool Guard	Sensitivity inspection with human-in-the-loop validation	Enabled by default
Prompt Injection Defense	Input scanning, repetition monitoring, and recursive loop blocks	Enabled by default
Context Safety Guard	Stuck-loop detectors and contextual overflow preemptive alerts	Enabled by default

Installation

Install the Python package locally:

# Using uv (highly recommended)
uv pip install vector-mcp[all]

# Using standard pip
python -m pip install vector-mcp[all]

Repository Owners

Contribute

Contributions are welcome! Please ensure code quality by executing local checks before submitting pull requests:

Format code using ruff format .
Lint code using ruff check .
Validate type-safety with mypy .
Execute test suites using pytest

Name		Name	Last commit message	Last commit date
Latest commit History 224 Commits
.github/workflows		.github/workflows
.specify/specs		.specify/specs
docker		docker
docs		docs
scripts		scripts
tests		tests
vector_mcp		vector_mcp
.bumpversion.cfg		.bumpversion.cfg
.codespellignore		.codespellignore
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
a2a.json		a2a.json
mcp_config.json		mcp_config.json
mkdocs.yml		mkdocs.yml
opencode.json		opencode.json
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vector Mcp

CLI or API | MCP | Agent

Overview

Key Features

CLI or API

MCP

Available MCP Tools

Dynamic Tool Selection & Visibility

MCP Configuration Examples

stdio Transport (Recommended for local IDEs e.g., Cursor, Claude Desktop)

Streamable-HTTP Transport (Recommended for production deployments)

Agent

Running the Agent CLI

Docker Compose Orchestration

Security & Governance

Access Control & Policy Enforcement

Runtime Security Grid

Installation

Repository Owners

Contribute

About

Uh oh!

Releases 82

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Vector Mcp

CLI or API | MCP | Agent

Overview

Key Features

CLI or API

MCP

Available MCP Tools

Dynamic Tool Selection & Visibility

MCP Configuration Examples

stdio Transport (Recommended for local IDEs e.g., Cursor, Claude Desktop)

Streamable-HTTP Transport (Recommended for production deployments)

Agent

Running the Agent CLI

Docker Compose Orchestration

Security & Governance

Access Control & Policy Enforcement

Runtime Security Grid

Installation

Repository Owners

Contribute

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 82

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages