gramene-mcp

An MCP server that connects Claude and other AI agents to the Gramene plant genomics database via its public REST API at data.gramene.org. It exposes a suite of tools covering gene search, comparative genomics, expression, loss-of-function germplasm, ontology enrichment, and literature discovery.

Requirements

Node.js v18 or later (ES modules + native fetch)

No local infrastructure (Solr, MongoDB, Docker) is needed — every tool call goes to the public Gramene REST API.

Installation

git clone https://github.com/warelab/gramene-mcp.git
cd gramene-mcp
npm install

Configuration

Variable	Default	Description
`MCP_HOST`	`127.0.0.1`	Interface to listen on
`MCP_PORT`	`8787`	Port to listen on
`MCP_ALLOWED_ORIGINS`	(localhost only)	Comma-separated CORS origins; set to `*` to allow all
`MCP_LOG`	`true`	Write JSON request logs to stderr
`MCP_LOG_FILE`	(none)	Optional path for a persistent log file
`GRAMENE_API_BASE`	`https://data.gramene.org/v69`	Gramene REST API base URL. Override to point at a species-focused stack.

Starting the server

# Default — talks to data.gramene.org/v69
npm start

# Development mode (auto-reloads on file changes)
npm run dev

# Point at a different release or species-focused stack
GRAMENE_API_BASE=https://data.gramene.org/v70 npm start

The server listens for MCP JSON-RPC requests at POST http://<MCP_HOST>:<MCP_PORT>/mcp.

Connecting to MCP clients

The server speaks two transports:

stdio — the client launches node server.mjs --stdio as a child process and communicates over stdin/stdout. No port, no CORS, no HTTP server. Easiest for single-user setups on the same machine.
HTTP — MCP 2025-03-26 Streamable HTTP at POST /mcp. Needed when the server runs on a different machine, when multiple clients share one instance, or when the client only supports HTTP.

The same server.mjs handles both — pass --stdio (or set MCP_STDIO=1) to switch into stdio mode.

Claude Desktop

Edit claude_desktop_config.json (macOS: ~/Library/Application Support/Claude/claude_desktop_config.json):

stdio (recommended for local use):

{
  "mcpServers": {
    "gramene": {
      "command": "node",
      "args": ["/absolute/path/to/gramene-mcp/server.mjs", "--stdio"]
    }
  }
}

HTTP:

{
  "mcpServers": {
    "gramene": {
      "url": "http://127.0.0.1:8787/mcp"
    }
  }
}

Restart Claude Desktop after saving.

Claude Code / Cowork

stdio:

claude mcp add gramene -- node /absolute/path/to/gramene-mcp/server.mjs --stdio

HTTP:

claude mcp add gramene --url http://127.0.0.1:8787/mcp

VS Code (GitHub Copilot)

Create or edit .vscode/mcp.json in your workspace (or add to user settings under mcp.servers for a global entry):

{
  "servers": {
    "gramene": {
      "type": "http",
      "url": "http://127.0.0.1:8787/mcp"
    }
  }
}

Open the Chat view, select Agent mode, and the gramene tools will appear in the tools picker.

Cursor

Create or edit ~/.cursor/mcp.json (global) or .cursor/mcp.json in your project root:

{
  "mcpServers": {
    "gramene": {
      "url": "http://127.0.0.1:8787/mcp"
    }
  }
}

Open Cursor Settings → MCP to verify the server is listed and active.

Zed

Open your Zed settings.json (zed: open settings) and add a context_servers entry:

{
  "context_servers": {
    "gramene": {
      "url": "http://127.0.0.1:8787/mcp"
    }
  }
}

Zed supports the MCP HTTP transport natively. If your server requires authentication, add a headers map:

{
  "context_servers": {
    "gramene": {
      "url": "http://127.0.0.1:8787/mcp",
      "headers": {
        "Authorization": "Bearer <token>"
      }
    }
  }
}

OpenAI Codex

Edit ~/.codex/config.toml (global) or .codex/config.toml in a trusted project:

[mcp_servers.gramene]
url = "http://127.0.0.1:8787/mcp"

Or add it from the CLI:

codex mcp add gramene --url http://127.0.0.1:8787/mcp

Both the Codex CLI and the Codex IDE extension share this config file, so you only need to set it up once.

Continue.dev

Add to .continue/config.yaml (or the JSON equivalent) in your project or home directory:

mcpServers:
  - name: gramene
    transport:
      type: http
      url: http://127.0.0.1:8787/mcp

See the Continue MCP docs for authentication options and tool filtering.

Any MCP-compliant client

The server endpoint is:

POST http://<host>:<port>/mcp
Content-Type: application/json

It accepts standard MCP JSON-RPC 2.0 messages (initialize, tools/list, tools/call, prompts/list, prompts/get). Sessions are tracked via the Mcp-Session-Id header returned on initialize — echo it back on subsequent requests if your client supports session continuity.

If the server is on a remote host, make sure MCP_ALLOWED_ORIGINS is set to allow your client's origin, or set it to * for unrestricted access.

Tools

The server exposes 12 tools. They map onto two API endpoint families: /search and /suggest (Solr-backed) and /<collection> (MongoDB-backed).

Tool	Description
`solr_search`	Full Solr query against `/search` — field lists, filters, facets, sorting, pagination. Use for single-gene cards (`q="id:…"`) and to produce facet-count arrays consumed by the client-side enrichment skill
`solr_suggest`	Calls `/suggest`. Entry point for free-text concepts (gene name, family, pathway, species, ontology, trait) → `fq_field`/`fq_value`. Always start here before `mongo_find`. Returns the Solr grouped envelope (`grouped.category.groups[].doclist.docs[]`).
`solr_search_bool`	Structured AND/OR/NOT boolean queries compiled into `/search` `fq` clauses
`solr_graph`	Single-hop `{!graph}` traversal via `/search` (e.g. genomic neighbourhoods through `compara_neighbors_10`). Multi-hop relationships are expressed by chaining two queries
`genes_in_region`	Find all genes overlapping a chromosomal interval. `taxon_id` is the plain NCBI ID
`expression_for_genes`	Baseline (TPM/FPKM) and differential (log₂FC) expression by tissue and condition. Joins gene expression fields with `/experiments` and `/assays`.
`vep_for_gene`	Germplasm accessions carrying predicted loss-of-function alleles (Ensembl VEP), grouped by consequence, zygosity, and study. Joins with `/germplasm`.
`pubmed_for_genes`	PubMed and DOI cross-references for a set of genes (returns IDs only — pipe to a PubMed-focused MCP for bibliographic detail)
`mongo_find`	Detail lookup by ID against `/<collection>`. Supports `{_id: x}`, `{_id: {$in: […]}}`, and top-level field-equality filters with optional `$in`. Operators like `$gt`/`$lt`/`$regex`/`$where` are NOT supported by the public API — use `solr_search` for those.
`mongo_lookup_by_ids`	Batch-resolve numeric ontology term IDs (or string IDs in other collections) to documents
`mongo_list_collections`	List the fixed set of collections served by the API
`kb_relations`	Return the search-index ↔ collections field crosswalk (schema documentation)

Enrichment / overrepresentation analysis is intentionally not an MCP tool. Build foreground and background facet-count arrays via solr_search (with facet.field on GO__ancestors, PO__ancestors, TO__ancestors, pathways__ancestors, or domains__ancestors) and pass them to a client-side enrichment skill that operates on (ontology, foreground array, background array).

Workflow prompts

The server also exposes workflow prompts that Claude loads on demand to guide multi-step research tasks:

base — Role definition, query routing, species reference table, and critical conventions
gene_lookup — Search by gene/protein name or molecular function
pathway_genes — List genes in a Plant Reactome pathway for a given species
qtl_discovery — Resolve a free-text trait to a TO term and list matching QTLs
qtl_candidate_ranking — Full pipeline for ranking candidate genes within a QTL interval
cross_species_comparison — Compare a gene across orthologs in multiple species
orthologs_paralogs — Field reference for ortholog / paralog / homolog queries
gene_family — Explore a gene family across species
gene_family_expansion — Per-clade copy counts via gene_tree × taxonomy__ancestors faceting
germplasm_lof — Find EMS and natural-diversity knockout lines for target genes
pav_cnv — Analyse presence/absence and copy-number variation via facets or chained graph traversal
literature_search — Collect PubMed/DOI cross-references for a gene (and its orthologs) to hand off to a PubMed-focused MCP

Data model

The public REST API exposes two endpoint families:

/search (Solr-backed gene documents) — one document per gene, across 30+ plant species. Key field groups:

Gene identity: id, name, description, biotype, taxon_id, region, start, end, strand
Species filtering: taxonomy__ancestors (plain NCBI taxon IDs at every rank — preferred)
Ontology ancestors: GO__ancestors, PO__ancestors, TO__ancestors, pathways__ancestors, domains__ancestors
Comparative genomics: gene_tree, homology__all_orthologs, homology__ortholog_one2one/one2many/many2many, homology__within_species_paralog, compara_neighbors_N, compara_idx_multi (PAV/CNV)
Expression linkage: expressed_in_gxa_attr_ss (joins to the /experiments and /assays collection endpoints)
Loss-of-function: VEP__{consequence}__{zygosity}__{species}__{study}__attr_ss, VEP__merged__EMS/NAT__attr_ss
Literature: PUBMED__xrefs

/<collection> (annotation documents) — used for enrichment and metadata lookups:

Collection	Contents
`genes`	Gene-level metadata
`genetrees`	Gene family / homology trees
`taxonomy`	NCBI taxonomy nodes
`GO`, `PO`, `TO`	Ontology term documents
`domains`, `pathways`	InterPro domains and Plant Reactome pathways
`assays`, `experiments`, `expression`	Expression study metadata and values
`qtls`	QTL records with Trait Ontology annotations
`germplasm`	Accession metadata: `pub_id`, `stock_center`, `subpopulation`, genebank URL
`maps`	Genome assembly metadata (`in_compara` flag)

The collection endpoints accept idList, q (text search), and top-level field-equality filters. They do not support range, regex, or $where-style operators — use solr_search for those.

Development

# Auto-reload on file changes
npm run dev

# Run the integration test suite (hits data.gramene.org)
npm test

The default test runner targets a server you start separately on http://127.0.0.1:8787/mcp; override with MCP_URL if you want to point at a different instance.

Conventions

A few quirks to be aware of when using the tools directly:

Gene IDs must be full stable IDs (e.g. SORBI_3006G095600), never abbreviated.
Taxon IDs — filter with taxonomy__ancestors:<plain NCBI id> (e.g. 4558 for sorghum). The taxon_id field on individual gene documents uses NCBI×1000+suffix (e.g. 4558001); avoid filtering on it directly. The genes_in_region taxon_id parameter takes the plain NCBI ID and applies it as taxonomy__ancestors.
solr_graph is single-hop by design; depth is hard-coded server-side. Multi-hop relationships are expressed by chaining two graph queries (e.g. orthologs → neighbors).
Discovery vs detail. Always start free-text discovery with solr_suggest. Reserve mongo_find for fetching detail records once you have a specific ID.
Homology field choice. Default to homology__all_orthologs. Use homology__ortholog_one2one only for tight pairs (e.g. sorghum ↔ rice); maize is a paleopolyploid, and monocot ↔ Arabidopsis is too distant for stable 1:1 mappings.
Species suggestions work best with an exact-name query (q: 'name:"Sorghum bicolor"') rather than a fuzzy term: lookup. The same applies to pathway and Trait Ontology lookups (fq: ['category:Trait Ontology']).
Expression data is richest for sorghum; VEP loss-of-function data covers sorghum, maize, and several rice genomes.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
skills/qtl-candidate-analysis		skills/qtl-candidate-analysis
test		test
.gitignore		.gitignore
AGENT_PROMPT.md		AGENT_PROMPT.md
AGENT_PROMPT_v2.md		AGENT_PROMPT_v2.md
AGENT_PROMPT_v3.md		AGENT_PROMPT_v3.md
README.md		README.md
ecosystem.config.cjs		ecosystem.config.cjs
enrichment_browser_example.html		enrichment_browser_example.html
package.json		package.json
server.mjs		server.mjs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gramene-mcp

Requirements

Installation

Configuration

Starting the server

Connecting to MCP clients

Claude Desktop

Claude Code / Cowork

VS Code (GitHub Copilot)

Cursor

Zed

OpenAI Codex

Continue.dev

Any MCP-compliant client

Tools

Workflow prompts

Data model

Development

Conventions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

gramene-mcp

Requirements

Installation

Configuration

Starting the server

Connecting to MCP clients

Claude Desktop

Claude Code / Cowork

VS Code (GitHub Copilot)

Cursor

Zed

OpenAI Codex

Continue.dev

Any MCP-compliant client

Tools

Workflow prompts

Data model

Development

Conventions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages