mcp-rag-server (RAG MCP server for any repository)

mcp-rag-server is a lightweight Retrieval‑Augmented Generation helper you can plug into any client that speaks the [Model Context Protocol (MCP)]. GitHub Copilot Agent mode in Visual Studio / VS Code is just one option – you can also use the official MCP Inspector, future MCP‑aware IDEs, or custom tooling.

It indexes a target repository directory, chunks the content (default chunk size 2400 characters, about 800 tokens, with 400 characters of overlap, about 120 tokens; both configurable via CHUNK_SIZE / CHUNK_OVERLAP), builds embeddings using either local inference via @huggingface/transformers or an OpenAI‑compatible embeddings API, and exposes MCP tools:

rag_query – semantic search returning scored snippets (path, score, snippet)
read_file – secure file read (optional line range) constrained to REPO_ROOT. For PDF files, text is automatically retrieved from the unified cache file if available
list_files – list directory contents (files & subdirectories) with optional recursion, depth and extension filtering

Two transports are supported (select with MCP_TRANSPORT=stdio|http):

stdio – simplest integration for IDEs that spawn a process (backward compatible default)
http (Streamable HTTP) – recommended for large repos / first run so you can watch logs & poll readiness before attaching a client. Enable via MCP_TRANSPORT=http. Includes DNS rebinding protection by default.

Features

Embeddings via local inference (@huggingface/transformers) or an OpenAI‑compatible API
Multi‑language source + docs support (configurable via ALLOWED_EXT)
PDF support: Automatically extracts text from PDF files during indexing and caches it in a unified pdf-text-cache.json file (located alongside the index store) for fast retrieval. PDF text is treated like any other text file for semantic search
Excluded folder patterns support (configurable via EXCLUDED_FOLDERS)
Fast glob file discovery and overlapping chunking for better recall
Simple cosine similarity ranking (optionally swap to ANN later)
Pluggable model selection via MODEL_NAME (see guidance below)
Optional persistent index (multi-file storage) + warm start & incremental reindexing via INDEX_STORE_PATH
Incremental change detection (additions / deletions / file size changes) to avoid full rebuilds
Stdio or Streamable HTTP transport (with optional host allow‑list / DNS rebinding protection)
Safe path handling (rejects attempts to escape REPO_ROOT)
Minimal dependencies; quick startup after first local model load or remote API configuration validation
Ready for extension: add new MCP tools or ANN / hybrid retrieval backends

Planned / Nice‑to‑have: hybrid BM25 + embedding search, ANN acceleration (HNSW / IVF), per‑language tokenizer heuristics, batched / parallel embedding, semantic boundary aware chunking.

Requirements

Node.js 20+
Path to your repository (REPO_ROOT)

Optional MCP clients (any one is enough):

The official MCP Inspector
Visual Studio 2022 17.14+ with GitHub Copilot (Agent mode enabled)
VS Code with GitHub Copilot Agent mode
Any other MCP-aware tooling

Install

npm install
npm run build

Run (local test)

Build then start (stdio transport by default). Use either npm start or invoke the built file directly.

Windows PowerShell

npm run build
$env:REPO_ROOT="C:\path\to\your-repo"; node dist/index.js

Or:

$env:REPO_ROOT="C:\path\to\your-repo"; npm start

macOS / Linux (bash/zsh)

npm run build
export REPO_ROOT="/path/to/your-repo"; node dist/index.js

Or:

export REPO_ROOT="/path/to/your-repo"; npm start

Optionally set a model cache to speed up subsequent runs (first start downloads the model once):

export TRANSFORMERS_CACHE="/path/to/cache"   # macOS/Linux

$env:TRANSFORMERS_CACHE="C:\path\to\cache" # Windows PowerShell

OpenAI-compatible API embeddings

Set EMBEDDING_PROVIDER=openai to call a remote /embeddings endpoint instead of loading a local transformer model. The request format follows the OpenAI embeddings API and works with providers that expose a compatible protocol such as OpenAI, Mistral, and Jina AI.

Windows PowerShell:

$env:REPO_ROOT="C:\path\to\your-repo"
$env:EMBEDDING_PROVIDER="openai"
$env:EMBEDDING_API_BASE_URL="https://api.openai.com/v1"
$env:EMBEDDING_API_KEY="<your-api-key>"
$env:MODEL_NAME="text-embedding-3-small"
npm start

macOS / Linux:

export REPO_ROOT="/path/to/your-repo"
export EMBEDDING_PROVIDER="openai"
export EMBEDDING_API_BASE_URL="https://api.openai.com/v1"
export EMBEDDING_API_KEY="<your-api-key>"
export MODEL_NAME="text-embedding-3-small"
npm start

Notes:

EMBEDDING_API_BASE_URL should point to the provider's API base (for example https://api.openai.com/v1), not the /embeddings path itself.
MODEL_NAME is passed verbatim to the remote embeddings API when EMBEDDING_PROVIDER=openai.
EMBEDDING_API_BATCH_SIZE controls how many chunks are sent per remote embeddings request during indexing. Default: 200.
TRANSFORMERS_CACHE is only relevant for local inference.

Streamable HTTP mode (recommended for large initial indexes)

Run the MCP server as an HTTP endpoint and only open your IDE after Embeddings ready. shows (avoids client timeouts on cold start):

npm run build
$env:REPO_ROOT="C:\path\to\your-repo"; $env:MCP_TRANSPORT="http"; npm start

export REPO_ROOT="/path/to/your-repo"; MCP_TRANSPORT=http npm start

Default HTTP bind: http://127.0.0.1:3000/mcp. Override with HOST and MCP_PORT envs. A readiness endpoint is available at http://127.0.0.1:3000/health returning JSON like:

{
	"version": "0.x.y",
	"repoRoot": "C:/abs/path",
	"modelName": "<embedding model>",
	"transport": "stdio" | "http",
	"ready": true | false,
	"startedAt": "2025-01-01T00:00:00.000Z",
	"indexing": {
		"filesDiscovered": 123,
		"chunksTotal": 456,
		"chunksEmbedded": 456
	}
}

ready flips to true only once all discovered chunks have embeddings (post cold build or incremental update completion).

Instructions endpoint

The server also exposes GET /instructions, which serves the Markdown file docs/copilot-instructions.md with all occurrences of <FOLDER_INFO_NAME> replaced by the FOLDER_INFO_NAME value from your environment (default REPO_ROOT).

Notes:

Start the server from the repository root so docs/copilot-instructions.md resolves via the current working directory.
Response content type is text/markdown; charset=utf-8.

Linting & Formatting

Type-check (no emit): npm run typecheck
Run ESLint (check): npm run lint
Auto-fix ESLint issues: npm run lint:fix
Format with Prettier: npm run format
Check formatting: npm run format:check

Test with MCP Inspector (without VS)

Use the MCP Inspector to exercise the server locally and try the tools without Visual Studio.

Windows PowerShell:

npm run build
$env:REPO_ROOT="C:\path\to\your-repo"; npx @modelcontextprotocol/inspector node .\\dist\\index.js

Streamable HTTP via Inspector (Windows):

npm run build
$env:REPO_ROOT="C:\path\to\your-repo"; $env:MCP_TRANSPORT="http"; npx @modelcontextprotocol/inspector http://localhost:3000/mcp --transport http

macOS/Linux (bash/zsh):

export REPO_ROOT="/path/to/your-repo"
npx @modelcontextprotocol/inspector node dist/index.js

Streamable HTTP (macOS/Linux):

export REPO_ROOT="/path/to/your-repo"; MCP_TRANSPORT=http npx @modelcontextprotocol/inspector http://localhost:3000/mcp --transport http

Notes:

First run in local mode downloads the embedding model and builds embeddings; the Inspector will connect only after startup completes. Watch the terminal for progress logs printed to stderr.
You can also put settings in a .env file at the project root (e.g., REPO_ROOT, TRANSFORMERS_CACHE, EMBEDDING_PROVIDER, EMBEDDING_API_BASE_URL).

In the Inspector UI:

Click "List tools" to verify these tools are available: rag_query, read_file, list_files.
Select a tool and click "Call tool". Provide JSON input as shown below.

Examples

Semantic search over the repo

Tool: rag_query
Input JSON:
{
	"query": "protobuf message X schema",
	"top_k": 5
}

The response includes an array of matches with path, score, and snippet.

List files in a directory (non-recursive by default)

Tool: list_files
Input JSON:
{
	"dir": "src",
	"recursive": false
}

Recursive with filters and limits:

Tool: list_files
Input JSON:
{
	"dir": "src",
	"recursive": true,
	"maxDepth": 3,
	"includeExtensions": ["ts", "md"],
	"limit": 200
}

Response shape:

{
	"entries": [
		{ "path": "src/", "type": "dir" },
		{ "path": "src/index.ts", "type": "file", "size": 1234 },
		{ "path": "src/lib/", "type": "dir" }
	]
}

Read a file (optionally with a line range)

Tool: read_file
Input JSON:
{
	"path": "src/path/to/file.txt",  // relative to REPO_ROOT
	"startLine": 1,
	"endLine": 120
}

Troubleshooting

Slow startup: set TRANSFORMERS_CACHE to a fast local folder and (optionally) set ALLOWED_EXT (e.g., ts,tsx,js for TypeScript/JS only, or any list you need).
Path errors: path must be relative to REPO_ROOT. Absolute paths are rejected for safety.
Nothing appears in Inspector for minutes: the server is still initializing (model download + embedding). This is expected on first run.
Slow warm restarts: provide INDEX_STORE_PATH so embeddings persist and only changed files re‑embed.

Environment configuration (.env)

You can configure environment variables via a local .env file.

Steps:

Copy .env.example to .env.
Edit values as needed.

Supported variables:

REPO_ROOT (required): path to the repository to index.
FOLDER_INFO_NAME (optional): display label used inside MCP tool descriptions for the repository root (default REPO_ROOT). This is purely cosmetic for client UX; it does NOT affect which directory is indexed (that is controlled only by REPO_ROOT). Set it if you prefer a friendlier name (e.g., frontend-app or monorepo-root) to appear in tool metadata and path guidance returned to the client.
EMBEDDING_PROVIDER (optional): local (default) or openai. openai means “use an OpenAI-compatible /embeddings API”, not specifically OpenAI as the vendor.
TRANSFORMERS_CACHE (optional): cache folder for local model files.
EMBEDDING_API_BASE_URL (required when EMBEDDING_PROVIDER=openai): base URL for the OpenAI-compatible API, such as https://api.openai.com/v1, https://api.mistral.ai/v1, or your provider-specific equivalent.
EMBEDDING_API_KEY (required when EMBEDDING_PROVIDER=openai): bearer token used for the embeddings API.
EMBEDDING_API_BATCH_SIZE (optional when EMBEDDING_PROVIDER=openai): number of chunks sent per remote embeddings request during indexing. Default: 200.
ALLOWED_EXT (optional): comma-separated list of file extensions to index. Default includes common text/code formats plus pdf. PDF files are automatically processed: text is extracted once during indexing and cached in a unified pdf-text-cache.json file for fast retrieval.
EXCLUDED_FOLDERS (optional): comma-separated list of folder patterns to exclude from indexing. Supports both exact folder names (e.g., node_modules,dist,build,.git) and basic glob patterns (e.g., **/test/**,**/tests/**). Files in these folders will be skipped during indexing. Defaults include common build/dependency folders: node_modules, dist, build, .git, target, bin, obj, .cache, coverage, .nyc_output.
MCP_TRANSPORT (optional): http or stdio.
VERBOSE (optional): true/1/yes/on for more granular progress logs during indexing & embedding.
INDEX_STORE_PATH (optional): base path for persisted embedding index storage (e.g., C:\repo\.mcp-index or /repo/.mcp-index). The index is stored as multiple JSON files with this prefix (e.g., .mcp-index.part0000.json, .mcp-index.part0001.json, etc.) along with a manifest file (.mcp-index.manifest.json) that tracks metadata, compatibility parameters, and the list of data files. Enables fast warm starts + incremental reindex (new / deleted / size‑changed files only).
MODEL_NAME (optional): override the default embedding model (jinaai/jina-embeddings-v2-base-code). For EMBEDDING_PROVIDER=local, this must be a model supported by @huggingface/transformers. For EMBEDDING_PROVIDER=openai, this is passed directly to the remote /embeddings API. Examples:
- MODEL_NAME=jinaai/jina-embeddings-v2-base-code (default) — Balanced multilingual/code embedding model; strong for mixed natural language + source code semantic search.
- MODEL_NAME=Xenova/bge-base-en-v1.5 — High-quality English general-purpose text embeddings (good for documentation/wiki style corpora).
- MODEL_NAME=Xenova/bge-small-en-v1.5 — Faster/lighter English model when latency or memory matters more than a few points of recall. Any compatible sentence / feature-extraction model supported by @huggingface/transformers should work for local mode.
HOST (optional, HTTP mode): bind host (default 127.0.0.1).
MCP_PORT (optional, HTTP mode): TCP port (default 3000).
ENABLE_DNS_REBINDING_PROTECTION (optional, HTTP mode): defaults to true; set to false to disable host allow‑list checks.
ALLOWED_HOSTS (optional, HTTP mode): comma-separated list of hosts allowed when DNS rebinding protection is enabled. Defaults include localhost and 127.0.0.1 with/without port.
CHUNK_SIZE (optional): maximum characters per chunk before embedding (default 2400, roughly 800 tokens depending on tokenizer/model). Larger values reduce total embeddings (faster build, less memory) but can blur fine-grained matches. Typical ranges:
- 2100‑2700 (roughly 700‑900 tokens; balanced default)
- 3000‑4200 (roughly 1000‑1400 tokens; large prose / long functions; fewer vectors)
- 1200‑1800 (roughly 400‑600 tokens; fine‑grained code navigation; more vectors / memory)
CHUNK_OVERLAP (optional): trailing characters carried into the next chunk (default 400, roughly 120 tokens or about 15% of the default chunk size). Recommended 10‑20% of CHUNK_SIZE (for example 240‑540 when CHUNK_SIZE=2400). Increase slightly (up to ~20‑25%) if you observe answers missing cross‑boundary context; decrease to speed up builds.

Safety caps: CHUNK_SIZE is clamped to 8000 and CHUNK_OVERLAP to 4000; if overlap >= size it's automatically reduced (logged) to preserve forward progress.

DOCS_PER_FILE (optional): maximum number of documents to store in a single JSON file when persisting the index (default 10000). This prevents JSON.stringify from creating excessively large strings that could cause memory issues. Lower values create more files but reduce memory pressure during save/load operations. Minimum value is 100.

Persistence & Incremental Reindexing

Set INDEX_STORE_PATH to enable a persisted index storing chunks + embeddings across multiple JSON files. On startup:

If the manifest file exists (<INDEX_STORE_PATH>.manifest.json) and its metadata (model name, chunk size, overlap) matches, the index is loaded from the data files referenced in the manifest.
The repository is rescanned; removed files' chunks are discarded, and new or size‑changed files are re‑chunked & re‑embedded.
The merged index is saved back to disk (cold build path also persists when configured).

The manifest file contains metadata about the index (version, chunk parameters, model name, timestamp) and a list of all data files (<INDEX_STORE_PATH>.part####.json) that comprise the full index.

Benefits:

Dramatically faster warm starts for large repositories.
Avoids re‑embedding unchanged content.

Current limitations:

Change detection uses file size only (content edits keeping identical size won't re‑embed yet).
Embedding generation is not parallelized yet; OpenAI‑compatible providers support request batching, but local embeddings still run one chunk at a time.
Store schema is minimal (version 1); future versions may add hashing or mtime heuristics.

Force a full rebuild by deleting the manifest file (<INDEX_STORE_PATH>.manifest.json) and data files (<INDEX_STORE_PATH>.part*.json) or changing chunk/model/provider parameters.

Visual Studio integration (MCP)

Copy example.mcp.json to:

%USERPROFILE%.mcp.json (Windows), or
your solution root as .mcp.json (recommended for teams)

Adjust the paths in "command"/"args" and the REPO_ROOT env.

For Streamable HTTP, use a config entry like:

{
  "servers": {
    "mcp-rag-server": {
      "url": "http://127.0.0.1:3000/mcp"
    }
  }
}

Open VS -> Copilot Chat -> switch to Agent mode -> enable the "mcp-rag-server" and its tools (you will be asked to grant permission on first use). If using HTTP transport, ensure the server has finished indexing (check /health).

Usage in Agent mode

Sample prompt: "Modify the C# handler for message X. Before you start, use the tool rag_query with the query 'message X schema' and take the found contracts into account. If you get file paths back, read them via read_file."

Notes

First run in local mode will download and cache the model (tens to ~100 MB) and build embeddings — this may take minutes depending on repo size.
Logs are written to stderr (console.error) to keep MCP stdout clean.
For very large repos, consider adding an ANN index (hnswlib-node) or a hybrid BM25+embeddings setup.

Model selection guidance

Choose an embedding setup based on your repository characteristics. There are two orthogonal choices: provider and model.

Provider (EMBEDDING_PROVIDER):

local (default): fully local workflow after the initial model download via @huggingface/transformers. No outbound network calls at query time.
openai: hosted embeddings via any OpenAI-compatible /embeddings API (OpenAI, Mistral, Jina AI, etc.). Use for centralized credentials or larger managed models.

Model (MODEL_NAME):

jinaai/jina-embeddings-v2-base-code (local default): balanced multilingual/code embedding model. Use when your corpus contains a meaningful amount of source code mixed with README / design docs. Strong cross-domain alignment for code-symbol + natural language queries.
Xenova/bge-base-en-v1.5 (local): high-quality English general-purpose text embeddings. Use when content is predominantly English natural language (docs, knowledge base).
Xenova/bge-small-en-v1.5 (local): faster/lighter English model. Use for lower memory or when indexing very large repos where throughput matters more than a few points of recall.
text-embedding-3-small (openai, preferred): fast, cost-efficient hosted model with strong general-purpose quality. Good default when EMBEDDING_PROVIDER=openai.
text-embedding-3-large (openai): higher-dimensional model for maximum recall quality; higher cost and latency than text-embedding-3-small.

Feel free to experiment—swap via MODEL_NAME and rebuild the embedding cache (delete any existing cached vectors if you persist them externally). Changing EMBEDDING_PROVIDER or MODEL_NAME invalidates the persisted index on purpose.

Chunk sizing guidance

Why ~800 / 120 tokens? The implementation chunks by characters, so the defaults are 2400 / 400 characters, which is roughly 800 / 120 tokens for typical code and prose. Empirically this keeps most self‑contained code constructs (functions/classes) and short doc sections in a single chunk while providing enough continuity for cross‑block semantic matches. Adjust based on corpus:

Mostly short functions or config files: smaller chunks (500‑700) aid pinpoint retrieval.
Large narrative docs / design specs: larger chunks (1000‑1400) reduce vector count without much recall loss.
Heavily interdependent code where context spans multiple files: keep default or modestly raise overlap (to ~500‑600 characters) rather than shrinking size.

Rule of thumb: Overlap ≈ 15% of size. Avoid overlap >= size (auto‑corrected) and avoid extremely small sizes (<300) unless you have a downstream re‑ranking stage.

Step‑by‑step: Index a Java repo (ProjectB in IntelliJ) and use it from C# (ProjectA in Visual Studio 2022)

This walkthrough shows how to index a Java project (ProjectB) and make that knowledge available to GitHub Copilot (Agent mode) while you work in a separate C# solution (ProjectA) in Visual Studio 2022.

Assumptions

You’re on Windows and use PowerShell.
ProjectB is a Java codebase you typically open in IntelliJ IDEA (location: C:\path\to\ProjectB). IntelliJ does not need to be open for indexing.
ProjectA is a C# solution you open in Visual Studio 2022 (location: C:\path\to\ProjectA).

1) Build this MCP server

npm install
npm run build

2) Start the server in HTTP mode pointing at ProjectB

Set environment variables once in your PowerShell session, then start. The optional index store speeds up warm starts.

$env:REPO_ROOT = "C:\path\to\ProjectB"
$env:MCP_TRANSPORT = "http"
$env:INDEX_STORE_PATH = "C:\path\to\ProjectB\.mcp-index"   # optional but recommended
$env:ALLOWED_EXT = "java,kt,kts,md,xml,gradle,properties"           # tailor for Java projects
# Optional: cache model files to a fast local folder
# $env:TRANSFORMERS_CACHE = "C:\model-cache"

npm start

Wait until the console prints “Embeddings ready.” You can also confirm readiness:

Health: http://127.0.0.1:3000/health (ready: true)
Tools are exposed at: http://127.0.0.1:3000/mcp (for MCP clients)

Leave this window running.

3) Point Visual Studio (ProjectA) at this server

Create a .mcp.json next to your ProjectA solution file (or place it at %USERPROFILE%\.mcp.json to apply globally). Use the HTTP entry so VS doesn’t need to spawn the server.

{
  "servers": {
    "mcp-rag-server": {
      "url": "http://127.0.0.1:3000/mcp"
    }
  }
}

Open ProjectA in Visual Studio 2022, open Copilot Chat, switch to Agent mode, and enable the "mcp-rag-server". Grant permissions if prompted.

Tips

If this is your first run on a large repo, keep the MCP server window open until indexing completes before connecting from VS. Using HTTP avoids timeouts during the cold build.
For subsequent runs, the INDEX_STORE_PATH makes startup much faster.

4) Use it from Copilot while coding in ProjectA

Ask Copilot to search ProjectB before answering questions or generating code in ProjectA. Example prompts:

“Use the tool rag_query to find the Java service responsible for authentication in ProjectB; then show me the equivalent interface I should implement in C# here.”
“List files under src/main/java that reference ‘Invoice’ in ProjectB, then open the key file.”

Behind the scenes, Copilot will call:

rag_query – to locate relevant snippets from ProjectB
read_file – to fetch exact code/lines
list_files – to navigate directories

5) (Optional) Use MCP Inspector to sanity‑check

If you want to test the tools before involving Visual Studio:

# In a separate PowerShell
$env:REPO_ROOT = "C:\path\to\ProjectB"
$env:MCP_TRANSPORT = "http"
npm run build
npx @modelcontextprotocol/inspector http://127.0.0.1:3000/mcp --transport http

How to use `docs/copilot-instructions.md`

The file docs/copilot-instructions.md contains clear, copy‑pastable guidance that teaches the assistant how to leverage this MCP server effectively (when to call rag_query, read_file, list_files, how to quote code, etc.).

There are two easy ways to use it:

Via the server’s /instructions endpoint (best with HTTP mode)

Ensure the server is running with MCP_TRANSPORT=http.
Optionally set a friendly label for your repo in the UI:
```
$env:FOLDER_INFO_NAME = "ProjectB"
```
Open http://127.0.0.1:3000/instructions in a browser. The page renders the instructions with <FOLDER_INFO_NAME> replaced (e.g., “ProjectB”).
Copy the content into Copilot Chat in Visual Studio and pin it for the current session/conversation to guide the assistant’s behavior.

Sync and store in ProjectA’s .github folder (from /instructions)

Ensure the server is running with MCP_TRANSPORT=http and set a friendly label:
```
$env:FOLDER_INFO_NAME = "ProjectB"
```
Create (if not exists) C:\path\to\ProjectA\.github\.

Pull the latest rendered instructions and save them to the repo:

$dest = "C:\path\to\ProjectA\.github\copilot-instructions.md"
Invoke-RestMethod 'http://127.0.0.1:3000/instructions' | Set-Content -Encoding UTF8 $dest

Commit the file so your team can reuse it. Re-run the command above anytime you update the instructions in this server and want to refresh the checked-in copy.

Notes

These instructions are optional but help keep Copilot disciplined: it will search before answering, cite paths, and fetch exact code lines before quoting.
The server’s tool descriptions also reference FOLDER_INFO_NAME to provide consistent, repo‑specific guidance in tool metadata.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.vscode		.vscode
docs		docs
src		src
.editorconfig		.editorconfig
.env.example		.env.example
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
LICENSE		LICENSE
PDF_SUPPORT.md		PDF_SUPPORT.md
README.md		README.md
eslint.config.mjs		eslint.config.mjs
example.mcp.json		example.mcp.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mcp-rag-server (RAG MCP server for any repository)

Features

Requirements

Install

Run (local test)

Windows PowerShell

macOS / Linux (bash/zsh)

OpenAI-compatible API embeddings

Streamable HTTP mode (recommended for large initial indexes)

Instructions endpoint

Linting & Formatting

Test with MCP Inspector (without VS)

Environment configuration (.env)

Persistence & Incremental Reindexing

Visual Studio integration (MCP)

Usage in Agent mode

Notes

Model selection guidance

Chunk sizing guidance

Step‑by‑step: Index a Java repo (ProjectB in IntelliJ) and use it from C# (ProjectA in Visual Studio 2022)

1) Build this MCP server

2) Start the server in HTTP mode pointing at ProjectB

3) Point Visual Studio (ProjectA) at this server

4) Use it from Copilot while coding in ProjectA

5) (Optional) Use MCP Inspector to sanity‑check

How to use `docs/copilot-instructions.md`

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

mcp-rag-server (RAG MCP server for any repository)

Features

Requirements

Install

Run (local test)

Windows PowerShell

macOS / Linux (bash/zsh)

OpenAI-compatible API embeddings

Streamable HTTP mode (recommended for large initial indexes)

Instructions endpoint

Linting & Formatting

Test with MCP Inspector (without VS)

Environment configuration (.env)

Persistence & Incremental Reindexing

Visual Studio integration (MCP)

Usage in Agent mode

Notes

Model selection guidance

Chunk sizing guidance

Step‑by‑step: Index a Java repo (ProjectB in IntelliJ) and use it from C# (ProjectA in Visual Studio 2022)

1) Build this MCP server

2) Start the server in HTTP mode pointing at ProjectB

3) Point Visual Studio (ProjectA) at this server

4) Use it from Copilot while coding in ProjectA

5) (Optional) Use MCP Inspector to sanity‑check

How to use docs/copilot-instructions.md

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

How to use `docs/copilot-instructions.md`

Packages