MCP Server for Technical Docs

1 · Recipe Purpose

Goal	Outcome
AI-Native Documentation Discovery	Technical documentation becomes discoverable and actionable by AI agents through natural language queries.
Hybrid RAG Implementation	Combines keyword search, vector embeddings, and graph relationships for comprehensive content retrieval.
Real-time Content Synchronization	Heretto CCMS content automatically syncs and indexes for immediate agent availability.
Scalable Knowledge Architecture	Structured DITA content transforms into a queryable knowledge graph supporting multiple retrieval patterns.

Success Criteria

AI agents can discover relevant documentation with >90% accuracy
Search responses delivered in ≤200ms p95 latency
Automatic content synchronization from Heretto within 5 minutes
Support for Vector-RAG, Graph-RAG, and Tool-RAG patterns
Live configuration tuning without system redeployment

This recipe defines an end-to-end architecture for making structured technical documentation—originally authored in Heretto CCMS — discoverable, contextual, and actionable by AI agents like ChatGPT, Perplexity, or custom assistants. It combines the strengths of Conscia's Core Services, DX Graph, Hybrid Search, and DX Engine orchestration, all exposed through a unified Model Context Protocol (MCP) Server. The goal is to implement a real-time, agent-native retrieval flow—where content is indexed using both keyword and vector search, enriched at query time with graph relationships, ranked using business rules (and optionally LLMs), and delivered to agents in a structured, consumable format.

At the heart of this approach is the recognition that structured content is a prerequisite for intelligent, AI-driven retrieval. Formats like DITA XML are not just publishing frameworks—they encode meaning, relationships, topic types, and intent. That structure gives your content machine-understandable semantics: enabling RAG (Retrieval-Augmented Generation) techniques to identify whether a topic is a task, concept, or reference; to resolve prerequisites; and to follow RELATED or CONCEPT_OF links—all without training a model on your domain from scratch.

Why export content into DX Graph?

Exporting Heretto’s structured DITA topics into DX Graph turns static documentation into a live, multi-domain knowledge fabric that your search, chat, and commerce experiences can explore in real time.

Cross-domain linking: Join docs to SKUs, support issues, personas, and any third-party data—well beyond Heretto’s own reltables.
Graph queries at speed: DX Graph answers relationship-heavy queries in less than 50 ms. Deploy API was never built for that load.
Hybrid RAG ready: Blend keyword + vector search with on-the-fly graph enrichment, giving LLMs richer context.
Delivery decoupled from authoring: Content teams keep writing in Heretto; delivery teams tune retrieval rules in DX Engine—no CMS changes.
Governance & observability: Core-Services Jobs, Triggers, and audit logs provide retries, metrics, and policy enforcement.
Easy downstream syndication: Same graph feed can power analytics, help centers, or partner portals via export jobs.

In short, Heretto remains the single source of truth, while DX Graph becomes the high-performance, relationship-aware runtime that unlocks personalization, RAG accuracy, and cross-system reuse.

1 Building‑Block Cheat‑Sheet

Layer	Product / Service	Purpose
Authoring	Heretto CCMS (DITA topics & maps)	Writers craft structured technical docs.
Export	Heretto Deploy API	Streams JSON/DITA OT bundles on every Publish.
Data Ops	Conscia Core Services	Jobs, Buckets, Triggers, Integration Patterns for ETL in/out of DX Graph.
Graph	DX Graph	Unified KG: tech docs, products, relationships.
Search	OpenSearch (keyword + vector)	Hybrid BM‑25 + dense‑vector retrieval.
Flow	DX Engine `discover.docs.rag`	Orchestrates search → graph → ranking (+optional LLM).
API Gateway	Universal MCP Server	Publishes capability `discoverDocs`; proxies to DX Engine.
Agent / UI	ChatGPT, website chat, voice bot, etc.	Calls `discoverDocs` and renders enriched answers.

2 Ingestion Pipeline (Core‑Services)

Step	Core‑Services Component	Action
A1	Incoming Bucket	Nightly Heretto JSON dump (`dita_*.json`) lands in `heretto-dumps` (S3/Azure).
A2	`importDataFiles` Job	Cron `0 2 * * *` parses each file → writes records to `dita-topics` collection in DX Graph.
A3	Transform Job (optional)	`transformDataFiles` normalises metadata, maps Heretto IDs to product SKUs.
A4	DX Graph Event	`DataRecordCreated/Updated` event emitted for each topic.
A5	Trigger ➜ `callDxEngine` Job	Trigger `ditaUpdated` calls DX Engine template `dita-to-opensearch` to: ① enrich with related entities ② embed body text ③ upsert doc in OpenSearch.
A6	Integration Pattern 2.1	If external systems need change notifications, event stream can hit a webhook or Kafka topic.
A7	Scheduled Export (optional)	`exportCollection` → downstream lake / BI via `processCollectionWithWebserviceEndpoint`.

Example Job & Trigger

// importDataFiles Job (runs nightly)
{
  "jobDefinitionCode": "dita-import",
  "jobType": "importDataFiles",
  "schedule": { "cron": "0 2 * * *" },
  "params": {
    "incomingBucketCode": "heretto-dumps",
    "processedBucketCode": "processed",
    "invalidBucketCode": "invalid",
    "filenamePattern": "dita_*.json",
    "recordIdentifierField": "topicId",
    "collectionCode": "dita-topics"
  }
}

// Trigger → DX Engine
{
  "eventType": "DataRecordUpdated",
  "triggerCode": "ditaUpdated",
  "criteria": "`event.data.dataCollectionCode === 'dita-topics'`",
  "job": {
    "jobType": "callDxEngine",
    "params": {
      "templateCode": "dita-to-opensearch",
      "context": "`event`"
    }
  }
}

3 Heretto Deploy API Cheat‑Sheet

Item	Detail
Base URL	`https://deploy.<org‑id>.heretto.com/api/v2/`
Auth	`X-API-Key: <key>` (Simple Key) or `Authorization: Bearer <token>`
Full Map	`GET /deployments/{depId}/maps/{mapId}?locale=en-US&depth=all&format=json`
Single Topic	`GET /deployments/{depId}/topics/{topicId}?locale=en-US&format=json`
Delta Feed	`GET /deployments/{depId}/delta?modifiedSince=2025-07-26T00:00:00Z`
Caching	Responses are prerendered & server‑cached → safe for CDN.

4 Runtime Discovery Algorithm

discoverDocs(q, k) ───► DX Flow discover.docs.rag
    ① hybridSearch(OpenSearch, q, k)          // BM25 + vector
    ② for each hit ➜ DX Graph lookup(id)      // RELATED, PREREQUISITE, etc.
    ③ scoring
        base   = 0.6*bm25 + 0.4*vector
        +10    if topicType == 'Task'
        +2     per RELATED edge 'Troubleshooting'
        LLM‑rerank top 20 (optional)
    ④ return top N enriched docs

Business rules live in DX Engine; change weights or filters without code deploys.

5 Architecture Diagram

6 End‑to‑End Sequence

#	Actor	Action
1	Heretto Publish	Writers release a new map; Deploy API JSON lands in bucket.
2	Job `dita-import`	Parses file; upserts topics into DX Graph.
3	Graph Event → Trigger	`ditaUpdated` fires `callDxEngine` job.
4	DX Engine Flow	Enriches doc, embeds text, indexes in OpenSearch.
5	Agent	Calls `discoverDocs({q, k})` via UMCP.
6	OpenSearch	Returns top k keyword+vector hits.
7	DX Graph Lookup	Graph‑augment step fetches relations per hit.
8	DX Engine Rules Engine	Applies rules / LLM re‑rank; returns enriched list.
9	Agent LLM	Crafts conversational answer & follow‑ups.

7 Orchestration Flow Design for Search Ranking

#	Component (type)	Key config	Purpose
A	Universal API – `search-os`	Method = `POST` `/opensearch/_search`	Hybrid keyword + vector query.
B	Object Mapper – `mapHits`	`hits.hits[*] → $.hits[]`	Extract `id, bm25Score, vectorScore, topicType`.
C	For-Each Loop – `enrichHits`	Iterate over `$.hits[]`	Drives per-hit graph look-up.
C1	Universal API – `graphLookup`	Path `/collections/dita-topics/records/{{hit.id}}?include=related`	Pull RELATED / PREREQUISITE edges.
C2	Data-Transformation Script – `mergeGraph`	JS snippet (see below)	Append `related[]` to the hit object.
D	Data-Transformation Script – `scoreHits`	JS scoring logic	0.6 × bm25 + 0.4 × vector + rule boosts.
E	(Optional) LLM Scorer – `llmRerank`	Model `gpt-4o-mini`	Re-orders top 20 JSON docs.
F	Object Mapper – `responseMap`	Map to MCP schema	Final payload for UMCP.

8 · DX Engine Configuration

The DX Engine orchestration for technical documentation discovery requires sophisticated configuration to handle hybrid search, graph enrichment, and intelligent scoring. The following configurations define the complete discover.docs.rag flow with all necessary components, connections, and context fields.

8.1 · Context Fields

Context fields define the data passed from the MCP server request to the DX Engine components during documentation discovery.

{
  "q": {
    "name": "Search Query",
    "dataType": "string",
    "enriched": false,
    "description": "Natural language query from AI agent"
  },
  "k": {
    "name": "Max Results",
    "dataType": "number",
    "enriched": false,
    "description": "Maximum number of results to return (default: 10)"
  },
  "useLLM": {
    "name": "Use LLM Reranking",
    "dataType": "boolean",
    "enriched": false,
    "description": "Enable optional LLM-based reranking for top results"
  },
  "searchHits": {
    "name": "Search Results",
    "dataType": "array",
    "enriched": true,
    "description": "OpenSearch results with BM25 and vector scores"
  },
  "enrichedHits": {
    "name": "Graph-Enriched Results",
    "dataType": "array",
    "enriched": true,
    "description": "Search results enriched with DX Graph relationships"
  },
  "finalResults": {
    "name": "Scored and Ranked Results",
    "dataType": "array",
    "enriched": true,
    "description": "Final ranked results ready for MCP response"
  }
}

8.2 · Sample Query

Example DX Engine query for documentation discovery:

{
  "query": {
    "components": ["discover-docs-rag-flow"],
    "context": {
      "q": "How do I install the CLI?",
      "k": 10,
      "useLLM": true
    }
  }
}

8.3 · Connections

OpenSearch Connection

{
  "opensearch-connection": {
    "name": "OpenSearch Hybrid Search",
    "connectorCode": "genericWebservice",
    "config": {
      "baseHeaders": [
        {"header": "Authorization", "value": "`'Basic ' + btoa(secret('opensearch-username') + ':' + secret('opensearch-password'))`"},
        {"header": "Content-Type", "value": "application/json"}
      ],
      "baseQueryParams": [],
      "baseUrl": "https://search.acme.io/opensearch"
    }
  }
}

DX Graph Connection

{
  "dx-graph-connection": {
    "name": "DX Graph Knowledge Base",
    "connectorCode": "genericWebservice",
    "config": {
      "baseHeaders": [
        {"header": "Authorization", "value": "`'Bearer ' + secret('dx-graph-api-token')`"},
        {"header": "Content-Type", "value": "application/json"}
      ],
      "baseQueryParams": [],
      "baseUrl": "https://graph.conscia.ai"
    }
  }
}

LLM Connection (Optional)

{
  "llm-connection": {
    "name": "OpenAI GPT-4o Mini",
    "connectorCode": "genericWebservice",
    "config": {
      "baseHeaders": [
        {"header": "Authorization", "value": "`'Bearer ' + secret('openai-api-key')`"},
        {"header": "Content-Type", "value": "application/json"}
      ],
      "baseQueryParams": [],
      "baseUrl": "https://api.openai.com/v1"
    }
  }
}

8.4 · Components

OpenSearch Hybrid Search Performs simultaneous BM25 keyword search and vector similarity search against indexed documentation. Configured with hybrid query support and optimized for sub-200ms response times.

{
  "opensearch-hybrid-search": {
    "componentType": "genericWebserviceCall",
    "name": "OpenSearch Hybrid Search",
    "description": "Hybrid BM25 + vector search against indexed documentation",
    "isAsync": false,
    "noRules": true,
    "mergeConfig": true,
    "config": {
      "connectionCode": "opensearch-connection",
      "path": "/_search",
      "method": "POST",
      "body": "`{
        'size': contextField('k') || 10,
        'query': {
          'hybrid': {
            'query': contextField('q'),
            'vector': {
              'field': 'content_embedding',
              'query_vector': contextField('q') + '_embedded',
              'k': (contextField('k') || 10) * 2
            }
          }
        },
        '_source': ['title', 'topicType', 'content', 'topicId'],
        'highlight': {
          'fields': {
            'content': {
              'fragment_size': 150,
              'number_of_fragments': 2
            }
          }
        }
      }`",
      "returnFullResponse": false
    },
    "componentRules": [],
    "contextFieldEnrichment": {
      "searchHits": "`response.hits.hits.map(hit => ({
        id: hit._id,
        title: hit._source.title,
        topicType: hit._source.topicType,
        content: hit._source.content,
        topicId: hit._source.topicId,
        bm25Score: hit._score,
        vectorScore: hit._score,
        highlights: hit.highlight
      }))`"
    },
    "subComponents": {},
    "trigger": "`contextField('q')`",
    "validity": "`true`",
    "cacheConfig": {
      "cached": true,
      "cacheTags": ["search", "docs"],
      "cacheTtl": 300
    },
    "skipConfig": {
      "skipOnFailedDependency": false,
      "skipOnSkippedDependency": false,
      "skipOnInvalidDependency": false
    },
    "metadata": [],
    "tags": ["search", "opensearch", "hybrid"],
    "attributeConfig": [],
    "responseTransform": "`response.hits`"
  }
}

DX Graph Relationship Lookup Retrieves related documentation topics, prerequisites, and troubleshooting guides from the knowledge graph. Uses the DITA relationship structure to provide contextual connections for each search result.

{
  "dx-graph-lookup": {
    "componentType": "genericWebserviceCall",
    "name": "DX Graph Relationship Lookup",
    "description": "Enriches search results with graph relationships and prerequisites",
    "isAsync": false,
    "noRules": true,
    "mergeConfig": true,
    "config": {
      "connectionCode": "dx-graph-connection",
      "path": "`'/collections/dita-topics/records/' + contextField('hitId')`",
      "method": "GET",
      "queryParams": [
        {"parameter": "include", "value": "related"},
        {"parameter": "depth", "value": "2"}
      ],
      "returnFullResponse": false
    },
    "componentRules": [],
    "contextFieldEnrichment": {
      "relatedTopics": "`response.related || []`",
      "prerequisites": "`response.related.filter(r => r.relationshipType === 'PREREQUISITE')`",
      "troubleshooting": "`response.related.filter(r => r.relationshipType === 'TROUBLESHOOTING')`",
      "concepts": "`response.related.filter(r => r.relationshipType === 'CONCEPT_OF')`"
    },
    "subComponents": {},
    "trigger": "`contextField('hitId')`",
    "validity": "`true`",
    "cacheConfig": {
      "cached": true,
      "cacheTags": ["graph", "relationships"],
      "cacheTtl": 1800
    },
    "skipConfig": {
      "skipOnFailedDependency": false,
      "skipOnSkippedDependency": false,
      "skipOnInvalidDependency": false
    },
    "metadata": [],
    "tags": ["graph", "relationships", "dita"],
    "attributeConfig": [],
    "responseTransform": "`response`"
  }
}

Intelligent Scoring Engine Applies business rules and machine learning to score and rank documentation results. Combines search relevance with topic type preferences, relationship density, and optional LLM reranking.

{
  "intelligent-scoring": {
    "componentType": "dataTransformationScript",
    "name": "Intelligent Scoring Engine",
    "description": "Advanced scoring combining search relevance, business rules, and optional LLM ranking",
    "isAsync": false,
    "noRules": true,
    "mergeConfig": true,
    "config": {
      "script": "`
        const hits = contextField('searchHits') || [];
        const useLLM = contextField('useLLM') || false;
        
        // Apply business rule scoring
        const scoredHits = hits.map(hit => {
          let baseScore = (hit.bm25Score * 0.6) + (hit.vectorScore * 0.4);
          let ruleBoost = 0;
          
          // Topic type boosts
          if (hit.topicType === 'Task') ruleBoost += 10;
          if (hit.topicType === 'Troubleshooting') ruleBoost += 8;
          if (hit.topicType === 'Concept') ruleBoost += 5;
          
          // Relationship density boost
          const relatedCount = hit.relatedTopics?.length || 0;
          ruleBoost += Math.min(relatedCount * 0.5, 5);
          
          // Troubleshooting relationship boost
          const hasTroubleshooting = hit.troubleshooting?.length > 0;
          if (hasTroubleshooting) ruleBoost += 2;
          
          return {
            ...hit,
            baseScore,
            ruleBoost,
            finalScore: baseScore + ruleBoost
          };
        });
        
        // Sort by final score
        scoredHits.sort((a, b) => b.finalScore - a.finalScore);
        
        return {
          hits: scoredHits.slice(0, contextField('k') || 10),
          needsLLMRerank: useLLM && scoredHits.length <= 20
        };
      `"
    },
    "componentRules": [],
    "contextFieldEnrichment": {
      "scoredHits": "`response.hits`",
      "needsLLMRerank": "`response.needsLLMRerank`"
    },
    "subComponents": {},
    "trigger": "`contextField('searchHits')`",
    "validity": "`true`",
    "cacheConfig": {
      "cached": false,
      "cacheTags": [],
      "cacheTtl": 0
    },
    "skipConfig": {
      "skipOnFailedDependency": false,
      "skipOnSkippedDependency": false,
      "skipOnInvalidDependency": false
    },
    "metadata": [],
    "tags": ["scoring", "ranking", "business-rules"],
    "attributeConfig": [],
    "responseTransform": "`response`"
  }
}

8.5 · Component Template (Flow)

Documentation Discovery RAG Flow

{
  "discover-docs-rag-flow": {
    "componentCodes": [
      "opensearch-hybrid-search",
      "dx-graph-lookup",
      "intelligent-scoring",
      "llm-reranking"
    ],
    "elementType": "Template",
    "name": "Documentation Discovery RAG Flow",
    "description": "Complete hybrid RAG flow for technical documentation discovery with graph enrichment and intelligent ranking"
  }
}

8.6 · Secrets Management

{
  "opensearch-username": {
    "name": "OpenSearch Username",
    "description": "Username for OpenSearch cluster authentication"
  },
  "opensearch-password": {
    "name": "OpenSearch Password",
    "description": "Password for OpenSearch cluster authentication"
  },
  "dx-graph-api-token": {
    "name": "DX Graph API Token",
    "description": "Bearer token for DX Graph API access"
  },
  "openai-api-key": {
    "name": "OpenAI API Key",
    "description": "API key for OpenAI GPT-4o Mini access (optional LLM reranking)"
  }
}

9 Key Component Configurations (Legacy Reference)

A Universal API `search-os`

method: POST
url: https://search.acme.io/opensearch/_search
headers:
  Content-Type: application/json
bodyTemplate: |
  {
    "size": {{variables.k}},
    "query": {
      "hybrid": {
        "query": "{{variables.q}}",
        "vector": "{{variables.q | embed 'text-embedding-3'}}"
      }
    },
    "_source": ["title","topicType"]
  }
responsePath: "$.hits"
timeoutMs: 200

B Object Mapper `mapHits`

mappings:
  - path: "$.hits.hits[*]"
    to:   "$.hits[]"
    fields:
      id:          "$._id"
      title:       "$._source.title"
      topicType:   "$._source.topicType"
      bm25Score:   "$._score.keyword"
      vectorScore: "$._score.vector"

C1 Universal API `graphLookup`

method: GET
url: https://graph.conscia.ai/collections/dita-topics/records/{{item.id}}
params:
  include: related
responsePath: "$"
timeoutMs: 100

C2 · Data Transformation Script `mergeGraph`

ctx.item.related = ctx.api.related || [];
return ctx.item;

D · Data Transformation Script `scoreHits`

function ruleBoost(hit){
  let b = 0;
  if (hit.topicType === 'Task') b += 10;
  if (hit.related.some(r => r.edge === 'Troubleshooting')) b += 2;
  return b;
}
ctx.hits.forEach(h => {
  h.baseScore = 0.6*h.bm25Score + 0.4*h.vectorScore;
  h.score = h.baseScore + ruleBoost(h);
});
ctx.hits.sort((a,b) => b.score - a.score);
ctx.hits = ctx.hits.slice(0, ctx.variables.k);
return ctx.hits;

E · LLM Scorer `llmRerank`

enabled: "{{ variables.k <= 20 && variables.useLLM === true }}"
model: gpt-4o-mini
prompt: |
  Rank the following JSON docs by relevance to "{{variables.q}}".
  Return the docs in the same JSON shape, sorted best → worst.
input: "{{ json ctx.hits }}"

F · Object Mapper `responseMap`

mappings:
  - path: "$.hits[*]"
    to:   "$.hits[]"
    fields:
      id:        "$.id"
      title:     "$.title"
      score:     "$.score"
      topicType: "$.topicType"
      related:   "$.related"

9 · Flow Variables

Variable	Source	Used by
`q`	UMCP request	Component A
`k`	UMCP request (default 10)	A, D
`hits[]`	Mapper `mapHits`	Loop C, Scripts D & E
`useLLM`	UMCP flag (bool)	LLM Scorer

10 · Live Tuning

Weights & boosts live in scoreHits — change them in the console; no redeploy.
Search backend swap — just update search-os URL/body.
Disable LLM — set useLLM:false in the UMCP request.

7 · Universal MCP Server Stub

paths:
  /discoverDocs:
    post:
      summary: Hybrid tech‑doc discovery
      requestBody:
        required: true
        content:
          application/json:
            schema:
              type: object
              properties:
                q:
                  type: string
                  description: Natural‑language query
                k:
                  type: integer
                  description: Max hits
                  default: 10
      responses:
        "200":
          description: Enriched docs
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/DocHitList'

8 · Implementation Hints

Scalability – Core‑Services Jobs are stateless → run in parallel pods optimizing data ingestion performance;
Latency Targets – Keep LLM re‑rank to top 20 docs only. Aim for less than 200 ms p95.
Versioning – Name capability paths /v1/discoverDocs so future schema changes won’t break agents.
Monitoring – Core‑Services exposes Job execution logs, retries, and Trigger audit trails—pipe them to Grafana/Datadog or use Conscia's Observability dashboard.

✅ Result

A fully governed, end‑to‑end pipeline that:

Automates ingestion of Heretto DITA exports via Core‑Services Jobs & Buckets.
Enriches & embeds topics in DX Engine then indexes them in OpenSearch.
Serves hybrid, graph‑aware search through a single Universal MCP Server capability (discoverDocs).
Empowers agents & UIs to deliver rich, context‑aware technical answers—in milliseconds.

1 · Recipe Purpose​

1 Building‑Block Cheat‑Sheet​

2 Ingestion Pipeline (Core‑Services)​

Example Job & Trigger​

3 Heretto Deploy API Cheat‑Sheet​

4 Runtime Discovery Algorithm​

5 Architecture Diagram​

6 End‑to‑End Sequence​

7 Orchestration Flow Design for Search Ranking​

8 · DX Engine Configuration​

8.1 · Context Fields​

8.2 · Sample Query​

8.3 · Connections​

8.4 · Components​

8.5 · Component Template (Flow)​

8.6 · Secrets Management​

9 Key Component Configurations (Legacy Reference)​

A Universal API search-os​

B Object Mapper mapHits​

C1 Universal API graphLookup​

C2 · Data Transformation Script mergeGraph​

D · Data Transformation Script scoreHits​

E · LLM Scorer llmRerank​

F · Object Mapper responseMap​

9 · Flow Variables​

10 · Live Tuning​

7 · Universal MCP Server Stub​

8 · Implementation Hints​

✅ Result​