domain-memory-agent

Name: domain-memory-agent
Rating: 4.5 (4 reviews)
Author: Jeremy Longshore

Knowledge base with TF-IDF semantic search and extractive summarization - no ML dependencies required

View on GitHub

Author Jeremy Longshore

Namespace @jeremylongshore/claude-code-plugins

Category productivity

Version 1.0.0

Stars 38

Downloads 4

self.md verified

Table of content

Knowledge base with TF-IDF semantic search and extractive summarization - no ML dependencies required

Installation

npx claude-plugins install @jeremylongshore/claude-code-plugins/domain-memory-agent

Folders: servers, skills, tests

Files: LICENSE, README.md, package.json, server.json, tsconfig.json, vitest.config.ts

Documentation

Knowledge base with semantic search, document storage, and automatic summarization

A lightweight MCP server for domain-specific knowledge management using TF-IDF semantic search (no external ML dependencies). Perfect for building AI memory systems and RAG applications.

Features

Document Storage - Store documents with tags and metadata
Semantic Search - TF-IDF based search (no external dependencies)
Summarization - Automatic extractive summaries with caching
Full CRUD - Create, read, update, delete documents
Tagging System - Organize knowledge by tags
Pagination - Efficient browsing of large knowledge bases

Installation

/plugin install domain-memory-agent@claude-code-plugins-plus

6 MCP Tools

1. `store_document`

Store documents in knowledge base with automatic indexing.

{
  "title": "Machine Learning Basics",
  "content": "Machine learning is a subset of AI...",
  "tags": ["ai", "ml", "tutorial"],
  "metadata": {
    "author": "John Doe",
    "category": "Technical"
  }
}

2. `semantic_search`

Search using TF-IDF relevance scoring.

{
  "query": "machine learning algorithms",
  "limit": 10,
  "tags": ["ai"],
  "minScore": 0.1
}

Returns: Ranked results with relevance scores and excerpts.

3. `summarize`

Generate extractive summaries (cached).

{
  "documentId": "doc123",
  "maxSentences": 5,
  "regenerate": false
}

4. `list_documents`

Browse knowledge base with filtering.

{
  "tags": ["ai"],
  "sortBy": "updated",
  "limit": 50,
  "offset": 0
}

5. `get_document`

Retrieve full document by ID.

{
  "documentId": "doc123"
}

6. `delete_document`

Remove document and unindex.

{
  "documentId": "doc123"
}

Quick Start

// 1. Store knowledge
store_document({
  title: "API Design Best Practices",
  content: "RESTful APIs should be...",
  tags: ["api", "architecture"]
})

// 2. Search knowledge
semantic_search({
  query: "REST API design patterns",
  limit: 5
})

// 3. Get summary
summarize({
  documentId: "doc123",
  maxSentences: 3
})

How Semantic Search Works

Uses TF-IDF (Term Frequency-Inverse Document Frequency):

Tokenization: Text → lowercase words (filter short words)
Term Frequency: Count word occurrences in each document
Document Frequency: Track how many documents contain each term
IDF: Rare terms get higher scores
TF-IDF Score: Rank documents by relevance

Advantages:

No external ML dependencies
Fast and lightweight
Explainable results
Works offline

Architecture

In-Memory Storage:
├── documents: Map<id, Document>
├── tfidfIndex:
│   ├── termFrequencies: Map<term, Map<docId, freq>>
│   ├── documentFrequencies: Map<term, count>
│   └── documentLengths: Map<docId, totalTerms>

Note: Data persists during session but clears on restart. Future versions will add persistence.

Use Cases

RAG Systems - Store domain knowledge for AI retrieval
Documentation Search - Index and search project docs
Research Notes - Organize research with semantic search
Customer Support - Build knowledge bases for support agents
Personal Knowledge - Second brain / Zettelkasten system

Performance

Document Storage: < 10ms per document
Search: < 50ms for 1000 documents
Summarization: < 100ms per document
Indexing: Real-time (synchronous)

Best Practices

Use descriptive titles - Improves search relevance
Tag consistently - Makes filtering effective
Store focused documents - Better than huge files
Cache summaries - Regenerate only when needed
Regular cleanup - Delete outdated documents

Example Workflows

Building a Technical Knowledge Base

# Store API documentation
store_document(title: "REST API Guide", content: "...", tags: ["api", "docs"])

# Store best practices
store_document(title: "Error Handling Patterns", content: "...", tags: ["patterns", "errors"])

# Search when needed
semantic_search(query: "handle API errors", tags: ["api"])

Research Note System

# Store research papers
store_document(title: "Transformer Architecture", content: "...", tags: ["ml", "nlp", "research"])

# Find related research
semantic_search(query: "attention mechanisms", tags: ["ml"])

# Get quick summary
summarize(documentId: "paper123", maxSentences: 5)

License

MIT License - see LICENSE

project-health-auditor - Code quality analysis
conversational-api-debugger - API failure debugging
design-to-code - Figma to components
workflow-orchestrator - Task automation

Made with ️ by Intent Solutions

Source

View on GitHub

Tags: productivity mcp knowledge-base semantic-search tfidf summarization rag documentation

domain-memory-agent

Installation

Contents

Documentation

Features

Installation

6 MCP Tools

1. store_document

2. semantic_search

3. summarize

4. list_documents

5. get_document

6. delete_document

Quick Start

How Semantic Search Works

Architecture

Use Cases

Performance

Best Practices

Example Workflows

Building a Technical Knowledge Base

Research Note System

License

Related Tools

Source

1. `store_document`

2. `semantic_search`

3. `summarize`

4. `list_documents`

5. `get_document`

6. `delete_document`