Knowledge Base - Nadoo AI

Overview

The Nadoo AI Knowledge Base provides a complete Retrieval-Augmented Generation (RAG) pipeline that connects your documents to your AI agent workflows. Instead of relying solely on what a language model was trained on, you can ground its responses in your own data — company documents, product manuals, research papers, or any text corpus. The pipeline has four stages: Document Processing, Vector Storage, Retrieval, and Integration.

RAG Pipeline

Document Processing

Upload files and convert them into searchable chunks.

Upload — Drag and drop files or provide URLs
Parse — Extract text from PDF, DOCX, TXT, Markdown, Excel, and web pages
Chunk — Split documents into overlapping segments (default: 1000 characters with 200-character overlap)
Extract metadata — Capture titles, headings, page numbers, and custom metadata for filtering

Vector Storage

Generate embeddings and store them for fast similarity search.

Generate embeddings — Convert each chunk into a vector using a configurable embedding model (OpenAI, HuggingFace, Azure, Bedrock, Google, vLLM, Ollama, Local)
Store in vector database — Persist vectors via pluggable VectorStore (pgvector default, Milvus/Qdrant planned). Distance metrics: cosine, euclidean, dot product
Index — Build HNSW or IVFFlat indexes for approximate nearest neighbor search at scale

Retrieval

Find the most relevant chunks for a given query.

Query embedding — Convert the user’s question into a vector using the same embedding model
Similarity search — Find the closest vectors in the index
Rerank — Optionally re-score results with a cross-encoder reranker for higher precision
Context assembly — Combine the top chunks into a context window, respecting token limits

Integration

Inject retrieved context into your AI agent’s prompt.

Prompt injection — Insert retrieved chunks into the system prompt or user message
Citation tracking — Record which documents contributed to the response for transparency
Feedback loop — Use user feedback to improve retrieval quality over time

Supported Document Formats

Format	Extensions	Notes
PDF	`.pdf`	OCR support for scanned documents
Microsoft Word	`.docx`, `.doc`	Preserves heading structure
Plain Text	`.txt`	Direct ingestion
Markdown	`.md`, `.mdx`	Preserves heading hierarchy
Excel	`.xlsx`, `.xls`	Each sheet processed separately
Web Pages	URL	Fetches and parses HTML content

Search Modes

The knowledge base supports three search modes that you can configure per query or per knowledge base.

Vector Search
BM25 (Keyword)
Hybrid

Semantic similarity — Finds documents whose meaning is closest to the query, even if the exact words differ.Uses cosine similarity on embedding vectors. Best for natural language questions where the user’s phrasing may not match the document’s exact wording.

{
  "search_mode": "vector",
  "top_k": 5,
  "score_threshold": 0.7
}

Lexical matching — Finds documents containing the same terms as the query, weighted by term frequency and inverse document frequency.Best for precise keyword lookups, product codes, or technical terms that must match exactly.

{
  "search_mode": "bm25",
  "top_k": 5
}

Vector + BM25 combined — Runs both search methods in parallel, normalizes the scores, and merges the results using Reciprocal Rank Fusion (RRF).This is the recommended default for most use cases, as it combines the strengths of both approaches.

{
  "search_mode": "hybrid",
  "top_k": 5,
  "vector_weight": 0.7,
  "bm25_weight": 0.3,
  "score_threshold": 0.5
}

Configuration

Embedding Model

Choose the embedding model used to generate vectors. The model must be consistent between indexing and querying.

{
  "embedding": {
    "model": "text-embedding-3-small",
    "dimensions": 1536,
    "provider": "openai"
  }
}

Embedding providers include OpenAI, HuggingFace, Local models, Azure OpenAI, AWS Bedrock, Google AI Studio, Google Vertex AI, vLLM, and Ollama. The embedding model is set at the knowledge base level and applies to all documents within it.

Chunking

Control how documents are split into segments.

Parameter	Default	Description
`chunk_size`	1000	Maximum number of characters per chunk
`chunk_overlap`	200	Number of overlapping characters between consecutive chunks
`separator`	`\n\n`	Primary split boundary (falls back to sentence/word boundaries)

{
  "chunking": {
    "chunk_size": 1000,
    "chunk_overlap": 200,
    "separator": "\n\n"
  }
}

Retrieval Settings

Fine-tune how documents are fetched at query time.

Parameter	Default	Description
`top_k`	5	Number of chunks to retrieve
`score_threshold`	0.5	Minimum similarity score (0.0 to 1.0)
`reranking`	false	Enable cross-encoder reranking for higher precision
`rerank_model`	—	Model to use for reranking (e.g., `cohere-rerank-v3`)
`rerank_top_k`	3	Number of chunks to keep after reranking

{
  "retrieval": {
    "top_k": 10,
    "score_threshold": 0.5,
    "reranking": true,
    "rerank_model": "cohere-rerank-v3",
    "rerank_top_k": 3
  }
}

Advanced Features

Contextual Retrieval

Enhance each chunk with a brief AI-generated summary of its context within the full document. This improves retrieval accuracy by embedding each chunk with awareness of its surrounding content.

Knowledge Graphs

Extract entities and relationships from documents to build a knowledge graph. This enables graph-based queries that traverse relationships rather than relying solely on text similarity.

Multi-Hop Reasoning

For complex questions that require information from multiple documents, multi-hop reasoning chains together several retrieval steps:

Retrieve initial context for the question
Identify follow-up sub-questions based on the initial context
Retrieve additional context for each sub-question
Synthesize all retrieved information into a comprehensive answer

Using Knowledge in Workflows

To use a knowledge base in your workflow, add a Search Knowledge Node before the AI Agent Node: The Search Knowledge Node retrieves relevant chunks and passes them to the AI Agent Node as context. You can optionally add a Reranker Node between them for improved precision:

Next Steps

AI Agent Node

Configure the LLM node that consumes retrieved context

Workflow Engine

Learn about the overall workflow execution model

Visual Editor

Build RAG workflows in the drag-and-drop editor

Messaging Channels

Deploy your RAG agent to Slack, Discord, and more

​Overview

​RAG Pipeline

​Supported Document Formats

​Search Modes

​Configuration

​Embedding Model

​Chunking

​Retrieval Settings

​Advanced Features

​Contextual Retrieval

​Knowledge Graphs

​Multi-Hop Reasoning

​Using Knowledge in Workflows

​Next Steps

AI Agent Node

Workflow Engine

Visual Editor

Messaging Channels

Overview

RAG Pipeline

Supported Document Formats

Search Modes

Configuration

Embedding Model

Chunking

Retrieval Settings

Advanced Features

Contextual Retrieval

Knowledge Graphs

Multi-Hop Reasoning

Using Knowledge in Workflows

Next Steps