Architecture

GNO is a local knowledge indexing and search system built on SQLite.

System Overview

┌─────────────────────────────────────────────────────────────────────────────┐
│                                   User                                      │
│                       (developer, researcher, writer)                       │
└─────────────────────────────────────────────────────────────────────────────┘
                                      │
              ┌───────────────────────┼───────────────────────┐
              │                 │                 │           │
              ▼                 ▼                 ▼           ▼
        ┌──────────┐     ┌──────────────┐   ┌───────────┐ ┌──────────┐
        │   CLI    │     │  MCP Server  │   │  AI Agent │ │  Web UI  │
        │  (gno)   │     │  (gno mcp)   │   │  (Claude) │ │(gno serve)│
        └──────────┘     └──────────────┘   └───────────┘ └──────────┘
              │                 │                 │           │
              └─────────────────┼─────────────────┴───────────┘
                                │
                                ▼
       ┌───────────────────────────────────────────────────────────────┐
       │                           GNO Core                            │
       │  ┌──────────────┐  ┌────────────┐  ┌────────────────────────┐ │
       │  │  Ingestion   │  │  Pipeline  │  │   LLM Layer            │ │
       │  │  (walker,    │  │  (search,  │  │   (embed, rerank, gen) │ │
       │  │   chunker)   │  │   fusion)  │  │   (node-llama-cpp)     │ │
       │  └──────────────┘  └────────────┘  └────────────────────────┘ │
       └───────────────────────────────────────────────────────────────┘
                                      │
                                      ▼
       ┌───────────────────────────────────────────────────────────────┐
       │                         Storage Layer                         │
       │  ┌──────────────┐  ┌──────────────┐  ┌──────────────────────┐ │
       │  │   SQLite     │  │  FTS5 +      │  │    sqlite-vec        │ │
       │  │  (documents, │  │  Snowball    │  │   (vector KNN)       │ │
       │  │   chunks)    │  │  (20+ langs) │  │   (optional)         │ │
       │  └──────────────┘  └──────────────┘  └──────────────────────┘ │
       └───────────────────────────────────────────────────────────────┘
                                      │
                                      ▼
       ┌───────────────────────────────────────────────────────────────┐
       │                          File System                          │
       │          ~/notes    ~/work/docs    ~/papers                   │
       │           (collections configured by user)                    │
       └───────────────────────────────────────────────────────────────┘

Data Flow

Ingestion Pipeline

File on disk
    │
    ▼ Walker (glob patterns, exclude lists)
    │
    ▼ Hash source content (SHA-256 → sourceHash)
    │
    ├─[ sourceHash unchanged ]─► Skip (file not modified)
    │
    ▼ Converter (MIME detection → Markdown)
    │
    ▼ Canonicalize (NFC, normalize whitespace)
    │
    ▼ Hash canonical markdown (→ mirrorHash)
    │
    ├─[ mirrorHash exists ]─► Reuse content (deduplication)
    │
    ▼ Chunker (~800 tokens, 15% overlap)
    │
    ▼ Store (SQLite: documents, content, chunks, document-level FTS)
    │
    ▼ [Optional] Embed chunks with title context (llama.cpp → vectors)
    │   Format: "title: Doc Title | text: chunk content..."

Search Pipeline

User query
    │
    ▼ Detect query language (franc, 30+ languages)
    │
    ├─[ Structured query modes provided ]─► Use provided term/intent/hyde entries
    │
    ├─[ BM25-only mode ]─► searchBm25 only (document-level)
    │
    ▼ Strong signal check (skip expansion if confident BM25 match)
    │
    ▼ [Optional] Query expansion (LLM variants + HyDE)
    │
    ▼ Document-level BM25 Search (FTS5 + Snowball stemmer)
    │
    ▼ Chunk-level Vector Search (sqlite-vec KNN)
    │
    ▼ RRF Fusion (k=60, 2× weight for original, tiered bonus)
    │
    ▼ [Optional] Rerank best chunk per document (Qwen3, 4K chars)
    │
    ▼ Results (sorted by blended score)
    │
    ▼ [Optional] Answer stage (adaptive source selection + citation hygiene)

Retrieval V2 Controls

Structured query modes: callers can pass explicit term, intent, and hyde entries.
Compatibility: existing query calls still work; structured modes are opt-in.
Mode behavior: when structured modes are present, generated expansion is skipped for that query.

Observability Surfaces

--explain includes per-stage timings (lang, expansion, bm25, vector, fusion, rerank, assembly, total).
Explain output includes fallback + cache counters for retrieval diagnostics.
Result explain lines include score components (bm25/vector/fusion/rerank/blended).
gno ask --json may include meta.answerContext with selected/dropped source explain details.

Code Architecture

GNO uses “Ports without DI” - a pragmatic simplification of hexagonal architecture:

CLI/MCP/Web UI → new Adapter() → adapter.createPort() → Port interface → Pipeline

Port interfaces (in src/llm/types.ts):

EmbeddingPort - vector embeddings
GenerationPort - LLM text generation
RerankPort - cross-encoder reranking
VectorIndexPort - vector search (in src/store/vector)

Adapters (instantiate ports):

LlmAdapter - creates LLM ports via node-llama-cpp
SqliteAdapter - SQLite storage

Why not full hexagonal?

Single implementation per port (no swappable backends)
CLI tool with fixed dependencies - DI adds complexity without benefit
Pipeline code still testable via port interfaces

Key Components

Storage

Table	Purpose
documents	Source file tracking (path, hash, docid)
content	Canonical markdown by mirrorHash
content_chunks	Chunked text (800 tokens each)
documents_fts	Document-level FTS5 with Snowball stemmer
content_vectors	Chunk embeddings with title context (optional)
doc_tags	Document tags (frontmatter and user-added)
doc_links	Wiki and markdown links between documents

Content Addressing

GNO uses content-addressed storage:

sourceHash = SHA-256 of original file content
mirrorHash = SHA-256 of canonical markdown

Multiple source files with identical canonical content share the same chunks and vectors. This deduplicates storage and speeds up indexing.

LLM Models

All models run locally via node-llama-cpp:

Model	Purpose	Default
Embed	Generate vector embeddings	bge-m3-Q4 (1024 dims)
Rerank	Cross-encoder scoring	Qwen3-Reranker-0.6B-Q8 (32K context)
Gen	Answer generation	Qwen3-1.7B-Q4

Models are GGUF-quantized for efficiency. First use triggers automatic download.

Search Modes

Mode	Description
BM25	Document-level keyword matching via FTS5 + Snowball
Vector	Chunk-level semantic similarity with contextual embeddings
Hybrid	BM25 + vector with RRF fusion (2× original weight, tiered bonus)
Reranked	Hybrid + full-document cross-encoder (32K context)

Graceful Degradation

GNO works with reduced capabilities when components are missing:

Component	If Missing	Behavior
sqlite-vec	Extension not loaded	BM25 search only
Embed model	Not downloaded	Vector search disabled
Rerank model	Not downloaded	Skip reranking
Gen model	Not downloaded	`--answer` disabled

Run gno doctor to check component status.

File Locations

Run gno doctor to see resolved paths.

Link System

GNO extracts and tracks links between documents:

Link Types

Type	Syntax	Example
Wiki	`[[Target]]`	`[[My Note]]`
Wiki	`[[Target\\|Display]]`	`[[My Note\\|click here]]`
Wiki	`[[Target#Heading]]`	`[[My Note#Section]]`
Wiki	`[[collection:Target]]`	`[[work:Project Plan]]`
Wiki	`[Display]([[Target]])`	`[Plan]([[Project Plan]])`
Wiki	`\|`
Markdown	`[text](path.md)`	`[docs](./README.md)`

External URLs (https://) are NOT stored—only internal document links.

Resolution

Links are resolved at query time, not stored with target document IDs. This handles document renames gracefully:

Wiki links: Normalized title match with path-style fallbacks (basename/rel_path, optional .md)
Cross-collection: [[collection:Note]] syntax with explicit collection prefix
Markdown links: Resolved path stored for matching

Note: Case-insensitive matching relies on SQLite lower() (ASCII-only unless ICU).

Storage

The doc_links table stores:

Source document reference
Link type (wiki/markdown)
Target reference (raw and normalized)
Position (line/column for editor integration)
Optional anchor (#section) and display text

Links are extracted from original source content during sync, excluding frontmatter and code blocks.

Technical Notes

For implementation details, see:

How Search Works - Deep dive into query expansion, HyDE, and RRF fusion
spec/cli.md - CLI specification
spec/mcp.md - MCP specification