chat

dc-api-v2 chatbot

Local development setup

Follow the instructions in the main README to deploy or sync a development stack that includes the CHAT feature.

Websocket Communication

The DC API uses websockets to enable real-time, bi-directional communication between the frontend application and the chat API. This is particularly important for streaming chat responses where messages need to be delivered incrementally.

Connection

The websocket endpoint is available at:

wss://[API_ENDPOINT]/chat

Client to Server Messages

To initiate a chat conversation, send:

{
  "question": "Your question here",
  "docs": ["work-id-1", "work-id-2"],
  "auth": "jwt-token",
  "stream_response": true,
  "ref": "abc123"
}

Additional optional parameters:

forget: boolean (default: false) - Start a new conversation
model: string - Specify the LLM model to use (superuser only)
k: number - Number of documents to retrieve (superuser only)
temperature: number - Model temperature (superuser only)
facets: array - Apply search filters/facets (see Facets/Filters section below)

Server to Client Messages

Server Message Format

Messages from the API on the server-side follow this general structure:

{
  "type": "string",      // Type of message
  "message": "string",   // Content of the message
  "ref": "string",      // Reference ID for tracking conversation
}

The server sends different types of messages:

Start Message:

{
  "type": "start",
  "message": {
    "model": "model_name"
  },
  "ref": "conversation-id"
}

Token Updates:

{
  "type": "token",
  "message": "partial response text",
  "ref": "conversation-id"
}

Stop Message:

{
  "type": "stop",
  "ref": "conversation-id"
}

Answer Message:

{
  "type": "answer",
  "message": "response content",
  "ref": "conversation-id"
}

Final Message:

{
  "type": "final_message",
  "ref": "conversation-id"
}

Tool Start:

{
  "type": "tool_start",
  "message": {
    "tool": "tool_name",
    "input": "tool input"
  },
  "ref": "conversation-id"
}

Aggregation Result:

{
  "type": "aggregation_result",
  "message": {
    // example aggregation result object
    "buckets": [
      {
        "key": "bucket-key",
        "doc_count": 10
      }
    ],
    "sum_other_doc_count": 34,
    "doc_count_error_upper_bound": 0,
  },
  "ref": "conversation-id"
}

Search Result:

{
  "type": "search_result",
  "message": [
    {
      "id": "work-id",
      "title": "work title",
      "visibility": "visibility status",
      "work_type": "type",
      "thumbnail": "thumbnail url"
    }
  ],
  "ref": "conversation-id"
}

Final Completion:

{
  "type": "final",
  "message": "Finished", // Hard-coded value for the message
  "ref": "conversation-id"
}

Error Messages:

{
  "type": "error",
  "message": "error description",
  "ref": "conversation-id"
}

Error Handling

401 Unauthorized: Returned when the authentication token is missing or invalid
400 Bad Request: Returned when the question is blank or missing
Connection errors will emit an "error" type message through the websocket

Security and Authentication

The chat service uses JWT-based authentication inherited from the main DC API. Each WebSocket connection requires a valid JWT token to be provided in the connection payload.

Token Requirements

Tokens must be signed with the shared API secret
Tokens contain user entitlements and authentication status
Standard tokens expire after 12 hours
Anonymous access is supported with limited capabilities

Security Features

Token validation occurs on every connection
User privileges are enforced via the ApiToken class
Advanced features like model selection and debug mode require superuser status
Temperature and context window size limits are enforced for non-superusers
All chat interactions are logged for auditing purposes

Environment Configuration

The following security-related environment variables must be configured:

API_TOKEN_NAME - Name of the JWT cookie/header
API_TOKEN_SECRET - Shared secret for validating JWTs

Authorization Levels

The chat service implements the following authorization levels:

Unauthorized: No access to chat functionality
Authenticated Users (config.is_logged_in=true):
- Basic chat functionality
- Default model settings
- Standard context window limits
- Fixed temperature settings
Dev Team (config.is_dev_team=true):
- Same access as authenticated users
- Flagged in metrics logs for filtering development traffic
- No additional feature permissions
Superusers (config.is_superuser=true):
- Custom prompts
- Model selection
- Debug mode
- Temperature control
- Unrestricted context window
- Scope to override system defaults

Facets and Filters System

The chat system supports filtering search results using facets, which allows users to narrow down their searches to specific subsets of the digital collections. The system implements a dual-layer approach that maintains both functional filtering and LLM awareness.

Overview

When facets are applied to a chat conversation, the system:

Injects facets into tool calls - Ensures search and aggregation tools receive the filter parameters
Makes the LLM aware of filters - Adds context to the system message so the LLM understands it's working with filtered data
Persists facets in conversation state - Maintains filter context throughout the entire conversation

Usage

To apply facets to a chat conversation, include them in the initial request:

{
  "question": "What photography is available?",
  "facets": [
    {"subject.label": "Nigeria"},
    {"collection.title.keyword": "E. H. Duckworth Photograph Collection"},
    {"work_type.keyword": "Image"}
  ],
  "auth": "jwt-token",
  "ref": "abc123"
}

Implementation Details

State Management (`SearchAgentState`)

Extends LangGraph's MessagesState to include a facets field
Persists facets throughout the conversation using LangGraph's state management
Allows facets to be accessed at any point in the conversation flow

class SearchAgentState(MessagesState):
    """Extended state that includes facets context for the search agent."""
    facets: Optional[List[dict]] = None

Tool Injection (`FacetsToolNode`)

Automatically injects facets into search and aggregate tool calls
Uses facets from state if available, falls back to instance facets
Preserves existing facets if already present in tool arguments
Only injects into tools that support facet filtering

LLM Context Awareness

When facets are present, the system automatically enhances the system message with context like:

IMPORTANT CONTEXT: The user's search is currently filtered/scoped to specific content. 
Active filters: Subject: Nigeria; Collection Title: E. H. Duckworth Photograph Collection; Work Type: Image

When answering, be aware that:
- All search results are already filtered by these criteria
- You should acknowledge this context in your responses (e.g., "In the filtered results..." or "Among the Work Type content...")
- Do NOT attempt to broaden the search or remove these filters
- If results seem limited, explain that this is due to the applied filters rather than suggesting to broaden the search

Benefits

Improved Response Quality: The LLM provides contextually appropriate responses that acknowledge the applied filters
Prevents Search Broadening: The LLM won't attempt to remove filters when results seem limited
Better User Experience: Responses are worded appropriately for filtered contexts
Backward Compatibility: Existing tool injection continues to work

Facet Format

Facets should be provided as an array of objects where each object represents a filter:

[
  {"field_name": "single_value"},
  {"field_name": ["multiple", "values"]},
  {"nested.field.keyword": "exact_match"}
]

Common facet fields include:

subject.label - Subject terms
collection.title.keyword - Collection names
work_type.keyword - Work types (Image, Text, etc.)
genre.label - Genre classifications
language.label - Languages
date.created - Creation dates

Architecture Notes

The facets system leverages LangGraph's state management patterns:

Static Runtime Context: Immutable data passed at startup (not used for facets)
Dynamic Runtime Context: Mutable conversation state (used for facets persistence)
Tool Node Integration: Custom tool wrapper that intercepts and modifies tool calls

This approach ensures facets are both functionally applied to searches and conceptually understood by the LLM, providing the best of both worlds for filtered search conversations.

Name		Name	Last commit message	Last commit date
parent directory ..
dependencies		dependencies
src		src
test		test
README.md		README.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
template.yaml		template.yaml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

dc-api-v2 chatbot

Local development setup

Websocket Communication

Connection

Client to Server Messages

Server to Client Messages

Server Message Format

Error Handling

Security and Authentication

Token Requirements

Security Features

Environment Configuration

Authorization Levels

Facets and Filters System

Overview

Usage

Implementation Details

State Management (`SearchAgentState`)

Tool Injection (`FacetsToolNode`)

LLM Context Awareness

Benefits

Facet Format

Architecture Notes

FilesExpand file tree

chat

Directory actions

More options

Directory actions

More options

Latest commit

History

chat

Folders and files

parent directory

README.md

dc-api-v2 chatbot

Local development setup

Websocket Communication

Connection

Client to Server Messages

Server to Client Messages

Server Message Format

Error Handling

Security and Authentication

Token Requirements

Security Features

Environment Configuration

Authorization Levels

Facets and Filters System

Overview

Usage

Implementation Details

State Management (SearchAgentState)

Tool Injection (FacetsToolNode)

LLM Context Awareness

Benefits

Facet Format

Architecture Notes

State Management (`SearchAgentState`)

Tool Injection (`FacetsToolNode`)