Gideon Discord Bot - Documentation

Gideon is a feature-rich Discord bot that leverages AI capabilities to provide conversation, image generation, thread management, trivia games, and more. It integrates with multiple AI services and providers (like OpenRouter, OpenAI, AI Horde, Cloudflare Workers) to deliver a comprehensive AI assistant experience within Discord.

The bot follows a modular architecture using Py-Cord’s cogs system, with state management via a singleton pattern that provides persistence across restarts using a SQLite database. It supports multiple AI models from various providers and maintains conversation context for natural interactions.

Overview
Getting Started (User/Administrator Guide)
Bot Usage (User/Administrator Guide)
Architecture and Development (Developer Guide)
Contributing
License

1. Overview

Gideon is a versatile Discord bot designed to enhance server interactions with advanced AI capabilities. Its core functionalities include:

AI Conversation: Engage in natural language conversations with the bot, powered by various AI models from multiple providers (e.g., OpenRouter, OpenAI).
Thread Management: Utilize Discord’s native threads for focused AI conversations with per-thread configuration overrides for models and system prompts.
Image Generation: Create images using different AI backends (AI Horde, Cloudflare Worker, OpenAI DALL-E, ComfyUI, OpenRouter) via a unified command interface.
Trivia Games: Play interactive trivia with AI-generated questions on any topic, featuring solo and competitive modes, scoring, achievements, and leaderboards.
Flexible Configuration: Customize bot behavior globally, per channel, or per thread for AI providers, models, and system prompts using organized command groups.

The bot is built on the Py-Cord library, using a modular “cog” system for organizing commands and features. State persistence (conversation history, configurations, etc.) is handled through a SQLite database, ensuring data survives bot restarts and updates.

2. Getting Started (User/Administrator Guide)

This section guides users and administrators through setting up and running the Gideon bot.

Prerequisites

Before you begin, ensure you have the following:

Python 3.8 or higher: Required for Method 1 (Python).
Docker: Required for Method 2 (Docker).
A Discord Account: You need a Discord account to create a bot application and invite it to your server.
Server Permissions: You need “Manage Server” permissions on the Discord server where you want to add the bot.
API Keys: You will need API keys for the AI services you wish to use. At a minimum, you need a Discord bot token. You will also need API keys for the specific AI providers you intend to use (e.g., OpenRouter, OpenAI).

Installation

You can install and run Gideon using either a direct Python environment or Docker.

Method 1: Using Python

Clone the Repository:

git clone https://github.com/eoko-dev/gideon
cd gideon

Install Dependencies:
```
pip install -r requirements.txt
```

Method 2: Using Docker

Ensure Docker is Installed: Download and install Docker Desktop or Docker Engine for your operating system if you haven’t already.

Clone the Repository:

git clone https://github.com/eoko-dev/gideon
cd gideon

Build the Docker Image:
```
docker build -t gideon-bot .
```
This command builds the Docker image based on the Dockerfile in the repository.

Configuration

Gideon is configured primarily through environment variables, typically stored in a .env file in the root directory of the project.

Create a .env file: Copy the example environment file:
```
cp .env.example .env
```
Edit the .env file: Open the newly created .env file in a text editor and fill in the required and optional variables.
- Essential Variables:
  - DISCORD_TOKEN: Your Discord bot token from the Discord Developer Portal.
- API Keys (At least one provider is needed for chat):
  - OPENROUTER_API_KEY: Your API key from OpenRouter.ai. Allows access to many models.
  - OPENAI_API_KEY: Your API key for OpenAI. Needed for OpenAI models (chat and DALL-E image generation).
  - AI_HORDE_API_KEY: Your API key for AI Horde (optional, for image generation and potentially text models).
  - CLOUDFLARE_WORKER_URL: The URL of your Cloudflare Worker for image generation (optional).
  - CLOUDFLARE_API_KEY: API key for your Cloudflare Worker (optional, if your worker requires authentication).
  - COMFYUI_URL: URL of your ComfyUI server for image generation (optional, e.g., http://127.0.0.1:8188).
- Optional Variables:
  - SYSTEM_PROMPT: Customize the bot’s default personality. This prompt is sent to the AI model at the beginning of conversations.
  - DEFAULT_MODEL: Set the default AI model to use (e.g., openrouter/openai/gpt-4o-mini). Use the format provider/model_name. The provider part must match the keys used internally (e.g., openrouter, openai, ai_horde).
  - DATA_DIRECTORY: The directory where the SQLite database (gideon_state.db) and other data (like temporary image files) will be stored.
    - If using Python: Set this to a local path, e.g., ./data.
    - If using Docker: The default /app/data is recommended as it aligns with the Dockerfile and docker-compose.yml. You will typically map a local directory to this path when running the container to persist data.
  - INTENT_DISCOVERY: Enable or disable natural language intent detection for @mentions (default: true). Set to false to disable.
  - INTENT_DETECTION_MODEL: The AI model used for intent classification (default: anthropic/claude-3.5-haiku). Must be in provider/model format.
  - INTENT_CONFIDENCE_THRESHOLD: Minimum confidence score (0.0-1.0) required to route to specialized handlers (default: 0.7).
Example .env:
```
# Discord Bot Token
DISCORD_TOKEN=your_discord_token_here

# --- API Keys (Provide keys for the services you want to use) ---
# OpenRouter API Key (Recommended for broad model access)
OPENROUTER_API_KEY=your_openrouter_api_key_here

# OpenAI API Key (for OpenAI models and Dall-E image generation)
OPENAI_API_KEY=your_openai_api_key_here

# AI Horde API Key (optional, for image/text generation)
AI_HORDE_API_KEY=your_ai_horde_api_key_here

# Cloudflare Worker (optional, for image generation)
CLOUDFLARE_WORKER_URL=https://your-worker-url.workers.dev/
CLOUDFLARE_API_KEY=your_cloudflare_api_key_here  # Optional

# ComfyUI (optional, for image generation)
COMFYUI_URL=http://127.0.0.1:8188

# --- Optional Settings ---
# System prompt for the AI assistant
SYSTEM_PROMPT=You are Gideon, a helpful and friendly AI assistant.

# Default model to use (provider/model_name format)
DEFAULT_MODEL=openrouter/openai/gpt-4o-mini # Default uses OpenRouter

# Data directory path. ONLY CHANGE THIS PATH IF YOU KNOW WHAT YOU ARE DOING! IT IS SET TO FUNCTION WITH DOCKER BY DEFAULT
DATA_DIRECTORY=./data # Or /app/data if using Docker

# --- Intent Detection Settings ---
# Enable natural language intent detection (default: true)
INTENT_DISCOVERY=true
# Model for intent classification (default: anthropic/claude-3.5-haiku)
INTENT_DETECTION_MODEL=anthropic/claude-3.5-haiku
# Confidence threshold for intent routing (default: 0.7)
INTENT_CONFIDENCE_THRESHOLD=0.7
```

Running the Bot

Running with Python

Navigate to the bot’s root directory in your terminal and run:

python src/__main__.py

Running with Docker

You can run the Docker image directly or use docker-compose. Using docker-compose is recommended as it handles mounting the data directory for persistence.

Using docker-compose (Recommended):

Ensure you have edited the .env file. From the bot’s root directory, run:

docker-compose up -d

This will build the image (if not already built) and start the container in detached mode. The docker-compose.yml file is configured to mount a local ./data directory to /app/data inside the container, ensuring your database and other data persist.

Running Docker Image Directly:

Ensure you have edited the .env file. From the bot’s root directory, run:

docker run -d --env-file .env -v ./data:/app/data gideon-bot

This command runs the gideon-bot image in detached mode (-d), passes environment variables from the .env file (--env-file .env), and mounts the local ./data directory to /app/data inside the container (-v ./data:/app/data).

The bot should connect to Discord and appear online in your server.

Adding to Your Server

To add the bot to your Discord server, you need to generate an invite link from the Discord Developer Portal.

Go to the Discord Developer Portal.
Select your bot application.
Go to “OAuth2” -> “URL Generator”.
Under “SCOPES”, select bot.
Under “BOT PERMISSIONS”, select the necessary permissions. Recommended permissions include:
- Read Messages/View Channels
- Send Messages
- Embed Links
- Attach Files
- Manage Messages (for clearing history, etc.)
- Use Slash Commands
- Manage Threads (if using native threads)
- Read Message History
Copy the generated URL and paste it into your browser. Select the server you want to add the bot to and authorize it.

3. Bot Usage (User/Administrator Guide)

Gideon uses Discord’s slash commands (/) for all interactions.

General Concepts

Slash Commands: Type / in the chat bar to see a list of available commands. Commands are organized into logical groups (e.g., /dream_manage).
Interactive Help: Use /help to access an interactive menu that lets you browse commands by category. The help system is role-aware and shows admin commands only to administrators.
Command Visibility: Admin commands (/settings, /channel, /admin, /dream_manage) are hidden from regular users in Discord’s command picker. Administrators will see all commands.
Permissions: Some commands are restricted to server administrators or the bot owner. If a command doesn’t appear or doesn’t work, you may lack the necessary permissions.
Providers: Gideon can use different backend services (providers) for AI tasks like chat (OpenRouter, OpenAI, AI Horde) and image generation (AI Horde, Cloudflare, OpenAI, ComfyUI). Administrators can configure which provider is used globally or per channel.

Help Command

/help: Opens an interactive help menu with category-based navigation. Categories include:
- 💬 Chat & AI - Conversation commands (/chat, /search, /reset, /summarize, /memory)
- 🔗 URL Tools - URL summarization (/summarizeurl)
- 🧵 Threads - Thread management (/thread *)
- 🎮 Trivia - Trivia games (/trivia *)
- 🎨 Images - Image generation (/dream, plus admin management commands)
- ⏰ Reminders - Reminder management (/remind *)
- ⚙️ Settings - Global and channel configuration (Admin only)
- 🔐 Admin - Administrative tools (Admin only)

AI Chat Commands

These commands are for general conversation and interaction with the AI using the configured provider and model.

/chat <prompt> [attachment]: Engage in a conversation with the AI. Provide your text prompt. You can also attach an image when using a model that supports vision (e.g., openai/gpt-4o).
/reset: Clears the conversation history for the current channel or thread. This is useful if the conversation goes off track or you want to start fresh.
/summarize: Generates a summary of the recent conversation history in the current channel or thread.
/memory: Shows statistics about the conversation history being stored for the current channel or thread, including message count and time window.

Natural Language Intent Detection

Gideon features an AI-powered intent detection system that allows users to interact with the bot through natural language @mentions instead of remembering specific slash command syntax. When you @mention Gideon, the bot analyzes your message using AI to determine what you want to do and automatically routes to the appropriate feature.

How It Works

Detection Process: When you @mention the bot (e.g., “@Gideon remind me to check the server tomorrow”), the message is sent to an AI model configured for intent detection.
Intent Classification: The AI analyzes your message and returns:
- Intent Type: The category of action you want (reminder, image_generation, search, conversation, or unknown)
- Confidence Score: A value from 0.0 to 1.0 indicating how certain the AI is about the classification
- Extracted Data: Structured parameters extracted from your natural language (e.g., reminder time, image prompt, search query)
Routing Decision:
- If confidence ≥ threshold (default 0.7), routes to the specialized handler
- If confidence < threshold, falls back to normal conversation
Execution: The appropriate handler executes with the extracted parameters

Supported Intents

1. Reminder Intent

Purpose: Schedule notifications for future events
Indicators: “remind me”, “set a reminder”, “notification”, time expressions
Examples:
- “@Gideon remind me to check the server logs tomorrow at 3pm”
- “@Gideon set a reminder for the meeting on Friday at 2pm”
- “@Gideon notification to restart the bot in 2 hours”
Extracted Data:
- reminder_text: What to be reminded about
- reminder_time: When the reminder should trigger (natural language or timestamp)

2. Image Generation Intent

Purpose: Create AI-generated images from text descriptions
Indicators: “draw”, “generate an image”, “create a picture”, “make an image”, “paint”, “illustrate”, “sketch”, “render”
Examples:
- “@Gideon draw a sunset over mountains with vibrant colors”
- “@Gideon generate an image of a futuristic cityscape”
- “@Gideon create a picture of a cat wearing a top hat, but no background”
Extracted Data:
- prompt: Main description of what to generate (required)
- negative_prompt: What to exclude (optional, detected from “no”, “without”, “avoid”, “but not”)
- size: Dimensions if specified (e.g., “1024x1024”, “landscape”, “portrait”)
- quality: Quality setting (“hd”, “high quality”, “standard”)
- style: Style preference (“vivid”, “natural”, “realistic”, “artistic”)
Provider Behavior: Uses the currently active image generation provider from the database (configured via /dream manage set_provider)
Error Handling: If image generation fails, the bot prefixes the response with “Image generation failed.” and then continues with a conversational response about the request

3. Search Intent

Purpose: Fetch current, real-time information from the web
Indicators:
- Direct keywords: “search for”, “look up”, “find information about”, “google”
- Time-sensitive language: “what’s the latest”, “current”, “today’s”, “recent”, “now”
- Real-time data requests: “weather”, “news”, “stock price”, “live scores”, “breaking”
Examples:
- “@Gideon what are the current best games on Xbox Game Pass”
- “@Gideon what’s the weather in Seattle today”
- “@Gideon latest AI news”
- “@Gideon current stock price of NVIDIA”
Extracted Data:
- query: The search query/question (required)
Provider Requirement: Search ONLY works with the OpenRouter provider (uses web search plugin)
Provider Fallback: If the current provider is not OpenRouter, the bot will:
1. Inform you that web search requires OpenRouter
2. Automatically fall back to conversation mode
3. Attempt to answer your question using the AI’s knowledge (without web search)
Distinguishing from Conversation:
- Search: Time-sensitive queries, current events, real-time data
  - Examples: “latest Python release”, “current weather in NYC”, “today’s news”
- Conversation: General knowledge, explanations, opinions, timeless topics
  - Examples: “what is Python”, “explain photosynthesis”, “tell me about history”

4. Conversation Intent

Purpose: General chat and Q&A
Indicators: Questions, statements, or requests that don’t fit other intents
Examples:
- “@Gideon what is machine learning?”
- “@Gideon tell me a joke”
- “@Gideon help me understand this code”
Behavior: Uses the configured provider and model for the channel to generate a conversational response

5. Unknown Intent

Purpose: Fallback when the AI cannot determine intent
Behavior: Defaults to conversation mode

Configuration

Intent detection is configured through environment variables in your .env file:

# Enable or disable intent detection (default: true)
INTENT_DISCOVERY=true

# AI model used for intent classification (default: anthropic/claude-3.5-haiku)
INTENT_DETECTION_MODEL=anthropic/claude-3.5-haiku

# Minimum confidence threshold (0.0-1.0) to trigger intent routing (default: 0.7)
INTENT_CONFIDENCE_THRESHOLD=0.7

Configuration Details:

INTENT_DISCOVERY: Set to false to completely disable intent detection. All @mentions will be treated as conversation.
INTENT_DETECTION_MODEL: The AI model used for analyzing intent. Must be in provider/model format. Claude 3.5 Haiku is recommended for its speed and accuracy at this task.
INTENT_CONFIDENCE_THRESHOLD: Controls how certain the AI must be before routing to a specialized handler. Lower values (0.5-0.6) will route more aggressively but may misclassify. Higher values (0.8-0.9) will be more conservative but may miss valid intents.

Implementation Details (for Developers)

The intent detection system is implemented in src/cogs/mention_commands.py:

System Prompt Engineering (lines 94-151): A detailed system prompt instructs the AI on how to classify messages, what indicators to look for, and what data to extract for each intent type.

Structured JSON Output (line 161): The AI returns a JSON object with intent, confidence, and data fields:

{
  "intent": "search",
  "confidence": 0.9,
  "data": {"query": "what are the current best games on Xbox Game Pass"}
}

Intent Handlers: Each intent has a dedicated handler method:
- handle_reminder_request() (existing)
- handle_image_generation_request() (lines 287-524)
- handle_search_request() (lines 605-773)
Routing Logic (lines 947-965): After intent detection, the appropriate handler is called based on the intent type, or falls back to conversation.
Conversation History: All intent-triggered actions (reminders, images, searches) are added to the channel’s conversation history, enabling contextual follow-up questions.

Thread Management Commands

Gideon leverages Discord’s native threads for focused conversations, allowing per-thread configuration.

/thread new <name>: Creates a new public thread with the AI in the current channel. The AI will join the thread and you can chat within it.
/thread message <thread> <prompt>: Sends a message to a specific thread by mentioning the thread (e.g., #thread-name). Useful if you are not currently viewing that thread.
/thread list: Lists all active threads the bot is aware of in the current server, showing their names and IDs.
/thread show: View the current thread’s configuration settings (model, system prompt, statistics).
/thread model [model]: Sets the AI model for the current thread. Provide the model ID in provider/model_name format (e.g., openrouter/openai/gpt-4o).
/thread system [prompt]: Sets a custom system prompt for the current thread. Provide the desired prompt text.
/thread rename <thread> <new_name>: Renames a specific thread in the bot’s database.
/thread delete <thread>: Deletes a specific thread and its associated history from the bot’s database.

Image Generation Commands

The /dream command provides a unified interface for generating images using different AI providers configured in the .env file.

/dream <prompt> [negative_prompt]: Generate an image using the currently configured AI backend.
- prompt: Describe the image you want to create. Be as descriptive as possible.
- negative_prompt: (Optional) Describe elements or styles you want to avoid in the generated image. Support for negative prompts varies by provider (e.g., OpenAI DALL-E does not support it directly).
Available Providers:
- AI Horde: A decentralized, crowdsourced image generation service. Requires AI_HORDE_API_KEY in .env. Offers various Stable Diffusion models.
- Cloudflare Worker: Allows using a custom image generation backend hosted on Cloudflare Workers. Requires CLOUDFLARE_WORKER_URL and optionally CLOUDFLARE_API_KEY in .env. The specific models and parameters supported depend on the worker implementation.
- OpenAI: Uses OpenAI’s DALL-E models (DALL-E 2, DALL-E 3). Requires OPENAI_API_KEY in .env.
- ComfyUI: Powerful node-based UI for advanced image generation workflows. Requires COMFYUI_URL in .env pointing to a running ComfyUI server (local or remote). Supports custom workflows, multiple checkpoint models, and advanced features like LoRA.
- OpenRouter: Access various image generation models through OpenRouter’s unified API. Uses the same OPENROUTER_API_KEY as chat. Supports Google Gemini image models (e.g., google/gemini-2.5-flash-image), FLUX models, and other providers with image output capabilities. Offers Gemini-specific features like aspect_ratio (1:1, 16:9, 9:16, 4:3, 3:4) and image_size (1K, 2K, 4K).
Admin Management Commands (/dream_manage): (Administrator Only)
- /dream_manage set_provider <provider>: Set the active image generation provider for the bot (ai_horde, cloudflare, openai, comfyui, openrouter). The bot will use this provider for all /dream commands until changed.
- /dream_manage view_config: View the current active provider and its default configuration settings (model, size, steps, etc.) as stored in the database.
- /dream_manage comfyui_test: Test connection to ComfyUI server and view system stats (ComfyUI only).
- /dream_manage comfyui_models: List available checkpoint models from ComfyUI server (ComfyUI only).
- /dream_manage comfyui_workflow <workflow_json>: Set a custom ComfyUI workflow JSON or use ‘reset’ to restore default (ComfyUI only).
Admin Configuration Commands (/dream_manage configure): (Administrator Only)
- /dream_manage configure ai_horde <model> <size> <steps>: Configure default settings for the AI Horde provider.
  - model: The default AI Horde model name (e.g., stable_diffusion_xl).
  - size: The default image resolution (e.g., 1024x1024). Must be a multiple of 64.
  - steps: The default number of generation steps (e.g., 30). Higher steps can increase detail but take longer.
- /dream_manage configure cloudflare <size> <steps> [seed]: Configure default settings for the Cloudflare Worker provider.
  - size: The default image resolution (e.g., 768x768).
  - steps: The default number of generation steps (e.g., 25).
  - seed: (Optional) A default random seed for reproducibility. Leave blank for a random seed each time.
- /dream_manage configure openai <model> [quality] [style]: Configure default settings for the OpenAI (DALL-E) provider.
  - model: The default OpenAI model (dall-e-2 or dall-e-3).
  - quality: (Optional, DALL-E 3 only) Image quality (standard or hd).
  - style: (Optional, DALL-E 3 only) Image style (vivid or natural).
- /dream_manage configure comfyui <size> <steps> [model]: Configure default settings for the ComfyUI provider.
  - size: The default image resolution (e.g., 512x512, 1024x1024, 768x512).
  - steps: The default number of generation steps (10-150).
  - model: (Optional) Default checkpoint model name. Use /dream_manage comfyui_models to list available models.
- /dream_manage configure openrouter <model> [aspect_ratio] [image_size]: Configure default settings for the OpenRouter provider.
  - model: The OpenRouter model ID (e.g., google/gemini-2.5-flash-image, black-forest-labs/flux.2-pro). Use autocomplete to see available models.
  - aspect_ratio: (Optional, Gemini only) Image aspect ratio (1:1, 16:9, 9:16, 4:3, 3:4). Default: 1:1.
  - image_size: (Optional, Gemini only) Image resolution (1K, 2K, 4K). Default: 1K.

Trivia Commands

Play interactive trivia games powered by AI-generated questions on any topic. The trivia system creates dedicated thread-based games where players simply type their answers naturally.

Game Modes:

Solo Mode: Personal trivia session where you test your knowledge
Competitive Mode: Race against other players - first correct answer wins each round

Available Commands:

/trivia start [mode] [category] [difficulty] [questions]: Start a new trivia game
- mode: Choose solo (personal session) or competitive (multiplayer race)
- category: Any topic you want - the AI generates questions dynamically (e.g., “science”, “80s movies”, “pokemon”, “cooking”)
- difficulty: Choose easy, medium, or hard (default: medium)
- questions: Number of questions (1-50, default: 10)
- Creates a dedicated Discord thread where the game takes place
/trivia stop: End the current trivia game early (use in a trivia thread)
- Only the game host can stop a solo game
- Anyone can stop a competitive game
/trivia stats [user]: View trivia statistics
- Shows total games played, accuracy, points earned, streaks, and achievements
- Omit the user parameter to view your own stats
/trivia leaderboard [timeframe]: View server rankings
- timeframe: Choose daily, weekly, monthly, or all_time (default: all_time)
- Shows top 10 players with their stats
/trivia achievements: Display all your earned achievement badges
- Shows achievement names, descriptions, and unlock dates

How to Play:

Start a Game: Use /trivia start and select your mode, category, and difficulty
Join the Thread: The bot creates a dedicated thread and posts the first question
Answer Questions: Simply type your answer in the thread:
- Type the letter (A, B, C, or D)
- Or type the full answer text
- The bot validates answers with fuzzy matching for spelling variations
Earn Points: Score is based on:
- Difficulty: Easy (100), Medium (200), Hard (300) base points
- Speed Bonus: Answer faster for up to +50% bonus points
- Streak Multiplier: Consecutive correct answers multiply your score (up to 2x)
Track Progress: View your stats and compete on leaderboards
Unlock Achievements: Earn badges for milestones like perfect games, speed records, and win streaks

Scoring System:

Base points: 100 (easy), 200 (medium), 300 (hard)
Speed bonus: < 5s (+50%), < 10s (+30%), < 20s (+10%)
Streak multiplier: 3-4 correct (1.1x), 5-9 (1.25x), 10-19 (1.5x), 20+ (2.0x)

Example Achievements:

🎯 First Blood - Answer your first question correctly
🔥 On Fire - Get 5 correct answers in a row
⚡ Unstoppable - Get 10 correct answers in a row
💯 Perfectionist - Complete a game with 100% accuracy
💨 Speed Demon - Answer 10 questions in under 3 seconds each
🏆 Champion - Win 25 competitive rounds
And many more!

Settings Commands (Administrator)

Manage global bot configuration settings. All commands in this group require administrator permissions.

/settings show: View all current global bot settings (provider, model, memory limits, etc.).
/settings model <model>: Set the global default AI model. Use format provider/model (e.g., openrouter/openai/gpt-4o-mini).
/settings system <prompt>: Set the global default system prompt that defines the bot’s personality.
/settings provider <provider>: Set the global AI provider (openrouter or openai). Requires corresponding API key in .env.
/settings memory <limit>: Set the maximum number of messages to remember per channel/thread.
/settings window <hours>: Set the time window (in hours) for message history retention.
/settings restore: Reset all settings to their default values.

Channel Commands (Administrator)

Configure channel-specific overrides for AI behavior. All commands in this group require administrator permissions.

/channel show: View the current channel’s configuration settings and active overrides.
/channel model <model>: Set the AI model for the current channel, overriding the global default.
/channel system <prompt>: Set a custom system prompt for the current channel.
/channel provider <provider>: Set the AI provider for the current channel.
/channel reset: Clear all channel-specific overrides and revert to global settings.
/channel list: List all channels that have custom configuration overrides.

Admin Commands

Administrative tools and diagnostics. Most commands require administrator permissions, some require bot owner permissions.

/admin sync: (Owner Only) Manually sync slash commands with Discord. Use this only if commands are not appearing.
/admin debug: Display debug information about the bot’s current state.
/admin state: Show database state information (message counts, threads, configurations).
/admin diagnostic: Run system diagnostics to check connectivity to Discord and AI providers.
/admin vision_models: List all available vision-capable AI models from the configured providers.

Deprecated Configuration Commands

These commands are deprecated and will be removed in a future update. Please use the new grouped commands (/settings, /channel, /admin) instead.

/setprovider → Use /settings provider
/setmodel → Use /channel model or /settings model
/model → Use /settings show or /channel show
/setsystem → Use /channel system or /settings system
/setchannelmodel → Use /channel model
/setchannelsystem → Use /channel system
/setmemory → Use /settings memory
/setwindow → Use /settings window

Troubleshooting Common Issues

Bot is offline:
- If using Python: Check if the Python script is still running. Check the console for errors during startup.
- If using Docker: Check if the Docker container is running (docker ps). Check the container logs (docker logs <container_id>).
- Verify the DISCORD_TOKEN in your .env file is correct.
Bot not responding to commands:
- Ensure the bot is online in Discord.
- Ensure the bot has the “Use Slash Commands” permission in the server.
- Discord may take a few minutes to register new slash commands after the bot starts.
- Check the bot’s console output for errors when you try to use a command.
Chat commands failing:
- Verify that the required API key for the currently configured provider (check /settings show) is present and correct in the .env file.
- Check the console logs for API errors from the provider (OpenRouter, OpenAI).
Image generation failing:
- Check the bot’s console output for specific API errors from AI Horde, Cloudflare, or OpenAI.
- Verify the API keys and URLs in your .env file for the selected image provider (/dream manage view_config shows the active provider).
- Ensure the bot has “Attach Files” and “Embed Links” permissions in the channel to send images.
- Some providers may have rate limits or require sufficient credits (e.g., AI Horde Kudos).
Commands requiring Admin/Owner permissions fail:
- Ensure your Discord user has the necessary permissions in the server (Administrator role or the bot owner).
- Use /admin sync (owner only) if slash commands are not appearing correctly.
Database Errors: If you encounter errors related to the database, ensure the DATA_DIRECTORY is correctly configured and that the bot process has write permissions to this directory. If using Docker, ensure the volume is correctly mounted.

4. Architecture and Development (Developer Guide)

This section provides a deeper dive into the bot’s architecture and codebase for developers and contributors.

High-Level Architecture

The Gideon bot follows a modular design centered around Py-Cord’s Cogs, with support for multiple AI providers.

graph TD
    A[Discord API] --> B(Bot Instance);
    B --> C(Event Handlers);
    B --> D(Command Dispatcher);
    D --> E(Cogs);
    E --> F(BotStateManager);
    F --> G(DatabaseManager);
    E --> H(API Clients);
    H --> I(External AI Services<br/>OpenRouter, OpenAI, AI Horde, Cloudflare);
    E --> J(Background Tasks<br/>Auto-save, Pruning);
    G --> K(SQLite Database);
    B --> L(Configuration<br/>.env, DB Config);
    L --> F;
    L --> H;

Discord API: The interface for receiving events (messages, command interactions, etc.) and sending responses (messages, embeds, files).
Bot Instance: The main application object (bot.py) that manages the connection to Discord, registers event handlers, loads cogs, and controls the bot’s lifecycle.
Event Handlers: Asynchronous functions decorated with @bot.event (in bot.py) or @commands.Cog.listener() (in cogs) that respond to specific Discord events (e.g., on_ready, on_application_command).
Command Dispatcher: Py-Cord’s built-in mechanism that parses incoming slash command interactions and routes them to the appropriate command function within a cog based on the command name and subgroups.
Cogs: Modular classes (src/cogs/) that inherit from commands.Cog. They encapsulate related commands, listeners, and state. Examples include ChatCommands, SettingsCommands, ChannelCommands, AdminCommands, ThreadCommands, UnifiedImageCommands, and MentionCommands. Each cog is loaded by the bot instance during startup. The MentionCommands cog specifically handles @mentions and implements the AI-powered intent detection system.
BotStateManager: A singleton utility class (src/utils/state_manager.py) that acts as a central point of access for the bot’s dynamic runtime state. It holds configuration overrides (channel/thread specific), caches some data, and orchestrates persistence by calling methods on the DatabaseManager. It also contains pruning logic and manages the selection of the active AI provider and model based on configuration.
DatabaseManager: Handles direct interactions with the SQLite database file (data/gideon_state.db). It manages the database connection, ensures the schema is initialized and migrated, and provides low-level methods for inserting, querying, updating, and deleting data in the various tables.
API Clients: Dedicated utility classes (src/utils/) responsible for abstracting the communication details with external AI services. Each client (openrouter_client.py, ai_horde_client.py, cloudflare_client.py, openai_client.py) handles request formatting, sending HTTP requests (using aiohttp), processing responses, and basic error handling specific to that API. These clients support both chat and/or image generation depending on the service.
External AI Services: The third-party APIs that provide the core AI capabilities (language models, image generation).
Background Tasks: Asynchronous tasks implemented using discord.ext.tasks (e.g., auto-save task and data pruning task in bot.py) that run periodically in the background without blocking the main bot loop.
SQLite Database: The file-based database (data/gideon_state.db) used for persistent storage of conversation history, configurations (including provider/model settings), thread data, and channel configurations.
Configuration: Settings are loaded from environment variables (.env) via config.py on startup. Runtime configuration changes (via admin commands using the new grouped command structure) are stored persistently in the GLOBAL_CONFIG, CHANNEL_CONFIG, and THREADS tables in the database.

Project Structure

gideon/
├── src/                    # Source code
│   ├── bot.py              # Main bot initialization and event handling
│   ├── config.py           # Loads configuration from environment variables
│   ├── __main__.py         # Application entry point
│   ├── cogs/               # Command modules (Cogs)
│   │   ├── __init__.py             # Makes cogs directory a Python package
│   │   ├── admin_commands.py       # Admin tools (/admin group)
│   │   ├── channel_commands.py     # Channel settings (/channel group)
│   │   ├── chat_commands.py        # AI conversation commands (/chat, /reset, etc.)
│   │   ├── config_commands.py      # Legacy commands (deprecated)
│   │   ├── diagnostic_commands.py  # Diagnostic utilities
│   │   ├── help_commands.py        # Interactive help system (/help)
│   │   ├── mention_commands.py     # Handles bot mentions
│   │   ├── settings_commands.py    # Global settings (/settings group)
│   │   ├── thread_commands.py      # Thread management (/thread group)
│   │   ├── unified_image_commands.py # Image generation (/dream, /dream_manage)
│   │   └── url_commands.py         # URL handling commands (/summarizeurl)
│   └── utils/              # Utility functions and classes
│       ├── __init__.py             # Makes utils directory a Python package
│       ├── ai_horde_client.py    # Client for AI Horde API (Image/Text)
│       ├── cloudflare_client.py  # Client for Cloudflare Worker API (Image)
│       ├── comfyui_client.py     # Client for ComfyUI API (Advanced Image Generation)
│       ├── database.py           # Handles SQLite database interactions and schema
│       ├── model_manager.py      # Manages available AI models (fetches from providers)
│       ├── openai_client.py      # Client for OpenAI API (DALL-E, Chat/Vision)
│       ├── openrouter_client.py  # Client for OpenRouter API (Chat, Vision, Web Search)
│       ├── permissions.py        # Utility functions for Discord permissions
│       └── state_manager.py      # Singleton for centralized state access and management
├── .env.example            # Template for environment variables
├── requirements.txt        # Project dependencies
├── documentation.md        # This documentation file
├── LICENSE                 # Project license
├── README.md               # Project README
├── index.md                # GitHub Pages source file
├── _config.yml             # Jekyll config (if using GitHub Pages)
├── docker-compose.yml      # Docker Compose file for easy setup
├── Dockerfile              # Defines the Docker image
└── assets/                 # Static assets (images, etc.)
    └── images/
        └── ... (screenshots and logo)

Core Components

bot.py: This is the heart of the bot application. It initializes the discord.Bot instance with necessary intents, loads all cogs found in the src/cogs directory, sets up signal handlers for graceful shutdown (ensuring state is saved), registers a global error handler to catch unhandled exceptions, and manages the connection to the Discord API. It also initializes the BotStateManager and ModelManager instances, making them available to cogs.
config.py: This module is responsible for loading configuration values from the environment, primarily from the .env file using the python-dotenv library. It provides access to critical settings like DISCORD_TOKEN, API keys, SYSTEM_PROMPT, DEFAULT_MODEL, and paths like DATA_DIRECTORY.
__main__.py: This is the script executed to start the bot. It typically performs minimal setup, such as loading the configuration and then calling the run() method on the bot instance defined in bot.py to start the Discord event loop.

State Management

The bot’s operational state, including conversation histories, configurations, and feature-specific data, is managed in memory by the BotStateManager and persistently stored in a SQLite database using the DatabaseManager.

BotStateManager (src/utils/state_manager.py): Implemented as a singleton, this class provides a unified, high-level interface for accessing and modifying the bot’s state. It holds runtime configuration settings (like maximum history length, time window for pruning, global model, global provider) and delegates the actual reading and writing of data to the DatabaseManager. It includes logic for managing conversation history per channel/thread, handling thread-specific overrides, determining the effective provider and model for a given context, and orchestrating the pruning of old data. It also interacts with the ModelManager for model validation.
DatabaseManager (src/utils/database.py): This class is responsible for all direct interactions with the SQLite database file (data/gideon_state.db). It manages the database connection, ensures the schema is initialized and migrated, and provides low-level methods for inserting, querying, updating, and deleting data in the various tables.
- GLOBAL_CONFIG: Stores key-value pairs for global bot settings that can be changed at runtime (e.g., max_channel_history, time_window_hours, global_model, global_provider, news_update_frequency). Values are stored as TEXT along with a type indicator (string, int, float, bool, json).
- CHANNELS: Stores a list of channel IDs the bot has interacted with, primarily to support foreign key constraints and channel-specific configurations.
- CHANNEL_CONFIG: Stores channel-specific overrides for model, provider, and system_prompt. Linked to CHANNELS via a foreign key.
- THREADS: Stores information about Discord threads (ID, parent channel ID, name, creation timestamp) and also includes columns for thread-specific model and system_prompt overrides. Linked to CHANNELS via a foreign key.
- MESSAGES: Stores individual chat messages (role, content, timestamp, user_id). Each message is linked to either a channel_id or a thread_id via foreign keys. Indexes are used to optimize querying history by channel or thread and by timestamp.
- NEWS_FEEDS: Stores details about configured RSS feeds (feed_id - typically the URL, url, name, category, last_checked timestamp).
- NEWS_CHANNEL_SUBSCRIPTIONS: A linking table (channel_id, feed_id) to manage which channels are subscribed to which news feeds. Uses foreign keys to CHANNELS and NEWS_FEEDS.
- NEWS_ARTICLES_HISTORY: Records the article_identifier and feed_id of articles that have been processed to prevent duplicate posts. Includes a processed_at timestamp for pruning.
- article_summaries: Caches AI-generated summaries of news articles. Stores article_id, feed_id, article_title, article_link, article_published_date, summary_text, denormalized feed_name and feed_category, and timestamp_summarized.
- user_article_summaries: Stores AI-generated summaries for articles saved by individual users in their personal feeds. Includes user_id, article_link, feed_url, article_title, article_published_date, summary_text, and timestamp_summarized.
- USERS: Stores basic user information (user_id) and a preferences_json column to hold user-specific settings (like personal RSS feed URLs).
Persistence Logic: The BotStateManager’s state is loaded from the database when the bot connects to Discord (on_ready event in bot.py). Changes to configuration and the addition of new messages, threads, etc., are written to the database via DatabaseManager methods. An auto-save background task in bot.py periodically triggers the saving of the current state to the database. State is also saved during graceful shutdown.
Pruning Logic: The BotStateManager includes a prune_old_data method (called periodically by the auto-save task) that instructs the DatabaseManager to delete old records from the MESSAGES, THREADS, NEWS_ARTICLES_HISTORY, article_summaries, and user_article_summaries tables based on configured time windows (time_window_hours, summary_retention_days) and limits (max_channel_history). This prevents the database file from growing indefinitely.

API Integrations

The bot interacts with external AI services through dedicated client classes in the src/utils/ directory, typically using the aiohttp library for asynchronous HTTP requests. A key aspect is the support for multiple providers for both chat and image generation.

OpenRouterClient (openrouter_client.py): Handles communication with the OpenRouter API. Primarily used for chat completions, accessing a wide variety of models. Supports vision and web search.
AIHordeClient (ai_horde_client.py): Interacts with the AI Horde API. Used for image generation and can also be used for text generation if configured as the provider.
CloudflareWorkerClient (cloudflare_client.py): Communicates with a custom image generation API hosted on Cloudflare Workers.
OpenAIClient (openai_client.py): Interacts directly with the OpenAI API. Used for DALL-E image generation and can also be used directly for chat completions (including vision models).
ComfyUIClient (comfyui_client.py): Communicates with a ComfyUI server for advanced image generation. Supports custom workflows, model selection, and polling-based generation. Handles workflow parameter injection and image retrieval.

The ChatCommands cog determines the effective provider and model for the current context (channel/thread/global) via BotStateManager and then uses the corresponding client’s send_message_with_history method. Similarly, UnifiedImageCommands selects the appropriate image generation client based on its separate configuration.

Command System

Gideon utilizes Py-Cord’s modern application command (slash command) framework for all user interactions.

Command Definition: Commands are defined as asynchronous methods within cog classes. They are decorated with @discord.slash_command to specify the command name and description. Parameters are defined using the @option decorator, allowing for type hinting, descriptions, required status, and choices.
Cog Structure: Each major feature or group of related commands resides in its own class that inherits from commands.Cog. This promotes modularity and organization. The bot discovers and loads these cogs during startup by iterating through the files in the src/cogs/ directory. Each cog file must have a setup function that the bot calls to add the cog instance (bot.add_cog(MyCog(bot))).
Command Groups and Subgroups: Commands can be nested into hierarchical structures using discord.SlashCommandGroup and the create_subgroup() method. This helps organize commands logically (e.g., /dream manage set_provider).
Permissions: Access control for commands is implemented using Py-Cord’s built-in checks, applied as decorators to command methods. Common checks include @commands.is_owner() (restricts command to the bot’s owner) and @commands.has_permissions(administrator=True) (restricts command to users with administrator permissions in the server).

Adding New Features/Cogs

To contribute a new feature or set of commands to Gideon:

Create a new Python file in the src/cogs/ directory (e.g., my_new_feature_cog.py).
Define a class within this file that inherits from commands.Cog.
Add commands as asynchronous methods within your new cog class. Use @discord.slash_command and @option decorators to define the command interface.
Implement the logic for your commands within these methods. You can access the bot instance via self.bot and interact with the BotStateManager (self.bot.state_manager) or other utilities as needed.
Include a setup function at the end of your new cog file. This function is required by Py-Cord for the bot to load your cog:
```
def setup(bot):
    bot.add_cog(MyNewFeatureCog(bot))
```
Ensure your new cog file is placed in the src/cogs/ directory so the bot can discover and load it on startup.

Error Handling and Logging

Error Handling Strategy: The bot implements a multi-layered approach to error handling. Specific exceptions are caught and handled within try...except blocks in individual command functions. Cog-level error handlers (methods decorated with @commands.Cog.listener("on_application_command_error")) can catch errors from commands within that cog. A global error handler is registered in bot.py (@bot.event) to catch any exceptions not handled at lower levels. This layered approach ensures that errors are caught, logged, and user-friendly error messages are provided where possible, preventing the bot from crashing due to unhandled exceptions.
Logging System: The standard Python logging module is used throughout the codebase. Loggers are configured in bot.py to output messages to the console with different severity levels (DEBUG, INFO, WARNING, ERROR, CRITICAL). Logs are typically output to the console and can be configured for file output. Detailed tracebacks are logged for errors to aid in debugging. Developers should use the appropriate logger (logging.getLogger(__name__) in each module) and logging levels (logger.info, logger.error, logger.debug, logger.exception) to provide useful information about the bot’s operation and any issues encountered.

Security Considerations

API Key Management: All sensitive API keys (Discord, OpenRouter, AI Horde, Cloudflare, OpenAI) are stored exclusively in environment variables, typically within the .env file. They are loaded into the application via config.py and accessed at runtime. This prevents hardcoding credentials directly in the source code, which is a critical security practice.
Permission Controls: Access to sensitive or administrative commands is strictly controlled using Discord’s built-in permission system and enforced by Py-Cord decorators (@commands.is_owner(), @commands.has_permissions()). This ensures that only authorized users can perform actions like changing bot configuration or managing feeds.
Input Validation: Command inputs and data received from external APIs are validated where necessary to prevent unexpected behavior, injection vulnerabilities, or crashes due to malformed data.
Data Protection: The primary persistent data is stored in the SQLite database file (data/gideon_state.db). While the bot itself does not implement database-level encryption, it is highly recommended that the host system where the bot is running utilizes filesystem-level encryption to protect this data at rest, especially if sensitive conversation history is being stored. Automatic pruning helps limit the amount of historical data stored, reducing the potential impact of a data breach.

Configuration System

Configuration in Gideon is managed through a combination of static environment variables and dynamic, persistent settings stored in the database.

Environment Variables: Loaded on bot startup from the .env file via config.py. These are typically used for settings that define the bot’s connection to external services (API keys, URLs) and initial default behaviors (SYSTEM_PROMPT, DEFAULT_MODEL, DATA_DIRECTORY). These values are loaded once at startup.
Database Configuration (GLOBAL_CONFIG table): Stores global settings that can be modified at runtime via administrator commands (e.g., /setprovider, /setmemory, /setwindow, /setfeedfrequency, /dream manage set_provider, /dream configure). These settings are loaded into the BotStateManager on startup and saved back to the database when changed, ensuring they persist across bot restarts. The global_provider and global_model keys determine the default AI service for chat.
Channel/Thread Overrides (CHANNEL_CONFIG, THREADS tables): Allow administrators to override the global default provider, model, and system prompt for specific channels or threads using commands like /setchannelmodel, /setchannelsystem, /thread setmodel, /thread setsystem. These overrides are stored in the database and loaded by the BotStateManager to determine the effective configuration for a given conversation context.

5. Contributing

Contributions to the Gideon Discord Bot are welcome! If you’d like to contribute, please follow these steps:

Fork the repository on GitHub: https://github.com/eoko-dev/gideon
Clone your forked repository to your local machine.
Create a new branch for your feature or bug fix.
Make your changes, following the project’s coding style and guidelines.
Write clear and concise commit messages.
Test your changes thoroughly.
Push your changes to your fork on GitHub.
Open a Pull Request from your branch to the main branch of the original repository.
Provide a clear description of your changes and why they are necessary.

Please also feel free to open issues on the GitHub repository to report bugs, suggest new features, or discuss potential improvements.

6. License

This project is licensed under the MIT License. See the LICENSE file in the repository for the full text.

Gideon Discord Bot - Documentation

Table of Contents

1. Overview

2. Getting Started (User/Administrator Guide)

Prerequisites

Installation

Method 1: Using Python

Method 2: Using Docker

Configuration

Running the Bot

Running with Python

Running with Docker

Adding to Your Server

3. Bot Usage (User/Administrator Guide)

General Concepts

Help Command

AI Chat Commands

Natural Language Intent Detection

How It Works

Supported Intents

Configuration

Implementation Details (for Developers)

Thread Management Commands

Image Generation Commands

Trivia Commands

Settings Commands (Administrator)

Channel Commands (Administrator)

Admin Commands

Deprecated Configuration Commands

Troubleshooting Common Issues

4. Architecture and Development (Developer Guide)

High-Level Architecture

Project Structure

Core Components

State Management

API Integrations

Command System

Adding New Features/Cogs

Error Handling and Logging

Security Considerations

Configuration System

5. Contributing

6. License