docs: update README for current architecture, remove stale docs

Rewrote README.md to reflect the actual project state: dual Discord bot + FastAPI web UI, WorkflowInspector-based dynamic node injection, no SerialJobQueue, full env var table, frontend build steps, web auth setup, and updated credits/troubleshooting. Removed three files that were actively misleading: - DEVELOPMENT.md and QUICK_START.md referenced deleted modules (job_queue.py, upload.py) and old WorkflowManager/StateManager APIs - backfill_image_data.py was a completed one-shot migration script Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-02 12:00:13 +07:00
parent 1ed3c9ec4b
commit f6907d2c39
4 changed files with 254 additions and 1411 deletions
--- a/README.md
+++ b/README.md
@@ -1,345 +1,320 @@
-# Discord ComfyUI Bot
+# ComfyUI Discord Bot + Web UI

-A Discord bot that integrates with ComfyUI to generate AI images and videos through Discord commands.
+A Discord bot and web interface that integrates with [ComfyUI](https://github.com/comfyanonymous/ComfyUI) to generate AI images and videos. Requests can be submitted through Discord commands or a browser-based UI with real-time progress updates.

 ## Features

- 🎨 **Image Generation** - Generate images using simple prompts or complex workflows
- 🎬 **Video Generation** - Support for video output workflows
- 📝 **Workflow Management** - Load, modify, and execute ComfyUI workflows
- 📤 **Image Upload** - Upload reference images directly through Discord
- 📊 **Generation History** - Track and retrieve past generations
- ⚙️ **Runtime Workflow Modification** - Change prompts, negative prompts, and input images on the fly
- 🔄 **Job Queue System** - Sequential execution prevents server overload
+**Discord Bot**
+- Image and video generation via simple prompts or full ComfyUI workflows
+- Runtime workflow parameter changes (prompt, negative prompt, input images, seeds)
+- Preset management — save and recall workflow configurations
+- Generation history with full output retrieval
+- Server control (start/stop ComfyUI via NSSM service)
+- Automatic image compression to fit Discord's 8 MiB limit

-## Quick Start
+**Web UI**
+- Invite-token authentication with JWT session cookies
+- Simple generate form and full dynamic workflow form (auto-discovers all node inputs)
+- Input image library — upload, browse, and select images for generation
+- Generation history with image preview
+- Preset management
+- Real-time status dashboard (ComfyUI connection, queue depth)
+- WebSocket-based live progress updates during generation
+- Admin panel for token and server management

-### Prerequisites
+## Architecture

- Python 3.9+
- Discord Bot Token ([create one here](https://discord.com/developers/applications))
- ComfyUI Server running and accessible
- Required packages: `discord.py`, `aiohttp`, `websockets`, `python-dotenv`
+```
+the-third-rev/
+├── bot.py                   # Entry point — Discord bot + Uvicorn run via asyncio.gather
+├── config.py                # BotConfig dataclass, loads all env vars
+├── comfy_client.py          # ComfyUI REST + WebSocket client
+├── workflow_inspector.py    # Dynamic node discovery and override injection
+├── workflow_manager.py      # Workflow template storage (get/set)
+├── workflow_state.py        # Runtime override dict with file persistence
+├── generation_db.py         # SQLite generation history + file BLOBs
+├── input_image_db.py        # SQLite input image storage
+├── token_store.py           # SHA-256 hashed invite tokens
+├── preset_manager.py        # Workflow preset CRUD
+├── user_state_registry.py   # Per-user workflow state for web sessions
+├── image_utils.py           # PIL-based image compression
+├── media_uploader.py        # Optional external media upload
+├── status_monitor.py        # Background status polling and Discord log channel
+├── discord_utils.py         # Discord helpers, decorators, argument parsing
+├── commands/                # Discord command handlers
+│   ├── __init__.py
+│   ├── generation.py        # generate, workflow-gen
+│   ├── workflow.py          # workflow-load
+│   ├── history.py           # history, get-history
+│   ├── input_images.py      # input image upload/management
+│   ├── presets.py           # preset save/load/delete
+│   ├── server.py            # ComfyUI server control
+│   ├── utility.py           # test, info, misc
+│   ├── workflow_changes.py  # get/set workflow overrides
+│   └── help_command.py      # custom help
+├── web/                     # FastAPI application
+│   ├── app.py               # App factory, middleware, static file serving
+│   ├── auth.py              # JWT create/verify
+│   ├── deps.py              # Shared FastAPI dependencies (bot reference)
+│   ├── login_guard.py       # Auth decorators
+│   ├── ws_bus.py            # Per-user WebSocket broadcast bus
+│   └── routers/             # API endpoints (generate, history, inputs, presets, ...)
+└── frontend/                # React + TypeScript + Vite + Tailwind source
+    └── src/
+        ├── pages/           # GeneratePage, WorkflowPage, HistoryPage, ...
+        ├── components/      # Layout, DynamicWorkflowForm, LazyImage
+        ├── hooks/           # useAuth, useWebSocket, useStatus
+        └── context/         # GenerationContext (pending count badge)
+```

-### Installation
+ComfyUI's own queue handles job ordering — no separate job queue exists in this bot. Generation callbacks are matched to requests via a `prompt_id → callback` map in `ComfyClient`.

-1. **Clone or download this repository**
+## Requirements

-2. **Install dependencies**:
-   ```bash
-   pip install discord.py aiohttp websockets python-dotenv
-   ```
+- Python 3.10+
+- Node.js 18+ (for building the frontend)
+- ComfyUI running and accessible
+- Discord bot token with **Message Content Intent** enabled

-3. **Create `.env` file** with your credentials:
-   ```bash
-   DISCORD_BOT_TOKEN=your_discord_bot_token_here
-   COMFY_SERVER=localhost:8188
-   ```
+**Python dependencies:**
+```bash
+pip install discord.py aiohttp websockets python-dotenv fastapi uvicorn pillow
+```

-4. **Run the bot**:
-   ```bash
-   python bot.py
-   ```
+## Installation

-## Configuration
+### 1. Clone and install Python dependencies
+
+```bash
+git clone <repo-url>
+cd the-third-rev
+pip install discord.py aiohttp websockets python-dotenv fastapi uvicorn pillow
+```
+
+### 2. Build the frontend
+
+```bash
+cd frontend
+npm install
+npm run build
+cd ..
+```
+
+The build output lands in `web-static/` and is served automatically by FastAPI.
+
+### 3. Configure environment

 Create a `.env` file in the project root:

 ```bash
-# Required
+# ── Required ────────────────────────────────────────
 DISCORD_BOT_TOKEN=your_discord_bot_token
 COMFY_SERVER=localhost:8188

-# Optional
-WORKFLOW_FILE=wan2.2-fast.json
+# ── ComfyUI paths ───────────────────────────────────
+COMFY_OUTPUT_PATH=C:\path\to\ComfyUI\output
+COMFY_INPUT_PATH=C:\path\to\ComfyUI\input
 COMFY_HISTORY_LIMIT=10
-COMFY_OUTPUT_PATH=C:\Users\YourName\Documents\ComfyUI\output
+
+# ── Startup workflow ────────────────────────────────
+WORKFLOW_FILE=workflows/my_workflow.json   # optional; loaded at startup
+
+# ── Web UI ──────────────────────────────────────────
+WEB_ENABLED=true
+WEB_HOST=0.0.0.0
+WEB_PORT=8080
+WEB_SECRET_KEY=change-me-to-a-random-secret
+WEB_JWT_EXPIRE_HOURS=720
+WEB_SECURE_COOKIE=false       # set true if serving over HTTPS
+
+# ── Admin ───────────────────────────────────────────
+ADMIN_PASSWORD=your_admin_password
+
+# ── ComfyUI server control (optional) ───────────────
+COMFY_SERVICE_NAME=ComfyUI         # NSSM service name
+COMFY_START_BAT=C:\path\to\run_nvidia_gpu.bat
+COMFY_LOG_DIR=C:\path\to\logs
+COMFY_AUTOSTART=false              # auto-start ComfyUI on bot launch
+
+# ── Discord status log channel (optional) ────────────
+LOG_CHANNEL_ID=123456789012345678
+
+# ── External media upload (optional) ────────────────
+MEDIA_UPLOAD_USER=
+MEDIA_UPLOAD_PASS=
 ```

-### Configuration Options
+### Full configuration reference

 | Variable | Required | Default | Description |
-|----------|----------|---------|-------------|
-| `DISCORD_BOT_TOKEN` | ✅ Yes | - | Discord bot authentication token |
-| `COMFY_SERVER` | ✅ Yes | - | ComfyUI server address (host:port) |
-| `WORKFLOW_FILE` | ❌ No | - | Path to workflow JSON to load at startup |
-| `COMFY_HISTORY_LIMIT` | ❌ No | `10` | Number of generations to keep in history |
-| `COMFY_OUTPUT_PATH` | ❌ No | `C:\Users\...\ComfyUI\output` | Path to ComfyUI output directory |
+|---|---|---|---|
+| `DISCORD_BOT_TOKEN` | Yes | — | Discord bot token |
+| `COMFY_SERVER` | Yes | — | ComfyUI address (`host:port`) |
+| `COMFY_OUTPUT_PATH` | No | `...\ComfyUI\output` | ComfyUI output directory |
+| `COMFY_INPUT_PATH` | No | `...\ComfyUI\input` | ComfyUI input directory |
+| `COMFY_HISTORY_LIMIT` | No | `10` | Generations kept in history |
+| `WORKFLOW_FILE` | No | — | Workflow JSON to load at startup |
+| `WEB_ENABLED` | No | `true` | Enable web UI |
+| `WEB_HOST` | No | `0.0.0.0` | Web server bind address |
+| `WEB_PORT` | No | `8080` | Web server port |
+| `WEB_SECRET_KEY` | No | — | JWT signing secret (set in production) |
+| `WEB_JWT_EXPIRE_HOURS` | No | `720` | Session expiry in hours |
+| `WEB_SECURE_COOKIE` | No | `false` | Mark session cookie as Secure (HTTPS only) |
+| `WEB_TOKEN_FILE` | No | `invite_tokens.json` | Invite token storage path |
+| `ADMIN_PASSWORD` | No | — | Admin panel password |
+| `COMFY_SERVICE_NAME` | No | — | NSSM service name for server control |
+| `COMFY_START_BAT` | No | — | ComfyUI launch script path |
+| `COMFY_LOG_DIR` | No | — | Directory for ComfyUI logs |
+| `COMFY_AUTOSTART` | No | `false` | Auto-start ComfyUI on bot launch |
+| `LOG_CHANNEL_ID` | No | — | Discord channel ID for status messages |

-## Usage
+### 4. Create a web UI invite token
+
+The web UI requires an invite token to register an account:
+
+```bash
+python -c "from token_store import create_token; print(create_token('username'))"
+```
+
+Copy the printed token — it is shown only once. Give it to the user who will register.
+
+### 5. Run
+
+```bash
+python bot.py
+```
+
+The bot starts the Discord client and the web server concurrently. Navigate to `http://localhost:8080` to access the web UI.
+
+## Discord Commands

 All commands use the `ttr!` prefix.

-### Basic Commands
+### Generation

-```bash
-# Test if bot is working
-ttr!test
+| Command | Alias | Description |
+|---|---|---|
+| `ttr!generate prompt:<text>` | `ttr!gen` | Generate using prompt mode |
+| `ttr!generate prompt:<text> negative_prompt:<text>` | | Generate with negative prompt |
+| `ttr!workflow-gen` | `ttr!wfg` | Execute loaded workflow with current overrides |
+| `ttr!workflow-gen queue:5` | | Queue 5 workflow runs |

-# Generate an image with a prompt
-ttr!generate prompt:a beautiful sunset over mountains
+### Workflow management

-# Generate with negative prompt
-ttr!generate prompt:a cat negative_prompt:blurry, low quality
-
-# Execute loaded workflow
-ttr!workflow-gen
-
-# Queue multiple workflow runs
-ttr!workflow-gen queue:5
-```
-
-### Workflow Management
-
-```bash
-# Load a workflow from file
-ttr!workflow-load path/to/workflow.json
-
-# Or attach a JSON file to the message:
-ttr!workflow-load
-[Attach: my_workflow.json]
-
-# View current workflow changes
-ttr!get-current-workflow-changes type:all
-
-# Set workflow parameters
-ttr!set-current-workflow-changes type:prompt A new prompt
-ttr!set-current-workflow-changes type:negative_prompt blurry
-ttr!set-current-workflow-changes type:input_image input/image.png
-```
-
-### Image Upload
-
-```bash
-# Upload images to ComfyUI
-ttr!upload
-[Attach: image1.png, image2.png]
-
-# Upload to specific folder
-ttr!upload type:temp
-[Attach: reference.png]
-```
+| Command | Alias | Description |
+|---|---|---|
+| `ttr!workflow-load <path>` | `ttr!wfl` | Load workflow from file path |
+| `ttr!workflow-load` (+ attachment) | | Load workflow from attached JSON |
+| `ttr!get-current-workflow-changes type:all` | `ttr!gcwc` | Show current overrides |
+| `ttr!set-current-workflow-changes type:prompt <text>` | `ttr!scwc` | Set prompt override |
+| `ttr!set-current-workflow-changes type:negative_prompt <text>` | | Set negative prompt |

 ### History

-```bash
-# View recent generations
-ttr!history
+| Command | Alias | Description |
+|---|---|---|
+| `ttr!history` | | List recent generations |
+| `ttr!get-history <id>` | `ttr!gh` | Retrieve output from a past generation |

-# Retrieve images from a past generation
-ttr!get-history <prompt_id>
-ttr!get-history 1  # By index
-```
+### Presets

-### Command Aliases
+| Command | Description |
+|---|---|
+| `ttr!preset-save <name>` | Save current workflow overrides as a preset |
+| `ttr!preset-load <name>` | Apply a saved preset |
+| `ttr!preset-list` | List all presets |
+| `ttr!preset-delete <name>` | Delete a preset |

-Many commands have shorter aliases:
+### Server control

- `ttr!generate` → `ttr!gen`
- `ttr!workflow-gen` → `ttr!wfg`
- `ttr!workflow-load` → `ttr!wfl`
- `ttr!get-history` → `ttr!gh`
- `ttr!get-current-workflow-changes` → `ttr!gcwc`
- `ttr!set-current-workflow-changes` → `ttr!scwc`
+| Command | Description |
+|---|---|
+| `ttr!server-start` | Start the ComfyUI NSSM service |
+| `ttr!server-stop` | Stop the ComfyUI NSSM service |
+| `ttr!server-status` | Show ComfyUI service status |
+| `ttr!server-log` | Tail the ComfyUI log file |

-## Architecture
+### Utility

-The bot is organized into focused, maintainable modules:
-
-```
-the-third-rev/
-├── config.py              # Configuration and constants
-├── job_queue.py           # Job queue system
-├── workflow_manager.py    # Workflow manipulation
-├── workflow_state.py      # Runtime state management
-├── discord_utils.py       # Discord utilities
-├── bot.py                 # Main entry point (~150 lines)
-├── comfy_client.py        # ComfyUI API client (~650 lines)
-└── commands/              # Command handlers
-    ├── generation.py      # Image/video generation
-    ├── workflow.py        # Workflow management
-    ├── upload.py          # File uploads
-    ├── history.py         # History retrieval
-    └── workflow_changes.py # State management
-```
-
-### Key Design Principles
-
- **Dependency Injection** - Dependencies passed via constructor
- **Single Responsibility** - Each module has one clear purpose
- **Configuration Centralization** - All config in `config.py`
- **Command Separation** - Commands grouped by functionality
- **Type Safety** - Modern Python type hints throughout
-
-## Development
-
-### Adding a New Command
-
-See `QUICK_START.md` for quick examples or `DEVELOPMENT.md` for comprehensive guide.
-
-Basic example:
-
-```python
-# commands/your_module.py
-
-def setup_your_commands(bot, config):
-    @bot.command(name="hello")
-    async def hello(ctx):
-        await ctx.reply("Hello!", mention_author=False)
-```
-
-Register in `commands/__init__.py`:
-
-```python
-from .your_module import setup_your_commands
-
-def register_all_commands(bot, config):
-    # ... existing ...
-    setup_your_commands(bot, config)
-```
-
-### Documentation
-
- **README.md** (this file) - Project overview and setup
- **QUICK_START.md** - Quick reference for common tasks
- **DEVELOPMENT.md** - Comprehensive development guide
- **CLAUDE.md** - Architecture documentation for Claude Code
+| Command | Description |
+|---|---|
+| `ttr!test` | Verify bot is online |
+| `ttr!help` | Show command list |

 ## Workflow System

-The bot supports two generation modes:
+### How node injection works

-### 1. Prompt Mode (Simple)
+`workflow_inspector.py` dynamically discovers all controllable inputs in any workflow:

-Uses a workflow template with a KSampler node:
+- **Prompt** — `CLIPTextEncode` node with title containing "Positive Prompt"
+- **Negative prompt** — `CLIPTextEncode` node with title containing "Negative Prompt"
+- **Input image** — `LoadImage` nodes (first one = `input_image` key; additional ones get slugified title keys)
+- **Seed** — any node with `inputs.seed` or `inputs.noise_seed` (auto-randomized unless explicitly set)
+- **Steps, CFG, checkpoint, LoRA** — discovered and injectable via the web workflow form
+
+No hardcoded node IDs. Workflows only need to follow standard ComfyUI node title conventions.
+
+### Workflow overrides persist across restarts
+
+Runtime changes are saved to `current-workflow-changes.json` automatically and restored on startup.
+
+### Loading a custom workflow
+
+1. Design and export your workflow in ComfyUI (Save → API Format)
+2. Load it in Discord: `ttr!workflow-load path/to/workflow.json`
+   or via the web UI: Workflow page → Upload
+3. Set overrides and run: `ttr!workflow-gen` or use the web Generate/Workflow page
+
+## Development
+
+### Frontend development

 ```bash
-ttr!generate prompt:a cat negative_prompt:blurry
+cd frontend
+npm run dev   # HMR dev server on :5173, proxies /api + /ws to :8080
 ```

-The bot automatically finds and replaces:
- Positive prompt in CLIPTextEncode node (title: "Positive Prompt")
- Negative prompt in CLIPTextEncode node (title: "Negative Prompt")
- Seed values (randomized each run)
+For production, rebuild with `npm run build`.

-### 2. Workflow Mode (Advanced)
+### Adding a Discord command

-Execute full workflow with runtime modifications:
+1. Add your handler to the appropriate module in `commands/` (or create a new one)
+2. Register it in `commands/__init__.py` → `register_all_commands()`
+3. Use `@require_comfy_client` from `discord_utils.py` if the command needs `bot.comfy`

-```bash
-# Set workflow parameters
-ttr!set-current-workflow-changes type:prompt A beautiful landscape
-ttr!set-current-workflow-changes type:input_image input/reference.png
+See `CLAUDE.md` for full architectural details.

-# Execute workflow
-ttr!workflow-gen
-```
+### Adding a web API endpoint

-The bot:
-1. Loads the workflow template
-2. Applies runtime changes from WorkflowStateManager
-3. Randomizes seeds
-4. Executes on ComfyUI server
-5. Returns images/videos
-
-### Node Naming Conventions
-
-For workflows to work with dynamic updates, nodes must follow naming conventions:
-
- **Positive Prompt**: CLIPTextEncode node with title containing "Positive Prompt"
- **Negative Prompt**: CLIPTextEncode node with title containing "Negative Prompt"
- **Input Image**: LoadImage node (any title)
- **Seeds**: Any node with `inputs.seed` or `inputs.noise_seed`
+1. Create a router in `web/routers/`
+2. Register it in `web/app.py` via `app.include_router()`
+3. Use `require_auth` / `require_admin` from `web/auth.py` for protected routes

 ## Troubleshooting

-### Bot won't start
+**Web UI shows a blank page in production**
+Windows may serve `.js` files as `text/plain`. This is fixed in `web/app.py` with explicit MIME type registration — ensure you are running the latest version.

-**Issue**: `AttributeError: module 'queue' has no attribute 'SimpleQueue'`
+**ComfyUI connection refused**
+Check `COMFY_SERVER` in `.env` and confirm ComfyUI is running. Test with `curl http://localhost:8188`.

-**Solution**: This was fixed by renaming `queue.py` to `job_queue.py`. Make sure you're using the latest version.
+**Commands not responding**
+Ensure the bot has **Message Content Intent** enabled in the Discord Developer Portal and has sufficient channel permissions.

-### ComfyUI connection issues
+**Videos not delivered**
+Set `COMFY_OUTPUT_PATH` to the correct ComfyUI output directory. The bot reads video files directly from disk.

-**Issue**: `ComfyUI client is not configured`
-
-**Solution**:
-1. Check `.env` file has `DISCORD_BOT_TOKEN` and `COMFY_SERVER`
-2. Verify ComfyUI server is running
-3. Test connection: `curl http://localhost:8188`
-
-### Commands not responding
-
-**Issue**: Bot online but commands don't work
-
-**Solution**:
-1. Check bot has Message Content Intent enabled in Discord Developer Portal
-2. Verify bot has permissions in Discord server
-3. Check console logs for errors
-
-### Video files not found
-
-**Issue**: `Failed to read video file`
-
-**Solution**:
-1. Set `COMFY_OUTPUT_PATH` in `.env` to your ComfyUI output directory
-2. Check path uses correct format for your OS
-
-## Advanced Usage
-
-### Batch Generation
-
-Queue multiple workflow runs:
-
-```bash
-ttr!workflow-gen queue:10
-```
-
-Each run uses randomized seeds for variation.
-
-### Custom Workflows
-
-1. Design workflow in ComfyUI
-2. Export as API format (Save → API Format)
-3. Load in bot:
-   ```bash
-   ttr!workflow-load path/to/workflow.json
-   ```
-4. Modify at runtime:
-   ```bash
-   ttr!set-current-workflow-changes type:prompt My prompt
-   ttr!workflow-gen
-   ```
-
-### State Persistence
-
-Workflow changes are automatically saved to `current-workflow-changes.json` and persist across bot restarts.
-
-## Contributing
-
-We welcome contributions! Please:
-
-1. Read `DEVELOPMENT.md` for coding guidelines
-2. Follow existing code style and patterns
-3. Test your changes thoroughly
-4. Update documentation as needed
-
-## License
-
-[Your License Here]
-
-## Support
-
-For issues or questions:
- Check the troubleshooting section above
- Review `DEVELOPMENT.md` for implementation details
- Check ComfyUI documentation for workflow issues
- Open an issue on GitHub
+**Web UI WebSocket disconnects immediately**
+Set a proper `WEB_SECRET_KEY` — an empty secret causes JWT validation failures.

 ## Credits

 Built with:
- [discord.py](https://github.com/Rapptz/discord.py) - Discord API wrapper
- [ComfyUI](https://github.com/comfyanonymous/ComfyUI) - Stable Diffusion GUI
- [aiohttp](https://github.com/aio-libs/aiohttp) - Async HTTP client
- [websockets](https://github.com/python-websockets/websockets) - WebSocket implementation
+- [discord.py](https://github.com/Rapptz/discord.py) — Discord API
+- [ComfyUI](https://github.com/comfyanonymous/ComfyUI) — AI image/video generation backend
+- [FastAPI](https://fastapi.tiangolo.com/) — Web API framework
+- [Uvicorn](https://www.uvicorn.org/) — ASGI server
+- [React](https://react.dev/) + [Vite](https://vitejs.dev/) + [Tailwind CSS](https://tailwindcss.com/) — Web frontend
+- [aiohttp](https://github.com/aio-libs/aiohttp) — Async HTTP client
+- [Pillow](https://python-pillow.org/) — Image compression