feat(webchat): support image attachments
This commit is contained in:
@@ -0,0 +1,423 @@
|
||||
# Webchat Attachment Support Implementation Plan
|
||||
|
||||
**Date:** 2026-02-13
|
||||
**Status:** Planning
|
||||
**Scope:** Minimal — Image attachments first (audio as follow-up)
|
||||
|
||||
## Overview
|
||||
|
||||
Enable webchat users to send image attachments through the gateway to the agent. This completes the deferred "webchat" phase from the P4 media pipeline (phase `6c_gateway_protocol_attachments` marked the protocol-level support as complete, but UI integration was deferred).
|
||||
|
||||
## Current State
|
||||
|
||||
- **Protocol:** `GatewayAttachment` type exists in `src/gateway/protocol.ts` (added in phase `6c_gateway_protocol_attachments`)
|
||||
- **Backend:** `agent.send` handler in `src/gateway/handlers/agent.ts` already accepts optional `attachments` parameter and converts them to channel `Attachment[]` format
|
||||
- **Media pipeline:** All model clients support multimodal messages (images via base64 or URL)
|
||||
- **Other channels:** Telegram, Discord, Slack, WhatsApp all extract and pass image attachments
|
||||
- **Webchat:** Currently text-only
|
||||
|
||||
## Scope
|
||||
|
||||
### In Scope
|
||||
- Image attachments (jpeg, png, gif, webp)
|
||||
- File input UI in webchat
|
||||
- Client-side base64 encoding
|
||||
- Gateway protocol attachment parameter
|
||||
- Basic error handling (file size, type validation)
|
||||
- Tests for gateway attachment handling
|
||||
- Update `docs/plans/state.json` (mark phase `6d_webchat_attachments` complete)
|
||||
|
||||
### Out of Scope (Follow-up)
|
||||
- Audio attachments (requires different UX — separate mic button, audio preview)
|
||||
- Document attachments (PDF, etc.)
|
||||
- Image preview/thumbnail UI in message list
|
||||
- Drag-and-drop file upload
|
||||
- Multiple file selection
|
||||
- Image compression/resizing client-side
|
||||
|
||||
## Implementation Plan
|
||||
|
||||
### Phase 1: UI Changes (Frontend)
|
||||
|
||||
**File:** `src/gateway/ui/pages/chat.js`
|
||||
|
||||
#### 1.1 Add File Input Element
|
||||
Add a hidden file input and an "attach" button next to the send button.
|
||||
|
||||
```html
|
||||
<!-- Inside the chat input wrapper -->
|
||||
<input type="file" id="file-input" accept="image/jpeg,image/png,image/gif,image/webp" style="display:none" />
|
||||
<button id="attach-btn" class="btn-icon" title="Attach image">📎</button>
|
||||
<button id="send-btn" class="btn-primary">Send</button>
|
||||
```
|
||||
|
||||
**Tasks:**
|
||||
- Add file input and attach button to `render()` function
|
||||
- Store references in `_elements` object
|
||||
- Wire click handler: attach button triggers file input
|
||||
- Accept only image MIME types
|
||||
|
||||
#### 1.2 File Selection Handler
|
||||
Read selected file, validate, encode to base64, store in component state.
|
||||
|
||||
```js
|
||||
let _selectedFile = null; // { mimeType, data, filename }
|
||||
|
||||
async function handleFileSelect(event) {
|
||||
const file = event.target.files?.[0];
|
||||
if (!file) return;
|
||||
|
||||
// Validate type
|
||||
const supportedTypes = ['image/jpeg', 'image/png', 'image/gif', 'image/webp'];
|
||||
if (!supportedTypes.includes(file.type)) {
|
||||
showSystemMessage('❌ Unsupported file type. Only JPEG, PNG, GIF, and WebP images are supported.');
|
||||
return;
|
||||
}
|
||||
|
||||
// Validate size (5MB limit)
|
||||
const MAX_SIZE = 5 * 1024 * 1024;
|
||||
if (file.size > MAX_SIZE) {
|
||||
showSystemMessage('❌ File too large. Maximum size is 5MB.');
|
||||
return;
|
||||
}
|
||||
|
||||
// Read and encode
|
||||
const reader = new FileReader();
|
||||
reader.onload = () => {
|
||||
const base64 = reader.result.split(',')[1]; // Strip data URI prefix
|
||||
_selectedFile = {
|
||||
mimeType: file.type,
|
||||
data: base64,
|
||||
filename: file.name,
|
||||
};
|
||||
updateAttachmentUI();
|
||||
};
|
||||
reader.onerror = () => {
|
||||
showSystemMessage('❌ Failed to read file.');
|
||||
};
|
||||
reader.readAsDataURL(file);
|
||||
}
|
||||
```
|
||||
|
||||
**Tasks:**
|
||||
- Implement `handleFileSelect()`
|
||||
- Validate MIME type against `['image/jpeg', 'image/png', 'image/gif', 'image/webp']`
|
||||
- Validate file size (5MB max)
|
||||
- Read file as base64 using `FileReader`
|
||||
- Store in `_selectedFile` state variable
|
||||
- Clear file input after reading
|
||||
|
||||
#### 1.3 Attachment UI Indicator
|
||||
Show selected file name and allow removal before sending.
|
||||
|
||||
```js
|
||||
function updateAttachmentUI() {
|
||||
const indicator = _elements.attachmentIndicator;
|
||||
if (!indicator) return;
|
||||
|
||||
if (_selectedFile) {
|
||||
indicator.textContent = `📎 ${_selectedFile.filename}`;
|
||||
indicator.classList.remove('hidden');
|
||||
} else {
|
||||
indicator.textContent = '';
|
||||
indicator.classList.add('hidden');
|
||||
}
|
||||
}
|
||||
|
||||
function clearAttachment() {
|
||||
_selectedFile = null;
|
||||
const fileInput = _elements.fileInput;
|
||||
if (fileInput) fileInput.value = '';
|
||||
updateAttachmentUI();
|
||||
}
|
||||
```
|
||||
|
||||
**UI element (add to input wrapper):**
|
||||
```html
|
||||
<div id="attachment-indicator" class="attachment-indicator hidden">
|
||||
<span class="attachment-name"></span>
|
||||
<button class="btn-clear" title="Remove">×</button>
|
||||
</div>
|
||||
```
|
||||
|
||||
**Tasks:**
|
||||
- Add attachment indicator element to UI
|
||||
- Implement `updateAttachmentUI()` to show/hide indicator
|
||||
- Implement `clearAttachment()` to reset state
|
||||
- Add clear button handler
|
||||
|
||||
#### 1.4 Update Send Handler
|
||||
Pass `attachments` array to `agent.send` RPC call.
|
||||
|
||||
```js
|
||||
async function sendMessage() {
|
||||
const text = _elements.input.value.trim();
|
||||
const attachment = _selectedFile;
|
||||
|
||||
if (!text && !attachment) return; // Nothing to send
|
||||
|
||||
// ... existing command parsing ...
|
||||
|
||||
// Prepare RPC params
|
||||
const params = { message: text };
|
||||
if (attachment) {
|
||||
params.attachments = [attachment];
|
||||
}
|
||||
|
||||
// Add user message to UI (with attachment indicator if present)
|
||||
let displayText = text;
|
||||
if (attachment) {
|
||||
displayText = `📎 ${attachment.filename}\n${text}`;
|
||||
}
|
||||
_elements.messages.appendChild(createMessageEl('user', displayText));
|
||||
|
||||
// Clear input and attachment
|
||||
_elements.input.value = '';
|
||||
clearAttachment();
|
||||
|
||||
// Send RPC
|
||||
try {
|
||||
await client.call('agent.send', params, handleStreamingEvent);
|
||||
} catch (err) {
|
||||
// ... error handling ...
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
**Tasks:**
|
||||
- Modify `sendMessage()` to include `attachments` in RPC params if `_selectedFile` is set
|
||||
- Update user message display to show attachment indicator
|
||||
- Clear `_selectedFile` after sending
|
||||
- Allow sending attachment-only messages (text optional)
|
||||
|
||||
#### 1.5 CSS Styling
|
||||
|
||||
**File:** `src/gateway/ui/style.css`
|
||||
|
||||
```css
|
||||
/* File input and attach button */
|
||||
#attach-btn {
|
||||
background: transparent;
|
||||
border: none;
|
||||
font-size: 1.2rem;
|
||||
cursor: pointer;
|
||||
padding: 0.5rem;
|
||||
opacity: 0.7;
|
||||
}
|
||||
|
||||
#attach-btn:hover {
|
||||
opacity: 1;
|
||||
}
|
||||
|
||||
/* Attachment indicator */
|
||||
.attachment-indicator {
|
||||
display: flex;
|
||||
align-items: center;
|
||||
gap: 0.5rem;
|
||||
padding: 0.5rem;
|
||||
background: var(--primary-bg);
|
||||
border: 1px solid var(--border);
|
||||
border-radius: 4px;
|
||||
margin-bottom: 0.5rem;
|
||||
font-size: 0.9rem;
|
||||
}
|
||||
|
||||
.attachment-indicator.hidden {
|
||||
display: none;
|
||||
}
|
||||
|
||||
.attachment-indicator .btn-clear {
|
||||
background: transparent;
|
||||
border: none;
|
||||
color: var(--text-muted);
|
||||
cursor: pointer;
|
||||
font-size: 1.2rem;
|
||||
line-height: 1;
|
||||
padding: 0 0.25rem;
|
||||
}
|
||||
|
||||
.attachment-indicator .btn-clear:hover {
|
||||
color: var(--text-primary);
|
||||
}
|
||||
```
|
||||
|
||||
**Tasks:**
|
||||
- Add styles for attach button
|
||||
- Add styles for attachment indicator
|
||||
- Add styles for clear button
|
||||
- Ensure responsive layout
|
||||
|
||||
### Phase 2: Testing
|
||||
|
||||
#### 2.1 Backend Integration Tests
|
||||
|
||||
**File:** `src/gateway/handlers/handlers.test.ts` (extend existing tests)
|
||||
|
||||
Add tests for `agent.send` with attachments:
|
||||
|
||||
```ts
|
||||
describe('agent.send with attachments', () => {
|
||||
it('should accept image attachments and convert to channel format', async () => {
|
||||
const params = {
|
||||
message: 'What is this?',
|
||||
attachments: [
|
||||
{
|
||||
mimeType: 'image/jpeg',
|
||||
data: 'base64data',
|
||||
filename: 'photo.jpg',
|
||||
},
|
||||
],
|
||||
};
|
||||
// ... test that agent.process receives correct Attachment[]
|
||||
});
|
||||
|
||||
it('should allow attachment-only messages', async () => {
|
||||
const params = {
|
||||
message: '',
|
||||
attachments: [{ mimeType: 'image/png', data: 'base64' }],
|
||||
};
|
||||
// ... test that message is processed
|
||||
});
|
||||
|
||||
it('should handle missing data/url gracefully', async () => {
|
||||
const params = {
|
||||
message: 'test',
|
||||
attachments: [{ mimeType: 'image/jpeg' }], // no data or url
|
||||
};
|
||||
// ... test graceful handling (attachment ignored or error)
|
||||
});
|
||||
});
|
||||
```
|
||||
|
||||
**Tasks:**
|
||||
- Add tests for `agent.send` with `attachments` parameter
|
||||
- Verify conversion from `GatewayAttachment[]` to `Attachment[]`
|
||||
- Test attachment-only messages (empty text)
|
||||
- Test malformed attachments (missing data/url)
|
||||
|
||||
#### 2.2 Manual Testing Checklist
|
||||
|
||||
**Test cases:**
|
||||
1. Select and send a JPEG image → agent receives image, responds with description
|
||||
2. Select and send a PNG image → agent receives image
|
||||
3. Select an invalid file type (e.g., .txt) → error message shown, file rejected
|
||||
4. Select a large file (>5MB) → error message shown, file rejected
|
||||
5. Select an image, then clear it before sending → no attachment sent
|
||||
6. Send attachment-only message (no text) → agent receives image
|
||||
7. Send text + image → agent receives both
|
||||
8. Send message, then send another with attachment → both messages processed correctly
|
||||
9. Test on mobile (touch events for attach button)
|
||||
|
||||
**Tasks:**
|
||||
- Run full test suite: `pnpm test:run`
|
||||
- Run typecheck: `pnpm typecheck`
|
||||
- Manual testing in browser (Desktop Chrome, Firefox)
|
||||
- Manual testing on mobile (Android/iOS)
|
||||
|
||||
### Phase 3: Documentation Updates
|
||||
|
||||
#### 3.1 Update `docs/plans/state.json`
|
||||
|
||||
Add new phase entry under `p6-enhanced-media-pipeline`:
|
||||
|
||||
```json
|
||||
"6d_webchat_attachments": {
|
||||
"priority": "P6",
|
||||
"status": "completed",
|
||||
"description": "Webchat UI for image attachments: file input, base64 encoding, attachment indicator, send handler, gateway protocol integration",
|
||||
"files_modified": [
|
||||
"src/gateway/ui/pages/chat.js",
|
||||
"src/gateway/ui/style.css",
|
||||
"src/gateway/handlers/handlers.test.ts"
|
||||
],
|
||||
"test_status": "manual + integration tests passing"
|
||||
}
|
||||
```
|
||||
|
||||
**Tasks:**
|
||||
- Add `6d_webchat_attachments` phase
|
||||
- Update `p6-enhanced-media-pipeline` summary to include webchat
|
||||
- Update overall test count if applicable
|
||||
|
||||
#### 3.2 Update README or User Docs (if needed)
|
||||
|
||||
**File:** `README.md` (webchat section)
|
||||
|
||||
Add note about attachment support:
|
||||
|
||||
```md
|
||||
### Webchat Features
|
||||
|
||||
- Real-time chat with streaming responses
|
||||
- Markdown rendering with syntax highlighting
|
||||
- Slash commands (/help, /reset, /compact, /usage, /status, /model)
|
||||
- Web search toggle
|
||||
- **Image attachments** (JPEG, PNG, GIF, WebP up to 5MB)
|
||||
- Message actions (copy, edit)
|
||||
```
|
||||
|
||||
**Tasks:**
|
||||
- Update README webchat section (if exists)
|
||||
- Add attachment feature to feature list
|
||||
|
||||
## File Changes Summary
|
||||
|
||||
### Modified Files
|
||||
1. `src/gateway/ui/pages/chat.js`
|
||||
- Add file input element and attach button
|
||||
- Implement `handleFileSelect()` for base64 encoding
|
||||
- Add `_selectedFile` state variable
|
||||
- Add `updateAttachmentUI()` and `clearAttachment()` helpers
|
||||
- Modify `sendMessage()` to include attachments in RPC params
|
||||
- Update user message display to show attachment indicator
|
||||
|
||||
2. `src/gateway/ui/style.css`
|
||||
- Styles for attach button
|
||||
- Styles for attachment indicator
|
||||
- Styles for clear button
|
||||
|
||||
3. `src/gateway/handlers/handlers.test.ts`
|
||||
- Add tests for `agent.send` with attachments
|
||||
- Test attachment-only messages
|
||||
- Test malformed attachments
|
||||
|
||||
4. `docs/plans/state.json`
|
||||
- Add `6d_webchat_attachments` phase entry
|
||||
|
||||
5. `README.md` (optional)
|
||||
- Update webchat feature list
|
||||
|
||||
### No Changes Needed
|
||||
- `src/gateway/protocol.ts` (already has `GatewayAttachment` type)
|
||||
- `src/gateway/handlers/agent.ts` (already handles attachments parameter)
|
||||
- `src/models/media.ts` (already has conversion logic)
|
||||
- `src/channels/types.ts` (already has `Attachment` type)
|
||||
|
||||
## Follow-up Work (Not in Scope)
|
||||
|
||||
### Audio Attachments
|
||||
- Requires different UX: separate mic button, recording UI, audio waveform preview
|
||||
- Audio transcription via Whisper (already supported in backend)
|
||||
- Estimate: 1-2 days
|
||||
|
||||
### Enhanced UX
|
||||
- Image preview/thumbnail in message list
|
||||
- Drag-and-drop file upload
|
||||
- Multiple file selection
|
||||
- Client-side image compression/resizing (reduce bandwidth)
|
||||
- Copy/paste image from clipboard
|
||||
- Estimate: 2-3 days
|
||||
|
||||
## Notes
|
||||
|
||||
- **Security:** Client-side validation is not sufficient. Backend should also validate MIME types and file sizes if needed (currently delegated to model providers).
|
||||
- **Performance:** 5MB limit keeps base64 payloads reasonable for WebSocket messages. Larger files should use URL-based attachments (out of scope).
|
||||
- **Compatibility:** `FileReader` API is widely supported (IE10+, all modern browsers).
|
||||
- **Mobile:** File input works on mobile browsers (triggers camera/gallery picker).
|
||||
|
||||
## Estimated Effort
|
||||
|
||||
- **Phase 1 (UI):** 2-3 hours
|
||||
- **Phase 2 (Testing):** 1 hour
|
||||
- **Phase 3 (Docs):** 30 minutes
|
||||
- **Total:** ~4 hours
|
||||
+10
-3
@@ -444,8 +444,15 @@
|
||||
"files_modified": ["src/channels/whatsapp/adapter.ts"]
|
||||
},
|
||||
"webchat": {
|
||||
"status": "deferred",
|
||||
"description": "Requires gateway protocol update for WebSocket attachment messages"
|
||||
"status": "completed",
|
||||
"description": "Webchat UI can send image attachments via gateway agent.send attachments parameter; agent handler accepts attachment-only messages",
|
||||
"files_modified": [
|
||||
"src/gateway/ui/pages/chat.js",
|
||||
"src/gateway/ui/chat.html",
|
||||
"src/gateway/ui/style.css",
|
||||
"src/gateway/handlers/agent.ts",
|
||||
"src/gateway/handlers/handlers.test.ts"
|
||||
]
|
||||
}
|
||||
}
|
||||
}
|
||||
@@ -1827,7 +1834,7 @@
|
||||
},
|
||||
|
||||
"overall_progress": {
|
||||
"total_test_count": 1617,
|
||||
"total_test_count": 1625,
|
||||
"all_tests_passing": true,
|
||||
"p0_completion": "3/3 (100%)",
|
||||
"p1_completion": "4/4 (100%)",
|
||||
|
||||
Reference in New Issue
Block a user