Commit Graph

7 Commits

Author SHA1 Message Date
15c650f942 Allow extra fields in ChatCompletionRequest to fix 422 errors
- Add model_config with extra='allow' to ChatCompletionRequest
- Fixes 422 validation errors when OpenAI-compatible clients send unknown fields
- Maintains backward compatibility with all existing fields
- Tested with unknown_field and extra_param - request succeeds
2026-02-23 13:03:09 -05:00
27d9bfbad8 Add support for 'developer' role in chat messages
- Add 'developer' to accepted role literals in ChatMessage model
- Existing normalization in chat.py converts developer → system before watsonx API call
- Fixes 422 validation errors when clients send developer role messages
- Maintains backward compatibility with all existing roles
2026-02-23 12:50:23 -05:00
0ed0ae9ce8 Update DEPLOYMENT.md with critical .env format requirements
- Emphasize that inline comments cause service startup failures
- Show correct .env format without inline comments
- Add example error message when inline comments are present
- Include sed command to fix existing .env files with inline comments
- Add model mapping examples in correct format
2026-02-23 12:20:16 -05:00
d924b7c45f Add vLLM message normalization for OpenAI content format compatibility
- Normalize 'developer' role to 'system' (vLLM doesn't support developer role)
- Flatten array content to string for text-only messages
- Preserve mixed content (text + images) as array
- Add comprehensive unit tests for normalization logic

Fixes HTTP 422 errors when clients send OpenAI multi-content format
2026-02-23 11:59:23 -05:00
debfb466ad Add comprehensive deployment guide with systemd service setup and LXC configuration 2026-02-23 11:14:40 -05:00
92bc754316 Add MODELS.md with comprehensive list of available watsonx.ai models 2026-02-23 10:14:25 -05:00
2e2b817435 Add AGENTS.md documentation for AI agent guidance 2026-02-23 09:59:52 -05:00