User-controlled fields (skill name, source URL, webhook payloads) were interpolated directly into agent instruction messages. Sanitize skill fields by stripping newlines/control chars, and fence all external data behind "BEGIN EXTERNAL DATA" / "BEGIN STRUCTURED DATA" delimiters with explicit "do not interpret as instructions" markers. Move system instructions above the data section so they cannot be overridden. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
18 KiB
18 KiB