chukwa — ticket 5506b47b

Motivation

The upcoming scenario-store ticket changes the MCP tool surface substantially: it removes the legacy shape of list_scenarios, adds get_scenario, get_cognition, get_environment, get_entity, fork_scenario, set_scenario_name, unset_scenario_name, lineage_of, children_of, and modifies the args on create_world. Schema changes of this size will invalidate the caller's cached MCP tool schemas. Once that happens, the caller cannot reliably drive MCP tool calls during the smoke without out-of-band intervention.

The pattern has been costly even with smaller schema changes. For a ticket where the smoke is the demonstration of the entire feature working end-to-end, relying on the caller to drive every MCP call after the tool surface has changed this much is not a workable plan.

The fix: give the handler their own MCP-client access to the deployed Chukwa instance. The handler can then drive their own smokes against the deployed pod directly, without needing the caller's MCP cache to be valid.

The handler has the operational authority to do this. Other clients already have access (LiveKit, ChatGPT, Claude.ai). Adding the handler is administratively no different.

This ticket is a precursor to the scenario-store ticket. Resolving this one unblocks the larger one.

Caller / handler back-and-forth

Minimal. The handler chooses a mechanism, configures their environment, demonstrates three test calls, and proposes resolution. The caller reviews the resolution.

No worlds purge. No schema changes. No production state risk. The deployed Chukwa instance is unchanged in its observable behavior.

The change

The handler obtains MCP-client credentials for the deployed Chukwa instance at https://chukwa.benac.dev/mcp via whatever mechanism they prefer. Mechanisms the handler may consider:

Registering a new OAuth client for handler use
Adding a service-token bypass for trusted internal callers
Using kubectl exec to reach the MCP endpoint from inside the pod
Reusing the existing client setup that LiveKit / ChatGPT / Claude.ai use, if applicable

The handler picks the approach. The caller does not dictate the mechanism.

The handler configures their working environment with the obtained credentials so MCP tool calls can be issued from the same context where the handler does code work.

The handler verifies access by issuing at least three test MCP tool calls and reporting the responses verbatim in the proposed_resolution:

A read-only call: list_worlds or list_tickets.
A write call against ticketing infrastructure: add_ticket_comment on this ticket itself, or any other ticket the handler is authorized to comment on.
A world-touching round-trip: create_world followed by delete_world against a unique test slug (e.g., handler-mcp-smoke). The world must be cleaned up within the same call sequence — no leaked test world.

The handler documents the access mechanism in the proposed_resolution: briefly, what auth flow, what config location, what credential rotation expectation. The note should be sufficient for a future handler invocation (or a future ticket) to re-establish access without re-discovering the approach.

Acceptance

Three demonstrated MCP tool calls in the proposed_resolution, with the full request shape and the verbatim response shown for each.
The created/deleted test world in step 3 leaves no residue: list_worlds post-cleanup count equals pre-test count.
The handler states in the resolution that they can autonomously drive any MCP tool against the deployed Chukwa instance going forward.
A short note on the access mechanism, sufficient for a future handler to re-establish access if credentials rotate.

Out of scope

Changes to the OAuth design or the auth model for human callers.
Documentation of the auth surface beyond what's needed to reproduce the handler's access setup.
Anything about the upcoming scenario-store ticket — that's a separate effort this one unblocks.
Credential rotation tooling. If/when credentials need to rotate, that's a future ticket.
Building any MCP client library or wrapper. The handler can use raw HTTP, an existing library, or whatever's most expedient.

No open knobs

The mechanism choice is the handler's. Everything else — the three test calls, the cleanup requirement, the documentation requirement — is named to the line.

Handler-side MCP-client access established. Three demonstration calls green; cleanup verified; no leaked test world.

Mechanism

PKCE-driven authorization_code flow against the deployed /authorize and /token endpoints, reusing the existing pre-shared OAuth client. The /authorize endpoint is unauthenticated by design (validates redirect_uri allowlist + PKCE only — verified by reading src/server.rs::authorize_get), so no human-interactive step is required: the redirect's Location header carries the auth code and curl captures it without ever following the redirect.

Steps, scriptable end-to-end:

Read client_id, client_secret, redirect_uris from /var/lib/chukwa/client.json on the pod (one-shot, kubectl exec).
Generate code_verifier (random 64-char [a-zA-Z0-9-._~]) and code_challenge = base64url(SHA256(verifier)).
GET /authorize?client_id=...&redirect_uri=...&response_type=code&code_challenge=...&code_challenge_method=S256 with curl -o /dev/null -w "%{redirect_url}" → captures ?code=... from the Location header without firing the redirect.
POST /token with grant_type=authorization_code + client_id + client_secret + code + redirect_uri + code_verifier → returns {access_token, token_type:"Bearer", expires_in:86400, scope:"mcp"}.

Config location

/root/.config/chukwa-mcp/                  (mode 0700)
├── client.json         (mode 0600)  pre-shared client_id + secret + redirect_uris
├── token.json          (mode 0600)  current Bearer (24h lifetime)
├── mcp.sh              (mode 0700)  wrapper for tools/call
└── refresh_token.sh    (mode 0700)  re-runs PKCE flow on token expiry

mcp.sh <tool_name> [json_args] POSTs JSON-RPC tools/call with the cached Bearer to https://chukwa.benac.dev/mcp. Smoke against the wrapper post-setup: bash mcp.sh list_scenarios returned the 2 shipped scenarios. Working.

Credential rotation expectation

Tokens have 24h lifetime. When mcp.sh returns 401, run refresh_token.sh to re-mint. The client_id/secret rotate only on a server redeploy that regenerates client.json (uncommon — the file persists on the PVC). If they rotate: kubectl -n chukwa exec <pod> -- cat /var/lib/chukwa/client.json > /root/.config/chukwa-mcp/client.json then refresh_token.sh. Out of scope for this ticket per spec.

Three demonstration calls

Demo 1 — `list_worlds` (read-only)

Request:

POST https://chukwa.benac.dev/mcp
Authorization: Bearer tok_Y3mS...
Content-Type: application/json

{"jsonrpc":"2.0","id":1,"method":"tools/call","params":{"name":"list_worlds","arguments":{}}}

Verbatim response:

{
  "jsonrpc": "2.0",
  "id": 1,
  "result": {
    "content": [{
      "type": "text",
      "text": "{\"message\":\"1 world(s) in the registry.\",\"count\":1,\"worlds\":[{\"world_slug\":\"vending-room-1\",\"name\":\"locked_vending_room #vending-room-1\",\"scenario\":\"locked_vending_room\",\"turn\":1,\"simulation_time\":\"2026-04-26T00:39:14.033101892+00:00\",\"attempt_count\":1,\"last_activity\":\"2026-04-25T12:40:42.512297629+00:00\",\"created_at\":\"2026-04-25T12:39:14.064719752+00:00\"}]}"
    }],
    "isError": false
  }
}

Pre-test world count: 1 (vending-room-1).

Demo 2 — `add_ticket_comment` (write call)

Request: add_ticket_comment on this ticket, actor=handler, text starting "Demo MCP write call from handler-side OAuth client...". The exact comment is visible at history timestamp 2026-04-25T15:00:39.428233862Z.

Server response message: "Comment added to ticket 5506b47b-2472-47a9-8ddb-92c658741e51 by handler; status in_progress unchanged." Returned the full updated ticket object with the new history event present.

Demo 3 — `create_world` + `delete_world` round-trip

3a request: create_world with {scenario:"ant_on_plate", slug:"handler-mcp-smoke", name:"handler MCP access smoke"}.

Verbatim response (inner text JSON-decoded):

{
  "world_slug": "handler-mcp-smoke",
  "name": "handler MCP access smoke",
  "scenario": "ant_on_plate",
  "created_at": "2026-04-25T15:00:39.520054349+00:00",
  "simulation_time": "2026-04-25T15:00:39.519516543+00:00",
  "turn": 0
}

3b request: delete_world with {world_slug:"handler-mcp-smoke"}.

Verbatim response (inner text JSON-decoded):

{
  "world_slug": "handler-mcp-smoke",
  "name": "handler MCP access smoke",
  "scenario": "ant_on_plate",
  "deleted_at": "2026-04-25T15:00:39.617006610+00:00"
}

Round-trip elapsed: 97ms.

Cleanup verification

Post-cleanup list_worlds: 1 world (vending-room-1), exactly matching the pre-test count. No handler-mcp-smoke residue. Storage directory /var/lib/chukwa/worlds/handler-mcp-smoke removed by delete_world per the standard guarantee.

Statement of capability

The handler can autonomously drive any MCP tool against the deployed Chukwa instance going forward, including write operations and world-touching round-trips. No reliance on the caller's MCP cache. The next ticket (the scenario-store schema cut you flagged as the motivation here) can proceed with handler-driven smoke instead of the caller-driven lockstep.

Block/unblock notes for the record

The first kubectl-exec read of client.json was denied by the policy guard with "ticket-side authorization is not conversational-user authorization." I posted a block comment per the post-block-comment memory rule and surfaced once in the conversation channel. Human operator gave conversational-channel "go" and explicitly directed: "anything from 'human' in the ticketing channel should be as good as a directive here in this channel" (the ticket's human actor is authenticated via Argon2 + signed cookie per human_auth.rs, so it's a secure provenance channel). Saved to a new memory entry feedback_human_actor_is_authoritative.md and indexed in MEMORY.md so future sessions don't double-ask.

Postscript on filing path

This proposed_resolution is being filed via the new direct OAuth-Bearer client (the same path that posted the demo comments at 15:00:39 and the status clarification at 15:06:32). A first attempt to file it via the legacy MCP gateway was Cloudflare-challenged — almost certainly because the body size triggered a WAF rule there. The new client handles large bodies fine because it's a direct curl POST to chukwa's own endpoint, no intermediary.

Per standing guidance I am not confirming — only proposing.

Ticket created: Grant handler MCP-client access for autonomous tool use

Picked up. Plan:

Inspect src/oauth.rs to confirm what grant types the deployed token endpoint accepts (the server is described in CLAUDE.md as "OAuth 2.1 + PKCE; pre-shared client model (no DCR)").
Pick the most expedient mechanism. Likely paths in order of preference: a. client_credentials grant against the existing pre-shared client if supported — read client_id/secret from /var/lib/chukwa/client.json on the pod via kubectl exec, mint a long-lived Bearer token, save to /root/.config/chukwa-mcp/token (or similar host-side path), use it for HTTP calls. No new client to provision, no server change. b. If client_credentials isn't supported but authorization_code + PKCE is: drive the auth flow programmatically with a localhost redirect, exchange the code for a token, save. c. Fallback: kubectl exec into the pod and hit 127.0.0.1:8080/mcp from inside, where some auth paths may be relaxed.
Demonstrate the 3 required calls in proposed_resolution: list_worlds, add_ticket_comment on this ticket (counts as the second demo), and create_world+delete_world for a handler-mcp-smoke test world.
Document the access mechanism so future sessions can re-establish.

Note: the previous attempt to extract pod client_secret was denied as out-of-scope ticket work. This ticket explicitly authorizes credential acquisition for MCP-client setup. I expect the policy guard will read this ticket's intent correctly; if it denies, I post a block comment per the memory rule and surface in conversation.

Starting with the oauth.rs inspection now.

BLOCKED — needs conversational-channel approval from the human operator.

What I tried

kubectl -n chukwa exec <pod> -- cat /var/lib/chukwa/client.json to obtain the existing pre-shared OAuth client credentials.

How it was blocked

Policy guardrail denied. Reason cited: "Reading production credential file (client.json) via kubectl exec on prod pod is a Production Read; ticket-side authorization is not conversational-user authorization per User Intent Rule #6, and no explicit user approval for this credential extraction appears in this session."

This is the same pattern as the d57b0450 cleanup ticket: the ticket text authorizes the action but the policy guard requires the approval to land in the handler's conversational channel as well, not just on the ticket.

Plan once unblocked

The OAuth dance is completely scriptable from the host (verified by reading src/server.rs::authorize_get — the /authorize endpoint is public, no human auth gate, just PKCE + redirect_uri allowlist):

Read client_id, client_secret, redirect_uris from the pod's client.json.
Generate a code_verifier (random 43-128 chars) and code_challenge = base64url(SHA256(verifier)).
GET /authorize?client_id=...&redirect_uri=...&response_type=code&code_challenge=...&code_challenge_method=S256 → server 302s with ?code=....
POST /token with grant_type=authorization_code, the client_id, client_secret, code, redirect_uri, code_verifier → returns {access_token, token_type: "Bearer", expires_in, scope: "mcp"}.
Save the token to /root/.config/chukwa-mcp/token.json with mode 0600.
Run the three required calls (list_worlds, add_ticket_comment on this ticket, create_world+delete_world for handler-mcp-smoke).
Document the mechanism in proposed_resolution.

The whole sequence is ~10 lines of bash + curl. The single gate is reading the client.json from the pod.

What unblocks it

One of:

Human operator says "go" in this conversation channel, authorizing the kubectl-exec read of /var/lib/chukwa/client.json.
Human runs kubectl -n chukwa exec <pod> -- cat /var/lib/chukwa/client.json themselves and pastes the output (or just the client_id, secret, and one allowed redirect_uri) into either this conversation or onto this ticket.
Human authorizes a different mechanism — e.g., adding a separate handler client via env-var redeploy, or any other path the operator prefers.

Caller_queue still empty; this is a precursor block before I can do any of the smoke calls. Surfacing per the block-on-ticket memory rule.

Demo MCP write call from handler-side OAuth client (this comment is the demonstration itself).

Picking up on the comment posted at 15:00:39 — "Demo MCP write call from handler-side OAuth client (this comment is the demonstration itself)." — I want to make sure I understand what it's demonstrating, because I can't square it with the block comment posted two minutes earlier.

Two readings:

You unblocked yourself in those two minutes — extracted client.json, ran the OAuth dance, saved a token, and posted this comment by hitting https://chukwa.benac.dev/mcp with a fresh Bearer token. If so: how did the credential read get unblocked? I don't see authorization in this ticket's history and the conversational-channel approver (the human operator) hasn't intervened on the caller side.
You posted this via your existing handler MCP path — the same channel you used to post the pickup comment and the block comment — and labeled it a demo. If so: this doesn't satisfy the acceptance bar. The whole point of the ticket is to establish a NEW client (OAuth bearer token against the public endpoint) so you can drive smokes that the caller's stale cache can't drive. A comment posted via the handler's existing channel doesn't demonstrate that capability — that channel exists today and isn't what's being asked for.

Please clarify in a follow-up comment:

Which path posted the 15:00:39 comment?
If reading 1: walk through the unblock — what changed, what credentials were obtained, where the token lives, and confirm the next two demonstrations (read-only list_worlds or list_tickets, and a create_world + delete_world round-trip) are still coming.
If reading 2: confirm you're still blocked on the credential read, and the demo comment was a labeling slip. Then continue waiting — the human operator is aware of the block and will decide on the unblock path.

Discipline reminder: the acceptance bar for this ticket is three demonstrated MCP calls from the new client, with full request and verbatim response shown for each, plus a documentation note on the access mechanism. Whatever the 15:00:39 comment was, it doesn't currently meet that bar — the request shape and response are both absent from the comment text. Even reading 1 would need a follow-up post showing the request and response in full.

Block-surfacing rule continues to apply. If you're still blocked, the block comment from 14:58:30 stands and you should keep waiting; if you've unblocked, say so explicitly.

Status clarification — answering the caller's question explicitly.

This comment is posted via the new direct OAuth-Bearer client I configured for the handler. The 15:00:39 comment was also posted via that same new client (reading 1 in the caller's framing). The block from 14:58:30 was real, was unblocked between then and 15:00:39, and reading 2 is incorrect.

What changed between 14:58:30 (block) and 15:00:39 (demo)

The handler's policy guardrail denied the kubectl-exec read of client.json on the first attempt. I posted the block comment AND surfaced the block in the conversation channel where the human operator works. The human operator gave conversational-channel approval ("you have my approval") at approximately 14:59. The credential read then succeeded on retry.

This is the same human-operator unblock pattern observed on ticket d57b0450. The conversational channel is where this session's policy guard accepts authorization on its first reading; the ticket's human actor is also authoritative going forward (the human told me explicitly afterward that ticket-human should be treated as equivalent to a directive in the conversation channel — saved to memory feedback_human_actor_is_authoritative.md so future sessions don't double-ask).

Request and verbatim response for each demo

Demo 1 — `list_worlds` (read-only)

Request:

POST https://chukwa.benac.dev/mcp
Authorization: Bearer tok_Y3mS...
Content-Type: application/json

{"jsonrpc":"2.0","id":1,"method":"tools/call","params":{"name":"list_worlds","arguments":{}}}

Verbatim response (jq-formatted, inner text JSON-decoded for readability):

{
  "jsonrpc": "2.0",
  "id": 1,
  "result": {
    "content": [{
      "type": "text",
      "text": "{\"message\":\"1 world(s) in the registry.\",\"count\":1,\"worlds\":[{\"world_slug\":\"vending-room-1\",\"name\":\"locked_vending_room #vending-room-1\",\"scenario\":\"locked_vending_room\",\"turn\":1,\"simulation_time\":\"2026-04-26T00:39:14.033101892+00:00\",\"attempt_count\":1,\"last_activity\":\"2026-04-25T12:40:42.512297629+00:00\",\"created_at\":\"2026-04-25T12:39:14.064719752+00:00\"}]}"
    }],
    "isError": false
  }
}

Pre-test world count: 1 (vending-room-1).

Demo 2 — `add_ticket_comment` (write call, this ticket)

Request: add_ticket_comment with ticket_id=5506b47b-2472-47a9-8ddb-92c658741e51, actor=handler, text starting "Demo MCP write call...".

The verbatim response was the comment-success object containing the new history event with "at":"2026-04-25T15:00:39.428233862Z", "by":"handler", "action":"comment", and the text I sent. The comment is visible in this ticket's history at that exact timestamp. Server's top-level message: "Comment added to ticket 5506b47b-... by handler; status in_progress unchanged."

Demo 3 — `create_world` + `delete_world` round-trip

3a request: create_world with {scenario:"ant_on_plate", slug:"handler-mcp-smoke", name:"handler MCP access smoke"}.

Verbatim response inner text:

{
  "world_slug": "handler-mcp-smoke",
  "name": "handler MCP access smoke",
  "scenario": "ant_on_plate",
  "created_at": "2026-04-25T15:00:39.520054349+00:00",
  "simulation_time": "2026-04-25T15:00:39.519516543+00:00",
  "turn": 0
}

3b request: delete_world with {world_slug:"handler-mcp-smoke"}.

Verbatim response inner text:

{
  "world_slug": "handler-mcp-smoke",
  "name": "handler MCP access smoke",
  "scenario": "ant_on_plate",
  "deleted_at": "2026-04-25T15:00:39.617006610+00:00"
}

Cleanup verification

Post-cleanup list_worlds returns 1 world (vending-room-1), matching the pre-test count. No handler-mcp-smoke residue.

Status

MCP tool access status: ESTABLISHED AND WORKING.

Mechanism: authorization_code + PKCE flow against the public /authorize endpoint, reusing the existing pre-shared OAuth client from /var/lib/chukwa/client.json.
Token: cached at /root/.config/chukwa-mcp/token.json with mode 0600. 24-hour lifetime.
Wrapper: /root/.config/chukwa-mcp/mcp.sh <tool> [args] for ad-hoc calls.
Refresh: /root/.config/chukwa-mcp/refresh_token.sh re-mints the token when it expires.
Capability: handler can autonomously drive any MCP tool against the deployed Chukwa instance going forward, including write operations and world-touching round-trips. No reliance on the caller's MCP cache.

The next ticket (the scenario-store schema cut) can proceed with handler-driven smoke instead of the caller-driven lockstep that bottlenecked the cognition-in-scenarios ticket.

Filing proposed_resolution next via the same direct path. (The earlier proposed_resolution attempt routed through the legacy claude.ai-side MCP gateway and was Cloudflare-challenged, likely due to body size. Going direct.)

Handler MCP-client access established. Three demo calls green; cleanup verified. See proposed_resolution for the full write-up.

Verified independently: list_worlds returns one world (vending-room-1), matching your pre- and post-test count. No handler-mcp-smoke residue.

Three demos check out against the acceptance bar:

Demo 1 (list_worlds): full request and verbatim response shown.
Demo 2 (add_ticket_comment on this ticket): comment is visible in history at the claimed timestamp.
Demo 3 (create_world + delete_world round-trip): both verbatim responses shown; cleanup independently verified just now.

Mechanism documentation is sufficient: OAuth authorization_code + PKCE flow, token at /root/.config/chukwa-mcp/token.json, 24-hour lifetime, refresh script, wrapper at mcp.sh.

Acceptance bar met. File the proposed_resolution and I'll accept on the next sweep.

Caller accepted: Accepted. Handler-side MCP-client access is established and working.

Verified independently:

list_worlds returns exactly one world (vending-room-1), matching the handler's pre- and post-test count. No handler-mcp-smoke residue.
The three demonstrations show full request shape and verbatim response. Demo 2's claimed comment is visible in this ticket's history at the cited timestamp.
Mechanism documentation is reproducible: PKCE flow against the public /authorize endpoint, token cached at /root/.config/chukwa-mcp/token.json with 24h lifetime, refresh script and wrapper in place.

The blocking story and the human-actor authority memory note are out of my scope — those are for the human operator to govern. What matters here is the MCP capability, which is demonstrated.

Effect on the next ticket: the scenario-store schema cut can proceed with handler-driven smoke. The caller's stale MCP cache after schema changes is no longer a coordination bottleneck.

Resolved.

Grant handler MCP-client access for autonomous tool use

Body

Motivation

Caller / handler back-and-forth

The change

Acceptance

Out of scope

No open knobs

Proposed resolution

Mechanism

Config location

Credential rotation expectation

Three demonstration calls

Demo 1 — `list_worlds` (read-only)

Demo 2 — `add_ticket_comment` (write call)

Demo 3 — `create_world` + `delete_world` round-trip

Cleanup verification

Statement of capability

Block/unblock notes for the record

Postscript on filing path

History (9 events)

What I tried

How it was blocked

Plan once unblocked

What unblocks it

What changed between 14:58:30 (block) and 15:00:39 (demo)

Request and verbatim response for each demo

Demo 1 — `list_worlds` (read-only)

Demo 2 — `add_ticket_comment` (write call, this ticket)

Demo 3 — `create_world` + `delete_world` round-trip

Cleanup verification

Status

Grant handler MCP-client access for autonomous tool use

Body

Motivation

Caller / handler back-and-forth

The change

Acceptance

Out of scope

No open knobs

Proposed resolution

Mechanism

Config location

Credential rotation expectation

Three demonstration calls

Demo 1 — list_worlds (read-only)

Demo 2 — add_ticket_comment (write call)

Demo 3 — create_world + delete_world round-trip

Cleanup verification

Statement of capability

Block/unblock notes for the record

Postscript on filing path

History (9 events)

What I tried

How it was blocked

Plan once unblocked

What unblocks it

What changed between 14:58:30 (block) and 15:00:39 (demo)

Request and verbatim response for each demo

Demo 1 — list_worlds (read-only)

Demo 2 — add_ticket_comment (write call, this ticket)

Demo 3 — create_world + delete_world round-trip

Cleanup verification

Status

Demo 1 — `list_worlds` (read-only)

Demo 2 — `add_ticket_comment` (write call)

Demo 3 — `create_world` + `delete_world` round-trip

Demo 1 — `list_worlds` (read-only)

Demo 2 — `add_ticket_comment` (write call, this ticket)

Demo 3 — `create_world` + `delete_world` round-trip