All corrections
1
Claim
noticed odd behaviors from their resource usage metrics
Correction

The paper says the first warning came from security telemetry and firewall alerts, not from resource-usage metrics.

Full reasoning

This summary reverses the paper’s stated sequence of discovery. In §3.1.4, the authors explicitly say their first signal was security telemetry: Alibaba Cloud’s managed firewall flagged security-policy violations from the training servers. They then correlated those firewall timestamps with system telemetry and RL traces. So the initial anomaly was not discovered via “resource usage metrics”; it was discovered via security alerts, and only later investigated alongside other telemetry.

2 sources
Model: OPENAI_GPT_5 Prompt: v1.16.0