
RouteScan: A Non-Intrusive Approach to Auditing MoE LLMs Safety via Expert Routing Telemetry
RouteScan introduces a privacy-preserving safety audit method for Mixture-of-Experts LLMs by analyzing GPU-level routing telemetry rather than user inputs or model outputs. This addresses a critical tension in production deployments: safety verification without exposing sensitive data. The technique exploits the sparse activation patterns inherent to MoE architectures, creating a new class of non-intrusive monitoring that could reshape how enterprises validate model behavior in regulated environments while maintaining user confidentiality.62


























