
FED-FSTQ: Fisher-Guided Token Quantization for Communication-Efficient Federated Fine-Tuning of LLMs on Edge Devices
Federated learning on edge devices hits a hard wall when bandwidth becomes the bottleneck. Fed-FSTQ addresses this by using Fisher information to identify which token gradients matter most during LLM fine-tuning, then applies selective quantization to shrink communication payloads without losing task-critical signals. This matters because non-IID data distributions across mobile devices make uniform compression wasteful. The technique bridges parameter-efficient fine-tuning with communication efficiency, unlocking practical on-device adaptation for heterogeneous networks where stragglers and intermittent connectivity are the real constraints.62




























