The Wire · Showcase
EXPERT PARALLEL MEMORY BUG FIXED, LIGHTEVAL GAINS IMAGE SUPPORT
By RepoJournal · Filed · About Hugging Face
Transformers shipped a critical fix for distributed model loading that was overestimating device memory in expert parallel runs, while lighteval patched a silent failure in task initialization.
The transformers team fixed a memory allocation bug [1] where the caching allocator's warmup phase was reading the wrong tensor parallelism plan and trying to preallocate as if expert weights weren't sharded. This could blow past device memory limits on multi-GPU setups running expert parallel inference. In a separate but equally quiet fix, lighteval addressed a ModuleNotFoundError [2] where the built-in MathVista task was importing PIL without declaring Pillow as a dependency. That meant running lighteval accelerate would fail during task registry initialization for any user without Pillow already installed. The hub-docs team also shipped automated Inference Providers documentation updates [3] reflecting the latest @huggingface/tasks and @huggingface/inference package versions. Dev version bumps landed across transformers as preparation for the next release.
Action items
- → Review distributed model loading if you run expert parallel inference - upgrade transformers for the warmup fix huggingface/transformers [plan]
- → Upgrade lighteval if you use task evaluation - Pillow dependency now declared huggingface/lighteval [plan]
References
- [1] Fix caching allocator warmup byte estimation for EP model loading (#46149) huggingface/transformers
- [2] Add Pillow dependency ↗ huggingface/lighteval
- [3] Update Inference Providers documentation (automated) (#2498) huggingface/hub-docs
FAQ
- What changed in Hugging Face on May 25, 2026?
- Transformers shipped a critical fix for distributed model loading that was overestimating device memory in expert parallel runs, while lighteval patched a silent failure in task initialization.
- What should Hugging Face teams do about it?
- Review distributed model loading if you run expert parallel inference - upgrade transformers for the warmup fix • Upgrade lighteval if you use task evaluation - Pillow dependency now declared
- Which Hugging Face repositories shipped on May 25, 2026?
- huggingface/transformers, huggingface/lighteval, huggingface/hub-docs