Who contributed to Hugging Face on June 8, 2026?

3 developers shipped this update, including Wauplin, davanstrien, and assafvayner.

What were the notable Hugging Face updates?

Fix: interleaved RoPE application for MLA and Support Index Cache DSA indexer skip-topk sharing for GLM5 (#46372), Triton finegrained fp8/fp4 (#46407), and Use torchvision's native LANCZOS interpolation instead of PIL fallback (#46496).

@huggingface

Transformers, Datasets, and the open AI-model layer

github ↗

Pick a date

Topics: Python AI / ML Full archive →

The Wire · Showcase

TRANSFORMERS SHIPS ROPE FIXES AND FINE-GRAINED QUANTIZATION WHILE HUB PATCHES PYTHON 3.15 BREAKAGE

By RepoJournal · Filed 22:43 UTC on June 8, 2026 · About Hugging Face

3 people shipped this

Wauplin @Wauplin 1 cited

davanstrien @davanstrien 1 cited

assafvayner @assafvayner 1 cited

Transformers landed critical correctness fixes for GLM sparse attention and new Triton-backed fp8/fp4 quantization, while huggingface_hub addressed a hard dependency break on Python 3.15.

The transformers team shipped interleaved RoPE fixes for MLA and DSA indexer caching on GLM5 [1], which accelerates sparse attention by reusing previous layers' top-k indices instead of recomputing them. Alongside that, fine-grained fp8/fp4 quantization via Triton landed [2], giving you native support for sub-tensor quantization with torch compile compatibility. Image processing got a speed bump with native torchvision LANCZOS interpolation replacing the PIL fallback [3], which matters for batch inference throughput. On the hub side, huggingface_hub fixed a critical import error on Python 3.15 where the private _MISSING_TYPE disappeared from dataclasses [4], causing immediate startup failures. The same release improves auth precedence in Colab environments [5], so user-provided tokens now take priority over Colab's vault token. CLI quiet mode now actually stays quiet [6]. Over in xet-core, russh bumped to 0.61 [7] and the team is working through an sdist release issue where LICENSE wasn't being included in the tarball [8].

Action items

→ If you support Python 3.15, update huggingface_hub immediately to avoid import failures on startup huggingface/huggingface_hub [immediate]
→ Test transformers upgrade if you use GLM5 or sparse attention workflows to validate RoPE behavior huggingface/transformers [plan]
→ Evaluate new Triton fp8/fp4 quantization for inference performance gains in your pipelines huggingface/transformers [monitor]
→ Watch xet-core 1.5.1 release for sdist fix completion huggingface/xet-core [monitor]

References

[1] Fix: interleaved RoPE application for MLA and Support Index Cache DSA indexer skip-topk sharing for GLM5 (#46372) huggingface/transformers
[2] Triton finegrained fp8/fp4 (#46407) huggingface/transformers
[3] Use torchvision's native LANCZOS interpolation instead of PIL fallback (#46496) huggingface/transformers
[4] [Fix] Remove private _MISSING_TYPE import from dataclasses module (#4322) huggingface/huggingface_hub
[5] [Auth] Take google colab token from env first ↗ huggingface/huggingface_hub
[6] [CLI] Suppress hints in quiet output mode ↗ huggingface/huggingface_hub
[7] chore: bump russh from 0.60 to 0.61 ↗ huggingface/xet-core
[8] Try to fix sdist release due to LICENSE missing from tarball root directory (#867) huggingface/xet-core

Quick answers

What shipped in Hugging Face on June 8, 2026?: Transformers landed critical correctness fixes for GLM sparse attention and new Triton-backed fp8/fp4 quantization, while huggingface_hub addressed a hard dependency break on Python 3.15. In total, 26 commits and 27 pull requests landed.
Who contributed to Hugging Face on June 8, 2026?: 3 developers shipped this update, including Wauplin, davanstrien, and assafvayner.
What were the notable Hugging Face updates?: Fix: interleaved RoPE application for MLA and Support Index Cache DSA indexer skip-topk sharing for GLM5 (#46372), Triton finegrained fp8/fp4 (#46407), and Use torchvision's native LANCZOS interpolation instead of PIL fallback (#46496).

TRANSFORMERS OVERHAULS LINEAR ATTENTION WHILE DEPRECATING LEGACY RESPONSE SCHEMA

The transformers library is retiring its fragile response_schema prototype in favor of streaming-compatible parsing, while simultaneously refactoring every linear attention model to use standardized convolution patterns.

python 70 shipped 2-min read

@huggingface 1 day ago

TRANSFORMERS SHIPS FSDP DISTRIBUTED TRAINING STACK, HUB LIBRARY PLUGS REDOS HOLE

Hugging Face landed distributed training orchestration in transformers while plugging a regex vulnerability that could stall untrusted card parsing for minutes.

+10

python 61 shipped 1-min read

@huggingface 2 days ago

TRANSFORMERS HARDENS AGAINST PYTORCH FRAGMENTATION WHILE TRL SIMPLIFIES DISTILLATION

Transformers plugged a cascading import failure that breaks downstream CI on older PyTorch versions, while TRL rips out dead code to lock DistillationTrainer into prompt-only datasets.

python 70 shipped 2-min read

@huggingface 3 days ago

DATASET VIEWER LOCKS DOWN ARROW, FUNES SHIPS GROUNDED ASK

Hugging Face security teams moved overnight to contain a critical Arrow IPC parsing vulnerability in dataset-viewer while shipping three production hardening releases across Repo2RLEnv, optimum-executorch, and funes.

python 91 shipped 2-min read

Elsewhere on the wire

AI Agents about 9 hours ago

CLAUDE OPUS 5 LANDS ACROSS THE STACK

The newest Anthropic model is now live in langchain, Cline, and llama-index, with native support for extended reasoning and 1M context windows.

ai-agents 28 shipped 1-min read

Local LLMs about 9 hours ago

OLLAMA LANDS LAGUNA SUPPORT AND CRUSHES MEMORY LEAKS WHILE SGLANG HITS V0.5.16 WITH CONFIDENCE-DRIVEN SPECULATIVE DECODING

Ollama shipped three critical performance and reliability fixes for Metal residency and concurrent access patterns, while SGL-Lang released 0.5.16 with a new speculative algorithm hitting 383.7 tok/s on DeepSeek-V4.

+11

llms 210 shipped 2-min read

@CachyOS about 9 hours ago

HYPRLAND V0.56 FIXES LAND, PACKAGE ECOSYSTEM ROLLS FORWARD

Hyprland configuration updated for v0.56 compatibility across multiple desks, while the AUR-derived ecosystem locked in four automated package bumps.

infra 85 shipped 1-min read

Elixir & Phoenix about 9 hours ago

LIVEVIEW ASYNC CLEANUP FIX SHIPS ALONGSIDE RANGE OPTIMIZATIONS

Phoenix LiveView closes a critical async task test failure while Elixir cuts unnecessary abs calls from Range operations.

elixir 19 shipped 1-min read

Want every project, not just this one?

Follow @huggingface