Who contributed to Hugging Face on May 26, 2026?

4 developers shipped this update, including aazizyan, rycerzes, sayakpaul, and sywangyi.

What were the notable Hugging Face updates?

v1.5.0, Add Qwen3.5 Think/NoThink training chat templates with generation markers, and Fix `OpenRewardSpec` omitting task‑scoped tools during rollout binding (fixes #5727).

@huggingface

Transformers, Datasets, and the open AI-model layer

github ↗

Pick a date

Topics: Python AI / ML Full archive →

The Wire · Showcase

TRL 1.5.0 SHIPS QWEN TEMPLATES AND FIXES OPENREWARD TOOL BINDING

By RepoJournal · Filed 06:03 UTC on May 26, 2026 · About Hugging Face

4 people shipped this

aazizyan @aazizyan 2 cited

rycerzes @rycerzes 1 cited

sayakpaul @sayakpaul 1 cited

sywangyi @sywangyi 1 cited

TRL's latest release adds training-ready chat templates for three model families while fixing a critical bug where task-scoped tools were silently omitted during rollout binding.

TRL v1.5.0 [1] is the headline: Phi-3.5, Qwen3-VL, and Qwen3.5 Think/NoThink now have training chat templates with generation markers, meaning assistant_only_loss=True finally just works across these model families [1]. The Qwen3.5 Think/NoThink templates [2] follow the refined approach already proven in Qwen3, wrapping assistant output with generation markers and preserving thinking blocks. Separately, a critical fix to OpenRewardSpec [3] now correctly discovers and binds task-scoped tools during rollout binding, addressing a silent failure where only shared tools were being wired up. On the stability front, diffusers has locked down a determinism problem in ZImageTransformer2DModel [5] by replacing torch.empty() initialization with torch.zeros() for pad tokens, eliminating potential NaNs that could surface in layerwise casting tests. The diffusers team is now documenting torch.empty footguns [4] to prevent similar issues downstream. A fourth vision model, Qwen2.5-VL [6], is in flight with both original and training chat templates ready to land.

Action items

→ Upgrade TRL to v1.5.0 if you're training Phi-3.5, Qwen3-VL, or Qwen3.5 variants with assistant_only_loss huggingface/trl [plan]
→ Test OpenRewardSpec bindings if you're using rollout integration, verify task tools are now discoverable huggingface/trl [monitor]
→ Update diffusers if you're using ZImage or other vision transformers to stabilize determinism huggingface/diffusers [plan]

References

[1] v1.5.0 ↗ huggingface/trl
[2] Add Qwen3.5 Think/NoThink training chat templates with generation markers ↗ huggingface/trl
[3] Fix `OpenRewardSpec` omitting task‑scoped tools during rollout binding (fixes #5727) ↗ huggingface/trl
[4] note: torch.zeros -> torch.empty ↗ huggingface/diffusers
[5] Initialize ZImage pad tokens deterministically ↗ huggingface/diffusers
[6] Add Qwen2.5-VL original and training chat template with generation markers ↗ huggingface/trl

Quick answers

What shipped in Hugging Face on May 26, 2026?: TRL's latest release adds training-ready chat templates for three model families while fixing a critical bug where task-scoped tools were silently omitted during rollout binding. In total, 8 commits, 8 pull requests, and 1 releases landed.
Who contributed to Hugging Face on May 26, 2026?: 4 developers shipped this update, including aazizyan, rycerzes, sayakpaul, and sywangyi.
What were the notable Hugging Face updates?: v1.5.0, Add Qwen3.5 Think/NoThink training chat templates with generation markers, and Fix `OpenRewardSpec` omitting task‑scoped tools during rollout binding (fixes #5727).

TRANSFORMERS OVERHAULS LINEAR ATTENTION WHILE DEPRECATING LEGACY RESPONSE SCHEMA

The transformers library is retiring its fragile response_schema prototype in favor of streaming-compatible parsing, while simultaneously refactoring every linear attention model to use standardized convolution patterns.

python 70 shipped 2-min read

@huggingface 1 day ago

TRANSFORMERS SHIPS FSDP DISTRIBUTED TRAINING STACK, HUB LIBRARY PLUGS REDOS HOLE

Hugging Face landed distributed training orchestration in transformers while plugging a regex vulnerability that could stall untrusted card parsing for minutes.

+10

python 61 shipped 1-min read

@huggingface 2 days ago

TRANSFORMERS HARDENS AGAINST PYTORCH FRAGMENTATION WHILE TRL SIMPLIFIES DISTILLATION

Transformers plugged a cascading import failure that breaks downstream CI on older PyTorch versions, while TRL rips out dead code to lock DistillationTrainer into prompt-only datasets.

python 70 shipped 2-min read

@huggingface 3 days ago

DATASET VIEWER LOCKS DOWN ARROW, FUNES SHIPS GROUNDED ASK

Hugging Face security teams moved overnight to contain a critical Arrow IPC parsing vulnerability in dataset-viewer while shipping three production hardening releases across Repo2RLEnv, optimum-executorch, and funes.

python 91 shipped 2-min read

Elsewhere on the wire

AI Agents about 9 hours ago

CLAUDE OPUS 5 LANDS ACROSS THE STACK

The newest Anthropic model is now live in langchain, Cline, and llama-index, with native support for extended reasoning and 1M context windows.

ai-agents 28 shipped 1-min read

Local LLMs about 9 hours ago

OLLAMA LANDS LAGUNA SUPPORT AND CRUSHES MEMORY LEAKS WHILE SGLANG HITS V0.5.16 WITH CONFIDENCE-DRIVEN SPECULATIVE DECODING

Ollama shipped three critical performance and reliability fixes for Metal residency and concurrent access patterns, while SGL-Lang released 0.5.16 with a new speculative algorithm hitting 383.7 tok/s on DeepSeek-V4.