Who contributed to Hugging Face on May 20, 2026?

1 developer shipped this update, including danieldk.

What were the notable Hugging Face updates?

FSDP + TP & native save/load distributed (#45028), Parakeet tdt (#44171), and [Model] Add PP-OCRv6 Models Support (#45838).

@huggingface

Transformers, Datasets, and the open AI-model layer

github ↗

Pick a date

Topics: Python AI / ML Full archive →

The Wire · Showcase

FSDP DISTRIBUTED TRAINING AND PARAKEET ASR LAND IN TRANSFORMERS

By RepoJournal · Filed 06:04 UTC on May 20, 2026 · About Hugging Face

1 person shipped this

danieldk @danieldk 2 cited

Transformers shipped native FSDP and tensor parallelism support with distributed save/load in one release, while kernels team hardened security auditing and operation registration validation across all repos.

The transformers FSDP integration [1] adds fully sharded data parallel training with auto/manual mode detection, shard-on-read loading via DtensorShardOperation, and FSDP-aware flash attention checks. This is the distributed training foundation teams have been waiting for. Parakeet ASR models [2] now support Token-and-Duration Transducer decoding with per-token timestamps and full AutoModel integration, extending beyond CTC-only to match production ASR pipelines. PP-OCRv6 [3] arrives as the new standard for document recognition, adding four new model variants with updated backbones and dedicated image processors. Across kernels and kernels-community, the team locked down security workflows [4] [5] [6] and rolled out operation registration validation [7] to catch misconfigured custom ops at build time. Flash attention variable-length backward pass now works on XPU [8], removing a gap for Intel accelerator users. AsyncGRPOTrainer [9] now handles models with final logits softcapping like Gemma 2, fixing a trainer compatibility issue.

Action items

→ Test FSDP integration in staging if you run distributed training at scale - this replaces DDP for large model training huggingface/transformers [plan]
→ Pin kernels builds and run security audit workflow on your fork to verify the remediation took effect [ref:13] huggingface/kernels-community [immediate]
→ Update any custom ops to follow the naming prefix convention documented in the nix-builder hook [ref:2] huggingface/kernels [plan]
→ Upgrade to latest TRL if you're training with Gemma 2 or other softcapped models using AsyncGRPOTrainer huggingface/trl [monitor]

References

[1] FSDP + TP & native save/load distributed (#45028) huggingface/transformers
[2] Parakeet tdt (#44171) huggingface/transformers
[3] [Model] Add PP-OCRv6 Models Support (#45838) huggingface/transformers
[4] feat: mention maintainers in the slack security auditing. (#567) huggingface/kernels
[5] fix(security): remediate workflow vulnerability in .github/workflows/security-audit.yml (#884) huggingface/kernels-community
[6] feat: mention maintainers in the slack security auditing. (#881) huggingface/kernels-community
[7] nix-builder: add a hook to detect incorrect op registrations ↗ huggingface/kernels
[8] flash-attn2: Add flash_attn_varlen_func backward support for XPU ↗ huggingface/kernels-community
[9] Final logits softcapping support for async GRPO Trainer (#5691) huggingface/trl

Quick answers

What shipped in Hugging Face on May 20, 2026?: Transformers shipped native FSDP and tensor parallelism support with distributed save/load in one release, while kernels team hardened security auditing and operation registration validation across all repos. In total, 20 commits and 20 pull requests landed.
Who contributed to Hugging Face on May 20, 2026?: 1 developer shipped this update, including danieldk.
What were the notable Hugging Face updates?: FSDP + TP & native save/load distributed (#45028), Parakeet tdt (#44171), and [Model] Add PP-OCRv6 Models Support (#45838).

TRANSFORMERS OVERHAULS LINEAR ATTENTION WHILE DEPRECATING LEGACY RESPONSE SCHEMA

The transformers library is retiring its fragile response_schema prototype in favor of streaming-compatible parsing, while simultaneously refactoring every linear attention model to use standardized convolution patterns.

python 70 shipped 2-min read

@huggingface 1 day ago

TRANSFORMERS SHIPS FSDP DISTRIBUTED TRAINING STACK, HUB LIBRARY PLUGS REDOS HOLE

Hugging Face landed distributed training orchestration in transformers while plugging a regex vulnerability that could stall untrusted card parsing for minutes.

+10

python 61 shipped 1-min read

@huggingface 2 days ago

TRANSFORMERS HARDENS AGAINST PYTORCH FRAGMENTATION WHILE TRL SIMPLIFIES DISTILLATION

Transformers plugged a cascading import failure that breaks downstream CI on older PyTorch versions, while TRL rips out dead code to lock DistillationTrainer into prompt-only datasets.

python 70 shipped 2-min read

@huggingface 3 days ago

DATASET VIEWER LOCKS DOWN ARROW, FUNES SHIPS GROUNDED ASK

Hugging Face security teams moved overnight to contain a critical Arrow IPC parsing vulnerability in dataset-viewer while shipping three production hardening releases across Repo2RLEnv, optimum-executorch, and funes.

python 91 shipped 2-min read

Elsewhere on the wire

AI Agents about 9 hours ago

CLAUDE OPUS 5 LANDS ACROSS THE STACK

The newest Anthropic model is now live in langchain, Cline, and llama-index, with native support for extended reasoning and 1M context windows.

ai-agents 28 shipped 1-min read

Local LLMs about 9 hours ago

OLLAMA LANDS LAGUNA SUPPORT AND CRUSHES MEMORY LEAKS WHILE SGLANG HITS V0.5.16 WITH CONFIDENCE-DRIVEN SPECULATIVE DECODING

Ollama shipped three critical performance and reliability fixes for Metal residency and concurrent access patterns, while SGL-Lang released 0.5.16 with a new speculative algorithm hitting 383.7 tok/s on DeepSeek-V4.