Who contributed to Hugging Face on May 18, 2026?

3 developers shipped this update, including remi-or, stevhliu, and HuggingFaceInfra.

What were the notable Hugging Face updates?

[CB] [Major] Add tensor paralellism, [docs] adding audio/video processors, and use opensearch for traces storage.

@huggingface

Transformers, Datasets, and the open AI-model layer

github ↗

Pick a date

Topics: Python AI / ML Full archive →

The Wire · Showcase

TENSOR PARALLELISM LANDS IN CONTINUOUS BATCHING, AUDIO DOCS FOLLOW

By RepoJournal · Filed 06:04 UTC on May 18, 2026 · About Hugging Face

3 people shipped this

remi-or @remi-or 1 cited

stevhliu @stevhliu 1 cited

HuggingFaceInfra @HuggingFaceInfra 1 cited

Transformers shipped tensor parallelism support for continuous batching, unlocking multi-GPU generation scaling you've been waiting for.

The major merge [1] adds full TP support to continuous batching with the infrastructure to back it: inter-process communication for request states, per-TP group seeding, NCCL graph safeguards, and a reproducible hash function that avoids Python's salted hash. This is production-grade work, not a sketch. Simultaneously, the docs team shipped dedicated guidance on adding audio and video processors [2], closing a gap that's been live in the code for months. On the CI front, transformers-ci completed a shift to OpenSearch for trace storage [3] and overhauled the UI to reduce confusion between traces and run IDs [4]. The hub-docs team automated their inference provider docs generation [5], keeping the JavaScript packages and generated docs in lockstep without manual intervention. Housekeeping landed too: dead environment variables got stripped from CircleCI configs [6], and the DeepSeek V4 MoE converter got a fix for substring-matching FP8 scale companions [7].

Action items

→ Review tensor parallelism PR and test against your multi-GPU generation workloads huggingface/transformers [plan]
→ Reference new audio/video processor docs if you're adding custom processors huggingface/transformers [monitor]
→ Check CI trace links in your dashboards post-OpenSearch migration huggingface/transformers-ci [monitor]

References

[1] [CB] [Major] Add tensor paralellism ↗ huggingface/transformers
[2] [docs] adding audio/video processors ↗ huggingface/transformers
[3] use opensearch for traces storage huggingface/transformers-ci
[4] renamed to trace to avoid confusion with run id huggingface/transformers-ci
[5] [Bot] Update Inference Providers documentation ↗ huggingface/hub-docs
[6] chore(ci): remove dead env vars from circleci-failure-summary-comment.yml (#45972) huggingface/transformers
[7] [DeepSeek V4] Fix MoE converter substring-matching FP8 scale companions (#45930) huggingface/transformers

Quick answers

What shipped in Hugging Face on May 18, 2026?: Transformers shipped tensor parallelism support for continuous batching, unlocking multi-GPU generation scaling you've been waiting for. In total, 11 commits and 6 pull requests landed.
Who contributed to Hugging Face on May 18, 2026?: 3 developers shipped this update, including remi-or, stevhliu, and HuggingFaceInfra.
What were the notable Hugging Face updates?: [CB] [Major] Add tensor paralellism, [docs] adding audio/video processors, and use opensearch for traces storage.

TRANSFORMERS OVERHAULS LINEAR ATTENTION WHILE DEPRECATING LEGACY RESPONSE SCHEMA

The transformers library is retiring its fragile response_schema prototype in favor of streaming-compatible parsing, while simultaneously refactoring every linear attention model to use standardized convolution patterns.

python 70 shipped 2-min read

@huggingface 1 day ago

TRANSFORMERS SHIPS FSDP DISTRIBUTED TRAINING STACK, HUB LIBRARY PLUGS REDOS HOLE

Hugging Face landed distributed training orchestration in transformers while plugging a regex vulnerability that could stall untrusted card parsing for minutes.

+10

python 61 shipped 1-min read

@huggingface 2 days ago

TRANSFORMERS HARDENS AGAINST PYTORCH FRAGMENTATION WHILE TRL SIMPLIFIES DISTILLATION

Transformers plugged a cascading import failure that breaks downstream CI on older PyTorch versions, while TRL rips out dead code to lock DistillationTrainer into prompt-only datasets.

python 70 shipped 2-min read

@huggingface 3 days ago

DATASET VIEWER LOCKS DOWN ARROW, FUNES SHIPS GROUNDED ASK

Hugging Face security teams moved overnight to contain a critical Arrow IPC parsing vulnerability in dataset-viewer while shipping three production hardening releases across Repo2RLEnv, optimum-executorch, and funes.

python 91 shipped 2-min read

Elsewhere on the wire

AI Agents about 10 hours ago

CLAUDE OPUS 5 LANDS ACROSS THE STACK

The newest Anthropic model is now live in langchain, Cline, and llama-index, with native support for extended reasoning and 1M context windows.

ai-agents 28 shipped 1-min read

Local LLMs about 10 hours ago

OLLAMA LANDS LAGUNA SUPPORT AND CRUSHES MEMORY LEAKS WHILE SGLANG HITS V0.5.16 WITH CONFIDENCE-DRIVEN SPECULATIVE DECODING

Ollama shipped three critical performance and reliability fixes for Metal residency and concurrent access patterns, while SGL-Lang released 0.5.16 with a new speculative algorithm hitting 383.7 tok/s on DeepSeek-V4.