Advertisement Β· 728 Γ— 90

Posts by Albert Villanova del Moral

Preview
Release v0.29.0 Β· huggingface/trl Features Add environment_factory to GRPOTrainer GRPOTrainer now accepts an environment_factory argument, allowing users to specify a custom environment class for training. This enables more flexibl...

If you’re building AI agents, RLHF pipelines, or training infra: give trl-training a spin in v0.29.0 and let us know what you think. We are working at improving it! πŸ€—

The era of agent-native ML tooling is here πŸš€

πŸ”— github.com/huggingface/...

1 month ago 1 1 0 0

Under the hood:
- A dedicated trl-training skill definition
- CLI-first design
- Clean integration with existing TRL commands
- Structured documentation so LLM agents can reason over it

1 month ago 0 0 1 0
Preview
Overview - Agent Skills A simple, open format for giving agents new capabilities and expertise.

What does this mean?

Instead of β€œfiguring out” how to train with TRL…

Agents now have a clearly defined interface to:
βœ… Run SFT
βœ… Run DPO
βœ… Run GRPO
βœ… Launch training jobs
βœ… Work with the TRL CLI safely

All through a structured SKILL.md.

1 month ago 0 0 1 0
Post image

πŸš€ Big news for TRL users with AI agents:

We just released the trl-training SKILL in TRL v0.29.0 πŸ€—

Agents can now use the TRL CLI as a structured capability:

1 month ago 2 2 1 0
Post image

5 years already working in democratizing AI πŸ€—
Grateful to be part of such an awesome team making it happen every day.

2 months ago 2 0 0 0
Post image

Latest smolagents release supports GPT-5: build agents that think, plan, and act.
⚑ Upgrade now and put GPT-5 to work!

8 months ago 1 0 0 0
Preview
Release v1.21.0 · huggingface/smolagents Enhancements ✨ Tool & Model Improvements Add Tool to_code_prompt and to_tool_calling_prompt by @albertvillanova in #1599 Improve function body extraction in tools by @HairlessVillager in #1627 S...

πŸš€ smolagents v1.21.0 is here!
Now with improved safety in the local Python executor: dunder calls are blocked!
⚠️ Still, not fully isolated: for untrusted code, use a remote executor instead: Docker, E2B, Wasm.
✨ Many bug fixes: more reliable code.
πŸ‘‰ github.com/huggingface/...

8 months ago 4 1 1 0
Advertisement
Preview
Release v1.20.0 · huggingface/smolagents New Features ✨ Remote Execution: Implement remote Python WasmExecutor by @albertvillanova in #1261 Callbacks: Support callbacks after planning step by @albertvillanova in #1445 Support multi...

πŸ”₯ This feature is live in smolagents v1.20.0!

Try it out.
Break things. Extend it. Give us feedback.

Let's build safer, smarter agents; together πŸ§ βš™οΈ

πŸ‘‰ github.com/huggingface/...

#smolagents #opensource #AIagents #WebAssembly #Deno #Pyodide #HuggingFace #AgenticAI

9 months ago 2 1 0 0
Preview
Implement remote Python WebAssemblyExecutor by albertvillanova Β· Pull Request #1261 Β· huggingface/smolagents Implement remote Python WebAssemblyExecutor.

Check out the PR πŸ‘‰ github.com/huggingface/...

Let's reimagine what agent-driven code execution can look like: remote, safe, and Wasm-powered.

9 months ago 0 0 1 0

πŸ‘· Want to help?

This is a foundation we're inviting the community to build on:
β€’ Add feature support
β€’ Expand compatibility to new use cases
β€’ Help make agents even more secure + capable

Contribute, test, break things, and PR back πŸš€

9 months ago 0 0 1 0

πŸ™…β€β™‚οΈ Why no network calls (for now)?

Because Pyodide in Node.js needs experimental flags for stack switching, and some APIs (like urllib3) aren’t yet compatible out of the box.

But this is just the beginning.

9 months ago 0 0 1 0

🚧 What’s supported right now?

βœ… Basic Python logic
βœ… Standard control flow
βœ… Limited stdlib support
❌ No HTTP requests (yet)
❌ No advanced I/O or external packages

It's early: a solid MVP for secure use cases. πŸ’‘

9 months ago 0 0 1 0

πŸ” Why WebAssembly?

WebAssembly = portable, fast, and secure.
It enables:
β€’ Safe remote execution of Python code
β€’ Zero access to the host machine
β€’ Consistent, deterministic environments
β€’ Serverless + edge-compatible agent workflows

9 months ago 0 0 1 0

πŸ’₯ Big milestone just dropped in smolagents:

The CodeAgent can now execute Python code remotely in WebAssembly, powered by Pyodide + Deno!

Your agent-generated Python runs in a secure, sandboxed Wasm runtime: no local execution required.

Here's why this changes the game πŸ§΅πŸ‘‡

9 months ago 0 0 1 0
Preview
Release v1.19.0 Β· huggingface/smolagents Enhancements πŸ› οΈ Agent Upgrades: Support managed agents in ToolCallingAgent by @albertvillanova in #1456 Support context managers for agent cleanup by @tobiasofsn in #1422 Change code tags to xml...

7/
🧡 That’s a wrap for smolagents v1.19.0!

Grab the latest, test the improvements, and tell us what you build.
πŸ‘‡πŸ‘‡πŸ‘‡
github.com/huggingface/...
#smolagents #AIagents #OSS #HuggingFace

9 months ago 0 0 0 0
Advertisement

6/
πŸ“š Docs Game: Leveled Up

* ✨ Quickstart got tool usage examples
* πŸ’‘ Added open in Colab buttons!
* 🧠 Reference docs now include AgentMemory, better docstrings

9 months ago 0 0 1 0

5/
πŸ› Bug Fixes Blitz
From final answer logic to Docker logs, this release patched up:

* Planning logic bugs 🧠
* Type-safe wrappers πŸ›‘οΈ
* Executor output matching 🧡
* Prompt fixes for additional_args πŸ“

9 months ago 0 0 1 0

4/
πŸ“‹ Agent Output Tracking

* CodeAgent outputs are now stored in ActionStep
πŸ“‚ More structured traceability of actions & results!

9 months ago 0 0 1 0

3/
πŸ“‘ Streaming Just Got Smarter

* Streaming event aggregation was moved off the Model class
βž• Cleaner structure, better separation of concerns

9 months ago 0 0 1 0

2/
πŸŽ›οΈ UI Improvements

* New reset_agent_memory feature in GradioUI πŸ’Ύ
- Handy for debugging or starting fresh mid-run!

9 months ago 0 0 1 0

1/
🧠 Agent Upgrades

* You can now use managed agents with ToolCallingAgent πŸ”§
* Agent cleanup supports context managers: easier resource handling!
* Switched code output tags to XML for clarity

9 months ago 0 0 1 0

πŸš€New Release: smolagents v1.19.0 is out!

This one's packed with agent upgrades, smoother streaming, better docs, and bug squashing galore. πŸ§΅πŸ‘‡

9 months ago 0 0 1 0
Preview
Release v1.17.0 · huggingface/smolagents New Features ✨ Structured Generation in CodeAgent: Add optional support for structured outputs in CodeAgent, enabling more reliable and consistent generation patterns Adding optional structured g...

New in smolagents v1.17.0:
- Structured generation in CodeAgent 🧱
- Streamable HTTP MCP support 🌐
- Agent.run() returns rich RunResult πŸ“¦

Smarter agents, smoother workflows.
Try it now: github.com/huggingface/...

10 months ago 1 0 0 0
Advertisement
Preview
TinyAgents: A Minimal Experiment with Code Agents and MCP Tools A Blog post by Albert Villanova del Moral on Hugging Face

Read the blog post πŸ‘‰ huggingface.co/blog/albertv...

11 months ago 0 0 0 0
Preview
TinyAgents: A Minimal Experiment with Code Agents and MCP Tools A Blog post by Albert Villanova del Moral on Hugging Face

πŸš€ Experiment with Tiny Agents: minimalist LLM agents powered by async MCP tools, inspired by @julien_c

πŸ” A side-by-side comparison of tool-calling vs. code agent

πŸ’‘ See how code agents handle complex workflows in one step, reducing model calls and boosting efficiency!

11 months ago 0 0 1 0
GitHub - huggingface/smolagents: πŸ€— smolagents: a barebones library for agents that think in python code. πŸ€— smolagents: a barebones library for agents that think in python code. - huggingface/smolagents

New in smolagents v1.16.0:
πŸ” Bing support in WebSearchTool
🐍 Custom functions & executor_kwargs in LocalPythonExecutor
πŸ”§ Streaming GradioUI fixes
🌐 Local web agents via api_base & api_key
πŸ“š Better docs

πŸ‘‰ github.com/huggingface/smolagents

11 months ago 1 0 0 0
Preview
Building Code Agents with Hugging Face smolagents Build agents that write and execute code to perform complex tasks, using Hugging Face’s smolagents.

Code Agents generate full code blocks and execute them in one go, instead of calling tools one-by-one. Learn how to build smarter, faster agents in a new @deeplearningai.bsky.social course by @hf.co.

Build with smolagents β†’ sandboxed, reliable, powerful.

πŸ”— www.deeplearning.ai/short-course...

11 months ago 1 0 0 0
Preview
Release v1.14.0 Β· huggingface/smolagents New features πŸš€ SmolAgents MCP Client by @grll in #1200 Introduce a new MCPClient class that allows users to manage connections to one or more MCP servers, making integration with SmolAgents more ...

Full release πŸ‘‰ github.com/huggingface/...
#smolagents #LLM #AgenticAI

11 months ago 1 0 0 0
Preview
Release v1.14.0 Β· huggingface/smolagents New features πŸš€ SmolAgents MCP Client by @grll in #1200 Introduce a new MCPClient class that allows users to manage connections to one or more MCP servers, making integration with SmolAgents more ...

smolagents v1.14.0 is out! πŸš€
πŸ”Œ MCPClient: A sleek new client for connecting to remote MCP servers, making integrations more flexible and scalable.
πŸͺ¨ Amazon Bedrock: Native support for Bedrock-hosted models.
SmolAgents is now more powerful, flexible, and enterprise-ready. πŸ’Ό

11 months ago 1 1 1 0

International Women's Day (IWD) 2025: www.internationalwomensday.com

1 year ago 0 0 0 0