Advertisement · 728 × 90

Posts by Allen Nie

Check out Tianwei’s latest work on using unlikelihood objective to distill search traces back to base model to boost reasoning capabilities of LLMs!

11 months ago 2 0 0 0

For all the RL PhDs and people interested in Planning and MDPs, there's a summer internship opportunity at AWS Science that specializes in LLM post-training, RLHF, LLM agents, and benchmarks like WebArena. Interested students can send their CV to fakoor@amazon.com

1 year ago 3 0 0 0

For education and psychometrics people, this dataset is very useful!

1 year ago 1 0 0 0

I credit Omar @lateinteraction.bsky.social for this beautiful summary of the difference 🤣

1 year ago 1 0 0 0

Hi Tim — Trace can optimize the control flow, whereas DSPy optimizes the modules in a fixed control flow (for now) 🙂 I would use DSPy for a supervised learning setup and Trace for an RL-like task (when there’s a clear definition of reward and feedback).

1 year ago 1 0 1 0

Trace performs inference-time optimization — not directly updating weights of the underlying neural network. It updates the agentic workflow (python functions, prompts to LLMs and etc)

1 year ago 0 0 1 0
Post image

People say Ching-an and I are indistinguishable…is that true 🤣

1 year ago 1 0 0 0
Advertisement
Post image Post image Post image Post image

Come check us out near the Tesla Booth in West Exhibition Hall A 3-5pm! Come and claim your mug 🤣 we have an identity crisis — people keep thinking we are from IBM for some reason…

1 year ago 3 0 0 0

We are happy to give a talk or have a 1:1 chat if you are interested in learning what Trace is and/or how to use it! Trace has already been presented at the UW Robotics Colloquium and ServiceNow. #foundermode for Open-Source Software! Time to build 🔧 and ship 🚀!

1 year ago 1 0 0 0
Preview
GitHub - microsoft/Trace: End-to-end Generative Optimization for AI Agents End-to-end Generative Optimization for AI Agents. Contribute to microsoft/Trace development by creating an account on GitHub.

This open-source project is a joint effort with
@chinganc_rl
and Adith, the MSR RL group. We are presenting Trace at the NeurIPS Expo Demo this afternoon 3pm-5pm PT. We have MUGs, T-SHIRTs, and STICKERs!

🌐 microsoft.github.io/Trace/
👨‍💻 github.com/microsoft/Tr...

1 year ago 0 0 1 0
Post image

Once you build an agent with Trace, you can use ANY LLM optimizer you want. With the release of Trace 0.1.3, we introduce TextGrad (github.com/microsoft/Tr...) as an optimizer for the RL agent, along with OPRO and OptoPrime.

1 year ago 0 0 1 0
Post image

What enables Trace to be an RL-style agentic library? We use **Generative Optimization** techniques (LLM as an optimizer) to derive an analog to RL's policy gradient algorithm. The agent makes a move, receives feedback/reward, and updates its parameters.

1 year ago 0 0 1 0
Post image

In Trace, you define an Agent with declarative Python functions using Trace primitives. Trace provides flexible ways to mark what you want to change -- for example, we mark two prompts and two functions below as trainable.

1 year ago 0 0 1 0

True RL agents learn online -- continuously changing themselves to improve upon the feedback (reward) from a user or an environment. Why haven't people done this in the LLM "Agentic" libraries? We wondered the same and developed Trace -- a true *RL-style* agentic framework.

1 year ago 0 0 1 0
Preview
Trace Overview This is "Trace Overview" by Allen Nie on Vimeo, the home for high quality videos and the people who love them.

Unveiling Trace v0.1.3 at NeurIPS 2024, a library for building an RL-style AI Agent that learns from the environment and human feedback. Today's LLM Agent libraries are not RL agents. They specify a workflow, and it remains unchanged regardless of user feedback. #NotRL vimeo.com/1036224270

1 year ago 4 0 2 0

An honor to have you here!! Welcome 🙏🙏

1 year ago 1 0 0 0
Advertisement
Preview
Anytime Acceleration of Gradient Descent This work investigates stepsize-based acceleration of gradient descent with {\em anytime} convergence guarantees. For smooth (non-strongly) convex optimization, we propose a stepsize schedule that all...

arxiv.org/abs/2411.17668 Our postdoc zihan slays another COLT open problem! proceedings.mlr.press/v247/kornows...

1 year ago 68 11 1 3

For people who like RL theory, this is a must follow!

1 year ago 2 0 0 0

📌

1 year ago 0 0 0 0

Can I get added? Not NLP but still working with LLMs on the RL side.

1 year ago 1 0 0 0

Hello...world?

Trying to reconstruct my academic networks over here :) Follow me if we know each other or if you're interested in machine learning for healthcare/social equity! Please retweet, or resky, or whatever they call it over here.

1 year ago 73 9 3 0

📌

1 year ago 0 0 0 0

Totally — it’s a great list 😊

1 year ago 1 0 0 0

Here is a list of ML OSS & Open Source / Science enthusiasts I found on Bluesky 🦋

go.bsky.app/8MFcfXd

Let me know if you find such people here!

I'm still new here and probably the list misses many must-add people, so let's built it together💪

1 year ago 112 49 40 4
Preview
GitHub - microsoft/Trace: End-to-end Generative Optimization for AI Agents End-to-end Generative Optimization for AI Agents. Contribute to microsoft/Trace development by creating an account on GitHub.

Hi, I’m one of the main maintainers of Trace: github.com/microsoft/Tr... and will use this platform to promote it and engage with the OSS community 🫡

1 year ago 1 0 1 0
Advertisement

This is kinda cool honestly

1 year ago 0 0 1 0

I see…well…hope they’ll include it soon 😕

1 year ago 1 0 0 0

How to save/bookmark posts on 🦋?

1 year ago 1 0 4 0

Filled out so fast 😫 but I saw some friends who made to the list — happy for them instead 🥳

1 year ago 0 0 0 0

I wanted to contribute to "Starter Pack Season" with one for Stanford NLP+HCI: go.bsky.app/VZBhuJ5

Here are some other great starter packs:

- CSS: go.bsky.app/GoEyD7d + go.bsky.app/CYmRvcK
- NLP: go.bsky.app/SngwGeS + go.bsky.app/JgneRQk
- HCI: go.bsky.app/p3TLwt
- Women in AI: go.bsky.app/LaGDpqg

1 year ago 25 10 2 2