Advertisement · 728 × 90

Posts by BlackboxNLP

BlackboxNLP is back once again at EMNLP'26! Very happy to be part of the team again, and excited for our new reproducibility track! Check it out ⬇️

1 month ago 14 3 0 0
BlackboxNLP 2026 The Ninth Workshop on Analyzing and Interpreting Neural Networks for NLP

🔗 Check out our website for details, updates, and important dates:
blackboxnlp.github.io/2026/

1 month ago 3 0 0 0
Post image

BlackboxNLP will be co-located with EMNLP 2026 in 🇭🇺 Budapest 🇭🇺 this October!

This edition will feature a special reproducibility track, investigating generalization and robustness of established results from interpretability research 👷‍♂️

Stay tuned for more details!

1 month ago 16 7 1 2

Nicolò & Mingyang: Can we understand which circuits emerge in small models and reasoning-tuned systems, and how do they compare with default systems? Are there methods that generalize better across all tasks?

5 months ago 0 0 0 0

Q: What's next for interpretability benchmarks? Michal: People sitting together and planning how to extend tests to multimodal, diverse contexts. @michaelwhanna.bsky.social: For circuit finding, integrating sparse features circuits could help us better understand our models.

5 months ago 0 0 1 0

Nicolò & Mingyang: Starting to explore notebooks and public libraries can be very helpful in gaining early intuitions about what's promising.

5 months ago 0 0 1 0

@michaelwhanna.bsky.social: Don't try to read everything. Find Qs you really care about, and go a level deeper to answer meaningful questions.

5 months ago 0 0 1 0

Q: How would one go about approaching interpretability research these days? Michal: "When things don't work out of the box, it's a sign to double down and find out why. Negative results are important!"

5 months ago 1 0 1 0

@danaarad.bsky.social: As deep learning research converges on similar architectures for different modalities, it will be interesting to determine which interpretability method will remain useful across various models and tasks.

5 months ago 1 0 1 0

@michaelwhanna.bsky.social, Nicolò & Mingyang: Counterfactuals in minimal settings can be helpful, but they do not capture the whole story. Extending current methods to long contexts, and finding practical applications in safety-related areas are exciting challenges ahead.

5 months ago 1 0 1 0
Advertisement

Michal: Mechanistic interpretability has heavily focused on toy tasks and text-only models. The next step is scaling to more complex tasks that involve real-world reasoning.

5 months ago 1 0 1 0
Post image

Our panel moderated by @danaarad.bsky.social
"Evaluating Interpretability Methods: Challenges and Future Directions" just started! 🎉 Come to learn more about the MIB benchmark and hear the takes of @michaelwhanna.bsky.social, Michal Golovanevsky, Nicolò Brunello and Mingyang Wang!

5 months ago 9 1 1 1
Post image

Next up: Kentaro Ozeki presenting "Normative Reasoning in Large Language Models: A Comparative Benchmark from Logical and Modal Perspectives" aclanthology.org/2025.blackbo...

5 months ago 1 0 0 0
Post image

After a productive poster session, BlackboxNLP returns with the second keynote "Memorization: Myth or Mystery?" by @vernadankers.bsky.social!

5 months ago 7 0 0 0
Post image

Nadav Shani is giving the first oral presentation of the day: Language Dominance in Multilingual Large Language Models. Find the paper here: aclanthology.org/2025.blackbo...

5 months ago 3 0 0 0
Post image

Next up: Circuit-Tracer: A New Library for Finding Feature Circuits presented by @michaelwhanna.bsky.social! Paper: aclanthology.org/2025.blackbo...

5 months ago 3 0 0 0

I'll be presenting this work at @blackboxnlp.bsky.social in Suzhou, happy to chat there or here if you are interested !

5 months ago 1 1 1 0

Nov 9, @blackboxnlp.bsky.social , 11:00-12:00 @ Hall C – Interpreting Language Models Through Concept Descriptions: A Survey (Feldhus & Kopf) @lkopf.bsky.social

🗞️ aclanthology.org/2025.blackbo...

bsky.app/profile/nfel...

5 months ago 4 2 1 1
Advertisement
Post image

Quanshi Zhang is giving the first keynote of the day: Can Neural Network Interpretability Be the Key to Breaking Through Scaling Law Limitations in Deep Learning?

5 months ago 0 0 0 0
Post image

BlackboxNLP is up and running! Here's the topics covered by this year's edition at a glance. Excited to see so many interesting topics, and the growing interest in reasoning!

5 months ago 2 0 0 1
Post image

📢 Call for Papers! 📢
#BlackboxNLP 2025 invites the submission of archival and non-archival papers on interpreting and explaining NLP models.

📅 Deadlines: Aug 15 (direct submissions), Sept 5 (ARR commitment)
🔗 More details: blackboxnlp.github.io/2025/call/

8 months ago 9 1 0 3

Writing your technical report for the MIB shared task?
Take a look at the task page for guidelines and tips!

8 months ago 2 0 0 0

The report deadline was also extended to August 10th!
Note that this is a final extension. We look forward to reading your reports! ✍️

8 months ago 2 1 0 0
Post image

Just 5 days left to submit your method to the MIB Shared Task at #BlackboxNLP!

Have last-minute questions or need help finalizing your submission?
Join the Discord server: discord.gg/n5uwjQcxPR

8 months ago 1 1 0 0
BlackboxNLP 2025 The Eight Workshop on Analyzing and Interpreting Neural Networks for NLP

Results + technical report deadline: August 8, 2025
Full task details: blackboxnlp.github.io/2025/task/

8 months ago 0 0 0 0
Post image

With the new extended deadline, there's still plenty of time to submit your method to the MIB Shared Task!

We welcome submissions of existing methods, experimental POCs, or any approach addressing circuit discovery or causal variable localization 💡

8 months ago 2 1 1 0
Post image

Results deadline extended by one week!
Following requests from participants, we’re extending the MIB Shared Task submission deadline by one week.

🗓️ New deadline: August 8, 2025
Submit your method via the MIB leaderboard!

8 months ago 3 1 0 2
Advertisement
Post image

📝 Technical report guidelines are out!

If you're submitting to the MIB Shared Task at #BlackboxNLP, feel free to take a look to help you prepare your report: blackboxnlp.github.io/2025/task/

8 months ago 3 1 0 1
Post image

Just 10 days to go until the results submission deadline for the MIB Shared Task at #BlackboxNLP!

If you're working on:
🧠 Circuit discovery
🔍 Feature attribution
🧪 Causal variable localization
now’s the time to polish and submit!

Join us on Discord: discord.gg/n5uwjQcxPR

8 months ago 3 1 0 1

Are you attending ICML? 👀

I'm sadly not, but if you are, you should check out the MIB 🕶️poster at 11AM: icml.cc/virtual/2025...

The benchmark is used as the shared task at this year's
@blackboxnlp.bsky.social (blackboxnlp.github.io/2025/task/) - there's still time to participate 🏆

9 months ago 4 1 0 0