Learn more: research.google/blog/small-models-big-re...
#GenerativeAI #EMNLP2025 #MachineLearning (2/2)
Will the influx of synthetic data lead to uniform #ModelCollapse across the internet?
Our recent #EMNLP2025 (Oral) paper suggests a nuanced picture: different collapse dynamics might emerge in different internet domains based on the properties of human data in those domains! 🧵
Thanks again all for a wonderful #EMNLP2025! For future reference, you can now find all accepted papers, award winners, and outstanding reviewers, area chairs, and senior area chairs on the conference website: 2025.emnlp.org/program/awar...
We're happy to have @veraneplenbroek.bsky.social at our lab this week! She presented her #EMNLP2025 work "Reading Between the Prompts: How Stereotypes Shape LLM's Implicit Personalization" and shared more of her exciting ongoing work.
#NLProc
What an inspiring week at #EMNLP2025 in Suzhou🇨🇳!
Huge thanks to the organizers and everyone who stopped by our poster/talk!
With MCP, your agent has access to tools. Great!
But that doesn't mean your agent knows how to use those tools.
Tool access ≠ Tool use ability
That's why we created MCPEval (#EMNLP2025), a new framework for evaluating agent performance on any MCP server.
ICYMI: A highlight from Julien Knafou’s PhD thesis, presented at #EMNLP2025
TransBERT, trained on the auto-translated PubMed dataset, achieves state-of-the-art results in French biomedical NLP. TransCorpus also provides scalable solutions for low-resource languages.
aclanthology.org/2025.finding...
Two weeks ago, Julien Knafou presented his work at #EMNLP2025 in Suzhou.
TransBERT is a framework for pre-training LM using synthetically translated text, and TransCorpus is our toolkit for creating large-scale translated corpora.
Try it here
huggingface.co/jknafou/Tran...
github.com/jknafou/Tran...
Heading home tired but very happy after a fantastic #EMNLP2025 (and some well-deserved vacation 😎).
We were super busy presenting 5 papers! It was fantastic catching up with colleagues, exchanging ideas, and seeing all the amazing work in the #NLProc community!
(1/5)
EMNLP 2025. Suzhou, China.
Our computer scientists presented fascinating #NLP research at @emnlpmeeting.bsky.social last week—check out some of their work in this thread! 🧵 (1/7) #EMNLP2025
Crowd of researchers at EMNLP 2025, exchanging ideas and research results.
📸 Snapshots from last week’s EMNLP 2025 — an inspiring week of research, collaboration, and conversations!
Proud of our researchers for contributing to such a vibrant and innovative NLP community.
#EMNLP2025 #AIResearch #NLProc #HealthNLP #InformationRetrieval #Interpretability
Honored that our paper “Detecting Legal Citations in United Kingdom Court Judgments” was featured in the Senior Area Chair Highlights at #EMNLP2025!
Thanks to my co-authors Andreas Östling and @mansmag.bsky.social and to the reviewers and SACs for the recognition.
aclanthology.org/2025.emnlp-m...
🌟 Update: Honoured that our paper was selected as a Senior Area Chair Highlight Paper. This award recognises papers that the conference’s senior reviewers consider particularly influential or promising in their field. #EMNLP2025
Back after a successful #EMNLP2025 conference in Suzhou, China -- some impressions ⤵️
Our papers: www.copenlu.com/news/8-paper...
@apepa.bsky.social @rnv.bsky.social @siddesh.bsky.social @kirekara.bsky.social @shoejoe.bsky.social @zainmujahid.me @lucasresck.bsky.social @copenlu.bsky.social
#NLProc
EMNLP 2025 is over... and Milan Straka is bringing home an award! 🏆
CorPipe triumphed in the prestigious CRAC25 Shared Task, focusing on multilingual coreference resolution.
Did Milan just CRACk it? We certainly think so! 😉
🔗 Find out more at arxiv.org/abs/2509.17858
#EMNLP2025 #CorPipe #CRAC25
Finally, #EMNLP2025 recognizes the following Outstanding Senior Area Chairs:
Ashiqur R. KhudaBukhsh
Cassandra L. Jacobs
Debora Nozza
Luciana Benotti
Miryam de Lhoneux
Richard Sproat
Sachin Kumar
Usman Naseem
Wenpeng Yin
Full house at BlackboxNLP at #EMNLP2025!! Getting ready for my 1.45PM keynote 😎 Join us in A102 to learn about "Memorization: myth or mystery?"
#EMNLP2025 come today #WMT2025 Abderrahmane will be presenting DTW-Align 🌀, a method that uses Dynamic Time Warping to align speech + text embeddings during training for E2E-ST.
🧠 Boaster: 14:00–15:30
📊 Poster: 16:00–17:00
📄 Paper: aclanthology.org/2025.wmt-1.1...
We are kicking off this year’s workshop in Suzhou at #EMNLP2025! Come join us in room A106-107 or online!
What an incredible EMNLP experience — truly the most fulfilling conference I’ve ever attended!
✅ Oral presentation
✅ SAC Highlights Award
✅ Panel discussion
Grateful to my amazing collaborators and to all the friends I had the chance to meet! 🌟
#EMNLP2025 #NLP
An image of the best paper slide at the EMNLP2025 conference, with the audience in the background
🎉 Congratulations to all #EMNLP2025 award winners 🎉
Starting with the ✨Best Paper award ✨:
"Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index"
by Hao Xu, Jiacheng Liu, Yejin Choi, Noah A. Smith, and Hannaneh Hajishirzi
aclanthology.org/2025.emnlp-m...
1/n
The EU's 🇪🇺 HPLT project, coordinated by @ufal.mff.cuni.cz is at #EMNLP2025! It has supported it as a silver sponsor, disseminating HPLT results from our booth and through several papers. We'll continue to shape the future of multilingual datasets and models here and in @openeurollm.bsky.social!
@ufal.mff.cuni.cz members, alumni and friends.
Excited to share our work at #EMNLP2025! Our team is presenting 12 papers across the main conference and workshops, covering multilingual NLG, LLM agents, coreference resolution, and machine translation.
A thread with highlights 🧵👇
Thrilled to see this work recognized at #EMNLP2025!
This framework and approach to measuring CoT faithfulness have been hugely influential for how I think about reasoning evaluation, and I'm so lucky to have worked with such brilliant collaborators. Huge credit to @mtutek.bsky.social
#EMNLP2025 Reminder! #NLLP2025 kicks off tomorrow at 0900 (China Time) at A202.
Check below for program and streaming info #nlproc
👇👇👇
If you were at #EMNLP2025 and you missed it, or if you are going through the published paper looking for cool stuff, have a look at our work on comparing persuasive arguments by humans vs. LLMs! aclanthology.org/2025.emnlp-m...
#EMNLP2026 will be in Budapest 🇭🇺 24-29/October/2026 (earlier than ever?) #EMNLP2025 #nlp #nlproc
🧠 🕵️ LLMs are mastering the "imitation game"... but can we still tell who wrote what?
📝 At #EMNLP2025, we present OpenTuringBench — 500K+ texts, 7 open LLMs, 7 challenging tasks, and a new method for machine-generated text detection and attribution!
📍 Today at 2 PM in Hall C
Fact decomposition for interpretable and robust NLI without the need for an LLM?
Let me tell you how!
At 2pm today, I will be presenting “Extractive Fact Decomposition for Interpretable Natural Language Inference in One Forward Pass” at #EMNLP2025!