Advertisement ยท 728 ร— 90

Posts by Maitrey Mehta

Post image

Been less than a year since I started my lab at @utah.edu and we already have a ton of new stuff that I canโ€™t wait to talk about soon.

Iโ€™ll start today by sharing that our updated Computer Use Survey blog has been accepted to ICLR Blogposts 2026.
iclr-blogposts.github.io/2026/blog/20...

1 month ago 5 2 1 0

The post has the spirit of the many incredible chats I've had with Vivek. So happy that what was enjoyed by a few of us till now, in a way, is open for everyone to appreciate! Do give it a read.

4 months ago 1 0 0 0
Preview
Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps Martin Tutek, Fateme Hashemi Chaleshtori, Ana Marasovic, Yonatan Belinkov. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. 2025.

Outstanding paper (5/7):

"Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps"
by Martin Tutek, Fateme Hashemi Chaleshtori, Ana Marasovic, and Yonatan Belinkov
aclanthology.org/2025.emnlp-m...

6/n

5 months ago 11 3 1 0
Post image

1/ ๐ŸšจNEW PAPER: "BriefMe: A Legal NLP Benchmark for Assisting with Legal Briefs", accepted to ACL Findings 2025!
We introduce the first benchmark specifically designed to help LLMs assist lawyers in writing legal briefs ๐Ÿง‘โ€โš–๏ธ

๐Ÿ“„ arxiv.org/abs/2506.06619
๐Ÿ—‚๏ธ huggingface.co/datasets/jw4...

10 months ago 7 4 1 2
Preview
What Has Been Lost with Synthetic Evaluation? Large language models (LLMs) are increasingly used for data generation. However, creating evaluation benchmarks raises the bar for this emerging paradigm. Benchmarks must target specific phenomena, pe...

๐–๐ก๐š๐ญ ๐‡๐š๐ฌ ๐๐ž๐ž๐ง ๐‹๐จ๐ฌ๐ญ ๐–๐ข๐ญ๐ก ๐’๐ฒ๐ง๐ญ๐ก๐ž๐ญ๐ข๐œ ๐„๐ฏ๐š๐ฅ๐ฎ๐š๐ญ๐ข๐จ๐ง?

(arxiv.org/abs/2505.22830)

I'm happy to announce that the preprint release of my first project is online! Developed with the amazing support of @lasha.bsky.social & @anamarasovic.bsky.social

10 months ago 11 4 1 1

๐Ÿ™‹

1 year ago 1 0 0 0