Maitrey Mehta (@my-tray) Bsky

Been less than a year since I started my lab at @utah.edu and we already have a ton of new stuff that I can’t wait to talk about soon.

I’ll start today by sharing that our updated Computer Use Survey blog has been accepted to ICLR Blogposts 2026.
iclr-blogposts.github.io/2026/blog/20...

1 month ago 5 2 1 0

The post has the spirit of the many incredible chats I've had with Vivek. So happy that what was enjoyed by a few of us till now, in a way, is open for everyone to appreciate! Do give it a read.

4 months ago 1 0 0 0

Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps Martin Tutek, Fateme Hashemi Chaleshtori, Ana Marasovic, Yonatan Belinkov. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. 2025.

Outstanding paper (5/7):

"Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps"
by Martin Tutek, Fateme Hashemi Chaleshtori, Ana Marasovic, and Yonatan Belinkov
aclanthology.org/2025.emnlp-m...

6/n

5 months ago 11 3 1 0

1/ 🚨NEW PAPER: "BriefMe: A Legal NLP Benchmark for Assisting with Legal Briefs", accepted to ACL Findings 2025!
We introduce the first benchmark specifically designed to help LLMs assist lawyers in writing legal briefs 🧑‍⚖️

📄 arxiv.org/abs/2506.06619
🗂️ huggingface.co/datasets/jw4...

10 months ago 7 4 1 2

What Has Been Lost with Synthetic Evaluation? Large language models (LLMs) are increasingly used for data generation. However, creating evaluation benchmarks raises the bar for this emerging paradigm. Benchmarks must target specific phenomena, pe...

𝐖𝐡𝐚𝐭 𝐇𝐚𝐬 𝐁𝐞𝐞𝐧 𝐋𝐨𝐬𝐭 𝐖𝐢𝐭𝐡 𝐒𝐲𝐧𝐭𝐡𝐞𝐭𝐢𝐜 𝐄𝐯𝐚𝐥𝐮𝐚𝐭𝐢𝐨𝐧?

(arxiv.org/abs/2505.22830)

I'm happy to announce that the preprint release of my first project is online! Developed with the amazing support of @lasha.bsky.social & @anamarasovic.bsky.social

10 months ago 11 4 1 1

🙋

1 year ago 1 0 0 0

Posts by Maitrey Mehta