Advertisement Β· 728 Γ— 90
#
Hashtag
#NLLP
Advertisement Β· 728 Γ— 90
Preview
Do LLMs Truly Understand When a Precedent Is Overruled? Large language models (LLMs) with extended context windows show promise for complex legal reasoning tasks, yet their ability to understand long legal documents remains insufficiently evaluated. Developing long-context benchmarks that capture realistic, high-stakes tasks remains a significant challenge in the field, as most existing evaluations rely on simplified synthetic tasks that fail to represent the complexity of real-world document understanding. Overruling relationships are foundational to common-law doctrine and commonly found in judicial opinions. They provide a focused and important testbed for long-document legal understanding that closely resembles what legal professionals actually do. We present an assessment of state-of-the-art LLMs on identifying overruling relationships from U.S. Supreme Court cases using a dataset of 236 case pairs. Our evaluation reveals three critical limitations: (1) era sensitivity -- the models show degraded performance on historical cases compared to modern ones, revealing fundamental temporal bias in their training; (2) shallow reasoning -- models rely on shallow logical heuristics rather than deep legal comprehension; and (3) context-dependent reasoning failures -- models produce temporally impossible relationships in complex open-ended tasks despite maintaining basic temporal awareness in simple contexts. Our work contributes a benchmark that addresses the critical gap in realistic long-context evaluation, providing an environment that mirrors the complexity and stakes of actual legal reasoning tasks.

Looking forward to read this paper that won the Best Student Paper award at #Jurix2025: "Do LLMs Truly Understand When a Precedent Is Overruled?" by Li Zhang, Jaromir Savelka, and Kevin Ashley. https://doi.org/10.48550/arXiv.2510.20941

#Precedent #LegalNLP #LNLP #NLLP #LLM #LegalAI #LawAndAI

0 0 0 0

🍽️ Lunch is on us β€” huge thanks to our sponsor HumanAds (ERC project)
πŸ™ŒπŸ† Best Presentation Award sponsored by @bloomberglp.bsky.social
πŸ§ βœ¨πŸŽ‰ Get ready for an amazing day of #NLLP #LegalTech discussions β€” both in-person & online!

πŸ‘ EVERYBODY READY? 🫳🎀

0 0 0 0

πŸŽ‰ We have a (appropriately colored) poster! πŸ‡¨πŸ‡³πŸ“œ

πŸ“’ Check the CfP:
πŸ”— nllpw.org/workshop/call/

πŸ“„ Direct submissions (πŸ—“οΈ deadline: 26/Aug): πŸ”— openreview.net/group?id=EMN...

πŸ“ ARR commits (πŸ—“οΈ deadline: 2/Sep):
πŸ”— openreview.net/group?id=EMN...

✍️ Did you already start writing?

#NLLP #legaltech #NLProc

2 1 0 1
Preview
NLLP Workshop 2025 The seventh workshop on Natural Legal Language Processing (NLLP 2025) explores methods and applications of Natural Language Processing for the Legal Domain by focusing on legal text and text with lega...

β˜€οΈ We’re here too - getting ready for the 7th NLLP workshop (8/Nov @emnlpmeeting.bsky.social ) πŸš€βœ¨
The updated Call for Papers is coming soon β€” stay tuned! πŸ“£πŸ“„

πŸ’™ Follow us & spread the word about NLLPπŸŒπŸ’¬

πŸ”— Keep an eye on the website for updates! πŸ–₯οΈπŸ” nllpw.org/workshop/

#NLLP #legaltech #nlproc #nlp

8 5 0 0