Advertisement ยท 728 ร— 90

Posts by Patrick Haller

Excited to share that our group will present 9 papers at this year's ACM Symposium on Eye Tracking Research & Applications (ETRA) in Tokyo!

We will post summaries of each paper in the coming weeks, but here's a quick sneak peek ๐Ÿ‘€

1 year ago 7 3 1 0
ACL 2025 Tutorial: Eyetracking and NLP ACL 2025 Tutorial on Eyetracking and NLP

At this year's ACL in Vienna, @lenajaeger.bsky.social and David Reich from our group, together with @whylikethis.bsky.social and Omer Shubi, will be hosting a tutorial on EyeTracking and NLP ๐Ÿ‘€ ๐Ÿ–ฅ๏ธ Be there to join us!

More information can be found here: acl2025-eyetracking-and-nlp.github.io

1 year ago 6 3 0 1
Preview
Sometimes I am a Tree: Data Drives Unstable Hierarchical Generalization Language models (LMs), like other neural networks, often favor shortcut heuristics based on surface-level patterns. Although LMs behave like n-gram models early in training, they must eventually learn...

Transformer LMs get pretty far by acting like ngram models, so why do they learn syntax? A new paper by sunnytqin.bsky.social, me, and @dmelis.bsky.social illuminates grammar learning in a whirlwind tour of generalization, grokking, training dynamics, memorization, and random variation. #mlsky #nlp

1 year ago 142 31 5 4
Tannon Kew presenting during his viva.

Tannon Kew presenting during his viva.

Congratulations to Dr. @tannonk.bsky.social, who just successfully defended his thesis on "Leveraging Data, Decoding, and Context for Controlling Text Generation from Pretrained Language Models". Special thanks to the external examiner @feralvam.bsky.social!

1 year ago 18 5 0 0

@echodroff.bsky.social

1 year ago 1 0 1 0