Advertisement ยท 728 ร— 90

Posts by Patrick Haller

Preview
BabyHGRN: Exploring RNNs for Sample-Efficient Language Modeling Patrick Haller, Jonas Golde, Alan Akbik. The 2nd BabyLM Challenge at the 28th Conference on Computational Natural Language Learning. 2024.

Are transformers really all we need? I doubt it. We tested alternative backbones for language models in low-resource scenarios โ€” #Mamba, #xLSTM, and #HGRN2 โ€” and they work surprisingly well!

๐Ÿ“„ Paper: aclanthology.org/2024.conll-b...

Thanks for being part of the #BabyLM Challenge! ๐Ÿ‘ถ

1 year ago 2 0 1 0