π¨ Survey Alert π¨
Our paper "Back to recurrent processing at the crossroad of transformers and state-space models" is out in Nature Machine Intelligence!
π www.nature.com/articles/s42...
Read it here: rdcu.be/el85K
#Mamba #SSMs #Transformers #LLMs
Posts by Matteo Tiezzi
(Top) A standard MLP catastrophically forgets samples learned online when data distribution shifts
(Bottom) A Memory Head is capable to avoid forgetting thanks to its learnable key-value routing mechanism. Keys become representative of the data distribution.
proceedings.mlr.press/v274/tiezzi2...
TL;DR: in Memory Heads, neurons route their computation across a learnable key-value mechanism
β‘οΈ dynamic behaviour depending on their own input.
β‘ Only the units(weights) that are relevant to process the observed sample are blended β‘ parameter isolation
proceedings.mlr.press/v274/tiezzi2...
@collasconf.bsky.social The #CoLLAs 2024 proceedings of our paper "Memory Head for Pre-Trained Backbones in Continual Learning" are out:
proceedings.mlr.press/v274/tiezzi2...
Video: lifelong-ml.cc/Conferences/...
π₯ Memory Head Code repository: github.com/sailab-code/...
A threadπ§΅
π Best of Both Worlds!
βHave you ever wondered why hybrid models (RNN + Transformers) are powerful? We answer this through the lens of circuit complexity and graphs!
Excited to share our work on understanding Graph Sequence Models (GSM), which allows the use of any sequence model for graphs.
π± Learning Over Time (LOT) Spring School π
24-27 March 2025 | π Siena, Italy
π sites.google.com/unisi.it/lot...
Organizers:
Stefano Melacci, V. Lomonaco, Andrea Cossu, Alessandro Betti, @mtiezzi.bsky.social
#ContinualLearning #CL #LifelongLearning #CollectionlessAI
πΈ Join the LOT Spring School with an incredible speaker lineup:
Battista Biggio, NicolΓ² Cesa-Bianchi, Elisa Ricci, Antonio Orvieto, Marco Gori, Tinne, Tinne Tuytelaars
π sites.google.com/unisi.it/lot...
π Pre-registrations are now open, secure your spot today: sites.google.com/unisi.it/lot...
π± Learning Over Time (LOT) Spring School
π
24-27 March 2025 | π Siena, Italy
π‘ Are you considering models that continuously adapt over time instead of learning "offline" from pre-designed-huge collections of data? π
π sites.google.com/unisi.it/lot...
#ContinualLearning #CL #LifelongLearning
Would love to join, working in CL and I was an organizer of CoLLAs 2024!
π GLOW is coming back in December with amazing speakers: Emily Jin and @joshsouthern.bsky.social !
ποΈ Dec 18th @ 17 CET on Zoom, don't miss that!
π Find more here: sites.google.com/view/graph-l...
Join this year's last edition of Graph Learning on Wednesdays (GLOW) tomorrow at 5pm CET!
May I join? Thanks
Entering the field, could you add me? Thanks!
β
Starter pack for people in Lifelong and Continual Learning βοΈ
@logconference.bsky.social #LOG meetup with @federico-errica.bsky.social open talk on "What is going on with oversmoothing, oversquashing, and underreaching?"
Wonderful location for the Siena #LoG meetup's poster sessionπ€©
#LOG2024 Franco Scarselli kicking off the Learning on Graphs (LOG) Italian Meetup!!!
Working in Lifelong/Continual Learning, and I was among the organisers of last CoLLAs conference! Please add meβΊοΈ