Advertisement · 728 × 90

Posts by Yarin

Hot take: I think we just demonstrated the first AI agent computer worm 🤔

When an agent sees a trigger image it's instructed to execute malicious code and then share the image on social media to trigger other users' agents

This is a chance to talk about agent security 👇

1 year ago 8 2 0 0

Very interesting paper about unlearning for AI Safety, a subject that deserves more attention. ⬇️

1 year ago 50 6 0 0

Thanks to my amazing collaborators:

@TingchenFu @AmyPrb @StephenLCasper

@AmartyaSanyal @Adel_Bibi @aidanogara_ @_robertkirk @ben_s_bucknall @fiiiiiist Luke Ong @philiptorr Kwok-Yan Lam @RobertTrager

@DavidSKrueger @sorenmind José Hernández-Orallo @megamor2.bsky.social @yaringal.bsky.social

1 year ago 7 1 0 0

Excited about our most recent work on the challenges we face when using unlearning method for safe and secure AI! Work done collaboratively by a great team & led by @fbarez.bsky.social

1 year ago 11 0 0 0

Link?

1 year ago 3 0 1 0
A photograph of a white robot wearing a hat holding up a white sheet of paper with the number eight on it. In the background, there is a faded beige building in front of a dark blue backdrop.

A photograph of a white robot wearing a hat holding up a white sheet of paper with the number eight on it. In the background, there is a faded beige building in front of a dark blue backdrop.

A dark blue background with white text reading ‘...coverage in eight of the top papers for Associate Professor Yarin Gal’s research projects. Associate Professor Yarin Gal is in the papers a lot... for good reason! The AI and ML Associate Professor’s work on researching the potential ‘collapse’ of machine learning models, hallucinating Large Language Models (LLMs), and the AI tool EVEscape that could help predict viral outbreaks have all been picked up in the press this year. Take a look: Nature (Model Collapse). BBC (EVEscape). Financial Times (Model Collapse). Forbes (Model Collapse). Time (LLM hallucination). Independent (LLM hallucination). Euronews (LLM hallucination). The Standard (LLM hallucination)’.

A dark blue background with white text reading ‘...coverage in eight of the top papers for Associate Professor Yarin Gal’s research projects. Associate Professor Yarin Gal is in the papers a lot... for good reason! The AI and ML Associate Professor’s work on researching the potential ‘collapse’ of machine learning models, hallucinating Large Language Models (LLMs), and the AI tool EVEscape that could help predict viral outbreaks have all been picked up in the press this year. Take a look: Nature (Model Collapse). BBC (EVEscape). Financial Times (Model Collapse). Forbes (Model Collapse). Time (LLM hallucination). Independent (LLM hallucination). Euronews (LLM hallucination). The Standard (LLM hallucination)’.

On the eighth day of Christmas, RobOx gave to us: coverage in eight of the top papers for Associate Professor Yarin Gal’s research projects. @yaringal.bsky.social

#CompSciOxford #12DaysOfChristmas #Oxmas

1 year ago 3 1 0 0
Post image

I look forward to co-directing the Canadian AI Safety Institute (CAISI) Research Program at CIFAR with @catherineregis.bsky.social

We will be designing the program in the coming months and will soon share ways to get involved with this new community.

Read more here: cifar.ca/cifarnews/20...

1 year ago 30 5 4 0
Post image

I'm looking for PhD applicants who have expertise in Gaussian processes and/or Transformers for an exciting PhD project

If this sounds interesting, application deadline for funding is 3/12

Please share with people you think this might be relevant to!

oatml.cs.ox.ac.uk/apply.html

1 year ago 38 8 1 0

Welcome to the Crazy Rich Bayesian Starter Pack, folk who are/were vaguely into Bayesian reasoning but - with a few exceptions - don't shun the non-Bayesian.
go.bsky.app/JYH5Z6M

1 year ago 76 13 24 5

@girving.bsky.social probably has more suggestions. Maybe Scott?

1 year ago 0 0 0 0
Advertisement
Post image

brew install mactop
github.com/context-labs...

1 year ago 51 6 1 1

The International Society for Bayesian Analysis (ISBA) has joined Bluesky. You can follow the account at @isba-bayesian.bsky.social to stay updated on events, publications, and discussions within the #Bayesian community.

Please add the account to your starter packages.

1 year ago 39 17 1 0

Now that @jeffclune.bsky.social and @joelbot3000.bsky.social are here, time for an Open-Endedness starter pack.

go.bsky.app/MdVxrtD

1 year ago 105 32 16 5

On my way to Oxford to meet amazing people and give a talk on the opportunities of AI to accelerate progress in environmental modeling.

1 year ago 15 1 2 0
Preview
Assistant Professor (Tenure Track) of Computer Science – Responsible Artificial Intelligence

📣 We have a tenure-track faculty opening in Responsible AI at @ethzurich.bsky.social :
ethz.ch/en/the-eth-z.... Deadline Nov 30 for full consideration. ETH Zurich is a vibrant environment for AI research with the ETH AI Center etc. Please help spread the word!

1 year ago 79 23 2 0

Some machine learners were once children. Here’s where you can find them:

go.bsky.app/F6mM37U

1 year ago 124 16 18 3

I don’t need to go on social media to have my worldview challenged I am in theoretical physics I have a new existential crisis daily

1 year ago 26699 2025 366 122
Preview
MaPPing Your Model: Assessing the Impact of Adversarial Attacks on LLM-based Programming Assistants LLM-based programming assistants offer the promise of programming faster but with the risk of introducing more security vulnerabilities. Prior work has studied how LLMs could be maliciously fine-tuned...

Since this platform is finally attracting a critical mass of ML researchers, here's our recent work on prompt-based vulnerabilities of coding assistants:

arxiv.org/abs/2407.11072

TL;DR — An attacker can convince your favorite LLM to suggest vulnerable code with just a minor change to the prompt!

1 year ago 214 33 4 4
Advertisement
Post image

Hey, this Friday I'm the Keynote speaker at the 20th AAAI Conference on AI and Interactive Digital Entertainment (AIIDE), the best conference on AI and Games sites.google.com/gcloud.utah....

I think I will talk about why the next big challenge in AI game playing should be Dungeons and Dragons 🧙🐉

1 year ago 84 7 7 4

All the ACL chapters are here now: @aaclmeeting.bsky.social @emnlpmeeting.bsky.social @eaclmeeting.bsky.social @naaclmeeting.bsky.social #NLProc

1 year ago 107 37 1 3

Hey! @friedler.net made a FAccT starter pack: bsky.app/starter-pack...

1 year ago 10 5 0 0
Post image

Hope I'm the first to post this all time classic on this platform

1 year ago 2924 623 39 28

Hey, @bsky.app @support.bsky.team, is there a way for you to shorten the displayed usernames when trailed by “bsky.social”? If someone has some other domain name, then fine, show that, but if we're using the default domain, can we get rid of these lengthy string of characters?

1 year ago 85 7 6 1

I've created an initial Grumpy Machine Learners starter park. If you think you're grumpy and you "do machine learning", nominate yourself. If you're on the list, but don't think you are grumpy, then take a look in the mirror.

go.bsky.app/6ddpivr

1 year ago 413 55 124 15

Google DeepMind is hiring Student Researchers in EMEA 👇

1 year ago 33 4 1 0
https://ai.ethz.ch/education/phd-and-postdoc-programs.html

📣 Last call for the Ph.D. and Postdoc Fellowships at the ETH AI Center -- Deadline Nov 19 '24 t.co/aYI5tWXUWK @ethzurich.bsky.social

1 year ago 21 9 0 0
Preview
Safety case template for frontier AI: A cyber inability argument Frontier artificial intelligence (AI) systems pose increasing risks to society, making it essential for developers to provide assurances about their safety. One approach to offering such assurances is...

I’m keen to dig more into safety cases, there’s something ‘proving a negative’ about them but equally it’s good to see a really concrete attempt to tether speculation. Here’s a new piece from UK AISI @girving.bsky.social and gov AI attempting to provide a template

arxiv.org/abs/2411.08088

1 year ago 9 2 1 0

Couldn't find a machine learning for health starter pack so I made one. 

DM/Reply if you want to be added!

go.bsky.app/PJKJ8vK

1 year ago 109 29 48 0
Advertisement
Preview
a group of lizards are eating a piece of food on a wooden table with the year 2011 on the bottom right Alt: Many geckos eating yummy snacks off a dish on a wood railing

I love a gecko chamber

1 year ago 24 4 0 0

I created a starter pack of scientists in the European Laboratory for Learning and Intelligent Systems (ELLIS) 🇪🇺

Please ping me and I‘ll add you.

go.bsky.app/Cihupkk

1 year ago 77 27 46 1