RJ Skerry-Ryan (@rjsr) Bsky

Relational Cognition Lab

I have recently launched the relational cognition lab at UC Irvine: relcoglab.org!
We study learning and memory in mind, brains and machines. I am open to collaborations and hiring a lab technician (lab manager/junior specialist). Job ad & application here: recruit.ap.uci.edu/JPF09400.

1 year ago 97 27 6 1

Huh, interesting

1 year ago 0 0 0 0

(small explosives)

1 year ago 0 0 0 0

Explosives Delivered by Drone – DRONE DELIVERY OF CBNRECy – DEW WEAPONS Emerging Threats of Mini-Weapons of Mass Destruction and Disruption ( WMDD)Share on Twitter

I'm just wondering how you weaponize them (I assume) how small a payload they can carry, and mounting a gun on them would probably never work.

Sounds like dropping explosives is the easiest way to weaponize: kstatelibraries.pressbooks.pub/drone-delive...

1 year ago 0 0 2 0

Earnest Q: What's the connection between this (very impressive) blinkenlight drone swarm (srsly, I am dying with envy of whoever got to build this) and military applications of drones? The drones the US has been bombing people with for decades now have nothing to do with this type of drone, no?

1 year ago 0 0 1 0

A die photo of the Pentium chip. An arrow points to a location on the die, with the text "FDIV bug". The chip itself has a complex pattern of circuitry with brownish rectangles and lines of various sizes.

In 1994, a math professor discovered that Intel's Pentium chip sometimes gave the wrong answer when dividing. Fixing this "FDIV" bug cost Intel $475 million. I analyzed the Pentium chip and found the bug. 1/N

1 year ago 364 85 7 5

Se Jin Park, Julian Salazar, Aren Jansen, Keisuke Kinoshita, Yong Man Ro, RJ Skerry-Ryan
Long-Form Speech Generation with Spoken Language Models
https://arxiv.org/abs/2412.18603

1 year ago 8 4 0 0

State-of-the-art video and image generation with Veo 2 and Imagen 3 We’re rolling out a new, state-of-the-art video model, Veo 2, and updates to Imagen 3. Plus, check out our new experiment, Whisk.

Here's Veo 2, the latest version of our video generation model, as well as a substantial upgrade for Imagen 3 🧑‍🍳🚢

(Did I mention we are hiring on the Generative Media team, btw 👀)

blog.google/technology/g...

1 year ago 60 17 0 1

DeepMind

🚨🚨My team @GoogleDeepMind in Tokyo is looking for a talented research scientist to work on audio generative models! 🔊
Please consider applying if you have expertise in the domain or related areas such as multimodal models, video generation 📹, etc.
boards.greenhouse.io/deepmind/job...

1 year ago 4 4 0 0

Nice! I'm surprised because the training window size is only 2.5 seconds, and the left context of the transformer is much longer than that, right?

1 year ago 0 0 1 0

Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis We describe a sequence-to-sequence neural network which directly generates speech waveforms from text inputs. The architecture extends the Tacotron model by incorporating a normalizing flow into the a...

Super cool! We did something similar for speech synthesis: arxiv.org/abs/2011.03568

1 year ago 1 0 1 0

Very nice! Does it generalize to arbitrary length inputs?

1 year ago 0 0 1 0

Don't forget text-to-speech!

1 year ago 4 1 0 0

noteuclaise/bluesky_1M_metaposts · Datasets at Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hm I think I was confused and thought the author of this dataset was banned:

huggingface.co/datasets/not...

That dataset is the instance of "You object to the use of your posts as data? I will make a dataset specifically of you."

1 year ago 3 0 0 0

Maybe a bad analogy, IIUC a bunch of people said "I don't want you to do X.", some subset of those people were uncivil about it, and the response was to do X.

1 year ago 0 0 1 0

Totally true for domains where you are a good enough verifier (and you don't get lulled into a false sense of security with it), but a problem I've seen is where you end up trusting it in domains you're not a verifier because it tends to be correct in domains you are a verifier in.

1 year ago 4 0 0 0

A picture of Alexander Fleming.

Thanksgiving shout out to this legend, as I wait in line at the pharmacy to pick up antibiotics for the second time in 2 weeks.

1 year ago 0 0 0 0

Seems like bullying tbh.

I have to work hard to teach my kids that just because someone hits you doesn't mean you get to hit them back.

1 year ago 1 0 2 0

f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization Generative neural samplers are probabilistic models that implement sampling using feedforward neural networks: they take a random input vector and produce a sample from a probability distribution defi...

f-GAN is an absolute banger: arxiv.org/abs/1606.00709

The theory that developed around GANs is rich and (for me) was transformative.

1 year ago 1 0 0 0

Eric Battenberg, RJ Skerry-Ryan, Daisy Stanton, Soroosh Mariooryad, Matt Shannon, Julian Salazar, David Kao
Very Attentive Tacotron: Robust and Unbounded Length Generalization in Autoregressive Transformer-Based Text-to-Speech
https://arxiv.org/abs/2410.22179

1 year ago 0 1 0 0

Arxiv sharing reminder

pdf ❌
abs ✅

1 year ago 249 41 9 2

Related: bsky.app/profile/rjsr...

1 year ago 3 1 0 0

1 year ago 4 0 0 1

Posts by RJ Skerry-Ryan