Patrick Pérez (@ptrkprz) Bsky

Unmute adds ears and vocal chords to your favorite text-based language model. A seamless plug-and-play augmentation with easy personalisation through voice conditioning and text instructions. We will open-source shortly.

10 months ago 12 3 0 0

After its preview version in last January, Helium 1 now takes its full expanse, with 2 billions of well used open parameters. 🇧🇬 🇭🇷 🇨🇿 🇩🇰 🇳🇱 🇬🇧 🇪🇪 🇫🇮 🇫🇷 🇩🇪 🇬🇷 🇭🇺 🇮🇪 🇮🇹 🇱🇻 🇱🇹 🇲🇹 🇵🇱 🇵🇹 🇷🇴 🇸🇰 🇸🇮 🇪🇸 🇸🇪

11 months ago 4 0 0 0

One vertu of open models is to allow one to adapt them to one’s needs. This is even more impactful when finetuning is data- and compute-efficient. This is something we strive for at Kyutai. Let’s start with Moshi, our groundbreaking multi-stream spoken dialogue model.

1 year ago 2 0 0 0

🔥🔥🔥 CV Folks, I have some news! We're organizing a 1-day meeting in center Paris on June 6th before CVPR called CVPR@Paris (similar as NeurIPS@Paris) 🥐🍾🥖🍷

Registration is open (it's free) with priority given to authors of accepted papers: cvprinparis.github.io/CVPR2025InPa...

Big 🧵👇 with details!

1 year ago 136 52 7 11

Diplomacy dies on live TV as Trump and Vance gang up to bully Ukraine leader US president said his horrific blow-up would make ‘great television’ – the White House has never seen anything like it

I wish it is a disgraceful video generated by an unhinged AI. Unfortunately, it is the disgraceful new reality. Shame on Trump and Vance.
www.theguardian.com/us-news/2025...

1 year ago 4 0 0 0

Pushing testing dedication to the next level.

1 year ago 5 0 0 0

Simultaneous speech-to-speech translation on mobile is a world premiere. In the near future, no one will ever be lost in translation (at least for linguistic reasons).

1 year ago 7 1 0 0

New sharing step on our journey towards easy-to-use fully-open models.

1 year ago 15 7 0 0

Posts by Patrick Pérez