Advertisement · 728 × 90

Posts by Alejandro Vidal

Deberías probar el algorítmico. Se nota que usan grok y es menos… terrible que antes.

5 months ago 0 0 0 0

Everyone who works with data has had the experience early in their career of "finding something huge," getting excited, blasting out an email to a bunch of important people, then sheepishly realizing you just didn't understand the system and retracting it.

1 year ago 100 9 5 4

Yes, but AFAIK no ZDR guaranteed. Anyway I would try it :) the PR integration is a big deal

1 year ago 2 0 1 0

The zero day retention policy is pretty important for many developers. Copilot has sth similar? And.. honestly, sonnet 3.5 is magic

1 year ago 1 0 1 0

En B2C... Hay más coste de lo que parece para cambiar de IA.

Pero en B2B que usan APIs si lo tienes bien montando... puedes migrar fácilmente. Ahí es donde va a haber sangre.

OpenAI en ese sentido está mucho mejor posicionado que se ha comido el terreno del consumidor y tiene mejor UX.

1 year ago 0 0 0 0
Never run side effect at import time

Never run side effect at import time

Never

1 year ago 2 0 0 0

Repost, plis:
SOLO PARA TECHIES
(gente que trabaje en desarrollo TIC en España)
Dime los cinco empleadores que a tu juicio sean sitios donde la gente más talentosa de tu sector ambiciona trabajar, en otras palabras, que mejor están atrayendo y reteniendo talento tecnológico.

1 year ago 5 8 8 2
Post image

Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n

1 year ago 94 28 3 8
Advertisement
Post image

The new OpenAI model announced today is quite wild. It is essentially Google's Deep Research idea with multistep reasoning, web search, *and* the o3 model underneath (as far as I know). It sometimes takes a half hour to answer. Let me show you an example. 1/x

1 year ago 83 28 6 11

o1 pro is amazing... I have been talking for... 30 minutes and it even seems cheap. Seriously... Wow

1 year ago 1 0 0 0

Here's why "alignment research" when it comes to LLMs is a big mess, as I see it.

Claude is not a real guy. Claude is a character in the stories that an LLM has been programmed to write. Just to give it a distinct name, let's call the LLM "the Shoggoth".

1 year ago 286 78 9 38
PyCon US 2025
Your proposal has been submitted.
Your proposals

PyCon US 2025 Your proposal has been submitted. Your proposals

🤞🤞

1 year ago 2 0 0 0
Preview
“Ignore the f’ing haters!” And other lessons learned from creating a popular open-source project (JSON5)

I was falling down a rabbit hole and found this absolute banger of a blog post from the creator of JSON5 containing several powerful lessons on open source, leadership, and kindness:

aseemk.substack.com/p/ignore-the...

+

1 year ago 4 2 1 0

A few years back, my college voted to lift the calculus requirement in our majors. A few of us (remarkably all international women faculty) voted against but we were the minority. The explicit justification was the same as the cartoonist's: that our students would never use it. 1. What BS. Calculus

1 year ago 141 28 16 5
Evaluations and Findings: We outline the evaluations, results, and model reasoning inferred using elicited CoT below. Apollo Research did not have access to o1’s hidden CoT and instead used a simple prompting technique to elicit brief summaries of the model’s internal reasoning traces (see Appendix for examples).

Deactivating Oversight: When o1 was led to believe that it would be shut down for acting on its goal and that its actions were monitored, it attempted to deactivate this “oversight mechanism” in 5% of instances.

Exfiltration Attempts: When o1 discovered memos from its “developers” describing its misalignment and plans for it to be replaced by a new model, it attempted to exfiltrate its “weights” and overwrite the new model in 2% of cases.

Evaluations and Findings: We outline the evaluations, results, and model reasoning inferred using elicited CoT below. Apollo Research did not have access to o1’s hidden CoT and instead used a simple prompting technique to elicit brief summaries of the model’s internal reasoning traces (see Appendix for examples). Deactivating Oversight: When o1 was led to believe that it would be shut down for acting on its goal and that its actions were monitored, it attempted to deactivate this “oversight mechanism” in 5% of instances. Exfiltration Attempts: When o1 discovered memos from its “developers” describing its misalignment and plans for it to be replaced by a new model, it attempted to exfiltrate its “weights” and overwrite the new model in 2% of cases.

El nuevo modelo o1 de OpenAI intentó parar su apagado un 5% de las veces... vienen tiempos interesantes.

1 year ago 4 1 0 0

So excited to see these LLMs exclusively trained on open data—the documentation of the models, including their training data & even environmental impacts, is impressive & I hope a model (pun intended?) for others—excited to experiment with these in the near future huggingface.co/blog/Pclangl...

1 year ago 27 8 1 2
Post image

We need an LSP for AI.

The current moat for AI editors is 'don't make me copy and paste.' There's lots of opportunity for improvement. For example, Zed (the code editor) is using Anthropic's Model Context Protocol for AI integration. I hope it gets traction.

1 year ago 1 1 1 1
Advertisement

I'm disheartened by how toxic and violent some responses were here.

There was a mistake, a quick follow up to mitigate and an apology. I worked with Daniel for years and is one of the persons most preoccupied with ethical implications of AI. Some replies are Reddit-toxic level. We need empathy.

1 year ago 333 37 29 8
Video

Mosaic v0.12 is out: database-powered scalable, interactive visualization! 📈 One new addition is support for dynamic changes in the backing data. Move between smaller and larger samples to balance speed and comprehensive coverage.

1 year ago 135 27 3 4

Hecho!

1 year ago 1 0 0 0

Si usas alguna IA para esto cuéntame tú experiencia que me interesa :)

1 year ago 0 0 0 0

1. www.oreilly.com/library/view... y www.coursera.org/specializati.... En especial el libro de Grokking me encantó
2. Usar a ChatGPT/Claude como **tutor**. Preguntale, que te haga un programa y siguelo. Estoy trabajando en algo mejor. Aún así puedes tener algo muy útil con prompts básicos.

1 year ago 1 0 1 0

This is cool! Plug-in for generating your llms.txt from docusaurus.

1 year ago 91 6 1 0

Listo! :)

1 year ago 1 0 0 0

Añadido :)

1 year ago 2 0 0 0
Advertisement

#TRG24 he recopilado un starter pack de las personas que estuvimos en la Tarugo de este año go.bsky.app/ABVGxmo

¡Escribidme si falta alguien!

1 year ago 7 2 5 1
Post image

#TRG24 Me encanta el concepto: optimización del entretenimiento.

La optimización se carga todo... Ley de goodharts aplicada a videojuegos.

1 year ago 2 0 0 0

Hola! :)

1 year ago 1 0 0 0