Advertisement Β· 728 Γ— 90

Posts by Maria Gorinova

The Population Test Your Agent Must Pass | Maria Gorinova
The Population Test Your Agent Must Pass | Maria Gorinova YouTube video by AI Native Dev

Learn more about our work on abstraction adherence of coding agents! www.youtube.com/watch?v=nx5_...

#coding #aicoding #agenticai #codegen #devtools #cursor #claudecode

4 months ago 3 0 0 0
Video

Had such a great chat with @sjmaple.bsky.social about abstraction adherence of coding agents, as well as what matters when it comes to evaluation. Check it out in the new @ainativedev.io podcast episode!

YouTube: ainativedev.co/ble
Apple Podcasts: ainativedev.co/exx
Spotify: ainativedev.co/2mn

4 months ago 1 0 0 0

At the risk of starting the flame war to end all flame wars...

Modern LLMs (GPT-5.1, Claude 4.5, Gemini 3) produce excellent code and can be a significant productivity boost to software engineers who take the time to learn how to effectively apply them - especially if used with coding agent tools

4 months ago 772 76 90 55
Preview
AI Lovable Hackathon London: Community Tools Edition Β· Luma 🧠 AI Hackathon: Community Tools Edition Hosted by Led by Community Powered by Lovable & Tessl Join us for a hands-on hackathon to build AI-powered tools that…

Build AI tools for community growth at our hackathon with Lovable & Led By Community

One day to collaborate with developers, designers, and community builders. Focus on real problems: onboarding, engagement, moderation, and events.

πŸ“… Dec 3rd, 9 AM-5 PM
πŸ“ Tessl HQ, London

luma.com/ai-hackathon...

4 months ago 6 3 1 0
Preview
A Proposed Evaluation Framework for Coding Agents: Specs Enhance Proper Use of Public APIs by ~35% This article proposes an evaluation framework highlighting how specifications enhance coding agents' effective use of public APIs.

Super excited to share what we've been doing at @tessl.io to improve the quality of code generated by AI agents! πŸ€–

We introduce a new way to measure abstraction adherence and show how Tessl's usage specs significantly boost it.

Check out the full article!

tessl.io/blog/propose...

5 months ago 5 0 0 0

It's funny to me how British digital newspapers are crying out loud what an invasion of privacy a state-issued digital ID is, but force me to accept tracking cookies or pay a fee.

6 months ago 4 0 0 0
Preview
a raccoon laying on a bed with a bowl of blueberries in its mouth Alt: a raccoon laying on a bed with a bowl of blueberries in its mouth; labelled "bluwbewwy"

"I really need a tool that helps me count the number of letters in 'blueberry'. It will change my life."

... said no one ever.

8 months ago 2 0 0 0
Preview
a pirate hat with a skull and crossbones and a sword on it Alt: a cat with a pirate hat with a skull and crossbones and a sword on it
8 months ago 1 0 0 0
Advertisement

I stole this for the linkedin post πŸ˜‚ Gratitude 🫑

8 months ago 1 0 1 0

Aaaaaaa! I'm regretting my choices πŸ₯²

8 months ago 2 0 0 0

I actually like the em dash. Am I an LLM? πŸ€”

8 months ago 18 0 2 2
A chatgpt generated take on the iconic kiss on the Berlin wall. But with Elon Musk kissing the Twitter blue bird. The text reads "My God, help me survive this tweet love"

A chatgpt generated take on the iconic kiss on the Berlin wall. But with Elon Musk kissing the Twitter blue bird. The text reads "My God, help me survive this tweet love"

A few years late but I was finally able to generate this

10 months ago 6 0 0 0
Preview
a man is holding a piece of bread over a woman 's face and asking what are you ? ALT: a man is holding a piece of bread over a woman 's face and asking what are you ?

What I imagine open ai feels like after reading that s1 paper

1 year ago 2 0 0 0

Wait... So s1 cost only $50 to fine-tune and beat o1-preview. And the secret sauce is... forcing the model to generate "Wait" instead of end-of-sequence???

Ahahhahahhaha this is so cool

1 year ago 2 0 1 0

🚨 MAJOR ALERT 🚨

The people wanted verification badges on Bluesky and @guan.dk and I have teamed up to bring you verification badges.

Behold! It is the Official Verified Labeller!
bsky.app/profile/veri...

1 year ago 2772 635 242 148

Strong characters sounds amazing + I think there is a lot to learn from that time if portrayed well. Thanks for the recommendation, I will give it a go!

1 year ago 2 0 0 0

Hi John,

That’s what all the bots say πŸ˜³πŸ˜‚

Not really my area of expertise, I know of the very obvious ones like input sanitation, regular finetuning on problematic examples, etc. The folks at lakera.ai have been on it for some time and regularly host hacking challenges, which I enjoy following

1 year ago 2 0 2 0

I love Silo and For All Mankind. Maybe I should try Halt & Catch Fire next!

1 year ago 1 0 1 0
Advertisement

Thank you! Indeed, what a debacle πŸ˜‚ 🍿

1 year ago 1 0 0 0

What's the drama with Byzantine? Did someone raise it during reviewing in the past? Is there a public discussion? 🍿

1 year ago 1 0 1 0

Bluesky's firehose is a treasure trove of public data for researchers and developers, and it's completely free. Check out our developer docs: docs.bsky.app

1 year ago 7889 1526 318 166
Book outline

Book outline

Over the past decade, embeddings β€” numerical representations of
machine learning features used as input to deep learning models β€” have
become a foundational data structure in industrial machine learning
systems. TF-IDF, PCA, and one-hot encoding have always been key tools
in machine learning systems as ways to compress and make sense of
large amounts of textual data. However, traditional approaches were
limited in the amount of context they could reason about with increasing
amounts of data. As the volume, velocity, and variety of data captured
by modern applications has exploded, creating approaches specifically
tailored to scale has become increasingly important.
Google’s Word2Vec paper made an important step in moving from
simple statistical representations to semantic meaning of words. The
subsequent rise of the Transformer architecture and transfer learning, as
well as the latest surge in generative methods has enabled the growth
of embeddings as a foundational machine learning data structure. This
survey paper aims to provide a deep dive into what embeddings are,
their history, and usage patterns in industry.

Over the past decade, embeddings β€” numerical representations of machine learning features used as input to deep learning models β€” have become a foundational data structure in industrial machine learning systems. TF-IDF, PCA, and one-hot encoding have always been key tools in machine learning systems as ways to compress and make sense of large amounts of textual data. However, traditional approaches were limited in the amount of context they could reason about with increasing amounts of data. As the volume, velocity, and variety of data captured by modern applications has exploded, creating approaches specifically tailored to scale has become increasingly important. Google’s Word2Vec paper made an important step in moving from simple statistical representations to semantic meaning of words. The subsequent rise of the Transformer architecture and transfer learning, as well as the latest surge in generative methods has enabled the growth of embeddings as a foundational machine learning data structure. This survey paper aims to provide a deep dive into what embeddings are, their history, and usage patterns in industry.

Cover image

Cover image

Just realized BlueSky allows sharing valuable stuff cause it doesn't punish links. 🀩

Let's start with "What are embeddings" by @vickiboykis.com

The book is a great summary of embeddings, from history to modern approaches.

The best part: it's free.

Link: vickiboykis.com/what_are_emb...

1 year ago 651 101 22 6

πŸ’― no fake science please 😬😬😬 This sort of accounts can also be used to build up a following base and then activate as part of an influence campaign

1 year ago 4 0 1 0

Personally, I like Quiet Posters: it surfaces posts I wouldn't have seen otherwise!

1 year ago 0 0 0 0

By the authority vested in me by the decentralised gods (πŸ’…πŸ») I accept this proposition.

(πŸ’…πŸ» Sarcasm. I possess no such authority. The decentralised gods would never grant it to me (πŸ’…πŸ»))

1 year ago 1 1 1 0
Advertisement

Maybe we need a special sarcasm tag here on bsky

1 year ago 1 0 1 0

My question was a joke!

I know what the word means 🀣

1 year ago 2 0 2 0

What do you mean "once"? 🀨

1 year ago 1 0 1 0