Advertisement · 728 × 90

Posts by John Kutay

Fun conversation with @jkxosound.com about where I think things are going in the AI agents space, Strands Agents, and rats 🐀

9 months ago 7 2 0 0
Retrieval as a Tool, Not a Destination – with Clare from AWS
Retrieval as a Tool, Not a Destination – with Clare from AWS YouTube video by Striim

youtu.be/mDFuNhehIOc?...

9 months ago 0 0 0 0
Preview
Strands Agents: A Model-Driven Approach to AI Agents with Clare Liguori (Senior Principal Engineer at AWS) What's New In Data · Episode

Link to the full episode:

open.spotify.com/episode/79PK...

9 months ago 0 0 0 0
Video

@clare.dev is one of the leaders making AI engineering simple and scalable at AWS. Had a great time chatting with her as we discussed Strands Agents and patterns like “Retrieval as a Tool”.

9 months ago 1 0 2 1
Preview
I'm sorry, but those are vanity evals | Hex Using GPT-4.1 as a case study of our framework for impactful LLM evaluation

hex.tech/blog/im-sorr...

9 months ago 0 0 0 0
Post image

Really appreciate the depth at which the Hex team broke down their Text-to-SQL implementation. Everyone's trying to teach LLMs SQL like it's a training problem but it's really a graph traversal problem.

9 months ago 1 0 1 0
Preview
Beyond Materialized Views: Using DuckDB for In-Process Columnar Caching In this post we will talk about using DuckDB as the operational analytics store for the control plane of Striim Developer — a serverless…

medium.com/striim/beyon...

1 year ago 0 0 0 0
Post image

In my post about #DuckDB I digress into the role of the database buffer cache to discuss how we segregate transactional workloads from analytical. DuckDB turned out to be a natural, lightweight approach to offloading analytical queries ensuring our application upheld performance requirements.

1 year ago 2 1 1 0
Preview
Beyond Materialized Views: Using DuckDB for In-Process Columnar Caching In this post we will talk about using DuckDB as the operational analytics store for the control plane of Striim Developer — a serverless…

Instead of materialized views, we built in-process DuckDB caching in the control plane of Striim Developer — improving query performance 5–10x with zero added infra.

PostgreSQL for OLTP, DuckDB for Operational OLAP. But I won't call it HTAP 🤐

medium.com/striim/beyon...

1 year ago 3 1 0 0
Distributed PostgreSQL with Aurora DSQL
Distributed PostgreSQL with Aurora DSQL YouTube video by Striim

@marcbrooker.bsky.social breaks down how they've architected a fully ACID-compliant database service that combines simple, serverless management with high availability and massive scale on AWS Aurora DSQL.

youtube.com/shorts/dScUi...

1 year ago 10 2 0 0
Advertisement

Thanks Marc! Super fun to learn how you combined the best parts of PostgreSQL and your own distributed processing engine.

1 year ago 0 0 0 0

This was actually my longest podcast ever at over 70 minutes. Not sure I could have made it any shorter because nerding out on databases with Andy Pavlo was too fun.

1 year ago 3 1 0 0
Andy Pavlo on Vector Databases
Andy Pavlo on Vector Databases YouTube video by Striim

Was super fun chatting with @andypavlo.bsky.social
to kick off the new season of What's New in Data. We dive into vector databases, text to sql, trends in data infrastructure, and Andy's awesome (and open) database course.

youtube.com/shorts/tjLmx...

1 year ago 4 0 0 1

A side effect of LLMs: I'm taking on way more than I ever have in my life. I don't know if this is more productive or diluting myself. tbd!

1 year ago 0 0 0 0

Just found out one of the internal b2b CRUD app vendors is more like CRD because it doesn't support updating submissions. AI gonna cook that sector so hard.

1 year ago 0 0 0 0
Post image

and that’s why I’m working on a Saturday morning 🫠

1 year ago 4 1 0 0

Your adversaries are taking (not my) Presidents Day off. Time to ship. 🚀

1 year ago 1 0 0 0

I’ll never forget where I was the day I learned oats could be milked.

1 year ago 0 0 1 0
Advertisement

Them: Wait so you're saying I don't need to deploy Kafka?
Me: No
Them: Kinesis?
Me: No
Them: Zookeeper? YARN?
Me: No
Them: Will you write every record to disk and replicate it?
Me: No

Unfortunately the bar of complexity for streaming has been set so high. I'm calling it Streamholm Syndrome.

1 year ago 4 0 2 0
I am trying to escape the Fivetran price increase

I'm not sure whether to be more amazed at the hate for FiveTran's price increase or the fact that Reddit doesn't know Striim exists and are proposing batch solutions to this persons obvious streaming CDC use case.

www.reddit.com/r/dataengine...

1 year ago 5 0 1 0

They really gave the smell of rain an epic name: petrichor. They really did that.

1 year ago 3 1 0 0
Sign Up - Striim Developer

If you don't have that type of scale, but simply want a reliable, real-time streaming service, you can use Striim Developer for free 🤘
signup-developer.striim.com

1 year ago 0 0 0 0
Post image

A single Striim cluster (multi-node for scalability and fault tolernace) can handle 35k, very wide, very active databases that produce millions of DML per hour hour and dozens of DDL per day. The 'intelligence' layer or Striim was able to apply rule based logic on how to handle complex DDL.

1 year ago 0 0 1 0

I will die on this hill but MySQL's 'Alter Table Add Column AFTER' DDL is pointless. It doesn't change the layout on disk. If you care about order of the columns, that's purely a read side construct and you should address it in your query not your DDL!

1 year ago 1 0 0 0
Preview
Real-Time RAG: Streaming Vector Embeddings and Low-Latency AI Search Imagine searching for products on an online store by simply typing “best eco-friendly toys for toddlers under $50” and getting instant, accurate results—while the inventory is synchronized seamlessly ...

We’ve shifted embedding generation and transformers left into the streaming layer to support near real-time RAG. Take a read if you want to hear the optimizations we made for change data capture and incremental embedding generation.

www.striim.com/blog/real-ti...

1 year ago 3 0 0 0
Post image

Remind me: 1,000 days.

1 year ago 4 0 0 0

Driving from SF to LA talking to ChatGPT about Kafka. I think that’s how schizo starts.

1 year ago 1 0 0 0
Advertisement

Nice screenshot !

1 year ago 2 0 1 0
Post image

I always love seeing clovers in the wild! Knowing every transaction is streamed to Snowflake in real time with Striim’s streaming CDC service. We do a lot of work to ensure transactions are bound reliably and replicated with no duplicates while maintaining low latency.

1 year ago 1 0 0 0
Preview
Escape from the Palisades: Split-second decision making, confusing responses YouTube video by Los Angeles Times

youtu.be/9YSU-M0m1Jk?...

When I saw the fire up the hill, I knew we wouldn't have much time to get out before the only escape road would become gridlocked. I texted my friends up on Lachman Ln an hour before they got an alert from the city to leave.

1 year ago 0 0 0 0