Advertisement · 728 × 90

Posts by Ananth Packkildurai

I just realized: if Airflow and other orchestration engines emit open-lineage data to S3 Files and enable Claude to search them, you've got a data catalog.

1 week ago 1 0 2 0
Preview
Semantic Layer vs. Text-to-SQL: 2026 Benchmark Update | dbt Developer Blog With 2026's best models, the dbt Semantic Layer hits near-100% accuracy for covered queries. Here's what changed and what didn't in our updated benchmark.

So, are we debating the semantic layer again?

docs.getdbt.com/blog...

What do you call a semantic layer from your perspective?

1 week ago 2 0 0 0

Whether you like or dislike Apache Kafka, its KIPs are among the best learning materials for distributed systems. KIP-848 is an excellent read
cwiki.apache.org/con...

1 week ago 2 0 0 0
Preview
The Missing Interface in Data Platform Engineering How data leaders should design the boundary between platforms and dependent teams.

Most data platform failures don’t start with bad infra. They start at the team boundary. My new post argues platforms scale through operating interfaces: contracts, ownership, communication, and adoption design, not tooling alone.

2 weeks ago 2 0 0 0
Preview
ETL is Dead Why the shift from human-operated to agent-operated data warehouses demands a new architecture

More ETL pipelines will run next year than ever before. And ETL is still dead. Not dead, like nobody uses it. Dead like landlines — they work, but nobody builds their strategy around one.

1 month ago 1 0 0 0
Preview
Data Engineering After AI Moving Data Was Never the Point. Meaning It Is.

The data engineer job title is due for an update.

Not because AI is replacing the role, but because AI is finally revealing what the role was always actually about.

Moving data was never the point. Meaning it is.

Read more:

1 month ago 1 0 0 0

At the end of 2026, we will talk about "AI Fan Effect" [en.wikipedia.org/wik...] and the invention of a new field: Psychology for AI. Perhaps, I feel this is the future of software engineering.

1 month ago 1 0 0 0
Preview
The Missing Layer in Your AI Stack: Context, Not Just State From SQL to Semantics: The Rise of the Context Graph for AI Agents

As we move from dashboards to autonomous agents, something breaks.

Systems of record capture what happened, not why.

Why data platforms need Truth Registries + Context Graphs for the agentic era 👇
www.dataengineeringw...

#DataEngineering #AgenticAI #Graphs #LLMs

2 months ago 3 0 0 0
Post image

Data Engineering Weekly's 254th edition is out. Context Graph is the new talk of the town!!

2 months ago 0 0 0 0
Post image

The companies that build the most boring data stack often win the market!!!

Prove me wrong.

2 months ago 1 0 0 0
Advertisement
Preview
Data Contracts: A Missed Opportunity The Conversation We Should Have Had—Before Thought Leadership Replaced System Design

Data Contract: There was no shortage of activity around the topic. Definitions were proposed and refined. Conceptual boundaries were drawn and redrawn.

I pen down a reflection of the Data Contracts here

www.dataengineeringweekly.com/p/data-contr...

3 months ago 1 0 0 0

How to build a scalable shopping agent?
Here's a wild thought:
What if—and hear me out—we let humans click that Buy Now button? Just throwing ideas out there.

3 months ago 1 0 0 0
Preview
Data Engineering Weekly #252 The Weekly Data Engineering Newsletter

This week, it is mostly about Multi-Agent Architecture. Do you think the data infrastructure is ready for a multi-agent architecture? Where is the gap?

3 months ago 1 0 0 0
Preview
A Critique of Iceberg REST Catalog: A Classic Case of Why Semantic Spec Fails How a Semantically Correct API Becomes Operationally Unreliable at Scale

Is semantic Spec Good enough to run an enterprise system? I listed challenges to adopting the Iceberg Rest Catalog

3 months ago 0 0 0 0
Preview
DEW - The Year in Review 2025 From Digital Plumbers to Architects of Intelligence: The 7 Paradigm Shifts That Defined 2025

Continuing our yearly tradition of Year in Review Data Engineering Weekly, we published the 2025 Year in Review. What do you think is the most notable trend of 2025?

3 months ago 0 0 0 0
Post image

www.dataengineeringw...

4 months ago 3 0 0 0
Post image
4 months ago 6 0 0 0

Look at the tech stack IBM now controls:

🐧 Compute: Red Hat (Linux/OpenShift)
☁️ IaC: HashiCorp (Terraform)
💰 FinOps: Kubecost
🌊 Streaming: Confluent (Kafka)
🧠 Vector/AI: DataStax (Cassandra)
⚡ Query Engine: Ahana (Presto)
🔄 Ingest: StreamSets

4 months ago 4 0 1 0
Preview
Data Engineering Weekly #247 The Weekly Data Engineering Newsletter

LinkedIn moves FishDB to Rust, DoorDash builds AI swarms, and Dropbox masters context engineering. 🤯 Data Engineering Weekly #247 is packed with system design deep dives from the best engineering teams.

4 months ago 3 0 0 0
Advertisement

If the Data Catalog is the answer for AI, the question was wrong.

4 months ago 1 0 0 0
Preview
The Dark Data Tax: How Hoarding is Poisoning Your AI Storage is cheap. Attention is finite. Hallucinations are expensive. It’s time to stop building Data Lakes and start managing Data Metabolism

We stopped asking if data was useful because storage got cheap. Now, "Dark Data" is actively poisoning your AI context windows with hallucination vectors.

Read about the Data Sustainability index

5 months ago 5 0 0 0
Post image

The open source companies built their success on top of open-source platforms, benefited from community contributions and adoption, but now must abandon open-source principles to survive commercially.

5 months ago 1 0 0 0
Preview
Data Engineering Weekly #244 The Weekly Data Engineering Newsletter

🚀 The 244th edition of Data Engineering Weekly dives into:

AI agents as execution engines, LLM inference economics, databases for AI, personalization, and product evidence.

Read more 👉 www.dataengineeringw...

#DataEngineering #AI #LLMs

5 months ago 2 0 0 0
Post image

Cricket has been India’s greatest force in overcoming centuries of colonial suppression. Today’s Women’s World Cup win echoes the spirit of 1983 — a triumph that will inspire generations to come. 🇮🇳🏆

5 months ago 0 0 0 0
Preview
Thinking Like a Data Engineer A Journey Beyond Code — Toward Systems, Curiosity, and Confidence

This is the most personal essay that I have written in Data Engineering Weekly. I shared a few key moments in my life and how fortunate I was to meet mentors along my professional journey, which shaped my career.

5 months ago 9 0 0 1
Preview
Revisiting Medallion Architecture: Data Vault in Silver, Dimensional Modeling in Gold How to Balance Flexibility and Performance in a Modern Data Platform

🚀 Data Vault vs. Dimensional Modeling vs. Medallion Architecture — When viewed through a modern enterprise data lens, these techniques interlock.

I break down how in Part 2 of my “Revisiting the Medallion Architecture” series.

6 months ago 4 0 0 0
Advertisement

Fivetran and dbt form a strong foundation for modern data infrastructure, known for bringing simplicity to complex engineering workflows. That said, calling it “open” data infrastructure feels like a stretch.

6 months ago 5 0 3 0

Should we update the definition of an "Analytical Engineer"?

6 months ago 4 0 0 0
Preview
Engineering Growth: The Data Layers Powering Modern GTM Building privacy-preserving pipelines that unify zero-, first-, second-, third-, and fourth-party data into a coherent GTM ecosystem.

As a data engineer, you can't treat zero-party (consent) and third-party (inferred) data the same way. This distinction is critical for building systems that are scalable, private, and trustworthy.

Here’s my guide:

6 months ago 5 0 0 0

Could be. Composable CDP has not gained significant market share, as identity resolution is a key component that is often proprietary.

6 months ago 1 0 0 0