Advertisement · 728 × 90

Posts by Christopher Finlan

The Security Feature That’s Actually About Speed Here's the thing nobody's saying about Fabric Eventstream's new Custom CA and mTLS support: it isn't really a security feature. Or rather, it is, but the teams who'll benefit most aren't security teams. They're Spark engineers who've been running shadow pipelines for months because the "secure" path was also the "impossible" path. Let me explain. The Workaround Tax If you've been running Spark workloads against Kafka clusters in any regulated environment (banking, healthcare, telecom), you already know the drill. Your Kafka brokers sit behind certificates signed by an internal Certificate Authority.

The Security Feature That’s Actually About Speed

Here's the thing nobody's saying about Fabric Eventstream's new Custom CA and mTLS support: it isn't really a security feature. Or rather, it is, but the teams who'll benefit most aren't security teams. They're Spark engineers who've been running…

1 week ago 0 0 0 0
What the March 2026 Fabric Influencers Spotlight Tells Us About Where Spark Architecture Is Actually Heading The Fabric community has a habit of telling you the future before the product team ships it. You just have to know where to listen. Microsoft's March 2026 Fabric Influencers Spotlight dropped last week. Most people will scan it like a LinkedIn roundup — a nice acknowledgment for community contributors, maybe a bookmark, then back to whatever pipeline is on fire. That's a mistake. Buried in those blog posts and videos are concrete signals about where Fabric's data engineering surface is shifting, and if you're building Spark workloads today, several of those signals should change how you plan your next quarter.

What the March 2026 Fabric Influencers Spotlight Tells Us About Where Spark Architecture Is Actually Heading

The Fabric community has a habit of telling you the future before the product team ships it. You just have to know where to listen. Microsoft's March 2026 Fabric Influencers Spotlight…

1 week ago 0 0 0 0
Workspace Customer-Managed Keys for BYOK in Microsoft Fabric: Where Native Acceleration Pays Off and Where Fallbacks Bite Somewhere inside a regulated financial services firm, a platform engineer stared at two Fabric capacities on her screen. One ran Power BI semantic models encrypted with BYOK keys she controlled. The other held Spark notebooks, Lakehouses, and pipelines in CMK-enabled workspaces, also encrypted with keys she controlled, on entirely separate infrastructure. Same compliance mandate. Same Azure Key Vault. Two capacity bills. Until this month's preview, you could not enable workspace-level CMK on a capacity that already had BYOK turned on. The two encryption modes were oil and water. If compliance demanded customer-owned keys for everything, you paid for the privilege of running parallel universes.

Workspace Customer-Managed Keys for BYOK in Microsoft Fabric: Where Native Acceleration Pays Off and Where Fallbacks Bite

Somewhere inside a regulated financial services firm, a platform engineer stared at two Fabric capacities on her screen. One ran Power BI semantic models encrypted with BYOK…

2 weeks ago 0 0 0 0
Your Fabric Spark Jobs Have Been Failing in Silence. That Just Changed. You schedule a notebook. You schedule a pipeline. You walk away. That's the deal. Set it and forget it. Except "forget it" has a dark side nobody warns you about. When a scheduled Spark job dies at 2 AM, it dies quiet. No call. No text. No alarm. The data just stops moving. Downstream reports go stale. Dashboards freeze mid-number. And you find out Monday morning when your VP asks why the revenue figure hasn't budged since Friday. That silence just got a fix. Scheduled job failure notifications hit General Availability in Microsoft Fabric.

Your Fabric Spark Jobs Have Been Failing in Silence. That Just Changed.

You schedule a notebook. You schedule a pipeline. You walk away. That's the deal. Set it and forget it. Except "forget it" has a dark side nobody warns you about. When a scheduled Spark job dies at 2 AM, it dies quiet. No…

2 weeks ago 0 0 0 0
What “Upgrade your Synapse pipelines to Microsoft Fabric with confidence (Preview)” actually means for Fabric Spark teams in production Preview posts are written to soothe. Production teams read them like incident reviewers. They want to know what moves, what stays off, and what still needs proof before anyone re-enables a trigger. This new migration experience is useful because it has brakes. It lets you assess Synapse pipelines, see compatibility gaps, migrate supported pipelines into a Fabric workspace, map Synapse linked services to Fabric connections, and keep execution under control while you validate the result. That is not a one-click estate conversion. Good. One-click migration promises are how people end up explaining themselves on a call at 6 a.m.

What “Upgrade your Synapse pipelines to Microsoft Fabric with confidence (Preview)” actually means for Fabric Spark teams in production

Preview posts are written to soothe. Production teams read them like incident reviewers. They want to know what moves, what stays off, and what still needs proof…

2 weeks ago 0 0 0 0
Fabric notebook resources in Git give Spark teams a real release boundary Most Spark notebook trouble is not caused by the notebook. It is caused by the little files loitering around it. A notebook runs fine in dev. In test, it quietly depends on a config file somebody copied by hand. In prod, a helper script lives in a workspace folder nobody remembers creating. Then the team spends an afternoon acting baffled. This happens so often it barely even feels embarrassing anymore. That is why Resources folder support in Git for Fabric notebooks matters. The official notebook source control and deployment docs now describe notebook resources living with the notebook in source control.

Fabric notebook resources in Git give Spark teams a real release boundary

Most Spark notebook trouble is not caused by the notebook. It is caused by the little files loitering around it. A notebook runs fine in dev. In test, it quietly depends on a config file somebody copied by hand. In prod, a…

3 weeks ago 2 1 0 0
What the February 2026 gateway release really means for Fabric Spark teams Monthly gateway release posts are usually the corporate equivalent of dry toast. A version number appears. Power BI Desktop compatibility gets a polite bow. Then everyone goes back to moving data and arguing with refresh logs. The February 2026 on-premises data gateway release is mostly that kind of update. Microsoft says the build is 3000.306, and the point is simple: keep the gateway aligned with the February 2026 Power BI Desktop release so reports refreshed through the gateway use the same query execution logic and runtime as Desktop. Useful? Yes. Dramatic?

What the February 2026 gateway release really means for Fabric Spark teams

Monthly gateway release posts are usually the corporate equivalent of dry toast. A version number appears. Power BI Desktop compatibility gets a polite bow. Then everyone goes back to moving data and arguing with refresh…

3 weeks ago 0 0 0 0
Bulk Import and Export Item Definitions Are the Fabric APIs Ops Teams Needed Most Fabric deployment pain is not dramatic. It is slow, dumb, and expensive in the worst way. Somebody asks you to move a workspace full of notebooks, pipelines, reports, and models. Then the afternoon disappears into portal clicking, second-guessing, and the private terror that you forgot one dependency that will blow up later. That is why the new bulk item-definition APIs matter. Not because they are flashy. They are not. Not because they are finished. The official docs call both APIs beta and say they are for evaluation and development purposes, not recommended for production use.

Bulk Import and Export Item Definitions Are the Fabric APIs Ops Teams Needed

Most Fabric deployment pain is not dramatic. It is slow, dumb, and expensive in the worst way. Somebody asks you to move a workspace full of notebooks, pipelines, reports, and models. Then the afternoon disappears into…

3 weeks ago 0 0 0 0
The API layer that wasn’t supposed to matter The strangest platform announcements are usually the boring ones. Nobody throws a party for source control. Nobody leans back and says, "Hell yes, deployment pipelines," with a straight face. The applause goes to the flashy stuff: faster engines, new runtimes, clever demos. Then a quiet release slips past and changes the quality of production systems more than all the fireworks did. That is what just happened with the general availability of source control and CI/CD support for the API for GraphQL in Microsoft Fabric. On the surface, this looks minor.

The API layer that wasn’t supposed to matter

The strangest platform announcements are usually the boring ones. Nobody throws a party for source control. Nobody leans back and says, "Hell yes, deployment pipelines," with a straight face. The applause goes to the flashy stuff: faster engines, new…

3 weeks ago 0 0 0 0
Advertisement
The Moment Your Spark Pipeline Stops Being a Pet There's a team, a data engineering squad at a mid-size financial services firm, who spent eighteen months building their Lakehouse architecture on Microsoft Fabric. They had notebooks. They had Spark job definitions. They had a deployment process that consisted of one senior engineer named David who clicked buttons in the Fabric portal at 6 AM on Tuesdays. David was their CI/CD. David was their change management. David was their single point of failure. Last week, Microsoft shipped three preview capabilities alongside the general availability of the Fabric Extensibility Toolkit that would, in a kinder world, have arrived before David started updating his LinkedIn.

The Moment Your Spark Pipeline Stops Being a Pet

There's a team, a data engineering squad at a mid-size financial services firm, who spent eighteen months building their Lakehouse architecture on Microsoft Fabric. They had notebooks. They had Spark job definitions. They had a deployment process…

4 weeks ago 0 0 1 0
DeltaFlow just changed the CDC conversation for Fabric Spark teams DeltaFlow preview in Fabric Eventstreams can turn raw Debezium CDC into analytics-ready streams for four supported database sources. For Spark teams, the opportunity is obvious and the caveat is too: test it now, but do not cut over blindly.

DeltaFlow just changed the CDC conversation for Fabric Spark teams

DeltaFlow preview in Fabric Eventstreams can turn raw Debezium CDC into analytics-ready streams for four supported database sources. For Spark teams, the opportunity is obvious and the caveat is too: test it now, but do not cut…

4 weeks ago 0 0 0 0
Operationalizing Fabric’s February 2026 feature drop: what actually matters for Spark teams Operationalizing Fabric's February 2026 feature drop: what actually matters for Spark teams Microsoft's monthly feature summaries have a familiar problem. They flatten every change into the same cheerful pitch. A new cell editor mode gets about the same oxygen as a moving security boundary. If you run Spark seriously on Fabric, that is useless. You need to know which items change architecture, which clean up the daily notebook grind, and which quietly add a new failure mode. February's release has all three. The headline is not "more features." The headline is that Fabric keeps removing excuses for portal-driven, manually operated Spark environments.

Operationalizing Fabric’s February 2026 feature drop: what actually matters for Spark teams

Operationalizing Fabric's February 2026 feature drop: what actually matters for Spark teams Microsoft's monthly feature summaries have a familiar problem. They flatten every change into the same cheerful…

1 month ago 0 0 0 0
Operationalizing the semantic model permissions update for Fabric data agents Operationalizing the semantic model permissions update for Fabric data agents Permissions in data platforms have a remarkable talent for turning a two-minute job into a small municipal drama. You want one ordinary thing. The system hands you a form, a role, a workspace, another role, and, sooner or later, a person named Steve who is out until Thursday. Starting April 6, 2026, Microsoft Fabric removes one of those little absurdities. Creators and consumers of Fabric data agents need only Read access on the semantic model to use it through a data agent.

Operationalizing the semantic model permissions update for Fabric data agents

Operationalizing the semantic model permissions update for Fabric data agents Permissions in data platforms have a remarkable talent for turning a two-minute job into a small municipal drama. You want one ordinary thing.…

1 month ago 0 0 0 0
ExtractLabel just changed how your Spark pipelines should handle unstructured data ExtractLabel just changed how your Spark pipelines should handle unstructured data Every data engineer eventually inherits the same cursed pipeline. Upstream sends you a blob of human text. Somewhere in that blob are the exact facts your downstream systems need: product name, issue category, requested resolution, timeline, who did what, and when. The facts are there. They are just buried in prose written by sleep-deprived humans, copied from emails, and occasionally typed from a phone in an airport parking lot. For years, we handled this with a pile of hacks:

ExtractLabel just changed how your Spark pipelines should handle unstructured data

ExtractLabel just changed how your Spark pipelines should handle unstructured data Every data engineer eventually inherits the same cursed pipeline. Upstream sends you a blob of human text. Somewhere in that blob…

1 month ago 0 0 0 0
What “Recent data” in Fabric means for Spark teams when time is the real bottleneck Microsoft Fabric's February 2026 update shipped a small feature called Recent data in Dataflow Gen2. For Spark data engineering teams, it removes repeated navigation friction at the ingestion layer -- if you pair it with the right discipline.

What “Recent data” in Fabric means for Spark teams when time is the real bottleneck

Microsoft Fabric's February 2026 update shipped a small feature called Recent data in Dataflow Gen2. For Spark data engineering teams, it removes repeated navigation friction at the ingestion layer -- if you pair…

1 month ago 1 0 0 0
From CDC to Lakehouse: Making Fabric Eventstreams SQL Survive Contact with Production Spark From CDC to lakehouse: making Fabric Eventstreams SQL survive contact with production Spark Every data team eventually has the same bright idea: "Let's do CDC so everything is real time." What follows is usually less bright. Somebody wires up connectors, somebody else stands up Kafka, somebody definitely provisions a VM that nobody can later identify, and before long your "modern architecture" has one person who understands it, one person who is afraid of it, and one person who is on call for it. Usually the same person. So yes, Fabric Eventstreams supporting native CDC connectors for Azure SQL, PostgreSQL, MySQL, and SQL Server sources matters.

From CDC to Lakehouse: Making Fabric Eventstreams SQL Survive Contact with Production Spark

From CDC to lakehouse: making Fabric Eventstreams SQL survive contact with production Spark Every data team eventually has the same bright idea: "Let's do CDC so everything is real time." What follows is…

1 month ago 0 0 0 0
Fabric Spark’s Native Execution Engine: What Speeds Up, What Falls Back, and What to Watch You have been running Spark on the JVM for years. It works. Your pipelines finish before the SLA alarm fires, your data scientists get their DataFrames, and you have learned to live with the garbage collector the way one learns to coexist with a roommate who occasionally rearranges all the furniture at 3 AM. Then Microsoft shipped the Native Execution Engine for Fabric Spark, and the pitch is seductive: swap the JVM's row-at-a-time processing for a vectorized C++ execution layer built on Meta's Velox and Apache Gluten, get up to 6x faster query performance on compute-heavy workloads, change zero lines of code, pay nothing extra.

Fabric Spark's Native Execution Engine: What Speeds Up, What Falls Back, and What to Watch

1 month ago 0 0 0 0
Open Mirroring + OneLake: Spark patterns that keep latency from eating your weekends Open Mirroring is deceptively simple to set up and tricky to run well with Spark at scale. Here are the architecture choices, anti-patterns, and validation checks that keep your pipelines from falling apart in production.

Open Mirroring + OneLake: Spark patterns that keep latency from eating your weekends

Open Mirroring is deceptively simple to set up and tricky to run well with Spark at scale. Here are the architecture choices, anti-patterns, and validation checks that keep your pipelines from falling apart in…

1 month ago 0 0 0 0
Advertisement
What “Execute Power Query Programmatically” Means for Fabric Spark Teams What “Execute Power Query Programmatically” Means for Fabric Spark Teams Somewhere in a Fabric workspace right now, two teams are maintaining the same transformation twice. The BI team owns it in Power Query. The Spark team rewrote it in PySpark so a notebook could run it on demand. Both versions work. Both versions drift. Both versions break at different times. That was normal. Microsoft’s new Execute Query API (preview) is the first real shot at ending that duplication. It lets you execute Power Query (M) through a public REST API from notebooks, pipelines, or any HTTP client, then stream results back in Apache Arrow format.

What “Execute Power Query Programmatically” Means for Fabric Spark Teams

What “Execute Power Query Programmatically” Means for Fabric Spark Teams Somewhere in a Fabric workspace right now, two teams are maintaining the same transformation twice. The BI team owns it in Power Query. The Spark team…

1 month ago 1 0 0 0
What the February 2026 Fabric Influencers Spotlight means for your Spark team What the February 2026 Fabric Influencers Spotlight means for your Spark team Microsoft published its February 2026 Fabric Influencers Spotlight last week. Twelve community posts. MVPs and Super Users. Most people skim the list. Maybe bookmark a link. Move on. Don't. Three of those posts carry signals that should change how your Spark data-engineering team operates in production. Not next quarter. Now. Signal 1: Get your production code out of notebooks Matthias Falland's Fabric Friday episode makes the case plainly: notebooks are great for development but risky in production. That framing resonates with a lot of production teams—and for good reason.

What the February 2026 Fabric Influencers Spotlight means for your Spark team

What the February 2026 Fabric Influencers Spotlight means for your Spark team Microsoft published its February 2026 Fabric Influencers Spotlight last week. Twelve community posts. MVPs and Super Users. Most people skim…

1 month ago 0 0 0 0
Fabric Spark failure playbook: OneLake and mirroring under real production pressure A field-tested runbook for the failures that hide between Spark, OneLake, and mirroring in Microsoft Fabric: detection signals, triage sequences, and remediation tradeoffs from real production incidents.

Fabric Spark failure playbook: OneLake and mirroring under real production pressure

A field-tested runbook for the failures that hide between Spark, OneLake, and mirroring in Microsoft Fabric: detection signals, triage sequences, and remediation tradeoffs from real production incidents.

1 month ago 0 0 0 0
Preview
The most boring technology announcement might be the most important one for your Fabric Spark team Microsoft's new ODBC Driver for Fabric Data Engineering looks like a checkbox feature. It isn't. Here's what it means for production Spark teams, the migration risks nobody's talking about, and a concrete rollout checklist.

The most boring technology announcement might be the most important one for your Fabric Spark team

Microsoft's new ODBC Driver for Fabric Data Engineering looks like a checkbox feature. It isn't. Here's what it means for production Spark teams, the migration risks nobody's talking about, and a…

1 month ago 0 0 1 0
Preview
fabric-cicd Is Now Officially Supported — Here’s Your Production Deployment Checklist Three days ago, Microsoft promoted fabric-cicd from community project to officially supported tool. That Python library your team has been running in a “don’t look too closely at our deployment process” sort of way now carries Microsoft’s name and their support commitment. That shift matters. Your compliance team can stop asking “is this thing even supported?” You can open Microsoft support tickets when it breaks. The roadmap is no longer a volunteer effort, so features will land faster and bugs will get fixed on a schedule. But here’s where most teams trip.

fabric-cicd Is Now Officially Supported — Here's Your Production Deployment Checklist

1 month ago 0 0 0 0
Preview
The Spark-to-Warehouse Connector in Fabric: What It Does, How It Breaks, and When to Use It The Spark connector for Fabric Data Warehouse lets your notebooks read from and write to Warehouse tables with one line of code. Here's what it does, how it breaks, and when to use Warehouse vs Lakehouse as your serving layer.

The Spark-to-Warehouse Connector in Fabric: What It Does, How It Breaks, and When to Use It

2 months ago 0 0 0 0
Preview
Fabric Spark billing just got clearer. Here’s how to make the most of it. Microsoft split AI function consumption out of Spark billing into its own meter. Your total cost stays the same, but your alerting, thresholds, and capacity plans probably need updating. Here's a concrete checklist.

Fabric Spark billing just got clearer. Here's how to make the most of it.

2 months ago 0 0 0 0
Preview
From Demo to Production: ML-Enriched Power BI in Microsoft Fabric Microsoft's new end-to-end pattern for enriching Power BI reports with ML in Fabric looks clean in the demo. Here's the production migration checklist for Spark teams crossing the gap from notebook to ops.

From Demo to Production: ML-Enriched Power BI in Microsoft Fabric

Microsoft's new end-to-end pattern for enriching Power BI reports with ML in Fabric looks clean in the demo. Here's the production migration checklist for Spark teams crossing the gap from notebook to ops.

2 months ago 0 0 0 0
Preview
Microsoft Fabric Warehouse + Spark: Interoperability Patterns That Actually Work If you’ve spent any time in a Fabric workspace with both Data Engineering (Spark) and Data Warehouse, you’ve probably had this moment: Spark is great for big transformations, complex parsing, and “just let me code it.” The Warehouse is great for a curated SQL model, concurrency, and giving the BI world a stable contract. And yet… teams still end up copying data around like they’re paid by the duplicate. The good news: Fabric’s architectural bet is that OneLake + Delta is the contract surface across engines. That means you can design a pipeline where Spark and Warehouse cooperate instead of competing.

Microsoft Fabric Warehouse + Spark: Interoperability Patterns That Actually Work

2 months ago 0 0 0 0
Preview
What SQL database in Fabric actually means for your Spark pipelines There is a particular kind of excitement that sweeps through data engineering teams when Microsoft announces a new database option. It is the same mixture of curiosity and low-grade dread you might feel upon learning that your neighborhood is getting a new highway interchange. Useful, probably. Disruptive, definitely. Someone is going to have to figure out the on-ramps. SQL database in Fabric went generally available in November 2025. Built on the same SQL Database Engine that powers Azure SQL Database, it is the first fully SaaS-native operational database living inside Microsoft Fabric.

What SQL database in Fabric actually means for your Spark pipelines

There is a particular kind of excitement that sweeps through data engineering teams when Microsoft announces a new database option. It is the same mixture of curiosity and low-grade dread you might feel upon learning that your…

2 months ago 0 0 0 0
Advertisement
Preview
Microsoft Fabric Table Maintenance Optimization: A Cross-Workload Survival Guide Your Delta tables are drowning. Thousands of tiny Parquet files pile up after every streaming microbatch. Power BI dashboards stall on cold-cache queries. SQL analytics endpoints grind through fragmented row groups. And somewhere in the middle of the medallion architecture, a Spark job is rewriting perfectly good files because nobody told it they were already compacted. This is the small-file problem at scale — and in Microsoft Fabric, where a single Delta table can serve Spark, SQL analytics endpoint, Power BI Direct Lake, and Warehouse simultaneously, it becomes a cross-workload survival situation.

Microsoft Fabric Table Maintenance Optimization: A Cross-Workload Survival Guide

Your Delta tables are drowning. Thousands of tiny Parquet files pile up after every streaming microbatch. Power BI dashboards stall on cold-cache queries. SQL analytics endpoints grind through fragmented row groups.…

2 months ago 0 0 0 0
Preview
Optimizing Spark Performance with the Native Execution Engine (NEE) in Microsoft Fabric Spark tuning often starts with the usual suspects (shuffle volume, skew, join strategy, caching)… but sometimes the biggest win is simply executing the same logical plan on a faster engine. Microsoft Fabric’s Native Execution Engine (NEE) does exactly that: it keeps Spark’s APIs and control plane, but runs a large portion of Spark SQL / DataFrame execution on a vectorized C++ engine. What NEE is (and why it’s fast) NEE is a vectorized native engine that integrates into Fabric Spark and can accelerate many SQL/DataFrame operators without you rewriting your code.

Optimizing Spark Performance with the Native Execution Engine (NEE) in Microsoft Fabric

2 months ago 1 0 0 0