Elad Leev (@leevs.dev) Bsky

Deep Dive into Apache Iceberg with Flink CDC - tech.kakao.com 이 글은 를 영어로 작성한 문서입니다. 다른 번역본 보기: 🇰🇷 한국어: h...

Some excellent deep-dive blogs from Kakao on their use of Flink CDC and Iceberg:

tech.kakao.com/posts...
tech.kakao.com/posts...
tech.kakao.com/posts...

10 months ago 11 4 0 0

I'm excited to be speaking at SREday London 2025!

Join me on March 27-28 to hear how we built a foundational, cloud-native, Streaming Platform using the operator pattern (and some magic ✨)

Register here:
sreday.com/2025-london-...
Don't forget to use the code 𝗟𝗗𝗡𝟭𝟬 for 10% off!

1 year ago 3 0 0 0

[FLINK-34440][formats][protobuf-confluent] Protobuf confluent dynamic format by dmariassy · Pull Request #25114 · apache/flink What is the purpose of the change Add support for deserializing protobuf messages using the Confluent wire format and whose schemas can be fetched from Confluent Schema Registry Add support for se...

Can’t believe this will be left out:
github.com/apache/flink...
:(

1 year ago 0 0 0 0

Curious to know what connectors are mostly in use, and how many of them are in-house/forks?

1 year ago 0 0 1 0

Robert Metzger - Tiny Flink — Minimizing the memory footprint of Apache Flink YouTube video by Plain Schwarz

Other frameworks: Apache Flink is too heavy and it requires a cluster

Apache Flink: let's process 100MB/s of data in Kafka with only 100 MB of memory heap and only one single JVM instance 😎

Great (and useful) talk! www.youtube.com/watch?v=04pw...

1 year ago 6 2 0 0

Effective Communication in Slack for Engineering teams Transform Slack from a distraction into a productivity powerhouse for engineering teams with proven strategies for channel organization, automation, and data-driven insights

Slightly different from my usual posts, but I wrote a thing!
This time, on how to transform Slack from a distraction into a productivity powerhouse for engineering teams with proven strategies for channel organisation, automation, and data-driven insights -
leevs.dev/effective-co...

1 year ago 2 0 0 0

A 5-10% performance improvement with every new Java stable release is a dopamine hit... for CFOs. If you work with the JVM ecosystem, you should upgrade often, it's easy money

1 year ago 12 1 1 0

What’s your take on this implementation? I have love-hate relationship with it. Totally get why it was built that way (consumer offsets, Schema Registry topic, kStreams state, etc etc), but does it stand the test of time?

1 year ago 0 0 2 0

Introducing the new Prometheus connector We are excited to announce a new sink connector that enables writing data to Prometheus (FLIP-312). This articles introduces the main features of the connector, and the reasoning behind design decisio...

flink.apache.org/2024/12/05/i...

1 year ago 2 0 0 0

The History and Evolution of Open Table Formats From Hive to High Performance: A Journey Through the Evolution of Data Management on Data Lakes

Fantastic post on the history and evolution of OTFs

alirezasadeghi1.medium.com/the-history-...

1 year ago 2 0 0 0

Jepsen: Bufstream 0.1.0

jepsen.io/analyses/buf...
A new Jepsen report, and as always, super interesting. 🔥

1 year ago 0 0 0 0

This is really useful: Awesome Data Catalogs and Observability Platforms: github.com/opendatadisc...

(h/t @adipolak.bsky.social for sharing this with me)

#dataBS

1 year ago 11 2 0 0

[FLINK-34440][formats][protobuf-confluent] Protobuf confluent dynamic format by dmariassy · Pull Request #25114 · apache/flink What is the purpose of the change Add support for deserializing protobuf messages using the Confluent wire format and whose schemas can be fetched from Confluent Schema Registry Add support for se...

Unfortunately, not really. We have a very (very) raw version of it open sourced, but it’s missing 7 months of advancement.
I do know there is an implementation which about to get merged:

github.com/apache/flink...

Worth giving it a try!

1 year ago 2 0 0 0

Also:

go.bsky.app/U1hZhum

1 year ago 2 2 0 0

BigTech is moving to object storage too, but not cuz it's cheaper. The idea is you no longer need every system to be distributed and durable. Everything can be stateless and simple, cuz your storage is distributed and durable.

1 year ago 24 4 3 1

definitely the coolest feature so far on 🦋

1 year ago 2 0 1 0

For those who are interested in keeping track of Apache Flink's disaggregated state storage work, this is the umbrella FLIP:

cwiki.apache.org/con...

1 year ago 4 3 0 0

Having said that, everything is a trade off, and although it put more load (in terms of eng time) on the Data Platform teams, our clients are extremely happy with it. Proto tooling (code gen, grpc, protovalidte) are just superior to Avro

1 year ago 2 0 0 0

We’ve made the decision to go full Proto this time (always used Avro as main format in past companies). The amount of forks, bugs, and code changes we had to make to support it is insane. Most notably, we had to build our own Flink format to pull from registry, as Flink currently doesn’t support it

1 year ago 5 1 3 1

I made an infra engineer starter pack. Folks posting about databases, stream processing, durable execution, orchestrators, service meshes, and more.

go.bsky.app/SCZe42X

1 year ago 288 74 44 15

Posts by Elad Leev