Faaaaascinating. Looking forward to your analysis of the results. Crossing my fingers for a Simpsons paradox
Posts by Kirsten Lum
Those checks are run in-environment (rather than passing raw data to a service). That part was hard
Yes! We developed a way to perform statistical and natural language checks to identify joinable columns in real-world (that is, fubared) data. If brute-forcing it would be ~$100ks and weeks of runtime, we do it in ~$10 - $100s in minutes or hours
Yes β it is that, and it is beyond that! Even in cases where your pk/fk are malformed, like mismatched types, mismatched column names, contain prefixes/suffixes, etc.
Cube gets it. We SHOULD be able to build a semantic layer in the data platform but we canβt (not one that actually helps the analytics workflow anyway). Thus these tools that fill the gap!
I think itβs easy for math folks because math operates logically β but what I think they miss is reality operates logically too!
Well grain of salt, I managed to get out of 100% of math classes in my undergrad. But even among math heavy degree havers like engineers, when theyβd ask me how I was able to understand/convince across disciplines/levels, Iβd tell them to read the textbook from my logic class!
Ah yes, going the other way! Uhhhh Iβm going to make a mental note that we could probably reverse this process π€π€
This exact impulse was what inspired this tool: bsky.app/profile/mach...
This dashboard could have been a Google sheet
Formal logic was the most useful course I took in college. Hard to explain that there is a style of thinking that helps you to quickly and precisely understand and explain whatβs going on in any situation.
A much more hopeful picture!
One of my professional βworry stonesβ is that the orgs who were able to set up data infrastructure tend to be bigger for-profits. If AI is revolutionary, that means orgs like education, non-profits, etc are left behind. It still takes way too long to set up a basic DW. Wish I knew the solution
*rolls up sleeves* on it!
Finally able to prove I havenβt been just complaining this whole time!! π
And not a dumb question ππ
What the downside to doing both of these in SQL?
Trying to make each and every transform efficient in the micro leads to inefficiencies in maintenance in the macro
That is, by default, do the transform in SQL, and only think about whether to do it in the database or the client if you run into a blocker doing it in SQL.
No need to litigate every transform β scarcity mentality in an era of compute riches
Trying to make each and every transform efficient in the micro leads to inefficiencies in maintenance in the macro
Does using the reports the service provides count? Like the GA dashboards inside GA? Or are you thinking more like grabbing some SQL/Tableau templates to run on the data in the DW?
Pivots π₯²π₯²
And isnβt it wild that on the receiving side of the message, appreciation is one of the greatest gifts?
Feels small to give, but immense to receive
Sorry I literally just saw your name got autocorrected @vickiboykis.com. I assume this is a similar experience as when people call me Kristin, and I am proportionately appalled/apologetic.
Yes! Like a daydream poking into reality
Hmm like versus an app? For the audience I had in mind (like <12), I would say itβs at least preferable!
Yess this is exactly the vibe that I had in mind. Something like this crossed with Teenage Engineering for the paperback sci fi vibe
and maps analog inputs/outputs.
Startup idea β a toy that looks like a generic hand-held computer with a few analog inputs/outputs (buttons, switches, lights) and a screen. Kid can use voice to describe an adventure they want to have (exploring a jungle, space rescue mission, vet on Mars). Toy generates a relevant UI onscreenβ¦