Jason (@jasononline) Bsky

Cryptic #2 - An array of directionless sweaters inside a mix A cryptic clue to solve

amybell.info/puzzles/puzz...

my partner made her own version so she could share puzzles with some friends

1 month ago 1 0 1 0

Chelsea are you minute-cryptic pilled

1 month ago 1 1 2 0

In somebody else's words:

1 month ago 19 5 1 1

How to use R packages such as ggplot in Julia

Tidier a great idea, but tbh last time i tried it (2yrs ago?) the small diffs between it and og tidyverse drove me crazy. If thats still the case, it might be easier to just write R with rcall: avt.im/blog/archive.... IIRC the only downside here was the lack of support for the vscode plot pane.

1 month ago 2 0 1 0

part of this is that Pkg is so, so much better than pip etc, and another part of this is that there’s no AD / c++ backend conflict weirdness b/c Julia packages are mostly just pure Julia.

1 month ago 1 0 0 0

a very, very short Bayesian hmm in Julia

turinglang.org/docs/tutoria...

This example I added to the docs I think really shows off how well Julia packages tend to work together! If you think about the total call stack here it’s kinda crazy that I’ve never seen package conflict issues, but it’s the truth!

1 month ago 1 0 1 0

GitHub - gdalle/HiddenMarkovModels.jl: A Julia package for simulation, inference and learning of Hidden Markov Models. A Julia package for simulation, inference and learning of Hidden Markov Models. - gdalle/HiddenMarkovModels.jl

I also haven’t touched Julia a minute since I started working an R job, but in the past I’ve had a great time with hmms.jl + turing!

github.com/gdalle/Hidde...

1 month ago 1 0 1 0

I have never had a single issue installing a Julia library, and I’ve installed a lot of really random low star count stuff. What issues are you referencing?

Python is a different story tho— plenty of issues even with giant, popular packages.

1 month ago 2 0 0 0

wonder if the programming languages we have now are basically all we're ever going to get. it feels like the more people get used to coding via prompt, the higher the cost of switching to a language that doesn't exist in the training set.

1 month ago 0 0 0 0

this is also like 60% of my linkedIn lol

1 month ago 0 0 0 0

actual interviews have been mercifully light on this stuff but the total quantity of related slop online is not great

1 month ago 0 0 1 0

Partner is interviewing rn and I really feel this. The standard data science interview prep deck is full of the same trivia questions about, like, bagging v boosting or whatever.

really sad and unproductive way to interact with work!

1 month ago 1 0 2 0

1. I try an llm and am impressed by the improvements since I last tried an llm
2. I try harder tasks over the next few weeks until the LLMs start to produce garbage
3. “Damn, this sucks”

[wait four months]

1. I try an llm and am impressed by the improvements since I last tried an llm

1 month ago 1 0 0 0

feel like youd like the diaconis book "ten great ideas about chance"

1 month ago 0 0 1 0

I have one separating my half of the office from my partner — underrated feature is that it slowly accumulates doodles and nice messages / jokes from guests. At this point less than half those board is actually usable, but makes for good decor!

1 month ago 2 1 1 0

x <- val -> y

1 month ago 5 0 0 0

Demetri Pananos Ph.D - How Long To Run an A/B Test as a Bayesian

A more definitive answer on A/B testing run times if you're Baysian

dpananos.github.io/posts/2026-0...

1 month ago 12 2 1 0

lmao

2 months ago 0 0 0 0

in the reproducing kernel hilbert spaces approach, as the name implies, we... uh...

2 months ago 9 1 1 0

OpenAI Platform Explore developer resources, tutorials, API docs, and dynamic examples to get the most out of OpenAI's platform.

you can play around with the tokenizer here if you want to see this in action

platform.openai.com/tokenizer

2 months ago 0 0 0 0

that something is (mostly, IIRC) tokenization! Instead of seeing the string "a strawberry", the llm is trained on token ids [64, 101830].

"strawberry" alone tokenizes to [302, 1618, 19772]. the chatbot only learns statistical associations between tokens – it never sees the actual strings.

2 months ago 4 0 2 0

looks pretty cool 👀

2 months ago 1 0 0 0

lol

2 months ago 1 0 0 0

not sure what people do to get around this? just follow a bunch of people on twitter? live in sf and go to in person meetups? any thoughts @cameron.stream (basically the only AI person I follow on here lol)

2 months ago 1 0 1 0

so many terrible medium blogs, seo optimized articles from companies selling their services, etc.

meanwhile searching up topics in bayes stats will still surface some pretty great personal blogs the first few results. also,: no branded search!

2 months ago 3 0 1 0

one think i appreciate about gp / bayesian stats stuff being fairly niche is the lack seo spam. been looking at some llm stuff lately and its interesting that if i want to find the state of the art on, like, curating pretraining datasets or something this will be essentially impossible on google–

2 months ago 3 0 1 0

Calories in, calories out revisited “All models are wrong, but some are useful.” George E. P. Box A couple of months ago, I wrote about my experience “counting calories,” particularly about the accuracy of a v…

the dumbest possible cico model has pretty killer holdout accuracy (n = 1)

possiblywrong.wordpress.com/2015/01/01/c...

2 months ago 2 0 0 0

It even shows up in probability textbooks sometimes (Whittle)

2 months ago 0 0 0 0

There’s a refrain you see online re: DSA coding screens that, yeah, some of this stuff is useful– but not dynamic programming! that’s just there to weed ppl out, DP never shows up in prod code etc.

I’ve always found this odd b/c DP flavoured stuff is, like, the *only* DSA topic I’ve seen at work?

2 months ago 0 0 1 0

geolift and tidysynth both coming out meta / industry instead of academia might be relevant? not sure tho 🤔

2 months ago 2 0 1 0

Posts by Jason