Advertisement · 728 × 90

Posts by Calvin McCarter

Post image

yeah so, assuming that where civs meet, they just stop expanding and respect their neighbors, you get the following distribution over civ size. because the x axis is log-scaled, this implies a fairly high level of inequality in the distribution of civ size.

1 week ago 3 1 1 0

i think i was subconsciously combining "my own personal vietnam" with bsky.app/profile/norv...

1 week ago 2 0 0 0

this is my sigmoid

1 week ago 2 0 1 0
Preview
Inverse distance weighting attention We report the effects of replacing the scaled dot-product (within softmax) attention with the negative-log of Euclidean distance. This form of attention simplifies to inverse distance weighting interp...

I've previously found interesting results from using -log(Euclidean distance) attention. On shallow networks, it seems to train more easily and learn interpretable keys. arxiv.org/abs/2310.18805
github.com/calvinmccart...

2 weeks ago 0 0 0 0

exactly. (having said all this, wearing my substack special purpose vehicle shareholder hat, i am rather pleased...)

3 weeks ago 3 0 0 0
Post image

so I'm purely a browser user, but yes it does (screenshot). the problem is that the feed surfaces "substack notes" and replies thereto, but not "substack articles" and replies thereto.

3 weeks ago 4 0 2 0

it's also hard for me to get the full list of people i follow -- like what if i decide i want to subscribe instead?

3 weeks ago 3 0 1 0

another bad thing is that if i follow someone, i don't find out about their comments on various articles. an even worse thing is that if someone follows me, they don't find out about my comments on various articles.

3 weeks ago 4 0 1 0
Post image Post image

they've reached parity on the dual problem (unfortunately with degenerate solutions)

3 weeks ago 3 1 1 0
Advertisement
Post image

Of all sad words of LLM,
The saddest are these: "Failed to fetch arxiv.org" again

3 weeks ago 0 0 0 0
Post image

more seriously, this one:

3 weeks ago 1 0 0 0
Post image

does this count?

3 weeks ago 2 0 1 0

yes, allegedly the Munich Conference was good because it bought time for GB to prepare for war, while clarifying that the next takeover on Eastern Europe would bring GB to intervene

1 month ago 1 0 1 0

interesting article: big for Neville Chamberlain if true

1 month ago 1 1 1 0

speaking of which, it would be cool for TMLR to be a place which tries out various improved reviewing ideas!

1 month ago 0 0 0 0

a similar tweak would be to require submissions be posted on arxiv (to be verified in some automated fashion), while still keeping the venue itself double-blinded. authors shouldn't be able to force reviewers to read something, if they don't want anyone else to see that they wrote it.

1 month ago 1 0 1 0

"SDG"

1 month ago 1 1 1 0
Advertisement

tangential, but "set the mood for people to have communal religious experiences at church" would make almost certainly make Bach flip out. if we're going to insist that his motivations matter, then we should at least be honest about what those were

1 month ago 2 0 2 0

well, but the lesson from that is that you need to time the market correctly in order to make money; and normal people shouldn't try to time the market.

1 month ago 2 0 1 0
Post image

a metaphor for x (formerly twitter) @norvid-studies.bsky.social

1 month ago 5 2 1 1
Post image

both of these have wikipedia articles featuring what TR thought about it -- coincidence?

1 month ago 2 0 1 0

2026: celebrating 250 years of having taxes that are "very low by european standards"

1 month ago 4 0 0 0

what's the incentive here? there are no creator payouts...

1 month ago 3 0 2 0

i suspect that GDM has a solution that people in academia are either unable to invent or unable to afford (guessing the latter)

1 month ago 1 0 0 0

it's hard to learn what CDRs do due to their messy structures and the lack of evolutionary info about them in standard ab sequence datasets

1 month ago 0 0 1 0

we are so back (to dalle2 trve art)

1 month ago 1 0 1 0
Advertisement

obnoxiously tagging @kevinkaichuang.bsky.social @delalamo.xyz @austinjtripp.bsky.social about our new PLM paper -- hope you find it interesting and/or useful!

1 month ago 1 0 2 0
Preview
How to make the most of your masked language model for protein engineering A plethora of protein language models have been released in recent years. Yet comparatively little work has addressed how to best sample from them to optimize desired biological properties. We fill th...

When supervised models were included via multi-objective guidance, we achieved a 100% synthesizability-and-binding success rate in vitro. But a cautionary note: guidance improved target objectives at the cost of reduced humanness. All the details are here: arxiv.org/abs/2603.10302

1 month ago 4 1 0 0

Another surprise: ESM-2, trained on generic proteins, was highly competitive with antibody-specialized models for antibody optimization. Meanwhile, AbLang2 — trained on human antibodies — sometimes produced less human sequences than ESM-2. Training data ≠ output bias in the ways you'd expect. 6/7

1 month ago 0 0 1 0

Perhaps the most provocative finding from the in vitro experiments: choice of sampling algorithm matters at least as much as choice of model. Beam search consistently outperformed Gibbs across every model where both were tested. 5/7

1 month ago 1 0 1 0