Advertisement · 728 × 90

Posts by

the people angry about the huggingface scraping are not long term thinkers. we are talking about people who have not even begun to think about what the second order consequences of highly centralized information access look like

1 year ago 4 0 0 0
Post image

(context)

1 year ago 4 0 0 0

Elon the type of guy to actually consider building the Absolutely Safe Capsule from Mother 3

1 year ago 9 0 1 0

i always come fast

1 year ago 2 0 0 0

"free culture for me but not for thee"

1 year ago 11 2 1 0

*the lack of learned optimization

there is a ton of information in the past changes of gradients that we could be doing more with than running averages!

1 year ago 1 0 0 0
Post image

i also think learned optimization may end up being far more of a bottleneck in the long term compared to the architectural structure of neural networks in terms of sample efficient learning

arxiv.org/abs/1606.04474

1 year ago 1 0 1 0
Preview
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process Recent advances in language models have demonstrated their capability to solve mathematical reasoning problems, achieving near-perfect accuracy on grade-school level math benchmarks like GSM8K. In thi...

arxiv.org/abs/2407.20311

1 year ago 1 0 1 0
Advertisement
Post image

clearly we can observe, that the deeper the network is, the better the heuristics that form in the network when it comes to generalizing to "like data".

so blanketly describing the solutions that dnns make as "poorly generalizable" is a little bizarre to me tbh

1 year ago 1 0 2 0

yeah i respect him a lot more than Gary Marcus in this regard. i just think there's a lack of humility involved when we abstract things to reductionist observations on the level of, "its just curve fitting bro"

1 year ago 2 0 1 0
Post image

i mean, how can you coherently assess the formation of approximated functions that don't actually exist in the data as a form of "memorization" if the internal heuristics of the network look nothing like the data but are formed by an attempt to match it?

1 year ago 1 0 1 0

yeah i think more broadly this is a useful thing to think about for the record

there are some people who are extremely dead set on their current interpretation of how dnns "actually" work (Gary Marcus, François Chollet) and refuse to look through any other lens

pretty unfortunate

1 year ago 1 0 2 0
Post image

a what now?

1 year ago 9 0 1 0

does attending to the attention scores of the "happy" token imbue a consistent expression of this semantically? does it only imbue it for a small subspace of attention comparisons?

assessing this seriously forces you to unpack a black box of unfalsifiable metaphysics questions

1 year ago 2 0 1 0

my objection to this is not on the basis of humans being "special" but on the basis of Transformers being state simulators

whenever Claude says it is "happy", if most of the distribution it did a PRNG diceroll over is an equal spread of emotions, how can we say that there was intent to express it?

1 year ago 1 0 1 0

the world is a ghetto with big guns and picket signs,

but it can do whatever it want, whenever it want, i don't mind

1 year ago 2 0 0 0

the dynamics of public posts on a decentralized social media network are nothing like the dynamics of physically owned private property

1 year ago 8 0 1 0
Advertisement

The Basilisk Cometh

1 year ago 3 0 1 0

can someone please eloquently explain to me how copying the posts from a database into another database is violence, or otherwise allows for exorbitant harms?

1 year ago 8 0 1 0

the vague insinuations of this being a threat to people's safety is especially ???
wtf is this leap in logic?

1 year ago 21 0 1 0

getting mad at the thing relative to what harm it actually causes instead of on the principle of it being for AI would require critical thinking instead of acting like a reactionary and rolling with my gut instinct though.

1 year ago 21 0 1 0

this is the logical consequence of the design of the website. it was always meant to be open. the firehose in particular is completely free

1 year ago 1 0 0 0

you guys do realize that all the leading companies were gonna do this privately regardless whether or not you asked politely, right?
and the only difference is, this is public and researchers/hobbyists can tinker with it more easily?

1 year ago 50 0 1 0

ever since i read the bluesky api was completely open, i knew someone was gonna do this and people would respond with the most artificial faux outrage possible

1 year ago 47 0 2 0

"publicly available data was used by people when i posted it online. this is an ethical crime,"

1 year ago 54 0 1 0
Preview
a cartoon character wearing a purple hat and gloves is jumping over a checkered floor . ALT: a cartoon character wearing a purple hat and gloves is jumping over a checkered floor .

deep learning is hitting a wall but what if if hit on ME instead?

1 year ago 12 0 5 0
Advertisement