Advertisement · 728 × 90

Posts by Max Noichl

one arm bandit curious? Never heard of the Zollman Effect? Join us Friday next week for the new episode of Conversations at the Center the podcast of the @center4philsci.bsky.social with our guest @kevinzollman.com

1 day ago 11 3 0 0

1/
"Silicon samples" are becoming more and more common in research and polling.

One problem: depending on the analytic decisions made, you can basically get these samples to show any effect you want.

The updated version of this preprint is now online!

THREAD🧵

arxiv.org/abs/2509.13397

2 days ago 85 42 5 4
Preview
MetFuse: Figurative Fusion between Metonymy and Metaphor Metonymy and metaphor often co-occur in natural language, yet computational work has studied them largely in isolation. We introduce a framework that transforms a literal sentence into three figurativ...

This is a bit niche, but for those interested in metaphor and metonymy research, here is one of the first articles I have seen using LLMs as research tool! #cogling #metaphor #metonymy arxiv.org/abs/2604.12919 Oh, they have also done something on visual metonym arxiv.org/abs/2601.17706

4 days ago 12 2 0 0
Preview
Data Engineer #00246 - Richmond, Virginia, United States Title: Data Engineer #00246 State Role Title: Information Tech Spec III Hiring Range: $100,000 - $125,000 Pay Band: 6 Agency: The Library of Virginia Location: Library of Virginia Building Agency Web...

The Library of Virginia in Richmond seeks a data engineer ($100k-$125k) to transform data practices at a 200-year-old cultural heritage org with an eye towards the future.

Looking for someone to imagine & collaboratively implement tomorrow's data infrastructure.

Apply by May 1! Tell your friends!

1 week ago 78 77 4 7
ICML 2026 Workshop GenAICreativity Welcome to the OpenReview homepage for ICML 2026 Workshop GenAICreativity

How can generative AI better support human creativity, without limiting it? If you have thoughts, we invite submissions to our ICML workshop on Generative AI, Creativity, and Human-AI Co-Creation

📍 July 2026, Seoul
📄 Submit by: April 24 (AOE)
🔗 Submission link: openreview.net/group?id=ICM...

1 week ago 20 8 0 0
Post image Post image

I made some playable philosophy simulations:

-Oxford 1952
-Republic Book I
-Jena 1799
-Paris 1945

www.ux-phi.com

1 week ago 13 4 1 2
Screenshot of plot showing ELO vs paramter count for different OCR models

Screenshot of plot showing ELO vs paramter count for different OCR models

There is no best VLM OCR model - rankings can flip completely by document type.

I built ocr-bench: run open OCR models on YOUR documents, get a per-collection leaderboard.

VLM-as-judge with Bradley-Terry ELO, all running on @hf.co. No local GPU needed.

1 month ago 52 11 1 1

i'm trying out the novel writing project with Claude in Claude Code, using Pangram to break it out of writing in a clearly identifiable AI-writing style. it's going... interesting so far. i despaired at the beginning but am now cautiously optimistic. not so much at the structural level though.

2 months ago 29 1 2 0

this is cool

tbh all i want is an LLM that sits atop my Zotero library and lets me talk to it tho

2 months ago 5 1 2 0
Preview
Current Workshop CFA: 8th Scientific Understanding and Representation (SURe) annual workshop   Call for abstracts        We invite authors to submit abstracts of up to 750-words for the upcoming...

Final CFA for the 8th Scientific Understanding and Representation (SURe) annual workshop, which will take place May 27-29, 2026, at the IFIS PAN in Warsaw.
Submission deadline: 20 January 2026.
More info: shorturl.at/AUoye
@philsci.bsky.social @eenphilsci.bsky.social @epsaphilsci.bsky.social

3 months ago 9 6 0 0
Advertisement
A four-panel figure showing the probability of predicting articles from The Journal of Philosophy versus PMLA using quarter-century models. Each panel represents a different training period (1925-1950, 1950-1975, 1975-2000, 2000-2025). Gray shaded regions indicate training periods. The model trained on early C21 philosophy vs literature cannot accurately distinguish early C20 philosophy vs literature, but the reverse is not true.

A four-panel figure showing the probability of predicting articles from The Journal of Philosophy versus PMLA using quarter-century models. Each panel represents a different training period (1925-1950, 1950-1975, 1975-2000, 2000-2025). Gray shaded regions indicate training periods. The model trained on early C21 philosophy vs literature cannot accurately distinguish early C20 philosophy vs literature, but the reverse is not true.

Hierarchical cluster of syntactic features predicting philosophy (blue) vs criticism (red).

Hierarchical cluster of syntactic features predicting philosophy (blue) vs criticism (red).

Top 2 distinctive features for Philosophy vs Criticism.

Top 2 distinctive features for Philosophy vs Criticism.

An example of the importance of the "marker" feature in philosophy.

An example of the importance of the "marker" feature in philosophy.

Analytic philosophy can be distinguished from literary criticism with 90-95% accuracy via syntax alone. Moreover, a classifier trained to separate them in early C20 does better predicting future separations than a C21 one predicts past ones, suggesting philosophy syntax narrows/specializes in ~C21.

3 months ago 34 8 0 0
Preview
OpenAlex intégré au Web of Science, ou la capture du travail des “commoners” C’est une annonce qui est passée relativement inaperçue, mais qui mérite que l’on s’y arrête un instant. Clarivate a récemment annoncé l’intégration d’OpenAlex comme une nouvelle base de données au se...

OpenAlex intégré au Web of Science, ou la capture du travail des “commoners” | carnetist.hypotheses.org/2572

4 months ago 29 28 1 2
Three scatterplots of colorful points.
titles = ['Color Space', 'Text Space', 'Image Space']
subtitles = ['Embeddings of color features', 'Text embedding of color names', 'Image embeddings of color swatches']

Three scatterplots of colorful points. titles = ['Color Space', 'Text Space', 'Image Space'] subtitles = ['Embeddings of color features', 'Text embedding of color names', 'Image embeddings of color swatches']

Three different ways to represent colo(u)r. Work in progress, inspired by an old post by Kat Zhang / The Poet Engineer.

5 months ago 5 1 1 0

"there is a part of human intelligence which operates in a continuous generalization of the space of words, and other parts entirely which do things which are less well understood" is a perfectly reasonable position which apparently has no adherents

5 months ago 63 5 2 0
Generative Aesthetics: On formal stuckness in AI verse | Published in Journal of Cultural Analytics By Ryan Heuser. This paper examines the formal and aesthetic patterns of AI-generated poems through a series of computational experiments.

Excited to share my latest publication, "Generative Aesthetics: On formal stuckness in AI verse." It's published in a special issue in the Journal of Cultural Analytics, expertly edited by Tess McNulty and Laura Chapot, on "Computation and Form, Reconsidered."
culturalanalytics.org/article/1448...

6 months ago 45 17 2 2

Tomorrow we will have a keynote from Charles Pence (UC Louvain).

Thanks to the Dutch Philosophy Research School (OZSW) for supporting this event, and @mnoichl.bsky.social for organizing this with me!

6 months ago 3 1 0 0
academic presentation in a baroque university environment. A group of researchers are gathered around a conference table

academic presentation in a baroque university environment. A group of researchers are gathered around a conference table

Gregor Betz (KIT) kicking off our "Data Driven Philosophy" Hackathon in Utrecht with his talk: "Doing Philosophy with and for LLMs". Besides input about the state of research and new directions, we're spending three days kicking off new projects.

6 months ago 7 1 1 0
Advertisement

i am going to try to give a framework of my own understanding which laypeople can understand.

6 months ago 383 53 6 20
The Big LLM Architecture Comparison
The Big LLM Architecture Comparison YouTube video by Sebastian Raschka

Updated & turned my Big LLM Architecture Comparison article into a video lecture.

The 11 LLM archs covered in this video:
1. DeepSeek V3/R1
2. OLMo 2
3. Gemma 3
4. Mistral Small 3.1
5. Llama 4
6. Qwen3
7. SmolLM3
8. Kimi 2
9. GPT-OSS
10. Grok 2.5
11. GLM-4.5/4.6

www.youtube.com/watch?v=rNlU...

6 months ago 51 9 0 1

For the first episode of Ping Pong Philosophy I had the absolute pleasure to speak with Greg Restall, one of the most renowned philosophical logicians and absolutely great guy to have a chat with. Thank you for your time, Greg, I had a blast.
We are also on Spotify!

6 months ago 4 1 0 0
Post image Post image Post image Post image

Christopher Colón Lugo uses 3D U-net to capture patterns in the Game of Life
#DistributedCiphers
#ALIFE2025

6 months ago 5 3 0 0
Job Posting I-390/25: Research Associate - salary grade E13 TV-L Berliner Hochschulen – Job Postings at Technische Universität Berlin Faculty I - Humanities and Educational Sciences, Institute of History and Philosophy of Science, Technology, and Literature / History and Philosophy of Modern Science

#Postdoc at Technische Universität Berlin in digital humanities & history/philosophy/sociology of science #philsci #STS. ERC project investigates digital communication within the ATLAS collaboration at CERN

Deadline: October 13, 2025
www.jobs.tu-berlin.de/en/job-posti...
#PhilJobs

6 months ago 28 19 0 1

Upshot:
NNES report to need twice as long to read English-language papers and to prepare English presentations. Even among highly proficient NNES (C1–C2 level), ~60% report having avoided asking questions at events due to concerns about their English (compared to 16% of NES). #philsky

6 months ago 24 10 0 0
Heat map of St Petersburg

Heat map of St Petersburg

How do literary communities actually form?
@maria-lev.bsky.social analyzes the networks of collaboration and aesthetic affinity that are documented through cultural events — e.g. readings, book launches, festivals. These real-world networks often remain invisible in text-based literary history.

7 months ago 10 4 1 1
Post image

In a new work with Joseph Rich and Conrad Oakes we tackle the problem of how to best organize alluvial plots. We formalize two optimization problems and develop a solution for them based on the neighbornet algorithm, implemented in the program wompwomp: github.com/pachterlab/w...

7 months ago 32 9 3 0
Preview
Max Noichl | Patterns, Pathways & Surprises Our poster for EPSA 2025, introducing OpenAlex mapper

Had a great time last week at #epsa2025! I've put the poster up here, if anyone wants to take a closer look: maxnoichl.eu/blog/2025/ep...

7 months ago 4 0 0 0
A Gaussian process showing that the allowed time series are forced to be compatible with data

A Gaussian process showing that the allowed time series are forced to be compatible with data

I’m especially proud of this article I wrote about Gaussian Processes for the Recast blog! 🥳

GPs are super interesting, but it’s not easy to wrap your head around them at first 🤔

This is a medium level (more intuition than math) introduction to GPs for time series.

getrecast.com/gaussian-pro...

7 months ago 80 23 2 1
Advertisement
The participants of Dagstuhl Seminar 24122 standing on steps outside (from https://www.dagstuhl.de/24122)

The participants of Dagstuhl Seminar 24122 standing on steps outside (from https://www.dagstuhl.de/24122)

Multiple types of embeddings (UMAP, t-SNE, Laplacian Eigenmaps, PHATE, PCA, MDS) of Wikipedia text data labelled by a text summaries generated by an LLM. Methods like UMAP and t-SNE show cluster structure that reflect shared subject matter in text, whiel other methods show more continuous structure.

Multiple types of embeddings (UMAP, t-SNE, Laplacian Eigenmaps, PHATE, PCA, MDS) of Wikipedia text data labelled by a text summaries generated by an LLM. Methods like UMAP and t-SNE show cluster structure that reflect shared subject matter in text, whiel other methods show more continuous structure.

Multiple embedding methods (PCA, Laplacian Eigenmaps, t-SNE, MDS, PHATE, UMAP) of primate brain organoids at different time periods. Different methods highlight different aspects of development, such as clusters of similar cell types or time courses of cell development.

Multiple embedding methods (PCA, Laplacian Eigenmaps, t-SNE, MDS, PHATE, UMAP) of primate brain organoids at different time periods. Different methods highlight different aspects of development, such as clusters of similar cell types or time courses of cell development.

Multiple embedding methods (PCA, Laplacian Eigenmaps, t-SNE, MDS, PHATE, UMAP) of 1000 Genomes Project genotypes. Different methods reflect different aspects of demographic history of populations.

Multiple embedding methods (PCA, Laplacian Eigenmaps, t-SNE, MDS, PHATE, UMAP) of 1000 Genomes Project genotypes. Different methods reflect different aspects of demographic history of populations.

Last year I met a bunch of great researchers who work with high-dimensional data at a Dagstuhl seminar. This week we put out a preprint about the history and philosophy of low-dimensional embedding methods, their applications, their challenges, and their possible future arxiv.org/abs/2508.15929

7 months ago 15 7 1 1