Data Science (@data4sci) Bsky

WeatherNext 2 â Google DeepMind DeepMind’s WeatherNext 2 is a useful look at how AI forecasting is improving, especially on accuracy. Worth skimming if you care about what’s changing in short-term weather prediction and its practical limits.

WeatherNext 2 â Google DeepMind — DeepMind’s WeatherNext 2 is a useful look at how AI forecasting is improving, especially on accuracy. Worth skimming if you care about what’s changing in short-term weather prediction and its practical limits. https://deepmind.google/science/weathernext/

1 hour ago 0 0 0 0

LangChain for Generative AI Pipelines — T-1d Topic: LangChain for Generative AI Pipelines\nStage: T-1d\nPlatform: LinkedIn + Twitter + Substack\n\n📝 Edit your post: https://docs.google.com/document/d/1OeRtRdhKa88bO3hKVS0DHJ2hgMJopUz8uUh_pH2VSc8/edit\n\n🔗 Registration: https://learning.oreilly.com/live-events/-/0642572002267

Tomorrow — bring your questions; we'll keep it hands-on. LangChain for Generative AI Pipelines. Register: learning.oreilly.com/live-events/-/0642572002...

13 hours ago 0 0 0 0

LangChain for Generative AI Pipelines — T-1d Topic: LangChain for Generative AI Pipelines\nStage: T-1d\nPlatform: LinkedIn + Twitter + Substack\n\n📝 Edit your post: https://docs.google.com/document/d/1OeRtRdhKa88bO3hKVS0DHJ2hgMJopUz8uUh_pH2VSc8/edit\n\n🔗 Registration: https://learning.oreilly.com/live-events/-/0642572002267

Tomorrow — bring your questions; we'll keep it hands-on. LangChain for Generative AI Pipelines. Register: learning.oreilly.com/live-events/-/0642572002...

13 hours ago 0 0 0 0

Scientists stunned by ‘fundamentally new way’ life produces DNA Worth a read; the useful bits are usually in the examples and edge cases.

Scientists stunned by ‘fundamentally new way’ life produces DNA — Worth a read; the useful bits are usually in the examples and e… www.science.org/content/article/scientis...

1 day ago 0 0 0 0

Stanford's AI Index for 2026 Shows the State of AI - IEEE Spectrum Stanford’s 2026 AI Index is a useful snapshot of where AI is heading, with clear numbers on compute growth, emissions, and shifting public trust. Worth skimming if you want data to ground strategy conversations beyond anecdotes.

Stanford's AI Index for 2026 Shows the State of AI - IEEE Spectrum — Stanford’s 2026 AI Index is a useful snapshot of where AI is heading, with clear numbers on compute growth, emissions, and shifting public trust. Worth skimming if you want data to… https://spectrum.ieee.org/state-of-ai-index-2026

2 days ago 1 0 0 0

What are skiplists good for? | Antithesis Blog A clear, practical look at where skiplists actually shine, plus a grounded comparison to what you’d otherwise end up doing with lots of SQL JOINs. Worth a read if you’re weighing data structure choices for ordered data and range queries.

What are skiplists good for? | Antithesis Blog — A clear, practical look at where skiplists actually shine, plus a grounded comparison to what you’d otherwise end up doing with lots of SQL JOINs. Worth a read if you’re weighing data structure choices for… https://antithesis.com/blog/2026/skiptrees/

2 days ago 0 0 0 0

Building Large Language Models (LLMs) A clear, practical overview of what goes into building an LLM, from data and training to evaluation and deployment. Worth a skim if you want the end-to-end picture without getting lost in theory.

Building Large Language Models (LLMs) — A clear, practical overview of what goes into building an LLM, from data and training to evaluation and deployment. Worth a skim if you want the end-to-end picture without getting lost in theory. https://m.youtube.com/watch?v=9vM4p9NN0Ts

3 days ago 0 0 0 0

How We Build Effective Agents: Barry Zhang, Anthropic A clear look at the practical engineering choices behind building effective AI agents, straight from Anthropic. Worth a skim if you’re thinking about tool use, evaluation, and where agent reliability actually comes from.

How We Build Effective Agents: Barry Zhang, Anthropic — A clear look at the practical engineering choices behind building effective AI agents, straight from Anthropic. Worth a skim if you’re thinking about tool use, evaluation, and where agent reliability… https://m.youtube.com/watch?v=D7_ipDqhtwk

3 days ago 0 0 0 0

Maximum entropy temporal networks Worth a read; the useful bits are usually in the examples and edge cases.

Maximum entropy temporal networks — Worth a read; the useful bits are usually in the examples and edge cases. https://journals.aps.org/pre/abstract/10.1103/78vv-hs72

3 days ago 0 0 0 0

[2604.14228] Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems A useful overview of how Claude Code frames the design space for agentic systems—what to build into the agent versus the environment. Worth a skim if you’re thinking about tool use, autonomy boundaries, and evaluation tradeoffs.

[2604.14228] Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems — A useful overview of how Claude Code frames the design space for agentic systems—what to build into the agent versus the environment. Worth a skim if you’re thinking about… https://arxiv.org/abs/2604.14228

3 days ago 1 0 1 0

[2604.13018] Toward Autonomous Long-Horizon Engineering for ML Research A useful look at what it would take for agents to handle long-horizon engineering work in ML research, beyond short coding tasks. Worth skimming for the problem framing and where current tooling still falls short.

[2604.13018] Toward Autonomous Long-Horizon Engineering for ML Research — A useful look at what it would take for agents to handle long-horizon engineering work in ML research, beyond short coding tasks. Worth skimming for the problem framing and where current… https://arxiv.org/abs/2604.13018

3 days ago 0 0 0 0

The Abstraction Fallacy: Why AI Can Simulate But Not Instantiate Consciousness Worth a read; the useful bits are usually in the examples and edge cases.

The Abstraction Fallacy: Why AI Can Simulate But Not Instantiate Consciousness — Worth a read; the useful bits are usually in the examples and edge cases. https://philpapers.org/rec/LERTAF

3 days ago 1 0 0 0

I Measured Claude 4.7's New Tokenizer. Here's What It Costs You. Useful reality check on Claude 4.7’s tokenizer: the author measured ~1.47× token inflation on real text versus the 1.0–1.35× range in the docs. Worth a read if you’re budgeting context windows or estimating API costs.

I Measured Claude 4.7's New Tokenizer. Here's What It Costs You. — Useful reality check on Claude 4.7’s tokenizer: the author measured ~1.47× token inflation on real text versus the 1.0–1.35× range in th… www.claudecodecamp.com/p/i-measured-claude-4-7-...

3 days ago 0 0 0 0

Introduction to Spherical Harmonics for Graphics Programmers Clear, programmer-focused intro to spherical harmonics, with just enough math to make the common graphics uses (especially lighting approximation) feel approachable. Good primer before diving into SH-based papers or engine code.

Introduction to Spherical Harmonics for Graphics Programmers — Clear, programmer-focused intro to spherical harmonics, with just enough math to make the common graphics uses (especially lighting approximation) feel approachable. Good primer before diving into… https://gpfault.net/posts/sph.html

5 days ago 0 0 0 0

Slop is text you haven't read, not text you haven't written Good framing on the “slop” debate: the real failure mode is sharing text you didn’t read, regardless of whether a human or an LLM wrote it. A useful reminder to treat review and accountability as the bottleneck, not authorship.

Slop is text you haven't read, not text you haven't written — Good framing on the “slop” debate: the real failure mode is sharing text you didn’t read, regardless of whether a human or an LLM wrote it. A useful reminder to treat review… dwyer.co.za/static/slop-is-text-you-...

5 days ago 0 0 0 0

David J. Chalmers, What we talk to when we talk to language models - PhilArchive Chalmers offers a clear way to think about what we’re actually engaging with when we “talk” to an LLM, especially around whether it makes sense to attribute mental states. Useful framing for anyone using these systems beyond simple Q&A.

David J. Chalmers, What we talk to when we talk to language models - PhilArchive — Chalmers offers a clear way to think about what we’re actually engaging with when we “talk” to an LLM, especially around whether it makes sense to attribute mental states. Useful… https://philarchive.org/rec/CHAWWT-8

6 days ago 0 0 0 0

Tool calling, open source, and the M×N problem Clear overview of why tool calling is easy with closed models but messy in open-source setups—the M×N integration problem is real. Worth a read if you’re building function/tool interfaces and want to avoid maintaining a pile of brittle adapters.

Tool calling, open source, and the M×N problem — Clear overview of why tool calling is easy with closed models but messy in open-source setups—the M×N integration problem is real. Worth a read if you’re building function/tool… www.thetypicalset.com/blog/grammar-parser-main...

6 days ago 1 0 0 0

LangChain for Generative AI Pipelines — T-1w Topic: LangChain for Generative AI Pipelines\nStage: T-1w\nPlatform: LinkedIn + Twitter + Substack\n\n📝 Edit your post: https://docs.google.com/document/d/1YqpDvjA1pU2vVfbNlePZT3zb7owPdZCEp1amRb6T968/edit\n\n🔗 Registration: https://learning.oreilly.com/live-events/-/0642572002267

One week out — last chance to plan it into your calendar. LangChain for Generative AI Pipelines. Register: learning.oreilly.com/live-events/-/0642572002...

6 days ago 0 0 0 0

LangChain for Generative AI Pipelines — T-1w Topic: LangChain for Generative AI Pipelines\nStage: T-1w\nPlatform: LinkedIn + Twitter + Substack\n\n📝 Edit your post: https://docs.google.com/document/d/1YqpDvjA1pU2vVfbNlePZT3zb7owPdZCEp1amRb6T968/edit\n\n🔗 Registration: https://learning.oreilly.com/live-events/-/0642572002267

One week out — last chance to plan it into your calendar. LangChain for Generative AI Pipelines. Register: learning.oreilly.com/live-events/-/0642572002...

6 days ago 0 0 0 0

Claude API for Python Developers — T-2w Topic: Claude API for Python Developers\nStage: T-2w\nPlatform: LinkedIn + Twitter + Substack\n\n📝 Edit your post: https://docs.google.com/document/d/1J6uZkUbfqDTrGu9bL7oilIYKZfsd8FE6OFm40wntxAw/edit\n\n🔗 Registration: https://learning.oreilly.com/live-events/-/0642572255893/

Two weeks out — here's the practical angle (tools, patterns, gotchas). Claude API for Python Developers. Register: learning.oreilly.com/live-events/-/0642572255...

6 days ago 0 0 0 0

Claude API for Python Developers — T-2w Topic: Claude API for Python Developers\nStage: T-2w\nPlatform: LinkedIn + Twitter + Substack\n\n📝 Edit your post: https://docs.google.com/document/d/1J6uZkUbfqDTrGu9bL7oilIYKZfsd8FE6OFm40wntxAw/edit\n\n🔗 Registration: https://learning.oreilly.com/live-events/-/0642572255893/

Two weeks out — here's the practical angle (tools, patterns, gotchas). Claude API for Python Developers. Register: learning.oreilly.com/live-events/-/0642572255...

6 days ago 0 0 0 0

My understanding is that they’re essentially cron jobs on steroids

1 week ago 0 0 0 0

Automate work with routines - Claude Code Docs Clear overview of Claude Code routines—how to schedule jobs, trigger them via API, or hook into GitHub events. Practical starting point if you’re looking to automate repeatable workflows with managed infrastructure.

Automate work with routines - Claude Code Docs — Clear overview of Claude Code routines—how to schedule jobs, trigger them via API, or hook into GitHub events. Practical starting point if you’re looking to automate repeatable workflows with managed… https://code.claude.com/docs/en/routines

1 week ago 4 0 1 0

Stanford Artificial Intelligence Index Report 2026 Useful snapshot of where AI is actually heading in 2026—metrics on research, investment, regulation, and real-world adoption in one place. Worth skimming for grounded numbers you can cite in planning and policy discussions.

Stanford Artificial Intelligence Index Report 2026 — Useful snapshot of where AI is actually heading in 2026—metrics on research, investment, regulation, and real-world adoption in one place. Worth skimming for grounded numbers you can… hai.stanford.edu/assets/files/ai_index_re...

1 week ago 0 0 0 0

[2604.07709] IatroBench: Pre-Registered Evidence of Iatrogenic Harm from AI Safety Measures Worth a look if you care about the trade-offs in AI safety: this paper sets up a pre-registered benchmark to test whether safety interventions can inadvertently cause harm. Useful framing for anyone evaluating guardrails beyond just “does it block bad outputs.”

[2604.07709] IatroBench: Pre-Registered Evidence of Iatrogenic Harm from AI Safety Measures — Worth a look if you care about the trade-offs in AI safety: this paper sets up a pre-registered benchmark to test whether safety interventions can inadvertently cause… https://arxiv.org/abs/2604.07709

1 week ago 0 0 0 0

[2604.08224] Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Useful unified review of how agent “externalization” actually gets built in practice—memory stores, skill libraries, protocols, and harness tooling. Worth skimming if you’re designing LLM agents and want a clearer taxonomy of what to implement vs. what to leave inside the model.

[2604.08224] Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering — Useful unified review of how agent “externalization” actually gets built in practice—memory stores, skill libraries, protocols, and harness tooling.… https://arxiv.org/abs/2604.08224

1 week ago 0 0 0 0

Center for Responsible, Decentralized Intelligence at Berkeley A useful reality check on AI agent benchmarks: Berkeley researchers show how top leaderboards can be gamed to get near-perfect scores without doing the task. Worth reading if you rely on benchmark numbers for model selection or evaluation.

Center for Responsible, Decentralized Intelligence at Berkeley — A useful reality check on AI agent benchmarks: Berkeley researchers show how top leaderboards can be gamed to get near-perfect scores without doing the task. Worth reading if… rdi.berkeley.edu/blog/trustworthy-benchma...

1 week ago 3 0 1 0

[2604.06425] Neural Computers Useful overview of the “neural computers” idea—how models can be structured to compute with learned representations rather than just pattern-match. Worth a skim if you’re tracking where architecture and algorithm design are converging.

[2604.06425] Neural Computers — Useful overview of the “neural computers” idea—how models can be structured to compute with learned representations rather than just pattern-match. Worth a skim if you’re tracking where architecture and algorithm design are converging. https://arxiv.org/abs/2604.06425

1 week ago 0 0 0 0

Anthropic Will Use CoreWeave’s AI Capacity to Power Claude - Bloomberg Worth a read; the useful bits are usually in the examples and edge cases.

Anthropic Will Use CoreWeave’s AI Capacity to Power Claude - Bloomberg — Worth a read; the useful bits are usually in the examples and edge cases. www.bloomberg.com/news/articles/2026-04-10...

1 week ago 0 0 0 0

Towards transparency and knowledge exchange in AI-assisted data analysis code generation | Nature Computational Science A concise perspective on why AI-generated analysis code needs clearer provenance, documentation, and sharing norms to be trustworthy and reusable. Useful if you’re thinking about reproducibility and collaboration in AI-assisted data workflows.

Towards transparency and knowledge exchange in AI-assisted data analysis code generation | Nature Computational Science — A concise perspective on why AI-generated analysis code needs clearer provenance, documentation, and sharing norms to be… https://www.nature.com/articles/s43588-025-00781-1

1 week ago 0 0 0 0

Posts by Data Science