Jan (@janschulte.com) Bsky

11 Tricks to Make Your AI Application Fast Practical techniques to reduce latency and improve perceived performance in AI applications — from model selection to UI patterns.

→ Validate structured output mid-stream, fail fast
→ Buffer rapid requests into single LLM calls

Full writeup with interactive visualisations: betalyra.com/blog/11-tric...

1 week ago 1 0 0 0

→ Structure prompts for caching (static top, dynamic bottom)
→ Disable thinking — it's on by default, adds seconds
→ Compact conversation history every turn
→ Strip your system prompt to the bone
→ Push everything except the response to background queues

1 week ago 0 0 1 0

Integrated AI into your SaaS but it feels slow? 11 tricks to make it fast:

→ Use small models, route up when needed
→ Minimise output tokens — they're 2-4x more expensive
→ Stream everything, including tool call results
→ Design use-case tools, not CRUD wrappers

1 week ago 1 0 2 0

Tiles Changelog All notable changes and releases for Tiles.

Tiles version 0.4.3 Alpha 7 has been released.

tiles.run/download

Turn-based chats with SQLite persistence. Release artifacts now include a .pkg bundle for easier installs. Support for non-Harmony models such as qwen 3.5-4b. Bug fix for venvstack generating multiple tar archives.

1 month ago 2 3 0 0

Introducing ChatGPT Atlas The browser with ChatGPT built in.

OpenAI just announced their own browser Atlas.

After Perplexity’s Comet and Claude’s Chrome extension OpenAI joins the AI-native browser wars. This will put some serious pressure on Google search..

#aio #geo #ai #llms #aiso
openai.com/index/introd...

5 months ago 3 1 0 0

Introducing ChatGPT Atlas The browser with ChatGPT built in.

OpenAI just announced their own browser Atlas.

After Perplexity’s Comet and Claude’s Chrome extension OpenAI joins the AI-native browser wars. This will put some serious pressure on Google search..

#aio #geo #ai #llms #aiso
openai.com/index/introd...

5 months ago 3 1 0 0

Instead of a useless AI emoji generator I wish #apple had used AI to create intelligent printing that doesn't print the whole page black or print in portrait mode even though you selected landscape

6 months ago 2 0 0 0

Architecting and Evaluating an AI-First Search API Building a scalable Search API that handles 200 million daily queries using hybrid retrieval and intelligent context curation for AI models

So pretty similar to a typical RAG pipeline but on a giant scale.

The search api announcement:
www.perplexity.ai/hub/blog/int...

The insights article
research.perplexity.ai/articles/arc...

hashtag#aio hashtag#aiso hashtag#geo hashtag#ai hashtag#llm hashtag#aeo

6 months ago 0 0 0 0

The gist is:
- they start with semantic embeddings and lexical retrieval (probably BM25 and similar) for a first very fast retrieval
- then they apply heuristics based filtering and base filters for things like stale content
- then using cross-encoders aka rerankers to rank the final result sets

6 months ago 1 0 1 0

if true this would make it an amazing api for AIO research.

They also published an article giving some super interesting insights into how their index is constructed.

6 months ago 0 0 1 0

This is huge! Perplexity just released their search index as an api and with it a blog article giving some insights into how they rank. According to their announcement the search api is the exact same as the one powering the perplexity application.

6 months ago 1 0 1 0

Yes exactly, there is no need for a special treatment of schemas

7 months ago 0 0 1 0

So the model "knows" that these tokens belong together in this context of a json schema.
Here is an example of how that looks in practise for the prompt "What organization is that?" where you see that the model looks at the context & type

7 months ago 0 0 1 0

The tokens are transformed into embeddings with a positional encoding added to them (e.g. RoPE), so tokens close to each other are more "similar" than tokens further away. This way the attention mechanism learns to attend to these tokens together (being trained on thousands of schema examples).

7 months ago 0 0 1 0

New captcha idea: let #ai generate a heatmap scale. #claude at least is surprisingly uncapable of doing it:

9 months ago 2 1 0 0

Can recommend better-auth which has organization support

9 months ago 0 0 0 0

TANK vs TESLA

"We've crushed fascism before and we'll crush it again"

- ⁠WW2 veteran Ken, 98, in the tank

11 months ago 56062 20130 2259 3624

There are many good corporate blogs that are highly informative, blog.logrocket.com eg is one I have in high esteem

11 months ago 3 0 0 0

Portugal with full blackout and bad internet. No jira, no teams, no AWS. Life is good

11 months ago 1 0 0 0

When war crimes go unanswered, and handshakes with smiles replace consequences, the message is clear: kill more — we allow it.

1 year ago 922 330 27 14

Building Effect 4.0 | Michael Arnaldi (Effect Days 2025) YouTube video by Effect | TypeScript at Scale

Effect 4.0 is on the way – smaller, faster, and more refined. In his closing remarks, Michael Arnaldi, creator of Effect, shared what’s coming:

▸ ~20× faster streams & batching
▸ Smaller bundles
▸ STM integrated into core

Big changes, same focus.

www.youtube.com/watch?v=nyvB...

1 year ago 15 5 0 1

import { oc } from '@orpc/effect' ;)

1 year ago 1 0 1 0

Introduction to Streams Learn the fundamentals of streams, a powerful tool for emitting multiple values, handling errors, and working with finite or infinite sequences in your applications.

awesome! would be great if you could add support for Effect- and Effect<Stream>-based handlers ;)

1 year ago 0 0 1 0

AI generated marketing videos saas

1 year ago 1 0 0 0

fleetwood.dev fleetwood.dev

Want an in depth exploration of the different hardware architectures within AI?
Of course you do :)
Another great article by Chris Fleetwood:
fleetwood.dev/posts/domain...

1 year ago 1 1 0 0

Yeah not saying this is the case with every traditional bank and some even have fast transfers. But there are also the traditional banks where you wait 3 days for an intra-EU transfer, speaking from experience, while with my modern bank the transfer arrives in seconds.

1 year ago 0 0 1 0

just give me real-time indexing. It's 2025!! The technology is there. Perplexity & co can leverage this to appear always up to date in comparison to Google, Bing & Co

1 year ago 0 0 0 0

This feels like traditional banking vs. modern banking like revolut. Google feels like traditional banks that rely on some batch COBOL jobs that some dude in the 70s wrote and that run once a night and if it fails, will run again the night after. also not running on the weekends.

1 year ago 0 0 2 0

search engines newcomers like perplexity.ai, chatgpt.com, you.com outperform google with their AI, but there is another attack vector that they can leverage: indexing. Google & Bing take like ages to index a page, even if you submit sitemaps & stuff.

1 year ago 0 0 1 0

If the United States is willing to extort with Starlink, assume AWS, Google, and Azure clouds are next.

Ukraine today, whose turn is it tomorrow?

Any US technology is now a liability.

We need European solutions as fast as possible.

1 year ago 35 18 3 0

Posts by Jan