Advertisement ยท 728 ร— 90

Posts by Pete Cheslock

Post image

That time I gave some quick legal advice to Afroman before he won his court case.

3 weeks ago 5 0 0 0
Preview
vLLM Inference Meetup ยท Boston ยท Luma Deep technical sessions. Live demos. Real conversations. If you're deploying, or scaling LLM inference, this is the room to be in. Join Red Hat AI, IBM,โ€ฆ

Hey Boston friends, we're cooking up another great event in the area.

Workshop + evening sessions covering:
- vLLM project update
- Model compression and speculative decoding
- Agentic AI with vLLM
- Distributed inference at scale with llm-d and k8s

luma.com/4rmkrrb7

3 weeks ago 1 1 0 0

๐Ÿ“ข ๐—ง๐—ต๐—ฒ ๐—ฆ๐˜๐—ฎ๐˜๐—ฒ ๐—ผ๐—ณ ๐— ๐—ผ๐—ฑ๐—ฒ๐—น ๐—ฆ๐—ฒ๐—ฟ๐˜ƒ๐—ถ๐—ป๐—ด ๐—–๐—ผ๐—บ๐—บ๐˜‚๐—ป๐—ถ๐˜๐—ถ๐—ฒ๐˜€: ๐— ๐—ฎ๐—ฟ๐—ฐ๐—ต ๐—˜๐—ฑ๐—ถ๐˜๐—ถ๐—ผ๐—ป ๐—ถ๐˜€ ๐—ผ๐˜‚๐˜!

We launched our newsletter publicly last year to share our contributions to upstream communities from our Red Hat AI teams. Weโ€™ve gained over ๐Ÿญ๐Ÿฏ๐Ÿฌ๐Ÿฌ ๐˜€๐˜‚๐—ฏ๐˜€๐—ฐ๐—ฟ๐—ถ๐—ฏ๐—ฒ๐—ฟ๐˜€!

1 month ago 2 2 2 0

Jayson Tatum looking like Jayson Tatum is a horrifying development for the rest of the East.

1 month ago 55 3 1 3

I'm going to be in NYC next week, come and join me at the first llm-d meetup.

If you're looking to learn more about distributed inferencing on kubernetes, this is going to be the place to be.

1 month ago 0 0 0 0
Optimizing LLM Workloads: A Deep Dive into the GPU Recommendation Tool & Configuration Explorer
Optimizing LLM Workloads: A Deep Dive into the GPU Recommendation Tool & Configuration Explorer YouTube video by llm-d Project

In the latest llm-d release, weโ€™re tackling high hardware costs with the new GPU Recommendation Tool! ๐Ÿ“ˆ

Evaluate throughput, latency, and cost-effectiveness before requesting expensive cluster resources.

Check out the full demo: www.youtube.com/watch?v=Y26i...

1 month ago 2 1 0 0

Come and join us for the first llm-d meetup in NYC!

1 month ago 1 0 0 0
Advertisement
Preview
Distributed Inference Meetup NYC ยท Luma llm-d Distributed Inference Meetup NYC Hosted by Red Hat AI, IBM Research, and AMD, this event takes place on March 11, 2026 in New York City. What toโ€ฆ

The agenda is still evolving, and weโ€™ve got even more awesomeness in the works! ๐Ÿ“ˆ

Whether you're running GenAI in production or building the platforms to support it, this is the room to be in.

๐Ÿ“… March 11 | 4:30 PM
๐Ÿ“ 1 Madison Ave, NYC
๐ŸŽŸ๏ธ RSVP: luma.com/0crwqwg4

1 month ago 0 1 0 1
[Announcement] WG Serving Has Succeeded and Will Be Disbanded

We'd like to announce that @kubernetes.io WG Serving has succeeded and will be disbanded! Thank you everyone who have participated and contributed to the discussions and initiatives!

More details: groups.google.com/a/kubernetes...

2 months ago 4 2 1 1

The most cursed venn diagram

2 months ago 5 0 1 0

In case you missed it, last week the llm-d community shipped the v0.5 release.

Check out the post from the llm-d project owners to learn more about all the features we've included in this release.

llm-d.ai/blog/llm-d-v...

2 months ago 1 1 0 0

๐Ÿ“ข ๐—ง๐—ต๐—ฒ ๐—ฆ๐˜๐—ฎ๐˜๐—ฒ ๐—ผ๐—ณ ๐— ๐—ผ๐—ฑ๐—ฒ๐—น ๐—ฆ๐—ฒ๐—ฟ๐˜ƒ๐—ถ๐—ป๐—ด ๐—–๐—ผ๐—บ๐—บ๐˜‚๐—ป๐—ถ๐˜๐—ถ๐—ฒ๐˜€: ๐—™๐—ฒ๐—ฏ๐—ฟ๐˜‚๐—ฎ๐—ฟ๐˜† ๐—˜๐—ฑ๐—ถ๐˜๐—ถ๐—ผ๐—ป ๐—ถ๐˜€ ๐—ผ๐˜‚๐˜!

We launched our newsletter publicly last year to share our contributions to upstream communities from our Red Hat AI teams. Weโ€™ve gained over ๐Ÿญ๐Ÿฎ๐Ÿฌ๐Ÿฌ ๐˜€๐˜‚๐—ฏ๐˜€๐—ฐ๐—ฟ๐—ถ๐—ฏ๐—ฒ๐—ฟ๐˜€!

2 months ago 1 1 1 0
llm-d 0.5: Sustaining Performance at Scale | llm-d Announcing the llm-d 0.5 release

๐Ÿ—๏ธ llm-d v0.5: Sustaining Performance at Scale In our last release, we focused on breaking latency records.

With v0.5, weโ€™re shifting from peak performance to the operational rigor required to sustain those gains in production.

๐Ÿงต๐Ÿ‘‡

llm-d.ai/blog/llm-d-v...

2 months ago 1 1 1 1
Preview
Inside vLLMโ€™s New KV Offloading Connector: Smarter Memory Transfer for Maximizing Inference Throughput In this post, we will describe the new KV cache offloading feature that was introduced in vLLM 0.11.0. We will focus on offloading to CPU memory (DRAM) and its benefits to improving overall inferenceโ€ฆ

Standardizing high-performance inference requires deep ecosystem collaboration. ๐Ÿš€

Huge shoutout to @vllm_project and @IBMResearch on the new KV Offloading Connector. Weโ€™re seeing up to 9x throughput gains on H100s and massive TTFT reductions. ๐Ÿงต

blog.vllm.ai/2026/01/08/k...

3 months ago 0 1 1 0
LLMโ€‘D Explained: Building Nextโ€‘Gen AI with LLMs, RAG & Kubernetes
LLMโ€‘D Explained: Building Nextโ€‘Gen AI with LLMs, RAG & Kubernetes YouTube video by IBM Technology

AI inference is like a busy airport: without a controller, you get gridlock. โœˆ๏ธ

Check out this breakdown by Cedric Clyburn from Red Hat on how llm-d intelligently routes distributed LLM requests.

๐Ÿ”น Solves "round robin" congestion
๐Ÿ”น Disaggregates P/D to save costs

www.youtube.com/watch?v=CNKG...

3 months ago 1 1 0 0

If you stop to think about it, Geysers are just Earth farts.

11 months ago 2 1 0 0

I'm unaware of any alternative pronunciations for it.

11 months ago 1 0 1 0
Advertisement

We're so young and full of life!

1 year ago 7 0 0 0
GAGGIUINO Gaggiuino is a community-driven project to add profiling, temp control, and other high-end features to Gaggia espresso machines

Honestly, if you want a project you could buy a Gaggia Classic Pro for $499 (or a used one cheaper) and go the gaggiuino route.
gaggiuino.github.io#/?id=home
aftermath.site/gaggiuino-ga...

1 year ago 0 0 1 0

Yea - they are shockingly close to ones I've seen in the past. Maybe if i get some time in the future and go and find some live ones. One I remember used the word "crunchy" to describe the headphones and i was reminded of a wine review for "chewy tannins". Hilarious

1 year ago 0 0 0 0
Post image

Well... that's the joke! They are all written to be basically as generic as possible so that they could equally apply to either wine or headphones. I seeded the vote counts out of the gate so that you wouldn't have like 1 or 2 votes swing the graph too much, but here's an example post-vote.

1 year ago 0 0 1 0
Video

So I decided to make a game out of this. Welcome to "Bottles or Cans" where you can read a review and guess if its for a bottle of wine or for a pair of headphones (aka cans).

bottlesorcans.com

1 year ago 14 2 2 1

So a long time ago when buying new headphones and reading reviews, I noticed how the reviews often sounded similar to reviews for a bottle of wine. Like:

"Rich and full-bodied with excellent depth. The bass notes are particularly impressive, with a smooth finish that lingers pleasantly."

1 year ago 2 1 2 0

not that i'm sure you need ANOTHER one to look at, BUT i'm a big fan of the recteq smokers too. I agree to pass on the Traegers, i've seen too many literally go up in flames.

1 year ago 1 0 2 0
Post image Post image Post image Post image
2 years ago 6668 1756 43 45
Advertisement
How do you pronounce โ€œwwwโ€ the abbreviation for โ€œWorld Wide Webโ€?  #shorts
How do you pronounce โ€œwwwโ€ the abbreviation for โ€œWorld Wide Webโ€? #shorts #www #sysadmin #sysadminlife #devops #sre #pronunciation #tutorial #software #opensource #developers #techtok

How do you pronounce โ€œwwwโ€ the abbreviation for โ€œWorld Wide Webโ€?

https://youtube.com/shorts/MxuX7M661Hg

#www #sysadmin #devops #sre #pronunciation #tutorial #software #developers

2 years ago 3 2 1 1
How do you pronounce "sudo" the #linux/#unix command? #shorts
How do you pronounce "sudo" the #linux/#unix command? #shorts How do you pronouce "sudo" the #linux/#unix command? #sudo #sysadmin #sysadminlife #devops #sre #pronounciation #tutorial #software #opensource #developers #...

How do you pronounce "sudo" the #linux/#unix command?

So are you team "Su DOUGH" or team "Su DOOOO"

https://www.youtube.com/shorts/qpi5wYblQfY

2 years ago 3 1 0 0
There is no agreed upon way to pronounce \
There is no agreed upon way to pronounce \ #techtips #data #softwareengineer #developer #devops #sysadmin #software #linux #techlife #tutorial #pronounciation #shorts #linux

This is probably the most requested pronunciation video i've gotten.

How do you say: "fsck" - a.k.a - File System Check.

There is no agreed upon pronunciation of this one!

https://youtube.com/shorts/7b-X6MJGkdA

#linux #sysadmin #devops #sre

2 years ago 1 0 0 0
Have you heard of JWT before? But how would YOU pronounce it? #shorts #software  #howdoyousay
Have you heard of JWT before? But how would YOU pronounce it? #shorts #software #howdoyousay Have you heard of JWT before? But how would YOU pronounce it? #shorts #software #howdoyousay #tech #softwareengineering #softwaretutorials

Another episode of โ€œHow do you sayโ€. This one is definitely one of my favorites.

How do you say JWT (JSON Web Token)?

https://youtube.com/shorts/D2D9umQMKhA?feature=share

2 years ago 2 2 1 0
petecheslock on TikTok Did you know there are 3 ways to pronouce #SQL? #techtok #data #softwareengineer #developer #tech #code

Did you know there are at least 3 (THREE) different ways to say SQL?

https://www.tiktok.com/t/ZTRK9rNSh/

2 years ago 1 1 0 0