Advertisement ยท 728 ร— 90

Posts by CloudThrill

Nice example of a production #vLLM setup on ๐—ก๐—ฒ๐—ฏ๐—ถ๐˜‚๐˜€ with terraform, managed K8s, inference, and observability all in one place.

This can be a ref stack builders can use without reinventing the basics ๐Ÿ’ก.
๐Ÿ‘จ๐Ÿปโ€๐Ÿ’ป full code on our repo.
github.com/CloudThrill/vllm-production-stack-terraform

2 months ago 1 0 0 0
Preview
kv_cache Explained: How It Enhances vLLM Inference - Cloudthrill This blog is my attempt to break it down simply, without drowning in dark math :). If youโ€™ve ever wondered what kv_cache actually does, youโ€™re in the right place. Letโ€™s make it click. explore how KV cache enhances vllm inference

๐Ÿ†And our #1 - 2025 blog-post on @Cloudthrill isโ€ฆ KV Cache explained (๐—Ÿ๐—ถ๐—ธ๐—ฒ ๐—œโ€™๐—บ ๐Ÿฑ)

Ever wondered what #KVCache really is in LLM inference?
Here's the simplest analogy for beginners plus an overview of popular KV cache optimization techniques!

๐Ÿ“– cloudthrill.ca/kv_cache-exp...

3 months ago 1 0 0 0
Post image

๐Ÿ†Ranked #2 most-read in 2025 - #vLLM for Beginners (Key features)
2๏ธโƒฃ Hereโ€™s the most exhaustive list of VLLM features you wish you knew. ๐Ÿ‘‡
๐Ÿ“– cloudthrill.ca/what-is-vllm...

Learn what makes #vllm the ๐—ฅ๐—ผ๐—น๐—น๐˜€ ๐—ฅ๐—ผ๐˜†๐—ฐ๐—ฒ of Inference in productionโœจ. #vLLM #AIForBeginners

3 months ago 0 0 1 0
LLM Quantization: All You Need to Know! - Cloudthrill We curated enough data to provide a foundational understanding of quantization principles, addressing common confusions and answering questions you might have hesitated to ask. So hereโ€™s everything I wish someone had told me a year ago.

๐Ÿ†Ranked #3: ๐—Ÿ๐—Ÿ๐—  ๐—ค๐˜‚๐—ฎ๐—ป๐˜๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป, because cheaper inference is never just about INT8 vs FP16.

Hereโ€™s everything you wish you knew about LLM quantization.๐Ÿ‘‡
๐Ÿ“– cloudthrill.ca/llm-quantization-all-you-need-to-know
๐ŸŽ™๏ธPodcast (YouTube):"from ๐˜Ž๐˜Ž๐˜œ๐˜ ๐˜ต๐˜ฐ enterprise ๐˜˜๐˜ถ๐˜ข๐˜ฏ๐˜ตization"โ™ฅ๏ธ
๐Ÿ“บ youtube.com/watch?v=XTE0oS7b6fM

3 months ago 0 0 1 0
Post image

๐ŸŽ„This week, weโ€™re counted down CloudThrillโ€™s ๐—ง๐—ผ๐—ฝ ๐Ÿฏ ๐—บ๐—ผ๐˜€๐˜-๐—ฟ๐—ฒ๐—ฎ๐—ฑ blog posts of ๐Ÿฎ๐Ÿฌ๐Ÿฎ๐Ÿฑ๐Ÿ†.
Three posts. Three lessons. First reveal drops this Monday ๐Ÿ‘€

#CloudThrill #LLM #AIInfrastructure #LLM #OpenSourceAI #AIEngineering

3 months ago 0 1 1 0

CloudThrill is a proud sponsor of Tech Beats Unplugged podcast๐ŸŽ™๏ธ. ๐Ÿ”ฅNew episode out with Michael (WebScale) Webster- breaking down the VMwareโ€“Broadcom chaos, Nutanix , and real exit strategies. Listen now๐ŸŽง๐Ÿ‘‡๐Ÿผ.

3 months ago 1 0 0 0

This terraform stack delivers a production-ready vLLM serving environment On @awscloud.bsky.social #EKS, supporting both CPU/GPU inference with operational best practices embedded in AWS Integration and Automation (๐—ฎ๐˜„๐˜€-๐—ถ๐—ฎ). A One Click Deploy๐Ÿ”ฅCheck the repo and Blog below ๐Ÿ‘‡๐Ÿป

4 months ago 0 0 0 0
Advertisement

This terraform stack delivers a production-ready vLLM serving environment On ๐—š๐—ผ๐—ผ๐—ด๐—น๐—ฒ ๐—–๐—น๐—ผ๐˜‚๐—ฑ ๐—š๐—ž๐—˜, supporting both ๐—–๐—ฃ๐—จ/๐—š๐—ฃ๐—จ inference with operational best practices embedded in #Terraform #GKE Module. A One Click Deploy๐Ÿ”ฅCheck the repo and blog below ๐Ÿ‘‡๐Ÿป

4 months ago 0 0 0 0

๐Ÿ”ฅCheck out whatโ€™s cooking in #vLLM for ๐Ÿฎ๐Ÿฌ๐Ÿฎ๐Ÿฒ and beyond. From the project leader himself ๐—ฆ๐—ถ๐—บ๐—ผ๐—ป ๐— ๐—ผ! #๐—ข๐—ฝ๐—ฒ๐—ป๐—ฆ๐—ผ๐˜‚๐—ฟ๐—ฐ๐—ฒ๐—”๐—œ #LeadingTheway ๐Ÿ’ช #RaySummit2025 #Anyscale

4 months ago 0 0 0 0
Post image

Still thinking of hosting your on AI Backend?
our FREE vLLM POC is still live - but not forever.
๐Ÿ“ข๐—”๐—ฝ๐—ฝ๐—น๐˜† ๐—ป๐—ผ๐˜„ โ†’ cloudthrill.ca/ai-poc

Run AI assistants, RAG, or open models privately in the cloud:
โœ… No external APIs
โœ… No vendor lock-in
โœ… Total data control

Your Infra. Your Models. Your rules.๐Ÿ†๐Ÿ

5 months ago 0 1 0 0

๐Ÿ’ก In this 5-min read you'll learn:
โœ… How embeddings work โ€“ in the simplest way possible
๐Ÿ” Chunk sizes, overlaps, and text splitters
๐Ÿ“ฆ Vector DBs, popular embedding models used today

๐Ÿ’กOh,& donโ€™t forget, our Private ๐—”๐—œ ๐—œ๐—ป๐—ณ๐—ฒ๐—ฟ๐—ฒ๐—ป๐—ฐ๐—ฒ campaign is still running, with a ๐‹๐ˆ๐Œ๐ˆ๐“๐„๐ƒ FREE ๐๐Ž๐‚ cloudthrill.ca/ai-poc

5 months ago 0 0 0 0
Post image

๐ŸWeโ€™re excited to share that CloudThrill has been awarded a ๐๐ซ๐จ๐’๐ž๐ซ๐ฏ๐ข๐œ๐ž๐ฌ ๐ฉ๐ซ๐ž๐ช๐ฎ๐š๐ฅ๐ข๐Ÿ๐ข๐œ๐š๐ญ๐ข๐จ๐ง๐Ÿ‘๐Ÿผ with ๐๐ฎ๐›๐ฅ๐ข๐œ ๐’๐ž๐ซ๐ฏ๐ข๐œ๐ž๐ฌ ๐š๐ง๐ ๐๐ซ๐จ๐œ๐ฎ๐ซ๐ž๐ฆ๐ž๐ง๐ญ ๐‚๐š๐ง๐š๐๐š !
๐Ÿ‘‹๐Ÿป Work with a ๐ฉ๐ฎ๐›๐ฅ๐ข๐œ ๐š๐ ๐ž๐ง๐œ๐ฒ ? Letโ€™s talk about your challenges - weโ€™d love to hear from you! cloudthrill.ca/contact-us
#CloudThrill #ProServices #GovernmentOfCanada

6 months ago 0 0 0 0

Check out our new ๐ฏ๐‹๐‹๐Œ ๐๐ซ๐จ๐๐ฎ๐œ๐ญ๐ข๐จ๐ง ๐’๐ญ๐š๐œ๐ค blog๐Ÿ‘‡๐Ÿผ
๐Ÿ’ŽWe cover:
โœ… What is ๐ฏ๐‹๐‹๐Œ production stack ?
โœ… Request Flow & Architecture breakdown
โœ… Serving Engine, Request Router & KV-Cache Netwrk
โœ… Autoscaling & built-in fault-tolerance
โœ… One-click Helm install

#LLMs #Kubernetes #Cloudthrill #vLLM

6 months ago 1 0 0 0

Hereโ€™s a full recap from #CloudThrill team of the vLLM beginners series, broken in 3 parts ๐Ÿ’ซ share and enjoy!

7 months ago 0 0 0 0

๐Ÿ” Learn the key to easy and production-grade secret management on K8s ๐Ÿ‘‡๐Ÿผ
๐‹๐ข๐ค๐ž ๐ญ๐ก๐ข๐ฌ ๐ค๐ข๐ง๐ ๐จ๐Ÿ ๐ฌ๐ญ๐ฎ๐Ÿ๐Ÿ? Subscribe here ๐Ÿ‘‰ tinyurl.com/CloudThrillBlogs

8 months ago 0 0 0 0

Get your teams to level up their CI/CD skills with this GithubActions cert guide ๐Ÿ‘‡๐Ÿป

8 months ago 0 0 0 0
Advertisement

#NewBlog: final part of our #VLLM blog series๐Ÿ”ฅ
๐Ÿ’ŽThis, we shift from theory to practice, covering #vLLM installs across platforms? check our new blog, where we break it down in 5 sections๐Ÿ˜Ž#TherYouGo

8 months ago 0 0 0 0

New week, new blog! ๐Ÿ‘‡๐Ÿผ

8 months ago 0 0 0 0

๐Ÿ‘‹๐ŸผSee you next Thursday! #Livestream #LLMs #Quantization

9 months ago 1 0 0 0

#NewBlog part 2 of our #VLLM blog series ๐Ÿ”ฅ
๐Ÿ’ŽWhat makes #VLLM the Rolls Royce of inference? ๐Ÿ‘‡๐Ÿปcheck our new blog, where we break it down in 5 performance-packed layers๐Ÿ˜Ž#TherYouGo

9 months ago 0 0 0 0
Preview
Terraform Pipelines for Dummies Part3: GitHub Actions Azure Deploy with OIDC - Cloudthrill Struggling with Azure credential management in your CI/CD pipelines? Both Azure and GitHub Actions now supportย OpenID Connect (OIDC) for secure deployments by simplifying the process, and aligning with modern security practices. With GitHubโ€™s OIDC provider, you can authenticate directly from your workflows using managed identities without the need for static access keys.

๐Ÿš€#NewBlog ๐†๐ข๐ญ๐‡๐ฎ๐› actions Azure deploy with ๐Ž๐ˆ๐ƒ๐‚!
๐Ÿ’กOver ๐Ÿ๐Ÿ‘ ๐ฆ๐ข๐ฅ๐ฅ๐ข๐จ๐ง๐ฌ secrets were exposed in #GitHub last year๐Ÿ’€ & ๐Ÿ“๐ŸŽK+ #Huggingface tokens leaks every month!
๐Ÿ›ก๏ธSwitch to ๐ฌ๐ž๐œ๐ซ๐ž๐ญ๐ฅ๐ž๐ฌ๐ฌ with Pipeline identity now!
๐Ÿ‘‰We show you how: cloudthrill.ca/github-actio...
#Azure #NHI #CICD #Terraform #ManagedIdentity

9 months ago 1 1 0 0

Want to learn about @VLLm ? start here ๐Ÿ‘‡๐Ÿป

10 months ago 0 0 0 0

๐Ÿšจ As proud sponsors, we're excited to share the latest episode of #TechBeatsUnplugged! ๐ŸŽงTune in as Steve Giguere digs through every attack vectorโ˜ข๏ธon your GitHub workflows and how to protect you๐Ÿ›ก๏ธfrom them.

10 months ago 0 0 0 0

New Blog drop!! ๐Ÿ‘‹๐Ÿป
๐Ÿง Your AI workloads are nothing without securing credentials.

10 months ago 0 0 0 0
Post image

๐Ÿšจ@CloudThrill is excited to announce its membership in the NVIDIA Inception Program! ๐Ÿ‘๐Ÿป๐Ÿ‘๐Ÿป๐Ÿ‘๐Ÿป
Read full statement: cloudthrill.ca/cloudthrill-...
#NVIDIAInception Program for Startups!

11 months ago 0 0 0 1
Preview
TAICO May 2025 meetup!, Wed, May 7, 2025, 5:30 PM | Meetup The TAICO team is proud to announce our next meetup at the Adaptavist office in Toronto. Much thanks to [Adaptavist](https://www.adaptavist.com/ "https://www.adaptavist.com

๐Ÿšจ#AI & #CyberSec heads in #Toronto!
Join us on Wednesday, ๐Œ๐š๐ฒ ๐Ÿ•๐ญ๐ก from 5:30pm-8pm EST for another exciting #TAICO Meetup (Toronto AI and Cybersecurity Organization).
#Cloudthrill #ProudSponsor๐Ÿ”ฅ
www.meetup.com/taico-toront...

11 months ago 0 1 0 0
Advertisement

Check out our team's new article how to Ace your #CNCF Certified Kubernetes Administrator exam๐Ÿ”ฅ #CKA

11 months ago 0 0 0 0
Post image

๐Ÿง  ๐†๐๐“ ๐Œ๐จ๐๐ž๐ฅ๐ฌ ๐Ÿ๐จ๐ซ ๐๐ฎ๐ฆ๐ฆ๐ข๐ž๐ฌ #cheatsheet
๐Ÿค” If youโ€™ve opened #ChatGPT lately and thought:
โ€œ๐–๐š๐ข๐ญโ€ฆ ๐ฐ๐ก๐š๐ญโ€™๐ฌ ๐จ๐Ÿ‘? ๐€๐ง๐ ๐ฐ๐ก๐ฒ ๐š๐ซ๐ž ๐ญ๐ก๐ž๐ซ๐ž ๐ฌ๐จ ๐ฆ๐š๐ง๐ฒ ๐ฆ๐จ๐๐ž๐ฅ๐ฌ ๐ง๐จ๐ฐ?โ€ Youโ€™re not alone. Today #openAI finally answered๐Ÿ™‹๐Ÿปโ€โ™€๏ธ
๐Ÿ‘‰๐Ÿปhttps://platform.openai.com/docs/models/compare

1 year ago 0 1 0 0

Check out our team's new article how to expose you web apps demos through Zero trust ๐Ÿš€ @openziti.bsky.social

1 year ago 2 1 0 0
Post image

๐Ÿ“ขNext week, learn more about
๐Ÿง ๐๐ซ๐ข๐ง๐ ๐ข๐ง๐  ๐€๐ˆ ๐ˆ๐ง๐Ÿ๐ž๐ซ๐ž๐ง๐œ๐ž ๐‚๐ฅ๐จ๐ฌ๐ž๐ซ ๐ญ๐จ ๐ฒ๐จ๐ฎ๐ซ ๐๐ž๐ฏ๐ฌ: deploy ๐€๐ˆ ๐„๐ง๐๐ฉ๐จ๐ข๐ง๐ญ๐ฌ ๐ข๐ง ๐Ž๐Š๐„ from our own๐Ÿ”ฅOracle #ACE @clouddude.bsky.social

โ™ ๏ธCheck out the entire agenda ๐Ÿ‘‰๐Ÿป social.ora.cl/60170pPYS
โ™ ๏ธ Register for free ๐Ÿ‘‰๐Ÿป social.ora.cl/60120pPYa

@oracleace.bsky.social #AIInference #K8s #ollama

1 year ago 1 0 0 1