Firethering (@firethering.com) Bsky

Gen-Searcher: An Open Source AI That Searches the Web Before Generating Images - Firethering Your image generator has never seen today. It was trained months ago, maybe longer, and everything it draws comes from that frozen snapshot of the world. Ask it to generate a current news moment, a pr...

GenSearcher does something none of the mainstream tools do. Before it draws a single pixel, it goes & looks things up. It searches web, browses sources, pulls visual references. Then it generates. The Result is an image grounded in actual current info. #opensource #ai
firethering.com/gen-searcher...

18 hours ago 2 1 0 0

File Converter Pro offline file converter for images audio video and documents - Firethering Most file converters still push you to upload your files somewhere. Even for basic stuff like changing a PDF or converting an image. It works, but it’s not something you feel great about, especially w...

File Converter Pro is an #opensource Windows App that works like a simple offline converter. You drop files in, pick what you want, and it converts everything locally. No uploads on any server.
firethering.com/file-convert...

23 hours ago 2 0 0 0

MiniMax M2.7: The Agentic Model That Helped Build Itself - Firethering MiniMax handed an internal version of M2.7 a programming scaffold and let it run unsupervised. Over 100 rounds it analyzed its own failures, modified its own code, ran evaluations, and decided what to...

MiniMax handed an internal version of M2.7 a programming scaffold and let it run unsupervised. Over 100 rounds it analyzed its own failures, modified its own code, & decided what to keep and what to revert. The result was a 30% performance improvement #technews #ai
firethering.com/minimax-m2-7...

2 days ago 4 1 0 0

Marco MoE Uses 5% of Its Parameters but Outperforms Models 3× Its Size - Firethering Most AI models are what they appear to be. A 12B parameter model uses 12B parameters. What you see is what runs. Marco MoE does not work that way. Alibaba built two models, Marco Nano and Marco Mini, that carry billions of parameters but wake up only a tiny fraction of them for each request. Marco Nano activates 0.6 billion out of 8 billion. Marco Mini activates 0.86 billion out of 17.3 billion. Less than 5% of either model is actually working at any moment. The part that makes this worth paying attention to is what that 5% manages to do against models running at full capacity.

Most AI models are what they appear to be. A 12B parameter model uses 12B parameters. What you see is what runs.
Marco MoE doesn't work that way. #Alibaba built 2 models, Marco Nano & Marco Mini, that carry billions of parameters but wake up only a tiny fraction. #ai
firethering.com/marco-moe-na...

2 days ago 1 0 0 0

VoxCPM2 lets you create voices just by describing them and it is open source - Firethering Most AI voice tools give you two options. Clone an existing voice or pick from a list of defaults. If neither works for what you need, you are stuck. VoxCPM2 adds a third option. You describe what you...

firethering.com/voxcpm2-voic...

3 days ago 1 0 0 0

VoxCPM2 lets you create voices just by describing them and it is open source - Firethering Most AI voice tools give you two options. Clone an existing voice or pick from a list of defaults. If neither works for what you need, you are stuck. VoxCPM2 adds a third option. You describe what you want. A young woman, gentle tone, slightly slow pace. A deep male voice with a formal cadence. Whatever you can put into words, it generates from scratch, no recording needed. That alone would make it interesting. But it also does voice cloning, supports 30 languages without needing a language tag, outputs 48kHz audio, runs on 8GB of VRAM, and ships under Apache 2.0. The whole thing is two billion parameters and installs with a single pip command. I tried the audio samples and the results are genuinely good. Not fully human, but natural enough that you stop noticing the model and start paying attention to what it is saying. Mixed languages, different emotions, and you can steer all of it.

Most AI voice tools give you two options. Clone an existing voice or pick from a list of defaults.
VoxCPM2 adds a third option. You describe what you want. A young woman, gentle tone, Whatever you can put into words, it generates #opensource #tts #ai
firethering.com/voxcpm2-voic...

3 days ago 3 0 0 0

Meta’s Muse Spark: A Closed Bet on Multimodal, Multi-Agent AI - Firethering Meta has a new AI model and for the first time in years it is not called Llama. Muse Spark launched yesterday under Meta Superintelligence Labs, a new internal division Meta quietly formed by bringing together researchers from Google DeepMind and other frontier labs. It is natively multimodal, supports multi-agent reasoning, and is available right now at meta.ai. It is also not being released as open weights. That last part is worth sitting with for a second. Meta built one of the most trusted brands in open source AI through Llama. Developers built on it, researchers published with it. Muse Spark continues none of that. No weights, no HuggingFace release, private API preview only. What you get instead is a genuinely capable multimodal model with some benchmark numbers that are hard to ignore and a new reasoning mode called Contemplating that puts it in conversation with Gemini Deep Think and GPT Pro. Whether that trade is worth it depends entirely on what you were using Meta AI for in the first place.

Muse Spark launched yesterday under Meta Superintelligence Labs, a new internal division Meta quietly formed by bringing together researchers from Google DeepMind and other frontier labs. #ai #meta
firethering.com/meta-muse-sp...

4 days ago 1 0 0 0

GLM 5.1: The open source model that gets better the longer you run it - Firethering Give an AI agent a hard problem and it usually figures out the easy wins fast. After that, more time does not help. It just sits there, trying the same things. ZhipuAI ran GLM-5.1 on a vector database...

ZhipuAI ran #GLM-5.1 on a vector database optimization problem & let it go for 600 iterations. It did not run out of ideas. At iteration 50 it was sitting at roughly the same performance as the best single-session result any model had achieved #opensource
firethering.com/glm-5-1-open...

5 days ago 2 0 0 0

Bonsai 8B: A 1-Bit LLM That Delivers 8B-Class Performance at 1/14th the Size - Firethering Nobody expected a 1.15 GB model to score competitively against full precision 8B models. That is not how this usually goes. PrismML released Bonsai 8B last month and the headline number is almost absu...

PrismML released Bonsai 8B last month & the whole model, weights and all, fits in 1.15 GB. For context, the standard FP16 version of a comparable 8B model sits at around 16 GB. Bonsai beats or matches several of them on benchmarks #opensource #ai
firethering.com/bonsai-8b-1b...

6 days ago 3 1 0 0

Llamafile: Run AI Models Locally on Your PC with Just One File - Firethering Running a local LLM usually means a Python environment, CUDA drivers, and at least one Stack Overflow tab open before you've even started. llamafile skips all of that. Mozilla.ai packaged the whole ru...

Running a local #LLM usually means a Python environment, CUDA drivers, and at least one Stack Overflow tab open before you’ve even started. llamafile skips that. Mozilla packaged the whole runtime like model weights and everything into a single executable. #opensource
firethering.com/llamafile-ru...

1 week ago 2 0 0 0

Google AI Edge Gallery: Run LLMs Offline on Your Phone Google AI Edge Gallery lets you run open-source LLMs straight on your phone. No cloud. Once you download the models, you're offline. You get chat, image analysis, audio transcription, prompt testing. ...

Now you can run AI models like Gemma 4 Directly in your phones easily with Google's #opensource App Google AI Edge Gallery , It supports multiple #AI models you download in the app itself from Hugging face and it work fully Offline.
firethering.com/google-ai-ed...

1 week ago 2 0 0 0

4 Open-Source TTS Models That Can Clone Voices and Actually Sound Human Voice cloning used to mean expensive studio software, proprietary APIs with per-character pricing, or models so heavy they needed server infrastructure just to run. That changed quietly over the last ...

Four #opensource models exist right now that do something the previous generation struggled with. They can clone a voice from a short audio sample and produce output that is genuinely difficult to compare from the original speaker. #ai

firethering.com/open-source-...

1 week ago 3 0 0 0

VOID: Netflix's open source AI removes objects and fixes the physics they break Netflix has a visual effects budget most film studios would kill for. They do not release open source AI tools for fun. When they do ship something publicly, it is worth paying attention. VOID is thei...

#Netflix has a visual effects budget most film studios would kill for. They do not release #opensource #AI for fun thus its worth paying attention.
VOID is their latest release. Video Object & Interaction Deletion. Point at an object in a video, & VOID removes it.
firethering.com/void-ai-vide...

1 week ago 2 0 1 0

Onyx: Open-Source AI Platform for RAG, Agents & LLM Apps Most LLM tools feel like demos. You ask something, get an answer, and that’s about it. Onyx feels more like something you’d actually build on. It sits between you and the model and adds the stuff you ...

Most LLM tools feel like demos. You ask something, get an answer, and that’s about it.
Onyx sits between you & the model & adds the stuff you end up needing anyway. Search, agents, file output, even running code.
#opensource #ai #trending #artificialintelligence
firethering.com/onyx-ai-plat...

1 week ago 4 1 0 0

Trinity-Large-Thinking: the open source brain your AI agents have been missing Most open source models that claim agentic capability are really just instruction-tuned models with tool calling bolted on. They can call a function. They cannot think across ten steps, remember what ...

Meet Trinity-Large-Thinking an #opensource AI Model with 398B total parameters, but only 13B active during inference. That MoE architecture means it runs closer to a 13B model in practice while carrying the knowledge of something nearly 30 times larger. #aiagent
firethering.com/trinity-larg...

1 week ago 4 0 0 0

EmDash: Cloudflare rebuilt WordPress for the agent-first web WordPress has a problem it cannot fix from the inside. Not a performance problem. Not a features problem. A structural one. 96% of its security vulnerabilities come from plugins, and the reason is sim...

EmDash is a CMS written entirely in TypeScript, serverless by default, powered by Astro under the hood. It is MIT licensed & fully #opensource. You can deploy it to Cloudflare in one click or run it on any Node.js server you already have. #trending
#wordpress
firethering.com/emdash-cloud...

1 week ago 3 0 0 0

Gemma 4 Makes Local AI Agents Actually Practical Gemma 4 is a family of four models. Two dense models built for phones and laptops, E2B and E4B. One MoE model at 26B A4B for consumer GPUs. One dense 31B for workstations and servers. All four are mul...

Gemma 4 is different. Not because #Google said so. Because of 3.8 billion active parameters inside a 26 billion parameter model. The short version is that for the first time, running a genuinely capable AI agent on a consumer GPU is not a compromise.
#opensource
firethering.com/gemma-4-loca...

1 week ago 2 0 0 0

6 Open Source Developer Tools Worth Switching to Paid developer tools have gotten expensive. Postman wants a subscription. DataGrip wants a subscription. Design tools, API clients, database managers, recording tools. Everything is moving to SaaS and...

Six #OpenSource Developers Tools That can replace you Subscription based SaaS, Worth checking out!

firethering.com/open-source-...

1 week ago 2 0 2 0

Claude Code's leaked source code reveals what Anthropic is actually building Anthropic accidentally leaked Claude Code's full source code revealing unreleased features including KAIROS persistent agents, dream mode and multi agent orchestration.

Developers who dug into the #leaked code found more than just the current #ClaudeCode architecture. They found fully built features that have not shipped yet, complete with internal names and system prompts.

firethering.com/claude-code-...

1 week ago 1 0 0 0

Another – Open Source Android Screen Mirror & Controller for Desktop Another puts your Android screen directly on your desktop and lets you control it entirely from your keyboard and mouse. It mirrors in real-time over USB or WiFi, forwards audio, lets you type directl...

Another puts your Android screen directly on your desktop & lets you control it entirely from your keyboard and mouse.
It mirrors in real-time over USB or WiFi, forwards audio, lets you type directly into the device, and records your screen as a .webm file #opensource
firethering.com/another-andr...

1 week ago 2 0 0 0

5 open source AI agentic models built for real autonomous work Getting an AI agent to start a task is easy. Getting it to finish one properly is a different story. Most agents fall apart somewhere in the middle. A tool returns unexpected output, the model misread...

The five #Agentic AI models handle complex multi-step tasks, real browser control, deep research and coding workflows. All #opensource & self hostable.
firethering.com/best-open-so...

2 weeks ago 2 0 0 0

daVinci-MagiHuman Finally Makes Open-Source AI Video Feel Real daVinci-MagiHuman processes text, video and audio inside a single unified transformer simultaneously. No separate models, no post processing alignment. The lip sync and facial dynamics are not correct...

Finally #opensource Community gave us another realistic #AI video Generation Model called daVinci-Magihuman
firethering.com/davinci-magi...

2 weeks ago 2 0 0 0

MiroThinker 1.7 Finally Brings Deep Research AI Agents to Open Source For deep research tasks, the options are mostly proprietary. Perplexity, ChatGPT DeepResearch, paid tools that do the job but keep your data on their servers and charge you monthly for the privilege. ...

MiroThinker 1.7 is an #opensource AI research agent built specifically for long horizon tasks. Not a chatbot or a reasoning model you prompt manually. An agent that searches, verifies, cross references and works through complex multistep research problems autonomously
firethering.com/mirothinker-...

2 weeks ago 2 0 0 0

Voxtral TTS: Mistral Is Pushing Voice AI Off the Cloud Voxtral TTS supports nine languages: English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic. That by itself isn’t unusual anymore. A lot of models claim multilingual support. ...

Mistral AI is getting into voice now. They’ve put out Voxtral TTS, and yeah, on the surface it sounds like just another text-to-speech model. But once you look a bit closer, it’s not that simple.
#opensource #ai #mistral #tts
firethering.com/voxtral-tts-...

2 weeks ago 2 0 0 0

Modly: Open Source Local AI Image-to-3D Model Generator You've got a photo and you want a 3D model. Normally that means paying per generation on some cloud service that uploads your image to a server you'll never see. Modly skips all of that. It's a deskto...

You’ve got a photo and you want a #3D model. Normally that means paying per generation on some cloud service that uploads your image to a server you’ll never see. Modly skips all of that.
It’s #opensource app that convert any photo into a fully usable 3D mesh locally
firethering.com/modly-local-...

2 weeks ago 2 0 0 0

NVIDIA NemoClaw runs OpenClaw inside a secure sandbox and setup takes one command NemoClaw is an open source reference stack built by NVIDIA that runs OpenClaw inside a secure sandboxed environment. Think of it as a controlled container where your AI agent can work freely without b...

If you're a curious about Nemoclaw, like what is this and why its diffrent than open claw and looking for setting it up?
Here's everything you need to know #nemoclaw #opensource #openclaw

firethering.com/nvidia-nemoc...

2 weeks ago 3 0 0 0

Lore: Local AI Note Manager with Smart Recall & Private Second Memory Lore is a lightweight, privacy-first desktop app that lives quietly in your system tray and gives you a pop-up chat interface to capture thoughts the moment they happen. Powered entirely by a local LL...

Lore is a lightweight, privacy-first desktop app that lives quietly in your system tray and gives you a pop-up chat interface to capture thoughts the moment they happen. Powered entirely by a local #LLM through #Ollama #opensource
firethering.com/lore-local-a...

3 weeks ago 2 0 0 0

SparkVSR lets you control AI video upscaling with just a few keyframes A research team from Texas A&M and YouTube quietly dropped SparkVSR on GitHub. No big announcement or hype cycle. Just a repo and a paper. Everyone right now is chasing text to video. Sora, Kling, Wan...

SparkVSR is an #opensource video super resolution tool that takes low resolution video and restores it to high quality, but with one difference that separates it from everything else in this space. You can control the output using keyframes. #ai
firethering.com/sparkvsr-vid...

3 weeks ago 1 0 0 0

Recordly: Open-Source Screen Recorder & Editor for Windows, macOS & Linux Recordly is an open-source screen recorder and editor built for creating polished, professional-grade screen recordings without juggling multiple tools. Designed for developers, educators, and content...

Recordly is an #opensource #screenrecorder built for creating polished, screen recordings without juggling multiple tools. Designed for developers, educators, & content creators, lets you record screen or a specific window and jump straight into a built-in #editor
firethering.com/recordly-ope...

3 weeks ago 2 0 0 0

Microsoft MAI Image 2 is impressive, but it comes with serious limitations you should know Microsoft's MAI Image 2 just ranked third globally on Arena.ai. Here is what it genuinely does well, where it falls short, and what this launch actually signals about Microsoft's direction

#MAI-Image-2 is impressive in ways that are hard to ignore, but if you are a designer, a creative professional, or someone thinking about fitting this into a real workflow, there are a few things worth knowing before you get excited. #microsoft #ai
firethering.com/microsoft-ma...

3 weeks ago 1 0 0 0

Posts by Firethering