Skip to content
AI Primer
TOOL17 stories

Open Source

Stories, products, and related signals connected to this tag in Explore.

RELEASE1w ago
Presenton adds self-hosted prompt-to-PPTX exports with Ollama support

Presenton was highlighted as an open-source presentation generator that turns prompts and documents into editable PPTX or PDF decks and can run self-hosted with BYOK and Ollama. Use it if you want slide generation outside locked SaaS editors while keeping exportable files.

RELEASE1w ago
SenseNova U1 open-sources unified image-text generation with 2K images in ~15s

Posts report SenseTime open-sourced SenseNova U1, a unified text-image model with interleaved generation, 8-step distilled LoRA and ComfyUI workflows. They cite 2K image times around 15 seconds and H100 inference cuts to about 2 seconds, so compare it against your current image pipeline.

RELEASE1w ago
OmniVoice Studio opens local dubbing for 600 languages from one MP4

A community post spotlights OmniVoice Studio, an open-source local dubbing pipeline that transcribes, translates, clones voice from 3 seconds and remixes dubbed audio back into video. Running locally keeps voice data on device and removes subscription costs, so it may fit privacy-sensitive dubbing workflows.

RELEASE1w ago
Supertone opens Supertonic with ONNX on-device TTS

Supertone open-sourced Supertonic, a local TTS engine that runs faster than real time on phone CPUs with ONNX models and cross-language runtimes. Voice apps and audiobook workflows can use it to avoid per-character API billing and keep audio generation private.

RELEASE1w ago
BenchLocal releases v0.2.6 with offline-skip runs for tight-VRAM tests

BenchLocal v0.2.6 adds reachability checks so offline local models are skipped and resumed instead of breaking side-by-side tests. The update is aimed at tight-VRAM setups where creators and tinkerers load providers one after another on the same machine.

RELEASE1w ago
Harbor releases v0.4.18 with Open Design and Voicebox

Harbor 0.4.18 added one-command access to Open Design and Voicebox, bundling a local-first design app and a voice cloning and TTS studio inside one homelab layer. The release cuts setup friction, so users can migrate both tools into a single local install path.

WORKFLOW1w ago
Open toolkit turns one image into an interactive 3D world with meshes and audio

Posts show an open-source toolkit that turns one reference image into an interactive 3D scene with generated meshes, lighting, physics, and sound. The demo stack chains World Labs, Hunyuan 3D, ElevenLabs, and fal rather than a single native model.

RELEASE3w ago
Recordly launches free AGPL screen recorder with auto-zoom on macOS, Windows, and Linux

Recordly launched as a free open-source alternative to Screen Studio with auto-zoom, cursor effects, local editing, MP4 and GIF export, and an extension marketplace. It matters for tutorial and product-demo creators because capture and export stay on-device, though current evidence is still mostly repo promotion and reposts.

NEWS3w ago
Blender reports Anthropic sponsorship after €240K payment draws AI takeover backlash

An 80.lv-cited report said Anthropic paid €240K to sponsor Blender, while Blender leadership said the deal was not an AI takeover and creator posts said the public patronage was later pulled. The connector still appears to ship, but the funding tie-in became a flashpoint inside open 3D communities.

RELEASE4w ago
LTX releases HDR beta with scene-linear EXR output and ComfyUI support

Posts say LTX HDR can generate HDR video or convert SDR footage to scene-linear EXR through API, ComfyUI and Hugging Face. The beta matters because EXR output is described as workable in Resolve and Nuke, but the release is still tagged v0.9.

NEWS4w ago
MeiGen launches free prompt gallery with Claude MCP and Figma hooks

Posts describe MeiGen as a free prompt gallery for GPT Image 2, Nano Banana 2, Seedance 2.0, Veo 3.1 and Midjourney, with drag-to-canvas generation and reverse prompting. The thread says the dataset is open source and already wired into Claude, Figma and OpenClaw workflows.

RELEASE1mo ago
DeepSeek V4 Preview opens 1M context with Flash and Pro variants

DeepSeek V4 Preview surfaced as an open-source 1M-context model family, with early docs and community testing pointing to Flash and Pro variants. The release matters for creators and vibe coders looking at self-hosted options, but most performance claims are still coming from first-wave community benchmarks.

RELEASE1mo ago
Google opens DESIGN.md draft spec with CLI validator in progress

Google published the draft DESIGN.md specification so colors, typography, components, and rules can live in one AI-readable file, with a CLI validator and components support in progress. That matters because design agents and handoff tools can point to one structured source of truth instead of inferring UI rules from scattered docs.

RELEASE1mo ago
OpenGame releases prompt-to-web-game agent with playable demos

OpenGame released an open-source agent that turns prompts into playable web games, with public demos spanning shooters, quiz battlers and 90s-style fighters. The release matters because game ideas now arrive as runnable browser prototypes rather than static mockups, though the current proof points are demo-heavy.

RELEASE1mo ago
LTX 2.3 adds distilled LoRA v1.1 for better motion-audio sync

Stable Diffusion and VFX creators say LTX 2.3's distilled LoRA v1.1 improves motion and custom-audio sync. Posts show local short-film and flight-shot workflows running through ComfyUI and Resolve on consumer GPUs.

RELEASE1mo ago
Tencent releases HY-World 2.0 with persistent 3D world export

Tencent released HY-World 2.0 with WorldMirror 2.0 code and weights for turning text, images, or video into persistent 3D scenes. The output includes navigable geometry and camera data instead of disposable video frames.

RELEASE1mo ago
VoxCPM releases 2B voice model with 3-second cloning and 30-language support

OpenBMB released VoxCPM on GitHub with text-described voice design, 3-second cloning, 48kHz audio, and 30-language support. The Apache 2.0 release makes multilingual voice work and local self-hosting cheaper.

AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.