TOPIC23 stories

Open Source

Stories, products, and related signals connected to this tag in Explore.

Stories

AgenticSeek guide ships local agent stack with Ollama, SearXNG, and Docker

Hasantoxr documents AgenticSeek with Ollama, SearXNG, and Docker, including install steps, model config, and a locked WORK_DIR. The stack keeps models, chats, and files on the user's machine.

RELEASE2w ago

Wan-Streamer v0.1 opens real-time video agents with live voice demos

Wan-Streamer v0.1 surfaced with paper links and demos showing real-time video conversations, live recording, and spoken avatar responses. That matters for interactive characters and live creator tools because multimodal generation moves from rendered clips to low-latency back-and-forth.

WORKFLOW2w ago

Krea 2 Turbo community releases GGUF ports: RTX 3090 tests report 1.9x int8 speedups

Builders published GGUF conversions, loader nodes, and local benchmarks for Krea 2 Turbo after yesterday’s open-weights release, alongside new multi-style and watercolor tests. The follow-up matters because creators now have clearer ways to run, tune, and style-push Krea locally on smaller VRAM budgets.

RELEASE2w ago

Krea 2 Turbo releases open weights with ComfyUI workflows and 8 GB community ports

Krea 2 Turbo arrived with open weights, commercial rights, ComfyUI workflows, and community GGUF and FP8 ports that users say can run locally on modest hardware. Early benchmarks praise speed and style range, while some testers flag drift toward recognizable IP.

RELEASE1mo ago

Ideogram 4.0 releases as open-weight image model with JSON layout control

Ideogram 4.0 shipped as an open-weight image model with JSON prompting, bounding boxes, stronger text rendering, and native 2048px output. The release targets layout-heavy creative work, and teams can test early fal and Leonardo integrations in production flows.

NEWS1mo ago

Runway joins NVIDIA Cosmos Coalition on a codeveloped base world model

Runway joined NVIDIA's new Cosmos Coalition as a founding member, and Runway says the group's first project is a base model it is codeveloping with NVIDIA. NVIDIA also says Cosmos 3 is fully open with weights and post-training recipes, so teams can track the shared world-model stack.

RELEASE1mo ago

Presenton adds self-hosted prompt-to-PPTX exports with Ollama support

Presenton was highlighted as an open-source presentation generator that turns prompts and documents into editable PPTX or PDF decks and can run self-hosted with BYOK and Ollama. Use it if you want slide generation outside locked SaaS editors while keeping exportable files.

RELEASE1mo ago

SenseNova U1 open-sources unified image-text generation with 2K images in ~15s

Posts report SenseTime open-sourced SenseNova U1, a unified text-image model with interleaved generation, 8-step distilled LoRA and ComfyUI workflows. They cite 2K image times around 15 seconds and H100 inference cuts to about 2 seconds, so compare it against your current image pipeline.

RELEASE1mo ago

OmniVoice Studio opens local dubbing for 600 languages from one MP4

A community post spotlights OmniVoice Studio, an open-source local dubbing pipeline that transcribes, translates, clones voice from 3 seconds and remixes dubbed audio back into video. Running locally keeps voice data on device and removes subscription costs, so it may fit privacy-sensitive dubbing workflows.

RELEASE1mo ago

BenchLocal releases v0.2.6 with offline-skip runs for tight-VRAM tests

BenchLocal v0.2.6 adds reachability checks so offline local models are skipped and resumed instead of breaking side-by-side tests. The update is aimed at tight-VRAM setups where creators and tinkerers load providers one after another on the same machine.

RELEASE1mo ago

Supertone opens Supertonic with ONNX on-device TTS

Supertone open-sourced Supertonic, a local TTS engine that runs faster than real time on phone CPUs with ONNX models and cross-language runtimes. Voice apps and audiobook workflows can use it to avoid per-character API billing and keep audio generation private.

RELEASE1mo ago

Harbor releases v0.4.18 with Open Design and Voicebox

Harbor 0.4.18 added one-command access to Open Design and Voicebox, bundling a local-first design app and a voice cloning and TTS studio inside one homelab layer. The release cuts setup friction, so users can migrate both tools into a single local install path.

WORKFLOW1mo ago

Open toolkit turns one image into an interactive 3D world with meshes and audio

Posts show an open-source toolkit that turns one reference image into an interactive 3D scene with generated meshes, lighting, physics, and sound. The demo stack chains World Labs, Hunyuan 3D, ElevenLabs, and fal rather than a single native model.

RELEASE2mo ago

Recordly launches free AGPL screen recorder with auto-zoom on macOS, Windows, and Linux

Recordly launched as a free open-source alternative to Screen Studio with auto-zoom, cursor effects, local editing, MP4 and GIF export, and an extension marketplace. It matters for tutorial and product-demo creators because capture and export stay on-device, though current evidence is still mostly repo promotion and reposts.

NEWS2mo ago

Blender reports Anthropic sponsorship after €240K payment draws AI takeover backlash

An 80.lv-cited report said Anthropic paid €240K to sponsor Blender, while Blender leadership said the deal was not an AI takeover and creator posts said the public patronage was later pulled. The connector still appears to ship, but the funding tie-in became a flashpoint inside open 3D communities.

NEWS2mo ago

MeiGen launches free prompt gallery with Claude MCP and Figma hooks

Posts describe MeiGen as a free prompt gallery for GPT Image 2, Nano Banana 2, Seedance 2.0, Veo 3.1 and Midjourney, with drag-to-canvas generation and reverse prompting. The thread says the dataset is open source and already wired into Claude, Figma and OpenClaw workflows.

RELEASE2mo ago

LTX releases HDR beta with scene-linear EXR output and ComfyUI support

Posts say LTX HDR can generate HDR video or convert SDR footage to scene-linear EXR through API, ComfyUI and Hugging Face. The beta matters because EXR output is described as workable in Resolve and Nuke, but the release is still tagged v0.9.

RELEASE2mo ago

DeepSeek V4 Preview opens 1M context with Flash and Pro variants

DeepSeek V4 Preview surfaced as an open-source 1M-context model family, with early docs and community testing pointing to Flash and Pro variants. The release matters for creators and vibe coders looking at self-hosted options, but most performance claims are still coming from first-wave community benchmarks.

RELEASE2mo ago

Google opens DESIGN.md draft spec with CLI validator in progress

Google published the draft DESIGN.md specification so colors, typography, components, and rules can live in one AI-readable file, with a CLI validator and components support in progress. That matters because design agents and handoff tools can point to one structured source of truth instead of inferring UI rules from scattered docs.

RELEASE2mo ago

OpenGame releases prompt-to-web-game agent with playable demos

OpenGame released an open-source agent that turns prompts into playable web games, with public demos spanning shooters, quiz battlers and 90s-style fighters. The release matters because game ideas now arrive as runnable browser prototypes rather than static mockups, though the current proof points are demo-heavy.

RELEASE2mo ago

LTX 2.3 adds distilled LoRA v1.1 for better motion-audio sync

Stable Diffusion and VFX creators say LTX 2.3's distilled LoRA v1.1 improves motion and custom-audio sync. Posts show local short-film and flight-shot workflows running through ComfyUI and Resolve on consumer GPUs.

RELEASE2mo ago

Tencent releases HY-World 2.0 with persistent 3D world export

Tencent released HY-World 2.0 with WorldMirror 2.0 code and weights for turning text, images, or video into persistent 3D scenes. The output includes navigable geometry and camera data instead of disposable video frames.

RELEASE3mo ago

VoxCPM releases 2B voice model with 3-second cloning and 30-language support

OpenBMB released VoxCPM on GitHub with text-described voice design, 3-second cloning, 48kHz audio, and 30-language support. The Apache 2.0 release makes multilingual voice work and local self-hosting cheaper.