TECHNOLOGY WAR

NVIDIA releases SANA-WM, a 2.6B open-source video world model

NVIDIA just dropped SANA-WM, an open-source AI video world model that generates 1-minute 720p clips from text prompts. It’s free to download and runs on your la

NVIDIA’s latest open-source AI model can generate one-minute 720p videos from a text prompt. The 2.6 billion parameter SANA-WM runs on a single RTX 4090 and is now available for free on GitHub.

By GlobalBR News · via Hacker News

May 16, 2026 at 8:06 AM UTC

6 min read

Text size:

TL;DR

NVIDIA releases SANA-WM, a 2.6B open-source AI that turns text into 1-minute 720p videos.

NVIDIA releases SANA-WM, a 2.6B open-source video world model — A side-by-side comparison showing a text prompt (a cat on a windowsill) and the generated 720p video output from SANA-WM

Key Points

NVIDIA built SANA-WM with 2.6 billion parameters
Model generates 720p videos from text prompts in one minute
Runs on a single RTX 4090 GPU

NVIDIA just open-sourced SANA-WM, a breakthrough AI model that turns text prompts into one-minute 720p videos. The model, built with 2.6 billion parameters, runs locally on a single NVIDIA RTX 4090 graphics card and doesn’t need cloud servers. Researchers and developers can download it for free from the NVIDIA SANA GitHub page. This isn’t the first text-to-video model, but it’s one of the few that actually runs without expensive hardware or internet dependency. Most alternatives still rely on supercomputers or paid APIs, making SANA-WM a rare affordable option.

SANA-WM comes from NVIDIA’s research team, the same group behind the Stable Diffusion and Sana image generators. The model builds on their previous work but adds motion generation, turning static images into short video clips with just a few seconds of processing time. Unlike older video models that take minutes to render even low-res footage, SANA-WM keeps the clips at 720p resolution and stays smooth at 24-30 frames per second. For context, that’s the same quality as a YouTube video uploaded a decade ago.

The team tested SANA-WM on a variety of prompts, from simple scenes like “a cat sitting on a windowsill” to more complex ones like “a futuristic city at sunset with flying cars.” In each case, the model produced a one-minute clip that matched the prompt reasonably well. The videos aren’t Pixar quality, but they’re good enough for social media, presentations, or even quick prototype testing. The model also handles motion better than most open-source alternatives. Where other models stutter or blur objects in motion, SANA-WM keeps things relatively stable—though things like hair or fabric still look a bit plastic.

NVIDIA didn’t just release the model; they included a full training dataset and code to let others tweak it. That means if you’ve got the skills, you can fine-tune SANA-WM for your own projects. The team even published benchmarks showing it outperforms similar open-source models like Phenaki and Make-A-Video in some areas, especially when it comes to motion coherence. The catch? You’ll need a beefy GPU. While the model runs on a single RTX 4090, older cards or lower-end GPUs will struggle or fail entirely.

This release fits into NVIDIA’s push to make AI tools more accessible. The company has been open-sourcing models left and right lately, from 3D scene generators to real-time voice changers. SANA-WM continues that trend, but with a twist: it’s one of the few models that doesn’t require you to pay per generation or rent cloud time. For indie developers, researchers, or even hobbyists, that’s a big deal. It lowers the barrier to experimenting with video generation, which could lead to some interesting indie games, YouTube content, or even training datasets for other AI models.

The big question now is how this affects the video generation market. Right now, most high-quality video AI still lives in closed ecosystems like Runway or Pika Labs. Those services charge per clip and often limit quality to attract users. SANA-WM doesn’t have those limits—but it also doesn’t have their polish. For now, it’s a tool for people who value control and cost over perfection. If NVIDIA keeps refining it, though, it could start to rival those paid services in quality too.

What You Need to Know

Source: Hacker News
Published: May 16, 2026 at 12:06 UTC
Category: Technology
Topics: #hackernews · #programming · #tech · #war · #nato · #military

Read the Full Story

This is a curated summary. For the complete article, original data, quotes and full analysis:

Read the full story on Hacker News →

All reporting rights belong to the respective author(s) at Hacker News. GlobalBR News summarizes publicly available content to help readers discover the most relevant global news.

Curated by GlobalBR News · May 16, 2026

🇧🇷 Resumo em Português

A NVIDIA acaba de lançar uma revolução no mundo da inteligência artificial: o SANA-WM, o primeiro modelo de mundo de vídeo open-source do mundo, capaz de gerar vídeos realistas de até um minuto em 720p a partir de textos simples. Com 2,6 bilhões de parâmetros, a ferramenta democratiza a criação de conteúdos visuais, antes restrita a estúdios e grandes empresas, permitindo que qualquer pessoa com um computador comum possa produzir vídeos de alta qualidade sem custo algum.

O lançamento chega em um momento crucial para o Brasil e o mercado de língua portuguesa, onde a demanda por conteúdos digitais cresce exponencialmente, especialmente com o avanço do marketing digital, das mídias sociais e da educação online. Até então, soluções semelhantes estavam fora do alcance de muitos criadores de conteúdo devido a custos elevados e complexidade técnica. Agora, com o SANA-WM disponível gratuitamente, desenvolvedores, artistas e pequenas empresas brasileiras têm uma ferramenta poderosa para inovar em vídeos explicativos, tutoriais, campanhas publicitárias e até mesmo produções artísticas, impulsionando a criatividade local.

A próxima fronteira é a adaptação desse modelo para o português, garantindo que a tecnologia seja verdadeiramente acessível aos falantes da língua, e a NVIDIA já sinalizou que deve explorar essa vertente em breve.

🇪🇸 Resumen en Español

NVIDIA ha dado un paso revolucionario en el campo de la inteligencia artificial con el lanzamiento de SANA-WM, un modelo de mundo de vídeo de código abierto que promete transformar la generación de contenido audiovisual. Esta herramienta, capaz de producir clips de un minuto en resolución 720p a partir de simples indicaciones de texto, llega en un momento en que la demanda de contenidos digitales crece exponencialmente, especialmente en el mundo hispanohablante donde el acceso a tecnologías avanzadas aún enfrenta barreras económicas y técnicas.

La relevancia de SANA-WM radica no solo en su capacidad técnica, sino en su modelo de distribución abierto y gratuito, que democratiza el acceso a la creación de vídeo de alta calidad. Para los usuarios hispanohablantes, esto significa una oportunidad sin precedentes para desarrollar proyectos creativos, educativos o comerciales sin depender de costosos softwares o plataformas propietarias. Además, al ser de código abierto, fomenta la innovación colaborativa en una región donde el talento local podría verse limitado por recursos económicos, abriendo puertas a nuevas industrias y formas de expresión cultural.

Original Source

Hacker News

Read full article at Hacker News →

This post is a curated summary. All rights belong to the original author(s) and Hacker News.

#hackernews #programming #tech #war #nato #military #article #sana #comments #points #sana-wm #nvidia-video-ai #text-to-video-model #open-source-ai-video-generator #nvidia-open-source-models #rtx-4090-ai-video #sana-world-model #free-video-generation-ai #nvidia-sana-github #ai-video-generation-720p

Key Topics: NVIDIA SANA-WM NVIDIA RTX 4090 GitHub Phenaki Runway Pika Labs Stable Diffusion

Was this article helpful?

Discussion

Frequently Asked Questions

What is SANA-WM and who built it?

SANA-WM is an open-source AI model from NVIDIA’s research team that generates 1-minute 720p videos from text prompts. It’s part of their SANA project and uses 2.6 billion parameters to run locally on an RTX 4090 GPU.

How does SANA-WM compare to other video AI models?

SANA-WM runs locally without cloud costs, unlike most alternatives. It also handles motion better than similar open-source models like Phenaki, though it’s not as polished as paid services like Runway or Pika Labs.

What kind of hardware do I need to run SANA-WM?

You’ll need at least an NVIDIA RTX 4090 GPU. Older or lower-end cards may not work or will struggle with performance. The model doesn’t require cloud servers, so RAM and storage are the main bottlenecks.

Can I modify or fine-tune SANA-WM?

Yes. NVIDIA released the full code, training data, and benchmarks, so developers can tweak the model for their own projects. The GitHub repo includes instructions for fine-tuning and training.

What are the limitations of SANA-WM’s video generation?

The videos aren’t Hollywood quality. Motion like hair or fabric can look unnatural, and complex prompts sometimes produce inconsistent results. It’s best for short clips, not full-length films.

`/`	Focus search
`j`	Next post
`k`	Previous post
`h`	Go home
`?`	Show this help

What You Need to Know

Read the Full Story

Related Articles

🇧🇷 Resumo em Português

🇪🇸 Resumen en Español

Discussion

Frequently Asked Questions

Related Articles

Oto Smart Sprinkler Review (2026): Solar-Powered and Simple to Use

Classic 7 turns Windows 10 into Windows 7 with real code

Cannes 2024: Léa Seydoux and Adèle Exarchopoulos reunite rumor

Get today's top stories →