nvidia/chrono-edit
nvidia
image-to-image
NVIDIA Chrono Edit is a state-of-the-art image-to-image AI editor that turns photos into stylized edits and retouches with a few clicks. Ready-to-use REST inference API, best performance, no coldstarts.
wavespeed-ai/ltx-2.3/lipsync
wavespeed-ai
digital-human
LTX-2.3 Lipsync is an audio-driven digital human model that generates synchronized talking head videos from a reference image and audio input. It leverages the LTX-2.3 19B DiT architecture to produce high-fidelity lip-synced videos with natural head movements.
wavespeed-ai/wan-2.2/animate
wavespeed-ai
motion-control
Wan2.2-Animate unified character animation & replacement model replicating movement and expression; generates 720p videos up to 120s. Ready-to-use REST inference API, best performance, no coldstarts.
google/nano-banana-2/text-to-image
google
text-to-image
Google's Nano Banana 2 (Gemini 3.1 Flash Image) is a cutting-edge text-to-image model enabling high-res 4K image generation optimized for phones. Ready-to-use REST inference API, best performance, no coldstarts.
bytedance/seedance-2.0/text-to-video
bytedance
text-to-video
Seedance 2.0 (Text-to-Video) generates Hollywood-grade cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability. Built on ByteDance Seed's unified multimodal architecture, it leads on instruction adherence, motion quality, and visual aesthetics. Ready-to-use REST inference API, best performance, no cold starts.
x-ai/grok-imagine-video/text-to-video
x-ai
text-to-video
Generate videos from text descriptions using xAI's Grok Imagine Video model. Create high-quality videos with customizable duration, aspect ratio, and resolution.