First AI Video-to-Video Editor
Kling O1
Experience the world's first AI model that edits existing videos with text prompts. Kling O1 combines multi-modal intelligence with unified processing to transform video creation—from simple edits to complex 2-minute productions with advanced reference support.
Revolutionary Video-to-Video Editing
Kling O1 is the first AI model to edit existing videos using natural language prompts. Transform your video content by adding elements, changing styles, or modifying scenes—all through simple text commands. This groundbreaking capability lets you iterate on video ideas faster than ever, turning rough drafts into polished productions without traditional editing software.
Unified Multi-Modal Intelligence Engine
Process text, images, and video simultaneously with Kling O1's unified multi-modal engine. Unlike other AI models that handle inputs separately, Kling combines all modalities into a single coherent understanding. Use up to 7+ simultaneous inputs including text prompts, reference images, and video clips to create complex scenes with natural physics, realistic motion, and seamless transitions.
Advanced Reference Support with Native @ Syntax
Combine multiple assets effortlessly using Kling O1's intuitive @ reference syntax. Simply type @ in your prompt to reference images, videos, or other elements, and the AI seamlessly integrates them into your generation. This native support for asset combination enables professional workflows where you can maintain character consistency, reference specific styles, and build complex scenes by orchestrating multiple inputs with precision.
Kling O1

Kling O1
Enhanced version optimized for professional workflows, offering superior quality, advanced physics simulation, and extended multi-modal processing capabilities.
How to Use Kling O1 on Auralume AI
Create or edit videos in three simple steps with Kling O1's intuitive workflow.
Add Your Prompts and Assets
Start by typing text prompts or uploading existing videos, images, and clips. Kling O1 accepts up to 7+ simultaneous inputs across all modalities. Use the native @ syntax to reference specific assets in your prompt, enabling precise control over how elements combine in your final video.
Configure Generation Settings
Select Kling O1 as your model and choose between video-to-video editing or text-to-video generation modes. Set your preferred aspect ratio, resolution, and video duration (up to 2 minutes). The unified engine automatically optimizes processing based on your selected configuration.
Generate and Export Your Video
Click generate and watch as Kling O1 processes your multi-modal inputs with advanced physics understanding. Videos typically render in 1-2 minutes depending on complexity. Export your AI-generated videos in high-quality formats ready for social media, marketing campaigns, or professional productions.
Frequently Asked Questions
Start Creating with Kling O1 Today
Experience the first AI video-to-video editor with multi-modal intelligence and 2-minute generation capabilities