SwitchX is a professional video-to-video AI generation tool designed for filmmakers, VFX artists, and creators. Unlike standard video-to-video models that hallucinate or alter the entire frame, SwitchX uses your original footage’s pixels as direct control signals.
The core principle of SwitchX is simple: Switch anything, keep what matters.
- Masked Areas: Completely generated based on your prompts and reference images.
- Unmasked Areas: Retained from the original footage, but intelligently relit and restyled to blend seamlessly into the newly generated environment.
Alpha Masks
The alpha mask tells SwitchX exactly what to generate and what to preserve. SwitchX offers four masking modes:| Mode | Description | Best For |
|---|---|---|
| Auto | Auto-detects and isolates the main subject. No manual input needed. | Background replacement, relighting, virtual production |
| Select | Manually select specific elements to keep or generate. | Targeted changes (e.g., new outfit, inpainting) |
| Fill | Selects the entire frame. Keeps geometry, alters aesthetics. | Full-frame relighting or restyling |
| Upload | Upload a custom alpha matte from external software. | Pixel-perfect control from Nuke, After Effects, etc. |
Auto Mode
Best for: Standard background replacements, relighting a subject, and
virtual production.

Select Mode
Best for: Precision control, changing specific elements (e.g., changing
clothes while keeping the face and hands intact), and advanced inpainting.


Fill Mode
Best for: Complete scene relighting and restyling.

Upload Mode
Best for: VFX professionals using external compositing software.
Reference Images
The reference image is your visual blueprint. It should contain all the details you want in your final video — background, lighting, mood, and costumes. SwitchX reads this image and uses it as a guide:- Masked areas: SwitchX generates the background and content from the reference image into the masked region.
- Unmasked areas: SwitchX extracts the style and lighting from the reference image and applies it to your original footage, preserving the original pixels.



Imperfect References Are Fine
Your reference image doesn’t need to perfectly match your source footage. For example, if you generate a reference with Nano Banana and the face changes — that’s fine. SwitchX understands what your original pixels are and will only bring the lighting and style from the reference, applying it on top of your actual footage.Creating a Reference Image
You can upload any image, or use the built-in Create with AI tool. This sends your video’s first frame to our Nano Banana or Flux models to generate a matching reference.Prompting Guidelines
If you are struggling with art direction, use a text AI like ChatGPT to generate five detailed prompt variations based on your core concept.The Professional Workflow (Iteration)
For pixel-perfect results, do not rely solely on the initial AI-generated image:- Generate a reference image using the Create with AI tool.
- Download the generated image to your local machine.
- Refine in Photoshop or similar — adjust color grading, add props, fix details.
- Re-upload the edited image into SwitchX. The final video will strictly follow this tailored reference.
Settings & Export
Video Prompt
Resolution and Project Specs
| Setting | Details |
|---|---|
| Resolution | 720p or 1080p |
| Aspect Ratio | Always preserved |
| Frame Rate | Always preserved |