Skip to main content
SwitchX is a professional video-to-video AI generation tool designed for filmmakers, VFX artists, and creators. Unlike standard video-to-video models that hallucinate or alter the entire frame, SwitchX uses your original footage’s pixels as direct control signals. The core principle of SwitchX is simple: Switch anything, keep what matters.
  • Masked Areas: Completely generated based on your prompts and reference images.
  • Unmasked Areas: Retained from the original footage, but intelligently relit and restyled to blend seamlessly into the newly generated environment.
To get the best results from SwitchX, master two concepts: the Alpha Mask controls what gets changed, and the Reference Image controls how it looks.

Alpha Masks

The alpha mask tells SwitchX exactly what to generate and what to preserve. SwitchX offers four masking modes:
ModeDescriptionBest For
AutoAuto-detects and isolates the main subject. No manual input needed.Background replacement, relighting, virtual production
SelectManually select specific elements to keep or generate.Targeted changes (e.g., new outfit, inpainting)
FillSelects the entire frame. Keeps geometry, alters aesthetics.Full-frame relighting or restyling
UploadUpload a custom alpha matte from external software.Pixel-perfect control from Nuke, After Effects, etc.

Auto Mode

Best for: Standard background replacements, relighting a subject, and virtual production.
The engine automatically detects and isolates the main subject in your first frame, then propagates and tracks that mask throughout the rest of the video. No manual selection is required. SwitchX Mask - Auto mode

Select Mode

Best for: Precision control, changing specific elements (e.g., changing clothes while keeping the face and hands intact), and advanced inpainting.
You manually choose exactly which parts of the frame to mask. Only the selected areas will be generated — everything else stays untouched from your original footage. SwitchX Mask - Select mode result Powered by an interactive AI masking tool (SAM3), click to select specific objects on the first frame. The Alpha Editor lets you fine-tune your selections: SwitchX Mask - Select mode
  • Multiple Selections: Select multiple objects. For the cleanest results, add distinct parts (like a face and a hand) as separate objects rather than grouping them. - Deselect: Right-click to remove an unintended selection.
  • Invert: Invert the mask of a specific object. For example, to turn a hand into a robot arm, select the hand, click Invert, and the engine will only generate within that area.

Fill Mode

Best for: Complete scene relighting and restyling.
This mode selects the entire frame. Use Fill when you want to keep the original scene’s geometry and composition intact but alter the overall lighting, mood, or artistic style. SwitchX Mask - Fill mode

Upload Mode

Best for: VFX professionals using external compositing software.
Upload a precise alpha matte created in software like Nuke or After Effects. SwitchX reads the black-and-white alpha video pixel-by-pixel for absolute precision.

Reference Images

The reference image is your visual blueprint. It should contain all the details you want in your final video — background, lighting, mood, and costumes. SwitchX reads this image and uses it as a guide:
  • Masked areas: SwitchX generates the background and content from the reference image into the masked region.
  • Unmasked areas: SwitchX extracts the style and lighting from the reference image and applies it to your original footage, preserving the original pixels.
Source Source Reference Image Reference Image SwitchX Result SwitchX Result

Imperfect References Are Fine

Your reference image doesn’t need to perfectly match your source footage. For example, if you generate a reference with Nano Banana and the face changes — that’s fine. SwitchX understands what your original pixels are and will only bring the lighting and style from the reference, applying it on top of your actual footage.

Creating a Reference Image

You can upload any image, or use the built-in Create with AI tool. This sends your video’s first frame to our Nano Banana or Flux models to generate a matching reference.

Prompting Guidelines

Avoid vague prompts. Prompts like “Take me to heaven” will yield poor results. AI requires precise art direction.
Be highly specific. Specify the environment, lighting, and mood — for example: “A dramatic cliff in Ireland, soft overcast lighting, highly detailed props.”
If you are struggling with art direction, use a text AI like ChatGPT to generate five detailed prompt variations based on your core concept.

The Professional Workflow (Iteration)

For pixel-perfect results, do not rely solely on the initial AI-generated image:
  1. Generate a reference image using the Create with AI tool.
  2. Download the generated image to your local machine.
  3. Refine in Photoshop or similar — adjust color grading, add props, fix details.
  4. Re-upload the edited image into SwitchX. The final video will strictly follow this tailored reference.

Settings & Export

Video Prompt

Autopilot recommended. Leave the Video Prompt setting on Autopilot — the engine handles prompt engineering behind the scenes. Turn this off only if you need to manually override an unexpected result.

Resolution and Project Specs

SettingDetails
Resolution720p or 1080p
Aspect RatioAlways preserved
Frame RateAlways preserved
SwitchX will never distort your original aspect ratio or frame rate. The engine only resizes the shortest side of your video to match the chosen resolution, so you can iterate quickly without breaking your project specifications.