SwitchX is available exclusively on Beeble (Cloud app). It is not
available in Beeble Studio.
- Masked Areas: Completely generated based on your prompts and reference images.
- Unmasked Areas: Retained from the original footage, but intelligently relit and restyled to blend seamlessly into the newly generated environment.
Alpha Masks
The alpha mask tells SwitchX exactly what to generate and what to preserve. SwitchX offers four masking modes:| Mode | Description | Best For |
|---|---|---|
| Auto | Auto-detects and isolates the main subject. No manual input needed. | Background replacement, relighting, virtual production |
| Select | Manually select specific elements to keep or generate. | Targeted changes (e.g., new outfit, inpainting) |
| Fill | Selects the entire frame. Keeps geometry, alters aesthetics. | Full-frame relighting or restyling |
| Upload | Upload a custom alpha matte from external software. | Pixel-perfect control from Nuke, After Effects, etc. |
Auto Mode
Best for: Standard background replacements, relighting a subject, and
virtual production.

Select Mode
Best for: Precision control, changing specific elements (e.g., changing
clothes while keeping the face and hands intact), and advanced inpainting.


Fill Mode
Best for: Complete scene relighting and restyling.

Upload Mode
Best for: VFX professionals using external compositing software.
Camera Tracking
SwitchX only sees the unmasked (foreground) region of your video. The masked area is completely hidden from the model, meaning SwitchX must infer all camera movement solely from the visible foreground pixels. This has a direct impact on camera tracking quality:- Where it excels: Shots where the foreground contains rich visual data to infer motion — such as a subject with complex movement, organic camera shake, or visible depth changes.
- Where it struggles: Simple, linear camera movements (like lateral trucking/panning shots) where the isolated foreground lacks sufficient parallax cues to estimate motion accurately.
Reference Images
The reference image is your visual blueprint. It should contain all the details you want in your final video — background, lighting, mood, and costumes. SwitchX reads this image and uses it as a guide:- Masked areas: SwitchX generates the background and content from the reference image into the masked region.
- Unmasked areas: SwitchX extracts the style and lighting from the reference image and applies it to your original footage, preserving the original pixels.



Imperfect References Are Fine
Your reference image doesn’t need to perfectly match your source footage. For example, if you generate a reference with Nano Banana and the face changes — that’s fine. SwitchX understands what your original pixels are and will only bring the lighting and style from the reference, applying it on top of your actual footage.Creating a Reference Image
You can upload any image, or use the built-in Create with AI tool. This sends your video’s first frame to our Nano Banana or Flux models to generate a matching reference.Prompting Guidelines
If you are struggling with art direction, use a text AI like ChatGPT to generate five detailed prompt variations based on your core concept.The Professional Workflow (Iteration)
For pixel-perfect results, do not rely solely on the initial AI-generated image:- Generate a reference image using the Create with AI tool.
- Download the generated image to your local machine.
- Refine in Photoshop or similar — adjust color grading, add props, fix details.
- Re-upload the edited image into SwitchX. The final video will strictly follow this tailored reference.
Settings & Export
Video Prompt
Resolution and Project Specs
| Setting | Details |
|---|---|
| Resolution | 720p or 1080p |
| Aspect Ratio | Always preserved |
| Frame Rate | Always preserved |