Video

Luma Ray: how to write prompts the model actually understands

Luma · Updated:

Luma Ray is the Luma video-model family in Dream Machine: Ray 1.x, Ray 2, Ray 3.14, and Ray 3 Reasoning. All Ray models share the same formula — subject + mid-action verb + setting + secondary motion + camera + lighting. These are positive-only models: negative prompts are counter-productive. Duration is 5–10 seconds, extend up to 30 seconds.

What the Ray family does

All Luma Ray models support text-to-video, image-to-video with keyframes (start + end frame), extend for continuation, modify for transforming existing video, and loop for seamless looping. Aspect ratios: 9:16, 3:4, 1:1, 4:3, 16:9, 21:9.

Key feature of the family: Ray models are trained directly on video data, so they understand natural motion and physically correct interactions. They do NOT accept negative instructions — describe only what you want to see. The words «vibrant», «whimsical», «hyper-realistic», «beautiful», «amazing», «stunning» degrade quality — empirically confirmed by Luma.

  • 5–10 seconds per run, extend up to ~30 seconds
  • 720p and 1080p native, 4K via upscale
  • Keyframes: start frame + end frame for I2V
  • Loop for seamless product videos
  • Positive-only model — no negative prompts

Prompt structure

Universal formula across all Ray models: [Subject] + [Mid-Action Verb] + [Setting/Environment] + [Secondary Motion/Consequences] + [Camera Movement] + [Lighting/Mood].

Key rule — present continuous. Use mid-action verbs: «running» instead of «begins to run», «pouring» instead of «starts to pour», «spinning» instead of «will spin». The model doesn't understand future tense or action sequences.

Optimal prompt length is around 100 words focused on the action. Under 15 words — the model fills in too much and results are unpredictable. Over 200 words — detail overload and the model loses focus.

Camera and secondary motion

A concrete camera description significantly improves the result. Movement categories: dolly forward/back, tracking shot, orbit, crane shot, push in, pull out, pan left/right, tilt up/down. Shot size: close-up, extreme close-up, medium shot, wide shot, establishing shot, macro. Angle: low angle, high angle, bird's eye view, Dutch angle, POV, over-the-shoulder.

Secondary motion is the consequence of the main action that makes video feel alive: «ears flapping in the wind», «dust particles catching golden hour sunlight», «fabric billowing outward», «hair flowing», «reflections shimmering on wet pavement», «ripples spreading on water». Without them, video looks static even with a moving subject.

Avoid conflicting camera moves (zoom + pan + orbit simultaneously) — the model tries to execute everything at once and produces chaos.

Modes and keyframes

Text-to-Video — generation from text, the main mode. Image-to-Video — animating an image or transitioning between start/end keyframes. With keyframes the aspect ratio is taken automatically from the uploaded image. Describe only what CHANGES — don't re-describe static elements.

Extend — continuation of existing video. Limit ~30 seconds total, beyond that quality drops. Each extension is a separate prompt describing the new content.

Modify (V2V) — transforming existing video by prompt. Three modes: Adhere (preserve original, minimal change), Flex (balance), Reimagine (freedom). Critical rule: describe the END STATE, not commands. «Cyberpunk neon city at night, rain-slicked streets» works; «change the sky to blue» doesn't.

Loop — seamless looped video for product showcases. Activated via the ∞ icon in the prompt box.

Common mistakes

  1. 1. Forbidden words in the prompt

    «Vibrant», «whimsical», «hyper-realistic», «beautiful», «amazing», «stunning» empirically degrade Ray quality. These words give the model no visual information, take up prompt space, and reduce results. Replace with concrete description: «warm orange light», «soft pastel palette», «sharp detail», «cinematic lighting».

  2. 2. Temporal phrases and future tense

    «Begins to run», «starts to spin», «will pour», «then transforms» — Ray doesn't understand temporal sequence and works in present continuous. Use mid-action verbs: «running», «spinning», «pouring», «transforming». For sequential actions, split into separate prompts with extend.

  3. 3. Negative prompts

    «No text», «without people», «remove watermark», «no clouds» — Ray is a «positive only» model. Negative instructions are either ignored or inverted. Replace with positive description of the desired result: instead of «no clouds» write «clear blue sky»; instead of «without people» write «empty street».

  4. 4. Prompt too short

    A prompt under 15 words gives the model too little information and it makes up most of the scene. Optimal length is around 100 words with subject, action, setting, secondary motion, camera, and light. Over 200 words leads to detail overload.

  5. 5. Re-describing static elements with keyframes

    When using image-to-video with keyframes, don't re-describe what's already in the frames. Describe only the CHANGES between start and end frame: «hair starts to flow», «camera dollies forward», «light shifts from cool to warm». Detailed descriptions of static elements confuse the model.

Before / after examples

Example 1

Before

a beautiful golden retriever runs through a field, stunning visuals

After

A golden retriever running through a wheat field at sunset, ears flapping in the wind, dust particles catching golden hour sunlight, camera tracking alongside at the dog's level, warm cinematic light.

Forbidden words («beautiful», «stunning visuals») removed. «Runs» (present simple) replaced with «running» (mid-action). Added secondary motion (ears flapping, dust particles), concrete camera (tracking alongside at dog's level), and light (golden hour, warm cinematic).

Example 2

Before

espresso pouring, no spills, no mess

After

Espresso pouring into a white ceramic cup, dark liquid swirling and forming crema, steam rising slowly, macro close-up, soft warm morning light from the left, shallow depth of field.

«No spills, no mess» is a negative prompt that Ray ignores or inverts. The fix: positive description of the clean process (swirling, forming crema), macro proximity, specific light (warm morning light from the left).

Example 3

Before

change the sky from cloudy to blue (for Modify)

After

Clear blue sky, bright daylight, soft white clouds at the horizon, warm sunlight, golden hour atmosphere — same composition as original.

For Modify (V2V), commands («change», «remove», «transform») don't work. The fix: describe the END STATE as a standalone scene. Start with Adhere 1–2 intensity and increase as needed.

Frequently asked

How is the Ray family different from other video models?
Ray is trained directly on video data, giving it an understanding of natural motion and physically correct interactions. It's a «positive only» model — negative prompts don't work. Supports keyframes (start + end frame), extend up to ~30 seconds, modify (V2V with three modes), and loop. The family includes several models: Ray 1.x, Ray 2, Ray 3.14, Ray 3 Reasoning.
Which words degrade Ray quality?
Luma's empirically confirmed list: «vibrant», «whimsical», «hyper-realistic», «beautiful», «amazing», «stunning». They give the model no visual information and lower quality. Temporal phrases («begins to», «starts to», «will», «then») and negative prompts («no», «without», «remove») also degrade output. Replace with concrete description and present continuous verbs.
How many seconds can I generate at once?
5 or 10 seconds per run (selectable in Dream Machine settings). For longer videos use extend — continuation of the existing clip with a description of new content. Total limit is around 30 seconds, beyond that quality drops with each extension. Each extension is a separate prompt.
How do keyframes work in Ray?
Image-to-Video mode accepts a start frame (required) and an end frame (optional). The model generates the transition between them. With keyframes the aspect ratio is taken automatically from the image. Key rule: describe only what CHANGES between frames, don't re-describe static elements. This produces the smoothest motion.
What is Modify mode?
Modify (Video-to-Video) transforms existing video by prompt: changes style, lighting, environment, time of day. Three control levels: Adhere (1–3, preserves the original), Flex (1–3, balance), Reimagine (1–3, creative freedom). Describe the END STATE, not commands: «cyberpunk neon city at night» works; «change to cyberpunk» doesn't.
Should I write prompts in languages other than English?
No, Ray models are optimized for English and cinematic terminology. In other languages results are noticeably worse — the model loses the photographic and cinematographic anchors it responds to best. For production work always translate to English; for short experiments other languages are fine, but expect lower quality.
Does Opten support Luma Ray?
Yes, the Opten extension auto-detects models in the Ray family (Ray 1.x, Ray 2, Ray 3.14, Ray 3) and scores prompts by family-specific rules. It checks for absence of forbidden words and negative prompts, presence of mid-action verbs, concrete camera and light descriptions, and secondary motion. One click gives you a rewrite in the positive-only format with the correct structure.

Related models

Ready to write Luma Ray (general family) prompts in one click?

  • Auto-detects the model inside its native interface
  • Scores every line of your prompt
  • One-click rewrite into the correct structure
ChromeYandex BrowserChrome / Yandex BrowserInstall extension

Pro — $2.99/month or ₽199/month · cancel anytime

Stop Guessing. Generate
On The First Try.

Install Opten in 30 seconds and score your next prompt.

Opten is a Chrome extension that scores AI prompts for the specific model. Supports 60+ image and video models — Midjourney, GPT Image 2, Kling, Sora, Nano Banana, Flux — and rewrites them in one click inside the Syntx, Higgsfield, and Freepik interfaces. From $2.99/month.

© 2026 Opten · IE Nikolai Shupletsov · Tax ID 306389672