LogoAcceptPrompt
  • 功能
  • FAQ
  • 博客
  • 文档
Google Veo 3.1 提示词指南:技巧、方法与提示词模板

Photo by Kaitlyn Baker on Unsplash

2026/03/09

Google Veo 3.1 提示词指南:技巧、方法与提示词模板

Google Veo 3.1 全面指南:提示词技巧、真实皮肤生成、去字幕方法、费用价格详解,以及 Veo 3.1 时长限制说明。含可直接复制的提示词模板。

Google Veo 3.1 是 Google DeepMind 推出的新一代 AI 视频生成模型。它能够生成高保真视频,并原生支持音频——包括对话、音效、环境音和背景音乐——全部在一次生成中完成。本指南将详细介绍如何有效地为 Veo 3.1 编写提示词,并提供可直接复制使用的提示词模板。

什么是 Veo 3.1?

Veo 3.1 是 Google Veo 系列视频生成模型的最新版本。核心亮点包括:

  • 原生音频生成:直接生成与视频同步的对话、音效、环境音和音乐——无需额外的音频处理步骤。
  • 行业领先的真实感:重新设计以实现更高的真实度和保真度,内置真实世界物理模拟。
  • 顶级提示词遵循能力:改进了对复杂、详细指令的准确执行能力。
  • 专业级分辨率:支持 1080p 和 4K 分辨率输出。
  • 视频延展:生成 8 秒片段后可扩展为更长的连贯场景,保持视觉和音频的一致性。

在哪里使用 Veo 3.1

平台说明链接
Gemini基于对话的界面,快速生成视频gemini.google.com
Flow为创作者打造的 AI 电影制作工具labs.google/flow
Google AI Studio开发者友好的提示词实验平台aistudio.google.com
Gemini API程序化接入,用于构建应用ai.google.dev
Vertex AI Studio企业级部署方案cloud.google.com/vertex-ai

优秀 Veo 3.1 提示词的构成要素

提示词中加入的细节越多,你对最终视频的控制力就越强。一个出色的 Veo 提示词通常包含以下要素:

要素控制什么示例
镜头/构图景别、角度、运镜"中景"、"无人机跟踪拍摄"、"镜头缓缓推进"
视觉风格艺术方向、类型、媒介"电影感"、"定格动画"、"35mm 胶片黑色电影"
光线氛围、气氛、时间"温暖的灯光"、"黄金时段"、"霓虹灯"
角色外貌、服装、表情"一位戴着太阳镜、穿着佩斯利衬衫的灰胡子老人"
环境场景、景色、道具"夜晚烟雾弥漫的爵士俱乐部"、"霓虹灯闪烁的赛博朋克城市"
动作角色行为、场景事件"在岩石上奔跑"、"做后空翻"
对话角色的台词""这座城市总有故事,"老人低声说"
音频音效设计、音乐、效果"Audio: 翅膀振动、鸟鸣、轻柔的管弦乐配乐"

提示词编写技巧

1. 用具体细节塑造角色

不要只说"一个女人"——描述她的外貌、服装、表情和声音。越具体,角色就越独特和一致。

❌ 模糊的:

一个棕色头发的女人在说话。

✅ 具体的:

A medium shot opens on a seasoned, grey-bearded man in sunglasses and a paisley shirt, his gaze fixed off-camera with a contemplative expression. His gold chain glints subtly. Beside him, a younger man in a tank top, also looking forward, suggests a shared moment of observation or reflection.

2. 用感官语言构建沉浸式世界

使用富有感染力的感官描述来勾勒完整的画面。思考光线、质感、声音和氛围。

A snow-covered plain of iridescent moon-dust under twilight skies. Thirty-foot crystalline flowers bloom, refracting light into slow-moving rainbows. A fur-cloaked figure walks between these colossal blossoms, leaving the only footprints in untouched dust.

3. 自然地加入对话

Veo 3.1 可以原生生成语音对话。你可以给角色具体的台词,或描述一个让他们讨论的主题。使用引号将对话直接嵌入提示词中。

A medium shot frames an old sailor, his knitted blue sailor hat casting a shadow over his eyes, a thick grey beard obscuring his chin. He holds his pipe in one hand, gesturing with it towards the churning, grey sea beyond the ship's railing. "This ocean, it's a force, a wild, untamed might. And she commands your awe, with every breaking light"

4. 明确设计音频

你可以在提示词中内嵌描述音效、环境音和音乐,也可以在末尾使用单独的 Audio: 部分来指定。

A follow shot of a wise old owl high in the air, peeking through the clouds in a moonlit sky above a forest. The wise old owl carefully circles a clearing looking around to the forest floor. After a few moments, it dives down to a moonlit path and sits next to a badger. Audio: wings flapping, birdsong, loud and pleasant wind rustling and the sound of intermittent pleasant sounds buzzing, twigs snapping underfoot, croaking. A light orchestral score with woodwinds throughout with a cheerful, optimistic rhythm, full of innocent curiosity.

5. 用极致细节控制复杂动作

对于快节奏或技术要求高的场景,不要留任何想象空间。详细描述事件的确切顺序、摄像机行为和时间安排。

The scene explodes with the raw, visceral, and unpredictable energy of a hardcore off-road rally, captured with a dynamic, almost found-footage or embedded sports documentary aesthetic. The camera is often shaky, seemingly mounted inside one of the vehicles or held by a daring spectator very close to the action, frequently splattered with mud or water, catching unintentional lens flares from the natural, often harsh, sunlight filtering through trees or reflecting off wet surfaces. Within an 8-second sequence, one of the lead vehicles, a low-slung, open-cockpit buggy so caked in thick, brown mud that its original color is a mystery, approaches a wide, shallow river crossing at incredible speed. Without the slightest hesitation, its unseen driver powers straight into the water. The impact sends an enormous, almost solid, opaque sheet of muddy water spectacularly high into the air, completely engulfing the small buggy for a terrifying moment.

6. 定义独特的视觉风格和基调

在提示词开头指定视频的媒介和风格——真实感、卡通、黏土动画、定格动画、VHS 风格、动漫等。用角色对话来设定情感基调。

Camping (Stop Motion): Camper: "I'm one with nature now!" Bear: "Nature would prefer some personal space."

7. 将视觉与声音设计融合

将特定的音频线索与视觉描述配对,打造多感官体验。使用 Audio: 前缀来指定专门的声音指令。

A keyboard whose keys are made of different types of candy. Typing makes sweet, crunchy sounds. Audio: Crunchy, sugary typing sounds, delighted giggles.
A handheld shot follows a wok as it's expertly flicked, sending vibrant, sizzling vegetables tumbling over themselves in a flash of motion and steam. Audio: a metallic clank and a sharp whoosh.

8. 围绕日常事件构建叙事

你不需要史诗级角色来讲述引人入胜的故事。赋予简单物体一个使命,编写一个有开头、中间和结尾的完整叙事。

A paper boat sets sail in a rain-filled gutter. It navigates the current with unexpected grace. It voyages into a storm drain, continuing its journey to unknown waters.

9. 使用时间戳精确控制时序

为了对视频中的事件序列实现终极控制,你可以使用时间戳标记来描述每个时刻发生的事情。

A meticulously detailed scene opens, displaying a small, pale yellow, humanoid figure crafted from wax. This figure stands centered in a warm, ethereal landscape composed entirely of molten wax. In its raised hand, a delicate, bright flame flickers with a vibrant glow. (0-1 seconds) The camera initiates a smooth, tracking shot, maintaining an eye-level perspective with the small wax person. As the figure begins to gently walk forward, its small feet creating subtle ripples in the viscous, pale yellow wax terrain, the camera gracefully follows its movement. (1-7 seconds) The wax person continues its quiet journey, steadily progressing across the glowing, soft landscape. The camera holds its smooth, tracking motion, subtly receding slightly to reveal a broader view. (7-8 seconds)

可直接复制的提示词模板

以下是完整的、可直接使用的提示词,你可以直接粘贴到 Gemini、Flow 或 Google AI Studio 中。

🎬 电影对话场景

A medium shot opens on a seasoned, grey-bearded man in sunglasses and a paisley shirt, his gaze fixed off-camera with a contemplative expression. His gold chain glints subtly. Beside him, a younger man in a tank top, also looking forward, suggests a shared moment of observation or reflection. The camera slowly pushes in, subtly emphasizing their quiet focus. In the background, a vibrant mural splashes across a wall, hinting at an urban setting. Faint city murmurs and distant chatter drift in, accompanied by a mellow, soulful hip-hop beat that adds a contemplative yet grounded atmosphere. "The city always got a story," the older man murmurs, a slight nod of his head. "Just gotta listen."

🦉 动物奇幻叙事

A follow shot of a wise old owl high in the air, peeking through the clouds in a moonlit sky above a forest. The wise old owl carefully circles a clearing looking around to the forest floor. After a few moments, it dives down to a moonlit path and sits next to a badger. Audio: wings flapping, birdsong, loud and pleasant wind rustling and the sound of intermittent pleasant sounds buzzing, twigs snapping underfoot, croaking. A light orchestral score with woodwinds throughout with a cheerful, optimistic rhythm, full of innocent curiosity.

🎭 历史剧情

A medium shot, historical adventure setting: Warm lamplight illuminates a cartographer in a cluttered study, poring over an ancient, sprawling map spread across a large table. Cartographer: "According to this old sea chart, the lost island isn't myth! We must prepare an expedition immediately!"

😂 喜剧 / 荒诞风格

A detective interrogates a nervous-looking rubber duck. "Where were you on the night of the bubble bath?!" he quacks. Audio: Detective's stern quack, nervous squeaks from rubber duck.

🕵️ 间谍惊悚

A close up of spies exchanging information in a crowded train station with uniformed guards patrolling nearby "The microfilm is in your ticket" he murmured pretending to check his watch "They're watching the north exit" she warned casually adjusting her scarf "Use the service tunnel" Commuters rush past oblivious to the covert exchange happening amid announcements of arrivals and departures

🎻 音乐演奏

A woman, classical violinist with intense focus plays a complex, rapid passage from a Vivaldi concerto in an ornate, sunlit baroque hall during a rehearsal. Their bow dances across the strings with virtuosic speed and precision. Audio: Bright, virtuosic violin playing, resonant acoustics of the hall, distant footsteps of crew, conductor's occasional soft count-in (muffled), rustling sheet music.

🍳 美食 / 烹饪

A close up in a smooth, slow pan focuses intently on diced onions hitting a scorching hot pan, instantly creating a dramatic sizzle. Audio: distinct sizzle.

🎨 动画艺术风格(浮世绘 / 日本木版画)

A breathtaking, painterly 2D animated continuous visual narrative, rendered with the lush, vibrant, and slightly surreal, almost dreamlike, infused with the intricate, delicate detail of traditional Japanese woodblock prints (Ukiyo-e), follows a young, adventurous, and kind-hearted girl as she befriends a colossal, gentle, ancient Forest Spirit. The Spirit is a magnificent, awe-inspiring creature, its form a harmonious blend of animal and plant – perhaps with moss-covered, antler-like branches, fur like shimmering leaves that change color with its mood, and eyes like deep, tranquil forest pools. They meet in a sun-dappled, sacred grove deep within an ancient, primeval forest, where impossibly tall, gnarled trees form a living cathedral and tiny, glowing, friendly forest sprites peek from behind mossy rocks.

🏠 奢华内饰 / 商业广告

The camera begins with a slow, elegant track along the richly paneled walls of a dimly lit, sophisticated hallway, the warm glow of the ornate wall sconces casting inviting reflections on the polished floor. Soft jazz music plays in the background. As we approach an arched entryway, the camera performs a graceful push-in, revealing a grand mirror and flickering candles, then smoothly pivots to the right, opening up to a luxurious home bar. The clinking of ice and the murmur of conversation become audible. The camera settles on a close-up of a perfectly crafted cocktail. "Welcome," a smooth, baritone voice says. "Care for a taste?"

🏔️ 史诗级景观 / 自然

The camera slowly pushes forward into a breathtaking ice cave, its jagged walls sculpted by nature into intricate patterns of blues and whites, reflecting the ethereal light from an opening ahead. The crunch of ice underfoot and the drip-drip of melting water create a serene, echoing soundscape. As the camera moves closer, a gentle, ambient melody begins, swelling with the light from the cave's exit. The camera emerges from the narrow opening into a vast, sun-drenched valley, revealing a group of polar bears playfully sliding down an ice slope, their roars echoing with joy.

进阶功能:文本到视频之外

Veo 3.1 还通过 Flow 和 API 支持高级创作控制:

素材引导生成(参考图片)

提供场景、角色或物体的参考图片来引导 Veo 的生成。这确保视频符合你的创意愿景。

Prompt: Camera dramatically dollies around the subject in this striking cinematic scene. It captures a high-tension moment within a long, sterile, monochromatic green corridor. A lone woman, dressed in a dark, flowing trench coat and trousers that billow dramatically, is suspended mid-air in a powerful, graceful leap.

+ 附上你的角色/场景参考图片

风格匹配

提供一张风格参考图片,Veo 将生成具有相同视觉美学的视频——从绘画到电影级色彩分级。

Prompt: Rendered in an intricate origami art style using complex, angular folds and crisp creases. A multi-layered diorama depicts a cute neighborhood street entirely from folded paper – houses with sharp rooflines, precise white picket fences, and layered, geometric flowers and rose bushes in vibrant paper hues.

+ 附上风格参考图片

角色一致性

提供角色的参考图片,以在不同场景中保持角色外观的一致性。

Prompt: a cute monster walking towards the camera
Prompt: a cute monster swimming underwater
Prompt: a cute monster walking in a candy wonderland

+ 为每个提示词附上相同的角色参考图片

场景延展

将短片扩展为更长的视频。使用第一个镜头的最后一秒来继续故事,同时保持视觉和音频的一致性。

Prompt 1: Graceful dancer is slowly dancing to classical music.
Prompt 2: A male dancer comes in, gracefully dancing with the woman as classical music plays.
Prompt 3: More dancers show up on the stage.
Prompt 4: The classical music continues, and the dancers continue to dance

其他控制

  • 摄像机控制:精确控制构图和摄像机运动(后退、推进、上移、右移)。
  • 首尾帧:在两张提供的图片之间创建流畅的过渡。
  • 外绘扩展:将视频扩展到原始画面之外,适配任何屏幕尺寸。
  • 添加/移除物体:无缝插入或移除物体,保持逼真的阴影和比例。
  • 角色控制:使用你的身体、面部和声音来驱动角色动画。
  • 运动控制:定义物体的精确运动路径。

进阶技巧

  1. 从风格开始:在提示词开头定义视觉媒介(电影感、卡通、定格动画等)。
  2. 角色要具体:"一位二十多岁、有着波浪棕发和淡雅雀斑的年轻女性"远胜于"一个棕色头发的女人"。
  3. 使用电影术语:如"中景"、"推拉变焦"、"跟踪拍摄"和"推镜"等术语能给模型明确的摄像指令。
  4. 单独描述音频:使用 Audio: 部分来描述复杂的声音设计,保持提示词条理清晰。
  5. 使用时间戳:添加 (0-1 seconds)、(1-7 seconds) 标记来精确编排事件时序。
  6. 用 Gemini 协助扩展:使用 Gemini 帮助将你的初始创意扩展为更详细的提示词描述。
  7. 大胆实验:长提示词和短提示词都能产生惊人的效果——尝试不同的方法!

安全与水印

所有使用 Veo 生成的视频都会标记 SynthID,这是 Google 用于标记 AI 生成内容的水印技术。Veo 还包含安全评估和内容检查,以防止滥用、隐私侵犯和偏见。


常见问题解答(FAQ)

Veo 3.1 的时长限制是多少?

Veo 3.1 单次生成的时长限制为 8 秒。但你可以通过 Flow Scene Builder 或 Gemini API 的 Frames-to-Video 扩展工具大幅延长视频:

  • 每次扩展追加 7 秒
  • 最多可扩展 20 次
  • 最大可达约 148 秒(约 2.5 分钟)

建议做法:生成简短的 8 秒片段后,在剪辑软件中拼接,而不是一次性推到最大时长。Veo 3.1 专门优化了跨片段的一致性,比 Veo 3.0 有显著提升。


Veo 3 的费用是多少?

Veo 3 的价格取决于访问方式:

方案价格访问内容
Google AI Pro$19.99/月Veo 3 Fast(via Gemini & Flow)
Google AI Ultra$249.99/月完整 Veo 3(约 12,500 积分/月)
Gemini API(Veo 3 标准版)约 $0.40/秒按秒计费,1080p 含音频
Gemini API(Veo 3 Fast)约 $0.15/秒更经济的选项
免费 Gemini 计划免费每月 100 积分,可生成短视频

提示: 通过 API 生成一个 8 秒的 Veo 3 标准版视频约需 $3.20。普通用户使用免费 Gemini 计划或 Google AI Pro 订阅更为划算。


如何用 Veo 3 生成无字幕视频?

Veo 3 的一个常见问题是自动添加字幕或文字叠加——尤其是在有对话的场景中。这是因为训练数据中包含大量带字幕的视频。以下方法可以避免:

在提示词末尾添加明确的否定指令:

[你的场景描述]。No subtitles. No captions. No on-screen text. No text overlay. No typography.

无字幕 Veo 3 提示词的额外技巧:

  • 对话内容避免使用引号——用 一个男人说:你好 替代 "你好"
  • 在提示词中加入 clean screen(干净画面)或 no text elements(无文字元素)
  • 提示词越详细,模型越少发挥——减少文字填充的可能性
  • 如果字幕仍然出现,尝试去掉对话描述中的标点符号
A warm kitchen scene. A mother and daughter cook together, laughing. The daughter stirs a large pot, steam rising softly. The mother tastes from a spoon and smiles. Natural window light. No subtitles. No captions. No on-screen text.

如何在 Veo 3 中生成真实感皮肤?

AI 视频模型默认生成的皮肤常有蜡质感或塑料感。以下方法可在 Veo 3 中实现更自然、高保真的真实感皮肤:

在角色描述中加入皮肤专属关键词:

[角色描述],visible pores(可见毛孔),natural skin texture(自然皮肤纹理),peach fuzz detail(桃绒毛细节),subtle subsurface scattering(细微次表面散射),realistic skin tone with natural color variation(自然色调变化),no retouching(无修图),photorealistic skin(照片级真实感皮肤)

真实感皮肤完整示例提示词:

A medium close-up of a woman in her late thirties, visible pores, natural skin texture, peach fuzz lit by morning sunlight, subtle subsurface scattering, no retouching, soft shadows that sculpt her features. She looks quietly out a window. Warm natural key light with cool fill. No subtitles.

提升皮肤真实感的关键词:

  • visible pores(可见毛孔)——防止皮肤过于光滑无纹理
  • peach fuzz(桃绒毛)——在近景中增加微细节
  • subsurface scattering(次表面散射)——让皮肤在光线下呈现透亮感
  • natural skin tone with color variation(含色调变化的自然肤色)——避免均匀、平坦的着色
  • no retouching(无修图)——提示模型保留自然不完美之处
  • 使用柔和、有方向感的打光:强光会使皮肤显平;漫射光或黄金时段光线能更好地体现纹理

总结

Veo 3.1 代表了 AI 视频生成的重大飞跃,在单一模型中结合了照片级真实感视频和原生音频。获得出色结果的关键在于编写详细、结构化的提示词,明确指定摄像机运动、角色、环境、动作、对话和音频设计。

无论你是电影制作人、内容创作者还是开发者,掌握 Veo 3.1 的提示词技巧将为你解锁全新的创作水平。从上面的现成模板开始,探索高级控制功能,让你的想象力引领一切。

准备好了吗? 现在就在 Gemini、Flow 或 Google AI Studio 上试用 Veo 3.1。

全部文章

作者

avatar for Accept Prompt
Accept Prompt

分类

  • 产品
什么是 Veo 3.1?在哪里使用 Veo 3.1优秀 Veo 3.1 提示词的构成要素提示词编写技巧1. 用具体细节塑造角色2. 用感官语言构建沉浸式世界3. 自然地加入对话4. 明确设计音频5. 用极致细节控制复杂动作6. 定义独特的视觉风格和基调7. 将视觉与声音设计融合8. 围绕日常事件构建叙事9. 使用时间戳精确控制时序可直接复制的提示词模板🎬 电影对话场景🦉 动物奇幻叙事🎭 历史剧情😂 喜剧 / 荒诞风格🕵️ 间谍惊悚🎻 音乐演奏🍳 美食 / 烹饪🎨 动画艺术风格(浮世绘 / 日本木版画)🏠 奢华内饰 / 商业广告🏔️ 史诗级景观 / 自然进阶功能:文本到视频之外素材引导生成(参考图片)风格匹配角色一致性场景延展其他控制进阶技巧安全与水印常见问题解答(FAQ)Veo 3.1 的时长限制是多少?Veo 3 的费用是多少?如何用 Veo 3 生成无字幕视频?如何在 Veo 3 中生成真实感皮肤?总结

更多文章

Seedance 2.0 提示词指南:技巧、方法与提示词模板
产品

Seedance 2.0 提示词指南:技巧、方法与提示词模板

关于 Seedance AI 的全面指南——字节跳动视频生成模型。涵盖 Seedance 1.0 与 2.0 对比、bytedance/seedance-v1-pro-i2v-480p API 使用方法,以及附带可复制模板的专业提示词技巧。

avatar for Accept Prompt
Accept Prompt
2026/03/09
Runway Gen-4 提示词指南:如何获得最佳生成效果
产品

Runway Gen-4 提示词指南:如何获得最佳生成效果

学习如何为 Runway Gen-4 视频生成模型编写高效提示词。本指南涵盖主体运动、镜头运动、场景运动、风格描述符及最佳实践,助你生成高质量 AI 视频。

avatar for Accept Prompt
Accept Prompt
2026/03/26
可灵 3.0 提示词指南:技巧、方法与提示词模板
产品

可灵 3.0 提示词指南:技巧、方法与提示词模板

一份全面、实用的可灵 3.0 AI 视频生成模型提示词编写指南。包含可直接复制的多镜头叙事、对话、运镜、音频设计等提示词模板。

avatar for Accept Prompt
Accept Prompt
2026/03/09

等待列表

抢先体验

成为第一批体验 AcceptPrompt 的用户。注册以获取早期访问和独家更新。

成为第一批体验用户。免费抢先体验,订阅即享五折优惠,绝不发送垃圾邮件。

LogoAcceptPrompt

AcceptPrompt 助你一次生成惊艳的 AI 视频。

Built withAUAI Company
产品
  • 功能
  • 价格
  • 常见问题
资源
  • 博客
  • 文档
  • 更新日志
公司
  • 关于我们
  • 联系我们
法律
  • Cookie政策
  • 隐私政策
  • 服务条款
© 2026 AcceptPrompt All Rights Reserved.