game-audio

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

When this skill is activated, always start your first response with the 🧢 emoji.

激活本技能后，首次回复请以🧢表情开头。

Game Audio

游戏音频

Game audio encompasses every sound a player hears - from UI clicks to orchestral scores that shift with gameplay. Unlike film audio, game audio is non-linear and reactive: sounds must respond to player actions, environmental state, and game events in real time. This skill covers sound design fundamentals, adaptive music composition and implementation, spatial (3D) audio systems, and professional middleware tools (FMOD Studio and Audiokinetic Wwise) that power audio in most shipped titles.

游戏音频涵盖玩家听到的所有声音——从UI点击声到随游戏进程变化的管弦乐配乐。与电影音频不同，游戏音频是非线性且具有响应性的：声音必须实时响应用户操作、环境状态和游戏事件。本技能涵盖声音设计基础、自适应音乐创作与实现、空间（3D）音频系统，以及为大多数已发布游戏提供音频支持的专业中间件工具（FMOD Studio和Audiokinetic Wwise）。

When to use this skill

何时使用本技能

Trigger this skill when the user:

Needs to design or implement a sound effects system for a game
Wants to create adaptive or dynamic music that responds to gameplay
Is setting up spatial/3D audio with HRTF or attenuation curves
Needs to integrate FMOD Studio or Wwise into a game engine
Wants to architect a sound event system or audio manager
Asks about audio mixing, buses, ducking, or signal routing
Needs to optimize audio for memory, CPU, or streaming performance
Wants to build dynamic soundscapes or ambient audio layers

Do NOT trigger this skill for:

Music composition theory or notation (use a music theory skill)
Non-game audio production like podcast editing or mastering (use audio-production)

当用户有以下需求时触发本技能：

需要设计或实现游戏音效系统
想要创建能响应游戏进程的自适应或动态音乐
正在设置基于HRTF或衰减曲线的空间/3D音频
需要将FMOD Studio或Wwise集成到游戏引擎中
想要构建声音事件系统或音频管理器
询问音频混音、总线、闪避或信号路由相关问题
需要针对内存、CPU或流传输性能优化音频
想要构建动态音景或环境音频层

以下情况请勿触发本技能：

音乐创作理论或记谱（使用音乐理论技能）
非游戏类音频制作，如播客编辑或母带处理（使用音频制作技能）

Key principles

核心原则

Audio is reactive, not scripted - Game audio cannot be authored linearly like film. Every sound must be designed as a response to an event. Think in terms of event-to-sound mappings, not timelines.
Less is more in the mix - Players process audio subconsciously. A clean mix with 4-6 prominent sounds is more impactful than 20 competing layers. Use priority systems, ducking, and voice limits to keep the mix focused.
Variation prevents fatigue - Any sound heard more than twice needs randomization. Use round-robin containers, pitch/volume randomization, and multiple samples to prevent repetition fatigue.
Spatial audio sells immersion - Proper 3D spatialization, distance attenuation, and environmental effects (reverb zones, occlusion) make players feel present in the world without them consciously noticing.
Middleware is your friend - FMOD and Wwise exist so audio designers and programmers can work in parallel. Sound designers author in the middleware tool; programmers fire events from code. This separation is non-negotiable on any team larger than one person.

音频是响应式的，而非脚本化的 - 游戏音频不能像电影音频那样线性制作。每个声音都必须设计为对事件的响应。要从事件到声音的映射角度思考，而非时间线。
混音中少即是多 - 玩家会下意识地处理音频。一个包含4-6个突出声音的简洁混音，比20个相互竞争的音层更有冲击力。使用优先级系统、闪避和语音限制来保持混音的焦点。
变化避免听觉疲劳 - 任何被听到超过两次的声音都需要随机化处理。使用循环容器、音高/音量随机化以及多采样来避免重复疲劳。
空间音频增强沉浸感 - 恰当的3D空间化、距离衰减和环境效果（混响区域、遮挡）能让玩家身临其境，而无需他们刻意察觉。
中间件是好帮手 - FMOD和Wwise的存在让音频设计师和程序员可以并行工作。音频设计师在中间件工具中创作；程序员从代码中触发事件。对于人数超过一人的团队，这种分工是必不可少的。

Core concepts

核心概念

The event-driven audio model

事件驱动的音频模型

Game audio is built on an event system. Game code fires named events (e.g.,

Player/Footstep

Weapon/Fire

Music/EnterCombat

), and the audio middleware resolves those events into actual sounds. This decoupling means:

Sound designers can swap, layer, or randomize sounds without code changes
Programmers don't need to know which .wav file plays for a footstep
Events can carry parameters (surface type, velocity, health percentage) that the middleware uses to select or modulate sounds

游戏音频基于事件系统构建。游戏代码触发命名事件（例如

Player/Footstep

、

Weapon/Fire

、

Music/EnterCombat

），音频中间件将这些事件解析为实际声音。这种解耦意味着：

音频设计师无需修改代码即可替换、叠加或随机化声音
程序员无需知道脚步声对应的是哪个.wav文件
事件可以携带参数（表面类型、速度、生命值百分比），中间件会用这些参数来选择或调制声音

Adaptive music architecture

自适应音乐架构

Adaptive music uses one or more of these techniques to respond to gameplay:

Technique	Description	Best for
Horizontal re-sequencing	Rearranges musical sections in real time	Exploration, open-world
Vertical layering	Adds/removes instrument layers based on intensity	Combat escalation
Stinger/transition	Plays a short musical phrase to bridge states	State changes (win, lose, discovery)
Branching	Pre-authored alternate paths at decision points	Story-driven moments

Most shipped games combine 2-3 of these. Vertical layering + stingers is the most common pattern for action games.

自适应音乐使用以下一种或多种技术来响应游戏进程：

技术	描述	适用场景
水平重排	实时重新排列乐段	探索、开放世界
垂直分层	根据强度添加/移除乐器层	战斗升级
过渡短曲/转场	播放简短乐段来衔接不同状态	状态变化（胜利、失败、发现）
分支结构	在决策点预制作备选路径	剧情驱动的时刻

大多数已发布游戏会结合2-3种这些技术。垂直分层+过渡短曲是动作游戏最常用的模式。

Spatial audio pipeline

空间音频流程

The spatial audio pipeline processes each sound source through:

Distance attenuation - Volume decreases with distance (linear, logarithmic, or custom curve)
Spatialization - Panning across speakers/headphones using HRTF (head-related transfer function) or simple panning
Occlusion/obstruction - Raycast from listener to source; muffle if geometry blocks the path
Environmental effects - Apply reverb, echo, or filtering based on the room/space the sound is in (reverb zones)

空间音频流程会通过以下步骤处理每个声源：

距离衰减 - 音量随距离降低（线性、对数或自定义曲线）
空间化 - 使用HRTF（头部相关传递函数）或简单声像在扬声器/耳机上进行声像定位
遮挡/阻碍 - 从听者向声源发射射线；如果几何体阻挡路径，则降低声音清晰度
环境效果 - 根据声音所在的房间/空间应用混响、回声或滤波（混响区域）

FMOD vs Wwise at a glance

FMOD与Wwise概览

Aspect	FMOD Studio	Wwise
Pricing	Free under $200K revenue	Free under 1000 sound assets
Learning curve	Lower - familiar DAW-like UI	Steeper - more powerful, more complex
Strength	Rapid prototyping, indie-friendly	Large-scale projects, AAA pipelines
Unity integration	First-class plugin	First-class plugin
Unreal integration	Community plugin	Built-in integration
Scripting	C/C++ API, C# wrapper	C/C++ API, Wwise Authoring API

See

references/fmod-guide.md

and

references/wwise-guide.md

for setup and API details.

方面	FMOD Studio	Wwise
定价	营收低于20万美元时免费	声音资产少于1000个时免费
学习曲线	较低 - 类似DAW的熟悉界面	较陡 - 功能更强大，也更复杂
优势	快速原型开发，独立开发者友好	大型项目，AAA级流水线
Unity集成	一流插件	一流插件
Unreal集成	社区插件	内置集成
脚本	C/C++ API，C#包装器	C/C++ API，Wwise创作API

有关设置和API的详细信息，请参阅

references/fmod-guide.md

和

references/wwise-guide.md

。

Common tasks

常见任务

Set up an audio event system

设置音频事件系统

Create a centralized audio manager that maps game events to middleware calls.

csharp

// Unity + FMOD example
public class AudioManager : MonoBehaviour
{
    public static AudioManager Instance { get; private set; }

    private void Awake()
    {
        if (Instance != null) { Destroy(gameObject); return; }
        Instance = this;
        DontDestroyOnLoad(gameObject);
    }

    public void PlayOneShot(string eventPath, Vector3 position)
    {
        FMODUnity.RuntimeManager.PlayOneShot(eventPath, position);
    }

    public FMOD.Studio.EventInstance CreateInstance(string eventPath)
    {
        return FMODUnity.RuntimeManager.CreateInstance(eventPath);
    }

    public void SetGlobalParameter(string name, float value)
    {
        FMODUnity.RuntimeManager.StudioSystem.setParameterByName(name, value);
    }
}

// Usage from game code:
AudioManager.Instance.PlayOneShot("event:/SFX/Explosion", transform.position);

创建一个集中式音频管理器，将游戏事件映射到中间件调用。

csharp

// Unity + FMOD example
public class AudioManager : MonoBehaviour
{
    public static AudioManager Instance { get; private set; }

    private void Awake()
    {
        if (Instance != null) { Destroy(gameObject); return; }
        Instance = this;
        DontDestroyOnLoad(gameObject);
    }

    public void PlayOneShot(string eventPath, Vector3 position)
    {
        FMODUnity.RuntimeManager.PlayOneShot(eventPath, position);
    }

    public FMOD.Studio.EventInstance CreateInstance(string eventPath)
    {
        return FMODUnity.RuntimeManager.CreateInstance(eventPath);
    }

    public void SetGlobalParameter(string name, float value)
    {
        FMODUnity.RuntimeManager.StudioSystem.setParameterByName(name, value);
    }
}

// Usage from game code:
AudioManager.Instance.PlayOneShot("event:/SFX/Explosion", transform.position);

Implement adaptive music with vertical layering

用垂直分层实现自适应音乐

Layer instrument stems that activate based on a game parameter (e.g., threat level).

FMOD Studio setup:
1. Create a Music Event with multiple audio tracks (stems):
   - Track 1: Ambient pad (always playing)
   - Track 2: Percussion (activates at threat > 0.3)
   - Track 3: Brass/strings (activates at threat > 0.6)
   - Track 4: Full orchestra (activates at threat > 0.9)

2. Create a parameter "ThreatLevel" (0.0 to 1.0) on the event

3. Add volume automation on each track tied to ThreatLevel:
   - Track 1: Volume 1.0 across full range
   - Track 2: Fade in from 0.0 to 1.0 between threat 0.3-0.4
   - Track 3: Fade in between 0.6-0.7
   - Track 4: Fade in between 0.9-1.0

csharp

// Code side - update the parameter from game state
private FMOD.Studio.EventInstance musicInstance;

void StartMusic()
{
    musicInstance = AudioManager.Instance.CreateInstance("event:/Music/Exploration");
    musicInstance.start();
}

void Update()
{
    float threat = CalculateThreatLevel();
    musicInstance.setParameterByName("ThreatLevel", threat);
}

根据游戏参数（例如威胁等级）激活乐器分层音轨。

FMOD Studio setup:
1. Create a Music Event with multiple audio tracks (stems):
   - Track 1: Ambient pad (always playing)
   - Track 2: Percussion (activates at threat > 0.3)
   - Track 3: Brass/strings (activates at threat > 0.6)
   - Track 4: Full orchestra (activates at threat > 0.9)

2. Create a parameter "ThreatLevel" (0.0 to 1.0) on the event

3. Add volume automation on each track tied to ThreatLevel:
   - Track 1: Volume 1.0 across full range
   - Track 2: Fade in from 0.0 to 1.0 between threat 0.3-0.4
   - Track 3: Fade in between 0.6-0.7
   - Track 4: Fade in between 0.9-1.0

csharp

// Code side - update the parameter from game state
private FMOD.Studio.EventInstance musicInstance;

void StartMusic()
{
    musicInstance = AudioManager.Instance.CreateInstance("event:/Music/Exploration");
    musicInstance.start();
}

void Update()
{
    float threat = CalculateThreatLevel();
    musicInstance.setParameterByName("ThreatLevel", threat);
}

Configure spatial audio with attenuation and occlusion

配置带衰减和遮挡的空间音频

csharp

// FMOD: Set 3D attributes on a looping sound source
public class AudioEmitter : MonoBehaviour
{
    [SerializeField] private string eventPath = "event:/SFX/Generator_Hum";
    private FMOD.Studio.EventInstance instance;

    void Start()
    {
        instance = FMODUnity.RuntimeManager.CreateInstance(eventPath);
        instance.set3DAttributes(FMODUnity.RuntimeUtils.To3DAttributes(transform));
        instance.start();
    }

    void Update()
    {
        instance.set3DAttributes(FMODUnity.RuntimeUtils.To3DAttributes(transform));
    }

    void OnDestroy()
    {
        instance.stop(FMOD.Studio.STOP_MODE.ALLOWFADEOUT);
        instance.release();
    }
}

For occlusion, raycast from the listener to the source and set a low-pass filter parameter based on hit count:

csharp

void UpdateOcclusion()
{
    Vector3 listenerPos = Camera.main.transform.position;
    Vector3 direction = transform.position - listenerPos;
    float distance = direction.magnitude;

    int hits = Physics.RaycastNonAlloc(listenerPos, direction.normalized,
        raycastHits, distance, occlusionMask);

    float occlusion = Mathf.Clamp01(hits * 0.3f);
    instance.setParameterByName("Occlusion", occlusion);
}

csharp

// FMOD: Set 3D attributes on a looping sound source
public class AudioEmitter : MonoBehaviour
{
    [SerializeField] private string eventPath = "event:/SFX/Generator_Hum";
    private FMOD.Studio.EventInstance instance;

    void Start()
    {
        instance = FMODUnity.RuntimeManager.CreateInstance(eventPath);
        instance.set3DAttributes(FMODUnity.RuntimeUtils.To3DAttributes(transform));
        instance.start();
    }

    void Update()
    {
        instance.set3DAttributes(FMODUnity.RuntimeUtils.To3DAttributes(transform));
    }

    void OnDestroy()
    {
        instance.stop(FMOD.Studio.STOP_MODE.ALLOWFADEOUT);
        instance.release();
    }
}

对于遮挡，从听者向声源发射射线，并根据命中次数设置低通滤波器参数：

csharp

void UpdateOcclusion()
{
    Vector3 listenerPos = Camera.main.transform.position;
    Vector3 direction = transform.position - listenerPos;
    float distance = direction.magnitude;

    int hits = Physics.RaycastNonAlloc(listenerPos, direction.normalized,
        raycastHits, distance, occlusionMask);

    float occlusion = Mathf.Clamp01(hits * 0.3f);
    instance.setParameterByName("Occlusion", occlusion);
}

Design sound variation for repeated events

为重复事件设计声音变化

Prevent repetition fatigue by using containers and randomization:

FMOD Studio:
1. Create a Multi Instrument inside your event
2. Add 3-5 sound variants (e.g., footstep_01.wav through footstep_05.wav)
3. Set playlist mode to "Shuffle" (avoids consecutive repeats)
4. Add pitch randomization: -2 to +2 semitones
5. Add volume randomization: -1 to +1 dB

Wwise equivalent:
1. Create a Random Container
2. Add sound variants as children
3. Enable "Avoid Repeating Last" with a value of 2-3
4. Add Randomizer on Pitch (-200 to +200 cents) and Volume (-1 to +1 dB)

使用容器和随机化避免重复疲劳：

FMOD Studio:
1. Create a Multi Instrument inside your event
2. Add 3-5 sound variants (e.g., footstep_01.wav through footstep_05.wav)
3. Set playlist mode to "Shuffle" (avoids consecutive repeats)
4. Add pitch randomization: -2 to +2 semitones
5. Add volume randomization: -1 to +1 dB

Wwise equivalent:
1. Create a Random Container
2. Add sound variants as children
3. Enable "Avoid Repeating Last" with a value of 2-3
4. Add Randomizer on Pitch (-200 to +200 cents) and Volume (-1 to +1 dB)

Set up audio buses and mixing

设置音频总线与混音

Organize all game audio into a bus hierarchy for clean mixing:

Master Bus
  |- Music Bus        (baseline: -6 dB)
  |    |- Combat Music
  |    |- Ambient Music
  |- SFX Bus          (baseline: 0 dB)
  |    |- Player SFX
  |    |- Enemy SFX
  |    |- Environment SFX
  |- UI Bus           (baseline: -3 dB)
  |- Voice Bus        (baseline: +2 dB, duck Music by -12 dB when active)

Key mixing practices:

Duck music when dialogue plays (sidechain the Voice bus to Music bus)
Set voice limits per event (e.g., max 8 simultaneous footsteps)
Use snapshot/state systems for context switches (underwater, pause menu)
Keep headroom: master should peak at -3 to -6 dBFS

将所有游戏音频组织为总线层级，以实现清晰的混音：

Master Bus
  |- Music Bus        (baseline: -6 dB)
  |    |- Combat Music
  |    |- Ambient Music
  |- SFX Bus          (baseline: 0 dB)
  |    |- Player SFX
  |    |- Enemy SFX
  |    |- Environment SFX
  |- UI Bus           (baseline: -3 dB)
  |- Voice Bus        (baseline: +2 dB, duck Music by -12 dB when active)

关键混音实践：

播放对话时降低音乐音量（将语音总线侧链到音乐总线）
为每个事件设置语音限制（例如，最多同时播放8个脚步声）
使用快照/状态系统处理场景切换（水下、暂停菜单）
保留头部空间：主总线峰值应控制在-3至-6 dBFS之间

Optimize audio for performance

优化音频性能

Technique	Memory savings	CPU savings	When to use
Compressed formats (Vorbis/Opus)	80-90%	Slight CPU cost	All SFX except very short sounds
Streaming from disk	~100% per sound	Disk I/O cost	Music, long ambiences (>5 seconds)
Voice limiting	Proportional	Proportional	Any sound that can overlap heavily
Sample rate reduction (22kHz)	50%	Minor	Ambient, background, low-frequency
Sound pooling	Avoids alloc spikes	Avoids alloc spikes	Rapid-fire sounds (bullets, particles)

Rules of thumb:

Stream music and long ambiences; load short SFX into memory
Set voice limits: 4-8 for common SFX, 1-2 for music events
Use compressed in-memory format for SFX banks
Budget: aim for under 50 MB total audio memory on console, 100 MB on PC

技术	内存节省	CPU节省	使用场景
压缩格式（Vorbis/Opus）	80-90%	轻微CPU开销	除极短音效外的所有SFX
从磁盘流传输	每个声音约100%	磁盘I/O开销	音乐、长环境音（超过5秒）
语音限制	成比例	成比例	任何可能大量重叠的声音
采样率降低（22kHz）	50%	微小	环境音、背景音、低频音
声音池	避免内存分配峰值	避免内存分配峰值	快速触发的声音（子弹、粒子）

经验法则：

音乐和长环境音采用流传输；短音效加载到内存中
设置语音限制：普通SFX为4-8个，音乐事件为1-2个
对SFX库使用压缩的内存内格式
预算：主机平台总音频内存目标低于50MB，PC平台低于100MB

Anti-patterns / common mistakes

反模式/常见错误

Mistake	Why it's wrong	What to do instead
Hardcoding audio file paths in game code	Tightly couples code to specific assets; impossible to iterate on sounds without recompiling	Use event-based middleware; reference events by name, never by file
No sound variation	Players notice repeated identical sounds within 3 occurrences; breaks immersion	Use random containers with 3-5 variants plus pitch/volume randomization
Music cuts abruptly on state change	Jarring transitions destroy mood; players notice bad music transitions immediately	Use transition timelines, crossfades, or stinger/bridge segments
Ignoring voice limits	100 simultaneous explosion sounds will clip, distort, and destroy CPU budgets	Set per-event voice limits with steal behavior (oldest, quietest, or farthest)
Flat 3D audio (no occlusion)	Sound passing through walls breaks spatial awareness and immersion	Implement raycast-based occlusion with low-pass filtering
Mixing everything at 0 dB	No headroom causes clipping; no hierarchy means players can't prioritize important sounds	Structure buses with headroom; duck less important buses when critical sounds play
Loading all audio into memory	Game runs out of memory or has huge load times	Stream long audio; compress short SFX; only load banks needed for current level

错误	错误原因	正确做法
在游戏代码中硬编码音频文件路径	代码与特定资产紧密耦合；不重新编译就无法迭代声音	使用基于事件的中间件；通过名称引用事件，绝不直接引用文件
无声音变化	玩家在3次内就会注意到重复的相同声音；破坏沉浸感	使用包含3-5个变体的随机容器，再加上音高/音量随机化
状态变化时音乐突然切断	突兀的过渡会破坏氛围；玩家会立刻注意到糟糕的音乐过渡	使用过渡时间线、交叉淡入淡出或过渡短曲/衔接段
忽略语音限制	同时播放100个爆炸音会导致削波、失真，并耗尽CPU预算	为每个事件设置语音限制，并配置抢占规则（最早、最安静或最远的声音）
平面3D音频（无遮挡）	声音穿墙会破坏空间感知和沉浸感	实现基于射线检测的遮挡，并配合低通滤波
所有音频都混音在0 dB	�无头部空间会导致削波；无层级意味着玩家无法区分重要声音	构建带头部空间的总线结构；播放关键声音时降低次要总线的音量
将所有音频加载到内存中	�游戏会耗尽内存或加载时间过长	长音频采用流传输；短音效压缩；仅加载当前关卡所需的音频库

References

参考资料

For detailed content on specific topics, read the relevant file from

references/

```
references/fmod-guide.md
```
- FMOD Studio setup, C# API, event authoring, parameter automation
```
references/wwise-guide.md
```
- Wwise project setup, SoundBank workflow, RTPC, state/switch groups
```
references/spatial-audio.md
```
- HRTF, ambisonics, reverb zones, occlusion algorithms, platform differences
```
references/adaptive-music.md
```
- Horizontal re-sequencing, vertical layering, transition matrices, implementation patterns

Only load a references file if the current task requires deep detail on that topic.

如需特定主题的详细内容，请阅读

references/

目录下的相关文件：

```
references/fmod-guide.md
```
- FMOD Studio设置、C# API、事件创作、参数自动化
```
references/wwise-guide.md
```
- Wwise项目设置、SoundBank工作流、RTPC、状态/开关组
```
references/spatial-audio.md
```
- HRTF、Ambisonics、混响区域、遮挡算法、平台差异
```
references/adaptive-music.md
```
- 水平重排、垂直分层、过渡矩阵、实现模式

仅当当前任务需要该主题的深入细节时，才加载对应的参考文件。

game-audio

Original

Translation

Game Audio

游戏音频

When to use this skill

何时使用本技能

Key principles

核心原则

Core concepts

核心概念

The event-driven audio model

事件驱动的音频模型

Adaptive music architecture

自适应音乐架构

Spatial audio pipeline

空间音频流程

FMOD vs Wwise at a glance

FMOD与Wwise概览

Common tasks

常见任务

Set up an audio event system

设置音频事件系统

Implement adaptive music with vertical layering

用垂直分层实现自适应音乐

Configure spatial audio with attenuation and occlusion

配置带衰减和遮挡的空间音频

Design sound variation for repeated events

为重复事件设计声音变化

Set up audio buses and mixing

设置音频总线与混音

Optimize audio for performance

优化音频性能

Anti-patterns / common mistakes

反模式/常见错误

References

参考资料

Related skills

相关技能