Skip to main content

Audio Descriptions for Video Content

Audio descriptions make video content accessible to people who are blind or have low vision by narrating important visual information that isn’t conveyed through dialogue or sound. This comprehensive guide covers when audio descriptions are needed, what to include, and how to implement them effectively.

What Are Audio Descriptions?

Audio descriptions are narrated explanations of key visual elements in video content. A narrator speaks during natural pauses in dialogue and sound, describing:

  • Visual actions: What characters are doing
  • Scene settings: Locations, environments, and changes in setting
  • Facial expressions: Emotions conveyed visually
  • Body language: Gestures, posture, and mannerisms
  • On-screen text: Important titles, captions, or graphics
  • Visual transitions: Scene changes, time passage, dream sequences

The goal is to provide enough information so that viewers who can’t see the video can follow and understand the story or content as completely as sighted viewers.

Who Benefits from Audio Descriptions?

People who are blind: Audio descriptions provide access to visual information they can’t perceive.

People with low vision: Descriptions clarify details that may be difficult to see clearly.

People who are deafblind: Combined with captions presented on refreshable braille displays, audio descriptions make video content accessible.

People with certain cognitive disabilities: Additional verbal information can help with comprehension and following complex visual narratives.

When Are Audio Descriptions Required?

Prerecorded Multimedia (Video + Audio)

Required when: Visual content contains essential information not available through audio alone.

If your video includes charts, graphs, demonstrations, important actions, or visual details critical to understanding, you must provide audio descriptions.

WCAG Level A requirement: Provide either audio descriptions or a full text alternative (like a transcript) for prerecorded multimedia.

WCAG Level AA requirement: Provide audio descriptions for all prerecorded multimedia content.

Prerecorded Video-Only Content

Required: Provide either audio descriptions or a full text transcript.

Best practice: Provide both. Audio descriptions make the video accessible while playing, and transcripts provide an alternative way to access content for users who prefer reading or need to search for specific information.

Live Multimedia Content

Not required under WCAG 2.1, Section 508, or the 21st Century Video Communications and Accessibility Act (CVAA), but audio descriptions may be provided if technically feasible.

Live audio description requires trained describers and specialized technical platforms to deliver alternate audio tracks in real-time. This is both an art and a science.

Important: If you record live content and make it available later, it becomes prerecorded content and must meet prerecorded accessibility requirements, including audio descriptions.

When Audio Descriptions Aren’t Needed

You may not need separate audio descriptions if:

  • All important visual information is already described in the spoken dialogue
  • The video contains no essential visual-only information
  • The content can be fully understood through audio alone

Example: A “talking head” video where someone speaks directly to the camera with no important visual elements doesn’t require additional audio description—the speaker is already describing everything verbally.

What to Include in Audio Descriptions

Effective audio descriptions convey all important visual information that dialogue and other sounds cannot. The level of detail depends on available time between dialogue.

Essential Elements

Settings and locations: Establish where scenes take place and describe significant location changes:

  • “A busy city street at night”
  • “Inside a small, cluttered office”

Characters: Introduce characters when they first appear, noting:

  • Physical appearance when relevant to the story
  • Age (approximate when significant)
  • Ethnicity when essential to plot or characterization
  • Distinctive features or clothing
  • Facial expressions conveying emotion

Actions: Describe what’s happening visually:

  • Character movements and interactions
  • Physical demonstrations
  • Important gestures or reactions

On-screen text: Read important text that appears visually:

  • Titles and credits
  • Location or time indicators
  • Important signs, notes, or messages
  • Subtitles in foreign languages

Visual transitions: Note changes that might not be obvious:

  • Passage of time: “Three years later”
  • Dream sequences: “In his imagination”
  • Flashbacks: “She remembers the accident”
  • Scene changes: “At the hospital”

General Principles

Match the soundtrack: Descriptions should fit naturally into pauses in dialogue and sound. Don’t talk over important audio.

Present tense: Describe actions as they happen: “She walks away” not “She walked away.”

Clear, simple language: Use straightforward descriptions. Avoid technical jargon unless the content requires it.

Neutral tone: Remain objective. Describe what’s visible without interpreting or adding opinion.

Timing: Descriptions should coincide with the actions being described, not come significantly before or after.

No spoilers: Don’t reveal information that sighted viewers wouldn’t know yet. Maintain suspense and surprise.

Essential information only: Describe what’s important, not every minor detail. Prioritize information that affects understanding of plot, character, or meaning.

What NOT to Include

Camera techniques: Avoid phrases like:

  • “The camera zooms in”
  • “A close-up of”
  • “We see”

Instead, describe what’s shown: “Her eyes fill with tears.”

Obvious audio cues: If sound clearly indicates what’s happening, don’t redundantly describe it:

  • If you hear a door slam, you don’t need “The door slams”
  • If dialogue says “Look at that sunset,” don’t describe the sunset unless details matter

Overwhelming detail: Don’t fill every silence. Music, ambient sound, and natural pauses contribute to the viewing experience.

Offensive language: Never use racist or offensive terms. Note ethnicity only when essential to plot or characterization, using respectful language.

Writing Style Guidelines

Blend naturally: Descriptions shouldn’t draw attention to themselves. They should feel like a natural part of the audio experience.

Match the tone: Adapt your descriptive style to match the content:

  • Action films: Brief, punchy descriptions
  • Dramas: More detailed emotional observations
  • Documentaries: Informative, educational tone
  • Comedies: Light touch that doesn’t interfere with timing

Create atmosphere: Give viewers a feel for settings and mood without overwhelming them with unnecessary details.

Describe sound sources: When the source of an unidentified sound or voice isn’t obvious from context, clarify who or what is making the sound.

Special Considerations

Songs and music: Avoid describing over important lyrics. Note the music’s presence if significant to the scene, but let lyrics be heard.

Subtitles and foreign language: Note subtitles on first instance: “Subtitle: Where are you going?” After establishing they’re present, you can simply prefix: “He says in Spanish…”

Credits: Read opening and closing credits, at least for major contributors (director, main cast, etc.).

Sounds with visual context: If a sound occurs that needs visual context, describe either just before or just after. It’s better to describe before when possible so viewers can anticipate what they’re hearing.

Extended Audio Descriptions

Sometimes there isn’t enough natural pause in the audio to adequately describe important visual information. Extended audio descriptions solve this problem.

What Are Extended Audio Descriptions?

Extended audio descriptions pause the video temporarily to provide the narrator with additional time to describe complex visual content. After the description is complete, the video resumes.

When to Use Extended Audio Descriptions

WCAG Level AAA: Extended audio descriptions are required when regular audio descriptions can’t provide adequate description due to insufficient pauses in foreground audio.

Use extended audio descriptions when:

  • Visual information is too complex for brief description
  • Natural pauses aren’t long enough for necessary descriptions
  • The pace of dialogue doesn’t allow adequate description time
  • Critical visual details require more explanation

Implementation

User control: Extended audio descriptions should be a feature users can toggle on and off. Some viewers may prefer the faster-paced regular version even if it means less detailed description.

Clear indication: Make it obvious when the video has paused for extended description. Some viewers with partial vision may be confused if they don’t realize the pause is intentional.

Minimize disruption: While extended descriptions add time to videos, they’re necessary for complete access. Try to minimize the number of pauses while still providing adequate description.

Methods for Providing Audio Descriptions

Method 1: Separate Version with Embedded Descriptions

Create two versions of your video:

  1. Standard version (no audio descriptions)
  2. Audio-described version (descriptions embedded in the audio track)

Presentation options:

Link to separate page:

<p>
  <a href="/video-standard">Watch video</a> |
  <a href="/video-audio-described">Watch with audio descriptions</a>
</p>

Toggle on same page:

<button>Switch to audio-described version</button>

Dialog/modal:

<button>Open video with audio descriptions</button>

Advantages:

  • Works with any video player
  • No special player features required
  • Descriptions are permanent and can’t fail to load

Disadvantages:

  • Requires maintaining two video files
  • Takes more storage and bandwidth
  • Multiple language versions require many files

Important: Include the audio description narration in closed captions so deaf-blind users can access descriptions via braille displays.

Method 2: Separate Audio Track

Create one video file with multiple audio tracks:

  1. Standard audio track
  2. Audio description track (includes standard audio plus descriptions)

Users toggle between tracks using player controls, similar to how they toggle captions on and off.

Button indicators:

  • “AD” button
  • “D” button
  • “Audio descriptions” in settings menu

Advantages:

  • Single video file for all users
  • Users control their own experience
  • More efficient storage and bandwidth
  • Easier to maintain and update

Disadvantages:

  • Requires player support for multiple audio tracks
  • Not all players support this feature
  • May require custom player implementation

Best practice: This is the preferred method when your platform and player support it, as it provides the best user experience and most efficient content management.

Method 3: Transcript

While not technically an audio description, a full text transcript describing all visual and audio content can serve as an alternative:

<h2>Video Transcript</h2>
<p><strong>Visual:</strong> A woman in a blue jacket enters a coffee shop.</p>
<p><strong>Barista:</strong> Good morning! What can I get for you?</p>
<p><strong>Woman:</strong> Just a black coffee, please.</p>
<p>
  <strong>Visual:</strong> She smiles and hands the barista her credit card.
</p>

Transcripts benefit:

  • Deaf-blind users
  • Users who prefer reading to watching
  • Search engines (SEO)
  • Translation services
  • Quick reference and skimming

Recommendation: Provide both audio descriptions AND transcripts when possible. They serve different user needs and usage patterns.

Quality Checklist

Before publishing audio-described content, verify:

  • All important visual information is described
  • Descriptions fit naturally into pauses in dialogue
  • Language is clear, simple, and in present tense
  • Tone is neutral and objective
  • Descriptions match the style and pace of the content
  • New speakers and scenes are identified
  • Visual transitions and time passages are noted
  • On-screen text is read
  • Descriptions don’t spoil upcoming events
  • Camera movements aren’t described unnecessarily
  • Offensive language is avoided
  • Extended descriptions are used when needed
  • Users can control audio description availability
  • Captions include audio description narration

Conclusion

Audio descriptions make video content accessible to people who are blind or have low vision, ensuring they can enjoy and understand visual media fully. While creating quality audio descriptions requires skill and practice, the impact is profound—you’re providing equal access to information, entertainment, and education.

Key takeaways:

  • Audio descriptions narrate important visual information during natural pauses
  • They’re required for prerecorded multimedia under WCAG Level AA
  • Describe actions, settings, characters, expressions, and on-screen text
  • Use clear, present-tense language that matches the content’s tone
  • Provide user control over audio description availability
  • Use extended descriptions when standard pauses aren’t sufficient
  • Combine with transcripts for maximum accessibility

Remember: Audio descriptions aren’t just about compliance—they’re about ensuring everyone can participate fully in our increasingly visual digital world. Invest in quality audio descriptions, and you’ll create content that truly serves all users.

Resources

Audio description services:

  • Audio Description Associates
  • WGBH Media Access Group
  • Descriptive Video Works

Tools and platforms:

  • YouTube - Supports multiple audio tracks
  • Vimeo - Supports audio description tracks
  • HTML5 video - Can implement multiple audio tracks
  • YouDescribe - Crowdsourced audio descriptions

Training and guidelines:

  • Audio Description Coalition - Best practices and standards
  • DCMP - Described and Captioned Media Program
  • ACB - American Council of the Blind audio description guidelines