Broadcast News

Bookmark and Share
05/06/2014

A Viable Economic Alternative to AD...

News Image
Both Spoken Subtitling and Audio Description may be candidate services that could benefit from the application of Text-To-Speech technology, says John Birch, Strategic Partnerships Manager, Screen Systems

Speech synthesis has improved dramatically in recent years, with computer generated voices that can convey emphasis and emotion, without excessive need for 'mark-up' within the text used as input. In addition, recent research indicates that users may be happy with computer generated speech for Audio Description if it leads to more provision. As provision of Audio Description is being increasingly mandated, it is suggested that Text To Speech is a viable economic alternative to using voice talents, particularly for non-premium channels.

Audio Description
Traditionally, a trained 'describer' identifies appropriate points in the audio timeline where a description is needed (and can be placed) and produces a script. This process has much in common with captioning, but, perhaps for historical reasons, is almost always performed separately. Typically, the ‘describer’ voices and records the descriptions, although sometimes the ‘voicing’ is performed by a separate 'voice talent'. The (mono) recordings of each description are used to produce a full-length audio description track, which may be 'pre-mixed' to create a separate audio track in advance, or live mixed with the original audio at play-out.
Live mixing generally uses the mono description track and a control track. This control track (sometimes termed a ‘warble track’ because of its sound) contains a low rate digital signal that encodes pan and fade information. This information defines how the descriptive audio should be mixed with the original audio, allowing the balance between the description and the original audio to be controlled.

Spoken Subtitles
Spoken Subtitles are currently provided by national television broadcasters in some European countries as a service to provide accessibility for the blind and partially sighted viewers.
Spoken Subtitles supplement an Audio Description service and replace the inaccessible foreign language narrative provided as text subtitles. Unlike Audio Description, Spoken Subtitles are traditionally provisioned using Text To Speech, because the textual data is already available in the form of subtitle files and adding machine ‘reading’ of these is not operationally challenging. It is highly unusual for Spoken Subtitles to receive any special preparatory effort (for example, to match the voice with the gender of the speaker).
It should be understood that the 'quality' requirements of Spoken Subtitles may be different to Audio Description. Spoken Subtitles can be easily provided automatically for all programmes that have translation subtitles by default. In some regions and channels this is ALL programmes. As the spoken subtitles are automatically derived from the subtitle texts, the original spoken audio (in a foreign language) may be left audible, as it carries hint information such as the mood and gender of speaker. (The typical Audio Description practise of muting the original audio would detract from the quality of the viewing experience as audible cues would then be removed).

Technical Implementation
Both Spoken Subtitles and Audio Description have a common root in a timed script file. From this timed script file, audio is created. The main difference between the two practices is the 'typical' method of audio creation, using ‘voice talents’ for Audio Description and using Text To Speech for Spoken Subtitles. Additionally there may be a difference in the mixing of audio tracks due to a desire to retain connected information in the original program audio for Spoken Subtitles. From a technical perspective, both Spoken Subtitles and Audio Description may be provisioned using the same Text To Speech 'engine'.

Live insertion at programme playout
Screen has developed an output driver for our Polistream subtitle and caption transmission system that acts like all other Polistream output encoders. This specialist Polistream module receives 'subtitle texts' and renders them (using a programming interface called SAPI 5) to drive a Text to Speech engine and produces audio snippets.
The module will attempt to fit the generated audio snippets into the available time by re-rendering audio that is too long (by speeding up the spoken rate). If a delay in audio snippets occurs then the module will cut and fade the generated audio snippets. 'Live' Audio Description can also be rendered using the module, but in this case, the duration is unknown ahead of time, so there is no rate modification.
The specialist module can also detect the presence of an audio filename (hidden as metadata) in the subtitle file, in which case the identified sound file is loaded instead of performing a Text To Speech operation. This allows for traditional 'voice-talent' produced Audio Description to be supported, allowing a combination of Text To Speech and ‘voice-talent’ produced Audio Description, or the playout of pre-recorded voiceovers or other short audio prompts.
The same technology is also available as a module for our MediaMate product, to allow offline processing for file based workflows.

The article is also available in BFV online

(IT)
VMI.TV Ltd

Top Related Stories
Click here for the latest broadcast news stories.

02/11/2009
Itfc Provides Subtitling And Audio Description For Michael Jackson Film
itfc, the leading London-based media access services provider, has completed work on This is It, a film that followed Michael Jackson as he prepared f
24/03/2009
Screen Subtitling Systems Wins Turner Contract For HD Subtitling In Argentina
Screen Subtitling Systems has won a contract with Turner to provide two HD subtitling systems for Imagen Satelital S.A. its affiliate in Argentina. Pr
02/05/2006
Screen Subtitling Systems unveil latest HD subtitling technology at NAB
Screen Subtitling’s latest innovations mean that all types of subtitling services are available in HD. From preparation through transmission to compre
21/11/2018
Subtitling Is A Profit-Boosting Opportunity For Broadcasters
They are not limited to just being used as translation devices for foreign films or only of benefit to the hearing-impaired. Therefore, why is it that
19/10/2018
Captionmax Buys Licenses For Starfish's Advantage Audio Description Workstation
Captionmax has bought and implemented multiple licenses for Starfish's Advantage Audio Description workstation and linear audio processing software. T
16/10/2015
Production News : French Drama First To Be Broadcast With Audio Description
French drama series The Returned is to become the first drama broadcast by Channel 4 with audio description. The audio description of the eight-part d
10/09/2004
BBC Broadcast to provide audio description and visual signing to five
BBC Broadcast, one of the commercial arms of the BBC, has won a four and a half year audio description and visual signing contract with terrestrial ch
24/11/2003
Five launch 'audio description' on digital satellite
Five are to mark the International Day of Disabled Persons on December 3 2003 by becoming the first UK public service broadcaster to offer audio descr
11/11/2016
What Is The Future For Immersive Audio?
Peter Poers, Managing Director at Jünger Audio, looks at production efforts versus consumer experience. Introduction Along with the evolution of highe
18/11/2024
W E Audio Invests In Martin Audio WPL
Long-term Martin Audio rental partner, W E Audio, recently underlined its commitment to the British manufacturer, by making a massive investment acros
20/02/2024
NADiV Audio Introduces Range Of Dante Audio And Control Devices
NADiV Audio has launched its NADiV range of Dante-enabled audio interface and control devices for portable and installed AV and pro audio environments
28/07/2023
DHD audio Unveils XS3 Core Audio Processor
DHD audio has announced a new addition to its modular range of audio studio equipment and systems. The XS3 core audio processor supports up to 20 ster
17/07/2023
ES-Pro Audio Appointed To Handle Prism Sound's Range Of Audio Converters
Prism Sound has appointed ES-Pro Audio to handle its entire range of audio converters to the professional market in Germany. Formerly a Prism Sound re
22/05/2023
Synthax Audio Appointed Distributor For TIERRA Audio
Synthax Audio UK has been appointed UK and Ireland distributor for TIERRA Audio's range of professional audio products. Founded in 2018 in Madrid, Spa
12/04/2023
Audio-Technica's BP3600 Immersive Audio Microphone Now Available
Audio-Technica has announced scheduled availability in Europe and the UK for its recently launched BP3600 Immersive Audio Microphone. A premium broadc