Audio Categorization

Audio categorization identifies and classifies audio segments, for example as "speech", "silence", "noise", or "music".

Configuration Parameter Description
AudioCategorizeServers A comma-separated list of IDOL Speech Servers to use for audio categorization.
Input The audio track to analyze.
MaxConsecutiveTries The maximum number of attempts that Media Server makes to connect to the servers listed in the AudioCategorizeServers parameter.
SampleFrequency The sample frequency of the audio to send to the IDOL Speech Server.
SegmentDuration The maximum duration of a record in the SegmentedResult track.
Type The analysis engine to use. Set this parameter to AudioCategorize.

Output Tracks

Output track Description
Result Contains one record, describing the analysis results, for each identified audio segment.
SegmentedResult Contains one or more records, describing the analysis results, for each identified audio segment. The maximum duration of records in this track is limited by the parameter SegmentDuration, so multiple records are created with the same ID when an audio segment continues for longer than this duration.

_FT_HTML5_bannerTitle.htm