Audio categorization identifies and classifies audio segments, for example as "speech", "silence", "noise", or "music".
Configuration Parameter | Description |
---|---|
AudioCategorizeServers | A comma-separated list of IDOL Speech Servers to use for audio categorization. |
Input | The audio track to analyze. |
MaxConsecutiveTries | The maximum number of attempts that Media Server makes to connect to the servers listed in the AudioCategorizeServers parameter. |
SampleFrequency | The sample frequency of the audio to send to the IDOL Speech Server. |
SegmentDuration | The maximum duration of a record in the SegmentedResult track. |
Type | The analysis engine to use. Set this parameter to AudioCategorize . |
Output track | Description |
---|---|
Result
|
Contains one record, describing the analysis results, for each identified audio segment. |
SegmentedResult
|
Contains one or more records, describing the analysis results, for each identified audio segment. The maximum duration of records in this track is limited by the parameter SegmentDuration, so multiple records are created with the same ID when an audio segment continues for longer than this duration. |
|