Object Class Recognition

Object class recognition finds objects that belong to pre-defined classes, when they appear in media. For example, if you are processing video captured by a CCTV camera, you could train Media Server to return the locations of pedestrians, cyclists, and cars that appear in the video.

Configuration Parameter Description
DetectionThreshold The minimum confidence score necessary for Media Server to output a result.
Input The track to analyze.
MaxImageSize The maximum image size for analysis.
MaxInputQueueLength Can be used to place a limit on latency.
NumParallel The maximum number of video frames to analyze simultaneously.
Recognizer The recognizer to use for recognizing objects.
Region The region of the frame to restrict processing to.
RegionUnit The units to use when setting the Region parameter.
RestrictToInputRegion Specifies whether to analyze a region of the input image or video frame that is specified in the input record, instead of the entire image.
SampleInterval The interval at which frames are selected to be analyzed.
SegmentDuration The maximum duration of a record in the SegmentedResult or SegmentedResultWithSource track.
SyncDatabase Specifies whether to synchronize with the training database before starting analysis.
Type The analysis engine to use. Set this parameter to ObjectClassRecognition.

Output Tracks

Output track Description Output1This column indicates whether the information contained in the track is included by default in the output created by an output task (when you don't set the Input parameter for the output task).
Data Contains one record, describing the analysis results, per recognized object, per video frame. No
DataWithSource

The same as the Data track, but each record also includes the source frame.

No
Result Contains one record, describing the analysis results, for each recognized object. When the same object appears in many consecutive frames, Media Server produces a single result. Yes
ResultWithSource

The same as the Result track, but each record also includes the best source frame.

No
SegmentedResult (Available only when processing video). Contains one or more records, describing the analysis results, for each recognized object. The maximum duration of records in this track is limited by the parameter SegmentDuration, so multiple records are created with the same ID when an object remains in the scene for longer than this duration. Yes
SegmentedResultWithSource

(Available only when processing video). The same as the SegmentedResult track, but each record also includes the best source frame.

No
Start

The same as the Data track, except it contains only the first record of each event.

No
End

The same as the Data track, except it contains only the last record of each event.

No

_FT_HTML5_bannerTitle.htm