detect

Detect objects using ACF object detector configured for monocular camera

Syntax

bboxes = detect(detector,I)

[bboxes,scores]
= detect(detector,I)

[___]= detect(detector,I,roi)

[___] = detect(___,Name,Value)

Description

bboxes = detect(detector,I) detects objects within image I using an aggregate channel features (ACF) object detector configured for a monocular camera. The locations of objects detected are returned as a set of bounding boxes.

example

[bboxes,scores] = detect(detector,I) also returns the detection confidence scores for each bounding box.

[___]= detect(detector,I,roi) detects objects within the rectangular search region specified by roi, using any of the preceding syntaxes.

[___] = detect(___,Name,Value) specifies options using one or more Name,Value pair arguments. For example, detect(detector,I,'WindowStride',2) sets the stride of the sliding window used to detect objects to 2.

Examples

collapse all

Detect Vehicles Using Monocular Camera and ACF

Open Script

Configure an ACF object detector for use with a monocular camera mounted on an ego vehicle. Use this detector to detect vehicles within video frames captured by the camera.

Load an acfObjectDetector object pretrained to detect vehicles.

detector = vehicleDetectorACF;

Model a monocular camera sensor by creating a monoCamera object. This object contains the camera intrinsics and the location of the camera on the ego vehicle.

focalLength = [309.4362 344.2161];    % [fx fy]
principalPoint = [318.9034 257.5352]; % [cx cy]
imageSize = [480 640];                % [mrows ncols]
height = 2.1798;                      % height of camera above ground, in meters
pitch = 14;                           % pitch of camera, in degrees
intrinsics = cameraIntrinsics(focalLength,principalPoint,imageSize);

monCam = monoCamera(intrinsics,height,'Pitch',pitch);

Configure the detector for use with the camera. Limit the width of detected objects to a typical range for vehicle widths: 1.5–2.5 meters. The configured detector is an acfObjectDetectorMonoCamera object.

vehicleWidth = [1.5 2.5];
detectorMonoCam = configureDetectorMonoCamera(detector,monCam,vehicleWidth);

Load a video captured from the camera, and create a video reader and player.

videoFile = fullfile(toolboxdir('driving'),'drivingdata','caltech_washington1.avi');
reader = VideoReader(videoFile);
videoPlayer = vision.VideoPlayer('Position',[29 597 643 386]);

Run the detector in a loop over the video. Annotate the video with the bounding boxes for the detections and the detection confidence scores.

cont = hasFrame(reader);
while cont
   I = readFrame(reader);

   % Run the detector.
   [bboxes,scores] = detect(detectorMonoCam,I);
   if ~isempty(bboxes)
       I = insertObjectAnnotation(I, ...
                           'rectangle',bboxes, ...
                           scores, ...
                           'Color','g');
   end
   videoPlayer(I)
   % Exit the loop if the video player figure is closed.
   cont = hasFrame(reader) && isOpen(videoPlayer);
end

release(videoPlayer);

Input Arguments

collapse all

`detector` — ACF object detector configured for monocular camera
`acfObjectDetectorMonoCamera` object

ACF object detector configured for a monocular camera, specified as an acfObjectDetectorMonoCamera object. To create this object, use the configureDetectorMonoCamera function with a monoCamera object and trained acfObjectDetector (Computer Vision Toolbox) object as inputs.

`I` — Input image
grayscale image | RGB image

Input image, specified as a real, nonsparse, grayscale or RGB image.

`roi` — Search region of interest
[x y width height] vector

Search region of interest, specified as an [x y width height] vector. The vector specifies the upper left corner and size of a region in pixels.

Name-Value Pair Arguments

Specify optional comma-separated pairs of Name,Value arguments. Name is the argument name and Value is the corresponding value. Name must appear inside quotes. You can specify several name and value pair arguments in any order as Name1,Value1,...,NameN,ValueN.

Example: 'NumScaleLevels',4

`'NumScaleLevels'` — Number of scale levels per octave
`8` (default) | positive integer

Number of scale levels per octave, specified as the comma-separated pair consisting of 'NumScaleLevels' and a positive integer. Each octave is a power-of-two downscaling of the image. To detect people at finer scale increments, increase this number. Recommended values are in the range [4, 8].

`'WindowStride'` — Stride for sliding window
`4` (default) | positive integer

Stride for the sliding window, specified as the comma-separated pair consisting of 'WindowStride' and a positive integer. This value indicates the distance for the function to move the window in both the x and y directions. The sliding window scans the images for object detection.

`'SelectStrongest'` — Select strongest bounding box for each object
`true` (default) | `false`

Select the strongest bounding box for each detected object, specified as the comma-separated pair consisting of 'SelectStrongest' and either true or false.

true — Return the strongest bounding box per object. To select these boxes, detect calls the selectStrongestBbox (Computer Vision Toolbox) function, which uses nonmaximal suppression to eliminate overlapping bounding boxes based on their confidence scores.
false — Return all detected bounding boxes. You can then create your own custom operation to eliminate overlapping bounding boxes.

`'MinSize'` — Minimum region size
[height width] vector

Minimum region size that contains a detected object, specified as the comma-separated pair consisting of 'MinSize' and a [height width] vector. Units are in pixels.

By default, MinSize is the smallest object that the trained detector can detect.

`'MaxSize'` — Maximum region size
`size`(`I`) (default) | [height width] vector

Maximum region size that contains a detected object, specified as the comma-separated pair consisting of 'MaxSize' and a [height width] vector. Units are in pixels.

To reduce computation time, set this value to the known maximum region size for the objects being detected in the image. By default, 'MaxSize' is set to the height and width of the input image, I.

`'Threshold'` — Classification accuracy threshold
`–1` (default) | numeric scalar

Classification accuracy threshold, specified as the comma-separated pair consisting of 'Threshold' and a numeric scalar. Recommended values are in the range [–1, 1]. During multiscale object detection, the threshold value controls the accuracy and speed for classifying image subregions as either objects or nonobjects. To speed up the performance at the risk of missing true detections, increase this threshold.

Output Arguments

collapse all

`bboxes` — Location of objects detected within image
M-by-4 matrix

Location of objects detected within the input image, returned as an M-by-4 matrix, where M is the number of bounding boxes. Each row of bboxes contains a four-element vector of the form [x y width height]. This vector specifies the upper left corner and size of that corresponding bounding box in pixels.

`scores` — Detection confidence scores
M-by-1 vector

Detection confidence scores, returned as an M-by-1 vector, where M is the number of bounding boxes. A higher score indicates higher confidence in the detection.

Documentation

detect

Syntax

Description

Examples

Detect Vehicles Using Monocular Camera and ACF

Input Arguments

`detector` — ACF object detector configured for monocular camera
`acfObjectDetectorMonoCamera` object

`I` — Input image
grayscale image | RGB image

`roi` — Search region of interest
[x y width height] vector

Name-Value Pair Arguments

`'NumScaleLevels'` — Number of scale levels per octave
`8` (default) | positive integer

`'WindowStride'` — Stride for sliding window
`4` (default) | positive integer

`'SelectStrongest'` — Select strongest bounding box for each object
`true` (default) | `false`

`'MinSize'` — Minimum region size
[height width] vector

`'MaxSize'` — Maximum region size
`size`(`I`) (default) | [height width] vector

`'Threshold'` — Classification accuracy threshold
`–1` (default) | numeric scalar

Output Arguments

`bboxes` — Location of objects detected within image
M-by-4 matrix

`scores` — Detection confidence scores
M-by-1 vector

See Also

Apps

Functions

Objects

Automated Driving Toolbox Documentation

Support

Documentation

detect

Syntax

Description

Examples

Detect Vehicles Using Monocular Camera and ACF

Input Arguments

detector — ACF object detector configured for monocular camera acfObjectDetectorMonoCamera object

I — Input image grayscale image | RGB image

roi — Search region of interest [x y width height] vector

Name-Value Pair Arguments

'NumScaleLevels' — Number of scale levels per octave 8 (default) | positive integer

'WindowStride' — Stride for sliding window 4 (default) | positive integer

'SelectStrongest' — Select strongest bounding box for each object true (default) | false

'MinSize' — Minimum region size [height width] vector

'MaxSize' — Maximum region size size(I) (default) | [height width] vector

'Threshold' — Classification accuracy threshold –1 (default) | numeric scalar

Output Arguments

bboxes — Location of objects detected within image M-by-4 matrix

scores — Detection confidence scores M-by-1 vector

See Also

Apps

Functions

Objects

Automated Driving Toolbox Documentation

Support

`detector` — ACF object detector configured for monocular camera
`acfObjectDetectorMonoCamera` object

`I` — Input image
grayscale image | RGB image

`roi` — Search region of interest
[x y width height] vector

`'NumScaleLevels'` — Number of scale levels per octave
`8` (default) | positive integer

`'WindowStride'` — Stride for sliding window
`4` (default) | positive integer

`'SelectStrongest'` — Select strongest bounding box for each object
`true` (default) | `false`

`'MinSize'` — Minimum region size
[height width] vector

`'MaxSize'` — Maximum region size
`size`(`I`) (default) | [height width] vector

`'Threshold'` — Classification accuracy threshold
`–1` (default) | numeric scalar

`bboxes` — Location of objects detected within image
M-by-4 matrix

`scores` — Detection confidence scores
M-by-1 vector