Object Detection using Deep Learning

Perform classification, object detection, transfer learning using convolutional neural networks (CNNs, or ConvNets)

Convolutional neural networks (CNNs, or ConvNets) are essential tools for deep learning, and are especially useful for image classification, object detection, and recognition tasks. CNNs are implemented as a series of interconnected layers. The layers are made up of repeated blocks of convolutional, ReLU (rectified linear units), and pooling layers. The convolutional layers convolve their input with a set of filters. The filters were automatically learned during network training. The ReLU layer adds nonlinearity to the network, which enables the network to approximate the nonlinear mapping between image pixels and the semantic content of an image. The pooling layers downsample their inputs and help consolidate local image features

Convolutional neural networks require Deep Learning Toolbox™. Training and prediction are supported on a CUDA^®-capable GPU with a compute capability of 3.0 or higher. Use of a GPU is recommended and requires Parallel Computing Toolbox™

You can construct a CNN architecture, train a network using semantic segmentation, and use the trained network to predict class labels or detect objects. You can also extract features from a pretrained network, and use these features to train a classifier. Additionally, you can perform transfer learning which retrains the CNN on new data.You can also use the Image Labeler, Video Labeler, feature extractors, and classifiers to create a custom detector

Functions

expand all

Load Training Data

`boxLabelDatastore`	Datastore for bounding box label data
`imageDatastore`	Datastore for image data
`groundTruth`	Ground truth label data
`objectDetectorTrainingData`	Create training data for an object detector
`combine`	Combine data from multiple datastores

Augment and Preprocess Training Data

`balanceBoxLabels`	Balance image blocks using bounding boxes and big images
`bboxcrop`	Crop bounding boxes
`bboxresize`	Resize bounding boxes
`bboxwarp`	Apply geometric transformation to bounding boxes
`bbox2points`	Convert rectangle to corner points list
`imwarp`	Apply geometric transformation to image
`imcrop`	Crop image
`imresize`	Resize image
`randomAffine2d`	Create randomized 2-D affine transformation
`centerCropWindow2d`	Create rectangular center cropping window
`randomCropWindow2d`	Create randomized rectangular cropping window
`integralImage`	Calculate 2-D integral image

Design Object Detection Deep Learning Networks

`anchorBoxLayer`	Create anchor box layer for object detection
`estimateAnchorBoxes`	Estimate anchor boxes for deep learning object detectors
`fasterRCNNLayers`	Create a faster R-CNN object detection network
`focalLossLayer`	Create focal loss layer using focal loss function
`roiAlignLayer`	Non-quantized ROI pooling layer for Mask-CNN
`roiInputLayer`	ROI input layer for Fast R-CNN
`roiMaxPooling2dLayer`	Neural network layer used to output fixed-size feature maps for rectangular ROIs
`rpnSoftmaxLayer`	Softmax layer for region proposal network (RPN)
`rpnClassificationLayer`	Classification layer for region proposal networks (RPNs)
`rcnnBoxRegressionLayer`	Box regression layer for Fast and Faster R-CNN
`regionProposalLayer`	Region proposal layer for Faster R-CNN
`spaceToDepthLayer`	Space to depth layer
`ssdLayers`	SSD multibox object detection network
`ssdMergeLayer`	Create SSD merge layer for object detection
`yolov2Layers`	Create YOLO v2 object detection network
`yolov2TransformLayer`	Create transform layer for YOLO v2 object detection network
`yolov2OutputLayer`	Create output layer for YOLO v2 object detection network
`yolov2ReorgLayer`	(Not recommended) Create reorganization layer for YOLO v2 object detection network
`focalCrossEntropy`	Compute focal cross-entropy loss

Train Object Detectors

`trainRCNNObjectDetector`	Train an R-CNN deep learning object detector
`trainFastRCNNObjectDetector`	Train a Fast R-CNN deep learning object detector
`trainFasterRCNNObjectDetector`	Train a Faster R-CNN deep learning object detector
`trainSSDObjectDetector`	Train an SSD deep learning object detector
`trainYOLOv2ObjectDetector`	Train YOLO v2 object detector

Evaluate Training Results

`evaluateDetectionAOS`	Evaluate average orientation similarity metric for object detection
`evaluateDetectionMissRate`	Evaluate miss rate metric for object detection
`evaluateDetectionPrecision`	Evaluate precision metric for object detection
`bboxOverlapRatio`	Compute bounding box overlap ratio
`bboxPrecisionRecall`	Compute bounding box precision and recall against ground truth

Detect Objects Using Deep Learning Detectors

`rcnnObjectDetector`	Detect objects using R-CNN deep learning detector
`fastRCNNObjectDetector`	Detect objects using Fast R-CNN deep learning detector
`fasterRCNNObjectDetector`	Detect objects using Faster R-CNN deep learning detector
`ssdObjectDetector`	Detect objects using SSD deep learning detector
`yolov2ObjectDetector`	Detect objects using YOLO v2 object detector
`selectStrongestBbox`	Select strongest bounding boxes from overlapping clusters
`selectStrongestBboxMulticlass`	Select strongest multiclass bounding boxes from overlapping clusters

Visualize Detection Results

`insertObjectAnnotation`	Annotate truecolor or grayscale image or video stream
`insertObjectMask`	Insert masks in image or video stream
`insertShape`	Insert shapes in image or video
`showShape`	Display shapes on image, video, or point cloud

Topics

Get Started

Choose Function to Visualize Detected Objects

Compare visualization functions.

Getting Started with Object Detection Using Deep Learning

Object detection using deep learning neural networks.

Getting Started with R-CNN, Fast R-CNN, and Faster R-CNN

R-CNN, Fast R-CNN, and Faster R-CNN basics

Getting Started with Mask R-CNN for Instance Segmentation

Perform multiclass instance segmentation using Mask R-CNN and deep learning.

Getting Started with YOLO v2

You only look once (YOLO) v2 basics

Getting Started with SSD Multibox Detection

Single shot detection basics.

Anchor Boxes for Object Detection

Basics of anchor boxes that are used in deep learning object detection

Create Training Data for Object Detection

Datastores for Deep Learning (Deep Learning Toolbox)

Learn how to use datastores in deep learning applications.

Training Data for Object Detection and Semantic Segmentation

Create training data for object detection or semantic segmentation using the Image Labeler or Video Labeler.

Object Detection Using Deep Learning

Deep Network Designer (Deep Learning Toolbox)

List of Deep Learning Layers (Deep Learning Toolbox)

Discover all the deep learning layers in MATLAB^®.

Deep Learning in MATLAB (Deep Learning Toolbox)

Discover deep learning capabilities in MATLAB using convolutional neural networks for classification and regression, including pretrained networks and transfer learning, and training on GPUs, CPUs, clusters, and clouds.

Pretrained Deep Neural Networks (Deep Learning Toolbox)

Learn how to download and use pretrained convolutional neural networks for classification, transfer learning and feature extraction.