invertedImageIndex class

Search index that maps visual words to images

Syntax

imageIndex = invertedImageIndex(bag) imageIndex = invertedImageIndex(bag,'SaveFeatureLocations',tf) imageIndex = invertedImageIndex(___,Name,Value)

Construction

imageIndex = invertedImageIndex(bag) returns a search index object that you can use with the retrieveImages function to search for an image. The object stores the visual word-to-image mapping based on the input bag, a bagOfFeatures object.

imageIndex = invertedImageIndex(bag,'SaveFeatureLocations',tf) optionally specifies whether or not to save the feature location data in imageIndex.

imageIndex = invertedImageIndex(___,Name,Value) uses additional options specified by one or more Name,Value pair arguments, using any of the preceding syntaxes.

Input Arguments

expand all

`bag` — Bag of visual words
`bagOfFeatures` object

Bag of visual words, specified as a bagOfFeatures object.

`SaveFeatureLocations` — Save feature locations
`true` (default) | `false`

Save feature locations, specified as a logical scalar. When you set this property to true, the image feature locations are saved in the imageIndex output object. Use location data to verify the spatial or geometric image search results. If you do not require feature locations, set this property to false to reduce memory consumption.

Properties

expand all

`ImageLocation` — Indexed image locations
cell array

Indexed image locations, stored as a cell array.

`ImageWords` — Visual words
1-by-M vector of `visualWords` objects

Visual words, stored as a 1-by-M vector of visualWords objects for each indexed image. The visualWords object contains the WordIndex, Location, VocabularySize, and Count properties for each indexed image.

`WordFrequency` — Word occurrence
M-by-1 vector

Word occurrence, specified as an M-by-1 vector. The vector contains the percentage of images in which each visual word occurs. These percentages are analogous to document frequency in text retrieval applications. The WordFrequency property contains the percentage of images in which each visual word occurs. It is often helpful to suppress the most common words to reduce the search set when looking for the most relevant images. Also helpful, is to suppress rare words as they probably come from outliers in the image set.

You can control how much the top and bottom end of the visual word distribution affects the search results by tuning the WordFrequencyRange property. A good way to set this value is to plot the sorted WordFrequency values.

`BagOfFeatures` — Bag of visual words
`bagOfFeatures` object

Bag of visual words, specified as the bagOfFeatures object used in the index.

`MatchThreshold` — Percentage of similar words required between query and potential image match
`0.01` (default) | numeric value in the range [0 1]

Percentage of similar words required between a query and a potential image match, specified as a numeric value in the range [0, 1]. To obtain more search results, lower this threshold.

`WordFrequencyRange` — Word frequency range
`[0.01 0.9]` (default) | two-element vector

Word frequency range, specified as a two-element vector of a lower and upper percentage, [lower upper]. Use the word frequency range to ignore common words (the upper percentage range) or rare words (the lower percentage range) within the image index. These words often occur as repeated patterns or outliers and can reduce search accuracy. You can control how much the top and bottom end of the visual word distribution affects the search results by tuning the WordFrequencyRange property. A good way to set this value is to plot the sorted WordFrequency values.

Methods

addImages	Add new images to image index
removeImages	Remove images from image index

Examples

collapse all

Search ROI for Object

Open Live Script

Define a set of images to search.

imageFiles = ...
  {'elephant.jpg', 'cameraman.tif', ...
   'peppers.png',  'saturn.png',...
   'pears.png',    'stapleRemover.jpg', ...
   'football.jpg', 'mandi.tif',...
   'kids.tif',     'liftingbody.png', ...
   'office_5.jpg', 'gantrycrane.png',...
   'moon.tif',     'circuit.tif', ...
   'tape.png',     'coins.png'};

imgSet = imageSet(imageFiles);

Learn the visual vocabulary.

bag = bagOfFeatures(imgSet,'PointSelection','Detector',...
  'VocabularySize',1000);

Creating Bag-Of-Features.
-------------------------
* Image category 1: <undefined>
* Selecting feature point locations using the Detector method.
* Extracting SURF features from the selected feature point locations.
** detectSURFFeatures is used to detect key points for feature extraction.

* Extracting features from 16 images in image set 1...done. Extracted 3680 features.

* Keeping 80 percent of the strongest features from each category.

* Balancing the number of features across all image categories to improve clustering.
** Image category 1 has the least number of strongest features: 2944.
** Using the strongest 2944 features from each of the other image categories.

* Using K-Means clustering to create a 1000 word visual vocabulary.
* Number of features          : 2944
* Number of clusters (K)      : 1000

* Initializing cluster centers...100.00%.
* Clustering...completed 17/100 iterations (~0.05 seconds/iteration)...converged in 17 iterations.

* Finished creating Bag-Of-Features

Create an image search index and add images.

imageIndex = invertedImageIndex(bag);

addImages(imageIndex, imgSet);

Encoding images using Bag-Of-Features.
--------------------------------------
* Image category 1: <undefined>
* Encoding 16 images from image set 1...done.

* Finished encoding images.

Specify a query image and an ROI to search for the target object, elephant.

queryImage = imread('clutteredDesk.jpg');
queryROI = [130 175 330 365]; 

figure
imshow(queryImage)
rectangle('Position',queryROI,'EdgeColor','yellow')

You can also use the imrect function to select an ROI interactively. For example, queryROI = getPosition(imrect).

Find images that contain the object.

imageIDs = retrieveImages(queryImage,imageIndex,'ROI',queryROI)

imageIDs = 15×1

     1
    11
     2
     6
    12
     8
    16
     3
    13
    14
      ⋮

bestMatch = imageIDs(1);

figure
imshow(imageIndex.ImageLocation{bestMatch})

References

Sivic, J. and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. ICCV (2003) pg 1470-1477.

Philbin, J., O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. CVPR (2007).

Documentation

invertedImageIndex class

Syntax

Construction

Input Arguments

`bag` — Bag of visual words
`bagOfFeatures` object

`SaveFeatureLocations` — Save feature locations
`true` (default) | `false`

Properties

`ImageLocation` — Indexed image locations
cell array

`ImageWords` — Visual words
1-by-M vector of `visualWords` objects

`WordFrequency` — Word occurrence
M-by-1 vector

`BagOfFeatures` — Bag of visual words
`bagOfFeatures` object

`MatchThreshold` — Percentage of similar words required between query and potential image match
`0.01` (default) | numeric value in the range [0 1]

`WordFrequencyRange` — Word frequency range
`[0.01 0.9]` (default) | two-element vector

Methods

Examples

Search ROI for Object

References

See Also

Topics

Computer Vision Toolbox Documentation

Support

Documentation

invertedImageIndex class

Syntax

Construction

Input Arguments

bag — Bag of visual words bagOfFeatures object

SaveFeatureLocations — Save feature locations true (default) | false

Properties

ImageLocation — Indexed image locations cell array

ImageWords — Visual words 1-by-M vector of visualWords objects

WordFrequency — Word occurrence M-by-1 vector

BagOfFeatures — Bag of visual words bagOfFeatures object

MatchThreshold — Percentage of similar words required between query and potential image match 0.01 (default) | numeric value in the range [0 1]

WordFrequencyRange — Word frequency range [0.01 0.9] (default) | two-element vector

Methods

Examples

Search ROI for Object

References

See Also

Topics

Computer Vision Toolbox Documentation

Support

`bag` — Bag of visual words
`bagOfFeatures` object

`SaveFeatureLocations` — Save feature locations
`true` (default) | `false`

`ImageLocation` — Indexed image locations
cell array

`ImageWords` — Visual words
1-by-M vector of `visualWords` objects

`WordFrequency` — Word occurrence
M-by-1 vector

`BagOfFeatures` — Bag of visual words
`bagOfFeatures` object

`MatchThreshold` — Percentage of similar words required between query and potential image match
`0.01` (default) | numeric value in the range [0 1]

`WordFrequencyRange` — Word frequency range
`[0.01 0.9]` (default) | two-element vector