What is the most unique signature of an image?

Ehsan Mirsadeghi Popular answer

Ok Parul, You have two choice:

1- You can use a shift invariant but rotation variant feature of images.

2- You can utilize a shift and rotation invariant feature but also extract the rotation value of your feature to distinct between rotated features.

if keypoints are suitable for your project you can utilize U-SURF (Upright-Speeded Up Robust Feature) that is not invariant to image rotation.

an other keypoint which is rotation invariant but you can extract rotation angle is SIFT (Scale Invariant Feature Transform).

U can find more information about SIFT and SURF on

http://www.cs.ubc.ca/~lowe/keypoints/

http://www.vision.ee.ethz.ch/~surf/eccv06.pdf‎

Ehsan Mirsadeghi

The most signature of an images is strongly depend on your application and what do you want from that feature. in some applications, histogram of a photometric color space can be suitable however in other applications more complicated computation require for extracting distinctive signature/feature.

If you give us more information about your application we can suggest you better tool for your project.

Parul Nilesh Shah

I want a signature which should change even if the image is rotated, so histogram will not work in my case.

David Cornforth

Parul Shah, I do not understand why you say that the histogram will change when the image is rotated. How can a histogram change when an image is rotated?

David Cornforth

The most unique signature of an image is the value of every single pixel in the correct order. But I do not think you want this. You want a compressed signature, smaller is size than the image, rotation-invariant and perhaps shift-invariant too? I suppose you have looked at Fourier and Wavelet transforms?

Lambert Zijp

You ask for MOST unique... Something is unique or not. I tend to agree with David: the uniqueness is the value and order of pixels.

I guess the number of bytes you want to use to store the 'uniqueness' is crucial. If you use only one integer, I would go for a CRC, which can be used for any data, not specifically images.

Martin Georg Ljungqvist

Do you really mean that the signature should change even if the image is rotated? Or do you mean that the signature should NOT change if the image is rotated? If the later, I agree with Mr Cornforth that it seems as you are looking for rotation-invariant features of some sorts.

Parul Nilesh Shah

I want the signature to CHANGE if the image is rotated or flipped (mirror image) i.e. a flipped image or rotated image is a different image for me. A small amount of translation or scaling can be considered as the same image and hence same signature. So it might be shift invariant (small shift) but not rotation-invariant.

Let me rephrase it as unique (may be not most unique) signature.

Yes, I need a smaller signature, easy to store and match.

Histogram is not a signature of an image and so cannot be used here (it will not only change if the image is rotated but two completely different image can also have the same histogram).

Ehsan Mirsadeghi

Ok Parul, You have two choice:

1- You can use a shift invariant but rotation variant feature of images.

2- You can utilize a shift and rotation invariant feature but also extract the rotation value of your feature to distinct between rotated features.

if keypoints are suitable for your project you can utilize U-SURF (Upright-Speeded Up Robust Feature) that is not invariant to image rotation.

an other keypoint which is rotation invariant but you can extract rotation angle is SIFT (Scale Invariant Feature Transform).

U can find more information about SIFT and SURF on

http://www.cs.ubc.ca/~lowe/keypoints/

http://www.vision.ee.ethz.ch/~surf/eccv06.pdf‎

Francesco Banterle

I suggest SIFT, but if you have problem with licenses there are other alternatives such as SURF. For example, daisy is not bad:

http://cvlab.epfl.ch/software/daisy

A very fast descriptor is ORB the rotational invariant version of BRIEF:

http://www.vision.cs.chubu.ac.jp/CV-R/pdf/Rublee_iccv2011.pdf

Xiaomeng Wu

There have been great answers here. Just in case, please note the following issues:

1. SIFT, SURF, and ORB are NOT image features.

They are local features, i.e. features of a very local region in an image. To generate a signature for an image, one usually uses a bag-of-visual-word (BOVW) model based on local features. Please refer to the following links for more details.

http://en.wikipedia.org/wiki/Bag-of-words_model_in_computer_vision

http://www.robots.ox.ac.uk/~vgg/publications/papers/philbin07.pdf

There are two ways to obtain the local regions, from which SIFT etc. will be extracted. One is to use keypoint detector, e.g. Hessian Affine or MSER etc. Another is to densely sampling local regions from the image. Please google using keywords like "keypoint detection BOVW" and "Dense Sampling BOVW".

2. SIFT, SURF, and ORB are rotation INVARIANT.

But you can make them rotation variant by fixing the orientation of the local region. Please refer to the following link for what the orientation of a local region is.

http://www.vlfeat.org/overview/sift.html

You can use the tool in the VLFeat library for dense sampling based SIFT extraction.

3. BOVW is expensive in terms of offline indexing.

The online speed of BOVW-based image matching is OK. But the offline processing includes visual vocabulary construction and visual word assignment. The former can be achieved by approximated k-means, and the latter by approximated nearest neighbor search. Depending on the size of your dataset, these two steps can be very slow.

If you only care about online speed, then you can ignore this issue.

Lambert Zijp

A question out of curiosity to the SIFT, SURF, ORB and BOVW guru's : how many bytes are needed to store those unique signatures of e.g. a 1 MB color portrait?

And also: how UNIQUE are those signatures?

Xiaomeng Wu

To Dr. Lambert Zijp,

I am not a guru of this field, but maybe I can share some of my experience.

The signature size depends on the image resolution and how we sample the local regions.

Given a 1024x768 image and a Hessian Affine detector, the number of keypoints range from 1,000 to 10,000, averagely 3,000. If we use standard TF-IDF weighting, the signature will contain around 3,000 doubles, i.e. 3,000x8=24,000 bytes.

Just in case, the BOVW representation is usually a real feature vector, NOT a signature. If you need a signature, you may have to binarize it using PCA hashing, spectral hashing, or etc. I am not familiar with this issue, so there maybe better solutions.

The second question is hard.

The uniqueness of BOVW depends on the application, the dataset, the definition of "same" images, and the type of variations between "same" images.

Let's just consider standard BOVW with TF-IDF and forget signature. Suppose that the task is object retrieval. Given a small dataset with 5,000 images of say buildings or logos, the mean average precision (MAP) of retrieving this dataset ranges from 50% to 70%. Adding a number of non-trivial pre- and post-processings can boost the MAP to 80% to 90%.

Which is the most efficient approach to extract 'text' region from a given scene/image?

How to learn more about SPSS and its Application?

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

Baseline drift in HPLC? What causes this?

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

How are iso-frequency contours plotted?

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Why does my protein refolded to beta sheet during thermal denaturation analysis?