Computer Vision Metrics: Survey, Taxonomy, and Analysis

Computer Vision Metrics: Survey, Taxonomy, and Analysis, 9781430259299 (1430259299), Apress, 2014

Computer Vision Metrics provides an extensive survey and analysis of over 100 current and historical feature description and machine vision methods, with a detailed taxonomy for local, regional and global features. This book provides necessary background to develop intuition about why interest point detectors and feature descriptors actually work, how they are designed, with observations about tuning the methods for achieving robustness and invariance targets for specific applications. The survey is broader than it is deep, with over 540 references provided to dig deeper. The taxonomy includes search methods, spectra components, descriptor representation, shape, distance functions, accuracy, efficiency, robustness and invariance attributes, and more. Rather than providing ‘how-to’ source code examples and shortcuts, this book provides a counterpoint discussion to the many fine opencv community source code resources available for hands-on practitioners.

What youÂll learn

Interest point & descriptor concepts (interest points, corners, ridges, blobs, contours, edges, maxima), interest point tuning and culling, interest point methods (Laplacian, LOG, Moravic, Harris, Harris-Stephens, Shi-Tomasi, Hessian, difference of Gaussians, salient regions, MSER, SUSAN, FAST, FASTER, AGHAST, local curvature, morphological regions, and more), descriptor concepts (shape, sampling pattern, spectra, gradients, binary patterns, basis features), feature descriptor families.
Local binary descriptors (LBP, LTP, FREAK, ORB, BRISK, BRIEF, CENSUS, and more).
Gradient descriptors (SIFT, SIFT-PCA, SIFT-SIFER, SIFT-GLOH, Root SIFT, CensureE, STAR, HOG, PHOG, DAISY, O-DAISY, CARD, RFM, RIFF-CHOG, LGP, and more).
Shape descriptors (Image moments, area, perimeter, centroid, D-NETS, chain codes, Fourier descriptors, wavelets, and more) texture descriptors, structural and statistical (Harallick, SDM, extended SDM, edge metrics, Laws metrics, RILBP, and more).
3D descriptors for depth-based, volumetric, and activity recognition spatio-temporal data sets (3D HOG, HON 4D, 3D SIFT, LBP-TOP, VLBP, and more).
Basis space descriptors (Zernike moments, KL, SLANT, steerable filter basis sets, sparse coding, codebooks, descriptor vocabularies, and more), HAAR methods (SURF, USURF, MUSURF, GSURF, Viola Jones, and more), descriptor-based image reconstruction.
Distance functions (Euclidean, SAD, SSD, correlation, Hellinger, Manhattan, Chebyshev, EMD, Wasserstein, Mahalanobis, Bray-Curtis, Canberra, L0, Hamming, Jaccard), coordinate spaces, robustness and invariance criteria.
Image formation, includes CCD and CMOS sensors for 2D and 3D imaging, sensor processing topics, with a survey identifying over fourteen (14) 3D depth sensing methods, with emphasis on stereo, MVS, and structured light.
Image pre-processing methods, examples are provided targeting specific feature descriptor families (point, line and area methods, basis space methods), colorimetry (CIE, HSV, RGB, CAM02, gamut mapping, and more).
Ground truth data, some best-practices and examples are provided, with a survey of real and synthetic datasets.
Vision pipeline optimizations, mapping algorithms to compute resources (CPU, GPU, DSP, and more), hypothetical high-level vision pipeline examples (face recognition, object recognition, image classification, augmented reality), optimization alternatives with consideration for performance and power to make effective use of SIMD, VLIW, kernels, threads, parallel languages, memory, and more.
Synthetic interest point alphabet analysis against 10 common opencv detectors to develop intuition about how different classes of detectors actually work (SIFT, SURF, BRISK, FAST, HARRIS, GFFT, MSER, ORB, STAR, SIMPLEBLOB). Source code provided online.
Visual learning concepts, although not the focus of this book, a light introduction is provided to machine learning and statistical learning topics, such as convolutional networks, neural networks, classification and training, clustering and error minimization methods (SVM,’s, kernel machines, KNN, RANSAC, HMM, GMM, LM, and more). Ample references are provided to dig deeper.

Who this book is for

Engineers, scientists, and academic researchers in areas including media processing, computational photography, video analytics, scene understanding, machine vision, face recognition, gesture recognition, pattern recognition and general object analysis.

Chapter 1. Image Capture and Representation

Chapter 2. Image Pre-Processing

Chapter 3. Global and Regional Features

Chapter 4. Local Feature Design Concepts, Classification, and Learning

Chapter 5. Taxonomy Of Feature Description Attributes

Chapter 6. Interest Point Detector and Feature Descriptor Survey

Chapter 7. Ground Truth Data, Data, Metrics, and Analysis

Chapter 8. Vision Pipelines and Optimizations

Appendix A. Synthetic Feature Analysis

Appendix B. Survey of Ground Truth Datasets

Appendix C. Imaging and Computer Vision Resources

Appendix D. Extended SDM Metrics

Comments

Amazing Books

Signal Analysis: Wavelets, Filter Banks, Time-Frequency Transforms and Applications

John Wiley & Sons, 1999

"...excellent and interesting reading for digital signal processing engineers and designers and for postgraduate students in electrical and computer faculties." (Mathematical Reviews, 2002d)

Signal Analysis explores methods that offer an insight into the properties of signals and stochastic processes. This...

Reconfigurable Computing: Accelerating Computation with Field-Programmable Gate Arrays

Springer, 2005

This volume is unique: the first comprehensive exposition of the exciting new field of Reconfigurable Computing with FPGAs. By mapping algorithms directly into programmable logic, FPGA accelerators offer and deliver 10X-100X performance increases over microprocessors for a large range of application domains. Reconfigurable computing is found in...

The iMovie '11 Project Book

Peachpit Press, 2011

I love that I’ve been able to write a book about projects in iMovie. Editing video, after all, is a project in many ways. It often requires a good deal of time and attention to detail. It’s something you elect to do, probably in your spare time, because you want to document what happened at an event, or preserve memories,...

Parkinson's Disease and Nonmotor Dysfunction (Current Clinical Neurology)

Humana Press, 2005

This collection of review articles provides detailed clinical descriptions and treatment recommendations for the important, but often unrecognized, nonmotor dysfunctions of Parkinson's disease. Topics range from behavioral abnormalities and autonomic dysfunction to sleep-related and sensory dysfunction, and include depression and anxiety,...

Fuzzy Models and Algorithms for Pattern Recognition and Image Processing

Springer, 2005

Fuzzy Models and Algorithms for Pattern Recognition and Image Processing presents a comprehensive introduction of the use of fuzzy models in pattern recognition and selected topics in image processing and computer vision. Unique to this volume in the Kluwer Handbooks of Fuzzy Sets Series is the fact that...

Object-Oriented Programming Languages: Interpretation (Undergraduate Topics in Computer Science)

Springer, 2007

This comprehensive examination of the main approaches to object-oriented language explains the key features of the languages in use today. Class-based, prototypes and Actor languages are all looked at and compared in terms of their semantic concepts. In providing such a wide-ranging comparison, this book provides a unique overview of the main...

What youÂll learn

Who this book is for

Table of Contents

What youÂll learn