Hierarchical Neural Networks for Image Interpretation (Lecture Notes in Computer Science)

Hierarchical Neural Networks for Image Interpretation (Lecture Notes in Computer Science), 9783540407225 (3540407227), Springer, 2003

It is my pleasure and privilege to write the foreword for this book, whose results I have been following and awaiting for the last few years. This monograph represents the outcome of an ambitious project oriented towards advancing our knowledge of the way the human visual system processes images, and about the way it combines high level hypotheses with low level inputs during pattern recognition. The model proposed by Sven Behnke, carefully exposed in the following pages, can be applied now by other researchers to practical problems in the field of computer vision and provides also clues for reaching a deeper understanding of the human visual system.

This book arose out of dissatisfaction with an earlier project: back in 1996, Sven wrote one of the handwritten digit recognizers for the mail sorting machines of the Deutsche Post AG. The project was successful because the machines could indeed recognize the handwritten ZIP codes, at a rate of several thousand letters per hour. However, Sven was not satisfied with the amount of expert knowledge that was needed to develop the feature extraction and classification algorithms. He wondered if the computer could be able to extract meaningful features by itself, and use these for classification. His experience in the project told him that forward computation alone would be incapable of improving the results already obtained. From his knowledge of the human visual system, he postulated that only a two-way system could work, one that could advance a hypothesis by focussing the attention of the lower layers of a neural network on it. He spent the next few years developing a new model for tackling precisely this problem.

The main result of this book is the proposal of a generic architecture for pattern recognition problems, called Neural Abstraction Pyramid (NAP). The architecture is layered, pyramidal, competitive, and recurrent. It is layered because images are represented at multiple levels of abstraction. It is recurrent because backward projections connect the upper to the lower layers. It is pyramidal because the resolution of the representations is reduced from one layer to the next. It is competitive because in each layer units compete against each other, trying to classify the input best. The main idea behind this architecture is letting the lower layers interact with the higher layers. The lower layers send some simple features to the upper layers, the uppers layers recognize more complex features and bias the computation in the lower layers. This in turn improves the input to the upper layers, which can refine their hypotheses, and so on. After a few iterations the network settles in the best interpretation. The architecture can be trained in supervised and unsupervised mode.

Comments

Amazing Books

Algae for Biofuels and Energy (Developments in Applied Phycology)

Springer, 2012

Microalgae are one of the most studied potential sources of biofuels and bioenergy. This book covers the key steps in the production of renewable biofuels from microalgae - strain selection, culture systems, inorganic carbon utilisation, lipid metabolism and quality, hydrogen production, genetic engineering, biomass harvesting,...

Models and Cognition (Bradford Books)

MIT Press, 2006

"Every now and then, a book comes along that tries to put it all together. Waskan's book is eminently readable and well informed, and taught me a lot about stuff I thought I already knew. It is an accessible text and a thoroughly original contribution, all in one."
—Robert Cummins, Department of Philosophy...

Pro Docker

Apress, 2015

In this fast-paced book on the Docker open standards platform for developing, packaging and running portable distributed applications, Deepak Vorha

discusses how to build, ship and run applications on any platform such as a PC, the cloud, data center or a virtual machine. He describes how to install and...

Embedded SoPC Design with Nios II Processor and Verilog Examples

John Wiley & Sons, 2012

Explores the unique hardware programmability of FPGA-basedembedded systems, using a learn-by-doing approach to introduce theconcepts and techniques for embedded SoPC design with Verilog

An SoPC (system on a programmable chip) integrates a processor,memory modules, I/O peripherals, and custom hardware acceleratorsinto a single FPGA...

Practical Applications of Soft Computing in Engineering

World Scientific Publishing, 2001

Soft computing has been presented not only with the theoretical developments but also with a large variety of realistic applications to consumer products and industrial systems. Application of soft computing has provided the opportunity to integrate human-like vagueness and real-life uncertainty into an otherwise hard computer program. This text...

$Digital Typography Using LaTeX$

Digital Typography Using LaTeX

Springer, 2002

This book explores a great number of concepts, methods, technologies, and tools-in oneword resources-that apply to various domains of typeselling. These resources have been developed and are used by the members of a very special community of people, which is also a community of very special people: the TEX community. To understand the motivation...