To help you navigate the large number of new data tools available, this guide describes 60 of the most recent innovations, from NoSQL databases and MapReduce approaches to machine learning and visualization tools. Descriptions are based on first-hand experience with these tools in a production environment.
This handy glossary also includes a chapter of key terms that help define many of these tool categories:
NoSQL Databases—Document-oriented databases using a key/value interface rather than SQL
MapReduce—Tools that support distributed computing on large datasets
Storage—Technologies for storing data in a distributed way
Servers—Ways to rent computing power on remote machines
Processing—Tools for extracting valuable information from large datasets
Natural Language Processing—Methods for extracting information from human-created text
Machine Learning—Tools that automatically perform data analyses, based on results of a one-off analysis
Visualization—Applications that present meaningful data graphically
Acquisition—Techniques for cleaning up messy public data sources
Serialization—Methods to convert data structure or object state into a storable format
Packet Forwarding Technologies As Internet traffic continues to grow exponentially, there is a great need to build Internet protocol (IP) routers with high-speed and high-capacity packet networking capabilities. The first book to explore this subject, Packet Forwarding Technologies explains in depth packet forwarding concepts and implementation technologies. It... Windows Communication Foundation Unleashed (WCF) Windows Communication Foundation (WCF) is a new Microsoft technology for allowing software to communicate. Superseding earlier technologies, such as COM/DCOM, .NET Remoting, ASP.NET Web Services, and the Web Services Enhancements for .NET, WCF provides a single solution that is designed to always be the best way to exchange data among software... Visual Basic for Applications Unleashed
Other Office products were slowly brought into the VBA fold, but it’s only with the release of Office 97 that Microsoft has finally achieved its goal. Now all of the Big Four—Word, Excel, Access, and PowerPoint—have a VBA development environment at their core. Not only that, but just about ...
The Theory of Computation
This is the best text on complexity theory I have seen, and could easily become the standard text on the subject...This is the first modern text on the theory of computing. ---William Ward Jr, Ph.D, University of South Alabama
Taking a practical approach, this modern introduction to the theory of computation focuses on the study of...
SketchUp 7.1 for Architectural Visualization: Beginner's Guide
Google SketchUp, the most popular architectural software package, is used by millions of architects and visualizers throughout the world. But what you may not know is that it's also the most powerful 3D design software on the market. With this book in hand and patented technology such as the PhotoMatch, Push-Pull, and Face-me components...
LINQ: The Future of Data Access in C# 3.0 Language Integrated Query (LINQ) is Microsoft's new technology for powerful, general purpose data access. This technology provides a fully-integrated query language, available in both C# 3.0 and VB 9.0, for high-level data access against objects, relational databases, and XML documents. In this Short Cut you'll learn about LINQ and the proposed C#...