Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Instant Nokogiri

Instant Nokogiri, 9781783289974 (178328997X), Packt Publishing, 2013

Learning data scraping and parsing in Ruby using the Nokogiri gem


  • Learn something new in an Instant! A short, fast, focused guide delivering immediate results
  • Master Nokogiri with the use of clear, step-by-step instructions and real world examples
  • Learn how to identify sources, parse documents, and extract information from them
  • Use the interactive Ruby shell and the features of Nokogiri to test and refine your theories in real-time

In Detail

A wealth of information sits waiting on the Internet. Instant Nokogiri helps you access this information today with Nokogiri, a slick and fast HTML and XML parsing engine. Bundled in an easy-to-use Ruby gem, Nokogiri empowers you to combine disparate data sources and gain an unprecedented insight into your Ruby applications.

"Instant Nokogiri" is a hands-on guide to extracting information from the sources available on the Internet, sources that are not traditionally accessible to developers. You will learn the secrets of identifying content, extracting just the right parts, and incorporating the new data in your Ruby applications.

"Instant Nokogiri" provides step-by-step instructions on how to incorporate the power of the Nokogiri gem and data parsing into your Ruby projects. You will learn all the basics of designing a project around data parsing, exploring disparate data sources, and refining strategies and theories. You will also combine your thoughts in a real-world, real-data sample application. This book will examine common Nokogiri and Ruby methods useful in scraping and parsing complete with practical code samples. You will also learn the secrets behind effective caching, rate limiting, and masking your identity. Instant Nokogiri will teach you how to get targeted data out of HTML and into Ruby, as well as tons of tips, tricks, code snippets, and expert advice.

What you will learn from this book

  • Set up a development environment for Nokogiri
  • Know when to use a parsing engine
  • Identify ideal sources from which to extract content and devise optimal strategies for selecting content
  • Use CSS and XPath selectors to target content
  • Test your theories in an interactive Ruby shell
  • Work with live web data
  • Avoid detection and be a good netizen
  • Incorporate your finished snippets in a full Sinatra application


Get to grips with a new technology, understand what it is and what it can do for you, and then get to work with the most important features and tasks. A concise, illustrated guide to extracting information available on the Internet using Nokogiri.

Who this book is written for

"Instant Nokogiri" is the perfect choice for the aspiring Ruby developer looking to incorporate screen scraping and parsing technology in their applications. Beginner level Ruby, basic HTML, and CSS experience is suggested.

(HTML tags aren't allowed.)

Beginning PL/SQL: From Novice to Professional
Beginning PL/SQL: From Novice to Professional
This is a book about writing stored procedures for an Oracle database. A stored procedure in this context is a generic term for a program written in the Procedure Language extension for SQL (PL/SQL) that is stored inside the database when it is compiled. This means that you can then execute the program from inside the database. Why would you want...
CompTIA Network+ Certification Study Guide
CompTIA Network+ Certification Study Guide

CompTIA's Network+ certification is a globally-recognized, vendor neutral exam that has helped over 235,000 IT professionals reach further and higher in their careers. The 2009 Network+ exam (N10-004) is a major update with more focus on security and wireless aspects of networking. Our new study guide has been updated accordingly with...

Concepts of Database Management
Concepts of Database Management
CONCEPTS OF DATABASE MANAGEMENT fits perfectly into any introductory database course for information systems, business or CIS programs. This concise text teaches SQL in a database-neutral environment with all major topics being covered, including E-R diagrams, normalization, and database design. Now in its seventh edition, CONCEPTS OF DATABASE...

Applied Statistical Genetics with R: For Population-based Association Studies (Use R!)
Applied Statistical Genetics with R: For Population-based Association Studies (Use R!)

Statistical genetics has become a core course in many graduate programs in public health and medicine. This book presents fundamental concepts and principles in this emerging field at a level that is accessible to students and researchers with a first course in biostatistics. Extensive examples are provided using publicly available data and...

Drawing Graphs: Methods and Models (Lecture Notes in Computer Science)
Drawing Graphs: Methods and Models (Lecture Notes in Computer Science)
Graph drawing comprises all aspects of visualizing structural relations between objects. The range of topics dealt with extends from graph theory, graph algorithms, geometry, and topology to visual languages, visual perception, and information visualization, and to computer-human interaction and graphics design. This monograph gives a systematic...
Visual Basic 2005 Express: Now Playing
Visual Basic 2005 Express: Now Playing
Dot Net Expertise, May 7, 2006
"It is very well illustrated; a big plus for a beginner's book."

DesktopEngineer.com, April 14, 2006
"Good layout which takes the learner through practical examples
©2019 LearnIT (support@pdfchm.net) - Privacy Policy