Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Practical Data Science Cookbook - Real-World Data Science Projects to Help You Get Your Hands On Your Data


Key Features

  • Learn how to tackle every step in the data science pipeline and use it to acquire, clean, analyze, and visualize data
  • Get beyond the theory with real-world projects
  • Expand your numerical programming skills through step-by-step code examples and learn more about the robust features of R and Python

Book Description

Data's value has grown exponentially in the past decade, with 'Big Data' today being one of the biggest buzzwords in business and IT, and data scientist hailed as 'the sexiest job of the 21st century'. Practical Data Science Cookbook helps you see beyond the hype and get past the theory by providing you with a hands-on exploration of data science. With a comprehensive range of recipes designed to help you learn fundamental data science tasks, you'll uncover practical steps to help you produce powerful insights into Big Data using R and Python.

Use this valuable data science book to discover tricks and techniques to get to grips with your data. Learn effective data visualization with an automobile fuel efficiency data project, analyze football statistics, learn how to create data simulations, and get to grips with stock market data to learn data modelling. Find out how to produce sharp insights into social media data by following data science tutorials that demonstrate the best ways to tackle Twitter data, and uncover recipes that will help you dive in and explore Big Data through movie recommendation databases.

Practical Data Science Cookbook is your essential companion to the real-world challenges of working with data, created to give you a deeper insight into a world of Big Data that promises to keep growing.

What you will learn

  • Follow the recipes in this essential data science cookbook to learn the fundamentals of data science and data analysis
  • Put theory into practice with a selection of real-world Big Data projects
  • Learn the data science pipeline and successfully structure your data science project
  • Find out how to clean, munge, and manipulate data
  • Learn different approaches to data modelling and how to determine the most appropriate for your data
  • Learn numerical computing with NumPy and SciPy

About the Authors

Tony Ojeda is the founder of District Data Labs, a cofounder of Data Community DC, and is actively involved in promoting data science education through both organizations.

Sean Patrick Murphy spent 15 years as a senior scientist at The Johns Hopkins University Applied Physics Laboratory, where he focused on machine learning, modeling and simulation, signal processing, and high performance computing in the Cloud. Now, he acts as an advisor and data consultant for companies in SF, NY, and DC.

Benjamin Bengfort has worked in military, industry, and academia for the past 8 years. He is currently pursuing his PhD in Computer Science at the University of Maryland, College Park, researching Metacognition and Natural Language Processing.

Abhijit Dasgupta is a data consultant working in the greater DC-Maryland-Virginia area, with several years of experience in biomedical consulting, business analytics, bioinformatics, and bioengineering consulting.

Table of Contents

  1. Preparing Your Data Science Environment
  2. Driving Visual Analysis with Automobile Data (R)
  3. Simulating American Football Data (R)
  4. Modeling Stock Market Data (R)
  5. Visually Exploring Employment Data (R)
  6. Creating Application-oriented Analyses Using Tax Data (Python)
  7. Driving Visual Analyses with Automobile Data (Python)
  8. Working with Social Graphs (Python)
  9. Recommending Movies at Scale (Python)
  10. Harvesting and Geolocating Twitter Data (Python)
  11. Optimizing Numerical Code w
(HTML tags aren't allowed.)

The Moral Advantage : How to Succeed in Business by Doing the Right Thing
The Moral Advantage : How to Succeed in Business by Doing the Right Thing
All too many people view business as a ruthless, dog-eat-dog world where only the pitiless survive. But here Bill Damon tells the compelling stories of real-life business leaders who have achieved great success by adhering to moral conviction. Based on interviews with 48 executives in a variety of industries, The Moral Advantage...
Domain-Specific Model-Driven Testing
Domain-Specific Model-Driven Testing

Software reuse and software quality are increasingly important topics in today's software engineering both for researchers and for practitioners. The design and implementation of tests is especially challenging when tests are conceptualized for different variants and versions of an application. Stefan Baerisch applies a combination of...

Artificial Intelligence for Games, Second Edition
Artificial Intelligence for Games, Second Edition

Creating robust artificial intelligence is one of the greatest challenges for game developers, yet the commercial success of a game is often dependent upon the quality of the AI. In this book, Ian Millington brings extensive professional experience to the problem of improving the quality of AI in games. He describes numerous examples from...

Global Energy Policy and Security (Lecture Notes in Energy)
Global Energy Policy and Security (Lecture Notes in Energy)

Despite efforts to increase renewables, the global energy mix is still likely to be dominated by fossil-fuels in the foreseeable future, particularly gas for electricity and oil for land, air and sea transport. The reliance on depleting conventional oil and natural gas resources  and the geographic distribution of these reserves can have...

Swift in the Cloud
Swift in the Cloud

Write and run Swift language programs in the Cloud

Written by the team of developers that has helped bring the Swift language to Cloud computing, this is the definitive guide to writing and running Swift language programs for cloud environment. In Swift in the Cloud, you'll find full coverage of all aspects...

Computer Architecture And Organization
Computer Architecture And Organization
One does not undertake the task of composing a new textbook lightly. This one has taken more than a year to produce. Furthermore, it does not make the author rich. (It pays better working in a bar!) So why bother?

Having moved to a computer science department from an applied physics environment, I was somewhat shocked at just how little
©2018 LearnIT (support@pdfchm.net) - Privacy Policy