- Paperback: 522 pages
- Publisher: O′Reilly; 1 edition (20 January 2017)
- Language: English
- ISBN-10: 1491910399
- ISBN-13: 978-1491910399
- Product Dimensions: 15.2 x 2.7 x 22.9 cm
- Average Customer Review: 15 customer reviews
- Amazon Bestsellers Rank: #93,900 in Books (See Top 100 in Books)
Other Sellers on Amazon
+ ₹ 66.00 Delivery charge
R for data science : Import, Tidy, Transform, Visualize, And Model Data Paperback – 20 Jan 2017
Save Extra with 3 offers
- Cashback (3): Get 10% cashback up to Rs. 100 using Visa Signature or Visa Infinite cards. Shop during the Visa Shopping Days starting 20th to end of every month. Cashback within 3 days from order. Here's how
- 50% up to Rs.150 back using Amazon Pay UPI. For Android App customers only. Valid once during offer period. Cashback within 5 days. Set up Amazon Pay UPI Here's how
- Get GST invoice and save up to 28% on business purchases. Sign up for free Here's how
- No Cost EMI: No cost EMI available on Amazon Pay ICICI credit card on orders above Rs.3000 Here's how
- Partner Offers (1): Avail EMI on Debit Cards. Get credit up to ₹1,00,000. Check your eligibility here! Here's how
Frequently bought together
Customers who bought this item also bought
Visualize, Model, Transform, Tidy, and Import Data
About the Author
Garrett Grolemund is a statistician, teacher and R developer who currently works for RStudio. He sees data analysis as a largely untapped fountain of value for both industry and science. Garrett received his Ph.D at Rice University in Hadley Wickham's lab, where his research traced the origins of data analysis as a cognitive process and identified how attentional and epistemological concerns guide every data analysis.
Garrett is passionate about helping people avoid the frustration and unnecessary learning he went through while mastering data analysis. Even before he finished his dissertation, he started teaching corporate training in R and data analysis for Revolutions Analytics. He's taught at Google, eBay, Axciom and many other companies, and is currently developing a training curriculum for RStudio that will make useful know-how even more accessible.
Outside of teaching, Garrett spends time doing clinical trials research, legal research, and financial analysis. He also develops R software, he's co-authored the lubridate R package--which provides methods to parse, manipulate, and do arithmetic with date-times--and wrote the ggsubplot package, which extends the ggplot2 package.
Hadley Wickham is an Assistant Professor and the Dobelman FamilyJunior Chair in Statistics at Rice University. He is an active memberof the R community, has written and contributed to over 30 R packages, and won the John Chambers Award for Statistical Computing for his work developing tools for data reshaping and visualization. His research focuses on how to make data analysis better, faster and easier, with a particular emphasis on the use of visualization to better understand data and models.
From the Publisher
|Data Science for Business||Data Science from Scratch||Doing Data Science||R for Data Science||Data Science at the Command Line||Python Data Science Handbook|
|What you need to know about data mining and data-analytic thinking||First principles with python||Straight talk from the frontline||Visualize, model, transform, tidy, and import data||Facing the future with time-tested tools||Tools and techniques for developers|
Enter your mobile number or email address below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.
To get the free app, enter mobile phone number.
Customers who viewed this item also viewed
Showing 1-8 of 15 reviews
There was a problem filtering reviews right now. Please try again later.
In my personal experience it takes more than 90% time to get the data in a shape where you think it is now ready for you to run your algorithms. This is far cry from the sterile,cleaned up data sets used in the data science courses. The real life data is very messy. This book is clearly written for the practitioners who deal with real life situations. This book helped me a lot to get my data in proper form. which includes joining two disparate data-sets based on keys, creating new derivative columns based on existing columns, remove unwanted feature set, take care of all those NAs, filter the rows on criteria, perform different aggregate functions (mean, sum, median etc) date wise or factor wise. It also covers bit of ggplot basics so you can start plotting the data from the word go. This book uses the tidyverse family packages (especially dplyr) written by the same author. ggplot is also written by Wikham. Both packages contain functions that can improve your productivity 10x. Both have now become de facto packages used by most R data scientists.
This book immensely helped me perform data analysis on my messy data. It taught me what and how to perform necessary operations on my data using very useful functions in dplyr package which is part of tidyverse family.
R is very powerful environment for data analysis. I would call it Ferrari of data science world. But although very powerful, it has its own quirks and learning curve, even for experienced programmers. The packages like dplyr, caret, ggplot make your life easier and allow you to fully harness the horse power of R.