1. 程式人生 > >Best Books For Machine Learning in R

Best Books For Machine Learning in R

R is a powerful platform for data analysis and machine learning. It is my main workhorse for things like competitions and consulting work. The reason is the large amounts of powerful algorithms available, all on the one platform.

In this post I want to point out some resources you can use to get started in R for machine learning.

Introduction to R

You might want to familiarize yourself with the platform and language before you start addressing your problems with machine learning.

I think the best way to familiarize yourself is to start addressing problems. The trial of real work will force you to learn what you must learn to solve your problem.A good reference can help you answer your “how do I…

” questions.

R in a Nutshell

Amazon Image

The book I read cover-to-cover when first starting out with R was R in a Nutshell. It walks you through the platform from installation, basic operations, data analysis and even some machine learning algorithms. I highly recommend it.

I chose it because it was a broad reference. I wanted to know a little bit of everything in the platform, so I would know where to look when I had a specific question.

Need more Help with R for Machine Learning?

Take my free 14-day email course and discover how to use R on your project (with sample code).

Click to sign-up and also get a free PDF Ebook version of the course.

Machine Learning

There is a wealth of machine learning algorithms implemented in R, many by the academics and their teams that actually developed them in the first place. This alone is a compelling reason to get started in R. Additionally, the data handling/manipulation and graphing tools are very powerful (although Python’s SciPy stack is catching up).

CRAN: Machine Learning and Statistical Learning

Not a book, but a great place you can start out is the Machine Learning and Statistical Learning view on CRAN maintained by Torsten Hothorn. It lists most of the R packages you can use for machine learning, grouped by algorithm and algorithm types.

It is a great place to start, but one thing that I think it could do better is point out canonical packages and to elaborate more on some of the wrapper packages available like caret.

Applied Predictive Modeling

Amazon Image

Max Kuhn, an author to this book is the creator of the famous caret package. Applied Predictive Modeling is very practical and opens in the first part with a description of predictive analytics process and case studies. Parts 2 and 3 look at regression and classification algorithms and the final Part covers more advanced topics like feature selection.

It’s a thick book and an excellent reference, and I’m a a fan of this book. Also check out the companion website for relevant resources.

An Introduction to Statistical Learning: with Applications in R

Amazon Image

An Introduction to Statistical Learning opens with an introduction to Statistical Learning and concerns such as model accuracy and the bias-variance tradeoff. Chapters 3 and 4 looks at linear regression and some simpler classification algorithms. Following chapters look at cross validation, model selection before moving into non-linear regression, decision trees, SVM and finishing up with unsupervised methods.

The book is also available online for free from the authors webpage.

Practical Data Science with R

Amazon Image

Practical Data Science with R has more of a data science spin than machine learning. Part 1 is introductory looking at loading data into R. Part 2 starts off with model evaluation and works through models in increasing complexity through k-NN, Naive Bayes, Linear Regression, clustering, association rules and SVM. Part 3 works through advanced issues like self-documenting scripts and presenting results.

Provides a good introduction with solid practical advice.

Machine Learning with R

Amazon Image

Machine Learning with R provides an overview of machine learning in R without going into detail or theory. It also heavily uses case studies to demonstrate each algorithm. It opens with a brief introduction to machine learning and R and in data management in R. It goes on in subsequent chapters to cover k-NN, Naive Bayes, Decision Trees, Regression, Neural Networks, Apriori, and Clustering.

It finishes up with chapters on model evaluation, algorithm tuning and other advanced topics. A good feature of this text are the step-by-step sequences provided in each chapter, providing an actionable framework around the case studies.

Data Mining with R: Learning with Case Studies

Amazon Image

After a quick introduction to R in the first chapter, Data Mining with R presents case study after case study. These include: predicting algae blooms, stock market returns, fraudulent transactions and classifying microarray samples. Each study explores various different data preparation, model building and model evaluation methods.

It’s a dense by valuable book if you’re looking for getting a feel for working through real problems.

Data Mining and Business Analytics with R

Amazon Image

Data Mining and Business Analytics with R provides worked examples using R, but the examples are more business focused than scientifically focused, as in some other books. The chapters work through the key machine learning methods using R with smaller case studies throughout. The book finishes with some larger case studies on sentiment analysis in text and modeling network data.

Data Mining with Rattle and R: The Art of Excavating Data for Knowledge Discovery (Use R!)

Amazon Image

Data Mining with Rattle and R provides an introduction to machine learning algorithms, although the twist is that uses the Rattle graphical environment. After the introductory material on loading and handling data in part 1, the standard machine learning algorithms are covered in part 2.

What I do like about the presentation of the algorithms is the standardized description that includes a tutorial, parameter tuning and command summary. I’m big fan of the consistent structured presentation of algorithms.

Summary

We have coved 7 popular machine learning books that focus on using the R platform.

The best advice I can give is to pick one and read it. Read it cover to cover, take notes and do the exercises. Like programming, using R is a practical skill that you can only build by practicing. Practice machine learning R.

Have I missed a Machine Learning book on R? Leave a comment and let me know.


Frustrated With Your Progress In R Machine Learning?

Master Machine Learning With R

Develop Your Own Models in Minutes

…with just a few lines of R code

Covers self-study tutorials and end-to-end projects like:
Loading data, visualization, build models, tuning, and much more…

Finally Bring Machine Learning To
Your Own Projects

Skip the Academics. Just Results.


相關推薦

Best Books For Machine Learning in R

Tweet Share Share Google Plus R is a powerful platform for data analysis and machine learning. I

Get Your Data Ready For Machine Learning in R with Pre

Tweet Share Share Google Plus Preparing data is required to get the best results from machine le

[Infographic] The Best Tools for Machine Learning Gengo AI

Machine learning projects can range from small datasets and standard algorithms, to much larger projects that use neural networks engines with massive data

Essential libraries for Machine Learning in Python

Python is often the language of choice for developers who need to apply statistical techniques or data analysis in their work. It is also used by data scie

NXP Owns the Stage for Machine Learning in Edge Devices

SAN JOSE, Calif. and BARCELONA, Spain, Oct. 16, 2018 (GLOBE NEWSWIRE) -- (ARMTECHCON and IoT World Congress Barcelona) - Mathematical advances that are dri

NXP's New Development Platform for Machine Learning in the IoT

NXP Semiconductors has launched a new machine learning toolkit. Called "eIQ", it's a software development platform that supports popular neural network fra

Abdul Latif Jameel Clinic for Machine Learning in Health at MIT aims to revolutionize disease prevention, detection, and treatme

Today, MIT and Community Jameel, the social enterprise organization founded and chaired by Mohammed Abdul Latif Jameel ’78, launched the Abdul Latif Jameel

Introduction to Random Number Generators for Machine Learning in Python

Tweet Share Share Google Plus Randomness is a big part of machine learning. Randomness is used a

Rescaling Data for Machine Learning in Python with Scikit

Tweet Share Share Google Plus Your data must be prepared before you can build models. The data p

Prepare Data for Machine Learning in Python with Pandas

Tweet Share Share Google Plus If you are using the Python stack for studying and applying machin

How To Get Started With Machine Learning in R (get results in one weekend)

Tweet Share Share Google Plus How do you get started with machine learning in R? R is a large an

The 50 Best Public Datasets for Machine Learning

The 50 Best Public Datasets for Machine LearningWhat are some open datasets for machine learning? After scrapping the web for hours after hours, we have cr

Assessing Annotator Disagreements in Python to Build a Robust Dataset for Machine Learning

Assessing Annotator Disagreements in Python to Build a Robust Dataset for Machine LearningTea vs. Coffee: the perfect example of decisions and disagreement

Best Programming Language for Machine Learning

Tweet Share Share Google Plus A question I get asked a lot is: What is the best programming lang

Machine Learning in Action-chapter2-k近鄰算法

turn fma 全部 pytho label -c log eps 數組 一.numpy()函數 1.shape[]讀取矩陣的長度 例: import numpy as np x = np.array([[1,2],[2,3],[3,4]]) print x

intersect for multiple vectors in R

con span osi library tar other and pos intersect Say you have a <- c(1,3,5,7,9) b <- c(3,6,8,9,10) c <- c(2,3,4,5,7,9) A stra

&lt;Machine Learning in Action &gt;之二 樸素貝葉斯 C#實現文章分類

options 直升機 water 飛機 math mes 視頻 write mod def trainNB0(trainMatrix,trainCategory): numTrainDocs = len(trainMatrix) numWords =

[Javascript] Classify text into categories with machine learning in Natural

bus easy ann etc hms scrip steps spam not In this lesson, we will learn how to train a Naive Bayes classifier or a Logistic Regression cl

[Javascript] Classify JSON text data with machine learning in Natural

comm about cnblogs ++ get ssi learn clas save In this lesson, we will learn how to train a Naive Bayes classifier and a Logistic Regressi

斯坦福大學公開課機器學習:machine learning system design | data for machine learning(數據量很大時,學習算法表現比較好的原理)

ali 很多 好的 info 可能 斯坦福大學公開課 數據 div http 下圖為四種不同算法應用在不同大小數據量時的表現,可以看出,隨著數據量的增大,算法的表現趨於接近。即不管多麽糟糕的算法,數據量非常大的時候,算法表現也可以很好。 數據量很大時,學習算法表現比