R Programming for Data Scientists and Analysts
R Programming for Data Scientists and Analysts Course Details:
R is a functional programming environment for business analysts and data scientists. It's a language that many non-programmers can easily work with, naturally extending a skill set that is common to high-end Excel users. It's the perfect tool for when you have a statistical, numerical, or probabilities problems based on real data and you’ve pushed Excel past its limits.
This comprehensive hands-on course presents common scenarios encountered in analysis and shares practical solutions. Special attention is paid to data science theory including AI grouping theory. A discussion of using R with AI libraries like MADlib is included.
Call (919) 283-1674 to get a class scheduled online or in your area!
From Excel or SAS to R
- Common challenges with Excel/SAS
- The R Environment
- Hello, R
Working with R Studio
- Rshiny
- Rpresentations
- Rmarkdown
R Basics
- Simple Math with R
- Working with Vectors
- Functions
- Comments and Code Structure
- Using Packages
Vectors
- Vector Properties
- Creating, Combining, and Iterating
- Passing and Returning Vectors in Functions
- Logical Vectors
Reading and Writing
- Text Manipulation
- Factors
Dates
- Working with Dates
- Date Formats and formatting
- Time Manipulation and Operations
Multiple Dimensions
- Adding a second dimension
- Indices and named rows and columns in a Matrix
- Matrix calculation
- n-Dimensional Arrays
- Data Frames
- Lists
R in Data Science
- AI Grouping Theory
- K-means
- Linear Regression
- Logistic Regression
- Elastic Net
R with MADLib
Importing and Exporting static Data (CSV and Excel)
Using Libraries with CRAN
K-means with MADlib
Regression with MADlib
Other libraries
Data Visualization
- Powerful Data through Visualization: Communicating the Message
- Techniques in Data Visualization
- Data Visualization Tools
- Examples
Databases, Data lakes, and additional Topics
- Building connections to Databases and Data lakes, for both Python and R (using Hive server)
- Methods to “query” data from database and data lakes, for both Python and R
- Creating and passing macro variables.
R with Hadoop
- Overview of Hadoop
- Overview of Distributed Databases
- Overview of Pig
- Overview of Mahout
- Exploiting Hadoop clusters with R
- Hadoop, Mahout, and R
Business Rule Systems
- Rule Systems in the Enterprise
- Enterprise Service Busses
- Drools
- Using R with Drools
*Please Note: Course Outline is subject to change without notice. Exact course outline will be provided at time of registration.
Join an engaging hands-on learning environment, where you’ll learn:
- R Language and Mathematics
- How to work with R Vectors
- How to read and write data from files, and how to categorize data in factors
- How to work with Dates and perform Date math
- How to work with multiple dimensions and DataFrame essentials
- Essential Data Science and how to use R with it
- Visualization in R
- How R can be used in Spark
This course has a 50% hands-on labs to 50% lecture ratio with engaging instruction, demos, group discussions, labs, and project work.
Before attending this course, you should have:
- Experience working with Excel or SAS
- Understand SQL basics
Data Scientist, Data Analyst, Data Architect, Statistician, Data Engineer, Developer, and Database Administrators who need to leverage R for analytics.