Python Statistical Methods: Machine Learning & Data Science
- Description
- Curriculum
- FAQ
- Reviews
This course is ideal for you if you want to gain knowledge in statistical methods required for Data Science and machine learning!
Learning Statistics is an essential part of becoming a professional data scientist. Most data science learners study python for data science and ignore or postpone studying statistics. One reason for that is the lack of resources and courses that teach statistics for data science and machine learning.
Statistics is a huge field of science, but the good news for data science learners is that not all statistics are required for data science and machine learning. However, this fact makes it more difficult for learners to study statistics because they are not sure where to start and what are the most relevant topics of statistics for data science.
This course comes to close this gap.
This course is designed for both beginners with no background in statistics for data science or for those looking to extend their knowledge in the field of statistics for data science.
I have organized this course to be used as a video library for you so that you can use it in the future as a reference. Every lecture in this comprehensive course covers a single topic.
In this comprehensive course, I will guide you to learn the most common and essential methods of statistics for data analysis and data modeling.
My course is equivalent to a college-level course in statistics for data science and machine learning that usually cost thousands of dollars. Here, I give you the opportunity to learn all that information at a fraction of the cost! With 77 HD video lectures, many exercises, and two projects with solutions.
All materials presented in this course are provided in detailed downloadable notebooks for every lecture.
Most students focus on learning python codes for data science, however, this is not enough to be a proficient data scientist. You also need to understand the statistical foundation of python methods. Models and data analysis can be easily created in python, but to be able to choose the correct method or select the best model you need to understand the statistical methods that are used in these models. Here are a few of the topics that you will be learning in this comprehensive course:
· Data Types and Structures
· Exploratory Data Analysis
· Central Tendency Measures
· Dispersion Measures
· Visualizing Data Distributions
· Correlation, Scatterplots, and Heat Maps
· Data Distribution and Data Sampling
· Data Scaling and Transformation
· Data Scaling and Transformation
· Confidence Intervals
· Evaluation Metrics for Machine Learning
· Model Validation Techniques in Machine Learning
Enroll in the course and gain the essential knowledge of statistical methods for data science today!
-
1Overview of Course CurriculumVideo lesson
In this Introductory lecture, I will show an overview of the course curriculum and the contents of this course.
-
2Installing Jupyter Notebook EnvironmentVideo lesson
In this lecture, you will learn how to install anaconda on your local computer to access its integrated Jupyter notebooks.
-
3How to Download Exercises & Course NotebooksVideo lesson
In this lecture, you will learn how to download the course materials including course notebooks, exercises and projects of this course.
-
4Built-in Data Structures - Tuple and ListVideo lesson
Students will learn how to create tuples and lists in python. Additionally, the most common methods and functions for manipulating tuples and lists will be explained.
-
5Built-in Data Structures - Dictionary and SetVideo lesson
Two built-in data structures will be explained, which are dictionary and set. Students will learn how to create and how to use python functions to manipulate data structured as dictionaries or sets.
-
6Numpy ArraysVideo lesson
Students will learn about NumPy arrays. They will learn how to create and manipulate NumPy arrays.
-
7Pandas Series and DataframesVideo lesson
Students will learn how to create pandas series and pandas dataframe. In addition to the common functions and methods used by data scientists to manipulate pandas dataframes.
-
8Data Types (Numeric or Categorical)Video lesson
Students will learn the common data types in data science. They will learn numerical and categorical data types.
-
9Exercise: Create Data Structures in PythonVideo lesson
This is a hands-on exercise for students to apply the skills in this section.
-
10Mean (Average)Video lesson
Students will learn what is the mean in statistics and how it can be calculated in python.
-
11Weighted AverageVideo lesson
You will learn what is the weighted average in statistics and how it can be calculated in python.
-
12MedianVideo lesson
Students will learn about the median and how it can be calculated in python.
-
13Population vs. SampleVideo lesson
You will learn two important concepts in statistics which are population and sample. And how they are related to t data analysis.
-
14Application in Data ScienceVideo lesson
In this lecture, you will learn how to apply the concepts explained in this module in data science using real-world data.
-
15Exercise: Calculate Central Tendency MeasuresVideo lesson
This is a hands-on exercise to apply the knowledge that you learned in this section.
-
16RangeVideo lesson
You will learn what is the range and how to calculate the range of data using python.
-
17Variance and Standard DeviationVideo lesson
You will learn what are the variance and the standard deviation. You will also learn how to calculate variance and standard deviation with python.
-
18Percentile & QuartileVideo lesson
Students will learn about percentiles and quartiles in data analysis and how to calculate the percentiles in python.
-
19Outlier – part 1Video lesson
In this lecture, you will learn about outliers and how they can be detected in data.
-
20Outlier – part 2Video lesson
In this lecture, we will continue the discussion of the outliers where you will learn how to detect outliers in real-world data.
-
21Sampling ErrorVideo lesson
-
22Application in Data ScienceVideo lesson
In this lecture, we will apply what we discussed in this section to real-world data.
-
23Exercise: Calculate Variability MeasuresVideo lesson
This is an exercise for you to put your knowledge into practical skills by solving tasks in this exercise.
-
24Box PlotVideo lesson
In this lecture, you will learn how to create box plots in python. You will learn also how to read box plots in terms of data distribution.
-
25Violin PlotVideo lesson
In this lecture you will learn how to create and use violin plots to explore the distribution of data.
-
26Histogram and Density PlotVideo lesson
In this lecture, we will discuss histograms and density plots. You will learn how to create and read histograms and density plots in terms of data distribution.
-
27Bar Plot for Categorical DataVideo lesson
Students will learn how to create bar plots in python to describe categorical data.
-
28Pie Chart for Categorical DataVideo lesson
In this lecture, another visualization for categorical data will be explored which is pie charts. students will learn how to create pie charts with python.
-
29Application in Data ScienceVideo lesson
In this lecture, we will have more practical examples of using visualizations to describe data distribution.
-
30Exercise: Exploring Data DistributionVideo lesson
This lecture is an exercise for you to practice the skills explained in this section.
-
31Correlation and Covariance CoefficientsVideo lesson
In this lecture, you will learn how to create correlations and covariance between variables in data sets. Also, you will learn how to create a correlation matrix.
-
32Correlation Using Scatter plotVideo lesson
In this lecture, you will learn how to visualize the correlation between variables using scatter plots.
-
33Mapping with Scatter plotsVideo lesson
In this lecture, you will learn how to explore the relationship between more than two variables using mapping and scatter plots.
-
34Heat MapsVideo lesson
In this lecture, you will learn how to create heat maps in python to explore the relationship between three variables and to create heatmaps for correlation matrix.
-
35Application in Data ScienceVideo lesson
In this lecture, we will discuss more practical examples of exploring the relationships between variables in datasets using correlation, scatter plots, and heat maps.
-
36Exercise: Create Mapped Scatterplots and Heat MapsVideo lesson
This is an exercise for you to put the skills that you learned in this section into practice.
-
39Random Sampling and BiasVideo lesson
In this lecture, you will learn through practical examples what is random sampling in statistics, and you will learn about bias and how it can be calculated in python.
-
40Central Limit TheoremVideo lesson
Students will learn through practical examples about an important theorem in statistics and data analysis which is the central limit theorem.
-
41Normal distributionVideo lesson
Students will learn a very important probability distribution in statistical modeling which is the normal distribution.
-
42Normality Tests for Real-World DataVideo lesson
Students will learn how to perform the normality test to check if the data is normally distributed.
-
43Skewed Data: Real-life DistributionsVideo lesson
Students will learn about skewed data, and how to measure skewness in real-world data.
-
44Probability: A Practical IntroductionVideo lesson
This lecture is a practical introduction to probability in statistics.
-
45Common Probability DistributionsVideo lesson
In this lecture, you will learn the common probability distributions in statistics such as the normal distribution and the binomial distribution.
-
46Exercise: Normal Distribution and SkewnessVideo lesson
This lecture is an exercise for the students to apply the skills learned in this section.
-
47Data Scaling: StandardizationVideo lesson
You will learn how to standardize data to have the same scale, which is a pre-requisite for some machine learning models.
-
48Data Scaling: NormalizationVideo lesson
Students will learn how to normalize the data to have the same scale.
-
49Log and Square Root TransformationsVideo lesson
Students will learn how to fix skewed data to have a normal distribution using log and square root transformations.
-
50Power Transformation (PowerTransformer)Video lesson
Students will learn how to use the power transformer to transform the data to a more normal distribution and to fix skewed data.
-
51Application in Data ScienceVideo lesson
In this lecture, we will apply the skills in this section to solve real-world examples in data science and machine learning.
-
52Exercise: Data Scaling and TransformationVideo lesson
This lecture is an exercise for you to practice what you learned in this section about data scaling and data transformation.
-
53C.I for Continuous DataVideo lesson
In this lecture, you will learn how to calculate confidence intervals for continuous data in python.
-
54C.I for Classification DataVideo lesson
In this lecture, you will learn how to calculate confidence intervals for classification data in python.
-
55Bootstrapping For Unknown DistributionsVideo lesson
Students will learn the bootstrapping method in statistics and how it can be used in machine learning and python.
-
56Nonparametric Confidence Interval with BootstrappingVideo lesson
You will learn how to use bootstrapping to create non-parametric confidence intervals in python.
-
57Exercise: Create Confidence IntervalVideo lesson
This lecture is an exercise for the students to apply what they learned about confidence intervals.
-
58Bias vs. VarianceVideo lesson
Students will learn very important terms in statistical modeling which are bias and variance.
-
59Overfitting and UnderfittingVideo lesson
Students will learn what is overfitting and underfitting in statistical and data modeling.
-
60Information Criteria for Model SelectionVideo lesson
In this lecture, you will learn the common information criteria that are used in statistical modeling and machine learning for model selection.
-
61Evaluation Metrics for Regression ModelsVideo lesson
Students will learn how to calculate the information criteria for regression models in machine learning.
-
62Evaluation Metrics for Classification Models _Part OneVideo lesson
Students will learn how to calculate the information criteria for classification models in machine learning.
-
63Evaluation Metrics for Classification Models – Part TwoVideo lesson
In this lecture, you will learn how to calculate confusion matric and evaluation metrics for classification models such as accuracy, precision, recall, and f1-score metrics.
-
64Application in Data ScienceVideo lesson
In this lecture, we will apply the skills in this section to solve real-world examples in data science and machine learning.
-
65Exercise: Evaluating Machine Learning ModelsVideo lesson
This lecture is an exercise for the students to apply what they learned about confidence intervals.
-
66Hold Out Validation - Train/Test SplitVideo lesson
Students will learn how to perform the K-fold cross-validation technique to evaluate machine learning models.
-
67K-Fold Cross-ValidationVideo lesson
Students will learn how to perform the K-fold cross-validation technique to evaluate machine learning models.
-
68Leave-One-Out Cross-Validation (LOOCV)Video lesson
Students will learn how to perform the Leave-One Out validation technique to evaluate machine learning models.
-
69Application in Data ScienceVideo lesson
In this lecture, we will apply the skills in this section to solve real-world examples in data science and machine learning.
-
70Exercise: Validation Techniques in Machine LearningVideo lesson
This lecture is an exercise for the students to apply what they learned about confidence intervals.
External Links May Contain Affiliate Links read more