Join here:
This course serves as an introduction to the field of machine learning with a focus on implementation using Python programming language. Machine learning is a branch of artificial intelligence that enables systems to learn from data and make predictions or decisions without being explicitly programmed. Python has emerged as one of the most popular programming languages for machine learning due to its simplicity, versatility, and a rich ecosystem of libraries such as scikit-learn, Mlxtend, Pandas, Seaborn, SciPy etc.
Throughout this course, students will explore fundamental machine learning concepts, algorithms, and techniques, and gain hands-on experience in implementing them using Python. The course will cover topics including:
1. Introduction to Machine Learning
2. Data Cleaning using Python
· Creating a Data Frame
· Describing the Data
· Navigating Data frames
· Selecting Row Based Conditionals
· Replacing Values
· Renaming Columns
· Finding The Minimum, Maximum. Sum, Average, and Count
· Finding Unique Values
· Handling Missing Values
· Deleting a Column
· Deleting a Row
· Dropping Duplicate rows
· Group Rows by Values and Time
· Looping over a Column
· Applying a Function Over All Elements in a Column
· Applying a Function to Groups
· Concatenating Data Frames
· Merging Data Frames
Handling Numerical Data
· Rescaling a Feature
· Standardizing a Feature
· Transforming Features
· Detecting Outliers
· Handling Outliers
· Deleting Observations with Missing Values
Handling Categorical Data
· Encoding Ordinal Categorical Features
· Encoding Dictionaries of Features
3. Plotting and exploring Numerical Data and Categorical Data
· Box Plot
· Histogram
· Scatterplot
· Cross Tabulations
4. Training and modelling the data
· Splitting a dataset into training and validation sets
· K-fold cross-validation
· Bootstrap Sampling
5. Dimensionality Reduction using Feature Extraction
· Reducing Features using PCA
· Reducing Features using LDA
· Reducing Features using NMF
6. Supervised Algorithms for Classification
· KNN
· Decision Tree
· Random forest
· Support Vector Machine
· Naive Bayes
· Logistic Regression
7. Improving Performance of the Model with Ensembling Methods
· Ada Boost
· XG Boost
8. Evaluating Performance of the Model for Classification
· Confusion Matrix
· Kappa Score
· F – measure
· Accuracy
· Precision
· Recall
· ROC Curve
9. Regression
· Linear Regression
· Logistic Regression
· Evaluation with R2 score
10. Unsupervised Algorithms
Clustering
· K-means
· K-Medoids
· Hierarchical
No comments:
Post a Comment