Syllabus of Data Science
Data Science with Python
Module 1: Introduction to Data Science
- What is Data Science?
- What is Machine Learning?
- What is Deep Learning?
- What is AI?
- Data Analytics & its types
Module 2: Introduction to Python
- What is Python?
- Why Python?
- Installing Python
- Python IDEs
Hands-on-Exercise:
- Installing Python idle for windows,Linux and
- Creating “Hello World” code
Module 3: Python Basics
- Introduction to Python
- Basic Concepts
- Conditional Statements
- Control Statements
- List, Tuples, Dictionary, Set
- String Functions
- Mathematical Functions
- User defined Functions
- Class and Objects
- Constructor
- Inheritance
- Package
- Exception handling
- Lambda Functions
- File Concepts
- Date and Regex Function
- Tkinter Entry
- Tkinter Checkbox
- Tkinter Dropdown list
- Tkinter Radio Button
- Tkinter Label
- Tkinter Button
- Tkinter Menu
- Tkinter Frame
- Tkinter MessageBox
- Tkinter Scroll bar
- MySql and Sqlite 3 Connection
Hands-on-Exercise-Constructing Operators
- Practice and Quickly learn Python necessary skills by solving simple questions and problems.
- how Python uses indentation to structure a program, and how to avoid some common indentation errors.
- You executed to make simple numerical lists, as well as a few operations you can perform on numerical lists, tuple, dictionary and set
Module 4: Python Packages, Database, Operating System & Tools
- Pandas
- Numpy
- Sci-kit Learn
- Mat-plot library
- Seaborn
- Pillow
- OpenCV
Hands-on-Exercise:
- Installing IDLE for windows, Linux and Mac
- Installing numpy, pandas and matplotlib
Database
- MySQL
- MongoDB
Framework
- Django
Operating System
- Linux
Tools
- Power BI
Module 5: Importing Data
- Reading CSV files
- Saving in Python data
- Loading Python data objects
- Writing data to CSV file
Hands-on-Exercise:
- To generate data sets and create visualizations of that data. You learned to create simple plots with matplotlib, and you saw how to use a scatter plot to explore random
- You learned to create a histogram with Pygal and how to use a histogram to explore the results of rolling dice of different
- Generating your own data sets with code is an interesting and powerful way to model and explore a wide variety of real-world
- As you continue to work through the data visualization projects that follow, keep an eye out for situations you might be able to model with
Module 6: Manipulating Data
- Selecting rows/observations
- Rounding Number
- Selecting columns/fields
- Merging data
- Data aggregation
- Data munging techniques
Hands-on-Exercise:
- As you gain experience with CSV and JSON files, you’ll be able to process almost any data you want to analyze.
- Most online data sets can be downloaded in either or both of these From working with these formats, you’ll be able to learn other data formats as well.
Module 7: Statistics Basics
- Central Tendency
- Mean
- Median
- Mode
- Skewness
- Normal Distribution
- Probability Basics
- What does it mean by probability?
- Types of Probability
- ODDS Ratio?
- Standard Deviation
- Data deviation & distribution
- Variance
- Bias variance Tradeoff
- Underfitting
- Overfitting
- Distance metrics
- Euclidean Distance
- Manhattan Distance
- Outlier analysis
- What is an Outlier?
- Inter Quartile Range
- Box & whisker plot
- Upper Whisker
- Lower Whisker
- Scatter plot
- Cook’s Distance
- Missing Value treatment
- What is NA?
- Central Imputation
- KNN imputation
- Dummification
- Correlation
- Pearson correlation
- positive & Negative correlation
Hands-on-Exercise:
- Compute probability in a situation where there are equally-likely outcomes
- Apply concepts to cards and dice
- Compute the probability of two independent events both occurring
- Compute the probability of either of two independent events occurring
- Do problems that involve conditional probabilities
- Calculate the probability of two independent events occurring
- List all permutations and combinations
- Apply formulas for permutations and combinations
Module 8: Error Metrics
- Classification
- Confusion Matrix
- Precision
- Recall
- Specificity
- F1 Score
- Regression
- MSE
- RMSE
- MAPE
Hands-on-Exercise:
- State why the z’ transformation is necessary
- Compute the standard error of z
- Compute a confidence interval on ρ The computation of a confidence interval
- Estimate the population proportion from sample proportions
- Apply the correction for continuity
Machine Learning
Supervised Learning
- Linear Regression
- Linear Equation
- Slope
- Intercept
- R square value
- Logistic regression
- ODDS ratio
- Probability of success
- Probability of failure Bias Variance Tradeoff
- ROC curve
- Bias Variance Tradeoff
Hands-on-Exercise:
- we’ve reviewed the main ways to approach the problem of modeling data using simple and definite
Unsupervised Learning
- K-Means
- K-Means ++
- Hierarchical Clustering
SVM
- Support Vectors
- Hyperplanes
- 2-D Case
- Linear Hyperplane
SVM Kernal
- Linear
- Radial
- polynomial
Other Machine Learning algorithms
- K – Nearest Neighbour
- Naïve Bayes Classifier
- Decision Tree – CART
- Decision Tree – C50
- Random Forest
Hands-on-Exercise:
- We have covered the simplest but still very practical machine learning models in an eminently practical way to get us started on the complexity
- where we will cover several regression techniques, it will be time to go and solve a new type of problem that we have not worked on, even if it’s possible to solve the problem with clustering methods (regression), using new mathematical tools for approximating unknown values.
- In it, we will model past data using mathematical functions, and try to model new output based on those modeling
Artificial Intelligence
Module 1: AI Introduction
- Perceptron
- Multi-Layer perceptron
- Markov Decision Process
- Logical Agent & First Order Logic
- AL Applications
Deep Learning
Module 1: Deep Learning Algorithms
- CNN – Convolutional Neural Network
- RNN – Recurrent Neural Network
- ANN – Artificial Neural Network
Hands-on-Exercise:
- We took a very important step towards solving complex problems together by means of implementing our first neural
- Now, the following architectures will have familiar elements, and we will be able to extrapolate the knowledge acquired on this chapter, to novel
Introduction to NLP
- Text Pre-processing
- Noise Removal
- Lexicon Normalization
- Lemmatization
- Stemming
- Object Standardization
Text to Features (Feature Engineering)
- Syntactical Parsing
- Dependency Grammar
- Part of Speech Tagging
- Entity Parsing
- Named Entity Recognition
- Topic Modelling
- N-Grams
- TF – IDF
- Frequency / Density Features
- Word Embedding’s
Tasks of NLP
- Text Classification
- Text Matching
- Levenshtein Distance
- Phonetic Matching
- Flexible String Matching
Hands-on-Exercise:
- provided, you will even be able to create new customized
- As our models won’t be enough to solve very complex problems, in the following chapter, our scope will expand even more, adding the important dimension of time to the set of elements included in our generalization.
Project Works
Project 1: Board Game Review Prediction
- To perform a Linear regression
- Analysis by predicting the average reviews in a board game
Project 2 :Credit Card Fraud Detection
- TO focus on Anomaly Detection by using probability densities to detect credit card fraud
Project 3: Stock Market Clustering
- Learn how to use the K-means clustering
- To find related companies by finding correlations among stock market movements over a given time span
Project 4: Getting Started with Natural Language Processing
- will focus on Natural Language Processing (NLP) methodology, such as tokenizing words
- and sentences, part of speech identification and tagging, and phrase
Project 5: Obtaining Near State-of-the-Art Performance on Object Recognition
- Using Deep Learning – In this project, will use the CIFAR-10 object recognition dataset as a
- benchmark to implement a recently published deep neural
Project 6: Image Super Resolution with the SRCNN – Learn how to implement & use
- Tensorflow version of the Super Resolution Convolutional Neural Network (SRCNN) for
- improving image
Project 7: Natural Language Processing: Text Classification
- an advanced approach to Natural Language Processing by solving a text classification task
- using multiple classification
Project 8: K-Means Clustering For Image Analysis
- use K-Means clustering in an unsupervised learning method to analyze and classify 28 x 28 pixel images from the MNIST
Project 9:Data Compression & Visualization Using Principal Component Analysis
- This project will show you how to compress our Iris dataset into a 2D feature set and how to visualize it through a normal x-y plot using k-means clustering
Tableau
Module 1: Tableau Course Material
- Start Page
- Show Me
- Connecting to Excel Files
- Connecting to Text Files
- Connect to Microsoft SQL Server
- Connecting to Microsoft Analysis Services
- Creating and Removing Hierarchies
- Bins
- Joining Tables
- Data Blending
Module 2: Learn Tableau Basic Reports
- arameters
- Grouping Example 1
- Grouping Example 2
- Edit Groups
- Set
- Combined Sets
- Creating a First Report
- Data Labels
- Create Folders
- Sorting Data
- Add Totals, Subtotals and Grand Totals to Report
Hands-on-Exercise:
- Install Tableau Desktop
- Connect Tableau to various Datasets: Excel and CSV files
Module 3: Learn Tableau Charts
- Area Chart
- Bar Chart
- Box Plot
- Bubble Chart
- Bump Chart
- Bullet Graph
- Circle Views
- Dual Combination Chart
- Dual Lines Chart
- Funnel Chart
- Traditional Funnel Charts
- Gantt Chart
- Grouped Bar or Side by Side Bars Chart
- Heatmap
- Highlight Table
- Histogram
- Cumulative Histogram
- Line Chart
- Lollipop Chart
- Pareto Chart
- Pie Chart
- Scatter Plot
- Stacked Bar Chart
- Text Label
- Tree Map
- Word Cloud
- Waterfall Chart
Hands-on-Exercise:
- Create and use Static Sets
- Create and use Dynamic Sets
- Combine Sets into more Sets
- Use Sets as filters
- Create Sets via Formulas
- Control Sets with Parameters
- Control Reference Lines with Parameters
Module 4: Learn Tableau Advanced Reports
- Dual Axis Reports
- Blended Axis
- Individual Axis
- Add Reference Lines
- Reference Bands
- Reference Distributions
- Basic Maps
- Symbol Map
- Use Google Maps
- Mapbox Maps as a Background Map
- WMS Server Map as a Background Map
Hands-on-Exercise:
- Create Barcharts
- Create Area Charts
- Create Maps
- Create Interactive Dashboards
- Create Storylines
- Understand Types of Joins and how they work
- Work with Data Blending in Tableau
- Create Table Calculations
- Work with Parameters
- Create Dual Axis Charts
- Create Calculated Fields
Module 5: Learn Tableau Calculations & Filters
- Calculated Fields
- Basic Approach to Calculate Rank
- Advanced Approach to Calculate Rank
- Calculating Running Total
- Filters Introduction
- Quick Filters
- Filters on Dimensions
- Conditional Filters
- Top and Bottom Filters
- Filters on Measures
- Context Filters
- Slicing Fliters
- Data Source Filters
- Extract Filters
Hands-on-Exercise:
- Creating Data Extracts in Tableau
- Understand Aggregation, Granularity, and Level of Detail
- Adding Filters and Quick Filters
Module 6: Learn Tableau Dashboards
- Create a Dashboard
- Format Dashboard Layout
- Create a Device Preview of a Dashboard
- Create Filters on Dashboard
- Dashboard Objects
- Create a Story
Module 7: Server
- Tableau online.
- Overview of Tableau
- Publishing Tableau objects and scheduling/subscription.
Hands-on-Exercise:
- Create Data Hierarchies
- Adding Actions to Dashboards (filters & highlighting)
- Assigning Geographical Roles to Data Elements
- Advanced Data Preparation