Consulting & Analytics Club, IIT Guwahati's


A primer course on Data Science

About the Course

Data Science is all about extracting meaningful insights from huge amount of data. This online data science certification course will revolve around the concepts of Python, Machine Learning, Data Cleaning and Data Analysis. This course will help you understand core concepts and the latest advancements including aspects of Supervised, Unsupervised and the very latest and introduce you to tools and algorithms used in the industry.

With the availability of numerous paid and free courses across the internet, it becomes overwhelming for students to start with Data Science. To demtsify the process, we specially curate a list of courses from 20+ websites, and create a simplified path for our students to follow within 2 months.

The course is conducted by Consulting and Analytics Club of IIT Guwahati in the summers, training more than 500+ students each year. The online course was started to train the students at IIT Guwahati campus but eventually spread out to different Universities across India.

Duration 6 weeks
Pre-requisite None
Projects 5+

What Our Students Say?

"Pffff! A sigh of relief after completing the final assignment after juggling with so many things. Ultimately we were able to learn great skills. I always wanted to do something in this field and I would like to thank the CnA club for making the course."

Abdullah Jamil Ahmad, 2019

"Brilliant course, really enjoyed it. I felt the difference when I did the final assignment. Thanks, cheers."

Subodh Sondkar, 2019

"The course was structured properly, pretty good for the beginners."

Nagurtha Sudheer, 2019


  • Data Analysis with Python

    • Basics of Python
    • Numpy and Pandas
    • Data Analysis in Python
    • Python Web Scraping
    Python Basics
  • Advanced Exploratory Analysis

    • Basic Statistics
    • Charts and Visualization
    • Outlier Analysis
    • Handling Missing Values
    • Effective Visuals
  • ML Algorithms

    • Linear Regression
    • Logistic Regression
    • Cost function & Gradient Descent
    • Handling Missing Values
    • Overfitting & Underfitting
    ML Algorithms
  • Model Tuning

    • SVM and Tree Based models
    • Evaluation Metrics
    • Hyperparameter tuning
    • Handling Missing Values
    • Feature Selection & Engineering
    Model Tuning
  • Unsupervised Learning

    • Nueral Networks
    • PCA
    • K-Means Clustring
  • Capstone Project

    • Topic: HR Analytics
    • Hackathon Based
    • Buffer period
    Capstone Project


Chicago Crime Detection
Chicago Crime Detection
Diabetes Prediction
Diabetes Prediction
Handwritten Digit Recognition
Handwritten Digit Recognition
HR Analytics
HR Analytics



Sabhareesh M
Consulting Head


Rishabh Agrawal


Krish Tikmani
Analytics Head


Starts 10th April, 2020