STATISTICS FOR DATA SCIENCE

Types of Statistics:

  1. Descriptive statistics - Understand the sample
  2. Inferential statistics - Understand the population

Descriptive Statistics:

1.Measure of central tendency:

Comparison of Mean , Median ,Mode

2. Measures of Spread/Dispersion:

  • 25% of the data points lie below Q1 and 75% lie above it.
  • 50% of the data points lie below Q2 and 50% lie above it. Q2 is nothing but Median.
  • 75% of the data points lie below Q3 and 25% lie above it.
  • Mesokurtic — This is the case when the kurtosis is zero, similar to the normal distributions.
  • Leptokurtic — This is when the tail of the distribution is heavy (outlier present) and kurtosis is higher than that of the normal distribution.
  • Platykurtic — This is when the tail of the distribution is light( no outlier) and kurtosis is lesser than that of the normal distribution.
COVARIANCE -TOGETHER SPREAD OF X AND Y
CORRELATION: COV(x)/SIGMA (x) IS ZSCALED FORMULA SO IT IS DOING SCALING

INFERENTIAL STATISTICS:

  1. Starts with data
  2. Arriving the insights y=f(x) and finding hypothesis.

--

--

--

Data scientist Aspirant passionate in learning new technologies and sharing my thoughts to others .

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Log errors in SQL Server that would otherwise go unnoticed (+ examples)

GSOC 2020 mzTab-M format support and cliqueMS algorithm implementation for MZmine

D4S Sunday Briefing #76

Machine Learning Model as a Serverless App using Google App Engine

Basics of a SQL Query

Exploration Formulas in Ms. Excel (SUM, AVERAGE, IF, COUNT, MAX, MIN, SUMIF, COUNTIF, and RANK)

How One Article Has Paid My Rent For Nearly A Year

Cybercrime & Confusion Matrix

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Vignesh S

Vignesh S

Data scientist Aspirant passionate in learning new technologies and sharing my thoughts to others .

More from Medium

Python vs R in Data Science

Statistics For DataScience PART-1

Data Science: Skills Required in becoming a Data Scientist

Exploratory Data Analysis