# data scientist statistics Cheat Sheet

this sheet is a training sheet

This is a draft cheat sheet. It is a work in progress and is not finished yet.

### Simple statistic

 moyenne faire la moyenne d'une liste de valeur somme faire la somme eccart type faire l'eccart type

### Probab­ilité

 Complement P(A) + P(A’) = 1 Inters­ection P(A∩B) = P(A)P(B) Union P(A∪B) = P(A) + P(B) − P(A∩B)

### Regression

 linéaire linéaire logistique logistique logari­thmique logari­thmique

### Variab­ility

 Percen­tiles A measure that indicates the value below which a given percentage of observ­ations in a group of observ­ations falls. Quantiles Values that divide the number of data points into four more or less equal parts, or quarters. Interq­uartile Range (IQR) A measure of statis­tical dispersion and variab­ility based on dividing a data set into quartiles. IQR = Q3 − Q1 Variance The average squared difference of the values from the mean to measure how spread out a set of data is relative to mean. Standard Deviation The standard difference between each data point and the mean and the square root of variance. Standard Error An estimate of the standard deviation of the sampling distri­bution.