This is a draft cheat sheet. It is a work in progress and is not finished yet.
Basic Terms
parameter |
fixed value describing population; usually unknown |
statistic |
value calculated from sample; used to estimate parameter |
descriptive stats |
- collecting, summarizing, describing data - graphical/numerical |
inferential stats |
- drawing conclusions/making predictions about pop based on sample |
** |
data types
name |
type |
data |
discrete |
num |
whole number |
continuous |
num |
decimals |
nominal |
cat |
no order |
ordinal |
cat |
has order |
numerical summary
percentile |
quartile |
standard deviation |
IQR |
outliers |
|
symmetric |
skewed |
measure of center |
mean |
median |
measure of spread |
SD |
IQR |
conditional probability
cont prob distribution properties
sampling distribution - sample mean
standard error and bias of X̅
confidence level & z-score
Confidence Interval - 3 cases
1. pop not normal; σ KNOWN ⇒ central limit theorem
the approx confidence interval for pop mean μ is
x̅±z*(σ/√n)
z*=zα/2 is upper critical value
2. pop normal; σ UNknown ⇒ t-distribution
T≡ (X̅-μ)/(S/√n)
S²=1/n-1∑(Xi-X̅)²
S=√S² |
estimator and MOE from CI
sampling dist. - sample proportion
|
|
hyp test for one population proportion
hyp test - one population mean μ
normal pop, known σ |
one sample z-test |
normal pop, UNknown σ |
one sample t-test |
decision errors
type 1 |
-reject a true Ho -false positive |
type 2 |
-fail to reject false Ho -false negative |
relationship |
α |
prob of type 1 error (same as sig level) |
β |
prob of type 2 |
hypothesis test steps
1. check validity of assumptions
a. randomness
b. sample size
c. population distribution
2. set up hypotheses
- identify parameter of interest
3. test statistic and its distribution
4. compute p-value
- confirm level of sig given in advance
5. conclusion interpretation |
|
|
hypothesis test
or significance testing |
test an assumption regarding pop. parameter |
method used depends on kind of data and reason |
asses plausibility of hypothesis using sample data |
hypothesis testing terms
hypothesis |
a claim or statement about a characteristic of a population of interest |
null hypothesis |
statement about the value of a population parameter, such as the population mean (µ) or the population proportion (p) |
alt hypothesis |
claim to be tested, the opposite of the null hypothesis |
test statistic |
value computed from the sample data that is used in making a decision about the rejection of the null hypothesis; converts the sample mean (x̄) or sample proportion (p̂) to a Z- or t-score under the assumption that the null hypothesis is true; |
p-value |
area under the curve to the left or right of test statistic; compared to level of significance (α) |
critical value |
significance level |
statistical significance |
practical significance |
effect size |
degree of a relationship between two given variables |
standardized effect size |
one sided |
two sided |
tests whether the population parameter is equal to, versus not equal to, some specific value |
|