Show Menu
Cheatography

STAT 100 Chapter 3 and 4 Cheat Sheet (DRAFT) by

This is a draft cheat sheet. It is a work in progress and is not finished yet.

Ch 4 Sampling errors

sampling errors: caused by the act of taking a sample. They cause sample results to be different from the results of a census
Random Sampling error: the deviation between the sample statistic and the population parameter caused by change in selecting a random sample.
 
The margin of error in a confidence statement includes only random sampling error.
Non-sa­mpling Error: errors not related to the act of selecting a sample from the popula­tion. They can be present even in a census.
Underc­ove­rage: occurs when some groups in the population are left out of the process of choosing a sample.

Questions to Ask Before Believing a Poll.

Who carried out the survey? Any personal gain from outcome?
What was the popula­tion? Whose opinions were being asked?
How was the sample selected? Look for mention of random sampling.
How large was the sample? Even better, find out both the sample size and margin of error within which results of 95% of all samples drawn as this one was would fall.
What was the response rate? what percent of the original subjects actually provided inform­ation
How were the subjects contacted? Phone, mail, face to face
When was the survey conducted? Was it just after some event that might have influenced opinion?
What were the exact questions asked?
 

Nonsam­pling Errors

Nonsam­pling errors can be present in a census
Processing error: mistakes in mechanical tasks such as imput.
Response error: when a subject gives an incorrect response (lies/­doesn't unders­tand)
Nonres­ponse: failure to obtain data from an individual selected for a sample. Most nonres­ponse happens because subjects can't be contacted or because some subjects contacted refuse to cooperate
What the Margin of error doesn't say:
 
covers only random sampling error.
 
nonres­ponse, under coverage and other practical diffic­ulties can cause large bias that is not covered by margin of error.

How to Live with Non sampling Errors

First: substitute other households for non respon­ders.
 
Replacing nonres­ponders with other households in the same neighb­ourhood may reduce bias.
Once all the data is in, weight the responses.
 
Weighting is an attempt to correct sources of bias. For example, if there are too many households that did not respond, the survey gives weight to the houses that did respond.
 

Wording Questions

question wording may influence the results of a sample survey
If the questions are slanted to favor one response over others, we have another source of non sampling error.
Taking the sample in several stages with clusters at the final stage saves time for interv­iewers grouping the sample households first in PSUs and then in clusters.
Stratified Sample:
 
step 1: divide the sampling frame into distinct groups of indivi­duals, called strata.
 
step 1.2: choose the strata according to any special interest you have in certain groups within the population or because the indivi­duals in each stratum resemble each other.
 
step 2: take a separate SRS in each stratum and combine these to make up the complete sample.
Probab­ility Sample: chosen by chance. Know what samples are possible and what chance­/pr­oba­bility each possible sample has.
 
Some probab­ility samples, such as stratified samples, don't allow all possible samples from the population and may not give an equal chance to all the samples they do allow.