\documentclass[10pt,a4paper]{article} % Packages \usepackage{fancyhdr} % For header and footer \usepackage{multicol} % Allows multicols in tables \usepackage{tabularx} % Intelligent column widths \usepackage{tabulary} % Used in header and footer \usepackage{hhline} % Border under tables \usepackage{graphicx} % For images \usepackage{xcolor} % For hex colours %\usepackage[utf8x]{inputenc} % For unicode character support \usepackage[T1]{fontenc} % Without this we get weird character replacements \usepackage{colortbl} % For coloured tables \usepackage{setspace} % For line height \usepackage{lastpage} % Needed for total page number \usepackage{seqsplit} % Splits long words. %\usepackage{opensans} % Can't make this work so far. Shame. Would be lovely. \usepackage[normalem]{ulem} % For underlining links % Most of the following are not required for the majority % of cheat sheets but are needed for some symbol support. \usepackage{amsmath} % Symbols \usepackage{MnSymbol} % Symbols \usepackage{wasysym} % Symbols %\usepackage[english,german,french,spanish,italian]{babel} % Languages % Document Info \author{jisidro} \pdfinfo{ /Title (chapter-4-cheat-sheet.pdf) /Creator (Cheatography) /Author (jisidro) /Subject (Chapter 4 Cheat Sheet) } % Lengths and widths \addtolength{\textwidth}{6cm} \addtolength{\textheight}{-1cm} \addtolength{\hoffset}{-3cm} \addtolength{\voffset}{-2cm} \setlength{\tabcolsep}{0.2cm} % Space between columns \setlength{\headsep}{-12pt} % Reduce space between header and content \setlength{\headheight}{85pt} % If less, LaTeX automatically increases it \renewcommand{\footrulewidth}{0pt} % Remove footer line \renewcommand{\headrulewidth}{0pt} % Remove header line \renewcommand{\seqinsert}{\ifmmode\allowbreak\else\-\fi} % Hyphens in seqsplit % This two commands together give roughly % the right line height in the tables \renewcommand{\arraystretch}{1.3} \onehalfspacing % Commands \newcommand{\SetRowColor}[1]{\noalign{\gdef\RowColorName{#1}}\rowcolor{\RowColorName}} % Shortcut for row colour \newcommand{\mymulticolumn}[3]{\multicolumn{#1}{>{\columncolor{\RowColorName}}#2}{#3}} % For coloured multi-cols \newcolumntype{x}[1]{>{\raggedright}p{#1}} % New column types for ragged-right paragraph columns \newcommand{\tn}{\tabularnewline} % Required as custom column type in use % Font and Colours \definecolor{HeadBackground}{HTML}{333333} \definecolor{FootBackground}{HTML}{666666} \definecolor{TextColor}{HTML}{333333} \definecolor{DarkBackground}{HTML}{8FA32A} \definecolor{LightBackground}{HTML}{F8F9F1} \renewcommand{\familydefault}{\sfdefault} \color{TextColor} % Header and Footer \pagestyle{fancy} \fancyhead{} % Set header to blank \fancyfoot{} % Set footer to blank \fancyhead[L]{ \noindent \begin{multicols}{3} \begin{tabulary}{5.8cm}{C} \SetRowColor{DarkBackground} \vspace{-7pt} {\parbox{\dimexpr\textwidth-2\fboxsep\relax}{\noindent \hspace*{-6pt}\includegraphics[width=5.8cm]{/web/www.cheatography.com/public/images/cheatography_logo.pdf}} } \end{tabulary} \columnbreak \begin{tabulary}{11cm}{L} \vspace{-2pt}\large{\bf{\textcolor{DarkBackground}{\textrm{Chapter 4 Cheat Sheet}}}} \\ \normalsize{by \textcolor{DarkBackground}{jisidro} via \textcolor{DarkBackground}{\uline{cheatography.com/164495/cs/34469/}}} \end{tabulary} \end{multicols}} \fancyfoot[L]{ \footnotesize \noindent \begin{multicols}{3} \begin{tabulary}{5.8cm}{LL} \SetRowColor{FootBackground} \mymulticolumn{2}{p{5.377cm}}{\bf\textcolor{white}{Cheatographer}} \\ \vspace{-2pt}jisidro \\ \uline{cheatography.com/jisidro} \\ \end{tabulary} \vfill \columnbreak \begin{tabulary}{5.8cm}{L} \SetRowColor{FootBackground} \mymulticolumn{1}{p{5.377cm}}{\bf\textcolor{white}{Cheat Sheet}} \\ \vspace{-2pt}Published 3rd October, 2022.\\ Updated 3rd October, 2022.\\ Page {\thepage} of \pageref{LastPage}. \end{tabulary} \vfill \columnbreak \begin{tabulary}{5.8cm}{L} \SetRowColor{FootBackground} \mymulticolumn{1}{p{5.377cm}}{\bf\textcolor{white}{Sponsor}} \\ \SetRowColor{white} \vspace{-5pt} %\includegraphics[width=48px,height=48px]{dave.jpeg} Measure your website readability!\\ www.readability-score.com \end{tabulary} \end{multicols}} \begin{document} \raggedright \raggedcolumns % Set font size to small. Switch to any value % from this page to resize cheat sheet text: % www.emerson.emory.edu/services/latex/latex_169.html \footnotesize % Small font. \begin{tabularx}{17.67cm}{x{6.748 cm} x{8.435 cm} p{1.687 cm} } \SetRowColor{DarkBackground} \mymulticolumn{3}{x{17.67cm}}{\bf\textcolor{white}{Descriptive epidemiology}} \tn % Row 0 \SetRowColor{LightBackground} Involves observation, definitions, measurements, and interpretations & Dissemination of health-related states or events by a person, place, and time. & \tn % Row Count 5 (+ 5) % Row 1 \SetRowColor{white} 1. Providing information about a disease or condition. & 2. Providing clues to identify a new disease or adverse health effect. & \tn % Row Count 9 (+ 4) % Row 2 \SetRowColor{LightBackground} 3. Identifying the extent of the public health problem. & 4. Obtaining a description of the public health problem that can be easily communicated. & \tn % Row Count 14 (+ 5) % Row 3 \SetRowColor{white} 5. Identifying the population at greatest risk. & 6. Assisting in planning and resource allocation. & \tn % Row Count 17 (+ 3) % Row 4 \SetRowColor{LightBackground} \mymulticolumn{3}{x{17.67cm}}{7. Identifying avenues for future research that can provide insights about an etiologic relationship between an exposure and health outcome.} \tn % Row Count 20 (+ 3) \hhline{>{\arrayrulecolor{DarkBackground}}---} \SetRowColor{LightBackground} \mymulticolumn{3}{x{17.67cm}}{The research problem, question, and hypotheses are supported by descriptive epidemiology. \newline \newline Hypotheses are tested using appropriate study designs and statistical methods. \newline \newline Describing data by person allows identification of the frequency of disease and who is at greatest risk. \newline \newline Describing data by place (residence, birthplace, place of employment, country, state, county, census tract, etc.) \newline \newline Descriptive statistics are a means of organizing and summarizing data.} \tn \hhline{>{\arrayrulecolor{DarkBackground}}---} \end{tabularx} \par\addvspace{1.3em} \begin{tabularx}{17.67cm}{x{1.8117 cm} x{4.7763 cm} x{4.941 cm} x{4.941 cm} } \SetRowColor{DarkBackground} \mymulticolumn{4}{x{17.67cm}}{\bf\textcolor{white}{Descriptive Study Designs}} \tn % Row 0 \SetRowColor{LightBackground} & \seqsplit{Description} & Strengths & Weakness \tn % Row Count 1 (+ 1) % Row 1 \SetRowColor{white} \seqsplit{Ecologic} & Aggregate data involved (no \seqsplit{information} is available for specific \seqsplit{individuals} & Takes advantages of preexisting data. Relatively quick and \seqsplit{inexpensive.} Can be used to evaluate programs, policies, or regulations implemented at the ecologic level. Allows estimation of effects not easily measurable for individuals & Susceptible to confounding, exposures and disease or injury outcomes not measured on the same individuals \tn % Row Count 21 (+ 20) % Row 2 \SetRowColor{LightBackground} Case Study & A snapshot \seqsplit{description} of a problem or situation for an individual or group; \seqsplit{qualitative} \seqsplit{descriptive} research of the facts in \seqsplit{chronological} order & In depth description provides cues to identify a new disease or adverse health effect resulting from an exposure or experience. Identifies potential areas of research & Conclusions limited to the individual, group, and/or context under study, cannot be used to establish a \seqsplit{cause-effect} \seqsplit{relationship} \tn % Row Count 35 (+ 14) \end{tabularx} \par\addvspace{1.3em} \begin{tabularx}{17.67cm}{x{1.8117 cm} x{4.7763 cm} x{4.941 cm} x{4.941 cm} } \SetRowColor{DarkBackground} \mymulticolumn{4}{x{17.67cm}}{\bf\textcolor{white}{Descriptive Study Designs (cont)}} \tn % Row 3 \SetRowColor{LightBackground} \seqsplit{Cross-Sectional} & All variables measured at a point in the time. No \seqsplit{distinction} between potential risk factors and outsomes & Control over study population. Control over \seqsplit{measurements}. Several \seqsplit{associations} between variables can be studied at the same time. Short time period required. Complete data collection. Exposure and \seqsplit{injury/disease} data collected from same \seqsplit{individuals.} Questions can be asked to obtain prevalence data. & No data in the time \seqsplit{relationship} between exposure and \seqsplit{injury/disease} development, no follow up, potential bias from low response rate, potential \seqsplit{measurements} bias, higher proportion of long term survivors, not feasible with rare exposures or outcomes, does not yield incidence or relative risk \tn % Row Count 25 (+ 25) \hhline{>{\arrayrulecolor{DarkBackground}}----} \SetRowColor{LightBackground} \mymulticolumn{4}{x{17.67cm}}{Descriptive study designs include case reports and case series, cross-sectional surveys, and exploratory ecologic designs. \newline These designs provide a means for obtaining descriptive statistics without typically attempting to test particular hypotheses. \newline In an ecologic study, the unit of analysis is the population. In a case report, case series, or cross-sectional survey, the unit of analysis is the individual.} \tn \hhline{>{\arrayrulecolor{DarkBackground}}----} \end{tabularx} \par\addvspace{1.3em} \begin{tabularx}{17.67cm}{x{3.7994 cm} x{13.4706 cm} } \SetRowColor{DarkBackground} \mymulticolumn{2}{x{17.67cm}}{\bf\textcolor{white}{Ratios, Proportions, and Rates I}} \tn % Row 0 \SetRowColor{LightBackground} Ration & In a ratio, the values of x and y are distinct, such that the values of x are not contained in y. The rate base for a ratio is 100 = 1. The rate base for a ratio is 100 = 1. For example, in 2017 in the United States, the leading causes of death for the age group 15–24 were unintentional injury (9,746 in males and 3,695 in females), suicide (5,027 in males and 1,225 in females), and homicide (4,234 in males and 671 in females). Corresponding ratios indicate that males are 2.64 times more likely die from unintentional injury, 4.10 times more likely to commit suicide, and 6.31 times more likely to die from homicide. \tn % Row Count 21 (+ 21) % Row 1 \SetRowColor{white} \seqsplit{Proportions} & A proportion is typically expressed as a percentage, such that the rate base is 102 = 100. Thus, for the preceding data, we can say that of deaths involving unintentional injury, 72.5\% were male; of suicides, 80.4\% were male; and of deaths due to homicide, 86.3\% were male. \tn % Row Count 30 (+ 9) \end{tabularx} \par\addvspace{1.3em} \begin{tabularx}{17.67cm}{x{3.7994 cm} x{13.4706 cm} } \SetRowColor{DarkBackground} \mymulticolumn{2}{x{17.67cm}}{\bf\textcolor{white}{Ratios, Proportions, and Rates I (cont)}} \tn % Row 2 \SetRowColor{LightBackground} Rates & Is a type of frequency measure where the numerator involves nominal data that represent the presence or absence of a health-related state or event. It also incorporates the added dimension of time; it may be thought of as a proportion with the addition that it represents the number of disease states, events, behaviors, or conditions in a population over a specified time period. An incidence rate is the number of new cases of a specified health-related state or event reported during a given time interval divided by the estimated population at risk of becoming a case. \tn % Row Count 19 (+ 19) \hhline{>{\arrayrulecolor{DarkBackground}}--} \SetRowColor{LightBackground} \mymulticolumn{2}{x{17.67cm}}{In epidemiology, it is common to deal with data that indicate whether an individual was exposed to an illness, has an illness, has experienced an injury, is disabled, or is dead. Ratios, proportions, and rates are commonly used measures for describing dichotomous data. The general formula for a ratio, proportion, or rate is: X/Y *10\textasciicircum{}Z. \newline \newline A mortality rate is the total number of deaths reported during a given time interval divided by the population from which the deaths occurred.} \tn \hhline{>{\arrayrulecolor{DarkBackground}}--} \end{tabularx} \par\addvspace{1.3em} \begin{tabularx}{17.67cm}{x{8.635 cm} x{8.635 cm} } \SetRowColor{DarkBackground} \mymulticolumn{2}{x{17.67cm}}{\bf\textcolor{white}{Ratios, Proportions, and Rates II}} \tn % Row 0 \SetRowColor{LightBackground} A mortality rate is the total number of deaths reported during a given time interval divided by the population from which the deaths occurred. & The attack rate is also called the cumulative incidence rate. It tends to describe diseases or events that affect a larger proportion of the population than the conventional incidence rate \tn % Row Count 10 (+ 10) % Row 1 \SetRowColor{white} \mymulticolumn{2}{x{17.67cm}}{Outbreak refers to more localized situations, whereas epidemic refers to more widespread disease and, possibly, over a longer period. The investigation of the outbreak involved first constructing a line listing of those at the picnic. Each line represented an individual, with measurements taken on age, gender, time the meal was eaten, whether illness resulted, date of onset, time of onset, and whether selected foods were eaten.} \tn % Row Count 19 (+ 9) % Row 2 \SetRowColor{LightBackground} Another common measure for describing disease and health-related events is prevalence, which is the frequency of existing cases of a health-related state or event in a given population at a certain time or period. & Period prevalence is the frequency of an existing health-related state or event during a time period. For example, the period prevalence of arthritis in a given year includes existing cases the first day of the year, along with new (incident) cases diagnosed during the year. Period prevalence is less commonly used than point prevalence. \tn % Row Count 36 (+ 17) \end{tabularx} \par\addvspace{1.3em} \begin{tabularx}{17.67cm}{x{8.635 cm} x{8.635 cm} } \SetRowColor{DarkBackground} \mymulticolumn{2}{x{17.67cm}}{\bf\textcolor{white}{Ratios, Proportions, and Rates II (cont)}} \tn % Row 3 \SetRowColor{LightBackground} \mymulticolumn{2}{x{17.67cm}}{The crude rate of an outcome is calculated without any restrictions, such as by age or gender or weighted adjustment of group-specific rates; however, these rates are limited if the epidemiologist is trying to compare them between subgroups of the population or over time because of potential confounding influences, such as differences in the age distribution between groups.} \tn % Row Count 8 (+ 8) % Row 4 \SetRowColor{white} \mymulticolumn{2}{x{17.67cm}}{A confidence interval is the range of values in which the population rate is likely to fall.} \tn % Row Count 10 (+ 2) \hhline{>{\arrayrulecolor{DarkBackground}}--} \SetRowColor{LightBackground} \mymulticolumn{2}{x{17.67cm}}{Interpretation \newline $\filledsquare{}$ SMR = 1: The health-related states or events observed were the same as expected from the age-specific rates in the standard population. \newline \newline $\filledsquare{}$ SMR \textgreater{} 1: More health-related states or events were observed than expected from the age-specific rates in the standard population. \newline \newline $\filledsquare{}$ SMR \textless{} 1: Fewer health-related states or events were observed than expected from the age-specific rates in the standard population.} \tn \hhline{>{\arrayrulecolor{DarkBackground}}--} \end{tabularx} \par\addvspace{1.3em} \begin{tabularx}{17.67cm}{X} \SetRowColor{DarkBackground} \mymulticolumn{1}{x{17.67cm}}{\bf\textcolor{white}{Tables, Graphs, and Numerical Measures}} \tn % Row 0 \SetRowColor{LightBackground} \mymulticolumn{1}{x{17.67cm}}{A frequency distribution is a complete summary of the frequencies, or number of times each value appears.} \tn % Row Count 3 (+ 3) % Row 1 \SetRowColor{white} \mymulticolumn{1}{x{17.67cm}}{A box plot has a single axis and presents a summary of the data.} \tn % Row Count 5 (+ 2) % Row 2 \SetRowColor{LightBackground} \mymulticolumn{1}{x{17.67cm}}{Median is the number or value that divides a list of numbers in half; it is the middle observation in the data set. It is less sensitive to outliers than the mean.} \tn % Row Count 9 (+ 4) \hhline{>{\arrayrulecolor{DarkBackground}}-} \end{tabularx} \par\addvspace{1.3em} \begin{tabularx}{17.67cm}{x{8.635 cm} x{8.635 cm} } \SetRowColor{DarkBackground} \mymulticolumn{2}{x{17.67cm}}{\bf\textcolor{white}{Measures of Association}} \tn % Row 0 \SetRowColor{LightBackground} Correlation coefficient & Represents the proportion of the total variation in the dependent variable that is determined by the independent variable. If a perfect positive or negative association exists, then all the variation in the dependent variable would be explained by the independent variable. Generally, however, only part of the variation in the dependent variable can be explained by a single independent variable. \tn % Row Count 20 (+ 20) % Row 1 \SetRowColor{white} Spearman's rank correlation coefficient (denoted by rs) & An alternative to the Pearson correlation coefficient when outlying data exist, such that one or both of the distributions are skewed. This method is robust to outliers. \tn % Row Count 29 (+ 9) % Row 2 \SetRowColor{LightBackground} Simple regression model y = b0 + b1x1 & A statistical analysis that provides an equation that estimates the change in the dependent variable (y) per unit change in an independent variable (x). This method assumes that for each value of x, y is normally distributed; that the standard deviation of the outcomes y do not change over x; that the outcomes y are independent; and that a linear relationship exists between x and y. \tn % Row Count 49 (+ 20) \end{tabularx} \par\addvspace{1.3em} \begin{tabularx}{17.67cm}{x{8.635 cm} x{8.635 cm} } \SetRowColor{DarkBackground} \mymulticolumn{2}{x{17.67cm}}{\bf\textcolor{white}{Measures of Association (cont)}} \tn % Row 3 \SetRowColor{LightBackground} Multiple regression y = b0 + b1x1 + … + bkxk & An extension of simple regression analysis in which there are two or more independent variables. The effects of multiple independent variables on the dependent variable can be simultaneously assessed. This type of model is useful for adjusting for potential confounders. \tn % Row Count 14 (+ 14) % Row 4 \SetRowColor{white} Logistic regression Log(odds) = b0 + b1x1 & A type of regression in which the dependent variable is a dichotomous variable. Logistic regression is commonly used in epidemiology because many of the outcome measures considered involve nominal data. \tn % Row Count 25 (+ 11) % Row 5 \SetRowColor{LightBackground} Multiple logistic regression Log(odds) = b0 + b1x1 + … + bkxk & An extension of logistic regression in which two or more independent variables are included in the model. It allows the researcher to look at the simultaneous effect of multiple independent variables on the dependent variable. As in the case of multiple regression, this method is effective in controlling for confounding factors. \tn % Row Count 42 (+ 17) \hhline{>{\arrayrulecolor{DarkBackground}}--} \SetRowColor{LightBackground} \mymulticolumn{2}{x{17.67cm}}{A contingency table is where all entries are classified by each of the variables in the table. For example, suppose we were interested in assessing whether exposure to a dietary intervention (yes vs. no) is associated with a decrease in low-density lipoprotein (yes vs. no). A 2 × 2 contingency table could represent the data.} \tn \hhline{>{\arrayrulecolor{DarkBackground}}--} \end{tabularx} \par\addvspace{1.3em} \begin{tabularx}{17.67cm}{x{8.635 cm} x{8.635 cm} } \SetRowColor{DarkBackground} \mymulticolumn{2}{x{17.67cm}}{\bf\textcolor{white}{Conclusion}} \tn % Row 0 \SetRowColor{LightBackground} Descriptive epidemiology is used to assess and monitor the health of communities and to identify health problems and priorities according to person (who?), place (where?), and time (when?) factors. It also involves characterizing the nature of the health problem (what?). Selected descriptive study designs, statistical measures, and graphs and charts were presented for describing the frequency and pattern of health-related states or events. & Descriptive analysis is the first step in epidemiology to understanding the presence, extent, and nature of a public health problem and is useful for formulating research hypotheses. Descriptive studies are hypothesis generating; they provide the rationale for testing specific hypotheses. The analytic study design, which is the focus of a later chapter, involves evaluating directional hypotheses about associations between variables. Some of the same measures and statistical tests used in exploratory and descriptive studies are also used in analytic studies. After a hypothesis is statistically evaluated for significance and an association between variables is deemed to not be explained by chance, bias, or confounding, then an investigator can use this information as part of the evidence for establishing a cause–effect relationship. Other criteria to consider in making a judgment about causality must also be considered, including temporality, dose–response relationship, biologic credibility, and consistency among studies. \tn % Row Count 52 (+ 52) \hhline{>{\arrayrulecolor{DarkBackground}}--} \end{tabularx} \par\addvspace{1.3em} \end{document}