CS412: Final Exam Cheat Sheet

Types of Data

Record: data matrix (crosstabs), document data(term-frequency vector/text documents)

Graph/Network: WWW, facebook, molecular structures

Ordered: Video data (sequence of images), temporal data - time-series, genetic sequence data

Spatial/Image/Multimedia: Maps, Photos, Videos

Median interval

Median is difficult to calculate for large amounts of data, so approximated/interpolated for grouped data to median interval. L1 is lower boundary of mdn interval, N is # of vals of entire dataset, freq is the sum of freq of all lower than mdn interval, freq_median is freq of mdn interval, and width is the width of mdn interval.

Attribute Type: Just important info

Binary attribute type?

Under nominal attribute type: categories subtype and also discrete

Symmetric binary vs assymetric binary

Outcomes equally important vs not eqlly important

Numeric: interval-scaled vs ratio-scaled

No true 0 pt, temperature, not in kelvin True 0 pt, ratios : temperature kelvin, length, count

Measures of central tendency: Mode/Midrange

Unimodal, multimodal, bimodal, trimodal, no mode

Datasets with one mode vs more than one mode vs two modes vs 3 modes vs each val only once

unimodal data formula

assymetrical, formula: mean - mode = 3*(mean-median)

symmetric vs positively vs negatively skewed data

mean=median=mode @ same center vs mode<median<mean (right-skewed) vs mean<median<mode

midrange

highest+lowest_val divided by 2

Measures of central tendency: Mean

1st one is sample mean, 2nd is population mean, 3rd is weighted mean.
Most useful measure of center Bad for skewed/outliers
Solution: trimmed mean: mean after trimming outliers. Loss of valuable info if too much trimmed down.

Download the CS412: Final Exam Cheat Sheet

1 Page

Latest Cheat Sheet

7 Pages

(0)

Python Beginner to Advanced Cheat Sheet

A detailed Python cheat sheet covering beginner to advanced topics. Python is a popular programming language that can be used on a server to create web applications and this cheat sheet will cover all essential concepts.

musmankkh

3 Aug 25

python, programming, flask, leetcode, w3school, hackerrank

Recent Cheat Sheet Activity

CS412: Final Exam Cheat Sheet (DRAFT) by ntp

Types of Data

Median interval

Attribute Type: Just important info

Measures of central tendency: Mode/Midrange

Measures of central tendency: Mean

Latest Cheat Sheet

Random Cheat Sheet

About Cheatography

Behind the Scenes

Recent Cheat Sheet Activity

Please Disable Your Ad Blocker

CS412: Final Exam Cheat Sheet (DRAFT) by ntp

Types of Data

Median interval

Attribute Type: Just important info

Measures of central tendency: Mode/M­idrange

Measures of central tendency: Mean

Latest Cheat Sheet

Random Cheat Sheet

About Cheatography

Behind the Scenes

Recent Cheat Sheet Activity

Please Disable Your Ad Blocker

Measures of central tendency: Mode/Midrange