Show Menu
Cheatography

Data Analytics Test Cheat Sheet (DRAFT) by [deleted]

MIS 3300

This is a draft cheat sheet. It is a work in progress and is not finished yet.

Data Science

•Bu­siness Analyt­ics - the skills, techno­logies, applic­ations, and practices for contin­uous, iterative explor­ation and invest­igation of past business perfor­mance to gain insight and drive business planning
•Data Mining - the comput­ational process of discov­ering patterns in large data sets ("big data")
•Big Data - the term for a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or tradit­ional data processing applic­ations
•Bu­siness Intell­ige­nce - a set of theories, method­olo­gies, archit­ect­ures, and techno­logies that transform raw data into meaningful and useful inform­ation for business purposes
•De­scr­iptive Analyt­ics - to gain insight from historical data with reporting, scorec­ards, cluste­ring, etc.
•Pr­edi­ctive Analyt­ics - a variety of techniques from statis­tics, modeling, machine learning, and data mining that analyze current and historical facts to make predic­tions about future, or otherwise unknown, events
•Pr­esc­riptive Analyt­ics - automa­tically synthe­sizes big data, multiple discip­lines of mathem­atical sciences and comput­ational sciences, and business rules, to make predic­tions and then suggests decision options to take advantage of the predic­tions

CRISP-DM

Cross Industry Standard Process for Data Mining (CRISP­-DM):
1. Business unders­tan­ding
2. Data unders­tan­ding
3. Data prepar­ation
4. Modeling
5. Evalua­tion
6. Deploy­ment
   

Cluster Analysis

Defini­tions
Partit­ional vs Hierar­chical
ret
•Type of analysis: –Unsup­ervised (looking for natural relati­onships - not trying to predict a target variable) •Type of data: –Quant­itative (inter­val­/ratio) and/or qualit­ative (ordin­al/­nom­inal) data may be used in cluster analysis •Types of business questions answered: –Do cases (e.g., customers, employees, etc.) tend to cluster into natural groups that we can use to take some action?