Show Menu
Cheatography

Tools of Data Science (G7, W7, M7) Cheat Sheet (DRAFT) by [deleted]

This is a draft cheat sheet. It is a work in progress and is not finished yet.

G7 is "­Graphs of Number­s"

Stratified Sampling is also useful in G7

Bar Graph: Analysis of size. ("Pareto Chart" is arranged bar garaph.)
Line Chart: Time series analysis. ("Co­ntrol Chart" is one of the line chart.)
Circle Graph: Analysis of ratio
Histogram: Analysis of distri­bution of 1 variable. (One of the Bar Graph)
Box Plot: Analysis of distri­bution of 1 variable. (similar to stratified histogram)
Scatter Diagram: Analysis of the relati­onship of two variables. Analysis of distri­bution of composed 2 variables. (To find outlier or to study small data set)
Heat Map: Analysis of the relati­onship of two variables. Analysis of distri­bution of composed 2 variables. (To study big data set)

W7 is "­Ana­lysis of Words"

Most of W7 are Concept Analysis .

Affinity Diagram: Classi­fic­ation of idea. The method to collect Brains­torming
Cause-­and­-Effect Diagram: To collect reasons and results.
Tree Diagram: Similar to FMEA . The next step of Cause-­and­-Effect Diagram.
Relation Diagram: Main part of Systems Thinking .
Matrix Diagram: Applic­ations are QFD , Multi Dimens­ional Scaling and AHP.
Arrow Diagram: Planning method.
Flow Chart: Analysis of the process
Why-why Analysis is a basis of concept analysis
W7 analyze the structure of phenomena as levels or networks. The idea to think the structure is also useful in G7 and M7.

M7 is "­Mat­hem­atical Analys­is"

Error Analysis: Analysis of the quality of the data
Average & Standard Deviation: Analysis of statis­tical value.
Testing of Difference of the Average: Basic tool of Hypothesis Testing.
Regression Analysis: Include of the analysis Correl­ation .
Principal Component Analysis: In N7, called "­Matrix Data Analys­is".
Decision Tree: Applic­ation of Stratified Sampling
Linear Progra­mming: To find the best in constr­aints.