Cheatography

# Tools of Data Science (G7, W7, M7) Cheat Sheet (DRAFT) by [deleted]

This is a draft cheat sheet. It is a work in progress and is not finished yet.

### G7 is "­Graphs of Number­s"

 Stratified Sampling is also useful in G7 Bar Graph: Analysis of size. ("Pareto Chart" is arranged bar garaph.) Line Chart: Time series analysis. ("Co­ntrol Chart" is one of the line chart.) Circle Graph: Analysis of ratio Histogram: Analysis of distri­bution of 1 variable. (One of the Bar Graph) Box Plot: Analysis of distri­bution of 1 variable. (similar to stratified histogram) Scatter Diagram: Analysis of the relati­onship of two variables. Analysis of distri­bution of composed 2 variables. (To find outlier or to study small data set) Heat Map: Analysis of the relati­onship of two variables. Analysis of distri­bution of composed 2 variables. (To study big data set)

### W7 is "­Ana­lysis of Words"

 Most of W7 are Concept Analysis . Affinity Diagram: Classi­fic­ation of idea. The method to collect Brains­torming Cause-­and­-Effect Diagram: To collect reasons and results. Tree Diagram: Similar to FMEA . The next step of Cause-­and­-Effect Diagram. Relation Diagram: Main part of Systems Thinking . Matrix Diagram: Applic­ations are QFD , Multi Dimens­ional Scaling and AHP. Arrow Diagram: Planning method. Flow Chart: Analysis of the process
Why-why Analysis is a basis of concept analysis
W7 analyze the structure of phenomena as levels or networks. The idea to think the structure is also useful in G7 and M7.

### M7 is "­Mat­hem­atical Analys­is"

 Error Analysis: Analysis of the quality of the data Average & Standard Deviation: Analysis of statis­tical value. Testing of Difference of the Average: Basic tool of Hypothesis Testing. Regression Analysis: Include of the analysis Correl­ation . Principal Component Analysis: In N7, called "­Matrix Data Analys­is". Decision Tree: Applic­ation of Stratified Sampling Linear Progra­mming: To find the best in constr­aints.