+------------------------+
| Data Analysis |
+------------------------+
|
|
+----------------------------------------------------+
| |
+-------------------+ +---------------------+
| Data Collection | | Data Cleaning |
+-------------------+ +---------------------+
| |
+---------------------------+ +--------------------------+
| Sources of Data | | Removing Null Values |
| - Databases | | - Inconsistencies |
| - APIs | | - Data Transformation |
| - Web Scraping | | - Normalization |
+---------------------------+ +--------------------------+
|
+------------------------------+
| Collection Methods |
| - Surveys |
| - Experiments |
| - Observations |
+------------------------------+
|
+-------------------------------+
| Exploratory Data Analysis (EDA)|
+-------------------------------+
|
+-----------------------------+ +-------------------------------+
| Data Visualization | | Descriptive Statistics |
| - Graphs (bar, pie) | | - Mean, Median, Mode |
| - Histograms, Boxplots | | - Standard Deviation |
+-----------------------------+ +-------------------------------+
|
|
+---------------------------------------+
| Data Pattern and Trends |
| - Anomalies |
| - Correlations |
+---------------------------------------+
|
+----------------------------+
| Data Modeling |
+----------------------------+
|
+-------------------------+ +---------------------------+
| Predictive Algorithms | | Evaluation of Models |
| - Regression (Linear, Logistic) | | - Accuracy, Recall, F1-Score |
| - Classification (KNN, SVM) | | - Confusion Matrix |
+-------------------------+ +---------------------------+
|
+-----------------------------+
| Result Interpretation & |
| Presentation |
+-----------------------------+