Data Visualizer Tool

Overview

Data Visualizer Tool is a Python application designed to assist users in performing exploratory data analysis (EDA). The application provides a user-friendly interface built with Tkinter, allowing users to easily load datasets, visualize data, and apply various transformations.

This project was created during my early years at university, and while it serves its purpose, Since Tkinter is considered an outdated tool for building GUIs, I plan to redo it using more modern frameworks like Flask, Django, or FastAPI as well as include Machine Learning capabilities. (If I have the time that is).

Features

Load CSV Files: Users can select and load CSV files into the application for analysis.
Exploratory Data Analysis (EDA): The application provides detailed EDA capabilities, including:
- Summary statistics of the dataset.
- Visualization of missing values.
- Detailed analysis of individual columns, including histograms and common values.
Data Transformation: Users can perform various data transformations, including:
- Handling missing values (mean, median, or removal).
- Encoding categorical columns using label encoding.
- Renaming and removing columns.
- Removing duplicates from the dataset.
Data Visualization: The application includes several visualization options:
- Histograms for numerical data.
- Stacked bar charts for categorical data.
- Scatter plots to visualize interactions between two columns.
Correlation Analysis: Users can visualize correlations between different columns in the dataset using heatmaps.
Interaction Analysis: Users can explore interactions between different features in the dataset through scatter plots.
User-Friendly Interface: The application is designed with a simple and intuitive interface, making it accessible for users with varying levels of expertise in data science.

Getting Started

Prerequisites

Python 3.x
Required libraries:
- NumPy
- Pandas
- Matplotlib
- Seaborn
- Tkinter
- scikit-learn
- Pillow

Installation

Clone the repository:

git clone https://github.com/yourusername/Supervised_ML_Helper.git

Navigate to the project directory:
```
cd Supervised_ML_Helper
```
Install the required libraries:
```
pip install -r requirements.txt
```

Usage

You can run the application using the provided Python file for the GUI:
```
python Supervised_ML_Classifier_python.py
```
Alternatively, there is a Jupyter Notebook available for additional analysis and exploration of the dataset.
Follow the on-screen instructions to load your dataset and explore the various features of the application.

Screenshots

Here are some screenshots of the application in action:

Histogram EDA:
Detailed EDA:
Correlation Heatmap:

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
CSVs		CSVs
Results Pictures		Results Pictures
Data_Visualizer.py		Data_Visualizer.py
Data_Visualizer_Notebook.ipynb		Data_Visualizer_Notebook.ipynb
README.md		README.md
requirments.txt		requirments.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Data Visualizer Tool

Overview

Features

Getting Started

Prerequisites

Installation

Usage

Screenshots

About

Uh oh!

Releases

Packages

Languages

Gallillio/Data_Science-Data_Visualizer_Tool

Folders and files

Latest commit

History

Repository files navigation

Data Visualizer Tool

Overview

Features

Getting Started

Prerequisites

Installation

Usage

Screenshots

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages