Skip to content

krototype/The-Preprocessor

Repository files navigation

📝 The-Preprocessor

The Preprocessor is a CLI tool which would help the data-scientist in preprocessing there data without actually writing the code 😃

📒 Features

  • Helps in Data Description and knowing more about data.
  • Univariate analysis along with graph plotting.
  • Bivariate analysis along with graph plotting to know the relation between data.
  • Data imputation and removing of rows if needed.
  • Wrangling with data like : One Hot Encoding , Normalization , Column Dropping
  • We can even download the new dataset after preprocessing.

⬇️ Installation

  1. Clone the repository.
  2. Move to the folder where repository is stored.
  3. Run the command
$ python data_intake.py
  1. Now you would be guided through.

📂 Modules Used

  • pandas
  • numpy
  • sys
  • seaborn
  • matplotlib
  • scipy
  • sklearn
Note : Added 1 dataset along with repo just for testing purpose(Titanic dataset used) 😉
Note : When giving the dataset either enter the path of dataset if needed or store your dataset in the same folder 😅

About

A CLI toolkit for easing the pre-processing steps before applying any machine learning model

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages