The Preprocessor is a CLI tool which would help the data-scientist in preprocessing there data without actually writing the code 😃
- Helps in Data Description and knowing more about data.
- Univariate analysis along with graph plotting.
- Bivariate analysis along with graph plotting to know the relation between data.
- Data imputation and removing of rows if needed.
- Wrangling with data like : One Hot Encoding , Normalization , Column Dropping
- We can even download the new dataset after preprocessing.
- Clone the repository.
- Move to the folder where repository is stored.
- Run the command
$ python data_intake.py
- Now you would be guided through.
- pandas
- numpy
- sys
- seaborn
- matplotlib
- scipy
- sklearn