Simple Ml regression models applied on Volve Dataset
- Performed EDA
- Selected features based on Linear relationship between variables
(Some variables are non linearly related)
Split Data into 70:30::training:testing (60::40 & 80::20 perform worse on this Dataset)
Used ML models to forecast Oil Production
- Multi Linear Regression
- KNN Regressor
- Decision Tree
- Random Forest
- Gradient Boosting
- XGB
- Support Vector Machine (with 3 different Kernels = rbf, poly & sigmoid
Evaluated Model using different errors
Have to test for more metrics in future, need to do PCA to reduce data dimension
Use Deep Learning methods on Time series Data
- Multi Linear Regression