-
Log Parser - tokenizes log files of the format in here
-
StackOverflow Scraper - an api that generates a database of posts from stack overflow
-
Mongo Matcher - a matching algorithm for comparing errors to data scraped by the StackOverflow Scraper
-
Mongo DB API - an API for the mongo db containing information from StackOverflow Scraper
git clone https://github.com/cloudhubs/log-errors.git
This will use pycharm as the IDE of choice. (Assuming you have already cloned the repo)
-
Open
log-errors/scraper
in Pycharm -
Indicate the python interpreter if prompted at the top.
-
Open a new terminal at the bottom of the window.
-
If the line does not start with the name of your virtual environment, follow this
for example: note the (venv) indicating the virtual environment, the path and the brach (master)
(venv) mark@MarksComputer ~/Documents/scraper/log-errors (master) $
-
Run the following command in
scraper/
rootpip install -r requirements.txt
-
To run the project right click in
api/main.py
and selectrun
-
Alternatively using bash:
$ python3 -m venv venv
$ source venv/bin/activate
$ pip3 install -r requirements.txt
$ python3 main.py
Start the parser here
- Parse File:
POST /errors/
The mongo and scraper apis are run simultaneusly. Start them from this file
- Start Scraper
POST /scrape/{language}
- Scrape Meta-data
POST /scrape-meta/{language}
- Stop Scraper
POST /scrape/stop
The mongo and scraper apis are run simultaneusly. Start them from this file
Start the parser here
-Find Matches GET /matcher