This will be a 3-hour workshop that would give you an overview of how to do Web Scraping using Python. We will be using the following libraries:
- Jupyter (for Instructional Purposes only)
- Request
- LXML
- Selenium
For this activity, we will be Scraping Job Postings from MyNimo. And just to be sure that we're clear to Scrape data from that website, here's the Terms and Conditions of MyNimo.
- Download and Install **Anaconda **here.
- Download the Files here. Save it somewhere on your Documents folder and unzip the compressed folder.
- Open Anaconda -> Jupyter Notebook.
- On Jupyter Notebook, navigate to where you saved the Files, and open the web_scraping_basics_student.ipynb file.
- Follow the instructions on the Jupyter Notebook.
[1] A. Sweigart and Recorded Books, Inc, Automate the boring stuff with Python : practical programming for total beginners . San Francisco: No Starch Press, 2020.
[2] S. Masango, “Web Scraping using Python (and Beautiful Soup),” DataCamp, Inc. , Jul. 26, 2018. https://www.datacamp.com/tutorial/web-scraping-using-python (accessed Mar. 10, 2024).
[3] “What is a URL?,” What is a URL? - Learn web development | MDN , Aug. 03, 2023. https://developer.mozilla.org/en-US/docs/Learn/Common_questions/Web_mechanics/What_is_a_URL (accessed Mar. 11, 2024).