Python for Humanists: Parsing Data
Python for Humanists: Parsing Data
Develop techniques for wrangling messy data with Python. Data retrieved online often needs to be transformed or otherwise parsed before it can become usable for your research. In this workshop, we’ll walk through using Beautiful Soup, a Python library for extracting data from HTML and XML files.
This workshop is designed for participants who have taken the “First Steps with Python” workshop or who otherwise have a general understanding of Python’s syntax and data types. If you missed or want to review what was covered in “First Steps with Python,” you can find the tutorial on the DHLab’s GitHub repository.
Instructors: Catherine DeRose (DHLab) and Doug Duhaime (DHLab)
Registration
This workshop is open to all Yale students, faculty, and staff, but space is limited. To register, visit the YUL Instruction Calendar. If you have registered, you will be sent a Zoom link the day before the workshop. If you don’t receive the email or lose the link, please contact the Digital Humanities Lab.
Participants are asked to come to the workshop with Anaconda Python (version 3.7 or higher) already installed. If you have trouble with the installation, stop by the Digital Humanities Lab’s virtual Office Hours for help.
Be among the first to know when future workshops are announced by signing up for the DHLab’s newsletter.