This application has a number of test data sources for Python for Informatics: Exploring Information written by @DrChuck / www.dr-chuck.com.
This data is set up to be served by a Content Data Network (CDN) product like CloudFlare to conserve bandwidth and provide quicker access to a worldwide learner population. There is a cloud-hosted copy of this data at py4e-data.dr-chuck.net that you may be able to use.