Automated collection of publicly accessible data (or data ‘scraping’) is widely practiced, but operates under a cloud as the use of automated tools to scrape is prohibited by the site terms of many on-line environments and various technical and legal means have been used to block the practice.