Finally in this AoIR 2015 session, we move on to Greg Elmer, one of the editors of Compromised Data: From Social Media to Big Data. His contribution is focussed on the practice of collecting data from social media sites, some of which is done using some very simple Web scraping tools (as Edward Snowden did at the NSA, apparently).
Scraping is now a common practice in a number of contexts; some sites scrape from mainstream news sites in order to gain better search rankings, for example. Google briefly introduced a tool to identify where site content had been scraped …