The final speaker on this first day of "Compromised Data" is Sidneyeve Matrix, who shifts our focus towards geosocial information as generated by smartphones and other mobile devices. Only 12% of US users as surveyed by the Pew Centre posted Foursquare check-ins in 2013, for example, down from 18% in 2011 - but this may mask a greater take-up of other location-based services, not least the Frequent Locations functionality in iOS7.
There is a continuing trend towards the consumerisation of geodata. Geosocial cultural arrangements are explored through the use of mobile communication patterns, but such analysis is notoriously …
The next speaker at "Compromised Data" this afternoon is Asta Zelenkauskaite, who notes the increasing interweaving of social and mainstream media; based on the properties of 'big data' it therefore becomes important to explore how users engage with mass media and cross-media contexts. How relevant are 'big data' to the mass communication field?
Traditional media outlets have been mainly focussing on a quasi-passive engagement with media content, while social media now offer a two-way interaction by providing back channel functionality. Mass media content, user-generated content, and user interactions' digital imprints are coming together to shape this cross-media environment. …
The next speaker at "Compromised Data" is Joanna Redden, whose interest is in government uses of 'big data', especially in Canada. There's a great deal of hype surrounding 'big data' in government at the moment, which needs to be explored from a critical perspective; the data rush has been compared to the gold rush, with similarly utopian claims - here especially around the ability for 'big data' to support decision-making and democratic engagement, and the contribution 'big data'-enabled industries can make to the GDP.
But how are 'big data' actually being used in government contexts? New tools and …
The final presentation in this "Compromised Data" session is by Mary Francoli and Dan Paré, who focus on the question of engagement and mobilisation in a time of rapidly evolving social media use. One initial observation is that these terms lack definitional clarity - there are some very high-level definitions (e.g. building on UN definitions), but these remain vague; political and civic engagement are conflated, and specific forms of engagement are not necessarily defined in detail.
Simply voting is a form of engagement, for example, but is clearly different from other, more complex forms of political engagement. The …
The next session at "Compromised Data" starts with Frauke Zeller, who begins by noting the multimodality of communication, including through social media: many texts are using more than one semiotic mode, combining text, images, audio and video. How can the existing methods for studying multimodality be transferred to online environments, and to research building on 'big data', however?
Some such work begins with exploring the networks between users, and between texts, but this is not enough - how do we move from the macro to the meso and micro levels of communication? How do we move to the …
The next paper at the "Compromised Data" symposium is by Jean Burgess and me, and explores the more difficult forms of 'big data' research we're rarely conducting at present because the political economy of data access is weighted against specific approaches - in the specific context of Twitter research. I'll upload the slides and audio for it as soon as possible - for now, consider this a placeholder! Slides and audio below:
The next speaker at "Compromised Data" today is Carolin Gerlitz, who begins by suggesting that social media data are both standardised and vague at the same time. She notes the German Twitter community which is focussed around favouriting on Twitter: the Favstar sphere sees favourites as a sign of importance and validation, and taking away favourites is therefore a serious affront.
This is an example of how the communicative affordances of social media platforms are being utilised by their users; these standardised activities mark the grammar of action on such platforms, and are specific to the particular …
From the always fabulous Association of Internet Researchers conference I've made it to the research colloquium "Compromised Data", organised by Greg Elmer and Ganaele Langlois in Toronto. We're starting with a presentation from Taina Bucher, on the enactive power of APIs. Beyond merely helping to collect data, APIs have a number of other functions. We must not get too seduced by the opportunities for data access and visualisation which they provide, but take APIs seriously as a mechanism for making meaning from data that privileges certain approaches over others.
This means a kind of "digital humanities" in reverse …
The final presenter in this AoIR 2013 session (and thus, the final presenter this year!) is Hazel Kwon, whose aim is to better understand the flows of communication on social media during protests. Her frame of research in this is Emergent Norm Theory, whose emphasis is on the rapid and transformative potential of word of mouth on collective behaviours. This is a process of diffusion for a collective identity.
Protests can be understood as collective behaviours. They may be prompted by the circulation of rumours, which are characterised by the informal and improvised circulation of situational information; gradually, key themes …