You are here

'Big Data'

Presenting Our Social Media Work at the 2013 IBM Research Colloquium

Now that I’m back in Australia from my extended conference trip, I immediately got back on a plane to travel to a freezing Melbourne, to present our social media research in crisis communication and beyond at the 2013 IBM Research Colloquium. Below are my slides and audio – many thanks again to Jennifer Lai and her team at IBM Research Australia for the invitation!

Social Media Issue Publics in Australia (IBM Research Colloquium 2013)

IBM Research Colloquium 2013

Social Media Issue Publics in Australia

Axel Bruns

When important news breaks, social media facilitate the rapid formation of issue publics which come together to 'work the story' of the unfolding event. This is especially evident in the context of natural disasters and other crises. The close study of social media feeds during such crisis provides a valuable insight into the dynamics of the event, with participants acting as human sensors for new information and current trends. This paper outlines the crisis communication research conducted at the ARC Centre of Excellence for Creative Industries and Innovation at Queensland University of Technology, and outlines the need for further background research into the longer-term development of social media platforms.

How Julia Gillard's Misogyny Speech Went Viral

The next panel at the Digital Methods conference begins with a panel by Theresa Sauter and me, on the viral distribution of links to the video of Julia Gillard's "misogyny" speech in 2012 as it was posted in full on the ABC News site. Unfortunately the audio recording didn't work out, so below are the slides only - do make sure you click on the links to see the video and the animations of the emerging retweet network.

The Challenges of Understanding Content Dissemination on Facebook

The final speakers in this Digital Methods plenary are Axel Maireder and Katrin Jungnickel, whose interest is in the uncertainties of the Facebook timeline. Facebook has continued to tinker with how the timeline is selected and presented for several years now, and this affects the flow of communication on the platform; what, then, are the factors which determine that flow?

This study combined content analysis and user surveys, but both these approaches have their drawbacks - it is impossible from the outside to track the content of users' timelines, for example, but surveys of users also suffer from self-reporting biases. In the end, the researchers asked users to copy the links they received through their timelines into an online survey, and to discuss the content of the URLs and the Facebook friends they received them from. Issues with privacy as well as the tedious nature of this approach also affect the results, however. Some 550 users participated in the study.

Generating Representative Samples from Search Engine Results?

The next plenary speaker at Digital Methods is Martin Emmer, whose focus is on sampling methods in digital contexts. Online media are now important public fora, and conventional media are increasingly using digital channels to transmit their content as well; this also leads to a shift in media usage, of course, and some of that shift is also driven by generational change.

If we need to examine the digital space to understand current debates in the public sphere, then, how do we generate representative samples of online content and activities? With traditional mass media, it was possible to draw on comprehensive lists of media providers, with a small handful of alternative media; in the digital environment, channels and platforms have multiplied massively, and it is no longer trivial to select a small number of sites and spaces which represent all online activity.

The Impact of Social Sharing on Google Search Results

The next session at Digital Methods is a plenary panel which begins with Christina Schumann, whose focus is on Google and other search engines as technological actors on the Internet. Search engines are especially important as they now serve as a kind of gatekeeper on the Net - but the criteria they use for ranking and structuring information are often far from transparent.

The basic approach of search engines is to crawl or otherwise gather Internet data which are then indexed and processed into a database; this database is queried as a search query is entered into the search engine. Factors in returning search results include on-page information (content, programming, and design of Web pages) as well as off-page metadata (especially the link networks surrounding each page, relative to the theme of the query).

The Opportunities and Challenges of 'Big Data' Research

At the end of an extended trip to a range of conferences and symposia I've made my way to Vienna, where I'm attending the DGPuK Digital Methods conference at the University of Vienna. The conference is in German, but I'll try to blog the presentations in English nonetheless - wish me luck... We begin with keynote by Jürgen Pfeffer, addressing - not surprisingly - the question of 'big data' in communications research.

Jürgen begins by asking what's different about 'big data' research. In our field, we're using 'big data' on communication and interaction to work towards a real-time analysis of large-scale, dynamic sociocultural systems, necessarily especially through computational approaches - this draws on the data available from major social networks and other participative sites, but it aims not to research "the Internet", but society by examining communication patterns on the Internet (and elsewhere).

Distinguishing Chain and Name Networks in Social Network Analysis

The final speaker in this "Compromised Data" session is Anatoliy Gruzd, whose interest is in the automated discovery and visualisation of communication networks from social media data. (He's also just launched a new journal in this field, Big Data and Society.) How can such networks be discovered and visualised, and how can we evaluate the sense of community which may exist in them?

Social network analysis enables us to investigate the connections between users in social networks. It reduces large quantities of messages to a smaller number of nodes exchanging communication; it can track longitudinal developments over time; it can show the social dynamics of interaction around specific topics and events; and it can differentiate between different types of network formation in social interaction.

Bottom-Up Measurements of Network Performance

The next session at "Compromised Data" starts with Fenwick McKelvey, who begins with a reference to the emergence of digitised methods for the study of the Web during the mid-2000s. This was the time around which the latest generation of social media emerged, enabling us to begin thinking about society through the study of the Internet, requiring the development of new research methods by repurposing computer science methods for social science research.

In Toronto, Infoscape Labs developed a number of tools for the exploration of political discourse in Web 2.0, including the Blogometer. This is the emergence of platform studies, paying attention to the platform itself - but this also introduces challenges about how to study the platform, as the core object of research itself intervenes in its study, e.g. through the politics of APIs. This work also required compromises around data access and utilisation, and a growing bifurcation between scholarly and commercial research activities emerged.

Archiving Our Personal Digital Milieux

The final presenter in this morning session at "Compromised Data" is Yuk Hui, who will present a social media self-archiving project. He has worked for years on audiovisual archives, but much of the work on this field has focussed on institutional rather than personal archives, with the latter often concerned mainly with privacy issues.

But another set of problems relates to data management instead: we are working with multiple cloud-based systems, but rarely archive our digital objects effectively - archiving is not just about storing, but about preserving the context of digital objects as well: the digital milieu.

Pages

Subscribe to RSS - 'Big Data'