Skip to main content
Home
Snurblog — Axel Bruns

Main navigation

  • Home
  • Information
  • Blog
  • Research
  • Publications
  • Presentations
  • Press
  • Creative
  • Search Site

'Big Data'

Snurb — Saturday 2 November 2024 22:34

LLMs in Content Coding: The 'Expertise Paradox' and Other Challenges

Elections | Polarisation | Journalism | Industrial Journalism | Internet Technologies | 'Big Data' | Artificial Intelligence | AoIR 2024 |

And the final speaker in this final AoIR 2024 conference session is the excellent Fabio Giglietto, whose focus is on coding Italian news data using Large Language Models. This worked with some 85,000 news articles shared on Facebook during the 2018 and 2022 Italian elections, and first classified such URLs as political or non-political; it then produced and clustered text embeddings for these articles, and used GPT-4-turbo to classify the dominant topics in these clusters.

This required considerable prompt crafting, especially also to ensure that prompts remained within the LLM’s token limits. Key challenges here included the choice of LLM …

» continue reading...
Snurb — Saturday 2 November 2024 22:30

LLMs and Transformer Models in News Content Coding

Politics | Polarisation | Journalism | Industrial Journalism | Internet Technologies | 'Big Data' | Artificial Intelligence | AoIR 2024 |

The next speaker in this final AoIR 2024 conference session is the great Hendrik Meyer, whose interest is in detecting stances in climate change coverage. This focusses especially on climate change debates in German news media, focussing on climate protests, discussions about speed limits, and discussions about heating and heat pump regulations.

Here stances might be better understood as evaluations related to a given issue or policy, and Large Language Models can be useful tools in assessing this, but this also requires considerable prompt crafting in order to generate consistent results. Computational costs for doing so (especially with complex prompts) …

» continue reading...
Snurb — Saturday 2 November 2024 22:28

Towards an LLM-Enhanced Pipeline for Better Stance Detection in News Content

Politics | Polarisation | Journalism | Industrial Journalism | Internet Technologies | 'Big Data' | Artificial Intelligence | Dynamics of Partisanship and Polarisation in Online Public Debate (ARC Laureate Fellowship) | AoIR 2024 |

The next speaker in this session at the AoIR 2024 conference is my QUT colleague Tariq Choucair, whose focus is especially on the use of LLMs in stance detection in news content. A stance is a public act by a social actors, achieved dialogically through communication, which evaluates objects, positions the self and other subjects, and aligns with other subjects within a sociocultural field.

Here, the focus is broadly on stances towards issues, persons, groups, and organisations. There are some tools for doing so, but they mainly focus on English-language content, are designed for specific types of data, and tend …

» continue reading...
Snurb — Saturday 2 November 2024 22:25

Using LLMs to Code Problematic Content in the Brazilian Manosphere

Internet Technologies | 'Big Data' | Artificial Intelligence | Social Media | AoIR 2024 |

The second speaker in this final session at the AoIR 2024 conference is Bruna Silveira de Oliveira, whose focus is on using LLMs to study content in the Brazilian manosphere. Extremist groups in this space seek legitimisation, and the question here is whether LLMs can be used productively to analyse their posts.

This analysis focusses on some 2,500 episodes of Brazilian masculinist podcasts across ten streaming platforms. It engaged in an assisted content analysis using OpenAI’s GPT-4 model, and explored whether this could identify detailed variables in the content. The podcast episodes were transcribed using automated tools, and 52 episodes …

» continue reading...
Snurb — Saturday 2 November 2024 22:24

Paying Attention to Marginalised Groups in Human and Computational Content Coding

Internet Technologies | 'Big Data' | Artificial Intelligence | AoIR 2024 |

The final (!) session at this wonderful AoIR 2024 conference is on content analysis, and starts with Ahrabhi Kathirgamalingam. Her interest is especially on questions of agreement and disagreement between content codings; the gold standard here has for a long time been intercoder reliability, but this tends to presume a single ground truth which may not exist in all coding contexts.

The concept of ‘constructs of marginalisation’ might be useful here: marginalised people are underrepresented; existing structural power defines who defines such constructs; they are historically and culturally shaped; and explicit as well as ambiguous and evasive language that discriminates …

» continue reading...
Snurb — Saturday 2 November 2024 21:37

Assessing Partisanship and Polarisation at Various Stages of News Production and Engagement

Politics | Polarisation | Journalism | Industrial Journalism | Internet Technologies | 'Big Data' | Social Media | Facebook | Social Media Network Mapping | Twitter | ARC Centre of Excellence for Automated Decision-Making and Society | Dynamics of Partisanship and Polarisation in Online Public Debate (ARC Laureate Fellowship) | AoIR 2024 |

I presented in and chaired the Saturday morning session at the AoIR 2024 conference, which was on polarisation in news publishing and engagement, so no liveblogging this time. However, here are the slides from the three presentations that our various teams and I were involved in.

We started with my QUT DMRC colleague Laura Vodden, who discussed our plans for manual and automated content coding of news content for indicators of polarisation, and especially highlighted the surprising difficulties in getting access to quality and comprehensive news content data:

CHALLENGES IN ACQUIRING AND ANALYSING NEWS DATA AT SCALE.pptx from tastysiltstone

I …

» continue reading...
Snurb — Thursday 31 October 2024 22:41

How Meta’s Third-Party Fact-Checkers Are Learning to Think Like the Machine

Journalism | Industrial Journalism | 'Big Data' | Artificial Intelligence | AoIR 2024 |

The final presenters in this session at the AoIR 2024 conference are Yarden Skop and Anna Schjøtt Hansen; their interests are in the third-party fact-checking network employed by Meta. This operates on the basis of a Meta-provided online dashboard that highlights potentially problematic content, and the dashboard’s operation directs fact-checking away from political content spread by major political figures, and towards other forms of content.

Many fact-checking organisations around the world now substantially rely on income from Meta through their engagement in its fact-checking programme; this is part of a global post-publication debunking turn, but also creates a dependency on …

» continue reading...
Snurb — Thursday 31 October 2024 22:40

The Platformisation of Newsroom Data Intermediaries in India

Journalism | Industrial Journalism | 'Big Data' | Artificial Intelligence | AoIR 2024 |

The next speaker in this AoIR 2024 conference session is Simran Agarwal, whose interest is in platformisation intermediaries in the Indian news industry. Her interest here is especially in the meso-layer of intermediaries, where AI-driven machine learning tools provide strategic counsel to newsrooms, broker interactions between platforms and publishers with the aim to ‘help’, ‘assist’, or ‘free’ journalists, and appear as certified partners.

Such intermediaries may be understood as cultural intermediaries, algorithmic experts, metricians, or content recommendation platforms; they may complement platforms or assist content production, and AI systems in particular retool, reshape, and rationalise the news. To explore this …

» continue reading...
Snurb — Thursday 31 October 2024 20:09

The Platformisation of Digital Platforms’ Climate Pledges

Politics | Government | Internet Technologies | 'Big Data' | Artificial Intelligence | AoIR 2024 |

The first full day at the AoIR 2024 conference starts with a panel on climate change, and the first speaker is Emily West, whose interest is in the climate policies of the large digital platform companies – such as Amazon’s ‘Climate Pledge’ initiative. This is supposed to provide an opportunity for involvement by other stakeholders, and some energy transparency measures. There are also the Carbon Free Energy initiative; Frontier, an initiative of the online payment company Stripe, which provides carbon removal and sequestration credits; and some emerging approaches to make generative AI platforms more carbon-neutral.

Even before the rise of …

» continue reading...
Snurb — Friday 27 September 2024 23:17

Towards Better Uses of News Engagement Analytics in Nordic Newsrooms

Journalism | Industrial Journalism | 'Big Data' | ECREA 2024 |

I am presenting our research on the in the Australian Facebook news ban in the post-lunch session at ECREA 2024 this Friday, but we start with a paper by Visa Noronen which examines news organisations’ attempts to understand their audiences in the current media context. This is important for determining editorial direction, and the present study examines such processes for the Nordic countries.

There has been a significant shift towards online news media use in these countries, and audiences are even often prepared to pay for online news subscriptions. Visa interviewed some 16 staff from news organisations that are not …

» continue reading...

Pagination

  • Previous page
  • 2
  • Next page
'Big Data'
INFORMATION
BLOG
RESEARCH
PUBLICATIONS
PRESENTATIONS
PRESS
CREATIVE

Recent Work

Presentations and Talks

Beyond Interaction Networks: An Introduction to Practice Mapping (ACSPRI 2024)

» more

Books, Papers, Articles

Destructive Polarization in Digital Communication Contexts: A Critical Review and Conceptual Framework (Information, Communication & Society)

» more

Opinion and Press

Inside the Moral Panic at Australia's 'First of Its Kind' Summit about Kids on Social Media (Crikey)

» more

Creative Work

Brightest before Dawn (CD, 2011)

» more

Lecture Series


Gatewatching and News Curation: The Lecture Series

Bluesky profile

Mastodon profile

Queensland University of Technology (QUT) profile

Google Scholar profile

Mixcloud profile

[Creative Commons Attribution-NonCommercial-ShareAlike 4.0 Licence]

Except where otherwise noted, this work is licensed under a Creative Commons BY-NC-SA 4.0 Licence.