You are here

'Big Data'

Prominent Themes in Data Sovereignty Debates Online

The final speaker in this ACSPRI 2024 conference session is Sidiq Madya, whose interest is in the discussion of the idea of data sovereignty by civil society organisations. Data sovereignty is a spectrum of approaches by nation states to subject data flows to national jurisdictions, and/or the ability or right of individuals to control their personal data and information.

Human vs. LLM Coding of Australian Charities’ Civic Activities

The final speaker in this ACSPRI 2024 conference session is Aaron Willcox, presenting work with the Scanlon Research Institute to explore local government-level civic opportunities. For organisations, such opportunities include hosting events, offering memberships, involving individuals through volunteering, and taking action through advocacy and campaigns.

Using Large Language Models to Code Policy Feedback Submissions

The first session at the ACSPRI 2024 conference is on generative AI, and starts with Lachlan Watson. He is interested in the use of AI assistance to analyse public policy submissions, here in the context of Animal Welfare Victoria’s draft cat management strategy. Feedback could be in the form of written submissions, surveys, or both, and needed to be analysed using quantitative approaches given the substantial volume of submission.

Fundamental Principles for Indigenous Data Sovereignty

From the AANZCA conference in Melbourne of the last few days I’ve moved on to the ACSPRI 2024 conference in Sydney for the rest of the week, which starts with a keynote by Maggie Walter, on methodologies for Indigenous statistics and quantitative research. Maggie is a Palawa woman from Tasmania. Data and population statistics have changed dramatically over the past decade or more; conventionally, Australian Indigenous people have been presented merely as average statistics that show what Maggie calls the Statistical Indigene: documenting prolonged disadvantage and inequality.

This is the case because these are the things we have data about: unemployment, imprisonment, health issues, etc. But these data are political: they are political artefacts that reflect a specific purpose, and position Indigenous people as hapless, helpless, and hopeless. This is a pejorative portrayal which is simplistic and undemanding of its audience; their presentation never advances beyond frequency tables and simplistic breakdowns (e.g. by gender or age). They define Indigenous people by the race they are not.

Maggie calls this ‘5D’ data: deficit, difference, disparity, disadvantage, and dysfunction. The aim may be to close the socioeconomic gap, but the aim is simply to bring Indigenous populations ‘up’ to a non-Indigenous level. And this pattern is not unique to Australia: the same is true for other (Anglo-)colonised nations, with many of the same deficits and dysfunctions identified – yet without ever acknowledging the underlying source of these patterns, which is Anglo-colonisation itself. This may be well-intentioned, but is nonetheless damaging.

The Complicated Influences Affecting Contemporary Internet Governance

The next session at the AANZCA 2024 conference starts with a paper by Terry Flew, Agata Stepnik, and Tim Koskie, who begin by noting the changing contours of Internet governance. There is increasing nation-state regulation in liberal democracies as well as authoritarian states, as well as renewed debate about the treatment of digital and social media platforms and a populist push towards greater regulation.

LLMs in Content Coding: The 'Expertise Paradox' and Other Challenges

And the final speaker in this final AoIR 2024 conference session is the excellent Fabio Giglietto, whose focus is on coding Italian news data using Large Language Models. This worked with some 85,000 news articles shared on Facebook during the 2018 and 2022 Italian elections, and first classified such URLs as political or non-political; it then produced and clustered text embeddings for these articles, and used GPT-4-turbo to classify the dominant topics in these clusters.

LLMs and Transformer Models in News Content Coding

The next speaker in this final AoIR 2024 conference session is the great Hendrik Meyer, whose interest is in detecting stances in climate change coverage. This focusses especially on climate change debates in German news media, focussing on climate protests, discussions about speed limits, and discussions about heating and heat pump regulations.

Pages

Subscribe to RSS - 'Big Data'