You are here

'Big Data'

A Final Round-Up of Publications and Other Updates from 2024

I disappeared on summer holidays pretty much immediately after my keynote on practice mapping at the ACSPRI conference in Sydney in late November, so I haven’t yet had a chance to round up my and our last few publications for the year (as well as a handful of early arrivals from 2025). And what a year it’s been – although it’s felt as if I’ve taken a more supportive than leading role these past few months, there have still been quite a few new developments, and a good lot more to come. I’ll group these thematically here:

 

Polarisation, Destructive or Otherwise

Central to the work of my current Australian Laureate Fellowship has been the development of our concept of destructive polarisation, and exploration of the five key symptoms we’ve identified for it: (a) breakdown of communication; (b) discrediting and dismissing of information; (c) erasure of complexities; (d) exacerbated attention to and space for extreme voices; and (e) exclusion through emotions. The point here is to distinguish such clearly problematic dynamics from other forms of polarisation that are more quotidian and benign, and may even be beneficial as they enable different sides of an argument to better define what they stand for. Where polarisation becomes destructive, on the other hand, mainstream political and societal cohesion declines and fails (and aren’t we seeing a lot of that at the moment…). I’ve got to pay tribute here to my Laureate Fellowship team, and especially the four Postdoctoral Research Associates Katharina Esau, Tariq dos Santos Choucair, Sebastian Svegaard, and Samantha Vilkins – Katharina in particular drove the development of this concept from its first presentation at the 2023 ICA conference in Toronto to the comprehensive journal article which has now been published in Information, Communication & Society:

Katharina Esau, Tariq Choucair, Samantha Vilkins, Sebastian F.K. Svegaard, Axel Bruns, Kate O'Connor-Farfan, and Carly Lubicz-Zaorski. “Destructive Polarization in Digital Communication Contexts: A Critical Review and Conceptual Framework.Information, Communication & Society, 2024. DOI: 10.1080/1369118X.2024.2413127.

Meanwhile, I’ve led the writing on a second article that also outlines this concept and provides some further examples for its symptoms. This has now been published in the new Routledge Handbook of Political Campaigning, and counts as our first publication in 2025:

Prominent Themes in Data Sovereignty Debates Online

The final speaker in this ACSPRI 2024 conference session is Sidiq Madya, whose interest is in the discussion of the idea of data sovereignty by civil society organisations. Data sovereignty is a spectrum of approaches by nation states to subject data flows to national jurisdictions, and/or the ability or right of individuals to control their personal data and information.

Human vs. LLM Coding of Australian Charities’ Civic Activities

The final speaker in this ACSPRI 2024 conference session is Aaron Willcox, presenting work with the Scanlon Research Institute to explore local government-level civic opportunities. For organisations, such opportunities include hosting events, offering memberships, involving individuals through volunteering, and taking action through advocacy and campaigns.

Using Large Language Models to Code Policy Feedback Submissions

The first session at the ACSPRI 2024 conference is on generative AI, and starts with Lachlan Watson. He is interested in the use of AI assistance to analyse public policy submissions, here in the context of Animal Welfare Victoria’s draft cat management strategy. Feedback could be in the form of written submissions, surveys, or both, and needed to be analysed using quantitative approaches given the substantial volume of submission.

Fundamental Principles for Indigenous Data Sovereignty

From the AANZCA conference in Melbourne of the last few days I’ve moved on to the ACSPRI 2024 conference in Sydney for the rest of the week, which starts with a keynote by Maggie Walter, on methodologies for Indigenous statistics and quantitative research. Maggie is a Palawa woman from Tasmania. Data and population statistics have changed dramatically over the past decade or more; conventionally, Australian Indigenous people have been presented merely as average statistics that show what Maggie calls the Statistical Indigene: documenting prolonged disadvantage and inequality.

This is the case because these are the things we have data about: unemployment, imprisonment, health issues, etc. But these data are political: they are political artefacts that reflect a specific purpose, and position Indigenous people as hapless, helpless, and hopeless. This is a pejorative portrayal which is simplistic and undemanding of its audience; their presentation never advances beyond frequency tables and simplistic breakdowns (e.g. by gender or age). They define Indigenous people by the race they are not.

Maggie calls this ‘5D’ data: deficit, difference, disparity, disadvantage, and dysfunction. The aim may be to close the socioeconomic gap, but the aim is simply to bring Indigenous populations ‘up’ to a non-Indigenous level. And this pattern is not unique to Australia: the same is true for other (Anglo-)colonised nations, with many of the same deficits and dysfunctions identified – yet without ever acknowledging the underlying source of these patterns, which is Anglo-colonisation itself. This may be well-intentioned, but is nonetheless damaging.

The Complicated Influences Affecting Contemporary Internet Governance

The next session at the AANZCA 2024 conference starts with a paper by Terry Flew, Agata Stepnik, and Tim Koskie, who begin by noting the changing contours of Internet governance. There is increasing nation-state regulation in liberal democracies as well as authoritarian states, as well as renewed debate about the treatment of digital and social media platforms and a populist push towards greater regulation.

LLMs in Content Coding: The 'Expertise Paradox' and Other Challenges

And the final speaker in this final AoIR 2024 conference session is the excellent Fabio Giglietto, whose focus is on coding Italian news data using Large Language Models. This worked with some 85,000 news articles shared on Facebook during the 2018 and 2022 Italian elections, and first classified such URLs as political or non-political; it then produced and clustered text embeddings for these articles, and used GPT-4-turbo to classify the dominant topics in these clusters.

Pages

Subscribe to RSS - 'Big Data'