Into the Crossfire

Evaluating the Use of a Language Model to Crowdsource Gun Violence Reports

Adriano Belisario, Scott A. Hale, Luc Rocher

The data gap on gun violence

Gun violence is a severe human rights issue that affects nearly every dimension of the social fabric, such as healthcare, education, psychology, and the economy.

The United States and Brazil account for a large share of global firearm-related violence, reaching epidemic and public health crisis levels.

Reliable data is crucial to develop effective policies and emergency responses. In Brazil, however, there are no official records of gun violence events.

Fogo Cruzado (“Crossfire”) monitors events of gun violence in four Brazilian cities.

Analysts track social media posts and on-the-ground sources 24/7.

They have been interacting with users who report gun violence on Twitter/X since 2018.

Keyword-based search with geographical filters on Tweetdeck.

Mobile app shows real-time alerts of gun violence events.

The needle in the haystack

Social media is a valuable source for crowdsourcing evidence in human rights monitoring and investigations, but…

  • Keyword-based search leads to a high proportion of unrelated text

  • Small teams can’t process high volumes of data

What’s been tried

Previous works show that machine learning models can help human rights organizations to filter large volumes of data.

However, we found several gaps in previous research:

  • No systematic evaluations of adopting these model in real-world settings

  • No previous work with Portuguese texts

Our Work

We built an open-source language model to help crowdsource gun violence reports from social media.

With Fogo Cruzado, we tested its real-world use in Brazil (2023).

We asked whether Transformer-based models can detect gun violence reports in Portuguese and how they support analysts in daily monitoring.

To answer these questions, we fine-tuned a BERT model on past interactions and built a web prototype to visualize results, evaluating its impact through surveys, interviews, and interaction metrics (diff-in-diff).

Text classification

Positive Examples Negative Examples
Gunshots started going off right when I ordered a milkshake. I hope this man is a brave warrior. Sometimes, certain words are like a shot, especially when you’re feeling a little insecure.
People here randomly fire off shots out of nowhere. Oh so many distracted friends’ photos that I took Jesus I deserved to get shot lol.
I already wake up startled, hearing gunshots. I’m trying to let my nails grow, but when anxiety attacks, I tear them all off.

Text classification

A BERT-based model in Portuguese achieved good performance (87% of recall for positive cases).

The prototype