Because Data Can’t Speak for Itself


A Practical Guide to Telling Persuasive Policy Stories” by David Chrisinger and Lauren Brodsky: “People with important evidence-based ideas often struggle to translate data into stories their readers can relate to and understand. And if leaders can’t communicate well to their audience, they will not be able to make important changes in the world.

Why do some evidence-based ideas thrive while others die? And how do we improve the chances of worthy ideas? In Because Data Can’t Speak for Itself, accomplished educators and writers David Chrisinger and Lauren Brodsky tackle these questions head-on. They reveal the parts and functions of effective data-driven stories and explain myriad ways to turn your data dump into a narrative that can inform, persuade, and inspire action.

Chrisinger and Brodsky show that convincing data-driven stories draw their power from the same three traits, which they call peoplepurpose, and persistence. Writers need to find the real people behind the numbers and share their stories. At the same time, they need to remember their own purpose and be honest about what data says—and, just as importantly, what it does not.

Compelling and concise, this fast-paced tour of success stories—and several failures—includes examples on topics such as COVID-19, public diplomacy, and criminal justice…(More)”

Data Free Disney


Essay by Janet Vertesy: “…Once upon a time, you could just go to Disneyland. You could get tickets at the gates, stand in line for rides, buy food and tchotchkes, even pick up copies of your favorite Disney movies at a local store. It wasn’t even that long ago. The last time I visited, in 2010, the company didn’t record what I ate for dinner or detect that I went on Pirates of the Caribbean five times. It was none of their business.

But sometime in the last few years, tracking and tracing became their business. Like many corporations out there, Walt Disney Studios spent the last decade transforming into a data company.

The theme parks alone are a data scientist’s dream. Just imagine: 50,000 visitors a day, most equipped with cell phones and a specialized app. Millions of location traces, along with rides statistics, lineup times, and food-order preferences. Thousands and thousands of credit card swipes, each populating a database with names and addresses, each one linking purchases across the park grounds.1 A QR-code scavenger hunt that records the path people took through Star Wars: Galaxy’s Edge. Hotel keycards with entrance times, purchases, snack orders, and more. Millions of photos snapped on rides and security cameras throughout the park, feeding facial-recognition systems. Tickets with names, birthdates, and portraits attached. At Florida’s Disney World, MagicBands—bracelets using RFID (radio-frequency identification) technology—around visitors’ wrists gather all that information plus fingerprints in one place, while sensors ambiently detect their every move. What couldn’t you do with all that data?…(More)”.

Secondary data for global health digitalisation


Paper by Anatol-Fiete Näher, et al: “Substantial opportunities for global health intelligence and research arise from the combined and optimised use of secondary data within data ecosystems. Secondary data are information being used for purposes other than those intended when they were collected. These data can be gathered from sources on the verge of widespread use such as the internet, wearables, mobile phone apps, electronic health records, or genome sequencing. To utilise their full potential, we offer guidance by outlining available sources and approaches for the processing of secondary data. Furthermore, in addition to indicators for the regulatory and ethical evaluation of strategies for the best use of secondary data, we also propose criteria for assessing reusability. This overview supports more precise and effective policy decision making leading to earlier detection and better prevention of emerging health threats than is currently the case…(More)”.

How can health data be used for public benefit? 3 uses that people agree on


Article by Alison Papricia et al: “Health data can include information about health-care services, health status and behaviours, medications and genetic data, in addition to demographic information like age, education and neighbourhood.

These facts and statistics are valuable because they offer insights and information about population health and well-being. However, they can also be sensitive, and there are legitimate public concerns about how these data are used, and by whom. The term “social licence” describes uses of health data that have public support.

Studies performed in Canada, the United Kingdom and internationally have all found public support and social licence for uses of health data that produce public benefits.

However, this support is conditional. Public concerns related to privacy, commercial motives, equity and fairness must be addressed.

Our team of health policy researchers set out to build upon prior studies with actionable advice from a group of 20 experienced public and patient advisers. Studies have shown that health data use, sharing and reuse is a complex topic. So we recruited people who already had some knowledge of potential uses of health data through their roles advising research institutions, hospitals, community organizations and governments.

We asked these experienced advisers to exchange views about uses of health data that they supported or opposed. We also gathered participants’ views about requirements for social licence, such as privacy, security and transparency.

Consensus views: After hours of facilitated discussion and weeks of reflection, all 20 participants agreed on some applications and uses of health data that are within social licence, and some that are not.

Participants agreed it is within social licence for health data to be used by:

  • health-care practitioners — to directly improve the health-care decisions and services provided to a patient.
  • governments, health-care facilities and health-system administrators — to understand and improve health care and the health-care system.
  • university-based researchers — to understand the drivers of disease and well-being.

Participants agreed that it is not within social licence for:

  • an individual or organization to sell (or re-sell) another person’s identified health data.
  • health data to be used for a purpose that has no patient, public or societal benefit.

Points of disagreement: Among other topics, the participants discussed uses of health data about systemically marginalized populations and companies using health data. Though some participants saw benefits from both practices, there was not consensus support for either.

For example, participants were concerned that vulnerable populations could be exploited, and that companies would put profit ahead of public benefits. Participants also worried that if harms were done by companies or to marginalized populations, they could not be “undone.” Several participants expressed skepticism about whether risks could be managed, even if additional safeguards are in place.

The participants also had different views about what constitutes an essential requirement for social licence. This included discussions about benefits, governance, patient consent and involvement, equity, privacy and transparency.

Collectively, they generated a list of 85 essential requirements, but 38 of those requirements were only seen as essential by one person. There were also cases where some participants actively opposed a requirement that another participant thought was essential…(More)”

Social media is too important to be so opaque with its data


Article by Alex González Ormerod: “Over 50 people were killed by the police during demonstrations in Peru. Brazil is reeling from a coup attempt in its capital city. The residents of Culiacán, a city in northern Mexico, still cower in their houses after the army swooped in to arrest a cartel kingpin. Countries across Latin America have kicked off the year with turmoil. 

It is almost a truism to say that the common factor in these events has been the role of social media. Far-right radicals in Brazil were seen to be openly organizing and spreading fake news about electoral fraud on Twitter. Peruvians used TikTok to bear witness to police brutality, preserving it for posterity.

Dealing with the aftermath of the crises, in Culiacán, Sinaloans shared crucial info as to where roadblocks continued to burn, and warned about shootouts in certain neighborhoods. Brazilians opened up Instagram and other social channels to compile photos and other evidence that might help the police bring the Brasília rioters to justice.

These events could be said to have happened online as much as they did offline, yet we know next to nothing about the inner workings of the platforms they occurred on.

People covering these platforms face a common refrain: After reaching out for basic social media data, they will often get a reply saying, “Unfortunately we do not have the information you need at this time.” (This particular quote came from Alberto de Golin, a PR agency representative for TikTok Mexico)…(More)”

Civic Switchboard: Connecting Libraries and Community Information Networks


Civic Switchboard is an Institute of Museum and Library Services supported effort that aims to develop the capacity of academic and public libraries in civic data ecosystems.

We encourage partnerships between libraries and local data intermediaries that will better serve data users, further democratize data, and support equitable access to information. Our project is created an online guide and toolkit for libraries interested in expanding (or beginning) their role around civic information…(More)”.

Governing Smart Cities as Knowledge Commons


Book edited by Brett M. Frischmann, Michael J. Madison, and Madelyn Rose Sanfilippo: “The rise of ‘smart’ – or technologically advanced – cities has been well documented, while governance of such technology has remained unresolved. Integrating surveillance, AI, automation, and smart tech within basic infrastructure as well as public and private services and spaces raises a complex set of ethical, economic, political, social, and technological questions. The Governing Knowledge Commons (GKC) framework provides a descriptive lens through which to structure case studies examining smart tech deployment and commons governance in different cities. This volume deepens our understanding of community governance institutions, the social dilemmas communities face, and the dynamic relationships between data, technology, and human lives. For students, professors, and practitioners of law and policy dealing with a wide variety of planning, design, and regulatory issues relating to cities, these case studies illustrate options to develop best practice. Available through Open Access, the volume provides detailed guidance for communities deploying smart tech…(More)”

A Comparative Study of Citizen Crowdsourcing Platforms and the Use of Natural Language Processing (NLP) for Effective Participatory Democracy


Paper by Carina Antonia Hallin: ‘The use of crowdsourcing platforms to harness citizen insights for policymaking has gained increasing importance in regional and national policy planning. Participatory democracy using crowdsourcing platforms includes various initiatives, such as generating ideas for new law reforms (Aitamurto and Landemore 2015], economic development, and solving challenges related to how to create inclusive social actions and interventions for better, healthier, and more prosperous local communities (Bentley and Pugalis, 2014). Such case observations, coupled with the increasing prevalence of internet-based communication, point to the real benefits of implementing participatory democracies on a mass scale in which citizens are invited to contribute their ideas, opinions, and deliberations (Salganik and Levy 2015). By adopting collective intelligence platforms, public authorities can harness local knowledge from citizens to find the right ‘policy mix’ and collaborate with citizens and relevant actors in the policymaking processes. This comparative study aims to validate the adoption of collective intelligence and artificial intelligence/natural language processing (NLP) on crowdsourcing platforms for effective participatory democracy and policymaking in local governments. The study compares 15 citizen crowdsourcing platforms, including Natural language Processing (NLP), for policymaking across Europe and the United States. The study offers a framework for working with citizen crowdsourcing platforms and exploring the usefulness of NLP on the platforms for effective participatory democracy…(More)”.

Human-AI Teaming


Report by the National Academies of Sciences, Engineering, and Medicine: “Although artificial intelligence (AI) has many potential benefits, it has also been shown to suffer from a number of challenges for successful performance in complex real-world environments such as military operations, including brittleness, perceptual limitations, hidden biases, and lack of a model of causation important for understanding and predicting future events. These limitations mean that AI will remain inadequate for operating on its own in many complex and novel situations for the foreseeable future, and that AI will need to be carefully managed by humans to achieve their desired utility.

Human-AI Teaming: State-of-the-Art and Research Needs examines the factors that are relevant to the design and implementation of AI systems with respect to human operations. This report provides an overview of the state of research on human-AI teaming to determine gaps and future research priorities and explores critical human-systems integration issues for achieving optimal performance…(More)”

Filling Public Data Gaps


Report by Judah Axelrod, Karolina Ramos, and Rebecca Bullied: “Data are central to understanding the lived experiences of different people and communities and can serve as a powerful force for promoting racial equity. Although public data, including foundational sources for policymaking such as the US Census Bureau’s American Community Survey (ACS), offer accessible information on a range of topics, challenges of timeliness, granularity, representativeness, and degrees of disaggregation can limit those data’s utility for real-time analysis. Private data—data produced by private-sector organizations either through standard business or to market as an asset for purchase—can serve as a richer, more granular, and higher-frequency supplement or alternative to public data sources. This raises questions about how well private data assets can offer race-disaggregated insights that can inform policymaking.

In this report, we explore the current landscape of public-private data sharing partnerships that address topic areas where racial equity research faces data gaps: wealth and assets, financial well-being and income, and employment and job quality. We held 20 semistructured interviews with current producers and users of private-sector data and subject matter experts in the areas of data-sharing models and ethical data usage. Our findings are divided into five key themes:

  • Incentives and disincentives, benefits, and risks to public-private data sharing
    Agreements with prestigious public partners can bolster credibility for private firms and broaden their customer base, while public partners benefit from access to real-time, granular, rich data sources. But data sharing is often time and labor intensive, and firms can be concerned with conflicting business interests or diluting the value of proprietary data assets.
  • Availability of race-disaggregated data sources
    We found no examples in our interviews of race-disaggregated data sources related to our thematic focus areas that are available externally. However, there are promising methods for data imputation, linkage, and augmentation through internal surveys.
  • Data collaboratives in practice
    Most public-private data sharing agreements we learned about are between two parties and entail free or “freemium” access. However, we found promising examples of multilateral agreements that diversify the data-sharing landscape.
  • From data champions to data stewards
    We found many examples of informal data champions who bear responsibility for relationship-building and securing data partnerships. This role has yet to mature to an institutionalized data steward within private firms we interviewed, which can make data sharing a fickle process.
  • Considerations for ethical data usage
    Data privacy and transparency about how data are accessed and used are prominent concerns among prospective data users. Interviewees also stressed the importance of not privileging existing quantitative data above qualitative insights in cases where communities have offered long-standing feedback and narratives about their own experiences facing racial inequities, and that policymakers should not use a need to collect more data as an excuse for delaying policy action.

Our research yielded several recommendations for data producers and users that engage in data sharing, and for funders seeking to advance data-sharing efforts and promote racial equity…(More)”