Balancing Privacy With Data Sharing for the Public Good


David Deming at the New York Times: “Governments and technology companies are increasingly collecting vast amounts of personal data, prompting new laws, myriad investigations and calls for stricter regulation to protect individual privacy.

Yet despite these issues, economics tells us that society needs more data sharing rather than less, because the benefits of publicly available data often outweigh the costs. Public access to sensitive health records sped up the development of lifesaving medical treatments like the messenger-RNA coronavirus vaccines produced by Moderna and Pfizer. Better economic data could vastly improve policy responses to the next crisis.

Data increasingly powers innovation, and it needs to be used for the public good, while individual privacy is protected. This is new and unfamiliar terrain for policymaking, and it requires a careful approach.

The pandemic has brought the increasing dominance of big, data-gobbling tech companies into sharp focus. From online retail to home entertainment, digitally savvy businesses are collecting data and deploying it to anticipate product demand and set prices, lowering costs and outwitting more traditional competitors.

Data provides a record of what has already happened, but its main value comes from improving predictions. Companies like Amazon choose products and prices based on what you — and others like you — bought in the past. Your data improves their decision-making, boosting corporate profits.

Private companies also depend on public data to power their businesses. Redfin and Zillow disrupted the real estate industry thanks to access to public property databases. Investment banks and consulting firms make economic forecasts and sell insights to clients using unemployment and earnings data collected by the Department of Labor. By 2013, one study estimated, public data contributed at least $3 trillion per year to seven sectors of the economy worldwide.

The buzzy refrain of the digital age is that “data is the new oil,” but this metaphor is inaccurate. Data is indeed the fuel of the information economy, but it is more like solar energy than oil — a renewable resource that can benefit everyone at once, without being diminished….(More)”.

Regulation of Algorithmic Tools in the United States


Paper by Christopher S. Yoo and Alicia Lai: “Policymakers in the United States have just begun to address regulation of artificial intelligence technologies in recent years, gaining momentum through calls for additional research funding, piece-meal guidance, proposals, and legislation at all levels of government. This Article provides an overview of high-level federal initiatives for general artificial intelligence (AI) applications set forth by the U.S. president and responding agencies, early indications from the incoming Biden Administration, targeted federal initiatives for sector-specific AI applications, pending federal legislative proposals, and state and local initiatives. The regulation of the algorithmic ecosystem will continue to evolve as the United States continues to search for the right balance between ensuring public safety and transparency and promoting innovation and competitiveness on the global stage….(More)”.

Reddit Is America’s Unofficial Unemployment Hotline


Ella Koeze at The New York Times: “In early December, Alex Branch’s car broke down. A 23-year-old former arcade employee in southern Virginia, Mr. Branch had been receiving unemployment benefits since he was laid off in March, and figured he would have no problem paying for the repairs. But when he checked his bank account, he was troubled to find that the payments had stopped.

He had failed to get useful information from his state’s unemployment office before, so he turned to the one place he figured he could get an explanation: Reddit.

“I’m very confused and have no idea what to do,” Mr. Branch wrote on r/Unemployment, a Reddit forum whose popularity has skyrocketed during the pandemic.

The next day, another user commented on Mr. Branch’s post, using a common abbreviation for Extended Benefits, an emergency unemployment program. “Were you on EB? If so, EB was cut off Nov 21.”

Mr. Branch hadn’t realized he had been on Extended Benefits, which kicked in after he exhausted 26 weeks of regular unemployment plus 13 additional weeks granted in the March pandemic stimulus bill. Virginia stopped payments because the state’s unemployment rate had fallen under 5 percent, triggering an end to federal funding for the Extended Benefits program.

“I didn’t know about it,” he said in an interview. “That’s the biggest frustration that I had about it was the fact that I never received the email that it was going to be shut off.”

For many of the millions of Americans like Mr. Branch who lost jobs because of the coronavirus, the stress of being unemployed in a pandemic has been compounded by the difficulty of navigating disorganized and often antiquated state and federal unemployment systems. Information from overwhelmed state offices and websites is often confusing, and reaching an official who can answer questions nearly impossible….

Post after post on r/Unemployment conveys bureaucratic problems with endless variations: how to file a claim depending on your circumstances, what to do if you made a mistake on your claim, what different statuses on your claim might mean, how to navigate confusing and glitch-prone online portals and even how to speak to an actual person to get issues resolved….

Many people come to r/Unemployment to offer answers, not just find them.

Albert Peers, who had been working in a call center in San Diego until the pandemic, spends time every day trying to answer questions about California’s system. He lives alone and can’t easily return to work because he has a lowered immune system. After first visiting the site when he encountered a hitch in his own unemployment benefits, Mr. Peers, 56, was shocked by the number of people who had no idea what to do.

The thought that someone might go hungry or miss rent because they were simply stymied by the system was unacceptable to him. “At that point I just made a decision,” he said. “You know what, like a couple hours every day, because I just can’t turn away.”…(More)”.

Atlas of AI: Power, Politics, and the Planetary Costs of Artificial Intelligence


Book by Kate Crawford: “What happens when artificial intelligence saturates political life and depletes the planet? How is AI shaping our understanding of ourselves and our societies? In this book Kate Crawford reveals how this planetary network is fueling a shift toward undemocratic governance and increased inequality. Drawing on more than a decade of research, award-winning science, and technology, Crawford reveals how AI is a technology of extraction: from the energy and minerals needed to build and sustain its infrastructure, to the exploited workers behind “automated” services, to the data AI collects from us. 

Rather than taking a narrow focus on code and algorithms, Crawford offers us a political and a material perspective on what it takes to make artificial intelligence and where it goes wrong. While technical systems present a veneer of objectivity, they are always systems of power. This is an urgent account of what is at stake as technology companies use artificial intelligence to reshape the world…(More)”.

Governance for Innovation and Privacy: The Promise of Data Trusts and Regulatory Sandboxes


Essay by Chantal Bernier: “Innovation feeds on data, both personal, identified data and de-identified data. To protect the data from increasing privacy risks, governance structures emerge to allow the use and sharing of data as necessary for innovation while addressing privacy risks. Two frameworks proposed to fulfill this purpose are data trusts and regulatory sandboxes.

The Government of Canada introduced the concept of “data trust” into the Canadian privacy law modernization discussion through Canada’s Digital Charter in Action: A Plan by Canadians, for Canadians, to “enable responsible innovation.” At a high level, a data trust may be defined, according to the Open Data Institute, as a legal structure that is appropriate to the data sharing it is meant to govern and that provides independent stewardship of data.

Bill C-11, known as the Digital Charter Implementation Act, 2020, and tabled on November 17, 2020, lays the groundwork for the possibility of creating data trusts for private organizations to disclose de-identified data to specific public institutions for “socially beneficial purposes.” In her recent article “Replacing Canada’s 20-Year-Old Data Protection Law,” Teresa Scassa provides a superb overview and analysis of the bill.

Another instrument for privacy protective innovation is referred to as the “regulatory sandbox.” The United Kingdom’s Information Commissioner’s Office (ICO) provides a regulatory sandbox service that encourages organizations to submit innovative initiatives without fear of enforcement action. From there, the ICO sandbox team provides advice related to privacy risks and how to embed privacy protection.

Both governance measures may hold the future of privacy and innovation, provided that we accept this equation: De-identified data may no longer be considered irrevocably anonymous and therefore should not be released unconditionally, but the risk of re-identification is so remote that the data may be released under a governance structure that mitigates the residual privacy risk.  

Innovation Needs Identified Personal Data and De-identified Data   

The role of data in innovation does not need to be explained. Innovation requires a full understanding of what is, to project toward what could be. The need for personal data, however, calls for far more than an explanation. Its use must be justified. Applications abound, and they may not be obvious to the layperson. Researchers and statisticians, however, underline the critical role of personal data with one word: reliability.

Processing data that can be traced, either through identifiers or through pseudonyms, allows superior machine learning, longitudinal studies and essential correlations, which provide, in turn, better data in which to ground innovation. Statistics Canada has developed a “Continuum of Microdata Access” to its databases on the premise that “researchers require access to microdata at the individual business, household or person level for research purposes. To preserve the privacy and confidentiality of respondents, and to encourage the use of microdata, Statistics Canada offers a wide range of options through a series of online channels, facilities and programs.”

Since the first national census in 1871, Canada has put data — derived from personal data collected through the census and surveys — to good use in the public and private sectors alike. Now, new privacy risks emerge, as the unprecedented volume of data collection and the power of analytics bring into question the notion that the de-identification of data — and therefore its anonymization — is irreversible.

And yet, data to inform innovation for the good of humanity cannot exclude data about humans. So, we must look to governance measures to release de-identified data for innovation in a privacy-protective manner. …(More)”.

N.Y.’s Vaccine Websites Weren’t Working. He Built a New One for $50.


Sharon Otterman at New York Times: “Huge Ma, a 31-year-old software engineer for Airbnb, was stunned when he tried to make a coronavirus vaccine appointment for his mother in early January and saw that there were dozens of websites to check, each with its own sign-up protocol. The city and state appointment systems were completely distinct.

“There has to be a better way,” he said he remembered thinking.

So, he developed one. In less than two weeks, he launched TurboVax, a free website that compiles availability from the three main city and state New York vaccine systems and sends the information in real time to Twitter. It cost Mr. Ma less than $50 to build, yet it offers an easier way to spot appointments than the city and state’s official systems do.

“It’s sort of become a challenge to myself, to prove what one person with time and a little motivation can do,” he said last week. “This wasn’t a priority for governments, which was unfortunate. But everyone has a role to play in the pandemic, and I’m just doing the very little that I can to make it a little bit easier.”

Supply shortages and problems with access to vaccination appointments have been some of the barriers to the equitable distribution of the vaccine in New York City and across the United States, officials have acknowledged….(More)”.

Public Policy Analytics: Code & Context for Data Science in Government


Open Access Book by Ken Steif: “… teaches readers how to address complex public policy problems with data and analytics using reproducible methods in R. Each of the eight chapters provides a detailed case study, showing readers: how to develop exploratory indicators; understand ‘spatial process’ and develop spatial analytics; how to develop ‘useful’ predictive analytics; how to convey these outputs to non-technical decision-makers through the medium of data visualization; and why, ultimately, data science and ‘Planning’ are one and the same. A graduate-level introduction to data science, this book will appeal to researchers and data scientists at the intersection of data analytics and public policy, as well as readers who wish to understand how algorithms will affect the future of government….(More)”.

Solving Public Problems


“Today the Governance Lab (The GovLab) at the NYU Tandon School of Engineering launched a free, online course on Solving Public Problems. The 12-part program, presented by Beth Simone Noveck, and over two-dozen global changemakers, trains participants in the skills needed to move from demanding change to making it. 

Taking a practical approach to addressing entrenched problems, from systemic racism to climate change, the course combines the teaching of quantitative and qualitative methods with participatory and equitable techniques for tapping the collective wisdom of communities to design and deliver powerful solutions to contemporary problems. 

“We cannot expect to tackle tomorrow’s problems with yesterday’s toolkit,” said Noveck, a former advisor on open government to President Barack Obama. “In the 21st century, we must equip ourselves with the skills to solve public problems. But those skills are not innate, and this program is designed to help people learn how to implement workable solutions to our hardest but most important challenges.”  

Based on Professor Noveck’s new book, Solving Public Problems: A Practical Guide to Fix Government and Change the World (Yale University Press 2021), this online program is intended to democratize access to public problem-solving education, providing citizens with  innovative tools to tap the collective wisdom of communities to take effective, organized action for change. …(More)”.

Robot census: Gathering data to improve policymaking on new technologies


Essay by Robert Seamans: There is understandable excitement about the impact that new technologies like artificial intelligence (AI) and robotics will have on our economy. In our everyday lives, we already see the benefits of these technologies: when we use our smartphones to navigate from one location to another using the fastest available route or when a predictive typing algorithm helps us finish a sentence in our email. At the same time, there are concerns about possible negative effects of these new technologies on labor. The Council of Economic Advisers of the past two Administrations have addressed these issues in the annual Economic Report of the President (ERP). For example, the 2016 ERP included a chapter on technology and innovation that linked robotics to productivity and growth, and the 2019 ERP included a chapter on artificial intelligence that discussed the uneven effects of technological change. Both these chapters used data at highly aggregated levels, in part because that is the data that is available. As I’ve noted elsewhere, AI and robots are everywhere, except, as it turns out, in the data.

To date, there have been no large scale, systematic studies in the U.S. on how robots and AI affect productivity and labor in individual firms or establishments (a firm could own one or more establishments, which for example could be a plant in a manufacturing setting or a storefront in a retail setting). This is because the data are scarce. Academic researchers interested in the effects of AI and robotics on economic outcomes have mostly used aggregate country and industry-level data. Very recently, some have studied these issues at the firm level using data on robot imports to France, Spain, and other countries. I review a few of these academic papers in both categories below, which provide early findings on the nuanced role these new technologies have on labor. Thanks to some excellent work being done by the U.S. Census Bureau, however, we may soon have more data to work with. This includes new questions on robot purchases in the Annual Survey of Manufacturers and Annual Capital Expenditures Survey and new questions on other technologies including cloud computing and machine learning in the Annual Business Survey….(More)”.

Profiling Insurrection: Characterizing Collective Action Using Mobile Device Data


Paper by David Van Dijcke and Austin L. Wright: “We develop a novel approach for estimating spatially dispersed community-level participation in mass protest. This methodology is used to investigate factors associated with participation in the ‘March to Save America’ event in Washington, D.C. on January 6, 2021. This study combines granular location data from more than 40 million mobile devices with novel measures of community-level voting patterns, the location of organized hate groups, and the entire georeferenced digital archive of the social media platform Parler. We find evidence that partisanship, socio-political isolation, proximity to chapters of the Proud Boys organization, and the local activity on Parler are robustly associated with protest participation. Our research fills a prominent gap in the study of collective action: identifying and studying communities involved in mass-scale events that escalate into violent insurrection….(More)”.