Big Data and AI – A transformational shift for government: So, what next for research?


Irina Pencheva, Marc Esteve and Slava Jenkin Mikhaylov in Public Policy and Administration: “Big Data and artificial intelligence will have a profound transformational impact on governments around the world. Thus, it is important for scholars to provide a useful analysis on the topic to public managers and policymakers. This study offers an in-depth review of the Policy and Administration literature on the role of Big Data and advanced analytics in the public sector. It provides an overview of the key themes in the research field, namely the application and benefits of Big Data throughout the policy process, and challenges to its adoption and the resulting implications for the public sector. It is argued that research on the subject is still nascent and more should be done to ensure that the theory adds real value to practitioners. A critical assessment of the strengths and limitations of the existing literature is developed, and a future research agenda to address these gaps and enrich our understanding of the topic is proposed…(More)”.

Our Infant Information Revolution


Joseph Nye at Project Syndicate: “…When people are overwhelmed by the volume of information confronting them, it is hard to know what to focus on. Attention, not information, becomes the scarce resource. The soft power of attraction becomes an even more vital power resource than in the past, but so does the hard, sharp power of information warfare. And as reputation becomes more vital, political struggles over the creation and destruction of credibility multiply. Information that appears to be propaganda may not only be scorned, but may also prove counterproductive if it undermines a country’s reputation for credibility.

During the Iraq War, for example, the treatment of prisoners at Abu Ghraib and Guantanamo Bay in a manner inconsistent with America’s declared values led to perceptions of hypocrisy that could not be reversed by broadcasting images of Muslims living well in America. Similarly, President Donald Trump’s tweets that prove to be demonstrably false undercut American credibility and reduce its soft power.

The effectiveness of public diplomacy is judged by the number of minds changed (as measured by interviews or polls), not dollars spent. It is interesting to note that polls and the Portland index of the Soft Power 30show a decline in American soft power since the beginning of the Trump administration. Tweets can help to set the global agenda, but they do not produce soft power if they are not credible.

Now the rapidly advancing technology of artificial intelligence or machine learning is accelerating all of these processes. Robotic messages are often difficult to detect. But it remains to be seen whether credibility and a compelling narrative can be fully automated….(More)”.

Data Protection and e-Privacy: From Spam and Cookies to Big Data, Machine Learning and Profiling


Chapter by Lilian Edwards in L Edwards ed Law, Policy and the Internet (Hart , 2018): “In this chapter, I examine in detail how data subjects are tracked, profiled and targeted by their activities on line and, increasingly, in the “offline” world as well. Tracking is part of both commercial and state surveillance, but in this chapter I concentrate on the former. The European law relating to spam, cookies, online behavioural advertising (OBA), machine learning (ML) and the Internet of Things (IoT) is examined in detail, using both the GDPR and the forthcoming draft ePrivacy Regulation. The chapter concludes by examining both code and law solutions which might find a way forward to protect user privacy and still enable innovation, by looking to paradigms not based around consent, and less likely to rely on a “transparency fallacy”. Particular attention is drawn to the new work around Personal Data Containers (PDCs) and distributed ML analytics….(More)”.

Why Do We Care So Much About Privacy?


Louis Menand in The New Yorker: “…Possibly the discussion is using the wrong vocabulary. “Privacy” is an odd name for the good that is being threatened by commercial exploitation and state surveillance. Privacy implies “It’s nobody’s business,” and that is not really what Roe v. Wade is about, or what the E.U. regulations are about, or even what Katz and Carpenter are about. The real issue is the one that Pollak and Martin, in their suit against the District of Columbia in the Muzak case, said it was: liberty. This means the freedom to choose what to do with your body, or who can see your personal information, or who can monitor your movements and record your calls—who gets to surveil your life and on what grounds.

As we are learning, the danger of data collection by online companies is not that they will use it to try to sell you stuff. The danger is that that information can so easily fall into the hands of parties whose motives are much less benign. A government, for example. A typical reaction to worries about the police listening to your phone conversations is the one Gary Hart had when it was suggested that reporters might tail him to see if he was having affairs: “You’d be bored.” They were not, as it turned out. We all may underestimate our susceptibility to persecution. “We were just talking about hardwood floors!” we say. But authorities who feel emboldened by the promise of a Presidential pardon or by a Justice Department that looks the other way may feel less inhibited about invading the spaces of people who belong to groups that the government has singled out as unpatriotic or undesirable. And we now have a government that does that….(More)”.

Data Stewards: Data Leadership to Address 21st Century Challenges


Post by Stefaan Verhulst: “…Over the last two years, we have focused on the opportunities (and challenges) surrounding what we call “data collaboratives.” Data collaboratives are an emerging form of public-private partnership, in which information held by companies (or other entities) is shared with the public sector, civil society groups, research institutes and international organizations. …

For all its promise, the practice of data collaboratives remains ad hoc and limited. In part, this is a result of the lack of a well-defined, professionalized concept of data stewardship within corporations that has a mandate to explore ways to harness the potential of their data towards positive public ends.

Today, each attempt to establish a cross-sector partnership built on the analysis of private-sector data requires significant and time-consuming efforts, and businesses rarely have personnel tasked with undertaking such efforts and making relevant decisions.

As a consequence, the process of establishing data collaboratives and leveraging privately held data for evidence-based policy making and service delivery is onerous, generally one-off, not informed by best practices or any shared knowledge base, and prone to dissolution when the champions involved move on to other functions.

By establishing data stewardship as a corporate function, recognized and trusted within corporations as a valued responsibility, and by creating the methods and tools needed for responsible data-sharing, the practice of data collaboratives can become regularized, predictable, and de-risked….

To take stock of current practice and scope needs and opportunities we held a small yet in-depth kick-off event at the offices of the Cloudera Foundation in San Francisco on May 8th 2018 that was attended by representatives from Linkedin, Facebook, Uber, Mastercard, DigitalGlobe, Cognizant, Streetlight Data, the World Economic Forum, and Nethope — among others.

Four Key Take Aways

The discussions were varied and wide-ranging.

Several reflected on the risks involved — including the risks of NOT sharing or collaborating on privately held data that could improve people’s lives (and in some occasions save lives).

Others warned that the window of opportunity to increase the practice of data collaboratives may be closing — given new regulatory requirements and other barriers that may disincentivize corporations from engaging with third parties around their data.

Ultimately four key take aways emerged. These areas — at the nexus of opportunities and challenges — are worth considering further, because they help us better understand both the potential and limitations of data collaboratives….(More)”

Blockchain in Cities


Report by Brooks Rainwater at the National League of Cities: “Public trust in American lawmakers (particularly at the national level), elections and democratic institutions has plummeted in recent years. While there are many contributing factors, the explosion of digital information, digital misinformation and outright abuse has played a major role in this downward trend.

To restore confidence in the core tenets of our society, leaders need solutions tailored to an increasingly digital world. Additionally, blockchain presents direct opportunities for cities — voting, real estate, transportation, energy, water management and more. The potential exists for local governments to utilize blockchain to lower costs, improve efficiency and create a framework to accelerate innovation, access and accountability in public management.

Blockchain is a shared database or distributed ledger, located permanently online for anything represented digitally, such as rights, goods and property. At its core, it is a secure, inalterable electronic register. Through enhanced trust, consensus and autonomy, blockchain brings widespread decentralization. This is a departure from the traditional role that centralized intermediaries or entities — such as banks — played to manage our valuable transfers. Its inherent transparency promotes relationships and builds confidence.

In the early days of the internet, few people could have predicted the magnitude of the disruption it would cause and the pivotal role it would play in globalization. Some experts say blockchain will potentially change the nature and security of all interactions of value. Because blockchain has large implications for individuals, it will have even larger ramifications for cities.

Here are seven key ways that cities can explore blockchain now:

  • Use blockchain to expand digital inclusion initiatives and help support the un- and under-banked.
  • Explore options for using blockchain in governance, procurement processes and business licensing.
  • Consider blockchain to increase civic engagement and offer additional pathways for voting.
  • Investigate how blockchain can help strengthen local alternative energy initiatives.
  • Prepare for the utilization of blockchain for digital transportation infrastructure needs as autonomous vehicles are more broadly deployed in cities.
  • While the benefits could be manifold, be cognizant of the potential for negative externalities that will need to be addressed and make sure that cities give themselves time to absorb each impact of introducing this technology.
  • Pay attention to what other cities have experienced and learned when it comes to blockchain. And above all, keep an open mind and be open to change. This new technology might just bring some unexpected yet very welcome benefits to your city and its residents….(More)”.

Latin America is fighting corruption by opening up government data


Anoush Darabi in apolitical: “Hardly a country in Latin America has been untouched by corruption scandals; this was just one of the more bizarre episodes. In response, using a variety of open online platforms, both city and national governments are working to lift the lid on government activity, finding new ways to tackle corruption with technology….

In Buenos Aires, government is dealing with the problem by making the details of all its public works projects completely transparent. With BA Obras, an online platform, the city maps projects across the city, and lists detailed information on their cost, progress towards completion and the names of the contractors.

“We allocate an enormous amount of money,” said Alvaro Herrero, Under Secretary for Strategic Management and Institutional Quality for the government of Buenos Aires, who helped to build the tool. “We need to be accountable to citizens in terms of what are we doing with that money.”

The portal is designed to be accessible to the average user. Citizens can filter the map to focus on their neighbourhood, revealing information on existing projects with the click of a mouse.

“A journalist called our communications team a couple of weeks ago,” said Herrero. “He said: ‘I want all the information on all the infrastructure projects that the government has, and I want the documentation.’ Our guy’s answer was, ‘OK, I will send you all the information in ten seconds.’ All he had to do was send a link to the platform.”

Since launching in October 2017 with 80 public works projects, the platform now features over 850. It has had 75,000 unique views, the majority coming in the month after launching.

Making people aware and encouraging them to use it is key. “The main challenge is not the platform itself, but getting residents to use it,” said Herrero. “We’re still in that process.”

Brazil’s public spending checkers

Brazil is using big data analysis to scrutinise its spending via its Public Expenditure Observatory (ODP).

The ODP was founded in 2008 to help monitor spending across government departments systematically. In such a large country, spending data is difficult to pull together, and its volume makes it difficult to analyse. The ODP pulls together disparate information from government databases across the country into a central location, puts it into a consistent format and analyses it for inconsistency. Alongside analysis, the ODP also makes the data public.

For example, in 2010 the ODP analysed expenses made on credit cards by federal government officers. They discovered that 11% of all transactions that year were suspicious, requiring further investigation. After the data was published, credit card expenditure dropped by 25%….(More)”.

Using Satellite Imagery to Revolutionize Creation of Tax Maps and Local Revenue Collection


World Bank Policy Research Paper by Daniel Ayalew Ali, Klaus Deininger and Michael Wild: “The technical complexity of ensuring that tax rolls are complete and valuations current is often perceived as a major barrier to bringing in more property tax revenues in developing countries.

This paper shows how high-resolution satellite imagery makes it possible to assess the completeness of existing tax maps by estimating built-up areas based on building heights and footprints. Together with information on sales prices from the land registry, targeted surveys, and routine statistical data, this makes it possible to use mass valuation procedures to generate tax maps. The example of Kigali illustrates the reliability of the method and the potentially far-reaching revenue impacts. Estimates show that heightened compliance and a move to a 1 percent ad valorem tax would yield a tenfold increase in revenue from public land….(More)”.

I want your (anonymized) social media data


Anthony Sanford at The Conversation: “Social media sites’ responses to the Facebook-Cambridge Analytica scandal and new European privacy regulations have given users much more control over who can access their data, and for what purposes. To me, as a social media user, these are positive developments: It’s scary to think what these platforms could do with the troves of data available about me. But as a researcher, increased restrictions on data sharing worry me.

I am among the many scholars who depend on data from social media to gain insights into people’s actions. In a rush to protect individuals’ privacy, I worry that an unintended casualty could be knowledge about human nature. My most recent work, for example, analyzes feelings people express on Twitter to explain why the stock market fluctuates so much over the course of a single day. There are applications well beyond finance. Other scholars have studied mass transit rider satisfactionemergency alert systems’ function during natural disasters and how online interactions influence people’s desire to lead healthy lifestyles.

This poses a dilemma – not just for me personally, but for society as a whole. Most people don’t want social media platforms to share or sell their personal information, unless specifically authorized by the individual user. But as members of a collective society, it’s useful to understand the social forces at work influencing everyday life and long-term trends. Before the recent crises, Facebook and other companies had already been making it hard for legitimate researchers to use their data, including by making it more difficult and more expensive to download and access data for analysis. The renewed public pressure for privacy means it’s likely to get even tougher….

It’s true – and concerning – that some presumably unethical people have tried to use social media data for their own benefit. But the data are not the actual problem, and cutting researchers’ access to data is not the solution. Doing so would also deprive society of the benefits of social media analysis.

Fortunately, there is a way to resolve this dilemma. Anonymization of data can keep people’s individual privacy intact, while giving researchers access to collective data that can yield important insights.

There’s even a strong model for how to strike that balance efficiently: the U.S. Census Bureau. For decades, that government agency has collected extremely personal data from households all across the country: ages, employment status, income levels, Social Security numbers and political affiliations. The results it publishes are very rich, but also not traceable to any individual.

It often is technically possible to reverse anonymity protections on data, using multiple pieces of anonymized information to identify the person they all relate to. The Census Bureau takes steps to prevent this.

For instance, when members of the public access census data, the Census Bureau restricts information that is likely to identify specific individuals, such as reporting there is just one person in a community with a particularly high- or low-income level.

For researchers the process is somewhat different, but provides significant protections both in law and in practice. Scholars have to pass the Census Bureau’s vetting process to make sure they are legitimate, and must undergo training about what they can and cannot do with the data. The penalties for violating the rules include not only being barred from using census data in the future, but also civil fines and even criminal prosecution.

Even then, what researchers get comes without a name or Social Security number. Instead, the Census Bureau uses what it calls “protected identification keys,” a random number that replaces data that would allow researchers to identify individuals.

Each person’s data is labeled with his or her own identification key, allowing researchers to link information of different types. For instance, a researcher wanting to track how long it takes people to complete a college degree could follow individuals’ education levels over time, thanks to the identification keys.

Social media platforms could implement a similar anonymization process instead of increasing hurdles – and cost – to access their data…(More)” .

User Perceptions of Privacy in Smart Homes


Paper by Serena Zheng, Marshini Chetty, and Nick Feamster: “Despite the increasing presence of Internet of Things (IoT) devices inside the home, we know little about how users feel about their privacy living with Internet-connected devices that continuously monitor and collect data in their homes. To gain insight into this state of affairs, we conducted eleven semi-structured interviews with owners of smart homes, investigating privacy values and expectations.

In this paper, we present the findings that emerged from our study: First, users prioritize the convenience and connectedness of their smart homes, and these values dictate their privacy opinions and behaviors. Second, user opinions about who should have access to their smart home data depend on the perceived benefit. Third, users assume their privacy is protected because they trust the manufacturers of their IoT devices. Our findings bring up several implications for IoT privacy, which include the need for design for privacy and evaluation standards….(More)”.