The final Global Open Data Index is now live


Open Knowledge International: “The updated Global Open Data Index has been published today, along with our report on the state of Open Data this year. The report includes a broad overview of the problems we found around data publication and how we can improve government open data. You can download the full report here.

Also, after the Public Dialogue phase, we have updated the Index. You can see the updated edition here

We will also keep our forum open for discussions about open data quality and publication. You can see the conversation here.”

Inside the Algorithm That Tries to Predict Gun Violence in Chicago


Gun violence in Chicago has surged since late 2015, and much of the news media attention on how the city plans to address this problem has focused on the Strategic Subject List, or S.S.L.

The list is made by an algorithm that tries to predict who is most likely to be involved in a shooting, either as perpetrator or victim. The algorithm is not public, but the city has now placed a version of the list — without names — online through its open data portal, making it possible for the first time to see how Chicago evaluates risk.

We analyzed that information and found that the assigned risk scores — and what characteristics go into them — are sometimes at odds with the Chicago Police Department’s public statements and cut against some common perceptions.

■ Violence in the city is less concentrated at the top — among a group of about 1,400 people with the highest risk scores — than some public comments from the Chicago police have suggested.

■ Gangs are often blamed for the devastating increase in gun violence in Chicago, but gang membership had a small predictive effect and is being dropped from the most recent version of the algorithm.

■ Being a victim of a shooting or an assault is far more predictive of future gun violence than being arrested on charges of domestic violence or weapons possession.

■ The algorithm has been used in Chicago for several years, and its effectiveness is far from clear. Chicago accounted for a large share of the increase in urban murders last year….(More)”.

Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are


Book by Seth Stephens-Davidowitz: “Blending the informed analysis of The Signal and the Noise with the instructive iconoclasm of Think Like a Freak, a fascinating, illuminating, and witty look at what the vast amounts of information now instantly available to us reveals about ourselves and our world—provided we ask the right questions.

By the end of an average day in the early twenty-first century, human beings searching the internet will amass eight trillion gigabytes of data. This staggering amount of information—unprecedented in history—can tell us a great deal about who we are—the fears, desires, and behaviors that drive us, and the conscious and unconscious decisions we make. From the profound to the mundane, we can gain astonishing knowledge about the human psyche that less than twenty years ago, seemed unfathomable.

Everybody Lies offers fascinating, surprising, and sometimes laugh-out-loud insights into everything from economics to ethics to sports to race to sex, gender and more, all drawn from the world of big data. What percentage of white voters didn’t vote for Barack Obama because he’s black? Does where you go to school effect how successful you are in life? Do parents secretly favor boy children over girls? Do violent films affect the crime rate? Can you beat the stock market? How regularly do we lie about our sex lives and who’s more self-conscious about sex, men or women?

Investigating these questions and a host of others, Seth Stephens-Davidowitz offers revelations that can help us understand ourselves and our lives better. Drawing on studies and experiments on how we really live and think, he demonstrates in fascinating and often funny ways the extent to which all the world is indeed a lab. With conclusions ranging from strange-but-true to thought-provoking to disturbing, he explores the power of this digital truth serum and its deeper potential—revealing biases deeply embedded within us, information we can use to change our culture, and the questions we’re afraid to ask that might be essential to our health—both emotional and physical. All of us are touched by big data everyday, and its influence is multiplying. Everybody Lies challenges us to think differently about how we see it and the world…(More)”.

Smart Cities: Foundations, Principles and Applications


Book by Houbing Song, Ravi Srinivasan, Tamim Sookoor, Sabina Jeschke: “Smart cities are emerging as a priority for research and development across the world. They open up significant opportunities in several areas, such as economic growth, health, wellness, energy efficiency, and transportation, to promote the sustainable development of cities. This book provides the basics of smart cities, and it examines the possible future trends of this technology. Smart Cities: Foundations, Principles, and Applications provides a systems science perspective in presenting the foundations and principles that span multiple disciplines for the development of smart cities.

Divided into three parts—foundations, principles, and applications—Smart Cities addresses the various challenges and opportunities of creating smart cities and all that they have to offer. It also covers smart city theory modeling and simulation, and examines case studies of existing smart cities from all around the world. In addition, the book:

  • Addresses how to develop a smart city and how to present the state of the art and practice of them all over the world
  • Focuses on the foundations and principles needed for advancing the science, engineering, and technology of smart cities—including system design, system verification, real-time control and adaptation, Internet of Things, and test beds
  • Covers applications of smart cities as they relate to smart transportation/connected vehicle (CV) and Intelligent Transportation Systems (ITS) for improved mobility, safety, and environmental protection…(More)”

Slave to the Algorithm? Why a ‘Right to Explanation’ is Probably Not the Remedy You are Looking for


Paper by Lilian Edwards and Michael Veale: “Algorithms, particularly of the machine learning (ML) variety, are increasingly consequential to individuals’ lives but have caused a range of concerns evolving mainly around unfairness, discrimination and opacity. Transparency in the form of a “right to an explanation” has emerged as a compellingly attractive remedy since it intuitively presents as a means to “open the black box”, hence allowing individual challenge and redress, as well as possibilities to foster accountability of ML systems. In the general furore over algorithmic bias and other issues laid out in section 2, any remedy in a storm has looked attractive.

However, we argue that a right to an explanation in the GDPR is unlikely to be a complete remedy to algorithmic harms, particularly in some of the core “algorithmic war stories” that have shaped recent attitudes in this domain. We present several reasons for this conclusion. First (section 3), the law is restrictive on when any explanation-related right can be triggered, and in many places is unclear, or even seems paradoxical. Second (section 4), even were some of these restrictions to be navigated, the way that explanations are conceived of legally — as “meaningful information about the logic of processing” — is unlikely to be provided by the kind of ML “explanations” computer scientists have been developing. ML explanations are restricted both by the type of explanation sought, the multi-dimensionality of the domain and the type of user seeking an explanation. However (section 5) “subject-centric” explanations (SCEs), which restrict explanations to particular regions of a model around a query, show promise for interactive exploration, as do pedagogical rather than decompositional explanations in dodging developers’ worries of IP or trade secrets disclosure.

As an interim conclusion then, while convinced that recent research in ML explanations shows promise, we fear that the search for a “right to an explanation” in the GDPR may be at best distracting, and at worst nurture a new kind of “transparency fallacy”. However, in our final section, we argue that other parts of the GDPR related (i) to other individual rights including the right to erasure (“right to be forgotten”) and the right to data portability and (ii) to privacy by design, Data Protection Impact Assessments and certification and privacy seals, may have the seeds of building a better, more respectful and more user-friendly algorithmic society….(More)”

Facebook Disaster Maps


Molly Jackman et al at Facebook: “After a natural disaster, humanitarian organizations need to know where affected people are located, what resources are needed, and who is safe. This information is extremely difficult and often impossible to capture through conventional data collection methods in a timely manner. As more people connect and share on Facebook, our data is able to provide insights in near-real time to help humanitarian organizations coordinate their work and fill crucial gaps in information during disasters. This morning we announced a Facebook disaster map initiative to help organizations address the critical gap in information they often face when responding to natural disasters.

Facebook disaster maps provide information about where populations are located, how they are moving, and where they are checking in safe during a natural disaster. All data is de-identified and aggregated to a 360 square meter tile or local administrative boundaries (e.g. census boundaries). [1]

This blog describes the disaster maps datasets, how insights are calculated, and the steps taken to ensure that we’re preserving privacy….(More)”.

Design and Implementation of Behavioral Informatics Interventions


Chapter by Liliana Laranjo, Annie Lau and Enrico Coiera in Cognitive Informatics in Health and Biomedicine: “The growing burden of chronic disease is drawing unprecedented attention to the importance of optimizing lifestyle behaviors. Interventions to promote behavior change seem promising, but their full potential can be missed when they are not easily disseminated or accessible to a larger audience. The ability of technology to address these issues, as well as to facilitate the tailoring of interventions, has led to the growing popularity of the field of behavioral informatics (BI).

Behavioral informatics interventions are designed to support patients and healthy consumers in modifying behaviors to improve health, with the help of computers, the Internet, mobile phones, wireless devices, or social media, among other technologies. To date, BI interventions have been applied in several health domains, from the promotion of healthy lifestyle behaviors to mental health and chronic disease self-management.

The effectiveness and impact of BI interventions are largely dependent on their meaningful design, development, evaluation, and implementation. Key elements for success include: performing a comprehensive observation and framing of the particular behavioral challenge within context; recognizing the relevant behavior change theories, models and techniques; having a deep understanding of user characteristics and needs; involving users throughout design and development; and refining the design through user-centred evaluation.

Due to the rapid pace of technology development, the evaluation of interventions and translation of research to practice are met with particular challenges. Innovative methodologies and implementation strategies are increasingly required to bring to fruition the potential of BI interventions in delivering cost-effective, personalized interventions, with broad scalability….(More)”.

Facebook Features Connect Lawmakers With Constituents


Griffin Connolly at RollCall: “Facebook users now have the option to pin “constituent badges” to their profiles, letting friends — and members of Congress — know which district they live in. And users can now search for articles, links, and posts that other residents in their districts engage with most frequently.

“When we think about civic engagement, we think about building communities of people,” said Erin Egan, Facebook’s vice president of U.S. public policy. “And this is about making sure that people engage with government.”

The new features can also help identify the issues that voters care about most. That could be a valuable tool for lawmakers — and their opponents — during election season.

“I’ve always been fascinated by how the internet helps citizens have a voice like never before,” Samidh Chakrabarti, Facebook’s head of civic engagement, told reporters and congressional staffers at the unveiling Wednesday. “But at the same time, it makes it more complicated for decision-makers to actually make sense of it all.”

Facebook’s new technology can help, he said.

The constituent badge unlocks a number of possibilities for lawmakers looking to engage voters more directly.

They can now choose to make their posts available exclusively to voters in their district, which Facebook calls “district targeting.” In the past, policymakers could only post publicly from their pages….

The badge tool also enables lawmakers to host virtual town halls with an exclusive audience of their own constituents via the Facebook Live streaming medium and to tailor their messages to a narrower band of local media sources and citizens.

One unintended consequence of these more private Facebook Live sessions is that reporters who don’t live in a lawmaker’s district may not be able to view it. Facebook users can only provide one address, and that determines their constituent badge. …

A number of tech-savvy lawmakers have led the charge in leveraging new social media features and platforms to promote their image and policies.

In March, two Texas House members, Democrat Beto O’Rourke and Republican Will Hurd, struck out on a multiday “bipartisan road trip” from the Lone Star State back to Washington, using Facebook Live and the livestreaming app Periscope to update viewers and answer questions on policy.

Democrat Rick Nolan and Republican Jason Lewis copied that approach in April when the two Minnesota congressmen traveled back to their home state.

Louisiana Republican Rep. Garret Graves, who spoke at the Facebook unveiling on Wednesday, has gone full bore with his use of Facebook Live, hosting town halls every Friday around lunchtime and periodically on evenings throughout the week….(More)”

The Citizen Marketer


Book by Joel Penney: “From hashtag activism to the flood of political memes on social media, the landscape of political communication is being transformed by the grassroots circulation of opinion on digital platforms and beyond. By exploring how everyday people assist in the promotion of political media messages to persuade their peers and shape the public mind, Joel Penney offers a new framework for understanding the phenomenon of viral political communication: the citizen marketer. Like the citizen consumer, the citizen marketer is guided by the logics of marketing practice, but, rather than being passive, actively circulates persuasive media to advance political interests. Such practices include using protest symbols in social media profile pictures, strategically tweeting links to news articles to raise awareness about select issues, sharing politically-charged internet memes and viral videos, and displaying mass-produced T-shirts, buttons, and bumper stickers that promote a favored electoral candidate or cause. Citizens view their participation in such activities not only in terms of how it may shape or influence outcomes, but as a statement of their own identity. As the book argues, these practices signal an important shift in how political participation is conceptualized and performed in advanced capitalist democratic societies, as they casually inject political ideas into the everyday spaces and places of popular culture.

While marketing is considered a dirty word in certain critical circles — particularly among segments of the left that have identified neoliberal market logics and consumer capitalist structures as a major focus of political struggle — some of these very critics have determined that the most effective way to push back against the forces of neoliberal capitalism is to co-opt its own marketing and advertising techniques to spread counter-hegemonic ideas to the public. Accordingly, this book argues that the citizen marketer approach to political action is much broader than any one ideological constituency or bloc. Rather, it is a means of promoting a wide range of political ideas, including those that are broadly critical of elite uses of marketing in consumer capitalist societies. The book includes an extensive historical treatment of citizen-level political promotion in modern democratic societies, connecting contemporary digital practices to both the 19th century tradition of mass political spectacle as well as more informal, culturally-situated forms of political expression that emerge from postwar countercultures. By investigating the logics and motivations behind the citizen marketer approach, as well as how it has developed in response to key social, cultural, and technological changes, Penney charts the evolution of activism in an age of mediatized politics, promotional culture, and viral circulation….(More)”.

The law and big data


Article by Felin, Teppo, Devins, Caryn, Kauffman, Stuart and Koppl, Roger: “In this article we critically examine the use of Big Data in the legal system. Big Data is driving a trend towards behavioral optimization and “personalized law,” in which legal decisions and rules are optimized for best outcomes and where law is tailored to individual consumers based on analysis of past data. Big Data, however, has serious limitations and dangers when applied in the legal context. Advocates of Big Data make theoretically problematic assumptions about the objectivity of data and scientific observation. Law is always theory-laden. Although Big Data strives to be objective, law and data have multiple possible meanings and uses and thus require theory and interpretation in order to be applied. Further, the meanings and uses of law and data are indefinite and continually evolving in ways that cannot be captured or predicted by Big Data.

Due to these limitations, the use of Big Data will likely generate unintended consequences in the legal system. Large-scale use of Big Data will create distortions that adversely influence legal decision-making, causing irrational herding behaviors in the law. The centralized nature of the collection and application of Big Data also poses serious threats to legal evolution and democratic accountability. Furthermore, its focus on behavioral optimization necessarily restricts and even eliminates the local variation and heterogeneity that makes the legal system adaptive. In all, though Big Data has legitimate uses, this article cautions against using Big Data to replace independent legal judgment….(More)”