A Guide to Data Innovation for Development – From idea to proof-of-concept


Press Release: “UNDP and UN Global Pulse today released a comprehensive guide on how to integrate new sources of data into development and humanitarian work.

New and emerging data sources such as mobile phone data, social media, remote sensors and satellites have the potential to improve the work of governments and development organizations across the globe.

Entitled A Guide to Data Innovation for Development – From idea to proof-of-concept,’ this publication was developed by practitioners for practitioners. It provides step-by-step guidance for working with new sources of data to staff of UN agencies and international Non-Governmental Organizations.

The guide is a result of a collaboration of UNDP and UN Global Pulse with support from UN Volunteers. Led by UNDP innovation teams in Europe and Central Asia and Arab States, six UNDP offices in Armenia, Egypt, Kosovo[1], fYR Macedonia, Sudan and Tunisia each completed data innovation projects applicable to development challenges on the ground.

The publication builds on these successful case trials and on the expertise of data innovators from UNDP and UN Global Pulse who managed the design and development of those projects.

It provides practical guidance for jump-starting a data innovation project, from the design phase through the creation of a proof-of-concept.

The guide is structured into three sections – (I) Explore the Problem & System, (II) Assemble the Team and (III) Create the Workplan. Each of the sections comprises of a series of tools for completing the steps needed to initiate and design a data innovation project, to engage the right partners and to make sure that adequate privacy and protection mechanisms are applied.

…Download ‘A Guide to Data Innovation for Development – From idea to proof-of-concept’ here.”

Just good enough data: Figuring data citizenships through air pollution sensing and data stories


Jennifer Gabrys, Helen Pritchard, and Benjamin Barratt in Big Data & Society: “Citizen sensing, or the use of low-cost and accessible digital technologies to monitor environments, has contributed to new types of environmental data and data practices. Through a discussion of participatory research into air pollution sensing with residents of northeastern Pennsylvania concerned about the effects of hydraulic fracturing, we examine how new technologies for generating environmental data also give rise to new problems for analysing and making sense of citizen-gathered data. After first outlining the citizen data practices we collaboratively developed with residents for monitoring air quality, we then describe the data stories that we created along with citizens as a method and technique for composing data. We further mobilise the concept of ‘just good enough data’ to discuss the ways in which citizen data gives rise to alternative ways of creating, valuing and interpreting datasets. We specifically consider how environmental data raises different concerns and possibilities in relation to Big Data, which can be distinct from security or social media studies. We then suggest ways in which citizen datasets could generate different practices and interpretive insights that go beyond the usual uses of environmental data for regulation, compliance and modelling to generate expanded data citizenships….(More)”

Towards Scalable Governance: Sensemaking and Cooperation in the Age of Social Media


Iyad Rahwan in Philosophy & Technology: “Cybernetics, or self-governance of animal and machine, requires the ability to sense the world and to act on it in an appropriate manner. Likewise, self-governance of a human society requires groups of people to collectively sense and act on their environment. I argue that the evolution of political systems is characterized by a series of innovations that attempt to solve (among others) two ‘scalability’ problems: scaling up a group’s ability to make sense of an increasingly complex world, and to cooperate in increasingly larger groups. I then explore some recent efforts toward using the Internet and social media to provide alternative means for addressing these scalability challenges, under the banners of crowdsourcing and computer-supported argumentation. I present some lessons from those efforts about the limits of technology, and the research directions more likely to bear fruit….(More)”

From policing to news, how algorithms are changing our lives


Carl Miller at The National: “First, write out the numbers one to 100 in 10 rows. Cross out the one. Then circle the two, and cross out all of the multiples of two. Circle the three, and do likewise. Follow those instructions, and you’ve just completed the first three steps of an algorithm, and an incredibly ancient one. Twenty-three centuries ago, Eratosthenes was sat in the great library of Alexandria, using this process (it is called Eratosthenes’ Sieve) to find and separate prime numbers. Algorithms are nothing new, indeed even the word itself is old. Fifteen centuries after Eratosthenes, Algoritmi de numero Indorum appeared on the bookshelves of European monks, and with it, the word to describe something very simple in essence: follow a series of fixed steps, in order, to achieve a given answer to a given problem. That’s it, that’s an algorithm. Simple.

 Apart from, of course, the story of algorithms is not so simple, nor so humble. In the shocked wake of Donald Trump’s victory in the United States presidential election, a culprit needed to be found to explain what had happened. What had, against the odds, and in the face of thousands of polls, caused this tectonic shift in US political opinion? Soon the finger was pointed. On social media, and especially on Facebook, it was alleged that pro-Trump stories, based on inaccurate information, had spread like wildfire, often eclipsing real news and honestly-checked facts.
But no human editor was thrust into the spotlight. What took centre stage was an algorithm; Facebook’s news algorithm. It was this, critics said, that was responsible for allowing the “fake news” to circulate. This algorithm wasn’t humbly finding prime numbers; it was responsible for the news that you saw (and of course didn’t see) on the largest source of news in the world. This algorithm had somehow risen to become more powerful than any newspaper editor in the world, powerful enough to possibly throw an election.
So why all the fuss? Something is now happening in society that is throwing algorithms into the spotlight. They have taken on a new significance, even an allure and mystique. Algorithms are simply tools but a web of new technologies are vastly increasing the power that these tools have over our lives. The startling leaps forward in artificial intelligence have meant that algorithms have learned how to learn, and to become capable of accomplishing tasks and tackling problems that they were never been able to achieve before. Their learning is fuelled with more data than ever before, collected, stored and connected with the constellations of sensors, data farms and services that have ushered in the age of big data.

Algorithms are also doing more things; whether welding, driving or cooking, thanks to robotics. Wherever there is some kind of exciting innovation happening, algorithms are rarely far away. They are being used in more fields, for more things, than ever before and are incomparably, incomprehensibly more capable than the algorithms recognisable to Eratosthenes….(More)”

Big Data Coming In Faster Than Biomedical Researchers Can Process It


Richard Harris at NPR: “Biomedical research is going big-time: Megaprojects that collect vast stores of data are proliferating rapidly. But scientists’ ability to make sense of all that information isn’t keeping up.

This conundrum took center stage at a meeting of patient advocates, called Partnering For Cures, in New York City on Nov. 15.

On the one hand, there’s an embarrassment of riches, as billions of dollars are spent on these megaprojects.

There’s the White House’s Cancer Moonshot (which seeks to make 10 years of progress in cancer research over the next five years), the Precision Medicine Initiative (which is trying to recruit a million Americans to glean hints about health and disease from their data), The BRAIN Initiative (to map the neural circuits and understand the mechanics of thought and memory) and the International Human Cell Atlas Initiative (to identify and describe all human cell types).

“It’s not just that any one data repository is growing exponentially, the number of data repositories is growing exponentially,” said Dr. Atul Butte, who leads the Institute for Computational Health Sciences at the University of California, San Francisco.

One of the most remarkable efforts is the federal government’s push to get doctors and hospitals to put medical records in digital form. That shift to electronic records is costing billions of dollars — including more than $28 billion alone in federal incentives to hospitals, doctors and others to adopt them. The investment is creating a vast data repository that could potentially be mined for clues about health and disease, the way websites and merchants gather data about you to personalize the online ads you see and for other commercial purposes.

But, unlike the data scientists at Google and Facebook, medical researchers have done almost nothing as yet to systematically analyze the information in these records, Butte said. “As a country, I think we’re investing close to zero analyzing any of that data,” he said.

Prospecting for hints about health and disease isn’t going to be easy. The raw data aren’t very robust and reliable. Electronic medical records are often kept in databases that aren’t compatible with one another, at least without a struggle. Some of the potentially revealing details are also kept as free-form notes, which can be hard to extract and interpret. Errors commonly creep into these records….(More)”

How Should a Society Be?


Brian Christian: “This is another example where AI—in this case, machine-learning methods—intersects with these ethical and civic questions in an ultimately promising and potentially productive way. As a society we have these values in maxim form, like equal opportunity, justice, fairness, and in many ways they’re deliberately vague. This deliberate flexibility and ambiguity are what allows things to be a living document that stays relevant. But here we are in this world where we have to say of some machine-learning model, is this racially fair? We have to define these terms, computationally or numerically.

It’s problematic in the short term because we have no idea what we’re doing; we don’t have a way to approach that problem yet. In the slightly longer term—five or ten years—there’s a profound opportunity to come together as a polis and get precise about what we mean by justice or fairness with respect to certain protected classes. Does that mean it’s got an equal false positive rate? Does that mean it has an equal false negative rate? What is the tradeoff that we’re willing to make? What are the constraints that we want to put on this model-building process? That’s a profound question, and we haven’t needed to address it until now. There’s going to be a civic conversation in the next few years about how to make these concepts explicit….(More) (Video)”

OpenStreetMap in Israel and Palestine – ‘Game changer’ or reproducer of contested cartographies?


Christian Bittner in Political Geography: “In Israel and Palestine, map-making practices were always entangled with contradictive spatial identities and imbalanced power resources. Although an Israeli narrative has largely dominated the ‘cartographic battlefield’, the latest chapter of this story has not been written yet: collaborative forms of web 2.0 cartographies have restructured power relations in mapping practices and challenged traditional monopolies on map and spatial data production. Thus, we can expect web 2.0 cartographies to be a ‘game changer’ for cartography in Palestine and Israel.

In this paper, I review this assumption with the popular example of OpenStreetMap (OSM). Following a mixed methods approach, I comparatively analyze the genesis of OSM in Israel and Palestine. Although nationalist motives do not play a significant role on either side, it turns out that the project is dominated by Israeli and international mappers, whereas Palestinians have hardly contributed to OSM. As a result, social fragmentations and imbalances between Israel and Palestine are largely reproduced through OSM data. Discussing the low involvement of Palestinians, I argue that OSM’s ground truth paradigm might be a watershed for participation. Presumably, the project’s data are less meaningful in some local contexts than in others. Moreover, the seemingly apolitical approach to map only ‘facts on the ground’ reaffirms present spatio-social order and thus the power relations behind it. Within a Palestinian narrative, however, many aspects of the factual material space might appear not as neutral physical objects but as results of suppression, in which case, any ‘accurate’ spatial representation, such as OSM, becomes objectionable….(More)”

Esri, Waze Partnership: A Growing Trend in Sharing Data for the Benefit of All?


Justine Brown at GovTech: “Esri and Waze announced in mid-October that they’re partnering to help local governments alleviate traffic congestion and analyze congestion patterns. Called the Waze Connected Citizens Program, the program — which enables local governments that use the Esri ArcGIS platform to exchange publicly available traffic data with Waze — may represent a growing trend in which citizens and government share data for the benefit of all.

Connecting Esri and Waze data will allow cities to easily share information about the conditions of their roads with drivers, while drivers anonymously report accidents, potholes and other road condition information back to the cities. Local governments can then merge that data into their existing emergency dispatch and street maintenance systems….

Through the Connected Citizen program, Waze shares two main data sets with its government partners: Jams and Alerts….If there’s a major traffic jam in an unusual area, a traffic management center operator might be triggered to examine that area further. For example, Boston recently used Waze jam data to identify a couple of traffic-prone intersections in the Seaport district….Similarly if a Waze user reports a crash, that information shows up on the city’s existing ArcGIS map. City personnel can assess the crash and combine the Waze data with its existing data sets, if desired. The city can then notify emergency response, for example, to address the accident and send out emergency vehicles if necessary….

The Connected Citizen Program could also provide local governments an alternative to IoT investments, because a city can utilize real-time reports from the road rather than investing in sensors and IoT infrastructure. The Kentucky Transportation Cabinet, for instance, uses data from the Connected Citizen Program in several ways, including to monitor and detect automobile accidents on its roadways….(More)”

Data Literacy – What is it and how can we make it happen?


Introduction by Mark Frank, Johanna Walker, Judie Attard, Alan Tygel of Special Issue on Data Literacy of The Journal of Community Informatics: “With the advent of the Internet and particularly Open Data, data literacy (the ability of non-specialists to make use of data) is rapidly becoming an essential life skill comparable to other types of literacy. However, it is still poorly defined and there is much to learn about how best to increase data literacy both amongst children and adults. This issue addresses both the definition of data literacy and current efforts on increasing and sustaining it. A feature of the issue is the range of contributors. While there are important contributions from the UK, Canada and other Western countries, these are complemented by several papers from the Global South where there is an emphasis on grounding data literacy in context and relating it the issues and concerns of communities. (Full Text: PDF)

See also:

Creating an Understanding of Data Literacy for a Data-driven Society by Annika Wolff, Daniel Gooch, Jose J. Cavero Montaner, Umar Rashid, Gerd Kortuem

Data Literacy defined pro populo: To read this article, please provide a little information by David Crusoe

Data literacy conceptions, community capabilities by Paul Matthews

Urban Data in the primary classroom: bringing data literacy to the UK curriculum by Annika Wolff, Jose J Cavero Montaner, Gerd Kortuem

Contributions of Paulo Freire for a Critical Data Literacy: a Popular Education Approach by Alan Freihof Tygel, Rosana Kirsch

DataBasic: Design Principles, Tools and Activities for Data Literacy Learners by Catherine D’Ignazio, Rahul Bhargava

Perceptions of ICT use in rural Brazil: Factors that impact appropriation among marginalized communities by Paola Prado, J. Alejandro Tirado-Alcaraz, Mauro Araújo Câmara

Graphical Perception of Value Distributions: An Evaluation of Non-Expert Viewers’ Data Literacy by Arkaitz Zubiaga, Brian Mac Namee

Using open government for climate action


Elizabeth Moses at Eco-Business: “Countries made many national climate commitments as part of the Paris Agreement on climate change, which entered into force earlier this month. Now comes the hard part of implementing those commitments. The public can serve an invaluable watchdog role, holding governments accountable for following through on their targets and making sure climate action happens in a way that’s fair and inclusive.

But first, the climate and open government communities will need to join forces….

Here are four areas where these communities can lean in together to ensure governments follow through on effective climate action:

1) Expand access to climate data and information.

Open government and climate NGOs and local communities can expand the use of traditional transparency tools and processes such as Freedom of Information (FOI) laws, transparent budgeting, open data policies and public procurement to enhance open information on climate mitigation, adaptation and finance.

For example, Transparencia Mexicana used Mexico’s Freedom of Information Law to collect data to map climate finance actors and the flow of finance in the country. This allows them to make specific recommendations on how to safeguard climate funds against corruption and ensure the money translates into real action on the ground….

2) Promote inclusive and participatory climate policy development.

Civil society and community groups already play a crucial role in advocating for climate action and improving climate governance at the national and local levels, especially when it comes to safeguarding poor and vulnerable people, who often lack political voice….

3) Take legal action for stronger accountability.

Accountability at a national level can only be achieved if grievance mechanisms are in place to address a lack of transparency or public participation, or address the impact of projects and policies on individuals and communities.

Civil society groups and individuals can use legal actions like climate litigation, petitions, administrative policy challenges and court cases at the national, regional or international levels to hold governments and businesses accountable for failing to effectively act on climate change….

4) Create new spaces for advocacy.

Bringing the climate and open government movements together allows civil society to tap new forums for securing momentum around climate policy implementation. For example, many civil society NGOs are highlighting the important connections between a strong Governance Goal 16 under the 2030 Agenda for Sustainable Development, and strong water quality and climate change policies….(More)”