Five Headlines from a Big Month for the Data Revolution


Sarah T. Lucas at Post2015.org: “If the history of the data revolution were written today, it would include three major dates. May 2013, when theHigh Level Panel on the Post-2015 Development Agenda first coined the phrase “data revolution.” November 2014, when the UN Secretary-General’s Independent Expert Advisory Group (IEAG) set a vision for it. And April 2015, when five headliner stories pushed the data revolution from great idea to a concrete roadmap for action.

The April 2015 Data Revolution Headlines

1. The African Data Consensus puts Africa in the lead on bringing the data revolution to the regional level. TheAfrica Data Consensus (ADC) envisions “a profound shift in the way that data is harnessed to impact on development decision-making, with a particular emphasis on building a culture of usage.” The ADC finds consensus across 15 “data communities”—ranging from open data to official statistics to geospatial data, and is endorsed by Africa’s ministers of finance. The ADC gets top billing in my book, as the first contribution that truly reflects a large diversity of voices and creates a political hook for action. (Stay tuned for a blog from my colleague Rachel Quint on the ADC).

2. The Sustainable Development Solutions Network (SDSN) gets our minds (and wallets) around the data needed to measure the SDGs. The SDSN Needs Assessment for SDG Monitoring and Statistical Capacity Development maps the investments needed to improve official statistics. My favorite parts are the clear typology of data (see pg. 12), and that the authors are very open about the methods, assumptions, and leaps of faith they had to take in the costing exercise. They also start an important discussion about how advances in information and communications technology, satellite imagery, and other new technologies have the potential to expand coverage, increase analytic capacity, and reduce the cost of data systems.

3. The Overseas Development Institute (ODI) calls on us to find the “missing millions.” ODI’s The Data Revolution: Finding the Missing Millions presents the stark reality of data gaps and what they mean for understanding and addressing development challenges. The authors highlight that even that most fundamental of measures—of poverty levels—could be understated by as much as a quarter. And that’s just the beginning. The report also pushes us to think beyond the costs of data, and focus on how much good data can save. With examples of data lowering the cost of doing government business, the authors remind us to think about data as an investment with real economic and social returns.

4. Paris21 offers a roadmap for putting national statistic offices (NSOs) at the heart of the data revolution.Paris21’s Roadmap for a Country-Led Data Revolution does not mince words. It calls on the data revolution to “turn a vicious cycle of [NSO] underperformance and inadequate resources into a virtuous one where increased demand leads to improved performance and an increase in resources and capacity.” It makes the case for why NSOs are central and need more support, while also pushing them to modernize, innovate, and open up. The roadmap gets my vote for best design. This ain’t your grandfather’s statistics report!

5. The Cartagena Data Festival features real-live data heroes and fosters new partnerships. The Festival featured data innovators (such as terra-i using satellite data to track deforestation), NSOs on the leading edge of modernization and reform (such as Colombia and the Philippines), traditional actors using old data in new ways (such as the Inter-American Development Bank’s fantastic energy database), groups focused on citizen-generated data (such as The Data Shift and UN My World), private firms working with big data for social good (such asTelefónica), and many others—all reminding us that the data revolution is well underway and will not be stopped. Most importantly, it brought these actors together in one place. You could see the sparks flying as folks learned from each other and hatched plans together. The Festival gets my vote for best conference of a lifetime, with the perfect blend of substantive sessions, intense debate, learning, inspiration, new connections, and a lot of fun. (Stay tuned for a post from my colleague Kristen Stelljes and me for more on Cartagena).

This month full of headlines leaves no room for doubt—momentum is building fast on the data revolution. And just in time.

With the Financing for Development (FFD) conference in Addis Ababa in July, the agreement of Sustainable Development Goals in New York in September, and the Climate Summit in Paris in December, this is a big political year for global development. Data revolutionaries must seize this moment to push past vision, past roadmaps, to actual action and results…..(More)”

Monithon


“Moni-thon” comes from “monitor” and “marathon”, and this is precisely what this platform seeks to help with: anintensive activity of observing and reporting of public policies in Italy.

What’s there to monitor?  Monithon was born as an independently developed initiative to promote the citizen monitoring of development projects funded both by the Italian government and the EU through the Cohesion (aka. Regional) Policy. Projects include a wide range of interventions such as large transport, digital, research or environmental infrastructures (railroads, highways, broadband networks, waste management systems…), aids to enterprises to support innovation and competitiveness, and other funding for energy efficiency, social inclusion, education and training, occupation and workers mobility, tourism, etc.

Citizen monitoring of these projects is possible thanks to a combination of open government data and citizens’ collaboration, joined by the goal of controlling how the projects are progressing, and whether they deliver actual results.

The Italian government releases the information on all the 800k+ projects funded (worth almost 100 billion Euros), the beneficiaries of the subsidies and all the actors involved as open data, including the location and the timing of the intervention. All the data is integrated with interactive visualizations on the national portal OpenCoesione, where people can play with the data and find the most interesting projects to follow.

The Monithon initiative takes this transparency further: it asks citizens to actively engage with open government data and to produce valuable information through it.

How does it work? Monithon means active involvement of communities and a shared methodology. Citizens, journalist, experts, researchers, students – or all combined – collect information on a specific project chosen from the OpenCoesione database. Then this information can be uploaded on the Monithon platform (based on Ushahidi) by selecting the projects from a list and it can be geo-referenced and enriched with interviews, quantitative data, pictures, videos. The result is a form of civic, bottom-down, collective data storytelling. All the “wannabe monithoners” can download this simple toolkit, a 10-page document that describes the initiative and explains how to pick a project to monitor and get things started.  ….

How to achieve actual impact? The Monithon platform is method and a model whereby citizen monitoring may be initiated and a tool for civic partners to press forward, to report on malpractice, but also to collaborate in making all these projects work, in accelerating their completion and understanding whether they actually respond to local demand. ….

Monithon has rapidly evolved from being an innovative new platform into a transferable civic engagement format.  Since its launch in September 2013, Monithon has drawn dozens of national and local communities (some formed on purpose, other based on existing associations) and around 500 people into civic monitoring activities, mostly in Southern Italy, where cohesion funds are more concentrated. Specific activities are carried out by established citizen groups, like Libera, a national anti-Mafia association, which became Monithon partner, focusing their monitoring on the rehabilitation of Mafia-seized properties. Action Aid is now partnering with Monithon to promote citizen empowerment. Existing, local groups of activists are using the Monithon methodology to test local transportation systems that benefited from EU funding, while new groups have formed to begin monitoring social innovation and cultural heritage projects.

Now more than 50 “citizen monitoring reports”, which take the form of collective investigations on project development and results, are publicly available on the Monithon website, many of which spurred further dialogue with public administrations….(More)

Data Fusion Heralds City Attractiveness Ranking


Emerging Technology From the arXiv: “The ability of any city to attract visitors is an important metric for town planners, businesses based on tourism, traffic planners, residents, and so on. And there are increasingly varied ways of measuring this thanks to the growing volumes of city-related data generated by with social media, and location-based data.

So it’s only natural that researchers would like to draw these data sets together to see what kind of insight they can get from this form of data fusion.

And so it has turned out thanks to the work of Stanislav Sobolevsky at MIT and a few buddies. These guys have fused three wildly different data sets related to the attractiveness of a city that allows them to rank these places and to understand why people visit them and what they do when they get there.

The work focuses exclusively on cities in Spain using data that is relatively straightforward to gather. The first data set consists of the number of credit and debit card transactions carried out by visitors to cities throughout Spain during 2011. This includes each card’s country of origin, which allows Sobolevsky and co to count only those transactions made by foreign visitors—a total of 17 million anonymized transactions from 8.6 million foreign visitors from 175 different countries.

The second data set consists of over 3.5 million photos and videos taken in Spain and posted to Flickr by people living in other countries. These pictures were taken between 2005 and 2014 by 16,000 visitors from 112 countries.

The last data set consists of around 700,000 geotagged tweets posted in Spain during 2012. These were posted by 16,000 foreign visitors from 112 countries.

Finally, the team defined a city’s attractiveness, at least for the purposes of this study, as the total number of pictures, tweets and card transactions that took place within it……

That’s interesting work that shows how the fusion of big data sets can provide insights into the way people use cities.   It has its limitations of course. The study does not address the reasons why people find cities attractive and what draws them there in the first place. For example, are they there for tourism, for business, or for some other reason. That would require more specialized data.

But it does provide a general picture of attractiveness that could be a start for more detailed analyses. As such, this work is just a small part of a new science of cities based on big data, but one that shows how much is becoming possible with just a little number crunching.

Ref: arxiv.org/abs/1504.06003 : Scaling of city attractiveness for foreign visitors through big data of human economic and social media activity”

How Not to Drown in Numbers


Seth Stephens-Davidowitz in the New York Times: “BIG data will save the world. How often have we heard that over the past couple of years? We’re pretty sure both of us have said something similar dozens of times in the past few months.

If you’re trying to build a self-driving car or detect whether a picture has a cat in it, big data is amazing. But here’s a secret: If you’re trying to make important decisions about your health, wealth or happiness, big data is not enough.

The problem is this: The things we can measure are never exactly what we care about. Just trying to get a single, easy-to-measure number higher and higher (or lower and lower) doesn’t actually help us make the right choice. For this reason, the key question isn’t “What did I measure?” but “What did I miss?”

So what can big data do to help us make big decisions? One of us, Alex, is a data scientist at Facebook. The other, Seth, is a former data scientist at Google. There is a special sauce necessary to making big data work: surveys and the judgment of humans — two seemingly old-fashioned approaches that we will call small data….(More)”

Urban Data Games: creating smart citizens for smart cities


Paper by Wolff, Annika; Kortuem, Gerd and Cavero, Jose: “A bottom-up approach to smart cities places citizens in an active role of contributing, analysing and interpreting data in pursuit of tackling local urban challenges and building a more sustainable future city. This vision can only be realised if citizens have sufficient data literacy skills and experience of large, complex, messy, ever expanding data sets. Schools typically focus on teaching data handling skills using small, personally collected data sets obtained through scientific experimentation, leading to a gap between what is being taught and what will be needed as big data and analytics become more prevalent. This paper proposes an approach to teaching data literacy in the context of urban innovation tasks, using an idea of Urban Data Games. These are supported by a set of training data and resources that will be used in school trials for exploring the problems people have when dealing with large data and trialling novel approaches for teaching data literacy….(More)”

A sentiment analysis of U.S. local government tweets: The connection between tone and citizen involvement


Paper by Staci M. ZavattaroP. Edward French, and Somya D. Mohanty: “As social media tools become more popular at all levels of government, more research is needed to determine how the platforms can be used to create meaningful citizen–government collaboration. Many entities use the tools in one-way, push manners. The aim of this research is to determine if sentiment (tone) can positively influence citizen participation with government via social media. Using a systematic random sample of 125 U.S. cities, we found that positive sentiment is more likely to engender digital participation but this was not a perfect one-to-one relationship. Some cities that had an overall positive sentiment score and displayed a participatory style of social media use did not have positive citizen sentiment scores. We argue that positive tone is only one part of a successful social media interaction plan, and encourage social media managers to actively manage platforms to use activities that spur participation….(More)”

How crowdsourcing could help simplify America’s tax code


 in Fortune: “By soliciting ideas from large groups, the U.S. can come up with the right policies to reform the nation’s tax code — similar to the way complex computer systems are managed around the world….

Addressing complexity in the tax code requires analogizing to other complex systems and drawing on the research that demonstrates how to manage that complexity. Indeed, there is a well-developed literature on how to manage complex systems that can provide the foundation for simplifying the tax code. In particular, we know a lot about how to manage the evolution of software codes. This analogy yields two primary lessons.

First, “over the wall” engineering is highly problematic and “concurrent” engineering is preferred. Throwing completed ideas “over the wall” to the next part of the production process limits learning and engenders complexity relative to a concurrent and iterative production process. Currently, policy ideas are often developed without a clear vision of the associated language and with even less attention to the perspective of administrators. As with software, this yields bulky and contradictory language that could be avoided if the practice of policy formulation and drafting were a collaborative activity with the administrative agency in charge of enforcement. While the 1998 IRS Reform Act calls for such an approach, the reality does not live up to the law’s aspiration.

Second, and more radically, we should embark on an effort to crowd source the code. Much as the development of software capitalizes on a distributed talent pool, our legislative and regulatory processes on taxes could be opened up radically during comment and drafting.

Currently, the code is managed much as it was 50 years ago – in a fundamentally closed manner. Laws and regulations are drafted by small groups in a non-transparent way that pays little attention to the overall architecture of the tax system. As a consequence, vested interests can influence the management of complexity toward their advantage and complexity grows by ignoring interrelationships.

Research shows that effective management of complex codes – be it Linux or the tax code – requires three things. First, the code must be mapped so that the interrelationships, technically and conceptually, of different parts of the code and associated regulations and rulings become manifest. Second, this mapping enables modularization whereby the code is reorganized into pieces that reflect these relationships. Finally, this modularization provides the foundation for opening up the code to experts throughout society – so-called crowdsourcing – who contribute suggestions for rationalization and simplification.

By mapping, modularizing and opening the code and associated regulations, we could draw upon widespread expertise, provide transparency on a critical process, address the imbalance in resources between the taxing authority and sophisticated taxpayers and begin the process of simplifying the code and its administrative guidance. In the limit, one could imagine a detailed mapping of the tax code and associated regulations hosted by the IRS much as software code is mapped. This mapping would then serve as a guide to reorganizing laws and regulations over time. While decision making rights would remain with Congress and the IRS, opinions on policies would then be solicited widely and the drafting of laws and regulations could be aided by experts around the country through an open platform.

The commentary and drafting process that is so critical to policy formulation and administration would be completely open in real time. …(More)”

Serious Gaming Takes Flight


Dennis Glenn at “Chief Learning Officer” Media: “Gamification is one of the hottest topics in corporate learning today, yet we don’t entirely trust it. So before delving into how leaders can take a reasoned, serious approach to use games in learning environments, let’s get one thing straight: Gamification is different from serious gaming.

Gamification places nongame experiences into a gamelike environment. Serious games are educational experiences specifically designed to deliver formative or summative assessments based on predetermined learning objectives. Gamification creates an experience; serious games promote task or concept mastery. The underlying aim of serious games concentrates the user’s effort on mastery of a specific task, with a feedback loop to inform users of their progress toward that goal….

In addition to simulations and gamification, many corporate learning leaders are turning to serious games, which demand social engagement. For instance, consider the World of Warcraft wiki, which has more than 101,000 players and contributors helping others master the online game.

Some of the most important benefits to gaming:

  • Accepting failure, which is seen as a benefit to mastery.
  • Rewarding players with appropriate and timely feedback.
  • Making social connections and feeling part of something bigger.

In serious games, frequent feedback — when accompanied by specific instruction — can dramatically reduce the time to mastery. Because the computer will record all data during the assessment, learning leaders can identify specific pathways to mastery and offer them to learners.

This feedback loop leads to self-reflection and that can be translated into learning, according the 2014 paper titled “Working Paper: Learning by Thinking: How Reflection Aids Performance.” Authors Giada Di Stefano, Francesca Gino, Gary Pisano and Bradley Staats found that individuals performed significantly better on subsequent tasks when thinking about what they learned from the previously completed task.

Social learning is the final link to understanding mastery learning. In a recent massive open online course, titled “Design and Development of Educational Technology MITx: 11.132x,” instructor Scot Osterweil said our understanding of literacy is rooted in a social environment and in interactions with other people and the world. But again, engagement is key. Gaming provides the structure needed to engage with peers, often irrespective of cultural and language differences….(More)”.

An Unexpected Value Coming From the Happy Meal


Gregory Ferenstein in Pacific Standard: “Improbable as it sounds, McDonald’s may hold the key to getting America’s youth to eat healthier.

A team of medical researchers led by Dr. Robert Siegel took a strategy from the Happy Meal playbook, pairing healthy lunch options at public schools with smiley faces and a toy. The result were extraordinary: voluntary healthy meal purchases quadrupled.

“A two-tiered approach of Emoticons followed by small prizes as an incentive for healthful food selections is very effective in increasing plain white milk, fruit and vegetable selection,” the researchers write in a study presented this week at the annual Pediatric Academic Societies meeting, in San Diego.

Indeed, the popularization of emoticons has been co-opted by researchers lately to see if the colorful balls of happiness can be utilized for socially beneficial ends. One 2014 study found that “emolabeling” could be a major factor in health choice selection by both pre-literate and young children….

This latest research delves further into the power of emoticons in two significant ways. First, the study was tested in some of the most troubled school neighborhoods. (Siegel estimates that a significant portion of the families were either poor or homeless.) Second, knowing that smiling faces alone may not be enough to change behavior, after three months of using emoticons, the team added in a toy to further entice healthy meals.

With emoticons alone, the team found that chocolate milk sales at the school took a noticeable dip, from 87 percent of total milk sales to 78 percent. Later, entire meals known as “Power Plates” were added and paired with a toy. These healthy lunches (with whole grains and vegetables) spiked from less than 10 percent to 42 percent with the introduction of emoticons and a toy.

Interestingly enough, after the toys were taken away, the children continued to select healthy meals. That’s because external rewards can have odd effects on behavior….(More)

EU DataLandscape


DataLandscape.eu: “The European Data Market study aims to define, assess and measure the European data economy, supporting the achievement of the Data Value Chain policy of the European Commission. This strategy is focused on developing a vibrant and innovative data ecosystem of stakeholders driving the growth of this innovative market in Europe. The main results of this study will feed into the annual reviews of the Digital Agenda Scoreboard providing valuable data and information.

Additionally, supporting above-mentioned aims the study also plans to support the development of the community of relevant stakeholders in the EU. Since the “data community” in Europe is already strong albeit very diverse, dispersed in several communities, the study wants to move from disparate communities to a genuine stakeholders’ ecosystem….(More)”