Selected Readings on Crowdsourcing Data


The Living Library’s Selected Readings series seeks to build a knowledge base on innovative approaches for improving the effectiveness and legitimacy of governance. This curated and annotated collection of recommended works on the topic of crowdsourcing data was originally published in 2013.

As institutions seek to improve decision-making through data and put public data to use to improve the lives of citizens, new tools and projects are allowing citizens to play a role in both the collection and utilization of data. Participatory sensing and other citizen data collection initiatives, notably in the realm of disaster response, are allowing citizens to crowdsource important data, often using smartphones, that would be either impossible or burdensomely time-consuming for institutions to collect themselves. Civic hacking, often performed in hackathon events, on the other hand, is a growing trend in which governments encourage citizens to transform data from government and other sources into useful tools to benefit the public good.

Selected Reading List (in alphabetical order)

Annotated Selected Reading List (in alphabetical order)

Baraniuk, Chris. “Power Politechs.” New Scientist 218, no. 2923 (June 29, 2013): 36–39. http://bit.ly/167ul3J.

  • In this article, Baraniuk discusses civic hackers, “an army of volunteer coders who are challenging preconceptions about hacking and changing the way your government operates. In a time of plummeting budgets and efficiency drives, those in power have realised they needn’t always rely on slow-moving, expensive outsourcing and development to improve public services. Instead, they can consider running a hackathon, at which tech-savvy members of the public come together to create apps and other digital tools that promise to enhace the provision of healthcare, schools or policing.”
  • While recognizing that “civic hacking has established a pedigree that demonstrates its potential for positive impact,” Baraniuk argues that a “more rigorous debate over how this activity should evolve, or how authorities ought to engage in it” is needed.

Barnett, Brandon, Muki Hansteen Izora, and Jose Sia. “Civic Hackathon Challenges Design Principles: Making Data Relevant and Useful for Individuals and Communities.” Hack for Change, https://bit.ly/2Ge6z09.

  • In this paper, researchers from Intel Labs offer “guiding principles to support the efforts of local civic hackathon organizers and participants as they seek to design actionable challenges and build useful solutions that will positively benefit their communities.”
  • The authors proposed design principles are:
    • Focus on the specific needs and concerns of people or institutions in the local community. Solve their problems and challenges by combining different kinds of data.
    • Seek out data far and wide (local, municipal, state, institutional, non-profits, companies) that is relevant to the concern or problem you are trying to solve.
    • Keep it simple! This can’t be overstated. Focus [on] making data easily understood and useful to those who will use your application or service.
    • Enable users to collaborate and form new communities and alliances around data.

Buhrmester, Michael, Tracy Kwang, and Samuel D. Gosling. “Amazon’s Mechanical Turk A New Source of Inexpensive, Yet High-Quality, Data?” Perspectives on Psychological Science 6, no. 1 (January 1, 2011): 3–5. http://bit.ly/H56lER.

  • This article examines the capability of Amazon’s Mechanical Turk to act a source of data for researchers, in addition to its traditional role as a microtasking platform.
  • The authors examine the demographics of MTurkers and find that “MTurk participants are slightly more demographically diverse than are standard Internet samples and are significantly more diverse than typical American college samples; (b) participation is affected by compensation rate and task length, but participants can still be recruited rapidly and inexpensively; (c) realistic compensation rates do not affect data quality; and (d) the data obtained are at least as reliable as those obtained via traditional methods.”
  • The paper concludes that, just as MTurk can be a strong tool for crowdsourcing tasks, data derived from MTurk can be high quality while also being inexpensive and obtained rapidly.

Goodchild, Michael F., and J. Alan Glennon. “Crowdsourcing Geographic Information for Disaster Response: a Research Frontier.” International Journal of Digital Earth 3, no. 3 (2010): 231–241. http://bit.ly/17MBFPs.

  • This article examines issues of data quality in the face of the new phenomenon of geographic information being generated by citizens, in order to examine whether this data can play a role in emergency management.
  • The authors argue that “[d]ata quality is a major concern, since volunteered information is asserted and carries none of the assurances that lead to trust in officially created data.”
  • Due to the fact that time is crucial during emergencies, the authors argue that, “the risks associated with volunteered information are often outweighed by the benefits of its use.”
  • The paper examines four wildfires in Santa Barbara in 2007-2009 to discuss current challenges with volunteered geographical data, and concludes that further research is required to answer how volunteer citizens can be used to provide effective assistance to emergency managers and responders.

Hudson-Smith, Andrew, Michael Batty, Andrew Crooks, and Richard Milton. “Mapping for the Masses Accessing Web 2.0 Through Crowdsourcing.” Social Science Computer Review 27, no. 4 (November 1, 2009): 524–538. http://bit.ly/1c1eFQb.

  • This article describes the way in which “we are harnessing the power of web 2.0 technologies to create new approaches to collecting, mapping, and sharing geocoded data.”
  • The authors examine GMapCreator and MapTube, which allow users to do a range of map-related functions such as create new maps, archive existing maps, and share or produce bottom-up maps through crowdsourcing.
  • They conclude that “these tools are helping to define a neogeography that is essentially ‘mapping for the masses,’ while noting that there are many issues of quality, accuracy, copyright, and trust that will influence the impact of these tools on map-based communication.”

Kanhere, Salil S. “Participatory Sensing: Crowdsourcing Data from Mobile Smartphones in Urban Spaces.” In Distributed Computing and Internet Technology, edited by Chittaranjan Hota and Pradip K. Srimani, 19–26. Lecture Notes in Computer Science 7753. Springer Berlin Heidelberg. 2013. https://bit.ly/2zX8Szj.

  • This paper provides a comprehensive overview of participatory sensing — a “new paradigm for monitoring the urban landscape” in which “ordinary citizens can collect multi-modal data streams from the surrounding environment using their mobile devices and share the same using existing communications infrastructure.”
  • In addition to examining a number of innovative applications of participatory sensing, Kanhere outlines the following key research challenges:
    • Dealing with incomplete samples
    •  Inferring user context
    • Protecting user privacy
    • Evaluating data trustworthiness
    • Conserving energy

Government's Crowdsourcing Revolution


John M. Kamensky  in Governing: “In a recent report for the IBM Center for the Business of Government, Brabham says that an important distinction between crowdsourcing and other forms of online participation is that crowdsourcing “entails a mix of top-down, traditional, hierarchical process and a bottom-up, open process involving an online community.”
Crowdsourcing in the public sector can be done within government, among employees as a way to surface ideas — such as the New York City government’s “Simplicity” initiative — or it can be done by nonprofit groups in ways that influence government operations. For example, a transportation advocacy group in New York City has created a site where citizens can report “near miss” accidents, which are then mapped to determine patterns. The idea is that, while the city government already maps accidents that have happened, hazardous traffic zones can be detected and resolved faster by mapping near-misses without waiting for a large number of actual accidents.
Brabham offers a strategic view of crowdsourcing and when it is useful to address public problems. His report also identifies four specific approaches, describing which is most useful for a given category of problem:
Knowledge discovery and management. This approach is best for information-gathering and cataloguing problems through an online community, such as the reporting of earth tremors or potholes to a central source. This approach could also be used to report conditions of parks or hiking trails or for cataloging public art projects as have been done in several cities across the country.
Distributed human-intelligence tasking: This approach is most useful when human intelligence is more effective than computer analysis. It involves distributing “micro-tasks” that require human intelligence to solve, such as transcribing handwritten historical documents into electronic files. For example, when the handwritten 1940 census records were publicly released in 2012, the National Archives catalyzed the electronic tagging of more than 130 million records so they could be searchable online. More than 150,000 people volunteered.
Broadcast search: This approach is most useful when an agency is attempting to find creative solutions to problems. It involves broadcasting a problem-solving challenge widely on the Internet and offering an award for the best solution. NASA, for example, offered a prize for an algorithm to predict solar flares. The federal government sponsors a contest and awards Web platform, Challenge.gov, that various federal agencies can use to post their challenges. To date, hundreds of diverse challenges have been posted, with thousands of people proposing solutions.
Peer-vetted creative production: This approach is most useful when an agency is looking for innovative ideas that must meet a test of taste or market support. It involves an online community that both proposes possible solutions and is empowered to collectively choose among them. For example, the Utah Transit Authority sponsored the Next Stop Design project, allowing citizens to design and vote on an ideal bus-stop shelter. Nearly 3,200 people participated, submitting 260 high-quality architectural renderings, and there were more than 10,000 votes leading to a final selection….”

The Decay of American Political Institutions


in the American Interest: “Many political institutions in the United States are decaying. This is not the same thing as the broader phenomenon of societal or civilization decline, which has become a highly politicized topic in the discourse about America. Political decay in this instance simply means that a specific political process—sometimes an individual government agency—has become dysfunctional. This is the result of intellectual rigidity and the growing power of entrenched political actors that prevent reform and rebalancing. This doesn’t mean that America is set on a permanent course of decline, or that its power relative to other countries will necessarily diminish. Institutional reform is, however, an extremely difficult thing to bring about, and there is no guarantee that it can be accomplished without a major disruption of the political order. So while decay is not the same as decline, neither are the two discussions unrelated.
There are many diagnoses of America’s current woes. In my view, there is no single “silver bullet” cause of institutional decay, or of the more expansive notion of decline. In general, however, the historical context of American political development is all too often given short shrift in much analysis. If we look more closely at American history as compared to that of other liberal democracies, we notice three key structural characteristics of American political culture that, however they developed and however effective they have been in the past, have become problematic in the present.
The first is that, relative to other liberal democracies, the judiciary and the legislature (including the roles played by the two major political parties) continue to play outsized roles in American government at the expense of Executive Branch bureaucracies. Americans’ traditional distrust of government thus leads to judicial solutions for administrative problems. Over time this has become a very expensive and inefficient way to manage administrative requirements.
The second is that the accretion of interest group and lobbying influences has distorted democratic processes and eroded the ability of the government to operate effectively. What biologists label kin selection and reciprocal altruism (the favoring of family and friends with whom one has exchanged favors) are the two natural modes of human sociability. It is to these types of relationships that people revert when modern, impersonal government breaks down.
The third is that under conditions of ideological polarization in a federal governance structure, the American system of checks and balances, originally designed to prevent the emergence of too strong an executive authority, has become a vetocracy. The decision system has become too porous—too democratic—for its own good, giving too many actors the means to stifle adjustments in public policy. We need stronger mechanisms to force collective decisions but, because of the judicialization of government and the outsized role of interest groups, we are unlikely to acquire such mechanisms short of a systemic crisis. In that sense these three structural characteristics have become intertwined….
In short, the problems of American government flow from a structural imbalance between the strength and competence of the state, on the one hand, and the institutions that were originally designed to constrain the state, on the other. There is too much law and too much “democracy”, in the form of legislative intervention, relative to American state capacity. Some history can make this assertion clearer….
In well-functioning governance systems, moreover, a great deal of deliberation occurs not just in legislatures but within bureaucracies. This is not a matter of bureaucrats simply talking to one another, but rather a complex series of consultations between government officials and businesses, outside implementers and service providers, civil society groups, the media and other sources of information about societal interests and opinions. The Congress wisely mandated consultation in the landmark 1946 Administrative Procedures Act, which requires regulatory agencies to publicly post proposed rule changes and to solicit comment about them. But these consultative procedures have become highly routinized and pro forma, with actual decisions being the outcome not of genuine deliberation, but of political confrontations between well organized interest groups….”

Power to the people: how open data is improving health service delivery


The Guardian: “…What’s really interesting is how this data can be utilised by citizens to enable them to make more informed choices and demand improved services in sectors such as health. A growing community of technologists and social activists is emerging across Africa, supported by a burgeoning network of technology innovation hubs. They’re beginning to explore the ways in which data can be utilised to improve health outcomes.
In Northern Uganda, the brutal Lord’s Resistance Army conflict displaced two million people, leaving the social infrastructure in tatters. In 2008, the government launched a Peace, Recovery and Development Plan, but progress has been limited. There are insufficient health centres to serve the population, a severe shortage of staff, drugs and equipment, and corruption is widespread.
Cipesa – an organisation that uses communication technologies to support poverty reduction and development – and Numec, a local media organisation, have launched the iParticipate project. A multimedia platform is being populated with baseline data outlining the current status of the health service across three districts….
In the same region, Wougnet is training women to use information technologies to tackle social challenges. Local officials and community members have formed voluntary social accountability committees and been trained in the use of an online platform to capture and store information relating to poor governance and corruption in the health sector, often via mobile phones.
The platform strengthened campaign efforts which resulted in the construction of a health centre in Aloni Parish. In Amuru district, five health workers were arrested following reports highlighting negligence.
In the village of Bagega in Nigeria, 400 children died and thousands suffered significant health problems as the result of lead poisoning caused by poor mining practices. The government pledged $5.3m (£3.23m) for remediation, but the funds never reached the affected region.
A local organisation, Follow the Money, created an infographic highlighting the government’s commitments and combined this with real life testimonies and photographs showing the actual situation on the ground. Within 48 hours of a targeted Twitter campaign, the president committed to releasing funds to the village and, in February this year, children started receiving long overdue medical attention.
All these initiatives depend on access to critical government data and an active citizens who feel empowered to effect change in their own lives and communities. At present, it’s often hard to access data which is sufficiently granular, particularly at district or local level. For citizens to be engaged with information from government, it also needs to be accessible in ways that are simple to understand and linked to campaigns that impact their daily lives.
Tracking expenditure can also operate across borders. Donors are beginning to open up aid data by publishing to the IATI registry. This transparency by donor governments should improve the effectiveness of aid spending and contribute towards improved health outcomes.
It’s hard to draw general conclusions about how technology can contribute towards improving health outcomes, particularly when context is so critical and the field is so new. Nonetheless, some themes are emerging which can maximise the chances of an intervention’s success.
It can at times be challenging to encourage citizens to report for an array of reasons, including a lack of belief in their ability to effect change, cultural norms, a lack of time and both perceived and real risks. Still, participation seems to increase when citizens receive feedback from reports submitted and when mechanisms are in place that enable citizens to take collective action. On-the-ground testimonies and evidence can also help shift public opinion and amplify critical messages.
Interventions are dramatically strengthened when integrated into wider programmes, implemented by organisations that have established a strong relationship with the communities in which they work. They need to be backed by at least one strong civil society organisation that can follow up on any reports, queries or challenges which may arise. Where possible, engagement from government and local leaders can make a real difference. Identifying champions within government can also significantly improve responsiveness.”

The value and challenges of public sector information


A paper by M. Henninger in Cosmopolitan Civil Societies: An Interdisciplinary Journal: “The aim of this paper is to explore the concept of public sector information (PSI), what it is, its history and evolution, what constitutes its corpus of documents and the issues and challenges it presents to society, its institutions and to those who use and manage it. The paper, by examining the literatures of the law, political science, civil society, economics and information and library science explores the inherent tensions of access to and use of PSI—pragmatism vs. idealism; openness vs. secrecy; commerce vs. altruism; property vs. commons; public good vs. private good. It focusses on open government data (OGD)—a subset of what is popularly referred to as ‘big data’—its background and development since much of the current debate of its use concerns its commercial value for both the private sector and the public sector itself. In particular it looks at the information itself which, driven by technologies of networks, data mining and visualisation gives value in industrial and economic terms, and in its ability to enable new ideas and knowledge.”

Making Europe's cities smarter


Press Release: “At a conference today hosted by the European Commission, city leaders, CEOs and civil society leaders discussed the actions outlined in the “Smart Cities Strategic Implementation Plan” and how to put them into practice. The Commission announced that it will launch an ‘Invitation for Smart City and Community Commitments’ in spring 2014 to mobilise work on the action plan’s priorities. The plan is part of Europe’s fifth “Innovation Partnership”.
Commission Vice-President Siim Kallas, in charge of transport, said: “I am very pleased to see transport operators, telecoms companies, vehicle manufacturers, city planners, energy companies and researchers all gathered in one room to discuss the future of our cities. The Smart Cities initiative is a great opportunity to make changes happen for less congestion and better business opportunities in our cities. We need to keep up the momentum and move from plan to action now.”
Commission Vice-President Neelie Kroes, responsible for the Digital Agenda, said: “The future of infrastructure and city planning will be based on integrating ICT systems and using big data to make our cities better places to live and work. We need to base those new systems on open standards for hardware, software, data and services which this European Innovation Partnership will develop.”
Günther H. Oettinger, EU Commissioner for energy, said: “The European Innovation Partnership for Smart Cities and Communities is about making investments in sustainable development in as many cities as possible. Creating equal partnerships between cities and companies based on synergies between ICT, energy and mobility will lead to projects that make a real difference in our everyday lives.”
The Commission intends to make available approximately EUR 200 million for Smart Cities and communities in the 2014-2015 budgets of the Horizon 2020 research and innovation programme, to accelerate progress and enlarge the scale of roll-out of smart cities solutions. There will also be possibilities to access the European Structural and Investment Funds.
For more information: http://ec.europa.eu/eip/smartcities/”

The Flow of Technology Talent into Government and Civil Society


Report for the Ford Foundation and the MacArthur Foundation: “As information technology further suffuses every aspect of our lives, government will inevitably have a role to play in ensuring that technology serves the public interest. The ability for government to improve operations and provide services to citizens more efficiently through the effective use of technology is among the greatest contemporary opportunities for the public sector…
Among the key findings of this report:

  • The Current Pipeline Is Insufficient: the vast majority of interviewees indicated that there is a severe paucity of individuals with technical skills in computer science, data science, and the Internet or other information technology expertise in civil society and government. In particular, many of those interviewed noted that existing talent levels fail to meet current needs to develop, leverage, or understand technology.
  • Barriers to Recruitment and Retention Are Acute: many of those interviewed said that substantial barriers thwart the effective recruitment and retention of individuals with the requisite skills in government and civil society. Among the most common barriers mentioned were those of compensation, an inability to pursue groundbreaking work, and a culture that is averse to hiring and utilizing potentially disruptive innovators
  • A Major Gap between the Public-Interest and For-Profit Sectors Persists: as a related matter, interviewees discussed superior for-profit recruitment and retention models. Specifically, the for-profit sector was perceived as providing both more attractive compensation (especially to young talent) and fostering a culture of innovation, openness, and creativity that was seen as more appealing to technologists and innovators.
  • A Need to Examine Models from Other Fields: interviewees noted significant space to develop new models to improve the robustness of the talent pipeline; in part, many existing models were regarded as unsustainable or incomplete. Interviewees did, however, highlight approaches from other fields that could provide relevant lessons to help guide investments in improving this pipeline.
  • Significant Opportunity for Connection and Training: despite consonance among those interviewed that the pipeline was incomplete, many individuals indicated the possibility for improved and more systematic efforts to expose young technologists to public interest issues and connect them to government and civil society careers through internships, fellowships, and other training and recruitment tools.
  • Culture Change Necessary: the culture of government and civil society –and its effects on recruitment and other bureaucratic processes –was seen as a vital challenge that would need to be addressed to improve the pipeline. This view manifested through comments that government and civil society organizations needed to become more open to utilizing technology and adopting a mindset of experimentation….”

6 Projects That Make Data More Accessible Win $100,000 Each From Gates


Chronicle of Philanthropy: “Six nonprofit projects that aim to combine multiple sets of data to help solve social problems have each won $100,000 grants from the Bill & Melinda Gates Foundation…The winners:
• Pushpa Aman Singh, who founded GuideStar India as an effort of the Civil Society Information Services India. GuideStar India is the most comprehensive database of India’s registered charities. It has profiles of more than 4,000 organizations, and Ms. Singh plans to expand that number and the types of information included.
• Development Initiatives, an international aid organization, to support its partnership with the Ugandan nonprofit Development Research and Training. Together, they are trying to help residents of two districts in Uganda identify a key problem the communities face and use existing data sets to build both online and offline tools to help tackle that challenge…
• H.V. Jagadish, at the University of Michigan, to develop a prototype that will merge sets of incompatible geographic data to make them comparable. Mr. Jagadish, a professor of electrical engineering and computer science, points to crime precincts and school districts as an example. “We want to understand the impact of education on crime, but the districts don’t quite overlap with the precincts,” he says. “This tool will address the lack of overlap.”
• Vijay Modi, at Columbia University, to work with government agencies and charities in Nigeria on a tool similar to Foursquare, the social network that allows people to share their location with friends. Mr. Modi, a mechanical-engineering professor and faculty member of the university’s Earth Institute, envisions a tool that will help people find important resources more easily…
• Gisli Olafsson and his team at NetHope, a network of aid organizations. The group is building a tool to help humanitarian charities share their data more widely and in real time—potentially saving more lives during disasters…
• Development Gateway, a nonprofit that assists international development charities with technology, and GroundTruth Initiative, a nonprofit that helps residents of communities learn mapping and media skills. The two groups want to give people living in the slums of Nairobi, Kenya, more detailed information about local schools…”

The Open Government Illusion


Katherine Barrett and Richard Greene in GOVERNING: “The easier it is for us to find important information about cities, counties and states, the better we’re able to report on topics of interest to our readers. But transparency isn’t just about us. It can help citizen organizations, good government bodies, advocacy groups, the press at large and even the general public. What’s more, accessible information makes it easier for legislators and city council members to drill down to the facts, creating more capacity for informed decision-making.
To be sure, progress has been made on a number of transparency fronts, and we certainly appreciate the additional data we’re able to find easily each year. That said, from our personal experience and conversations with experts in the field, much of the talk about heightened transparency in government is more rhetoric than reality.
Take so-called “online spending transparency,” or Web-based checkbooks that offer a clear and simple way to see where tax dollars are going. All 50 states have them. Optimally users would get, according to the nonprofit U.S. Public Interest Research Group (PIRG), a host of “checkbook-level information about expenditures including those made through contracts, grants, tax credits and other discretionary spending.”
Sounds swell, and in fact, PIRG’s studies of the 50 states have revealed consistent improvement. Each year, the organization has raised the bar on its criteria for grading the states. Still, in its most recent work, five states were given an F: California, Hawaii, North Dakota, Wisconsin and Wyoming. According to Phineas Baxandall, senior analyst for U.S. PIRG, in lagging and failing states—the dozen that got D’s and F’s—“you’ll find PDFs instead of searchable, sortable databases; you’ll find much more partial information about departments, and they generally don’t integrate economic subsidies.”
When it comes to disclosures of any kind there’s a huge chunk of information that’s as transparent as a window with the blinds closed. This includes a host of entities that generally don’t get their cash through the general fund. Starting the list are affiliated not-for-profits set up to provide government services and often funded through so-called “corporate funds” or grants, as well as public-private partnerships, authorities and a variety of other quasi-governmental bodies.”