Data Science for Social Good


Data Science for Social Good: “By analyzing data from police reports to website clicks to sensor signals, governments are starting to spot problems in real-time and design programs to maximize impact. More nonprofits are measuring whether or not they’re helping people, and experimenting to find interventions that work.
None of this is inevitable, however.
We’re just realizing the potential of using data for social impact and face several hurdles to it’s widespread adoption:

  • Most governments and nonprofits simply don’t know what’s possible yet. They have data – but often not enough and maybe not the right kind.
  • There are too few data scientists out there – and too many spending their days optimizing ads instead of bettering lives.

To make an impact, we need to show social good organizations the power of data and analytics. We need to work on analytics projects that have high social impact. And we need to expose data scientists to the problems that really matter.

The fellowship

That’s exactly why we’re doing the Eric and Wendy Schmidt Data Science for Social Good summer fellowship at the University of Chicago.
We want to bring three dozen aspiring data scientists to Chicago, and have them work on data science projects with social impact.
Working closely with governments and nonprofits, fellows will take on real-world problems in education, health, energy, transportation, and more.
Over the next three months, they’ll apply their coding, machine learning, and quantitative skills, collaborate in a fast-paced atmosphere, and learn from mentors in industry, academia, and the Obama campaign.
The program is led by a strong interdisciplinary team from the Computation institute and the Harris School of Public Policy at the University of Chicago.”

‘Medical Instagram’ helps build a library of reference photos for doctors


Springwise: “The power of the visual sharing that makes platforms such as Instagram so popular has been harnessed by retailers like Ask CT Food to share knowledge about cooking, but could the same be done for the medical world? Figure1 enables health professionals to upload and share photos of conditions, creating online discussion as well as crowdsourcing a database of reference images.
Developed by healthcare tech startup Movable Science, the platform is designed in a similar vein to Instagram and enables medical professionals to create their own feed of images from the cases they deal with. In order to protect patients’ identities, the app uses facial recognition to block out faces, while users can add their own marks to cover up other indentifiable marks. They can also add pointers and annotations, as well as choosing who sees it, before uploading the image. Photos can be tagged with relevant terms to allow the community to easily find them through search and others can comment on the images, fostering discussion among users. Images can also be starred, which acts simultaneously as an indication of quality as well as enabling users to save useful images for later reference. …
Although Instagram was developed with the broad purpose of entertainment and social sharing, Figure1 has tweaked the platform’s functions to provide a tool that could help doctors and students share their knowledge and learn from others in an engaging way…”

Facebook Is Being Redefined by Its Developing World Users


Tom Simonite in MIT Technology Review: “…as Facebook’s user base continues to expand, a growing proportion of its users think of it quite differently, as a luxury brand, badge of status, and or even a place to make a little extra money. That’s due to the rapid growth in the number of Facebook users signing on from developing countries, a trend underscored by news from the company today that more than 100 million people use a mobile app the company makes for feature phones
Little research has been done on Facebook’s growth in developing countries (and a lot would be needed to capture even some of the diversity included under the blanket term “developing world”). Two small, recent studies of Kenyan Facebook users in poor areas by Susan Wyche of Michigan State University are among the first to be published, and they provide some interesting insights.
One of Wyche’s ethnographic studies took place in rural Internet cafes, where the researchers were told that “Facebook is a luxury,” only to be indulged if someone had money to spare (here’s a PDF of Wyche’s paper). When study participants thought about social networking, the challenges of low bandwidth and sometimes unreliable electricity supplies were foremost in their minds.
The barriers of cost and infrastructure associated with Facebook led people in another community Wyche and colleagues visited, a slum of Nairobi, to see the service as for more than just socializing. They used it—with mixed success—as a way to make a little money, look for jobs, market themselves, and seek remittances from friends and family overseas. (This reminded me of a recent report on people in Kuwait using Instagram to sell things and run retail businesses.)…
Should it want to, Facebook could even become a powerful tool for efforts to improve the lives of people in poor areas, where the site is gaining traction. The company has already dabbled with using social engineering to boost organ donations in the U.S. (see “Thank God for Facebook: When Platforms Proselytize”). There’s no shortage of similar experiments that could be run in places with more fundamental health problems, where Facebook’s status as a luxury could make it very influential.”

Accountability.Org: Online Disclosure by Nonprofits


Paper by Joannie Tremblay-Boire and Aseem Prakash: “Why do some nonprofits signal their accountability via unilateral website disclosures? We develop an Accountability Index to examine the websites of 200 U.S. nonprofits ranked by the Chronicle of Philanthropy. We expect nonprofits’ incentives for website disclosures will be shaped by their organizational and sectoral characteristics. Our analysis suggests that nonprofits appearing frequently in the media disclose more accountability information while nonprofits larger in size disclose less. Religion-related nonprofits tend to disclose less information, suggesting that religious bonding enhances trust and reduce incentives for self-disclosure. Health nonprofits disclose less information, arguably because government-mandated disclosures reduce marginal benefits from voluntary disclosures. Education nonprofits, on the other hand, tend to disclose more accountability information perhaps because they supply credence goods. This research contributes to the emerging literature on websites as accountability mechanisms by developing a new index for scholars to use and proposing new hypotheses based on the corporate social responsibility literature.”

Next.Data.gov


Nick Sinai at the White House Blog: “Today, we’re excited to share a sneak preview of a new design for Data.gov, called Next.Data.gov. The upgrade builds on the President’s May 2013 Open Data Executive Order that aims to fuse open-data practices into the Federal Government’s DNA. Next.Data.gov is far from complete (think of it as a very early beta), but we couldn’t wait to share our design approach and the technical details behind it – knowing that we need your help to make it even better.  Here are some key features of the new design:
 

OSTP_nextdata_1 

Leading with Data: The Data.gov team at General Services Administration (GSA), a handful of Presidential Innovation Fellows, and OSTP staff designed Next.Data.Gov to put data first. The team studied the usage patterns on Data.gov and found that visitors were hungry for examples of how data are used. The team also noticed many sources, such as tweets and articles outside of Data.gov featuring Federal datasets in action. So Next.Data.gov includes a rich stream that enables each data community to communicate how its datasets are impacting companies and the public.

OSTP_nextdata_2 

In this dynamic stream, you’ll find blog posts, tweets, quotes, and other features that more fully showcase the wide range of information assets that exist within the vaults of government.
Powerful Search: The backend of Next.Data.gov is CKAN and is powered by Solr—a powerful search engine that will make it even easier to find relevant datasets online. Suggested search terms have been added to help users find (and type) things faster. Next.Data.gov will start to index datasets from agencies that publish their catalogs publicly, in line with the President’s Open Data Executive Order. The early preview launching today features datasets from the Department of Health and Human Services—one of the first Federal agencies to publish a machine-readable version of its data catalog.
Rotating Data Visualizations: Building on the theme of leading with data, even the  masthead-design for Next.Data.gov is an open-data-powered visualization—for now, it’s a cool U.S. Geological Survey earthquake plot showing the magnitude of earthquake measurements collected over the past week, around the globe.

OSTP_nextdata_3 

This particular visualization was built using D3.js. The visualization will be updated periodically to spotlight different ways open data is used and illustrated….
We encourage you to collaborate in the design process by creating pull requests or providing feedback via Quora or Twitter.”

Metrics for Government Reform


Geoff Mulgan: “How do you measure a programme of government reform? What counts as evidence that it’s working or not? I’ve been asked this question many times, so this very brief note suggests some simple answers – mainly prompted by seeing a few writings on this question which I thought confused some basic points.”
Any type of reform programme will combine elements at very different levels. These may include:

  • A new device – for example, adjusting the wording in an official letter or a call centre script to see what impact this has on such things as tax compliance.
  • A new kind of action – for example a new way of teaching maths in schools, treating patients with diabetes, handling prison leavers.
  • A new kind of policy – for example opening up planning processes to more local input; making welfare payments more conditional.
  • A new strategy – for example a scheme to cut carbon in cities, combining retrofitting of housing with promoting bicycle use; or a strategy for public health.
  • A new approach to strategy – for example making more use of foresight, scenarios or big data.
  • A new approach to governance – for example bringing hitherto excluded groups into political debate and decision-making.

This rough list hopefully shows just how different these levels are in their nature. Generally as we go down the list the following things rise:

  • The number of variables and the complexity of the processes involved
  • The timescales over which any judgements can be made
  • The difficultness involved in making judgements about causation
  • The importance of qualitative relative to quantitative assessment”

Crowdsourcing—Harnessing the Masses to Advance Health and Medicine


A Systematic Review of the literature in the Journal of General Internal Medicine: “Crowdsourcing research allows investigators to engage thousands of people to provide either data or data analysis. However, prior work has not documented the use of crowdsourcing in health and medical research. We sought to systematically review the literature to describe the scope of crowdsourcing in health research and to create a taxonomy to characterize past uses of this methodology for health and medical research..
Twenty-one health-related studies utilizing crowdsourcing met eligibility criteria. Four distinct types of crowdsourcing tasks were identified: problem solving, data processing, surveillance/monitoring, and surveying. …
Utilizing crowdsourcing can improve the quality, cost, and speed of a research project while engaging large segments of the public and creating novel science. Standardized guidelines are needed on crowdsourcing metrics that should be collected and reported to provide clarity and comparability in methods.”

The Durkheim Project


Co.Labs: “A new project, newly launched by DARPA and Dartmouth University, is trying something new: Data-mining social networks to spot patterns indicating suicidal behavior.
Called The Durkheim Project, named for the Victorian-era psychologist, it is asking veterans to offer their Twitter and Facebook authorization keys for an ambitious effort to match social media behavior with indications of suicidal thought. Veterans’ online behavior is then fed into a real-time analytics dashboard which predicts suicide risks and psychological episodes… The Durkheim Project is led by New Hampshire-based Patterns and Predictions, a Dartmouth University spin-off with close ties to academics there…
The Durkheim Project is part of DARPA’s Detection and Computational Analysis of Psychological Signals (DCAPS) project. DCAPS is a larger effort designed to harness predictive analytics for veteran mental health–and not just from social media. According to DARPA’s Russell Shilling’s program introduction, DCAPS is also developing algorithms that can data mine voice communications, daily eating and sleeping patterns, in-person social interactions, facial expressions, and emotional states for signs of suicidal thought. While participants in Durkheim won’t receive mental health assistance directly from the project, their contributions will go a long way toward treating suicidal veterans in the future….
The project launched on July 1; the number of veterans participating is not currently known but the finished number is expected to hover around 100,000.”

Microsensors help map crowdsourced pollution data


air-quality-egg-mapElena Craft in GreenBiz: Michael Heimbinder, a Brooklyn entrepreneur, hopes to empower individuals with his small-scale air quality monitoring system, AirCasting. The AirCasting system uses a mobile, Bluetooth-enabled air monitor not much larger than a smartphone to measure carbon dioxide, carbon monoxide, nitrogen dioxide, particulate matter and other pollutants. An accompanying Android app records and formats the information to an emissions map.
Alternatively, another instrument, the Air Quality Egg, comes pre-assembled ready to use. Innovative air monitoring systems, such as AirCasting or the Air Quality Egg, empower ordinary citizens to monitor the pollution they encounter daily and proactively address problematic sources of pollution.
This technology is part of a growing movement to enable the use of small sensors. In response to inquiries about small-sensor data, the EPA is researching the next generation of air measuring technologies. EPA experts are working with sensor developers to evaluate data quality and understand useful sensor applications. Through this ongoing collaboration, the EPA hopes to bolster measurements from conventional, stationary air-monitoring systems with data collected from individuals’ air quality microsensors….
Like many technologies emerging from the big data revolution and innovations in the energy sector, microsensing technology provides a wealth of high-quality data at a relatively low cost. It allows us to track previously undetected air pollution from traditional sources of urban smog, such as highways, and unconventional sources of pollution. Microsensing technology not only educates the public, but also helps to enlighten regulators so that policymakers can work from the facts to protect citizens’ health and welfare.

Can Silicon Valley Save the World?


Charles Kenny and Justin Sandefur in Foreign Policy: “Not content with dominating IPOs on Wall Street, Silicon Valley entrepreneurs are taking their can-do, failure-conquering, technology-enabled tactics to the challenge of global poverty. And why not? If we can look up free Khan Academy math lectures using the cheap, kid-friendly computers handed out by the folks at One Laptop per Child, who needs to worry about the complexities of education reform? With a lamp lit up by an electricity-generating soccer ball in every hut, who needs coal-fired power stations and transmission lines? And if even people in refugee camps can make money transcribing outsourced first-world dental records, who needs manufacturing or the roads and port systems required to export physical goods? No wonder the trendiest subject these days for TED talks is cracking the code on digital-era do-gooding, with 100 recent talks and counting just on the subjects of Africa and development…
But entrepreneurial spirit and even the fanciest of gadgets will only get you so far. All the technological transformation of the last 200 years hasn’t come close to wiping out global poverty. More than half the planet still lives on less than $4 a day, and 2.4 billion people live on less than $2 a day. And that’s after a decade that saw the biggest drop in extreme poverty ever. What’s more, millions and millions of people still die annually from easily and cheaply preventable or treatable diseases like diarrhea and pneumonia. None of this is for a lack of science; often it isn’t even for lack of money. It is because parents don’t follow simple health practices like washing their hands, government bureaucrats can’t or won’t provide basic water and sanitation programs, and arbitrary immigration restrictions prevent the poor from moving to places with better opportunities.
Sorry, but no iPhone, even one loaded with the coolest apps, is going to change all that….
SO WHAT CAN BE DONE to harness technological innovation, filter the good ideas from the bad, and spread a little of Silicon Valley’s fairy dust on the world’s poorer regions? The answer, according to Harvard economist Michael Kremer, is market discipline and rigorous testing. Kremer is a MacArthur “genius” grant winner whose name pops up in speculation about future Nobel Prize contenders. He thinks that technological fixes can dramatically improve the lives of the global poor, but markets won’t provide the right innovations without support.”