Infection forecasts powered by big data


Michael Eisenstein at Nature: “…The good news is that the present era of widespread access to the Internet and digital health has created a rich reservoir of valuable data for researchers to dive into….By harvesting and combining these streams of big data with conventional ways of monitoring infectious diseases, the public-health community could gain fresh powers to catch and curb emerging outbreaks before they rage out of control.

Going viral

Data scientists at Google were the first to make a major splash using data gathered online to track infectious diseases. The Google Flu Trends algorithm, launched in November 2008, combed through hundreds of billions of users’ queries on the popular search engine to look for small increases in flu-related terms such as symptoms or vaccine availability. Initial data suggested that Google Flu Trends could accurately map the incidence of flu with a lag of roughly one day. “It was a very exciting use of these data for the purpose of public health,” says Brownstein. “It really did start a whole revolution and new field of work in query data.”

Unfortunately, Google Flu Trends faltered when it mattered the most, completely missing the onset in April 2009 of the H1N1 pandemic. The algorithm also ran into trouble later on in the pandemic. It had been trained against seasonal fluctuations of flu, says Viboud, but people’s behaviour changed in the wake of panic fuelled by media reports — and that threw off Google’s data. …

Nevertheless, its work with Internet usage data was inspirational for infectious-disease researchers. A subsequent study from a team led by Cecilia Marques-Toledo at the Federal University of Minas Gerais in Belo Horizonte, Brazil, used Twitter to get high-resolution data on the spread of dengue fever in the country. The researchers could quickly map new cases to specific cities and even predict where the disease might spread to next (C. A. Marques-Toledo et al. PLoS Negl. Trop. Dis. 11, e0005729; 2017). Similarly, Brownstein and his colleagues were able to use search data from Google and Twitter to project the spread of Zika virus in Latin America several weeks before formal outbreak declarations were made by public-health officials. Both Internet services are used widely, which makes them data-rich resources. But they are also proprietary systems for which access to data is controlled by a third party; for that reason, Generous and his colleagues have opted instead to make use of search data from Wikipedia, which is open source. “You can get the access logs, and how many people are viewing articles, which serves as a pretty good proxy for search interest,” he says.

However, the problems that sank Google Flu Trends still exist….Additionally, online activity differs for infectious conditions with a social stigma such as syphilis or AIDS, because people who are or might be affected are more likely to be concerned about privacy. Appropriate search-term selection is essential: Generous notes that initial attempts to track flu on Twitter were confounded by irrelevant tweets about ‘Bieber fever’ — a decidedly non-fatal condition affecting fans of Canadian pop star Justin Bieber.

Alternatively, researchers can go straight to the source — by using smartphone apps to ask people directly about their health. Brownstein’s team has partnered with the Skoll Global Threats Fund to develop an app called Flu Near You, through which users can voluntarily report symptoms of infection and other information. “You get more detailed demographics about age and gender and vaccination status — things that you can’t get from other sources,” says Brownstein. Ten European Union member states are involved in a similar surveillance programme known as Influenzanet, which has generally maintained 30,000–40,000 active users for seven consecutive flu seasons. These voluntary reporting systems are particularly useful for diseases such as flu, for which many people do not bother going to the doctor — although it can be hard to persuade people to participate for no immediate benefit, says Brownstein. “But we still get a good signal from the people that are willing to be a part of this.”…(More)”.

Launching the Data Culture Project


New project by MIT Center for Civic Media and the Engagement Lab@Emerson College: “Learning to work with data is like learning a new language — immersing yourself in the culture is the best way to do it. For some individuals, this means jumping into tools like Excel, Tableau, programming, or R Studio. But what does this mean for a group of people that work together? We often talk about data literacy as if it’s an individual capacity, but what about data literacy for a community? How does an organization learn how to work with data?

About a year ago we (Rahul Bhargava and Catherine D’Ignazio) found that more and more users of our DataBasic.io suite of tools and activities were asking this question — online and in workshops. In response, with support from the Stanford Center on Philanthropy and Civil Society, we’ve worked together with 25 organizations to create the Data Culture Project. We’re happy to launch it publicly today! Visit datacultureproject.org to learn more.

The Data Culture Project is a hands-on learning program to kickstart a data culture within your organization. We provide facilitation videos to help you run creative introductions to get people across your organization talking to each other — from IT to marketing to programs to evaluation. These are not boring spreadsheet trainings! Try running our fun activities — one per month works as a brown bag lunch to focus people on a common learning goal. For example, “Sketch a Story” brings people together around basic concepts of quantitative text analysis and visual storytelling. “Asking Good Questions” introduces principles of exploratory data analysis in a fun environment. What’s more, you can use the sample data that we provide, or you can integrate your organization’s data as the topic of conversation and learning….(More)”.

Your Data Is Crucial to a Robotic Age. Shouldn’t You Be Paid for It?


The New York Times: “The idea has been around for a bit. Jaron Lanier, the tech philosopher and virtual-reality pioneer who now works for Microsoft Research, proposed it in his 2013 book, “Who Owns the Future?,” as a needed corrective to an online economy mostly financed by advertisers’ covert manipulation of users’ consumer choices.

It is being picked up in “Radical Markets,” a book due out shortly from Eric A. Posner of the University of Chicago Law School and E. Glen Weyl, principal researcher at Microsoft. And it is playing into European efforts to collect tax revenue from American internet giants.

In a report obtained last month by Politico, the European Commission proposes to impose a tax on the revenue of digital companies based on their users’ location, on the grounds that “a significant part of the value of a business is created where the users are based and data is collected and processed.”

Users’ data is a valuable commodity. Facebook offers advertisers precisely targeted audiences based on user profiles. YouTube, too, uses users’ preferences to tailor its feed. Still, this pales in comparison with how valuable data is about to become, as the footprint of artificial intelligence extends across the economy.

Data is the crucial ingredient of the A.I. revolution. Training systems to perform even relatively straightforward tasks like voice translation, voice transcription or image recognition requires vast amounts of data — like tagged photos, to identify their content, or recordings with transcriptions.

“Among leading A.I. teams, many can likely replicate others’ software in, at most, one to two years,” notes the technologist Andrew Ng. “But it is exceedingly difficult to get access to someone else’s data. Thus data, rather than software, is the defensible barrier for many businesses.”

We may think we get a fair deal, offering our data as the price of sharing puppy pictures. By other metrics, we are being victimized: In the largest technology companies, the share of income going to labor is only about 5 to 15 percent, Mr. Posner and Mr. Weyl write. That’s way below Walmart’s 80 percent. Consumer data amounts to work they get free….

The big question, of course, is how we get there from here. My guess is that it would be naïve to expect Google and Facebook to start paying for user data of their own accord, even if that improved the quality of the information. Could policymakers step in, somewhat the way the European Commission did, demanding that technology companies compute the value of consumer data?…(More)”.

New game aims to inoculate people against fake news


Springwise: “The term ‘fake news’ has become all too common in media coverage. However, a news item doesn’t have to be entirely made up to be misleading. Many fake news stories intend to deceive, often with a political agenda. Disinformation works because many people fail to recognise false information. A recent study, conducted by Britain’s Channel 4, found that only four percent of those surveyed could tell fake news from real. So how to inoculate people against fake news? Dutch organisation DROG, which works against the spread of disinformation, has teamed up with researchers at Cambridge University in the United Kingdom to develop a game that they claim can help confer resistance against false or misleading information.

The game, titled The Bad News Game, works by putting players in the position of creating fake news, so that they gain insight into the tactics and methods used by ‘real’ fake news-mongers to spread their message. This, in turn, builds up resistance to fake news. In the game, players are shown short texts or images and can react to them in a variety of ways. Choosing an option similar to that followed by a ‘real’ producer of disinformation earns the player more followers and credibility. Lying too blatantly, choosing an option that is obviously ridiculous, or acting in line with journalistic best practices, and the player will lose followers and credibility. The aim of the game is to gather as many followers as possible without losing too much credibility.

The Bad News Game is suitable for use in schools and takes around 20 minutes to complete. It joins other recent socially conscious educational innovations such as a cooking app that encourages healthy eating and a board game that eases discussions about arranged marriages….(More)”.

Building Democratic Infrastructure


Hollie Russon Gilman, K. Sabeel Rahman, & Elena Souris in Stanford Social Innovation Review: “How can civic engagement be effective in fostering an accountable, inclusive, and responsive American democracy? This question has gained new relevance under the Trump administration, where a sense of escalating democratic crises risks obscuring any nascent grassroots activism. Since the 2016 election, the twin problems of authoritarianism and insufficient political accountability have attracted much attention, as has the need to mobilize for near-future elections. These things are critical for the long-term health of American democracy, but at the same time, it’s not enough to focus solely on Washington or to rely on electoral campaigns to salvage our democracy.

Conventional civic-engagement activities such as canvassing, registering voters, signing petitions, and voting are largely transient experiences, offering little opportunity for civic participation once the election is over. And such tactics often do little to address the background conditions that make participation more difficult for marginalized communities.

To address these issues, civil society organization and local governments should build more long-term and durable democratic infrastructure, with the aim of empowering constituencies to participate in meaningful and concrete ways, overcoming division within our societies, and addressing a general distrust of government by enhancing accountability.

In our work with groups like the Center for Rural Strategies in Appalachia and the Chicago-based Inner-City Muslim Action Network, as well as with local government officials in Eau Claire, Wis. and Boston, Mass., we identify two areas where can help build a broader democratic infrastructure for the long haul. First, we need to support and radically expand efforts by local-level government officials to innovate more participatory and accountable forms of policymaking. And then we need to continue developing new methods of diverse, cross-constituency organizing that can help build more inclusive identities and narratives. Achieving this more-robust form of democracy will require that many different communities—including organizers and advocacy groups, policymakers and public officials, technologists, and funders—combine their efforts….(More)”.

Trustworthy data will transform the world


 at the Financial Times: “The internet’s original sin was identified as early as 1993 in a New Yorker cartoon. “On the internet, nobody knows you’re a dog,” the caption ran beneath an illustration of a pooch at a keyboard. That anonymity has brought some benefits. But it has also created myriad problems, injecting distrust into the digital world. If you do not know the provenance and integrity of information and data, how can you trust their veracity?

That has led to many of the scourges of our times, such as cyber crime, identity theft and fake news. In his Alan Turing Institute lecture in London last week, the American computer scientist Sandy Pentland outlined the massive gains that could result from trusted data.

The MIT professor argued that the explosion of such information would give us the capability to understand our world in far more detail than ever before. Most of what we know in the fields of sociology, psychology, political science and medicine is derived from tiny experiments in controlled environments. But the data revolution enables us to observe behaviour as it happens at mass scale in the real world. That feedback could provide invaluable evidence about which theories are most valid and which policies and products work best.

The promise is that we make soft social science harder and more predictive. That, in turn, could lead to better organisations, fairer government, and more effective monitoring of our progress towards achieving collective ambitions, such as the UN’s sustainable development goals. To take one small example, Mr Pentland illustrated the strong correlation between connectivity and wealth. By studying the telephone records of 100,000 users in south-east Asia, researchers have plotted social connectivity against income. The conclusion: “The more diverse your connections, the more money you have.” This is not necessarily a causal relationship but it does have a strong causal element, he suggested.

Similar studies of European cities have shown an almost total segregation between groups of different socio-economic status. That lack of connectivity has to be addressed if our politics is not to descend further into a meaningless dialogue.

Data give us a new way to measure progress.

For years, the Open Data movement has been working to create public data sets that can better inform decision making. This worldwide movement is prising open anonymised public data sets, such as transport records, so that they can be used by academics, entrepreneurs and civil society groups. However, much of the most valuable data is held by private entities, notably the consumer tech companies, telecoms operators, retailers and banks. “The big win would be to include private data as a public good,” Mr Pentland said….(More)”.

Mobile Data Collection Toolkit


Guide for the use of MDC in the humanitarian and development field: “This webpage aims at sharing documentation produced jointly by Terre des hommes (Tdh) and CartONG to help humanitarians and development actors use Mobile Data Collection (MDC)more efficiently in the field.

You will find tutorials and training material concerning all the phases of MDC, from thinking through the prerequisites of using MDC to the preparation of your forms and tools and the analysis of your data.

In addition to the MDC documentation you can also find a “Starter Kit” for data protection in humanitarian and development operations, as well as “Data Visualization” material, in the Analysis page,  produced to help organizations to better visualize the results of their data analyses.

These were made for Terre des hommes staff but are shared “as-is” as they could be useful for other NGOs. …(More)”.

Using Open Data for Public Services


New report by the Open Data Institute:  “…Today we’re publishing our initial findings based on examining 8 examples where open data supports the delivery of a public service. We have defined 3 high-level ‘patterns’ for how open data is used in public services. We think these could be helpful for others looking to redesign and deliver better services.

The patterns are summarised in the table below:

The first pattern is perhaps the model which everyone is most familiar with as it’s used by the likes of Citymapper, who use open transport data from Transport for London to inform passengers about routes and timings, and other citizen-focused apps. Data is released by a public sector organisation about a public service and a third organisation uses this data to provide a complementary service, online or face-face, to help citizens use the public service.

The second pattern involves the release of open data in the service delivery chain. Open data is used to plan public service delivery and make service delivery chains more efficient. Examples provided in the report include local authorities’ release of open spending, contract and tender data, which is used by Spend Network to support better value for money in public expenditure.

In the third pattern, public sector organisations commissioning services and external organisations involved in service delivery make strategic decisions based on insights and patterns revealed by open data. Visualisations of open data can inform policies on job seeker allowance, as shown in the example from the Department for Work and Pensions in the report.

As well as identifying these patterns, we have created ecosystem maps of the public services we have examined to help understand the relationships and the mechanisms by which open data supports each of them….

Having compared the ecosystems of the examples we have considered so far, the report sets out practical recommendations for those involved in the delivery of public services and for Central Government for the better use of open data in the delivery of public services.

The recommendations are focused on organisational collaboration; technology infrastructure, digital skills and literacy; open standards for data; senior level championing; peer networks; intermediaries; and problem focus….(More)”.

When Fighting Fake News Aids Censorship


Courtney C. Radsch at Project Syndicate: “Many media analysts have rightly identified the dangers posed by “fake news,” but often overlook what the phenomenon means for journalists themselves. Not only has the term become a shorthand way to malign an entire industry; autocrats are invoking it as an excuse to jail reporters and justify censorship, often on trumped-up charges of supporting terrorism.

Around the world, the number of honest journalists jailed for publishing fake or fictitious news is at an all-time high of at least 21. As non-democratic leaders increasingly use the “fake news” backlash to clamp down on independent media, that number is likely to climb.

The United States, once a world leader in defending free speech, has retreated from this role. President Donald Trump’s Twitter tirades about “fake news” have given autocratic regimes an example by which to justify their own media crackdowns. In December, China’s state-run People’s Daily newspaper posted tweets and a Facebook post welcoming Trump’s fake news mantra, noting that it “speaks to a larger truth about Western media.” This followed the Egyptian government’s praise for the Trump administration in February 2017, when the country’s foreign ministry criticized Western journalists for their coverage of global terrorism.

And in January 2017, Turkish President Recep Tayyip Erdoğan praised Trump for berating a CNN reporter during a live news conference. Erdoğan, who criticized the network for its coverage of pro-democracy protests in Turkey in 2013, said that Trump had put the journalist “in his place.” Trump returned the compliment when he met Erdoğan a few months later. Praising his counterpart for being an ally in the fight against terrorism, Trump made no mention of Erdoğan’s own dismal record on press freedom.

It is no accident that these three countries have been quickest to embrace Trump’s “fake news” trope. China, Egypt, and Turkey jailed more than half of the world’s journalists in 2017, continuing a trend from the previous year. The international community’s silence in the face of these governments’ attacks on independent media seems to have been interpreted as consent….(More)”.

Why the web has challenged scientists’ authority – and why they need to adapt


Andrew J. Hoffman at The Conversation: “Academia is in the midst of a crisis of relevance. Many Americans are ignoring the conclusions of scientists on a variety of issues including climate change and natural selection. Some state governments are cutting funding for higher education; the federal government is threatening to cut funding for research. Resentful students face ever increasing costs for tuition.

And distrustful segments of society fear what academia does; one survey found that 58 percent of Republicans and Republican-leaning independents say colleges and universities have a negative effect on the way things are going in the country.

There are multiple causes for this existential crisis, but one in particular deserves special attention. The web is fundamentally changing the channels through which science is communicated – who can create it, who can access it and ultimately what it is. Society now has instant access to more news and information than ever before; knowledge is being democratized. And as a result, the role of the scientist in society is in flux.

But rather than facing this changing landscape head on, research shows that many in academia are resisting its inevitability. In many ways, this response has parallels to that of the Catholic Church in the wake of the invention of the printing press and its role in hastening the Protestant Reformation. I hope this comparison offers a compelling provocation for the scientific community to come to grips with the cataclysmic changes we are now living through and ignore at our peril….(More)”.