We use big data to sentence criminals. But can the algorithms really tell us what we need to know?


 at the Conversation: “In 2013, a man named Eric L. Loomis was sentenced for eluding police and driving a car without the owner’s consent.

When the judge weighed Loomis’ sentence, he considered an array of evidence, including the results of an automated risk assessment tool called COMPAS. Loomis’ COMPAS score indicated he was at a “high risk” of committing new crimes. Considering this prediction, the judge sentenced him to seven years.

Loomis challenged his sentence, arguing it was unfair to use the data-driven score against him. The U.S. Supreme Court now must consider whether to hear his case – and perhaps settle a nationwide debate over whether it’s appropriate for any court to use these tools when sentencing criminals.

Today, judges across the U.S. use risk assessment tools like COMPAS in sentencing decisions. In at least 10 states, these tools are a formal part of the sentencing process. Elsewhere, judges informally refer to them for guidance.

I have studied the legal and scientific bases for risk assessments. The more I investigate the tools, the more my caution about them grows.

The scientific reality is that these risk assessment tools cannot do what advocates claim. The algorithms cannot actually make predictions about future risk for the individual defendants being sentenced….

Algorithms such as COMPAS cannot make predictions about individual defendants, because data-driven risk tools are based on group statistics. This creates an issue that academics sometimes call the “group-to-individual” or G2i problem.

Scientists study groups. But the law sentences the individual. Consider the disconnect between science and the law here.

The algorithms in risk assessment tools commonly assign specific points to different factors. The points are totaled. The total is then often translated to a risk bin, such as low or high risk. Typically, more points means a higher risk of recidivism.

Say a score of 6 points out of 10 on a certain tool is considered “high risk.” In the historical groups studied, perhaps 50 percent of people with a score of 6 points did reoffend.

Thus, one might be inclined to think that a new offender who also scores 6 points is at a 50 percent risk of reoffending. But that would be incorrect.

It may be the case that half of those with a score of 6 in the historical groups studied would later reoffend. However, the tool is unable to select which of the offenders with 6 points will reoffend and which will go on to lead productive lives.

The studies of factors associated with reoffending are not causation studies. They can tell only which factors are correlated with new crimes. Individuals retain some measure of free will to decide to break the law again, or not.

These issues may explain why risk tools often have significant false positive rates. The predictions made by the most popular risk tools for violence and sex offending have been shown to get it wrong for some groups over 50 percent of the time.

A ProPublica investigation found that COMPAS, the tool used in Loomis’ case, is burdened by large error rates. For example, COMPAS failed to predict reoffending in one study at a 37 percent rate. The company that makes COMPAS has disputed the study’s methodology….

There are also a host of thorny issues with risk assessment tools incorporating, either directly or indirectly, sociodemographic variables, such as gender, race and social class. Law professor Anupam Chander has named it the problem of the “racist algorithm.”

Big data may have its allure. But, data-driven tools cannot make the individual predictions that sentencing decisions require. The Supreme Court might helpfully opine on these legal and scientific issues by deciding to hear the Loomis case…(More)”.

Europol introduce crowdsourcing to catch child abusers


LeakofNations: “The criminal intelligence branch of the European Union, known as Europol, have started a campaign called #TraceAnObject which uses social media crowdsourcing to detect potentially-identifying objects in material that depicts child abuse….

Investigative crowdsourcing has gained traction in academic and journalistic circles in recent years, but this represents the first case of government bureaus relying on social media people-power to conduct more effective analysis.

Journalists are increasingly relying on a combination of high-end computing to organise terabytes of data and internet cloud hubs that allow a consortium of journalists from around the world to share their analysis of the material. In the Panama Papers scoop the Australian software Nuix was used to analyse, extract, and index documents into an encrypted central hub in which thousands of journalists from 80 countries were able to post their workings and assist others in a forum-type setting. This model was remarkably efficient; over 11.5 million documents, dating back to the 1970’s, were analysed in less than a year.

The website Zooinverse has achieved huge success in creating public participation on academic projects, producing the pioneering game Foldit, where participants play with digital models of proteins. The Oxford University-based organisation has now engaged over 1 million volunteers, and has has significant successes in astronomy, ecology, cell biology, humanities, and climate science.

The most complex investigations still require thousands of hours of straightforward tasks that cannot be computerised. The citizen science website Planet Four studies conditions on Mars, and needs volunteers to compare photographs and detect blotches on Mars’ surface – enabling anyone to feel like Elon Musk, regardless of their educational background.

Child abuse is something that incites anger in most people. Crowdsourcing is an opportunity to take the donkey-work away from slow bureaucratic offices and allow ordinary citizens, many of whom felt powerless to protect children from these vile crimes, to genuinely progress cases that will make children safer.

Zooinverse proves that the public are hungry for this kind of work; the ICIJ project model of a central cloud forum shows that crowdsourcing across international borders allows data to be interpreted more efficiently. Europol’s latest idea could well be a huge success.

Even the most basic object could potentially provide vital clues to the culprit’s identity. The most significant items released so far include a school uniform complete with ID card necktie, and a group of snow-covered lodges….(More) (see also #TraceAnObject).

The Way Ahead


Transcript of lecture delivered by Stephen Fry on the 28th May  2017 • Hay Festival, Hay-on-Wye: “Peter Florence, the supremo of this great literary festival, asked me some months ago if I might, as part of Hay’s celebration of the five hundredth anniversary of Martin Luther’s kickstarting of the reformation, suggest a reform of the internet…

You will be relieved to know, that unlike Martin Luther, I do not have a full 95 theses to nail to the door, or in Hay’s case, to the tent flap. It might be worth reminding ourselves perhaps, however, of the great excitements of the early 16th century. I do not think it is a coincidence that Luther grew up as one of the very first generation to have access to printed books, much as some of you may have children who were the first to grow up with access to e-books, to iPads and to the internet….

The next big step for AI is the inevitable achievement of Artificial General Intelligence, or AGI, sometimes called ‘full artificial intelligence’ the point at which machines really do think like humans. In 2013, hundreds of experts were asked when they thought AGI may arise and the median prediction was they year 2040. After that the probability, most would say certain, is artificial super-intelligence and the possibility of reaching what is called the Technological Singularity – what computer pioneer John van Neumann described as the point “…beyond which humans affairs, as we know them, could not continue.” I don’t think I have to worry about that. Plenty of you in this tent have cause to, and your children beyond question will certainly know all about it. Unless of course the climate causes such havoc that we reach a Meteorological Singularity. Or the nuclear codes are penetrated by a self-teaching algorithm whose only purpose is to find a way to launch…

It’s clear that, while it is hard to calculate the cascade upon cascade of new developments and their positive effects, we already know the dire consequences and frightening scenarios that threaten to engulf us. We know them because science fiction writers and dystopians in all media have got there before us and laid the nightmare visions out. Their imaginations have seen it all coming. So whether you believe Ray Bradbury, George Orwell, Aldous Huxley, Isaac Asimov, Margaret Atwood, Ridley Scott, Anthony Burgess, H. G. Wells, Stanley Kubrick, Kazuo Ishiguro, Philip K. Dick, William Gibson, John Wyndham, James Cameron, the Wachowski’s or the scores and scores of other authors and film-makers who have painted scenarios of chaos and doom, you can certainly believe that a great transformation of human society is under way, greater than Gutenberg’s revolution – greater I would submit than the Industrial Revolution (though clearly dependent on it) – the greatest change to our ways of living since we moved from hunting and gathering to settling down in farms, villages and seaports and started to trade and form civilisations. Whether it will alter the behaviour, cognition and identity of the individual in the same way it is certain to alter the behaviour, cognition and identity of the group, well that is a hard question to answer.

But believe me when I say that it is happening. To be frank it has happened. The unimaginably colossal sums of money that have flowed to the first two generations of Silicon Valley pioneers have filled their coffers, their war chests, and they are all investing in autonomous cars, biotech, the IoT, robotics Artificial Intelligence and their convergence. None more so than the outlier, the front-runner Mr Elon Musk whose neural link system is well worth your reading about online on the great waitbutwhy.com website. Its author Tim Urban is a paid consultant of Elon Musk’s so he has the advantage of knowing what he is writing about but the potential disadvantage of being parti pri and lacking in objectivity. Elon Musk made enough money from his part in the founding and running of PayPal to fund his manifold exploits. The Neuralink project joins his Tesla automobile company and subsidiary battery and solar power businesses, his Space X reusable spacecraft group, his OpenAI initiative and Hyperloop transport system. The 1950s and 60s Space Race was funded by sovereign governments, this race is funded by private equity, by the original investors in Google, Apple, Facebook and so on. Nation states and their agencies are not major players in this game, least of all poor old Britain. Even if our politicians were across this issue, and they absolutely are not, our votes would still be an irrelevance….

So one thesis I would have to nail up to the tent is to clamour for government to bring all this deeper into schools and colleges. The subject of the next technological wave, I mean, not pornography and prostitution. Get people working at the leading edge of AI and robotics to come into the classrooms. But more importantly listen to them – even if what they say is unpalatable, our masters must have the intellectual courage and honesty to say if they don’t understand and ask for repetition and clarification. This time, in other words, we mustn’t let the wave engulf us, we must ride its crest. It’s not quite too late to re-gear governmental and educational planning and thinking….

The witlessness of our leaders and of ourselves is indeed a problem. The real danger surely is not technology but technophobic Canute-ism, a belief that we can control, change or stem the technological tide instead of understanding that we need to learn how to harness it. Driving cars is dangerous, but we developed driving lesson requirements, traffic controls, seat-belts, maintenance protocols, proximity sensors, emission standards – all kinds of ways of mitigating the danger so as not to deny ourselves the life-changing benefits of motoring.

We understand why angry Ned Ludd destroyed the weaving machines that were threatening his occupation (Luddites were prophetic in their way, it was weaving machines that first used the punched cards on which computers relied right up to the 1970s). We understand too why French workers took their clogs, their sabots as they were called, and threw them into the machinery to jam it up, giving us the word sabotage. But we know that they were in the end, if you’ll pardon the phrase, pissing into the wind. No technology has ever been stopped.

So what is the thesis I am nailing up? Well, there is no authority for me to protest to, no equivalent of Pope Leo X for it to be delivered to, and I am certainly no Martin Luther. The only thesis I can think worth nailing up is absurdly simple. It is a cry as much from the heart as from the head and it is just one word – Prepare. We have an advantage over our hunter gatherer and farming ancestors, for whether it is Winter that is coming, or a new Spring, is entirely in our hands, so long as we prepare….(More)”.

Our path to better science in less time using open data science tools


Julia S. Stewart Lowndes et al in Nature: “Reproducibility has long been a tenet of science but has been challenging to achieve—we learned this the hard way when our old approaches proved inadequate to efficiently reproduce our own work. Here we describe how several free software tools have fundamentally upgraded our approach to collaborative research, making our entire workflow more transparent and streamlined. By describing specific tools and how we incrementally began using them for the Ocean Health Index project, we hope to encourage others in the scientific community to do the same—so we can all produce better science in less time.

Figure 1: Better science in less time, illustrated by the Ocean Health Index project.
Figure 1

Every year since 2012 we have repeated Ocean Health Index (OHI) methods to track change in global ocean health36,37. Increased reproducibility and collaboration has reduced the amount of time required to repeat methods (size of bubbles) with updated data annually, allowing us to focus on improving methods each year (text labels show the biggest innovations). The original assessment in 2012 focused solely on scientific methods (for example, obtaining and analysing data, developing models, calculating, and presenting results; dark shading). In 2013, by necessity we gave more focus to data science (for example, data organization and wrangling, coding, versioning, and documentation; light shading), using open data science tools. We established R as the main language for all data preparation and modelling (using RStudio), which drastically decreased the time involved to complete the assessment. In 2014, we adopted Git and GitHub for version control, project management, and collaboration. This further decreased the time required to repeat the assessment. We also created the OHI Toolbox, which includes our R package ohicore for core analytical operations used in all OHI assessments. In subsequent years we have continued (and plan to continue) this trajectory towards better science in less time by improving code with principles of tidy data33; standardizing file and data structure; and focusing more on communication, in part by creating websites with the same open data science tools and workflow. See text and Table 1 for more details….(More)”

Citizen Science and Alien Species in Europe


European Commission: “Citizen Science programs aim at creating a bridge between science and the general public, actively involving citizens in research projects. In this way, citizen scientists can work side by side with experts, contributing to the increase of scientific knowledge, addressing local, national and international issues that need scientific support and having the potential to influence policy-making.
The EU Regulation 1143/2014 on Invasive Alien Species (IAS) acknowledges the important role public awareness and active involvement of the citizens have for the successful implementation of the Regulation. Thus, Citizen Science could bring an important contribution to the early detection and monitoring of invasive alien species, as, in order to adopt efficient control measures, it is necessary to know the presence and distribution of these species as soon as possible.
With the new website section we want to disseminate information about how citizens can be involved in activities aimed at protecting European biodiversity, awareness raising, sharing news, examples and developments from the emerging field of Citizen Science.
If you are interested in becoming a citizen scientist and want to help monitor invasive alien species (IAS) in your region, you can use our App “Invasive Alien Species Europe” to report the 37 IAS of Union Concern.
Furthermore, we have compiled a list of European Citizen Science projects dealing with alien species. The list is not exhaustive and is open for improvement …(More)”.

A framework for analyzing digital volunteer contributions in emergent crisis response efforts


 and  in New Media and Society: “Advances in information, communication, and computational technologies allow digital volunteer networks formed by concerned publics across the globe to contribute to an effective response to disasters and crises. Digital volunteer networks are event-centric and emergent networks. Currently, the literature is sharply growing in the fields of communication, computer science, emergency management, and geography. This article aims to assess the current status of the literature and suggest a comprehensive conceptual framework of digital volunteer networks in response to disasters and crises. This framework is based on a traditional input–process–output model consisting of three dimensions: the disaster and crisis context, a voluntary response process, and outputs and outcomes. We also discuss challenges of digital volunteer networks for crisis response. This article is expected to contribute to the development of related theories and hypotheses and practical strategies for managing digital volunteer networks…(More)”,

Policymakers around the world are embracing behavioural science


The Economist: “In 2013 thousands of school pupils in England received a letter from a student named Ben at the University of Bristol. The recipients had just gained good marks in their GCSEs, exams normally taken at age 16. But they attended schools where few pupils progressed to university at age 18, and those that did were likely to go to their nearest one. That suggested the schools were poor at nurturing aspiration. In his letter Ben explained that employers cared about the reputation of the university a job applicant has attended. He pointed out that top universities can be a cheaper option for poorer pupils, because they give more financial aid. He added that he had not known these facts at the recipient’s age.

The letters had the effect that was hoped for. A study published in March found that after leaving school, the students who received both Ben’s letter and another, similar one some months later were more likely to be at a prestigious university than those who received just one of the letters, and more likely again than those who received none. For each extra student in a better university, the initiative cost just £45 ($58), much less than universities’ own attempts to broaden their intake. And the approach was less heavy-handed than imposing quotas for poorer pupils, an option previous governments had considered. The education department is considering rolling out the scheme….

Some critics feared that nudges would do little good, and that their effects would fade over time. Others warned that governments were straying perilously close to mass manipulation. More recently, some of the findings on which the behavioural sciences rest have been questioned, as researchers in many fields have sought to replicate famous results, and failed.

By and large those doubts have been allayed. Even if specific results turn out to be mistaken, an experimental, iterative, data-driven approach to policymaking is gaining ground in many places, not just in dedicated units, but throughout government.

Nudging is hardly new. “In Genesis, Satan nudged, and Eve did too,” writes Cass Sunstein of Harvard University. From the middle of the 20th century psychologists such as Stanley Milgram and Philip Zimbardo showed how sensitive humans are to social pressure. Daniel Kahneman and Amos Tversky described the mental shortcuts and biases that influence decision-making. Dale Carnegie and Robert Cialdini wrote popular books on persuasion. Firms, especially in technology, retail and advertising, used behavioural science to shape brand perception and customer behaviour—and, ultimately, to sell more stuff.

But governments’ use of psychological insights to achieve policy goals was occasional and unsystematic. According to David Halpern, the boss of BIT, as far as policymakers were concerned, psychology was “the sickly sibling to economics”. That began to change after Mr Sunstein and Richard Thaler, an economist, published “Nudge”, in 2008. The book attacked the assumption of rational decision-making inherent in most economic models and showed how “choice architecture”, or context, could be changed to “nudge” people to make better choices…..

Now many governments are turning to nudges to save money and do better. In 2014 the White House opened the Social and Behavioural Sciences Team. A report that year by Mark Whitehead of Aberystwyth University counted 51 countries in which “centrally directed policy initiatives” were influenced by behavioural sciences. Non-profit organisations such as Ideas42, set up in 2008 at Harvard University, help run dozens of nudge-style trials and programmes around the world. In 2015 the World Bank set up a group that is now applying behavioural sciences in 52 poor countries. The UN is turning to nudging to help hit the “sustainable development goals”, a list of targets it has set for 2030….

Among the most effective nudges are “social” ones: those that communicate norms or draw on people’s networks. A scheme tested in Guatemala with help from the World Bank and BIT tweaked the wording of letters sent to people and firms who had failed to submit tax returns the previous year. The letters that framed non-payment as an active choice, or noted that paying up is more common than evasion, cut the number of non-payers in the following year and increased the average sum paid. And a trial involving diabetes shows that it matters to nudge at the right moment. In 2014 Hamad Medical Corporation, a health-care provider in Qatar, raised take-up rates for diabetes screening by offering it during Ramadan. That meant most Qataris were fasting, so the need to do so before the test imposed no extra burden….(More)”.

A Framework for Assessing Technology Hubs in Africa


Paper by Jeremy de BeerPaula MillarJacquelen MwangiVictor B. Nzomo, and Isaac Rutenberg: “This article explains the importance of technology hubs as drivers of innovation, social change, and economic opportunity within and beyond the African continent. The article is the first to thoroughly review and synthesize findings from multi-disciplinary literature, and integrate insights from qualitative data gathered via interviews and fieldwork. It identifies three archetypes of hubs — clusters, companies, and countries — and discusses examples of each archetype using Kenya as a case study. The article discusses potential collaboration, conflicts, and competition among these archetypes of hubs, and concludes with recommendations for future researchers….(More)”

CityDash: Visualising a Changing City Using Open Data


Chapter by Christopher Pettit, Scott N. Lieske and Murad Jamal in Planning Support Science for Smarter Urban Futures: “In an increasingly urbanised world, there are pressures being placed on our cities, which planners, decision-makers, and communities need to be able to respond to. Data driven responses and tools that can support the communication of information, and indicators on a city’s performance are becoming increasingly available and have the potential to play a critical role in understanding and managing complex urban systems . In this research, we will review international efforts in the creation of city dashboards and introduce the City of Sydney Dashboard, known as CityDash. This chapter culminates in a number of recommendations for city dashboards’ implementation. The recommendations for city dashboards include: consolidated information on a single web page, live data feeds relevant to planners and decision-makers as well as citizens’ daily lives, and site analytics as a way of evaluating user interactions and preferences….(More)”.

Harnessing Science, Technology and Innovation to achieve the Sustainable Development Goals


Featured innovations for the second STI Forum: “…The theme of the 2017 High-level Political Forum on Sustainable Development (HLPF) is “Eradicating poverty and promoting prosperity in a changing world“, and the Member States have decided that the HLPF 2017 shall focus on six SDGs (1, 2, 3, 5, 9 and 14) in addition to SDG 17 that will be considered at each HLPF. In this context, the following topic may be considered for the STI Forum 2017: “Science, Technology and Innovation for a Changing World – Focus on SDGs 1, 2, 3, 5, 9, and 14“….

The second Call for Innovations was launched for the sharing of innovations that provide solutions targeted to these six SDGs. Innovators from around the world were invited to submit their scientific and technological solutions to the challenges posed by the six SDGs.The Call for Innovations is now closed. More than 110 inspiring innovations from all the globe were submitted through the Global Innovations Exchange platform.The following outstanding innovators were selected to attend the STI Forum 2017 at UNHQ and showcase their solutions: