Crowdsourced Science: Sociotechnical Epistemology in the e-Research Paradigm
Paper by David Watson and Luciano Floridi: “Recent years have seen a surge in online collaboration between experts and amateurs on scientific research. In this article, we analyse the epistemological implications of these crowdsourced projects, with a focus on Zooniverse, the world’s largest citizen science web portal. We use quantitative methods to evaluate the platform’s success in producing large volumes of observation statements and high impact scientific discoveries relative to more conventional means of data processing. Through empirical evidence, Bayesian reasoning, and conceptual analysis, we show how information and communication technologies enhance the reliability, scalability, and connectivity of crowdsourced e-research, giving online citizen science projects powerful epistemic advantages over more traditional modes of scientific investigation. These results highlight the essential role played by technologically mediated social interaction in contemporary knowledge production. We conclude by calling for an explicitly sociotechnical turn in the philosophy of science that combines insights from statistics and logic to analyse the latest developments in scientific research….(More)”
Big data may be reinforcing racial bias in the criminal justice system
Laurel Eckhouse at the Washington Post: “Big data has expanded to the criminal justice system. In Los Angeles, police use computerized “predictive policing” to anticipate crimes and allocate officers. In Fort Lauderdale, Fla., machine-learning algorithms are used to set bond amounts. In states across the country, data-driven estimates of the risk of recidivism are being used to set jail sentences.
Advocates say these data-driven tools remove human bias from the system, making it more fair as well as more effective. But even as they have become widespread, we have little information about exactly how they work. Few of the organizations producing them have released the data and algorithms they use to determine risk.
To make matters worse, risk-assessment tools typically evaluate their success in predicting a defendant’s dangerousness on rearrests — not on defendants’ overall behavior after release. If our two defendants return to the same neighborhood and continue their identical lives, the black defendant is more likely to be arrested. Thus, the tool will falsely appear to predict dangerousness effectively, because the entire process is circular: Racial disparities in arrests bias both the predictions and the justification for those predictions.
We know that a black person and a white person are not equally likely to be stopped by police: Evidence on New York’s stop-and-frisk policy, investigatory stops, vehicle searches and drug arrests show that black and Latino civilians are more likely to be stopped, searched and arrested than whites. In 2012, a white attorney spent days trying to get himself arrested in Brooklyn for carrying graffiti stencils and spray paint, a Class B misdemeanor. Even when police saw him tagging the City Hall gateposts, they sped past him, ignoring a crime for which 3,598 people were arrested by the New York Police Department the following year.
Before adopting risk-assessment tools in the judicial decision-making process, jurisdictions should demand that any tool being implemented undergo a thorough and independent peer-review process. We need more transparencyand better data to learn whether these risk assessments have disparate impacts on defendants of different races. Foundations and organizations developing risk-assessment tools should be willing to release the data used to build these tools to researchers to evaluate their techniques for internal racial bias and problems of statistical interpretation. Even better, with multiple sources of data, researchers could identify biases in data generated by the criminal justice system before the data is used to make decisions about liberty. Unfortunately, producers of risk-assessment tools — even nonprofit organizations — have not voluntarily released anonymized data and computational details to other researchers, as is now standard in quantitative social science research….(More)”.
How to Do Social Science Without Data
Neil Gross in the New York Times: With the death last month of the sociologist Zygmunt Bauman at age 91, the intellectual world lost a thinker of rare insight and range. Because his style of work was radically different from that of most social scientists in the United States today, his passing is an occasion to consider what might be gained if more members of our profession were to follow his example….
Weber saw bureaucracies as powerful, but dispiritingly impersonal. Mr. Bauman amended this: Bureaucracy can be inhuman. Bureaucratic structures had deadened the moral sense of ordinary German soldiers, he contended, which made the Holocaust possible. They could tell themselves they were just doing their job and following orders.
Later, Mr. Bauman turned his scholarly attention to the postwar and late-20th-century worlds, where the nature and role of all-encompassing institutions were again his focal point. Craving stability after the war, he argued, people had set up such institutions to direct their lives — more benign versions of Weber’s bureaucracy. You could go to work for a company at a young age and know that it would be a sheltering umbrella for you until you retired. Governments kept the peace and helped those who couldn’t help themselves. Marriages were formed through community ties and were expected to last.
But by the end of the century, under pressure from various sources, those institutions were withering. Economically, global trade had expanded, while in Europe and North America manufacturing went into decline; job security vanished. Politically, too, changes were afoot: The Cold War drew to an end, Europe integrated and politicians trimmed back the welfare state. Culturally, consumerism seemed to pervade everything. Mr. Bauman noted major shifts in love and intimacy as well, including a growing belief in the contingency of marriage and — eventually — the popularity of online dating.
In Mr. Bauman’s view, it all connected. He argued we were witnessing a transition from the “solid modernity” of the mid-20th century to the “liquid modernity” of today. Life had become freer, more fluid and a lot more risky. In principle, contemporary workers could change jobs whenever they got bored. They could relocate abroad or reinvent themselves through shopping. They could find new sexual partners with the push of a button. But there was little continuity.
Mr. Bauman considered the implications. Some thrived in this new atmosphere; the institutions and norms previously in place could be stultifying, oppressive. But could a transient work force come together to fight for a more equitable distribution of resources? Could shopping-obsessed consumers return to the task of being responsible, engaged citizens? Could intimate partners motivated by short-term desire ever learn the value of commitment?…(More)”
Beyond prediction: Using big data for policy problems
Susan Athey at Science: “Machine-learning prediction methods have been extremely productive in applications ranging from medicine to allocating fire and health inspectors in cities. However, there are a number of gaps between making a prediction and making a decision, and underlying assumptions need to be understood in order to optimize data-driven decision-making…(More)”
Crowdsourcing to Be the Future for Medical Research
“The benefits of crowdsourcing are substantial,” said Rose Marie Robertson, MD, Chief Science Officer of the AHA, who took part in writing the guide. “You can get information from new perspectives and highly innovative ideas that might well not have occurred to you.”
Crowdsourcing Medical Research Priorities: A Guide for Funding Agencies is the work of Precision Medicine Advances using Nationally Crowdsourced Comparative Effectiveness Research (PRANCCER), a joint initiative launched in 2015 by the AHA and PCORI.
“Acknowledging the power of open, multidisciplinary research to drive medical progress, AHA and PCORI turned to the rapidly evolving methodology of crowdsourcing to find out what patients, clinicians, and researchers consider the most urgent priorities in cardiovascular medicine and to shape the direction and design of research targeting those priorities,” according to the guide.
“Engaging patients and other healthcare decision makers in identifying research needs and guiding studies is a hallmark of our patient-centered approach to research, and crowdsourcing offers great potential to catalyze such engagement,” said PCORI Executive Director Joe V. Selby, MD. “We hope the input we’ve received will help us develop new research funding opportunities that will lead to improved care for people with cardiovascular conditions.”
The playbook offers more than a dozen recommendations on the ins and outs of medical crowdsourcing. It stresses the need to have crystal clear objectives and questions, whether you’re dealing with patients, researchers, or clinicians. … (More)”
Mass Observation: The amazing 80-year experiment to record our daily lives
William Cook at BBC Arts: “Eighty years ago, on 30th January 1937, the New Statesman published a letter which launched the largest (and strangest) writers’ group in British literary history.
An anthropologist called Tom Harrisson, a journalist called Charles Madge and a filmmaker called Humphrey Jennings wrote to the magazine asking for volunteers to take part in a new project called Mass Observation. Over a thousand readers responded, offering their services. Remarkably, this ‘scientific study of human social behaviour’ is still going strong today.
Mass Observation was the product of a growing interest in the social sciences, and a growing belief that the mass media wasn’t accurately reflecting the lives of so-called ordinary people. Instead of entrusting news gathering to jobbing journalists, who were under pressure to provide the stories their editors and proprietors wanted, Mass Observation recruited a secret army of amateur reporters, to track the habits and opinions of ‘the man in the street.’
Ironically, the three founders of this egalitarian movement were all extremely well-to-do. They’d all been to public schools and Oxbridge, but this was the ‘Age of Anxiety’, when capitalism was in chaos and dangerous demagogues were on the rise (plus ça change…).
For these idealistic public schoolboys, socialism was the answer, and Mass Observation was the future. By finding out what ‘ordinary’ folk were really doing, and really thinking, they would forge a new society, more attuned to the needs of the common man.
Mass Observation selected 500 citizen journalists, and gave them regular ‘directives’ to report back on virtually every aspect of their daily lives. They were guaranteed anonymity, which gave them enormous freedom. People opened up about themselves (and their peers) to an unprecedented degree.
Even though they were all unpaid, correspondents devoted a great deal of time to this endeavour – writing at great length, in great detail, over many years. As well as its academic value, Mass Observation proved that autobiography is not the sole preserve of the professional writer. For all of us, the urge to record and reflect upon our lives is a basic human need.
The Second World War was the perfect forum for this vast collective enterprise. Mass Observation became a national diary of life on the home front. For historians, the value of such uncensored revelations is enormous. These intimate accounts of air raids and rationing are far more revealing and evocative than the jolly state-sanctioned reportage of the war years.
After the war, Mass Observation became more commercial, supplying data for market research, and during the 1960s this extraordinary experiment gradually wound down. It was rescued from extinction by the historian Asa Briggs….
The founders of Mass Observation were horrified by what they called “the revival of racial superstition.” Hitler, Franco and Mussolini were in the forefront of their minds. “We are all in danger of extinction from such outbursts of atavism,” they wrote, in 1937. “We look to science to help us, only to find that science is too busy forging new weapons of mass destruction.”
For its founders, Mass Observation was a new science which would build a better future. For its countless correspondents, however, it became something more than that – not merely a social science, but a communal work of art….(More)”.
Using data and design to support people to stay in work
Cat Drew at Civil Service Quarterly: “…Data and digital are fairly understandable concepts in policy-making. But design? Why is it one of the three Ds?
Policy Lab believes that design approaches are particularly suited to complex issues that have multiple causes and for which there is no one, simple answer. Design encourages people to think about the user’s needs (not just the organisation’s needs), brings in different perspectives to innovate new ideas, and then prototypes (mocks them up and tries them out) to iteratively improve ideas until they find one that can be scaled up.

Policy Lab also recognises that data alone cannot solve policy problems, and has been experimenting with how to combine numerical and more human practices. Data can explain what is happening, while design research methods – such as ethnography, observing people’s behaviours – can explain why things are happening. Data can be used to automate and tailor public services; while design means frontline delivery staff and citizens will actually know about and use them. Data-rich evidence is highly valued by policy-makers; and design can make it understandable and accessible to a wider group of people, opening up policy-making in the process.
The Lab is also experimenting with new data methods.
Data science can be used to look at complex, unstructured data (social media data, for example), in real time. Digital data, such as social media data or internet searches, can reveal how people behave (rather than how they say they behave). It can also look at huge amounts of data far quicker than humans, and find unexpected patterns hidden in the data. Powerful computers can identify trends from historical data and use these to predict what might happen in the future.
Supporting people in work project
The project took a DDD approach to generating insight and then creating ideas. The team (including the data science organisation Mastodon C and design agency Uscreates) used data science techniques together with ethnography to create a rich picture about what was happening. Then it used design methods to create ideas for digital services with the user in mind, and these were prototyped and tested with users.
The data science confirmed many of the known risk factors, but also revealed some new insights. It told us what was happening at scale, and the ethnography explained why.
- The data science showed that people were more likely to go onto sickness benefits if they had been in the job a shorter time. The ethnography explained that the relationship with the line manager and a sense of loyalty were key factors in whether someone stayed in work or went onto benefits.
- The data science showed that women with clinical depression were less likely to go onto sickness benefits than men with the same condition. The ethnography revealed how this played out in real life:
- For example, Ella [not her real name], a teacher from London who had been battling with depression at work for a long time but felt unable to go to her boss about it. She said she was “relieved” when she got cancer, because she could talk to her boss about a physical condition and got time off to deal with both illnesses.
- The data science also allowed the segmentation of groups of people who said they were on health-related benefits. Firstly, the clustering revealed that two groups had average health ratings, indicating that other non-health-related issues might be driving this. Secondly, it showed that these two groups were very different (one older group of men with previously high pay and working hours; the other of much younger men with previously low pay and working hours). The conclusion was that their motivations and needs to stay in work – and policy interventions – would be different.
- The ethnography highlighted other issues that were not captured in the data but would be important in designing solutions, such as: a lack of shared information across the system; the need of the general practitioner (GP) to refer patients to other non-health services as well as providing a fit note; and the importance of coaching, confidence-building and planning….(More)”
GSK and MIT Flumoji app tracks influenza outbreaks with crowdsourcing
Beth Snyder Bulik at FiercePharma: “It’s like Waze for the flu. A new GlaxoSmithKline-sponsored app called Flumoji uses crowdsourced data to track influenza movement in real time.
Developed with MIT’s Connection Science, the Flumoji app gathers data passively and identifies fluctuations in users’ activity and social interactions to try to identify when a person gets the flu. The activity data is combined with traditional flu tracking data from the Centers for Disease Control to help determine outbreaks. The Flumoji study runs through April, when it will be taken down from the Android app store and no more data will be collected from users.
To make the app more engaging for users, Flumoji uses emojis to help users identify how they’re feeling. If it’s a flu day, symptom faces with thermometers, runny noses and coughs can be chosen, while on other days, users can show how they’re feeling with more traditional mood emojis.
The app has been installed on 500-1,000 Android phones, according to Google Play data.
“Mobile phones are a widely available and efficient way to monitor patient health. GSK has been using them in its studies to monitor activity and vital signs in study patients, and collect patient feedback to improve decision making in the development of new medicines. Tracking the flu is just the latest test of this technology,” Mary Anne Rhyne, a GSK director of external communications for R&D in the U.S., told FiercePharma in an email interview…(More)”
Quantifying scenic areas using crowdsourced data
Chanuki Illushka Seresinhe, Helen Susannah Moat and Tobias Preis in Environment and Planning B: Urban Analytics and City Science: “For centuries, philosophers, policy-makers and urban planners have debated whether aesthetically pleasing surroundings can improve our wellbeing. To date, quantifying how scenic an area is has proved challenging, due to the difficulty of gathering large-scale measurements of scenicness. In this study we ask whether images uploaded to the website Flickr, combined with crowdsourced geographic data from OpenStreetMap, can help us estimate how scenic people consider an area to be. We validate our findings using crowdsourced data from Scenic-Or-Not, a website where users rate the scenicness of photos from all around Great Britain. We find that models including crowdsourced data from Flickr and OpenStreetMap can generate more accurate estimates of scenicness than models that consider only basic census measurements such as population density or whether an area is urban or rural. Our results provide evidence that by exploiting the vast quantity of data generated on the Internet, scientists and policy-makers may be able to develop a better understanding of people’s subjective experience of the environment in which they live….(More)”