Big Data Is Not a Monolith


Book edited by Cassidy R. Sugimoto, Hamid R. Ekbia and Michael Mattioli: “Big data is ubiquitous but heterogeneous. Big data can be used to tally clicks and traffic on web pages, find patterns in stock trades, track consumer preferences, identify linguistic correlations in large corpuses of texts. This book examines big data not as an undifferentiated whole but contextually, investigating the varied challenges posed by big data for health, science, law, commerce, and politics. Taken together, the chapters reveal a complex set of problems, practices, and policies.

The advent of big data methodologies has challenged the theory-driven approach to scientific knowledge in favor of a data-driven one. Social media platforms and self-tracking tools change the way we see ourselves and others. The collection of data by corporations and government threatens privacy while promoting transparency. Meanwhile, politicians, policy makers, and ethicists are ill-prepared to deal with big data’s ramifications. The contributors look at big data’s effect on individuals as it exerts social control through monitoring, mining, and manipulation; big data and society, examining both its empowering and its constraining effects; big data and science, considering issues of data governance, provenance, reuse, and trust; and big data and organizations, discussing data responsibility, “data harm,” and decision making….(More)”

Obama Brought Silicon Valley to Washington


Jenna Wortham at The New York Times: “…“Fixing” problems with technology often just creates more problems, largely because technology is never developed in a neutral way: It embodies the values and biases of the people who create it. Crime-predicting software, celebrated when it was introduced in police departments around the country, turned out to reinforce discriminatory policing. Facebook was recently accused of suppressing conservative news from its trending topics. (The company denied a bias, but announced plans to train employees to neutralize political, racial, gender and age biases that could influence what it shows its user base.) Several studies have found that Airbnb has worsened the housing crises in some cities where it operates. In January, a report from the World Bank declared that tech companies were widening income inequality and wealth disparities, not improving them….

None of this was mentioned at South by South Lawn. Instead, speakers heralded the power of the tech community. John Lewis, the congressman and civil rights leader, gave a rousing talk that implored listeners to “get in trouble. Good trouble. Get in the way and make some noise.” Clay Dumas, chief of staff for the Office of Digital Strategy at the White House, told me in an email that the event could be considered part of a legacy to inspire social change and activism through technology. “In his final months in office,” he wrote, “President Obama wants to empower the generation of people that helped launch his candidacy and whose efforts carried him into office.”

…But a few days later, during a speech at Carnegie Mellon, Obama seemed to reckon with his feelings about the potential — and limits — of the tech world. The White House can’t be as freewheeling as a start-up, he said, because “by definition, democracy is messy. And part of government’s job is dealing with problems that nobody else wants to deal with.” But he added that he didn’t want people to become “discouraged and say, ‘I’m just not going to deal with government.’ ” Obama was the first American president to see technology as an engine to improve lives and accelerate society more quickly than any government body could. That lesson was apparent on the lawn. While I still don’t believe that technology is a panacea for society’s problems, I will always appreciate the first president who tried to bring what’s best about Silicon Valley to Washington, even if some of the bad came with it….(More)”

Kenyans have launched a campaign on Twitter to fix their roads


Lily Kuo in Quartz: “Traffic is a problem in Nairobi. A short commute can last for hours during morning or evening rush hour. Buses and motorbikes cut in and out of traffic, worsening congestion. It’s estimated that road congestion costs Kenya’s capital as much as $570,000 a day in lost productivity.

One of the reasons for the city’s bad traffic is the state of the roads: drivers swerve to avoid potholes, bumps, or breaks in the roads, causing a buildup of traffic. To help, an online campaign called “What is a Road” is crowdsourcing the location and condition of potholes around the city in an effort to push local officials to fix them.

Nairobians tweet a photo and location of a pothole under the hashtag #whatisaroad. Those reports are uploaded to a map, used to analyze where the city’s potholes are located and track which ones have been fixed. “We decided to take a more data driven approach to track progress, promises made and projects delivered,” says Muthuri Kinyamu, one of the organizers.

A map showing crowdsourced reports of potholes across Nairobi. (What Is a Road)

The campaign is also about addressing some of the fundamental problems that hold cities like Nairobi back. In Nairobi, branded the center of “Silicon Savannah” in recent years, there’s often more focus on entrepreneurship and innovation than resolving simpler problems like the state of the roads. …The

The campaign, started in August, will continue until January. Chris Orwa, a data analyst helping with the project, says that they can’t take credit for all the repairs they have been documented around the city, but they have noticed that roads are being fixed within days of a #Whatisaroad report. The average response time for fixing a road reported by a What is a Road user is three days, according to Orwa….(More)”

Improving Services—At What Cost? Examining the Ethics of Twitter Research


Case study by Sara Mannheimer, Scott W. H. Young and Doralyn Rossmann: “As social media use has become widespread, academic and corporate researchers have identified social networking services as sources of detailed information about people’s viewpoints and behaviors. Social media users share thoughts, have conversations, and build communities in open, online spaces, and researchers analyze social media data for a variety of purposes—from tracking the spread of disease (Lampos & Cristianini, 2010) to conducting market research (Patino, Pitta, & Quinones, 2012; Hornikx & Hendriks, 2015) to forecasting elections (Tumasjan et al., 2010). Twitter in particular has emerged as a leading platform for social media research, partly because user data from non-private Twitter accounts is openly accessible via an application programming interface (API). This case study describes research conducted by Montana State University (MSU) librarians to analyze the MSU Library’s Twitter community, and the ethical questions that we encountered over the course of the research. The case study will walk through our Twitter research at the MSU Library, and then suggest discussion questions to frame an ethical conversation surrounding social media research. We offer a number of areas of ethical inquiry that we recommend be engaged with as a cohesive whole….(More)”.

Could online democracy lead to governance by Trumps and trolls?


in The Guardian: “The first two user tutorials are pretty stock standard but, from there, things escalate dramatically. After mastering How to Sign Up and How to RecoverYour Password, users are apparently ready to advance to lesson number three: How to Create a Democracy.

As it turns out, on DemocracyOS, this is a relatively straightforward matter – not overthrowing the previous regime nor exterminating the last traces of the royal lineage in order to pave the way for a new world order. Instead Argentinian developers Democracia en Red have made it a simple matter of clicking a button to form a group and thrash out the policies voters wish to see enacted.

It is one of a range of digital platforms for direct democracy created by developers and activists to redefine the relationship between citizens and their governments,with the powers that be in Latin American city councils through to European anti-austerity parties making the upgrade to democracy 2.0.

Reshaping how government works is a difficult enough pitch by itself but,beyond that, there’s another challenge facing developers – the online trolls are ready and waiting.

Britain alone this year offered up two examples of what impact trolls could have on online direct democracy – there was the case of “BoatyMcBoatface” famously winning a Natural Environment ResearchCouncil poll to determine the name of a multimillion-pound arctic research vessel, and then there was the more serious case of trolls adding the signatures of thousands of residents of countries such as the Cayman Islands and Vatican City to a formal petition calling for a second Brexit referendum, in order to have the entire document disregarded as an online prank.

In the US presidential election even the politicians are getting in on it,with a pro-Hillary Clinton super PAC (political action committee) hiring an army of online commenters to defend the candidate in arguments on social media, while the Republican contender, Donald Trump, is himself engaging in textbook trolling behaviour – whether that’s urging the hacking of Clinton’s emails, revealing the phone number of a Republican rival during the primaries, or unleashing a constant stream of controversial statements as a means of derailing conversations, attracting attention and humiliating his targets.

So what does this mean for digital platforms for direct democracy? By merging the world of the internet with that of politics, will we all end up governed by some fusion of trolls and Trumps promising to build Wally McWallfaces on our borders? And will the technologies of the fourth industrial revolution also usher in a revolution in how democracy functions?…(More)”

Artificial Intelligence can streamline public comment for federal agencies


John Davis at the Hill: “…What became immediately clear to me was that — although not impossible to overcome — the lack of consistency and shared best practices across all federal agencies in accepting and reviewing public comments was a serious impediment. The promise of Natural Language Processing and cognitive computing to make the public comment process light years faster and more transparent becomes that much more difficult without a consensus among federal agencies on what type of data is collected – and how.

“There is a whole bunch of work we have to do around getting government to be more customer friendly and making it at least as easy to file your taxes as it is to order a pizza or buy an airline ticket,” President Obama recently said in an interview with WIRED. “Whether it’s encouraging people to vote or dislodging Big Data so that people can use it more easily, or getting their forms processed online more simply — there’s a huge amount of work to drag the federal government and state governments and local governments into the 21st century.”

…expanding the discussion around Artificial Intelligence and regulatory processes to include how the technology should be leveraged to ensure fairness and responsiveness in the very basic processes of rulemaking – in particular public notices and comments. These technologies could also enable us to consider not just public comments formally submitted to an agency, but the entire universe of statements made through social media posts, blogs, chat boards — and conceivably every other electronic channel of public communication.

Obviously, an anonymous comment on the Internet should not carry the same credibility as a formally submitted, personally signed statement, just as sworn testimony in court holds far greater weight than a grapevine rumor. But so much public discussion today occurs on Facebook pages, in Tweets, on news website comment sections, etc. Anonymous speech enjoys explicit protection under the Constitution, based on a justified expectation that certain sincere statements of sentiment might result in unfair retribution from the government.

Should we simply ignore the valuable insights about actual public sentiment on specific issues made possible through the power of Artificial Intelligence, which can ascertain meaning from an otherwise unfathomable ocean of relevant public conversations? With certain qualifications, I believe Artificial Intelligence, or AI, should absolutely be employed in the critical effort to gain insights from public comments – signed or anonymous.

“In the criminal justice system, some of the biggest concerns with Big Data are the lack of data and the lack of quality data,” the NSTC report authors state. “AI needs good data. If the data is incomplete or biased, AI can exacerbate problems of bias.” As a former federal criminal prosecutor and defense attorney, I am well familiar with the absolute necessity to weigh the relative value of various forms of evidence – or in this case, data…(More)

When the Algorithm Itself is a Racist: Diagnosing Ethical Harm in the Basic Components of Software


Paper by Christian Sandvig et al in Special Issue of the International Journal of Communication on Automation, Algorithms, and Politics: “Computer algorithms organize and select information across a wide range of applications and industries, from search results to social media. Abuses of power by Internet platforms have led to calls for algorithm transparency and regulation. Algorithms have a particularly problematic history of processing information about race. Yet some analysts have warned that foundational computer algorithms are not useful subjects for ethical or normative analysis due to complexity, secrecy, technical character, or generality. We respond by investigating what it is an analyst needs to know to determine whether the algorithm in a computer system is improper, unethical, or illegal in itself. We argue that an “algorithmic ethics” can analyze a particular published algorithm. We explain the importance of developing a practical algorithmic ethics that addresses virtues, consequences, and norms: We increasingly delegate authority to algorithms, and they are fast becoming obscure but important elements of social structure…. (More)”

A decentralized web would give power back to the people online


 at TechCrunch: “…The original purpose of the web and internet, if you recall, was to build a common neural network which everyone can participate in equally for the betterment of humanity.Fortunately, there is an emerging movement to bring the web back to this vision and it even involves some of the key figures from the birth of the web. It’s called the Decentralised Web or Web 3.0, and it describes an emerging trend to build services on the internet which do not depend on any single “central” organisation to function.

So what happened to the initial dream of the web? Much of the altruism faded during the first dot-com bubble, as people realised that an easy way to create value on top of this neutral fabric was to build centralised services which gather, trap and monetise information.

Search Engines (e.g. Google), Social Networks (e.g. Facebook), Chat Apps (e.g. WhatsApp )have grown huge by providing centralised services on the internet. For example, Facebook’s future vision of the internet is to provide access only to the subset of centralised services endorses (Internet.org and Free Basics).

Meanwhile, it disables fundamental internet freedoms such as the ability to link to content via a URL (forcing you to share content only within Facebook) or the ability for search engines to index its contents (other than the Facebook search function).

The Decentralised Web envisions a future world where services such as communication,currency, publishing, social networking, search, archiving etc are provided not by centralised services owned by single organisations, but by technologies which are powered by the people: their own community. Their users.

The core idea of decentralisation is that the operation of a service is not blindly trusted toany single omnipotent company. Instead, responsibility for the service is shared: perhaps by running across multiple federated servers, or perhaps running across client side apps in an entirely “distributed” peer-to-peer model.

Even though the community may be “byzantine” and not have any reason to trust or depend on each other, the rules that describe the decentralised service’s behaviour are designed to force participants to act fairly in order to participate at all, relying heavily on cryptographic techniques such as Merkle trees and digital signatures to allow participants to hold each other accountable.

There are three fundamental areas that the Decentralised Web necessarily champions:privacy, data portability and security.

  • Privacy: Decentralisation forces an increased focus on data privacy. Data is distributed across the network and end-to-end encryption technologies are critical for ensuring that only authorized users can read and write. Access to the data itself is entirely controlled algorithmically by the network as opposed to more centralized networks where typically the owner of that network has full access to data, facilitating  customer profiling and ad targeting.
  • Data Portability: In a decentralized environment, users own their data and choose with whom they share this data. Moreover they retain control of it when they leave a given service provider (assuming the service even has the concept of service providers). This is important. If I want to move from General Motors to BMW today, why should I not be able to take my driving records with me? The same applies to chat platform history or health records.
  • Security: Finally, we live in a world of increased security threats. In a centralized environment, the bigger the silo, the bigger the honeypot is to attract bad actors.Decentralized environments are safer by their general nature against being hacked,infiltrated, acquired, bankrupted or otherwise compromised as they have been built to exist under public scrutiny from the outset….(More)”

Facebook, World Bank and OECD Link Up to Gather Data


Paul Hannon in the Wall Street Journal: “Social media potentially offers cheaper and more timely way to survey firms and gauge the economy…Facebook has teamed up with the World Bank and the OECD to launch a new measure of business sentiment based on questioning companies that use their Facebook pages to connect with customers.

The three partners on Wednesday launched a new measure of business sentiment based on questioning companies that use their Facebook pages to connect with customers. Known as the Future of Business Survey, the report has been in testing since February and received responses to 15 queries from a total of 90,000 small and midsize firms across 22 countries.

Its first public release shows that those businesses are more optimistic about their prospects than other companies surveyed by more traditional means.

But the real interest for the three partners is the potential to drill down into the factors that affect the growth of small businesses, a process that until now has involved great expense and time, since it involves face-to-face interviews by polling professionals that are carried out over many months and are infrequently updated. “What I feel is appealing about this particular survey is that it’s potentially a more powerful tool for getting information more quickly and at a fraction of the cost,” said Augusto Lopez-Claros, director of the Global Indicators Group at the World Bank.

Even in developed countries with well funded and equipped statistics offices, timely information on very small businesses is hard to come by. In developing countries, that scarcity can be more acute. The ability to connect with business owners via Facebook or other social-media platforms could make it possible to gather such information, and acquire a more complete picture of what is happening in those economies.

The new approach to data gathering could even enable some smaller developing countries to skip the process of enlarging their statistics agencies. That is an opportunity Mr. Lopez Claros compares to the advent of mobile telephones, which enabled many African countries to skip the construction of expensive fixed-line infrastructure and improve communications at a fraction of that cost….(More)

See also Entrepreneurship at a Glance 2016 (OECD).

Social Machines: The Coming Collision of Artificial Intelligence, Social Networking, and Humanity


Book by James Hendler and Alice Mulvehill: “Will your next doctor be a human being—or a machine? Will you have a choice? If you do, what should you know before making it?

This book introduces the reader to the pitfalls and promises of artificial intelligence in its modern incarnation and the growing trend of systems to “reach off the Web” into the real world. The convergence of AI, social networking, and modern computing is creating an historic inflection point in the partnership between human beings and machines with potentially profound impacts on the future not only of computing but of our world.

AI experts and researchers James Hendler and Alice Mulvehill explore the social implications of AI systems in the context of a close examination of the technologies that make them possible. The authors critically evaluate the utopian claims and dystopian counterclaims of prognosticators. Social Machines: The Coming Collision of Artificial Intelligence, Social Networking, and Humanity is your richly illustrated field guide to the future of your machine-mediated relationships with other human beings and with increasingly intelligent machines.

What you’ll learn

• What the concept of a social machine is and how the activities of non-programmers are contributing to machine intelligence• How modern artificial intelligence technologies, such as Watson, are evolving and how they process knowledge from both carefully produced information (such as Wikipedia or journal articles) and from big data collections

• The fundamentals of neuromorphic computing

• The fundamentals of knowledge graph search and linked data as well as the basic technology concepts that underlie networking applications such as Facebook and Twitter

• How the change in attitudes towards cooperative work on the Web, especially in the younger demographic, is critical to the future of Web applications…(More)”