Why you should never trust a data visualisation


in The Guardian: “An excellent blogpost has been receiving a lot of attention over the last week. Pete Warden, an experienced data scientist and author for O’Reilly on all things data, writes:

The wonderful thing about being a data scientist is that I get all of the credibility of genuine science, with none of the irritating peer review or reproducibility worries … I thought I was publishing an entertaining view of some data I’d extracted, but it was treated like a scientific study.

This is an important acknowledgement of a very real problem, but in my view Warden has the wrong target in his crosshairs. Data presented in any medium is a powerful tool and must be used responsibly, but it is when information is expressed visually that the risks are highest.
The central example Warden uses is his visualisation of Facebook friend networks across the United States, which proved extremely popular and was even cited in the New York Times as evidence for growing social division.
As he explains in his post, the methodology behind his underlying network graph is perfectly defensible, but the subsequent clustering process was “produced by me squinting at all the lines, coloring in some areas that seemed more connected in a paint program, and picking silly names for the areas”. The exercise was only ever intended as a bit of fun with a large and interesting dataset, so there really shouldn’t be any problem here.
But there is: humans are visual creatures. Peer-reviewed studies have shown that we can consume information more quickly when it is expressed in diagrams than when it is presented as text.
Even something as simple as colour scheme can have a marked impact on the perceived credibility of information presented visually – often a considerably more marked impact than the actual authority of the data source.
Another great example of this phenomenon was the Washington Post’s ‘map of the world’s most and least racially tolerant countries‘, which went viral back in May of this year. It was widely accepted as an objective, scientific piece of work, despite a number of social scientists identifying flaws in the methodology and the underlying data itself.”

Sitegeist


“Sitegeist is a mobile application that helps you to learn more about your surroundings in seconds. Drawing on publicly available information, the app presents solid data in a simple at-a-glance format to help you tap into the pulse of your location. From demographics about people and housing to the latest popular spots or weather, Sitegeist presents localized information visually so you can get back to enjoying the neighborhood. The application draws on free APIs such as the U.S. Census, Yelp! and others to showcase what’s possible with access to data. Sitegeist was created by the Sunlight Foundation in consultation with design firm IDEO and with support from the John S. and James L. Knight Foundation. It is the third in a series of National Data Apps.”

New Book: Breakpoint, Why the Web will Implode, Search will be Obsolete, and Everything Else you Need to Know about Technology is in Your Brain


breakpointbook.com: “We are living in a world in which cows send texts to farmers when they’re in heat, where the most valuable real estate in New York City houses computers, not people, and some of humanity’s greatest works are created by crowds, not individuals.

We are in the midst of a networking revolution–set to transform the way we access the world’s information and the way we connect with one another. Studying biological systems is perhaps the best way to understand such networks, and nature has a lesson for us if we care to listen: bigger is rarely better in the long run. The deadliest creature is the mosquito, not the lion. It is the quality of a network that is important for survival, not the size, and all networks–the human brain, Facebook, Google, even the internet itself–eventually reach a breakpoint and collapse. That’s the bad news. The good news is that reaching a breakpoint can be a step forward, allowing a network to substitute quality for quantity.
In Breakpoint, brain scientist and entrepreneur Jeff Stibel takes readers to the intersection of the brain, biology, and technology. He shows how exceptional companies are using their understanding of the internet’s brain-like powers to create a competitive advantage by building more effective websites, utilizing cloud computing, engaging social media, monetizing effectively, and leveraging a collective consciousness. Indeed, the result of these technologies is a more tightly connected world with capabilities far beyond the sum of our individual minds. Breakpoint offers a fresh and exciting perspective about the future of technology and its effects on all of us.”

Taking Games for Good to a New Level


Idit Harel Caperton (@idit) in SSIR: “Last month’s Games for Change Festival (G4C) celebrated the promising power of video games to yield social change. The event, now in its tenth year, brings game developers, educators, NGOs, and government agencies to New York City to discuss and promote the creation of social-issue games in an industry with a global market of $67 billion, projected to reach $82 billion by 2017. Big numbers like this prove that the gaming industry has engaged the masses, and G4C wants to push this engagement toward social learning and positive action.
It’s already happening on a small scale. The Games for Change Awards, announced annually at the festival, recognizes effective mission-driven games. This year’s winning games included “Data Dealer,” which raises awareness around personal data and online privacy, and “Quandary,” where players are social pioneers facing decisions that challenge their moral compass. These and other games endorsed at G4C achieve a blend of social influence and technical innovation through engaging gameplay.
G4C has also aligned with larger social impact movements, proving that video games can be vehicles for positive global action through game mechanics. Half the Sky Movement is a transmedia campaign working against the oppression of women worldwide; it includes a book, film, and game. The game, produced by G4C and available for free on Facebook, features game tasks that transfer to real-world donations and social action opportunities. Since launching in March, “Half the Sky Movement: The Game” has raised nearly $350,000 to empower women worldwide. Yet, social issue games production still resides on the edge of the gaming industry. …”

Code for America: Announcing the 2013 Accelerator Class


Press Release: “Code for America opened applications for the 2013 Accelerator knowing that the competition would be fierce. This year we received over 190 applications from amazing candidates. Today, we’re pleased to announce the five teams chosen to participate in the 2013 Accelerator.

The teams are articulate, knowledgeable, and passionate about their businesses. They come from all over the country — Texas, North Carolina, Florida, and California  — and we’re excited to get started with them. Teams include:

ArchiveSocial enables organizations to embrace social media by minimizing risk and eliminating compliance barriers. Specifically, it solves the challenge of retaining Gov 2.0 communications for compliance with FOIA and other public records laws. It currently automates business-grade record keeping of communications on networks such as Facebook, Twitter, and YouTube. Moving forward, ArchiveSocial will help further enforce social media policy and protect the organizational brand.

The Family Assessment Form (FAF) Web is a tool designed by social workers, researchers, and technology experts to help family support practitioners improve family functioning, service planning for families, and organizational performance. The FAF is ideal for use in organizations performing home visitation services for families that address comprehensive concerns about family well-being and child welfare. FAF Web enables all stakeholders to access essential data remotely from any internet-enabled device.

OpenCounter helps entrepreneurs to register their businesses with the local government. It does so through an online check-out experience that adapts to the applicant’s answers and asks for pertinent information only once. OpenCounter estimates licensing time and costs so entrepreneurs can understand what it will take to get their business off the ground. It’s the TurboTax of business permitting.

SmartProcure is an online information service that provides access to local, state, and federal government procurement data, with two public-interest goals: 1. Enable government agencies to make more efficient procurement decisions and save taxpayer dollars. 2. Empower businesses to sell more effectively and competitively to government agencies. The proprietary system provides access to data from more than 50 million purchase orders issued by 1,700 government agencies.

StreetCred Software helps police agencies manage their arrest warrants, eliminate warrant backlogs, and radically improve efficiency while increasing officer safety. It helps agencies understand their fugitive population, measure effectiveness, and make improvements. StreetCred Software, Inc., was founded by two Texas police officers. One is an 18-year veteran investigator and fugitive hunter, the other a technology industry veteran who became an cop in 2010.”

5 Big Data Projects That Could Impact Your Life


Mashable: “We reached out to a few organizations using information, both hand- and algorithm-collected, to create helpful tools for their communities. This is only a small sample of what’s out there — plenty more pop up each day, and as more information becomes public, the trend will only grow….
1. Transit Time NYC
Transit Time NYC, an interactive map developed by WNYC, lets New Yorkers click a spot in any of the city’s five boroughs for an estimate of subway or train travel times. To create it, WNYC lead developer Steve Melendez broke the city into 2,930 hexagons, then pulled data from open source itinerary platform OpenTripPlanner — the Wikipedia of mapping software — and coupled it with the MTA’s publicly downloadable subway schedule….
2. Twitter’s ‘Topography of Tweets
In a blog post, Twitter unveiled a new data visualization map that displays billions of geotagged tweets in a 3D landscape format. The purpose is to display, topographically, which parts of certain cities most people are tweeting from…
3. Homicide Watch D.C.
Homicide Watch D.C. is a community-driven data site that aims to cover every murder in the District of Columbia. It’s sorted by “suspect” and “victim” profiles, where it breaks down each person’s name, age, gender and race, as well as original articles reported by Homicide Watch staff…
4. Falling Fruit
Can you find a hidden apple tree along your daily bike commute? Falling Fruit can.
The website highlights overlooked or hidden edibles in urban areas across the world. By collecting public information from the U.S. Department of Agriculture, municipal tree inventories, foraging maps and street tree databases, the site has created a network of 615 types of edibles in more than 570,000 locations. The purpose is to remind urban dwellers that agriculture does exist within city boundaries — it’s just more difficult to find….
5. AIDSvu
AIDSVu is an interactive map that illustrates the prevalence of HIV in the United States. The data is pulled from the U.S. Center for Disease Control’s national HIV surveillance reports, which are collected at both state and county levels each year…”

‘Medical Instagram’ helps build a library of reference photos for doctors


Springwise: “The power of the visual sharing that makes platforms such as Instagram so popular has been harnessed by retailers like Ask CT Food to share knowledge about cooking, but could the same be done for the medical world? Figure1 enables health professionals to upload and share photos of conditions, creating online discussion as well as crowdsourcing a database of reference images.
Developed by healthcare tech startup Movable Science, the platform is designed in a similar vein to Instagram and enables medical professionals to create their own feed of images from the cases they deal with. In order to protect patients’ identities, the app uses facial recognition to block out faces, while users can add their own marks to cover up other indentifiable marks. They can also add pointers and annotations, as well as choosing who sees it, before uploading the image. Photos can be tagged with relevant terms to allow the community to easily find them through search and others can comment on the images, fostering discussion among users. Images can also be starred, which acts simultaneously as an indication of quality as well as enabling users to save useful images for later reference. …
Although Instagram was developed with the broad purpose of entertainment and social sharing, Figure1 has tweaked the platform’s functions to provide a tool that could help doctors and students share their knowledge and learn from others in an engaging way…”

Eduardo Paes on Open Government


Mayor, Rio de Janeiro, Brazil in the Huffington Post: “The Internet revolution has transformed the way knowledge is disseminated and how people unite over causes. Social networks are playing a key role in this movement, just as books and the press have done over the last six centuries. During the recent demonstrations in Brazil, approximately 62 percent of the people were informed of the event via Facebook, a much higher rate than TV, which was first source of information to 14 percent of attendees, according to Ibope Institute. Three out of four agitators used social networks to round up support. As generations succeed and the digital gap narrows, these statistics could possibly rise.
This revolution is also accentuating the imperfections of the representative democracy, the only plausible alternative, as Churchill famously said. We live in an era of “Liquid Modernity” as defined by sociologist Zygmunt Bauman, which describes the ephemeral nature of contemporary social interactions. Bauman says that these days society, in a similar manner to liquid, adopts various unstable forms under small amounts of pressure. They are incapable of stabilizing in a consistent form, which results in consequences to social relationships and politics. Meanwhile, political parties, bureaucracy and institutions seem to remain firmly in the 17th Century.
Democracy has to reinvent itself in accordance with this new “liquid society” where collaboration happens between many millions of people directly. Leadership is not vertical, as in the past, but horizontal. Nowadays some say following is more important than leading. Cyber culture understands open code as a principle, something the music industry has reluctantly had to learn. There is no time and space limitation for public accountability on the Internet. Creative commonality is standard and does not resemble the authoritarian style of the dead communist experience. It seems that it is no longer society’s obligation to understand legislation, it is a duty for governments to be understood by their people.”

Facebook Is Being Redefined by Its Developing World Users


Tom Simonite in MIT Technology Review: “…as Facebook’s user base continues to expand, a growing proportion of its users think of it quite differently, as a luxury brand, badge of status, and or even a place to make a little extra money. That’s due to the rapid growth in the number of Facebook users signing on from developing countries, a trend underscored by news from the company today that more than 100 million people use a mobile app the company makes for feature phones
Little research has been done on Facebook’s growth in developing countries (and a lot would be needed to capture even some of the diversity included under the blanket term “developing world”). Two small, recent studies of Kenyan Facebook users in poor areas by Susan Wyche of Michigan State University are among the first to be published, and they provide some interesting insights.
One of Wyche’s ethnographic studies took place in rural Internet cafes, where the researchers were told that “Facebook is a luxury,” only to be indulged if someone had money to spare (here’s a PDF of Wyche’s paper). When study participants thought about social networking, the challenges of low bandwidth and sometimes unreliable electricity supplies were foremost in their minds.
The barriers of cost and infrastructure associated with Facebook led people in another community Wyche and colleagues visited, a slum of Nairobi, to see the service as for more than just socializing. They used it—with mixed success—as a way to make a little money, look for jobs, market themselves, and seek remittances from friends and family overseas. (This reminded me of a recent report on people in Kuwait using Instagram to sell things and run retail businesses.)…
Should it want to, Facebook could even become a powerful tool for efforts to improve the lives of people in poor areas, where the site is gaining traction. The company has already dabbled with using social engineering to boost organ donations in the U.S. (see “Thank God for Facebook: When Platforms Proselytize”). There’s no shortage of similar experiments that could be run in places with more fundamental health problems, where Facebook’s status as a luxury could make it very influential.”

Index: Participation and Civic Engagement


The Living Library Index – inspired by the Harper’s Index – provides important statistics and highlights global trends in governance innovation. This installment focuses on participation and civic engagement and was originally published in 2013.

  • Percent turnout of voting age population in 2012 U.S. Presidential election: 57.5
  • Percent turnout in 2008, 2004, 2000 elections: 62.3, 60.4, 54.2
  • Change in voting rate in U.S. from 1980 to most recent election: –29
  • Change in voting rate in Slovak Republic from 1980 to most recent election: –42, the lowest rate among democratic countries surveyed
  • Change in voting rate in Russian Federation from 1980 to most recent election: +14, the highest rate among democratic countries surveyed
  • Percent turnout in Australia as of 2011: 95, the highest rate among democratic countries surveyed
  • Percentage point difference in voting rates between high and low educated people in Australia as of 2011: 1
  • Percentage point difference in voting rates between high and low educated people in the U.S. as of 2011:  33
  • Number of Black and Hispanic U.S. voters in comparison to 2008 election: 1.7 million and 1.4 million increase
  • Number of non-Hispanic White U.S. voters in comparison to 2008 election: 2 million decrease, the only example of a race group showing a decrease in net voting from one presidential election to the next
  • Percent of Americans that contact their elected officials between elections: 10
  • Margin of victory in May 2013 Los Angeles mayoral election: 54-46
  • Percent turnout among Los Angeles citizens in May 2013 Los Angeles mayoral election: 19
  • Percent of U.S. adults that used social networking sites in 2012: 60
  • How many of which participated in a political or civic activity online: 2/3
  • Percent of U.S. social media users in 2012 that used social tools to encourage other people to take action on an issue that is important to them: 31
  • Percent of U.S. adults that belonged to a group on a social networking site involved in advancing a political or social issue in 2012: 12
  • Increase in the number of adults who took part in these behaviors in 2008: four-fold
  • Number of U.S. adults that signed up to receive alerts about local issues via email or text messaging in 2010: 1 in 5
  • Percent of U.S. adults that used digital tools digital tools to talk to their neighbors and keep informed about community issues in 2010: 20
  • Number of Americans that talked face-to-face with neighbors about community issues in 2010: almost half
  • How many online adults that have used social tools as blogs, social networking sites, and online video as well as email and text alerts to keep informed about government activities: 1/3
  • Percent of U.S. adult internet users that have gone online for raw data about government spending and activities in 2010: 40
  • Of which how many look online to see how federal stimulus money is being spent: 1 in 5
  • Read or download the text of legislation: 22%
  • How many Americans volunteered through or for an organization at least once between September 2011 and September 2012: 64.5 million
  • Median hours spent on volunteer activities during this time: 50
  • Change in volunteer rate compared to the year before: 0.3 decline

Sources