The Power of Hackathons


Woodrow Wilson International Center for Scholars: “The Commons Lab of the Science and Technology Innovation Program is proud to announce the release of The Power of Hackathons: A Roadmap for Sustainable Open Innovation. Hackathons are collaborative events that have long been part of programmer culture, where people gather in person, online or both to work together on a problem. This could involve creating an application, improving an existing one or testing a platform.
In recent years, government agencies at multiple levels have started holding hackathon events of their own. For this brief, author Zachary Bastian interviewed agency staff, hackathon planners and hackathon participants to better understand how these events can be structured. The fundamental lesson was that a hackathon is not a panacea, but instead should be part of a broader open data and innovation centric strategy.
The full brief can be found here”

Why you should never trust a data visualisation


in The Guardian: “An excellent blogpost has been receiving a lot of attention over the last week. Pete Warden, an experienced data scientist and author for O’Reilly on all things data, writes:

The wonderful thing about being a data scientist is that I get all of the credibility of genuine science, with none of the irritating peer review or reproducibility worries … I thought I was publishing an entertaining view of some data I’d extracted, but it was treated like a scientific study.

This is an important acknowledgement of a very real problem, but in my view Warden has the wrong target in his crosshairs. Data presented in any medium is a powerful tool and must be used responsibly, but it is when information is expressed visually that the risks are highest.
The central example Warden uses is his visualisation of Facebook friend networks across the United States, which proved extremely popular and was even cited in the New York Times as evidence for growing social division.
As he explains in his post, the methodology behind his underlying network graph is perfectly defensible, but the subsequent clustering process was “produced by me squinting at all the lines, coloring in some areas that seemed more connected in a paint program, and picking silly names for the areas”. The exercise was only ever intended as a bit of fun with a large and interesting dataset, so there really shouldn’t be any problem here.
But there is: humans are visual creatures. Peer-reviewed studies have shown that we can consume information more quickly when it is expressed in diagrams than when it is presented as text.
Even something as simple as colour scheme can have a marked impact on the perceived credibility of information presented visually – often a considerably more marked impact than the actual authority of the data source.
Another great example of this phenomenon was the Washington Post’s ‘map of the world’s most and least racially tolerant countries‘, which went viral back in May of this year. It was widely accepted as an objective, scientific piece of work, despite a number of social scientists identifying flaws in the methodology and the underlying data itself.”

The Internet generation will learn to let go


Julian B. Gewirtz and Adam B. Kern in The Washington Post: “Ours is the first generation to have grown up with the Internet. The first generation that got suspended from school because of a photo of underage drinking posted online. The first generation that could talk in chat rooms to anyone, anywhere, without our parents knowing. The first generation that has been “tracked” and “followed” and “shared” since childhood.
All this data will remain available forever — both to the big players (tech companies, governments) and to our friends, our sort-of friends and the rest of civil society. This fact is not really new, but our generation will confront the latter on a scale beyond that experienced by previous generations…
Certainly there will be many uses for information, such as health data, that will wind up governed by law. But so many other uses cannot be predicted or legislated, and laws themselves have to be informed by values. It is therefore critical that people establish, with their actions and expectations, cultural norms that prevent their digital selves from imprisoning their real selves.
We see three possible paths: One, people become increasingly restrained about what they share and do online. Two, people become increasingly restrained about what they do, period. Three, we learn to care less about what people did when they were younger, less mature or otherwise different.
The first outcome seems unproductive. There is no longer much of an Internet without sharing, and one of the great benefits of the Internet has been its ability to nurture relationships and connections that previously had been impossible. Withdrawal is unacceptable. Fear of the digital future should not drive us apart.
The second option seems more deeply unsettling. Childhood, adolescence, college — the whole process of growing up — is, as thinkers from John Locke to Dr. Spock have written, a necessarily experimental time. Everyone makes at least one mistake, and we’d like to think that process continues into adulthood. Creativity should not be overwhelmed by the fear of what people might one day find unpalatable.
This leaves the third outcome: the idea that we must learn to care less about what people did when they were younger or otherwise different. In an area where regulations, privacy policies and treaties may take decades to catch up to reality, our generation needs to take the lead in negotiating a “cultural treaty” endorsing a new value, related to privacy, that secures our ability to have a past captured in data that is not held to be the last word but seen in light of our having grown up in a way that no one ever has before.
Growing up, that is, on the record.”

Big data  + politics = open data: The case of health care data in England


New Paper in Policy & Internet: “There is a great deal of enthusiasm about the prospects for Big Data held in health care systems around the world. Health care appears to offer the ideal combination of circumstances for its exploitation, with a need to improve productivity on the one hand and the availability of data that can be used to identify opportunities for improvement on the other. The enthusiasm rests on two assumptions. First, that the data sets held by hospitals and other organizations, and the technological infrastructure needed for their acquisition, storage, and manipulation, are up to the task. Second, that organizations outside health care systems will be able to access detailed datasets. We argue that both assumptions can be challenged. The article uses the example of the National Health Service in England to identify data, technology, and information governance challenges. The public acceptability of third party access to detailed health care datasets is, at best, unclear.”

Sitegeist


“Sitegeist is a mobile application that helps you to learn more about your surroundings in seconds. Drawing on publicly available information, the app presents solid data in a simple at-a-glance format to help you tap into the pulse of your location. From demographics about people and housing to the latest popular spots or weather, Sitegeist presents localized information visually so you can get back to enjoying the neighborhood. The application draws on free APIs such as the U.S. Census, Yelp! and others to showcase what’s possible with access to data. Sitegeist was created by the Sunlight Foundation in consultation with design firm IDEO and with support from the John S. and James L. Knight Foundation. It is the third in a series of National Data Apps.”

New Book: Breakpoint, Why the Web will Implode, Search will be Obsolete, and Everything Else you Need to Know about Technology is in Your Brain


breakpointbook.com: “We are living in a world in which cows send texts to farmers when they’re in heat, where the most valuable real estate in New York City houses computers, not people, and some of humanity’s greatest works are created by crowds, not individuals.

We are in the midst of a networking revolution–set to transform the way we access the world’s information and the way we connect with one another. Studying biological systems is perhaps the best way to understand such networks, and nature has a lesson for us if we care to listen: bigger is rarely better in the long run. The deadliest creature is the mosquito, not the lion. It is the quality of a network that is important for survival, not the size, and all networks–the human brain, Facebook, Google, even the internet itself–eventually reach a breakpoint and collapse. That’s the bad news. The good news is that reaching a breakpoint can be a step forward, allowing a network to substitute quality for quantity.
In Breakpoint, brain scientist and entrepreneur Jeff Stibel takes readers to the intersection of the brain, biology, and technology. He shows how exceptional companies are using their understanding of the internet’s brain-like powers to create a competitive advantage by building more effective websites, utilizing cloud computing, engaging social media, monetizing effectively, and leveraging a collective consciousness. Indeed, the result of these technologies is a more tightly connected world with capabilities far beyond the sum of our individual minds. Breakpoint offers a fresh and exciting perspective about the future of technology and its effects on all of us.”

Taking Games for Good to a New Level


Idit Harel Caperton (@idit) in SSIR: “Last month’s Games for Change Festival (G4C) celebrated the promising power of video games to yield social change. The event, now in its tenth year, brings game developers, educators, NGOs, and government agencies to New York City to discuss and promote the creation of social-issue games in an industry with a global market of $67 billion, projected to reach $82 billion by 2017. Big numbers like this prove that the gaming industry has engaged the masses, and G4C wants to push this engagement toward social learning and positive action.
It’s already happening on a small scale. The Games for Change Awards, announced annually at the festival, recognizes effective mission-driven games. This year’s winning games included “Data Dealer,” which raises awareness around personal data and online privacy, and “Quandary,” where players are social pioneers facing decisions that challenge their moral compass. These and other games endorsed at G4C achieve a blend of social influence and technical innovation through engaging gameplay.
G4C has also aligned with larger social impact movements, proving that video games can be vehicles for positive global action through game mechanics. Half the Sky Movement is a transmedia campaign working against the oppression of women worldwide; it includes a book, film, and game. The game, produced by G4C and available for free on Facebook, features game tasks that transfer to real-world donations and social action opportunities. Since launching in March, “Half the Sky Movement: The Game” has raised nearly $350,000 to empower women worldwide. Yet, social issue games production still resides on the edge of the gaming industry. …”

Code for America: Announcing the 2013 Accelerator Class


Press Release: “Code for America opened applications for the 2013 Accelerator knowing that the competition would be fierce. This year we received over 190 applications from amazing candidates. Today, we’re pleased to announce the five teams chosen to participate in the 2013 Accelerator.

The teams are articulate, knowledgeable, and passionate about their businesses. They come from all over the country — Texas, North Carolina, Florida, and California  — and we’re excited to get started with them. Teams include:

ArchiveSocial enables organizations to embrace social media by minimizing risk and eliminating compliance barriers. Specifically, it solves the challenge of retaining Gov 2.0 communications for compliance with FOIA and other public records laws. It currently automates business-grade record keeping of communications on networks such as Facebook, Twitter, and YouTube. Moving forward, ArchiveSocial will help further enforce social media policy and protect the organizational brand.

The Family Assessment Form (FAF) Web is a tool designed by social workers, researchers, and technology experts to help family support practitioners improve family functioning, service planning for families, and organizational performance. The FAF is ideal for use in organizations performing home visitation services for families that address comprehensive concerns about family well-being and child welfare. FAF Web enables all stakeholders to access essential data remotely from any internet-enabled device.

OpenCounter helps entrepreneurs to register their businesses with the local government. It does so through an online check-out experience that adapts to the applicant’s answers and asks for pertinent information only once. OpenCounter estimates licensing time and costs so entrepreneurs can understand what it will take to get their business off the ground. It’s the TurboTax of business permitting.

SmartProcure is an online information service that provides access to local, state, and federal government procurement data, with two public-interest goals: 1. Enable government agencies to make more efficient procurement decisions and save taxpayer dollars. 2. Empower businesses to sell more effectively and competitively to government agencies. The proprietary system provides access to data from more than 50 million purchase orders issued by 1,700 government agencies.

StreetCred Software helps police agencies manage their arrest warrants, eliminate warrant backlogs, and radically improve efficiency while increasing officer safety. It helps agencies understand their fugitive population, measure effectiveness, and make improvements. StreetCred Software, Inc., was founded by two Texas police officers. One is an 18-year veteran investigator and fugitive hunter, the other a technology industry veteran who became an cop in 2010.”

Data Science for Social Good


Data Science for Social Good: “By analyzing data from police reports to website clicks to sensor signals, governments are starting to spot problems in real-time and design programs to maximize impact. More nonprofits are measuring whether or not they’re helping people, and experimenting to find interventions that work.
None of this is inevitable, however.
We’re just realizing the potential of using data for social impact and face several hurdles to it’s widespread adoption:

  • Most governments and nonprofits simply don’t know what’s possible yet. They have data – but often not enough and maybe not the right kind.
  • There are too few data scientists out there – and too many spending their days optimizing ads instead of bettering lives.

To make an impact, we need to show social good organizations the power of data and analytics. We need to work on analytics projects that have high social impact. And we need to expose data scientists to the problems that really matter.

The fellowship

That’s exactly why we’re doing the Eric and Wendy Schmidt Data Science for Social Good summer fellowship at the University of Chicago.
We want to bring three dozen aspiring data scientists to Chicago, and have them work on data science projects with social impact.
Working closely with governments and nonprofits, fellows will take on real-world problems in education, health, energy, transportation, and more.
Over the next three months, they’ll apply their coding, machine learning, and quantitative skills, collaborate in a fast-paced atmosphere, and learn from mentors in industry, academia, and the Obama campaign.
The program is led by a strong interdisciplinary team from the Computation institute and the Harris School of Public Policy at the University of Chicago.”

5 Big Data Projects That Could Impact Your Life


Mashable: “We reached out to a few organizations using information, both hand- and algorithm-collected, to create helpful tools for their communities. This is only a small sample of what’s out there — plenty more pop up each day, and as more information becomes public, the trend will only grow….
1. Transit Time NYC
Transit Time NYC, an interactive map developed by WNYC, lets New Yorkers click a spot in any of the city’s five boroughs for an estimate of subway or train travel times. To create it, WNYC lead developer Steve Melendez broke the city into 2,930 hexagons, then pulled data from open source itinerary platform OpenTripPlanner — the Wikipedia of mapping software — and coupled it with the MTA’s publicly downloadable subway schedule….
2. Twitter’s ‘Topography of Tweets
In a blog post, Twitter unveiled a new data visualization map that displays billions of geotagged tweets in a 3D landscape format. The purpose is to display, topographically, which parts of certain cities most people are tweeting from…
3. Homicide Watch D.C.
Homicide Watch D.C. is a community-driven data site that aims to cover every murder in the District of Columbia. It’s sorted by “suspect” and “victim” profiles, where it breaks down each person’s name, age, gender and race, as well as original articles reported by Homicide Watch staff…
4. Falling Fruit
Can you find a hidden apple tree along your daily bike commute? Falling Fruit can.
The website highlights overlooked or hidden edibles in urban areas across the world. By collecting public information from the U.S. Department of Agriculture, municipal tree inventories, foraging maps and street tree databases, the site has created a network of 615 types of edibles in more than 570,000 locations. The purpose is to remind urban dwellers that agriculture does exist within city boundaries — it’s just more difficult to find….
5. AIDSvu
AIDSVu is an interactive map that illustrates the prevalence of HIV in the United States. The data is pulled from the U.S. Center for Disease Control’s national HIV surveillance reports, which are collected at both state and county levels each year…”