Paper by Andrew Schrock: “We are awash in predictions about our data-driven future. Enthusiasts believe it will offer new ways to research behavior. Critics worry it will enable powerful regimes of institutional control. Both visions, although polar opposites, tend to downplay the importance of communication. As a result, the role of communication in human-centered data science has rarely been considered. This article fills this gap by outlining three perspectives on data that foreground communication. First, I briefly review the common social scientific perspective: “communication as data.” Next, I elaborate on two less explored perspectives. A “data as communication” perspective captures how data imperfectly carry meanings and guide action. “Communication around data” describes communication in organizational and institutional data cultures. I conclude that communication offers nuanced perspectives to inform human-centered data science. Researchers should embrace a robust agenda, particularly when researching the relationship between data and power…(More)”
Three and a half degrees of separation
Sergey Edunov, Carlos Diuk, Ismail Onur Filiz, Smriti Bhagat and Moira Burke at Facebook Research: “…How connected is the world? Playwrights, poets, and scientists have proposed that everyone on the planet is connected to everyone else by six other people. In honor of Friends Day, we’ve crunched the Facebook friend graph and determined that the number is 3.57. Each person in the world (at least among the 1.59 billion people active on Facebook) is connected to every other person by an average of three and a half other people. The average distance we observe is 4.57, corresponding to 3.57 intermediaries or “degrees of separation.” Within the US, people are connected to each other by an average of 3.46 degrees.
Our collective “degrees of separation” have shrunk over the past five years. In 2011, researchers at Cornell, the Università degli Studi di Milano, and Facebook computed the average across the 721 million people using the site then, and found that it was 3.74 [4,5]. Now, with twice as many people using the site, we’ve grown more interconnected, thus shortening the distance between any two people in the world.
Calculating this number across billions of people and hundreds of billions of friendship connections is challenging; we use statistical techniques described below to precisely estimate distance based on de-identified, aggregate data.
….Calculating degrees of separation in a network with hundreds of billions of edges is a monumental task, because the number of people reached grows very quickly with the degree of separation.
Imagine a person with 100 friends. If each of his friends also has 100 friends, then the number of friends-of-friends will be 10,000. If each of those friends-of-friends also has 100 friends then the number of friends-of-friends-of-friends will be 1,000,000. Some of those friends may overlap, so we need to filter down to the unique connections. We’re only two hops away and the number is already big. In reality this number grows even faster since most people on Facebook have more than 100 friends. We also need to do this computation 1.6 billion times; that is, for every person on Facebook.
Rather than calculate it exactly, we relied on statistical algorithms developed by Kang and others [6-8] to estimate distances with great accuracy, basically finding the approximate number of people within 1, 2, 3 (and so on) hops away from a source….(More)
My degrees of separation: Please log in to Facebook to see your number.
A Government of the Future
White House Fact Sheet on The President’s Fiscal Year 2017 Budget: “…The President is committed to driving lasting change in how Government works – change that makes a significant, tangible, and positive difference in the economy and the lives of the American people. Over the past seven years, the Administration has launched successful efforts to modernize and improve citizen-facing services, eliminate wasteful spending, reduce the Federal real property footprint, improve the use of evidence to improve program performance, and spur innovation in the private sector by opening to the public tens of thousands of Federal data sets and innovation assets at the national labs.
Supporting the President’s Management Agenda. The Budget includes investments to continue driving the President’s Management Agenda by improving the service we provide to the American public; leveraging the Federal Government’s buying power to bring more value and efficiency to how we use taxpayer dollars; opening Government data and research to the private sector to drive innovation and economic growth; promoting smarter information technology; modernizing permitting and environmental review processes; creating new Idea Labs to support employees with promising ideas; and, attracting and retaining the best talent in the Federal workforce.
Supporting Digital Service Delivery for Citizens. In 2014 the Administration piloted the U.S. Digital Service, a unit of innovators, entrepreneurs, and engineers. This team of America’s best digital experts has worked in collaboration with Federal agencies to implement streamlined and effective digital technology practices on the Nation’s highest priority programs. This work includes collaborating with the Department of Education to launch the new College Scorecard to give students, parents, and their advisors most reliable national data to help with college choice and supporting the U.S. Citizenship and Immigration Services (USCIS) transition to launch the new myUSCIS which makes it easier for users to access information about the immigration process and immigration services. To institutionalize the dramatic improvements that this approach has demonstrated, the Budget supports the Administration’s aggressive goal of hiring and placing 500 top technology and design experts to serve in the Government by January 2017.
Strengthening Federal Cybersecurity. As outlined above, the Budget provides $19 billion in resources for cybersecurity. This includes the creation of a new $3.1 billion revolving fund, the Information Technology Modernization Fund (ITMF), to retire the Government’s antiquated IT systems and transition to more secure and efficient modern IT systems, funding to streamline governance and secure Federal networks, and investments to strengthen the cybersecurity workforce and cybersecurity education across society.
Building Evidence and Encouraging Innovation. The President has made it clear that policy decisions should be driven by evidence so that the Federal government can do more of what works and less of what does not. The Administration’s evidence-based approaches have resulted in important gains in areas ranging from reducing veteran homelessness, to improving educational outcomes, to enhancing the effectiveness of international development programs. The Budget invests in expanding evidence-based approaches, developing and testing effective practices, and enhancing government’s capacity to build and use evidence, in particular by expanding access to administrative data and further developing Federal, State, local, and tribal data infrastructure.
Reorganizing Government to Succeed in the Global Economy. The Budget also includes proposals to consolidate and reorganize Government agencies to make them leaner and more efficient, and it increases the use of evidence and evaluation to ensure that taxpayer dollars are spent wisely on programs that work….(More). See also President Barack Obama’s FY 2017 Budget for the U.S. Government
Private Provision of Public Goods via Crowdfunding
Paper by Robert Chovanculiak and Marek Hudík: “Private provision of public goods is typically associated with three main problems: (1) high organization costs, (2) the assurance problem, and (3) the free-rider problem. We argue that technologies which enable crowdfunding (the method of funding projects by raising small amounts of money from a large number of people via the internet), have made the overall conditions for private provision of public goods more favorable: these technologies lowered the organization costs and enabled to employ more efficient mechanisms which reduce the assurance and free-rider problems. It follows that if the reason for government provision of public goods is higher efficiency as suggested by the standard theory, then with the emergence of crowdfunding we should observe a decline of the government role in this area….(More)”
Designing for Cities: Technology and the Urban Experience
eBook byPaul McConnell and Michael Clare: “How can today’s growing cities use technology and design to improve their infrastructure, management, and quality of life? In this O’Reilly report, Paul McConnell and Mike Clare from Intersection review how connected services and platforms are redefining how cities function, and how people interact within them.
As the world becomes more urbanized and connected, design methods can be applied to some of the most critical challenges among three major groups: citizens, civic stakeholders, and commercial interests.
This report will provide you with background, examples, and approaches for citizen-centered experiences and civic innovation projects. The authors provide examples from projects including the MTA Subway System and LinkNYC—an ambitious program to replace New York’s aging pay phone infrastructure with the world’s largest and fastest free municipal Wi-Fi network….(More)”
Open government data and why it matters
Dive Against Debris: Employing 25,600 scuba divers to collect data
DataDrivenJournalism: “In 2011, the team at Project AWARE launched the Dive Against Debris program with the objective of better documenting the amount of marine debris found in the world’s oceans. This global citizen science program trains volunteer scuba divers from across the globe to conduct underwater surveys, generating quantitative data on the debris they see. After cleaning this data for quality assurance, it is then published on their interactive Dive Against Debris Map. This data and visualization informs the team’s advocacy work, ultimately seeking to generate changes in policy.
The impact of marine debris is devastating, killing marine life and changing their habitats and ecosystems. Animals are extremely vulnerable to ingestion or entanglement which leads to death, as they are unable to distinguish between what is trash and what is not.
Beyond this, as microscopic pieces of plastic enter the food chain, most seafood ingested by humans also likely contains marine debris.
Project AWARE is a growing movement of scuba divers protecting the ocean, with a long history of working on the marine debris issue. Through its work, the Project AWARE team found that there was a significant lack of data available regarding underwater marine debris.
To remedy this, the Dive Against Debris program was launched in 2011. The programs seeks to collect and visualise data generated by their volunteers, then use this data to influence policy changes and raise social awareness around the world. This data collection is unique in that it focuses exclusively on yielding data about the types and quantities of marine debris items found beneath in the ocean, an issue Hannah Pragnell-Raasch, a Program Specialist with Project AWARE, told us “has previously been disregarded as out of sight, out of mind, as the everyday person is not exposed to the harmful impacts.”
To date, Dive Against Debris surveys have been conducted in over 50 countries, with the top reporting countries being the United States, Thailand and Greece. As more divers get involved with Dive Against Debris, Project AWARE continues to bring visibility to the problem of marine debris and helps to identify target areas for waste prevention efforts.
….
Anyone can take part in a Dive Against Debris survey, as long as they are a certified diver. As described in their “Action Zone”, scuba divers can either “join” or “create” an action. To further support the program, Project AWARE launched the Dive Against Debris Distinctive Specialty, a course of divers, which “aims to equip students (scuba divers) with the skills and knowledge necessary to conduct their own Dive Against Debris Surveys.”
Before the data appears on the interactive Dive Against Debris Map, it goes through a quality review in order to ensure data integrity. The survey leader at Project AWARE corrects any data inconsistencies. Then, as the focus is exclusively on what is found underwater, all land data is removed. Project AWARE Aware aims to create “an accurate perspective about underwater marine debris, that policy-makers simply cannot ignore”…. Explore the Dive Against Debris project here…. (More)
Open data dusts off the art world
Suzette Lohmeyer at GCN: “Open data is not just for spreadsheets. Museums are finding ways to convert even the provenance of artwork into open data, offering an out-of-the-box lesson in accessibility to public sector agencies. The specific use case could be of interest to government as well — many cities and states have sizeable art collections, and the General Services Administration owns more than 26,000 pieces.
Most art pieces have a few skeletons in their closet, or at least a backstory worthy of The History Channel. That provenance, or ownership information, has traditionally been stored in manila folders, only occasionally dusted off by art historians for academic papers or auction houses to verify authenticity. Many museums have some provenance data in collection management systems, but the narratives that tell the history of the work are often stored as semi-structured data, formatted according to the needs of individual institutions, making the information both hard to search and share across systems.
Enter Art Tracks from Pittsburgh’s Carnegie Museum of Art (CMOA) — a new open source, open data initiative that aims to turn provenance into structured data by building a suite of open source software tools so an artwork’s past can be available to museum goers, curators, researchers and software developers.
….The Art Tracks software is all open source. The code libraries and the user-facing provenance entry tool called Elysa (E-lie-za) are all “available on GitHub for use, modification and tinkering,” Berg-Fulton explained. “That’s a newer way of working for our museum, but that openness gives others a chance to lean on our technical expertise and improve their own records and hopefully contribute back to the software to improve that as well.”
Using an open data format, Berg-Fulton said, also creates opportunities for ongoing partnerships with other experts across the museum community so that provenance becomes a constant conversation.
This is a move Berg-Fulton said CMOA has been “dying to make,” because the more people that have access to data, the more ways it can be interpreted. “When you give people data, they do cool things with it, like help you make your own records better, or interpret it in a way you’ve never thought of,” she said. “It feels like the right thing to do in light of our duty to public trust.”….(More)”
Give Up Your Data to Cure Disease
David B. Agus in The New York Times: “How far would you go to protect your health records? Your privacy matters, of course, but consider this: Mass data can inform medicine like nothing else and save countless lives, including, perhaps, your own.
Over the past several years, using some $30 billion in federal stimulus money, doctors and hospitals have been installing electronic health record systems. ….Yet neither doctors nor patients are happy. Doctors complain about the time it takes to update digital records, while patients worry about confidentiality…
We need to get over it. These digital databases offer an incredible opportunity to examine trends that will fundamentally change how doctors treat patients. They will help develop cures, discover new uses for drugs and better track the spread of scary new illnesses like the Zika virus….
Case in point: Last year, a team led by researchers at the MD Anderson Cancer Center and Washington University found that a common class of heart drugs called beta blockers, which block the effects of adrenaline, may prolong ovarian cancer patients’ survival. This discovery came after the researchers reviewed more than 1,400 patient records, and identified an obvious pattern among those with ovarian cancer who were using beta blockers, most often to control their blood pressure. Women taking earlier versions of this class of drug typically lived for almost eight years after their cancer diagnosis, compared with just three and a half years for the women not taking any beta blocker….
We need to move past that. For one thing, more debate over data sharing is already leading to more data security. Last month a bill was signed into law calling for the Department of Health and Human Services to create a health care industry cybersecurity task force, whose members would hammer out new voluntary standards.
New technologies — and opportunities — come with unprecedented risks and the need for new policies and strategies. We must continue to improve our encryption capabilities and other methods of data security and, most important, mandate that they are used. The hack of the Anthem database last year, for instance, which allowed 80 million personal records to be accessed, was shocking not only for the break-in, but for the lack of encryption….
Medical research is making progress every day, but the next step depends less on scientists and doctors than it does on the public. Each of us has the potential to be part of tomorrow’s cures. (More)”
New Tools for Collaboration: The Experience of the U.S. Intelligence Community
IBM Center for Business of Government: “This report is intended for an audience beyond the U.S. Intelligence Community—senior managers in government, their advisors and students of government performance who are interested in the progress of collaboration in a difficult environment. …
The purpose of this report is to learn lessons by looking at the use of internal collaborative tools across the Intelligence Community. The initial rubric was tools, but the real focus is collaboration, for while the tools can enable, what ultimately matters are policies and practices interacting with organizational culture. It looks for good practices to emulate. The ultimate question is how and how much could, and should, collaborative tools foster integration across the Community. The focus is analysis and the analytic process, but collaborative tools can and do serve many other functions in the Intelligence Community—from improving logistics or human resources, to better connecting collection and analysis, to assisting administration and development, to facilitating, as one interlocutor put it, operational “go” decisions. Yet it is in the analytic realm that collaboration is both most visible and most rubs against traditional work processes that are not widely collaborative.
The report defines terms and discusses concepts, first exploring collaboration and coordination, then defining collaborative tools and social media, then surveying the experience of the private sector. The second section of the report uses those distinctions to sort out the blizzard of collaborative tools that have been created in the various intelligence agencies and across them. The third section outlines the state of collaboration, again both within agencies and across them. The report concludes with findings and recommendations for the Community. The recommendations amount to a continuum of possible actions in making more strategic what is and will continue to be more a bottom-up process of creating and adopting collaborative tools and practices….(More)”