DATA – Page 502 – The Living Library

Open data for competitive advantage: insights from open data use by companies

Curated on June 3, 2015August 3, 2018 by Stefaan Verhulst

Anneke Zuiderwijk et al in the Proceedings of the 16th Annual International Conference on Digital Government Research: “Politicians have high expectations for commercial open data use. Yet, companies appear to challenge the assumption that open data can be used to create competitive advantage, since any company can access open data and since open data use requires scarce resources. In this paper we examine commercial open data use for creating competitive advantage from the perspective of Resource Based Theory (RBT) and Resource Dependency Theory (RDT). Based on insights from a scenario, interviews and a survey and from RBT and RDT as a reference theory, we derive seven propositions. Our study suggests that the generation of competitive advantage with open data requires a company to have in-house capabilities and resources for open data use. The actual creation of competitive advantage might not be simple. The propositions also draw attention to the accomplishment of unique benefits for a company through the combination of internal and external resources. Recommendations for further research include testing the propositions….(More)”

Data (v.)

Curated on June 1, 2015October 24, 2018 by Stefaan Verhulst

Jer Thorp in Journal 001 of The Office for Creative Research and Medium: “I data you, you data me. They data us, we data them.

As your Concise Oxford sails toward me from across the room, let’s take some time to consider the arguments:

The word data has been in a pronounced flux over the last ten years, as its role and function has been redefined by technology and culture. A decade ago, data was firmly a plural noun. Specifically, it was the plural of datum– one datum, two data. Back then, you could point and laugh at the data amateurs because they would say ‘data is’ rather than ‘data are’. Of course, those data newbies went on to form companies, make software, build databases, write books and give TED talks. And slowly, data did turn into a particular kind of singular: it has become, commonly, a mass noun…..

Data is not inert, yet its perceived passivity is one of its most dangerous properties. When we are warned that a government is collecting data about its citizens, we may be underwhelmed specifically because this act of collection seems to be so harmless, so indifferent. But of course data is not collected and then left alone: it is used as a substrate for decision making; and as an instrument for differentiation, discrimination and damage. Putting an active form of the word data into common parlance could serve as a reminder that the systems of data collection and uses are humming with capacity for influence, action and violence.

Making data a verb also exposes to us the power imbalances that have kept our collective endeavours drastically off-kilter. Grammatically speaking, data-as-verb would present a number of possibilities for subject/object combinations:

I data you. You data me. We data you. You data us. They data me. They data us. We data them.

Exposed to this rich possibility of cause and effect, the common usages of data today become strikingly narrow: in our lived data experiences we are objects, rather than subjects. Google reads our every e-mail, placing us ingloriously in marketing buckets based on what we write to our friends, colleagues and lovers. Uber’s algorithms note our late night voyages asrecords of romantic trysts. They data us, then they data us again.

Even the innocent fitness tracker, on paper an embodiment of ‘I data myself’ isn’t so much about quantified self as it is about quantified selves, less a tool for individuals to track their own beating hearts than a system to find an aggregated 24 year old Bay Area resident that can be marketed against. These devices are exciting toys for runners and walkers but also for lawyers, who have found in them a new way to argue against claims of personal injury.

Yet there is plenty of potential for us to data. Last year we built Floodwatch, a browser based tool that allows users to track the web advertising profiles that are being authored about them— empowering individuals to track the trackers. Mapping Police Violence, a project by Ferguson activists@samsway @Nettaaaaaaaa and @deray, keeps a record of every black American killed by police in the USA. In doing so, the project reminds us how powerful the simple act of data collection can be, particularly when that data is something that the powerful don’t want us to see.

These projects give us a glimpse of what can happen if we abandon our idea of data as an innocent, passive noun. By embracing the new verbal form of data, we might better understand its potential for action, and in turn move beyond our own prescribed role as the objects in data sentences.

In doing so, perhaps we can imagine a future perfect for data, where not only will they have dataed us, we will have dataed them. A future, perhaps, where we all data together….(More)”

Open data could save the NHS hundreds of millions, says top UK scientist

Curated on May 31, 2015July 18, 2019 by Stefaan Verhulst

The Guardian: “The UK government must open up and highlight the power of more basic data sets to improve patient care in the NHS and save hundreds of millions of pounds a year, Nigel Shadbolt, chairman of the Open Data Institute (ODI) has urged.

The UK government topped the first league table for open data (paywall)produced by the ODI last year but Shadbolt warns that ministers’ open data responsibilities have not yet been satisfied.

Basic data on prescription administration is now published on a monthly basis but Shadbolt said medical practitioners must be educated about the power of this data to change prescribing habits across the country.

Other data sets, such as trusts’ opening times, consultant lists and details of services, that are promised to make the NHS more accessible are not currently available in a form that is machine-readable.

“These basic sets of information about the processes, the people and places in the health system are all fragmented and fractured and many of them are not available as registers that you can go to,” Shadbolt said.

“Whenever you talk about health data people think you must be talking about personal data and patient data and there are issues, obviously, of absolutely protecting privacy there. But there’s lots of data in the health service that is not about personal patient data at all that would be hugely useful to just have available as machine-readable data for apps to use.”

The UK government has led the way in recent years in encouraging transparency and accountability within the NHS by opening league tables. The publication of league tables on MRSA was followed by a 76-79% drop in infections.

Shadbolt said: “Those hospitals that were worst in their league table don’t like to be there and there was a very rapid diffusion of understanding of best practice across them that you can quantify. It’s many millions of pounds being saved.”

The artificial intelligence and open data expert said the next big area for open data improvement in the NHS is around prescriptions.

Shadbolt pointed to the publication of data about the prescription of statins,which has helped identify savings worth hundreds of millions of pounds: “There is little doubt that this pattern is likely to exist across the whole of the prescribing space.”…(More)”

The Open Seventeen

Curated on May 29, 2015May 29, 2019 by Stefaan Verhulst

“Crowdsourcing the Verification of the Sustainable Development Goals with Open Data : In 2015, the United Nations is announcing seventeen Sustainable Development Goals (SDGs) for the world. Success at implementing the SDGs by 2030 could put the planet on the right course for the rest of the century. Failure could result in a breakdown of trust in global initiatives and cynical pursuit of self-interest by nations and corporations.

One way to ensure SDGs are achieved is to establish an independent means for verifying that all stakeholders – governments, corporations, NGOs and international organisations – live up to their promises. This requires harnessing the grassroots efforts of concerned citizens on a global scale.

To ignite this effort, ONE– in collaboration with the Citizen Cyberscience Centre and the Crowdcrafting platform for open research – is launching The Open Seventeen, a challenge to develop crowdsourcing projects that tackle SDGs using open data.

How does this challenge work?

You’ll find a big blue button further down this page. Use this to pitch a crowdsourcing project that tackles any of the 17 SDGs, at either a local, regional or global level, and tell us what open data set could be analysed for this purpose.

To inspire you, we’ve provided below some >examples of crowdsourcing projects that have already been tackling different aspects of the SDGs, from deforestation to corruption, and from drought to disease. Projects proposed for the challenge should have clear and realistic goals, and build on existing open data sets.

ONE and its partners will select three proposals and create crowdsourcing projects based on these. The winners and their projects will be profiled by ONE in upcoming international events related to the launch of the SDGs. Your project could inspire the world….

What can you do with open data to help verify SDGs? Have a look at what citizens have already created using the open source technology PyBossa that powers the Crowdcrafting platform and other crowdsourcing projects….(More)”

Aligning Supply and Demand for Better Governance

Curated on May 28, 2015July 19, 2019 by Stefaan Verhulst

Findings regarding Open Data in the Open Government Partnership: “Many have predicted that open government data will lead to major gains in political accountability, generate economic value, and improve the quality of government services. Yet, there is a growing consensus among practitioners and experts that, for open data reforms to have strong governance, economic, and social impacts, reforms must do more than make data available and reusable. Government reforms ultimately must aim to provide data that is useful and used. There may be a high opportunity cost to investing in open data in the place of other useful governance reforms….

This paper identifies strong performances and gaps in aligning open data supply and demand. Findings from action plans and IRM reporting reveal the following trends:

OGP countries are making more open data commitments in their national action plans, both in absolute numbers and in percentage. This could be good for open data advocates, but may come at the expense of other open government approaches that may be more effective at countering excessive secrecy and corruption.
Open data commitments emphasize government supply of data and government coordination mechanisms over identifying and stimulating public demand for data.
Among a smaller group of countries, a growing number of commitments aim to align supply and demand by reforming the regulatory framework and by setting up mechanisms to ensure greater demand, such as participatory prioritization processes in which government solicits public input on which data sets to release. However, typical OGP action plans do not show a distinct move toward establishing or implementing the right to request data.
There is some evidence that sector-specific approaches to open data see higher rates of implementation than crosscutting and whole-of-government approaches to open data. Commitments emphasize data on budgets, health, natural resources, and aid…. (More)”

The Age of Every Building in Los Angeles, Mapped

Curated on May 28, 2015August 3, 2018 by Stefaan Verhulst

Laura Bliss at CityLab: “A fascinating resource for lovers of city planning, made possible by open data.

Construction in Los Angeles may have exploded during the postwar era, but as a new interactive map shows, the wide age range of its buildings might surprise you.

Using open data from local governments, built: LA visualizes the age of roughly 3 million buildings across L.A. County constructed between 1890 and 2008. Drag your mouse to explore the vast web of communities and neighborhoods, hover over individual properties to discover birth years, and double click to zoom in further.

Perhaps best of all, hit the rainbow stopwatch to view a decade-by-decade timelapse of development across the county. The city’s core, in particular, clusters together buildings of century-spanning generations, while suburbs and communities to the east and west tend to represent just one or two decades of development….(More).”

Tracking Employment Shocks Using Mobile Phone Data

Curated on May 28, 2015October 24, 2018 by Stefaan Verhulst

Paper by Jameson L. Toole et al.: “Can data from mobile phones be used to observe economic shocks and their consequences at multiple scales? Here we present novel methods to detect mass layoffs, identify individuals affected by them, and predict changes in aggregate unemployment rates using call detail records (CDRs) from mobile phones. Using the closure of a large manufacturing plant as a case study, we first describe a structural break model to correctly detect the date of a mass layoff and estimate its size. We then use a Bayesian classification model to identify affected individuals by observing changes in calling behavior following the plant’s closure. For these affected individuals, we observe significant declines in social behavior and mobility following job loss. Using the features identified at the micro level, we show that the same changes in these calling behaviors, aggregated at the regional level, can improve forecasts of macro unemployment rates. These methods and results highlight promise of new data resources to measure micro economic behavior and improve estimates of critical economic indicators….(More)”

A Repository of Open Data Repositories: Open Data Impact Case Studies and Examples

Curated on May 28, 2015August 3, 2018 by Stefaan Verhulst

“As part of its core mission, the GovLab has been engaged in a series of ongoing efforts to build awareness and gather evidence about the value, use, and impact of open data around the world – including the GovLab’s Open Data 500.

The GovLab is currently scoping a project with Omidyar Network to build a repository of in-depth, global case studies on existing examples of open data demand, use and impact. The goal of the project is to develop a more nuanced understanding of the various processes and factors underlying the value chain of open data.

As a part of our literature review in undertaking this scoping project, and in time for the 3rd International Open Data Conference, we first mapped several repositories of open data cases and examples that may serve as an empirical foundation for further case-studies.

Below is a non-exhaustive list of organizations that have compiled open data case study repositories in a complementary fashion.

LET US KNOW if you are aware of other compilations of open data examples and case studies we should include as to complete the below overview… by emailing Stefaan Verhulst (stefaan at thegovlab.org).

1. Open Data Case Study Repositories
2. Open Data Portal Repositories
3. Open Data Intermediary Repositories“

Protecting Privacy in Data Release

Curated on May 27, 2015August 3, 2018 by Stefaan Verhulst

Book by Giovanni Livraga: “This book presents a comprehensive approach to protecting sensitive information when large data collections are released by their owners. It addresses three key requirements of data privacy: the protection of data explicitly released, the protection of information not explicitly released but potentially vulnerable due to a release of other data, and the enforcement of owner-defined access restrictions to the released data. It is also the first book with a complete examination of how to enforce dynamic read and write access authorizations on released data, applicable to the emerging data outsourcing and cloud computing situations. Private companies, public organizations and final users are releasing, sharing, and disseminating their data to take reciprocal advantage of the great benefits of making their data available to others. This book weighs these benefits against the potential privacy risks. A detailed analysis of recent techniques for privacy protection in data release and case studies illustrate crucial scenarios. Protecting Privacy in Data Release targets researchers, professionals and government employees working in security and privacy. Advanced-level students in computer science and electrical engineering will also find this book useful as a secondary text or reference….(More)”