“Data on the Web” Best Practices


W3C First Public Working Draft: “…The best practices described below have been developed to encourage and enable the continued expansion of the Web as a medium for the exchange of data. The growth of open data by governments across the world [OKFN-INDEX], the increasing publication of research data encouraged by organizations like the Research Data Alliance [RDA], the harvesting and analysis of social media, crowd-sourcing of information, the provision of important cultural heritage collections such as at the Bibliothèque nationale de France [BNF] and the sustained growth in the Linked Open Data Cloud [LODC], provide some examples of this phenomenon.

In broad terms, data publishers aim to share data either openly or with controlled access. Data consumers (who may also be producers themselves) want to be able to find and use data, especially if it is accurate, regularly updated and guaranteed to be available at all times. This creates a fundamental need for a common understanding between data publishers and data consumers. Without this agreement, data publishers’ efforts may be incompatible with data consumers’ desires.

Publishing data on the Web creates new challenges, such as how to represent, describe and make data available in a way that it will be easy to find and to understand. In this context, it becomes crucial to provide guidance to publishers that will improve consistency in the way data is managed, thus promoting the re-use of data and also to foster trust in the data among developers, whatever technology they choose to use, increasing the potential for genuine innovation.

This document sets out a series of best practices that will help publishers and consumers face the new challenges and opportunities posed by data on the Web.

Best practices cover different aspects related to data publishing and consumption, like data formats, data access, data identification and metadata. In order to delimit the scope and elicit the required features for Data on the Web Best Practices, the DWBP working group compiled a set of use cases [UCR] that represent scenarios of how data is commonly published on the Web and how it is used. The set of requirements derived from these use cases were used to guide the development of the best practice.

The Best Practices proposed in this document are intended to serve a more general purpose than the practices suggested in Best Practices for Publishing Linked Data [LD-BP] since it is domain-independent and whilst it recommends the use of Linked Data, it also promotes best practices for data on the web in formats such as CSV and JSON. The Best Practices related to the use of vocabularies incorporate practices that stem from Best Practices for Publishing Linked Data where appropriate….(More)

Why Information Grows: The Evolution of Order, from Atoms to Economies


Forthcoming book: “In Why Information Grows, rising star César Hidalgo offers a radical interpretation of global economicsWhile economists often turn to measures like GDP or per-capita income, César Hidalgo turns to information theory to explain the success or failure of a country’s economic performance. Through a radical rethinking of what the economy is, Hidalgo shows that natural constraints in our ability to accumulate knowledge, knowhow and information explain the evolution of social and economic complexity. This is a rare tour de force, linking economics, sociology, physics, biology and information theory, to explain the evolution of social and economic systems as a consequence of the physical embodiment of information in a world where knowledge is quite literally power.
César Hidalgo leads the Macro Connections group at the MIT Media Lab. A trained statistical physicist and an expert on Networks and Complex Systems, he also has extensive experience in the field of economic development and has pioneered research on how big data impacts economic decision-making….(More)”

The Metrics Myth


Jed Emerson at BlendedValue: “…Simply because our present, dominant approaches to assessing metrics fall short of our task—How can one measure the full value of a life saved or possible future changed? What, ultimately, is the real impact and value created through the allocation of our capital?—we persist because we know two things:
First, we know we are on a Hero’s Journey of inquiry and innovation. Too often we forget the present system of tracking financial performance (the basis upon which trillions of dollars flow through global capital markets and the foundation upon which too many of us build our lives) is the outcome of over sixty years of development, refinement and debate. In the U.S., GAAP and FASB (the fundamental building blocks of mainstream business and finance) were not created until after World War II; and it was not until the creation of the Environmental Protection Agency in 1970 that business and many nonprofits began tracking and assessing environmental metrics on a consistent basis. And while social metrics have always been a part of the parlance of government and philanthropic funding, many foundations and social investors have not sought to weave performance assessment into their process of allocating funds until recent decades. It is for these reasons I am quite comfortable with the reality that those creating the metrics and evaluation frameworks of tomorrow will need another twenty years to build what is not yet ours, for I know it will come in good time.
Second, we are creating Total Portfolio Reporting frameworks to track the returns of unified investing strategies (capable of reflecting the aggregate performance of philanthropic, social and environmental value creation) because we know it can be done—and indeed, we see the metrics mist clearing by the year.
As initiatives such as

The Principles for Responsible Investing’s Integrated Reporting work,

the recently re-organized SROI Network,

the Sustainable Accounting Standards Board,

B-Lab’s B-Analytics framework,

CapRock’s iPar system,

the ANDE Metrics Working Group

and a variety of grassroots initiatives coming together around various sets of common reporting for assessing community impact,we find one can create a balance between our aspirations for a better world and the challenges of demarcating our progress toward that goal.
In the end, I hate the whole metrics debate.
It is repetitive, mind numbing and distracting from the critical task of fighting the forces presently destroying our societies and planet. Each time some ignorant (not stupid, mind you, and yet, not fully aware of what they do not know; they are quite rightly, ignorant) newcomer enters the discussion, we’re all expected to re-group and re-define concepts and issues well documented and explored in the past. The continual, mindless reminders that not everything that counts can be counted leave me frustrated and even angry at some who for reasons beyond me don’t seem to understand that such now trite insights were the very starting place of this journey well more than 25 years ago and that, indeed, as newcomers they are as far behind the current exploration as we are from our goal.
Yet, we make progress despite our doubts and complications.
We advance the practice of both impact investing and performance measurement one step forward and two steps back as the current “knowledge” of the crowd actually pulls us backward to previous thinking and practice. And we know the appropriate application of metrics bring meaning and insight just as they demonstrate the limitations of such efforts….(More)”

Data-Driven Development Pathways for Progress


Report from the World Economic Forum: “Data is the lifeblood of sustainable development and holds tremendous potential for transformative positive change particularly for lower- and middle-income countries. Yet despite the promise of a “Data Revolution”, progress is not a certainty. Lack of clarity on privacy and ethical issues, asymmetric power dynamics and an array of entangled societal and commercial risks threaten to hinder progress.
Written by the World Economic Forum Global Agenda Council on Data-Driven Development, this report serves to clarify how big data can be leveraged to address the challenges of sustainable development. Providing a blueprint for balancing competing tensions, areas of focus include: addressing the data deficit of the Global South, establishing resilient governance and strengthening capacities at the community and individual level. (PDF)”

Making emotive games from open data


Katie Collins at WIRED: “Microsoft researcher Kati London’s aim is “to try to get people to think of data in terms of personalities, relationships and emotions”, she tells the audience at the Story Festival in London. Through Project Sentient Data, she uses her background in games development to create fun but meaningful experiences that bridge online interactions and things that are happening in the real world.
One such experience invited children to play against the real-time flow of London traffic through an online game called the Code of Everand. The aim was to test the road safety knowledge of 9-11 year olds and “make alertness something that kids valued”.
The core mechanic of the game was that of a normal world populated by little people, containing spirit channels that only kids could see and go through. Within these spirit channels, everything from lorries and cars from the streets became monsters. The children had to assess what kind of dangers the monsters posed and use their tools to dispel them.
“Games are great ways to blur and observe the ways people interact with real-world data,” says London.
In one of her earlier projects back in 2005, London used her knowledge of horticulture to bring artificial intelligence to plants. “Almost every workspace I go into has a half dead plant in it, so we gave plants the ability to tell us what they need.” It was, she says, an exercise in “humanising data” that led to further projects that saw her create self aware street signs and a dynamic city map that expressed shame neighbourhood by neighbourhood depending on the open dataset of public complaints in New York.
A further project turned complaint data into cartoons on Instagram every week. London praised the open data initiative in New York, but added that for people to access it, they had to know it existed and know where to find it. The cartoons were a “lightweight” form of “civic engagement” that helped to integrate hyperlocal issues into everyday conversation.
London also gamified community engagement through a project commissioned by the Knight Foundation called Macon Money….(More)”.

Beyond Transparency


Hildy Gottlieb on “How “opening up” can help organizations achieve their missions” in Stanford Social Innovation Review : “…For the past two years, Creating the Future, a social change research and development laboratory, has been experimenting to find the answer to that question. In the process, we have learned that when organizations are more open in their work, it can improve both the work itself and the results in the communities they serve.
In December 2012, Creating the Future’s board voted to open all its board and strategy meetings (including meetings for branding, resource development, and programming) to anyone who wished to attend and participate.
Since our organization is global, we hold our meetings via Google Hangout, and community members participate via a dedicated Twitter hashtag. Everyone is encouraged to participate—through asking questions and sharing observations—as if they are board members, whether or not they are.
This online openness mirrors the kind of inclusive, participatory culture that many grassroots neighborhood groups have fostered in the “real world” for decades. As we’ve studied those groups and experienced open engagement for ourselves, here are some of the things we’ve learned that can apply to any organization, whether they are working at a distance or in person.

What Being Open Makes Possible

1.  Being open adds new thinking to the mix. We can’t overstate this obvious practical benefit for every strategic issue an organization considers. During a recent discussion of employee “paid time off” policies, a participant with no formal relationship to the organization powerfully shifted the board’s conversation and perspectives away from the rigidity of a policy, focusing instead on the values of relationships, outcomes, buy-in, and adaptability. That input helped the board clarify its intent. It ultimately chose to scrap the idea of a certain amount of “paid time off,” in favor of an outcomes-based approach that provides flexibility for both employees and their supervisors.
2. Being open flattens internal communications. Opening all our meetings has led to cross-pollination across every aspect of our organization, providing an ongoing opportunity for sharing information and resources, and for developing everyone’s potential as leaders….
3. Being open walks the talk of the engaged communities we want to see. From the moment we opened the doors to our meetings, people have walked in and found meaningful ways to become part of our work. …
It seems so simple: If we want to engage the community, we just need to open the doors and invite people in!
4. Being open creates meaningful inclusion. Board diversity initiatives are intended to ensure that an organization’s decision-making reflects the experience of the community it serves. In reality, though, there can never be enough seats on a board to accomplish inclusion beyond what often feels like tokenism. Creating the Future’s board doesn’t have to worry about representing the community, because our community members represent themselves. And while this is powerful in an online setting, it is even more powerful when on-the-ground community members are part of a community-based organization’s decision-making fabric.
5. Being open creates more inclusive accountability. During a discussion of cash flow for our young organization, one concerned board member wondered aloud whether adhering to our values might be at cross-purposes with our survival. Our community members went wild via Twitter, expressing that it was that very code of values that drew them to the work in the first place. That reminder helped board members remove scarcity and fear from the conversation so that they could base their decision on what would align with our values and help accomplish the mission.
The needs of our community directly impacted that decision—not because of a bylaws requirement for “voting members” but simply because we encouraged community members to actively take part in the conversation….(More)”

UNDP Eyes Ukraine’s Damaged Buildings With Crowdsourcing, Mobile App


Aida Akl at VOA TECHtonics: “The crisis that plunged east Ukraine into war in November 2013 has damaged or destroyed critical infrastructure and limited access to areas caught up in fighting between Ukraine’s government forces and pro-Russian rebels. In order to assess damage, the United Nations Development Program (UNDP) turned to crowdsourcing to help restore social infrastructure as part of a United Nations, European Union and World Bank Recovery and Peacebuilding Assessment for Eastern Ukraine….
Using an interactive map, ReDonbass, and a mobile app (Android and iOS), people of Donetsk and Lugansk regions can report damaged homes, hospitals, schools, kindergartens or libraries.
A screenshot of UNDP's app and crowdsourcing map for east Ukraine damage assessment. (UNDP)
The easy-to-use interactive tool allows any person with a mobile phone and access to the Internet to download the most accurate data about the building in its location, photographs of the damage, and the status of the recovery phase. After that, the Ukrainian government and international donors will use the data to better plan reconstruction.
Information from the map will contribute to an ongoing Recovery and Peacebuilding Assessment for Eastern Ukraine. UNDP is part of the assessment that brings the United Nations, the European Union and the World Bank Group together to analyze the impact of the conflict and offer recommendations for short-term recovery and peacebuilding over the next two years….The map has also proven to be very useful for the experts from the Ukrainian Government and a recently launched UNDP-Government of Japan project aiming to restore critical infrastructure for social care and services. They [are] using it to identify schools, orphanages, elderly homes, and social services centers that need to be restored and rebuilt first….(More)”.

Amid Open Data Push, Agencies Feel Urge for Analytics


Jack Moore at NextGov: “Federal agencies, thanks to their unique missions, have long been collectors of valuable, vital and, no doubt, arcane data. Under a nearly two-year-old executive order from President Barack Obama, agencies are releasing more of this data in machine-readable formats to the public and entrepreneurs than ever before.
But agencies still need a little help parsing through this data for their own purposes. They are turning to industry, academia and outside researchers for cutting-edge analytics tools to parse through their data to derive insights and to use those insights to drive decision-making.
Take the U.S. Agency for International Development, for example. The agency administers U.S. foreign aid programs aimed at ending extreme poverty and helping support democratic societies around the globe.
Under the agency’s own recent open data policy, it’s started collecting reams of data from its overseas missions. Starting Oct. 1, organizations doing development work on the ground – including through grants and contracts – have been directed to also collect data generated by their work and submit it to back to agency headquarters. Teams go through the data, scrub it to remove sensitive material and then publish it.
The data spans the gamut from information on land ownership in South Sudan to livestock demographics in Senegal and HIV prevention activities in Zambia….The agency took the first step in solving that problem with a Jan. 20 request for information from outside groups for cutting-edge data analytics tools.
“Operating units within USAID are sometimes constrained by existing capacity to transform data into insights that could inform development programming,” the RFI stated.
The RFI queries industry on their capabilities in data mining and social media analytics and forecasting and systems modeling.
USAID is far from alone in its quest for data-driven decision-making.
A Jan. 26 RFI from the Transportation Department’s Federal Highway Administration also seeks innovative ideas from industry for “advanced analytical capabilities.”…(More)”

Open data: how mobile phones saved bananas from bacterial wilt in Uganda


Anna Scott in The Guardian:”Bananas are a staple food in Uganda. Ugandans eat more of the fruit than any other country in the world. Each person eats on average 700g (about seven small bananas) a day, according to the International Food Policy Research Institute, and they provide up to 27% of the population’s calorie intake.
But since 2002 a disease known as banana bacterial wilt (BBW) has wiped out crops across the country. When plants are infected, they cannot absorb water so their leaves start to shrivel and they eventually die….
The Ugandan government drew upon open data – data that is licensed and made available for anyone to access and share – about the disease made available by Unicef’s community polling project Ureport to deal with the problem.
Ureport mobilises a network of nearly 300,000 volunteers across Uganda, who use their mobiles to report on issues that affect them, from polio immunisation to malaria treatment, child marriage, to crop failure. It gathers data from via SMS polls and publishes the results as open sourced, open datasets.
The results are sent back to community members via SMS along with treatment options and advice on how best to protect their crops. Within five days of the first SMS being sent out, 190,000 Ugandans had learned about the disease and knew how to save bananas on their farms.
Via the Ureport platform, the datasets can also be accessed in real-time by community members, NGOs and the Ugandan government, allowing them to target treatments to where they we needed most. They are also broadcast on radio shows and analysed in articles produced by Ureport, informing wider audiences of scope and nature of the disease and how best to avoid it….
A report published this week by the Open Data Institute (ODI) features stories from around the world which reflect how people are using open date in development. Examples range from accessing school results in Tanzania to building smart cities in Latin America….(More).”

Agile Government Handbook


The Agile Government Handbook is a community project to help government learn and adopt Agile development practices….

The U.S. federal TechFAR Handbook highlights six key reasons why government should adopt Agile for IT project management and development.

  • Improvement in investment manageability and budgetary feasibility
  • Reduction of overall risk
  • Frequent delivery of usable capabilities that provide value to customers more rapidly
  • Increased flexibility
  • Creation of new opportunities for small businesses
  • Greater visibility into contractor performance

(Source: The TechFAR Handbook for Procuring Digital Services Using Agile Processes)…

18F Consulting Co-Managing Director Robert L. Read presents on “How to Run an Agile Project in Government.”