Open Data Research Announced


WWW Foundation Press Release:  “Speaking at an Open Government Partnership reception last night in London, Sir Tim Berners-Lee, founder of the World Wide Web Foundation (Web Foundation) and inventor of the Web, unveiled details of the first ever in-depth study into how the power of open data could be harnessed to tackle social challenges in the developing world. The 14 country study is funded by Canada’s International Development Research Centre (IDRC) and will be overseen by the Web Foundation’s world-leading open data experts. An interim progress update will be made at an October 2013 meeting of the Open Government Partnership, with in-depth results expected in 2014…

Sir Tim Berners-Lee, founder of the World Wide Web Foundation and inventor of the Web said:

“Open Data, accessed via a free and open Web, has the potential to create a better world. However, best practice in London or New York is not necessarily best practice in Lima or Nairobi.  The Web Foundation’s research will help to ensure that Open Data initiatives in the developing world will unlock real improvements in citizens’ day-to-day lives.”

José M. Alonso, program manager at the World Wide Web Foundation, added:

“Through this study, the Web Foundation hopes not only to contribute to global understanding of open data, but also to cultivate the ability of developing world researchers and development workers to understand and apply open data for themselves.”

Further details on the project, including case study outlines are available here: http://oddc.opendataresearch.org/

From Open Data to Information Justice


Paper by Jeffrey Johnson for Annual Conference of the Midwest Political Science Association: “This paper argues for subsuming the question of open data within a larger question of information justice. I show that there are several problems of justice that emerge as a consequence of opening data to full public accessibility, and are generally a consequence of the failure of the open data movement to understand the constructed nature of data. I examine three such problems: the embedding of social privilege in datasets as the data is constructed, the differential capabilities of data users (especially differences between citizens and “enterprise” users), and the norms that data systems impose through their function as disciplinary systems.
In each case I show that open data has the quite real potential to exacerbate rather than alleviate injustices. This necessitates a theory of information justice. I briefly suggest two complementary directions in which such a theory might be developed: one leading toward moral principles that can be used to evaluate the justness of data practices, and another exploring the practices and structures that a social movement promoting information justice might pursue.”

Procurement needs better data now


Howard Rolfe, procurement director for East of England NHS Collaborative Procurement Hub, in The Guardian: “Knowledge management is fundamental to any organisation and procurement in the NHS is no exception. Current systems are not joined up and don’t give the level of information that should be expected. Management in many NHS trusts cannot say how effective procurement is within their organisation because they don’t have a dashboard of information that tells them, for example, the biggest spend areas, who is placing the order, what price is paid and how that price compares.
Systems now exist that could help answer these questions and increase board and senior management focus on this area of huge spend….The time for better data is now, the opportunity is at the top of political and management agendas and the need is overwhelming. What is the solution? The provision of effective knowledge management systems is key and will facilitate improvements in information, procurement and collaborative aggregation by providing greater visibility of spend and reduction of administrative activity.”

The Rise of Big Data


Kenneth Neil Cukier and Viktor Mayer-Schoenberger in Foreign Affairs: “Everyone knows that the Internet has changed how businesses operate, governments function, and people live. But a new, less visible technological trend is just as transformative: “big data.” Big data starts with the fact that there is a lot more information floating around these days than ever before, and it is being put to extraordinary new uses. Big data is distinct from the Internet, although the Web makes it much easier to collect and share data. Big data is about more than just communication: the idea is that we can learn from a large body of information things that we could not comprehend when we used only smaller amounts.”
Gideon Rose, editor of Foreign Affairs, sits down with Kenneth Cukier, data editor of The Economist (video):

China identified as main source behind cyber-espionage


Washington Post: “Analyses of hundreds of documented data breaches found that hackers affiliated with the Chinese government were by far the most energetic and successful cyberspies in the world last year, according to a report to be issued Tuesday by government and industry investigators.
Although hackers with financial motives are the most common source of data breaches worldwide, China dominated the category of state-affiliated cyber-espionage of intellectual property, said the 2013 Data Breach Investigations Report. The report was issued by Verizon’s RISK Team and 18 partners, including officials from the United States and several foreign governments.
Of 120 incidents of government cyber-espionage detailed in the report, 96 percent came from China; the source of the other 4 percent was unknown, it said.”

Reinvent Regulation


Reinvent Roundtable that will take place on April 23, 2013 11:00 am PT : “Tim O’Reilly has some big ideas about how to dramatically modernize the entire notion of government regulation, particularly “algorithmic regulation” that harnesses computer power, much like top tech companies in Silicon Valley, to oversee the financial industry, which is using those same tools. This roundtable features some top talent from the Valley to apply their brains to figuring out how we could reinvent much more iterative regulation that constantly gets refined through analyzing data and processing feedback loops – much like Google refines its search techniques. In fact, we’ll have a top person from Google Search as well as someone from the US Treasury Department to work on these ideas. Watch Now →”
http://reinventors.net/roundtables/reinvent-regulation/

Kickstarting People


Dave Girouard, co-founder and CEO of Upstart in Wired: “A total $2.7 billion was pledged by individual donors through crowdfunding last year, according to reports by research firm Massolution — up 81% from the year before. This space is only going to heat up further when SEC rules for the JOBS Act are released this year, paving the way for equity crowdfunding….
Crowdfunders and angel investors, while not purely philanthropic, share the common desire to participate and be involved in the creation of something new.Put another way, it’s more about cause than cash (a phrase I picked up from Kiva co-founder Jessica Jackley). And that desire is the disruptive “feature” of crowdfunding….
Predicting success for a newbie startup is notoriously difficult. But investing in people is one of the only ways to get a risk/return/volatility investment profile that actually works. It’s a model that could also appeal to quant (not just cause) investors as well…companies like Upstart (which I founded) and Pave make it easy for people to invest in other people.
Why is investing in people a safer bet? Because there are clear — and measurable — signals reflecting their accomplishments and hinting at their potential. It’s not unlike the logic used by big companies or universities faced with countless candidates, by recruiting firms and talent agents, and others. By using data and algorithms — in this case, a sophisticated regression model that considers variables like school, area of study, standardized test scores, internships, job offers — we can statistically predict a person’s future income.
Such a model allows a person to “borrow” from his or her future self.”

White House: Unleashing the Power of Big Data


Tom Kalil, Deputy Director for Technology and Innovation at OSTP : “As we enter the second year of the Big Data Initiative, the Obama Administration is encouraging multiple stakeholders, including federal agencies, private industry, academia, state and local government, non-profits, and foundations to develop and participate in Big Data initiatives across the country.  Of particular interest are partnerships designed to advance core Big Data technologies; harness the power of Big Data to advance national goals such as economic growth, education, health, and clean energy; use competitions and challenges; and foster regional innovation.
The National Science Foundation has issued a request for information encouraging stakeholders to identify Big Data projects they would be willing to support to achieve these goals.  And, later this year, OSTP, NSF, and other partner agencies in the Networking and Information Technology R&D (NITRD) program plan to convene an event that highlights high-impact collaborations and identifies areas for expanded collaboration between the public and private sectors.”

Work-force Science and Big Data


Steve Lohr from the New York Times: “Work-force science, in short, is what happens when Big Data meets H.R….Today, every e-mail, instant message, phone call, line of written code and mouse-click leaves a digital signal. These patterns can now be inexpensively collected and mined for insights into how people work and communicate, potentially opening doors to more efficiency and innovation within companies.

Digital technology also makes it possible to conduct and aggregate personality-based assessments, often using online quizzes or games, in far greater detail and numbers than ever before. In the past, studies of worker behavior were typically based on observing a few hundred people at most. Today, studies can include thousands or hundreds of thousands of workers, an exponential leap ahead.

“The heart of science is measurement,” says Erik Brynjolfsson, director of the Center for Digital Business at the Sloan School of Management at M.I.T. “We’re seeing a revolution in measurement, and it will revolutionize organizational economics and personnel economics.”

The data-gathering technology, to be sure, raises questions about the limits of worker surveillance. “The larger problem here is that all these workplace metrics are being collected when you as a worker are essentially behind a one-way mirror,” says Marc Rotenberg, executive director of the Electronic Privacy Information Center, an advocacy group. “You don’t know what data is being collected and how it is used.”

Taking Open Government to the Next Level


Carl Fillichio who heads the Labor Department’s Office of Public Affairs at (Work in Progress):  “Since we published a department-wide API two years ago, developers across the country have used it to create apps that educate users about workplace safety and health, employers’ compliance with wage and hour laws, and improving employment opportunities for disabled workers, just to name a few!
Releasing data through an API was a big step forward, but it was not exactly groundbreaking.  However, since then, my team has been working hard to develop software development kits that are truly innovative because they make using our API even easier.
These kits (also known as SDKs) contain application code for six different platforms − iOS, Android, Blackberry, .Net, PHP and Ruby − that anyone creating a mobile or Web-based app using our data could incorporate. By using the kits, experienced developers will save time and novice developers will be able to work with DOL data in just a few minutes…. All of these kits can be downloaded from our developer site. Additionally, in keeping with the federal digital government strategy, each has been published as an open source project on github, a popular code-sharing site. For a list of federal APIs that are supported by our kits, check the github repository’s wiki page. This list will be updated as the kits are tested with additional federal APIs.”