DataViva: a Big Data Engine for the Brazilian Economy


Piece by André Victor dos Santos Barrence and Cesar A. Hidalgo: “The current Internet paradigm in which one can search about anything and retrieve information is absolutely empowering. We can browse files, websites and indexes and effortlessly reach good amount of information. Google, for instance, was explicitly built on a library analogy available to everyone. However, it is a world where information that should be easily accessible is still hidden in unfriendly databases, and that the best-case scenario is finding few snippets of information embedded within the paragraphs of a report. But is this the way it should be? Or is this just the world we are presently stuck with?
The last decade has been particularly marked by an increasing hype on big data and analytics, mainly fueled by those who are interested in writing narratives on the topic but not necessarily coding about it, even when data itself is not the problem.
Let’s take the case of governments. Governments have plenty of data and in many cases it is actually public (at least in principle). Governments “know” how many people work in every occupation, in every industry and in every location; they know their salaries, previous employers and education history. From a pure data perspective all that is embedded in tax, social security records or annual registrations. From a more pragmatic perspective, it is still inaccessible and hidden even when it is legally open the public. We live in a world where the data is there, but where the statistics and information are not.
The state government of Minas Gerais in Brazil (3rd economy the country, territory larger than France and 20 millions inhabitants) made an important step in that direction by releasing DataViva.info, a platform that opens data for exports and occupations for the entire formal sector of the Brazilian economy through more than 700 million interactive visualizations. Instead of poorly designed tables and interfaces, it guides users to answer questions or freely discover locations, industries and occupations in Brazil that are of interest to them. DataViva allows users to explore simple questions such as the evolution of exports in the last decade for each of the 5,567 municipalities in the country, or highly specific queries, for instance, the average salaries paid to computer scientists working in the software development industry in Belo Horizonte, the state capital of Minas.
DataViva’s visualizations are built on the idea that the industrial and economic activity development of locations is highly path dependent. This means that locations are more likely to be successful at developing industries and activities that are related to the ones already existing, since it indicates the existence of labor inputs, and other capabilities, that are specific and that can often be redeployed to a few related industries and activities. Thus, it informs the processes by which opportunities can be explored and prospective pathways for greater prosperity.
The idea that information is key for the functioning of economies is at least as old as Friedrich Hayek’s seminal paper The Use of Knowledge in Society from 1945. According to Hayek, prices help coordinate economic activities by providing information about the wants and needs of goods and services. Yet, the price information can only serve as a signal as long as people know those prices. Maybe the salaries for engineers in the municipality of Betim (Minas Gerais) are excellent and indicate a strong need for them? But who would have known how many engineers are there in Betim and what are their average salaries?
But the remaining question is: why is Minas Gerais making all of this public data easily available? More than resorting to the contemporary argument of open government Minas understands this is extremely valuable information for investors searching for business opportunities, entrepreneurs pursuing new ventures or workers looking for better career prospects. Lastly, the ultimate goal of DataViva is to provide a common ground for open discussions, moving away from the information deprived idea of central planning and into a future where collaborative planning might become the norm. It is a highly creative attempt to renew public governance for the 21st century.
Despite being a relatively unknown state outside of Brazil, by releasing a platform as DataViva, Minas is providing a strong signal about where in world governments are really pushing forward innovation rather than simply admiring and copying solutions that used to come from trendsetters in the developed world. It seems like real innovation isn’t necessarily taking place in Washington, Paris or London anymore.”
 

Making Europe's cities smarter


Press Release: “At a conference today hosted by the European Commission, city leaders, CEOs and civil society leaders discussed the actions outlined in the “Smart Cities Strategic Implementation Plan” and how to put them into practice. The Commission announced that it will launch an ‘Invitation for Smart City and Community Commitments’ in spring 2014 to mobilise work on the action plan’s priorities. The plan is part of Europe’s fifth “Innovation Partnership”.
Commission Vice-President Siim Kallas, in charge of transport, said: “I am very pleased to see transport operators, telecoms companies, vehicle manufacturers, city planners, energy companies and researchers all gathered in one room to discuss the future of our cities. The Smart Cities initiative is a great opportunity to make changes happen for less congestion and better business opportunities in our cities. We need to keep up the momentum and move from plan to action now.”
Commission Vice-President Neelie Kroes, responsible for the Digital Agenda, said: “The future of infrastructure and city planning will be based on integrating ICT systems and using big data to make our cities better places to live and work. We need to base those new systems on open standards for hardware, software, data and services which this European Innovation Partnership will develop.”
Günther H. Oettinger, EU Commissioner for energy, said: “The European Innovation Partnership for Smart Cities and Communities is about making investments in sustainable development in as many cities as possible. Creating equal partnerships between cities and companies based on synergies between ICT, energy and mobility will lead to projects that make a real difference in our everyday lives.”
The Commission intends to make available approximately EUR 200 million for Smart Cities and communities in the 2014-2015 budgets of the Horizon 2020 research and innovation programme, to accelerate progress and enlarge the scale of roll-out of smart cities solutions. There will also be possibilities to access the European Structural and Investment Funds.
For more information: http://ec.europa.eu/eip/smartcities/”

Big Data needs Big Theory


Geoffrey West, former President of the Santa Fe Institute: “As the world becomes increasingly complex and interconnected, some of our biggest challenges have begun to seem intractable. What should we do about uncertainty in the financial markets? How can we predict energy supply and demand? How will climate change play out? How do we cope with rapid urbanization? Our traditional approaches to these problems are often qualitative and disjointed and lead to unintended consequences. To bring scientific rigor to the challenges of our time, we need to develop a deeper understanding of complexity itself….
The digital revolution is driving much of the increasing complexity and pace of life we are now seeing, but this technology also presents an opportunity. The ubiquity of cell phones and electronic transactions, the increasing use of personal medical probes, and the concept of the electronically wired “smart city” are already providing us with enormous amounts of data. With new computational tools and techniques to digest vast, interrelated databases, researchers and practitioners in science, technology, business and government have begun to bring large-scale simulations and models to bear on questions formerly out of reach of quantitative analysis, such as how cooperation emerges in society, what conditions promote innovation, and how conflicts spread and grow.
The trouble is, we don’t have a unified, conceptual framework for addressing questions of complexity. We don’t know what kind of data we need, nor how much, or what critical questions we should be asking. “Big data” without a “big theory” to go with it loses much of its potency and usefulness, potentially generating new unintended consequences.
When the industrial age focused society’s attention on energy in its many manifestations—steam, chemical, mechanical, and so on—the universal laws of thermodynamics came as a response. We now need to ask if our age can produce universal laws of complexity that integrate energy with information. What are the underlying principles that transcend the extraordinary diversity and historical contingency and interconnectivity of financial markets, populations, ecosystems, war and conflict, pandemics and cancer? An overarching predictive, mathematical framework for complex systems would, in principle, incorporate the dynamics and organization of any complex system in a quantitative, computable framework.
We will probably never make detailed predictions of complex systems, but coarse-grained descriptions that lead to quantitative predictions for essential features are within our grasp. We won’t predict when the next financial crash will occur, but we ought to be able to assign a probability of one occurring in the next few years. The field is in the midst of a broad synthesis of scientific disciplines, helping reverse the trend toward fragmentation and specialization, and is groping toward a more unified, holistic framework for tackling society’s big questions. The future of the human enterprise may well depend on it.”

Peacekeeping 4.0: Harnessing the Potential of Big Data, Social Media, and Cyber Technologies


Chapter by John Karlsrud in “Cyberspace and International Relations: Theory, Prospects and Challenges”(Edited by Jan-Frederik Kremer, and Benedikt Müller): “Since the Cold War, peacekeeping has evolved from first-generation peacekeeping that focused on monitoring peace agreements, to third-generation multidimensional peacekeeping operations tasked with rebuilding states and their institutions during and after conflict. However, peacekeeping today is lagging behind the changes marking our time. Big Data, including social media, and the many actors in the field may provide peacekeeping and peacebuilding operations with information and tools to enable them to respond better, faster and more effectively, saving lives and building states. These tools are already well known in the areas of humanitarian action, social activism, and development. Also the United Nations, through the Global Pulse initiative, has begun to discover the potential of “Big Data for Development,” which may in time help prevent violent conflict. However, less has been done in the area of peacekeeping. UN member states should push for change so that the world organization and other multilateral actors can get their act together, mounting a fourth generation of peacekeeping operations that can utilize the potentials of Big Data, social media and modern technology—“Peacekeeping 4.0.” The chapter details some of the initiatives that can be harnessed and further developed, and offers policy recommendations for member states, the UN Security Council, and UN peacekeeping at UN headquarters and at field levels.”

You Are Your Data


in Slate: “We are becoming data. Every day, our smartphones, browsers, cars, and even refrigerators generate information about our habits. When we click “I agree” on terms of service, we opt in to systems in which we are known only by our data. So we need to be able to understand ourselves as data, too.
To understand what that might mean for the average person in the future, we should look to the Quantified Self community, which is at the frontier of understanding what our role as individuals in a data-driven society might look like. Quantified Self began as a Meetup community sharing personal stories of self-tracking techniques, and is now a catchall adjective to describe the emerging set of apps and sensors available to consumers to facilitate self-tracking, such as the Fitbit or Nike Fuelband. Some of the self-tracking practices of this group come across as extreme (experimenting with the correlation between butter consumption and brain function). But what is a niche interest today could be widely marketed tomorrow—and accordingly, their frustrations may soon be yours…

Instead, I propose that we should have a “right to use” our personal data: I should be able to access and make use of data that refers to me. At best, a right to use would reconcile both my personal interest in the small-scale insights and the firms’ large-scale interests in big data insights from the larger population. These interests are not in conflict with each other.
Of course, to translate this concept into practice, we need to work out matters of both technology and policy.
What data are we asking for? Are we asking for data that individuals have opted into creating, like self-tracking fitness applications? Should we broaden that definition to describe any data that refers to our person, such as behavioral data collected by cookies and gathered by third-party data brokers? These definitions will be hard to pin down.
Also, what kind of data? Just that which we’ve actively opted in to creating, or does it expand to the more hidden, passive, transactional data? Will firms exercise control over the line between where “raw” data becomes processed and therefore proprietary? If we can’t begin to define the data representation of a “step” in an activity tracker, how will we standardize access to that information?
Access to personal data also suffers from a chicken-and-egg problem right now. We don’t see greater consumer demand for this because we don’t yet have robust enough tools to make use of disparate sets of data as individuals, and yet such tools are not gaining traction without proven demand.”

White House Unveils Big Data Projects, Round Two


Information Week: “The White House Office of Science and Technology Policy (OSTP) and Networking and Information Technology R&D program (NITRD) on Tuesday introduced a slew of new big-data collaboration projects aimed at stimulating private-sector interest in federal data. The initiatives, announced at the White House-sponsored “Data to Knowledge to Action” event, are targeted at fields as varied as medical research, geointelligence, economics, and linguistics.
The new projects are a continuation of the Obama Administration’s Big Data Initiative, announced in March 2012, when the first round of big-data projects was presented.
Thomas Kalil, OSTP’s deputy director for technology and innovation, said that “dozens of new partnerships — more than 90 organizations,” are pursuing these new collaborative projects, including many of the best-known American technology, pharmaceutical, and research companies.
Among the initiatives, Amazon Web Services (AWS) and NASA have set up the NASA Earth eXchange, or NEX, a collaborative network to provide space-based data about our planet to researchers in Earth science. AWS will host much of NASA’s Earth-observation data as an AWS Public Data Set, making it possible, for instance, to crowdsource research projects.
An estimated 4.4 million jobs are being created between now and 2015 to support big-data projects. Employers, educational institutions, and government agencies are working to build the educational infrastructure to provide students with the skills they need to fill those jobs.
To help train new workers, IBM, for instance, has created a new assessment tool that gives university students feedback on their readiness for number-crunching careers in both the public and private sector. Eight universities that have a big data and analytics curriculum — Fordham, George Washington, Illinois Institute of Technology, University of Massachusetts-Boston, Northwestern, Ohio State, Southern Methodist, and the University of Virginia — will receive the assessment tool.
OSTP is organizing an initiative to create a “weather service” for pandemics, Kalil said, a way to use big data to identify and predict pandemics as early as possible in order to plan and prepare for — and hopefully mitigate — their effects.
The National Institutes of Health (NIH), meanwhile, is undertaking its ” Big Data to Knowledge” (BD2K) initiative to develop a range of standards, tools, software, and other approaches to make use of massive amounts of data being generated by the health and medical research community….”
See also:
November 12, 2013 – Fact Sheet: Progress by Federal Agencies: Data to Knowledge to Action
November 12, 2013 – Fact Sheet: New Announcements: Data to Knowledge to Action
November 12, 2013 – Press Release: Data to Knowledge to Action Event

What future do you want? Commission invites votes on what Europe could look like in 2050 to help steer future policy and research planning


European Commission – MEMO: “Vice-President Neelie Kroes, responsible for the Digital Agenda, is inviting people to join a voting and ranking process on 11 visions of what the world could look like in 20-40 years. The Commission is seeking views on living and learning, leisure and working in Europe in 2050, to steer long-term policy or research planning.
The visions have been gathered over the past year through the Futurium, an online debate platform that allows policymakers to not only consult citizens, but to collaborate and “co-create” with them, and at events throughout Europe. Thousands of thinkers – from high school students, to the Erasmus Students Network; from entrepreneurs and internet pioneers to philosophers and university professors, have engaged in a collective inquiry – a means of crowd-sourcing what our future world could look like.
Eleven over-arching themes have been drawn together from more than 200 ideas for the future. From today, everyone is invited to join the debate and offer their rating and rankings of the various ideas. The results of the feedback will help the European Commission make better decisions about how to fund projects and ideas that both shape the future and get Europe ready for that future….
The Futurium is a foresight project run by DG CONNECT, based on an open source approach. It develops visions of society, technologies, attitudes and trends in 2040-2050 and use these, for example as potential blueprints for future policy choices or EU research and innovation funding priorities.
It is an online platform developed to capture emerging trends and enable interested citizens to co-create compelling visions of the futures that matter to them.

This crowd-sourcing approach provides useful insights on:

  1. vision: where people want to go, how desirable and likely are the visions posted on the platform;
  2. policy ideas: what should ideally be done to realise the futures; the possible impacts and plausibility of policy ideas;
  3. evidence: scientific and other evidence to support the visions and policy ideas.

….
Connecting policy making to people: in an increasingly connected society, online outreach and engagement is an essential response to the growing demand for participation, helping to capture new ideas and to broaden the legitimacy of the policy making process (IP/10/1296). The Futurium is an early prototype of a more general policy-making model described in the paper “The Futurium—a Foresight Platform for Evidence-Based and Participatory Policymaking“.

The Futurium was developed to lay the groundwork for future policy proposals which could be considered by the European Parliament and the European Commission under their new mandates as of 2014. But the Futurium’s open, flexible architecture makes it easily adaptable to any policy-making context, where thinking ahead, stakeholder participation and scientific evidence are needed.”

The GovLab Academy: A Community and Platform for Learning and Teaching Governance Innovations


Press Release: “Today the Governance Lab (The GovLab) launches The GovLab Academy at the Open Government Partnership Annual Meeting in London.
Available at www.thegovlabacademy.org, the Academy is a free online community for those wanting to teach and learn how to solve public problems and improve lives using innovations in governance. A partnership between The GovLab  at New York University and MIT Media Lab’s Online Learning Initiative, the site launching today offers curated videos, podcasts, readings and activities designed to enable the purpose driven learner to deepen his or her practical knowledge at her own pace.
The GovLab Academy is funded by a grant from the John S. and James L. Knight Foundation. “The GovLab Academy addresses a growing need among policy makers at all levels – city, federal and global – to leverage advances in technology to govern differently,” says Carol Coletta, Vice President of Community and National Initiatives at the Knight Foundation.  “By connecting the latest technological innovations to a community of willing mentors, the Academy has the potential to catalyze more experimentation in a sector that badly needs it.”
Initial topics include using data to improve policymaking and cover the role of big data, urban analytics, smart disclosure and open data in governance. A second track focuses on online engagement and includes practical strategies for using crowdsourcing to solicit ideas, organize distributed work and gather data.  The site features both curated content drawn from a variety of sources and original interviews with innovators from government, civil society, the tech industry, the arts and academia talking about their work around the world implementing innovations in practice, what worked and what didn’t, to improve real people’s lives.
Beth Noveck, Founder and Director of The GovLab, describes its mission: “The Academy is an experiment in peer production where every teacher is a learner and every learner a teacher. Consistent with The GovLab’s commitment to measuring what works, we want to measure our success by the people contributing as well as consuming content. We invite everyone with ideas, stories, insights and practical wisdom to contribute to what we hope will be a thriving and diverse community for social change”.”

Big Data


Special Report on Big Data by Volta – A newsletter on Science, Technology and Society in Europe:  “Locating crime spots, or the next outbreak of a contagious disease, Big Data promises benefits for society as well as business. But more means messier. Do policy-makers know how to use this scale of data-driven decision-making in an effective way for their citizens and ensure their privacy?90% of the world’s data have been created in the last two years. Every minute, more than 100 million new emails are created, 72 hours of new video are uploaded to YouTube and Google processes more than 2 million searches. Nowadays, almost everyone walks around with a small computer in their pocket, uses the internet on a daily basis and shares photos and information with their friends, family and networks. The digital exhaust we leave behind every day contributes to an enormous amount of data produced, and at the same time leaves electronic traces that contain a great deal of personal information….
Until recently, traditional technology and analysis techniques have not been able to handle this quantity and type of data. But recent technological developments have enabled us to collect, store and process data in new ways. There seems to be no limitations, either to the volume of data or technology for storing and analyzing them. Big Data can map a driver’s sitting position to identify a car thief, it can use Google searches to predict outbreaks of the H1N1 flu virus, it can data-mine Twitter to predict the price of rice or use mobile phone top-ups to describe unemployment in Asia.
The word ‘data’ means ‘given’ in Latin. It commonly refers to a description of something that can be recorded and analyzed. While there is no clear definition of the concept of ‘Big Data’, it usually refers to the processing of huge amounts and new types of data that have not been possible with traditional tools.

‘The new development is not necessarily that there are so much more data. It’s rather that data is available to us in a new way.’

The notion of Big Data is kind of misleading, argues Robindra Prabhu, a project manager at the Norwegian Board of Technology. “The new development is not necessarily that there are so much more data. It’s rather that data is available to us in a new way. The digitalization of society gives us access to both ‘traditional’, structured data – like the content of a database or register – and unstructured data, for example the content in a text, pictures and videos. Information designed to be read by humans is now also readable by machines. And this development makes a whole new world of  data gathering and analysis available. Big Data is exciting not just because of the amount and variety of data out there, but that we can process data about so much more than before.”

Smart Citizens


FutureEverything: “This publication aims to shift the debate on the future of cities towards the central place of citizens, and of decentralised, open urban infrastructures. It provides a global perspective on how cities can create the policies, structures and tools to engender a more innovative and participatory society. The publication contains a series of 23 short essays representing some of the key voices developing an emerging discourse around Smart Citizens.  Contributors include:

  • Dan Hill, Smart Citizens pioneer and CEO of communications research centre and transdisciplinary studio Fabrica on why Smart Citizens Make Smart Cities.
  • Anthony Townsend, urban planner, forecaster and author of Smart Cities: Big Data, Civic Hackers, and the Quest for a New Utopia on the tensions between place-making and city-making on the role of mobile technologies in changing the way that people interact with their surroundings.
  • Paul Maltby, Director of the Government Innovation Group and of the Open Data and Transparency in the UK Cabinet Office on how government can support a smarter society.
  • Aditya Dev Sood, Founder and CEO of the Center for Knowledge Societies, presents polarised hypothetical futures for India in 2025 that argues for the use of technology to bridge gaps in social inequality.
  • Adam Greenfield, New York City-based writer and urbanist, on Recuperating the Smart City.

Editors: Drew Hemment, Anthony Townsend
Download Here.