Cross-posted on: http://theconversation.com/how-social-media-data-can-improve-peoples-lives-if-used-responsibly-75367
By: Stefaan G. Verhulst
In January 2015, heavy rains triggered unprecedented floods in Malawi. Over the next five weeks, the floods displaced more than 230,000 people and damaged over 64,000 hectares of land.
Almost half the country was labelled a “disaster zone” by Malawi’s government. And as the humanitarian crisis unfolded, relief agencies, such as the Red Cross were faced with the daunting task of allocating aid and resources to places that were virtually unrecorded by the country’s mapping data, and thus rendered almost invisible.
Humanitarian workers struggled to navigate in many of the most affected areas, and one result was that aid did not necessarily reach those most in need.
To prevent similar knowledge gaps in the future, researchers, volunteers and humanitarian workers in Malawi and elsewhere, have turned to an unlikely partner: Facebook.
In 2016, as part of its “Missing Maps” project, the Red Cross accessed Facebook’s rich population density data to find and map people who were critically vulnerable to natural disasters and health emergencies, but remained unrecorded in existing maps.
During local Mapping Parties, volunteers in Malawi used Facebook’s satellite and population data, in addition to other satellite imagery, to trace roads, houses, and water points across Malawi’s communities.
Two years later, Missing Maps in collaboration with Facebook has identified more than 2,000,000 people in Malawi, allowing aid and relief organisations to better plan projects in Malawi’s disaster prone areas.
Disasters kill nearly 100,000 and affect or displace 200 million people annually. As climate change is expected to increase the frequency and severity of disasters in the near future, leveraging social media data, crowd-sourcing and other means will only become more important.
The potential of data collaborativesThe Malawi partnership is just one manifestation of the concept of data collaboratives. We have defined this as a new form of collaboration beyond the public-private partnership model, in which participants from different sectors — including private companies, research institutions, and government agencies — can exchange data to help solve public problems.
While such collaboratives are emerging in a number of sectors and areas, the Malawi case is an example of a particular kind of collaborative. It’s what we might call a social media data collaborative.
While much attention has been paid to the impact of social media on politics, much value can be generated from social media data for governing as well, but only when done responsibly.
Users of social media are today disclosing and sharing an unprecedented amount of data. Facebook alone collects 98 unique personal data points from its users, and Twitter processes about 6,000 tweets every second.
With an estimated 2.51 billion social media users across the world, a staggering amount of information is being gleaned about individuals and their interactions from social networking platforms.
There is little doubt that much of the data stored by social media companies could, if made available in a responsible manner, provide groups working for the public interest with new insights and avenues for action. Unfortunately, at present such groups have only limited access to data, and their data science expertise remains similarly limited.
Data collaboratives like the Missing Maps project represent a new, contemporary model of corporate social responsibility.
For instance, LinkedIn has established the Economic Graph Research initiative to leverage their data together with a range of third-party researchers to create collective insights for increasing the “economic opportunity for every member of the global workforce.” This reflects a growing willingness among companies to provide access to their data to pursue social responsibility goals.
Deploying such models, companies such as Facebook, Twitter and Reddit are no longer simply silent merchants of our personal data. They can use it to serve the public good in a variety of ways. They include:
Risks of data collaboratives
Source: The GovLab.
At any point in the data life cycle, there are inherent risks – from the unauthorised collection of social media information to misrepresenting data through poor analysis and the possible re-identification of individuals once data has been shared.
Such risks are real and ought not to be used as a reason to avoid sharing social media data. Rather, they highlight the need to develop and integrate a data responsibility framework into any data collaborative initiative.
Molly Jackman and Lauri Kanerva from Facebook have argued that when using social media for other purposes:
companies should develop principles and practices around research that are appropriate to the environments in which they operate, taking into account the values set out in law and ethics.
The concept of data responsibility has recently gained traction within a number of industries and sectors, including the social media industry. These latter can create and operationalise responsibility frameworks by employing data stewards – people tasked with determining what and when to share, how to protect, and how to act on available data.
A number of social media organisations have already established separate departments to administer data-sharing projects. Facebook’s public policy division, for example, has a review process that focuses on data stewardship.
Other organisations depend on separate, and sometimes independent, intermediaries, such as MIT’s Laboratory for Social Machines, which was founded by Twitter’s chief media scientist Deb Roy.
Social Machines regularly uses social media data, particularly from Twitter, to support its research and analysis. But, by maintaining its independence and aligning itself with an academic institution, it is able to establish strict guidelines to maintain the ethical rigour of its work.
All of these initiatives are promising, but it is not yet clear that they add up to a comprehensive data responsibility framework or decision tree enabling new ways of working. Such a framework could provide data stewards the means to assess the public value of social media data as well as the risks and harms of sharing it. It could also suggest ways to adequately mitigate this risk.
What’s more, it might help achieve the necessary balance between the benefits and risks of sharing, and ensure that the vast amounts of data being generated by the public every second are ultimately used for the greater good.
More specifically, a generally accepted responsibility framework can help accelerate the emergence of new, innovative data collaboratives, and maximise their potential.
Let’s speed up the work initated by bodies such as UN-OCHA, Global Pulse, the International Data Responsibility Group and others, toward building a data responsiblity framework to ensure social media data improves people’s lives in a trusted manner.
The author would like to thank Andrew Young, Knowledge Director at The GovLab, and Prianka Srinivasan, Research Assistance, for their research support in writing this article.
Josje Spierings is head of the Secretariat of the International Data Responsibility Group, a collaboration between the Data & Society Research Institute, Data-Pop Alliance, the GovLab at NYU, UN Global Pulse, Signal Program - Harvard Humanitarian Initiative - Harvard University and Leiden University.