A trove of documents from Facebook whistleblower Francis Haugen claimed in detail how staff complained to Facebook executives about how staff worried about the lack of policing on hate speech and the company’s collective failure to anticipate the January 6 riot.
Last week, the company said it was the victim of an incoming ‘attack’ by the media in what was largely considered an effort to get ahead of the papers’ release.
As the documents emerged on Monday, Haugen told British lawmakers that she is ‘extremely concerned’ about how Facebook ranks content based on ‘engagement’, saying it fuels hate speech and extremism, particularly in non-English-speaking countries.
Some of the most damning comments were posted on January 6, the day of the Capitol riot, when staff told Zuckerberg and other executives on an internal messaging board that they blamed themselves for the violence.
‘One of the darkest days in the history of democracy and self-governance. History will not judge us kindly,’ said one worker while another said: ‘We’ve been fueling this fire for a long time and we shouldn’t be surprised it’s now out of control’.
One of her complaints is how the company had been warned by staff for years that it was not doing enough to police hate speech.
One of the problems is its AI tools do not have the capability to appropriate pick out hateful commentary, and there aren’t enough staff with the language skills to do it manually.
The failures to block hate speech in volatile regions such as Myanmar, the Middle East, Ethiopia and Vietnam could contribute to real-world violence.
In a review posted to Facebook’s internal message board last year regarding ways the company identifies abuses, one employee reported ‘significant gaps’ in certain at-risk countries.
Facebook spokesperson Mavis Jones said in a statement that the company has native speakers worldwide reviewing content in more than 70 languages, as well as experts in humanitarian and human rights issues.
She said these teams are working to stop abuse on Facebook’s platform in places where there is a heightened risk of conflict and violence.
‘We know these challenges are real and we are proud of the work we’ve done to date,’ Jones said.
Still, the cache of internal Facebook documents offers detailed snapshots of how employees in recent years have sounded alarms about problems with the company’s tools – both human and technological – aimed at rooting out or blocking speech that violated its own standards.
The material expands upon Reuters’ previous reporting on Myanmar and other countries where the world’s largest social network has failed repeatedly to protect users from problems on its own platform and has struggled to monitor content across languages.
Among the weaknesses cited were a lack of screening algorithms for languages used in some of the countries Facebook has deemed most ‘at-risk’ for potential real-world harm and violence stemming from abuses on its site.
The company designates countries ‘at-risk’ based on variables including unrest, ethnic violence, the number of users and existing laws, two former staffers told Reuters.
The system aims to steer resources to places where abuses on its site could have the most severe impact, the people said.
Facebook reviews and prioritizes these countries every six months in line with United Nations guidelines aimed at helping companies prevent and remedy human rights abuses in their business operations, spokesperson Jones said.
In 2018, United Nations experts investigating a brutal campaign of killings and expulsions against Myanmar’s Rohingya Muslim minority said Facebook was widely used to spread hate speech toward them.
That prompted the company to increase its staffing in vulnerable countries, a former employee told Reuters.
Facebook has said it should have done more to prevent the platform being used to incite offline violence in the country.
Ashraf Zeitoon, Facebook’s former head of policy for the Middle East and North Africa, who left in 2017, said the company’s approach to global growth has been ‘colonial,’ focused on monetization without safety measures.
More than 90 per cent of Facebook’s monthly active users are outside the United States or Canada.
Facebook has long touted the importance of its artificial-intelligence (AI) systems, in combination with human review, as a way of tackling objectionable and dangerous content on its platforms.
Machine-learning systems can detect such content with varying levels of accuracy.
But languages spoken outside the United States, Canada and Europe have been a stumbling block for Facebook’s automated content moderation, the documents provided to the government by Haugen show.
The company lacks AI systems to detect abusive posts in a number of languages used on its platform.
In 2020, for example, the company did not have screening algorithms known as ‘classifiers’ to find misinformation in Burmese, the language of Myanmar, or hate speech in the Ethiopian languages of Oromo or Amharic, a document showed.
These gaps can allow abusive posts to proliferate in the countries where Facebook itself has determined the risk of real-world harm is high.
Reuters this month found posts in Amharic, one of Ethiopia’s most common languages, referring to different ethnic groups as the enemy and issuing them death threats.
A nearly year-long conflict in the country between the Ethiopian government and rebel forces in the Tigray region has killed thousands of people and displaced more than 2 million.
Facebook spokesperson Jones said the company now has proactive detection technology to detect hate speech in Oromo and Amharic and has hired more people with ‘language, country and topic expertise,’ including people who have worked in Myanmar and Ethiopia.
In an undated document, which a person familiar with the disclosures said was from 2021, Facebook employees also shared examples of ‘fear-mongering, anti-Muslim narratives’ spread on the site in India, including calls to oust the large minority Muslim population there.
‘Our lack of Hindi and Bengali classifiers means much of this content is never flagged or actioned,’ the document said.
Internal posts and comments by employees this year also noted the lack of classifiers in the Urdu and Pashto languages to screen problematic content posted by users in Pakistan, Iran and Afghanistan.
Jones said Facebook added hate speech classifiers for Hindi in 2018 and Bengali in 2020, and classifiers for violence and incitement in Hindi and Bengali this year. She said Facebook also now has hate speech classifiers in Urdu but not Pashto.
Facebook’s human review of posts, which is crucial for nuanced problems like hate speech, also has gaps across key languages, the documents show.
An undated document laid out how its content moderation operation struggled with Arabic-language dialects of multiple ‘at-risk’ countries, leaving it constantly ‘playing catch up.’
The document acknowledged that, even within its Arabic-speaking reviewers, ‘Yemeni, Libyan, Saudi Arabian (really all Gulf nations) are either missing or have very low representation.’
Facebook’s Jones acknowledged that Arabic language content moderation ‘presents an enormous set of challenges.’ She said Facebook has made investments in staff over the last two years but recognizes ‘we still have more work to do.’
Three former Facebook employees who worked for the company´s Asia Pacific and Middle East and North Africa offices in the past five years told Reuters they believed content moderation in their regions had not been a priority for Facebook management.
These people said leadership did not understand the issues and did not devote enough staff and resources.
Facebook’s Jones said the California company cracks down on abuse by users outside the United States with the same intensity applied domestically.
The company said it uses AI proactively to identify hate speech in more than 50 languages.
Facebook said it bases its decisions on where to deploy AI on the size of the market and an assessment of the country’s risks. It declined to say in how many countries it did not have functioning hate speech classifiers.
Facebook also says it has 15,000 content moderators reviewing material from its global users. ‘Adding more language expertise has been a key focus for us,’ Jones said.
In the past two years, it has hired people who can review content in Amharic, Oromo, Tigrinya, Somali, and Burmese, the company said, and this year added moderators in 12 new languages, including Haitian Creole.
Facebook declined to say whether it requires a minimum number of content moderators for any language offered on the platform.
Facebook’s users are a powerful resource to identify content that violates the company’s standards.
The company has built a system for them to do so, but has acknowledged that the process can be time consuming and expensive for users in countries without reliable internet access.
The reporting tool also has had bugs, design flaws and accessibility issues for some languages, according to the documents and digital rights activists who spoke with Reuters.
Next Billion Network, a group of tech civic society groups working mostly across Asia, the Middle East and Africa, said in recent years it had repeatedly flagged problems with the reporting system to Facebook management.
Those included a technical defect that kept Facebook’s content review system from being able to see objectionable text accompanying videos and photos in some posts reported by users.
That issue prevented serious violations, such as death threats in the text of these posts, from being properly assessed, the group and a former Facebook employee told Reuters. They said the issue was fixed in 2020.
Facebook said it continues to work to improve its reporting systems and takes feedback seriously.
Language coverage remains a problem. A Facebook presentation from January, included in the documents, concluded ‘there is a huge gap in the Hate Speech reporting process in local languages’ for users in Afghanistan.
The recent pullout of U.S. troops there after two decades has ignited an internal power struggle in the country. So-called ‘community standards’ – the rules that govern what users can post – are also not available in Afghanistan’s main languages of Pashto and Dari, the author of the presentation said.
A Reuters review this month found that community standards weren’t available in about half the more than 110 languages that Facebook supports with features such as menus and prompts.
Facebook said it aims to have these rules available in 59 languages by the end of the year, and in another 20 languages by the end of 2022.