Home / Posts / Content Moderation Insights: The Problem of Hate Speech in Social Media

Content Moderation Insights: The Problem of Hate Speech in Social Media

Published On:

January 26, 2024

|

Tags: Content Moderation

|

11 min read

Elevate your operations with our expert global solutions!

Customer Experience

Trust & Safety

Introduction

Peak season in customer support presents significant business challenges due to high consumer activity, frequent interactions, and a surge in service requests. Holidays, promotions, important events, and fluctuations in demand drive this phenomenon, placing considerable pressure on entire organisations to manage the chaotic influx of work, maintain quality levels, and ensure operational stability. To effectively navigate peak seasons, companies must focus on several key strategies, including correct forecasting, appropriate resource allocation and scalability of operations.

Pictures of young people exposed at online hate and harassment

Prevalence of Hate Speech in Online Platforms

In the wake of increasing digitalisation touching various aspects of our lives, many online services are spectacularly expanding and attracting new users who seek to be a part of this exciting society, be informed, and share thoughts and emotions with others. In this case, social media has naturally emerged as a central hub for interaction, breaking through geographical, cultural and communication barriers. Their strength lies in enabling users to be noticed, listened to, and appreciated in a manner often less inhibited by fear and shame than traditional offline dialogue, serving as a powerful tool for self-expression and global outreach. Unfortunately, diverse individuals or organisations have joined the game beyond the socialisers, engaging in unfair practices for personal gain, ideological motives, or malicious intent. All in all, incidents of hate speech related to social media are reported globally, affecting nearly every continent, various societies, and demographics.

Below are a few selected statistics from the Anti-Defamation League (ADL) study titled “Online Hate and Harassment: The American Experience 2023”. According to this document, online threats remain persistent and deep-rooted problems on social media platforms, as presented:

52% of the ADL’s survey respondents reported experiencing online harassment or hate, indicating a 12% increase compared to the previous year.

51% of teens ages 13-17 also experienced a sharp rise in online harassment in the past twelve months, compared to 36% in 2022.

Among those who faced harassment, 54% said it happened on Facebook. Twitter increased to 27% from 21% in 2022, and Reddit rose to 15% from 5% in 2022.

The Complexity in Defining Hate Speech

Here, it is worth acknowledging the crucial role and sophisticated shape of hate speech in the risk landscape to understand better how this painful phenomenon has transformed, bringing new challenges, complexities, and dimensions. Over time, the threat has evolved from historical tools of marginalisation and discrimination, adapting to new forms with the rise of communication technologies.

It is usually subjective and context-dependent in the contemporary era, making it difficult to create universal definitions encompassing all instances. In the digital space, hate speech typically involves inappropriate language or expressions, manifesting in various forms of communication and behaviour, including actions, symbols, images, and gestures that can be pervasive across social media posts, comments, articles, movies, or messages. It often targets subjects depending on prevailing societal issues and cultural aspects, encompassing race, ethnicity, religion, gender, sexual orientation, nationality, disability, political affiliation, social identity, immigration status, and health status.

Below are some selected examples illustrating the diverse forms hate speech can take and showcasing the harmful impact it can have:

Icons illustrating the types of hate speech

Racial and Ethnic Insults: Posting derogatory comments or memes targeting specific racial or ethnic groups.
Homophobic Remarks: Sharing content containing offensive language or jokes directed at the LGBTQ+ community.
Gender-based Harassment: Engaging in online behaviour that discriminates against individuals based on their gender.
Religious Discrimination: Posting content that ridicules or insults people based on their beliefs or practices.
Disability-based Intolerance: Sharing memes or comments that mock individuals with disabilities.

In addition, it must be emphasised that online hate speech is especially harmful to children and teenagers who have grown up with the advancement of social media and digital communication. Exposure to the threat can significantly impact their mental health, self-esteem, and overall wellness, posing significant social, psychological, and emotional challenges. It can also sometimes culminate in more severe outcomes such as breakdown or suicide attempts in the worst-case scenario. This is due to the minors’ stage of development, susceptibility to peer influence, the quest for social validation, and the potential lack of digital literacy skills.

Content Moderation: A Need to Go Beyond the Standard

Referring to another ADL report, “Block/Filter/Notify: Support for Targets of Online Hate Report Card” from mid-2023, popular social media services are not yet effectively supporting individuals who are targets of online hate and harassment, despite their stated intentions to ensure user protection. The gap between the aspirations and the actual actions highlights the urgency of implementing more advanced content moderation strategies to address the persistent challenges of online hate. This involves adopting robust and cutting-edge solutions beyond basic security measures, enabling better threat detection, efficient prevention, nuanced understanding, and proactive intervention.

In social media, content moderation is the gatekeeper, diligently helping to monitor, evaluate, and steer user-generated content towards compliance with community guidelines, legal frameworks, and ethical standards. This pivotal process encompasses identifying and removing information and materials that run afoul of established norms, and hate speech is a relevant example of its violation. The responsibility of such an undertaking can be shouldered either by human moderators through manual review or by leveraging automated tools fueled by artificial intelligence and machine learning. The goal is maintaining a secure and inclusive digital space, carefully balancing free expression and platform policies. Among the most popular moderation techniques are:

Visual presentation of the moderation types

Proactive Manual Review: In this case, content is reviewed by humans before publication, preventing the appearance of harmful materials.
Reactive Manual Review: Measures are taken after publication when content undergoes manual review supported by real-time tools, user reporting, and community guidelines.
Advanced Tools and Filters: This involves detecting and handling specific content using advanced tools and filters.
Real-time Automated Screening: The approach entails automated screening with human involvement to address complex issues, offering flexibility for diverse content analysis.
Hybrid Moderation: This approach combines various methods to ensure the highest efficiency, accuracy, and adaptability.
User-Driven Moderation: Considering this, visitors or communities are engaged to participate by actively reporting or flagging inappropriate content. Rating and voting systems are also employed to elevate highly rated content and conceal or remove low-rated material.

However, it is vital to underline that efficient content moderation must also align with the brand’s values, platform characteristics, and user profile to maximise the outcomes. The same content can have different meanings on diverse platforms based on context and guideline specificity. The specificity must be considered highly and critically, necessitating the need to tailor moderation approaches carefully and strategically.

Let’s delve deeper into this scenario: a meme containing potentially offensive language would likely be flagged as inappropriate on a general social media platform catering to a broad audience. In this context, the material may be subject to removal or warnings, aligning with specific regulations and community standards. Contrastingly, the perception could differ if the same message were shared on a website dedicated to dark humour or satire, where users anticipate and engage with more provocative materials. Community guidelines on this specialised platform might permit a broader range of expression, acknowledging the specific nature of the audience and its expectations.

Technological Challenges in Automatic Detection

Addressing technological challenges involves ongoing research, development, and collaboration between experts in artificial intelligence, machine learning, and content moderation. It requires refining algorithms, understanding cultural nuances, and staying vigilant against emerging threats to create a hate-speech-free environment.

AI plays a crucial role in this effort by automating processes, improving scalability, and providing real-time monitoring capabilities beyond human capacity. It excels at handling large volumes of content, making well-informed routine decisions, managing repetitive tasks swiftly, and accurately identifying and blocking inappropriate content, leading to timesaving and the risk of oversight reduction. This type of moderation relies on machine learning models trained on platform-specific data to recognise undesirable materials quickly and precisely. However, the effectiveness of AI depends on the availability of high-quality datasets for proper model training.

Therefore, allocating human agents in many situations where AI cannot address issues efficiently is also invaluable. Moderators bring empathy, sensitivity, a deep understanding of various details, and the ability to interpret highly delicate matters. They can handle content that needs to conform to predefined rules neatly. Despite being time-consuming, resource-intensive, and susceptible to human error, human moderation remains essential for addressing complex and context-specific content.

Additionally, using sentiment analysis helps gauge the emotional tone of user-generated content. Positive sentiment may indicate constructive discussions, while negative sentiment may signal potential issues or harmful content. This analysis is especially helpful in prioritising and addressing content that poses a higher risk, allowing platforms to manage and mitigate potential concerns swiftly and effectively proactively. Keyword strategy can also help establish a targeted approach to identify and handle specific terms associated with hate speech or inappropriate content. Through a well-defined set of keywords, platforms can efficiently and proactively detect and act upon content that violates guidelines.

Ethical and Societal Considerations

Ethical and societal considerations regarding hate speech mitigation in social media encompass a range of principles and concerns that focus on the impact, responsibilities, and values associated with online content. Some key issues include ethical, legal, cultural, and societal dimensions, for example:

Freedom of Speech vs. Harm Avoidance: Balancing the right to freedom of speech with the need to prevent harm and protect individuals or groups from hate speech is paramount. Social media constantly grapples with the challenge of fostering open dialogue while mitigating the potential adverse effects of harmful content. Therefore, it is imperative to establish transparent, ethical guidelines and policies that carefully delineate the boundary between free expression and harm prevention. Indicating what is deemed acceptable and what is forbidden, these rules should be easily accessible to the audience, ensuring a shared understanding and adherence to community standards.
Cultural Sensitivity: It is pivotal to recognise and respect norms, values, traditions, language nuances, varied comic sensibilities, beliefs, and societal settings when moderating content. Such activity is necessary to avoid cultural insensitivity, misunderstanding, or offence. It can be supported by engaging skilled and knowledgeable human agents who understand subtle nuances that automated systems might overlook or misinterpret, potentially leading to inappropriate content being allowed or deleted unnecessarily.
Talent Management & Well-being: Successful moderation also relies on a skilled and resilient workforce characterised by language proficiency, emotional intelligence, cultural awareness, and the ability to handle pressure. This underscores the importance of talent management practices that prioritise the mental health and professional development of each employee, especially given their daily exposure to hate speech and potentially distressing content.
Legal Compliance: Adhering to relevant laws and regulations about hate speech and discrimination in different jurisdictions is key for social media. This is because numerous new regulations have been introduced across regions in recent years, shaping the digital landscape and introducing new responsibilities to the providers, for instance:

Under the EU Digital Services Act (DSA), social media companies must regularly share reports revealing details such as the number of moderators, error rates, and their responsiveness to requests from EU member states. These reports offer a comprehensive view of the resources allocated by platforms like Facebook, Snapchat, TikTok, and others for handling illegal, hateful, or fraudulent content, along with general service statistics like user numbers. Starting in early November 2023, these companies began submitting these reports every six months as mandated by the DSA. (Source: Le Monde)

Summary

As indicated above, it is undeniable that hate speech is a concerning issue in social media, demanding the implementation of groundbreaking content moderation measures, balancing human expertise and technology, growing a customised approach, and swift action to curb its impact. However, a collective responsibility extends beyond online platforms, burdening legislators, non-profit organisations, parents, educators, and society to promote a safer digital environment in the range and scope of their influence, advocacy, and educational initiatives.

TikTok Use Case:

On its website, TikTok outlines its content moderation strategy and commitment to creating a secure and inclusive space for its expanding global community of creators. The company states that, with a team of over 40,000 safety professionals, TikTok employs a mix of guidelines, automated moderation technology, and human moderators.
(Source: TikTok.com)

X Use Case:

X’s guidelines are crafted to safeguard user safety and well-being. They guarantee that everyone can engage in public conversation securely, adding value to a lively and inclusive global discourse. The rules also explicitly prohibit violence, harassment, and similar behaviours, intending to prevent actions that hinder individuals from freely expressing themselves.
(Source: X.com)

Facebook Use Case:

Facebook relies on AI for content moderation, quickly detecting and removing violating content. Human review teams worldwide handle more complex cases, balancing AI efficiency and human decision-making for a safe environment.
(Source: Facebook.com)

A picture of a woman experiencing online hate and harassment

Elevate your operations with our expert global solutions!

Customer Experience

Trust & Safety

FAQ Section

1. Why is hate speech a significant challenge on social media?

Hate speech undermines online safety, fuels discrimination, and impacts the mental health of users and communities. Its pervasive nature on social platforms creates challenges for maintaining inclusive environments and protecting brand reputations, making it a critical issue for digital services to address.

2. How does content moderation tackle hate speech?

Content moderation employs strategies such as proactive and reactive manual reviews, AI-driven detection tools, and hybrid approaches combining technology and human expertise. These methods ensure compliance with community guidelines and promote respectful interactions while addressing harmful content.

3. What are the challenges in moderating hate speech online?

Moderating hate speech is complex due to cultural nuances, subjective interpretations, and rapid content evolution. Automated tools can struggle with context, requiring human moderators to address delicate cases, especially those involving sarcasm, slang, or cultural differences.

4. How can social media platforms enhance hate speech moderation?

Platforms can improve by leveraging advanced AI for real-time detection, incorporating sentiment analysis, and fostering user-driven moderation. Regular training for moderators, transparent guidelines, and tailored approaches that respect cultural and platform-specific contexts are essential for effective moderation.

5. What ethical considerations arise in moderating hate speech?

Balancing freedom of speech with harm prevention is a central ethical dilemma. Platforms must create clear, transparent policies to navigate this balance, respect cultural diversity, and adhere to legal requirements such as the EU Digital Services Act, ensuring a safe yet open online environment.

Might also interest you

link to article

Data Labelling and Annotation: The Human Touch Behind Smarter AI

Introduction Artificial Intelligence is expanding continuously and rapidly into more aspects of our lives, showing no signs of slowing down. While it offers remarkable opportunities for improvement and efficiency, it…

Read here

link to article

Trust and Safety in Transition: Trends, Challenges, and Future Innovations

Introduction Trust and Safety (T&S) is vital to the virtual space, safeguarding platforms’ integrity while creating a secure and inclusive user experience. However, as the environment evolves rapidly, T&S strategies…

Read here

link to article

A Small Island, Big Opportunities: Presenting Taiwan in the BPO World

Introduction Positioned in East Asia between mainland China and the Pacific Ocean, Taiwan is emerging as a leading BPO destination. Despite its compact size, the country offers a compelling blend…

Read here

link to article

Customer Experience in 2025: Key Trends and Insights

Introduction As 2025 approaches, the customer experience landscape is set for another wave of transformation, driven by emerging technologies—particularly the growing influence of artificial intelligence and automation—alongside dynamic behavioural shifts. We…

Read here

link to article

Greece in the Spotlight: Uncover a Fresh Frontier for Global Outsourcing

Introduction Greece is emerging as a promising BPO destination in Southern Europe. With a range of compelling advantages, this unique Mediterranean country is becoming an increasingly attractive choice for global…

Read here

link to article

BPO Collaboration Working Models: Insights and Review

Introduction In today’s fast-paced and competitive business world, how BPOs and brands work together is crucial. The established partnership directly influences the efficiency and impact of outsourcing services, shaping the…

Read here

Content Moderation Insights: The Problem of Hate Speech in Social Media

At a Glance

Table of Content

Introduction

Prevalence of Hate Speech in Online Platforms

The Complexity in Defining Hate Speech

Content Moderation: A Need to Go Beyond the Standard

Technological Challenges in Automatic Detection

Ethical and Societal Considerations

Summary

FAQ Section

1. Why is hate speech a significant challenge on social media?

2. How does content moderation tackle hate speech?

3. What are the challenges in moderating hate speech online?

4. How can social media platforms enhance hate speech moderation?

5. What ethical considerations arise in moderating hate speech?

Data Labelling and Annotation: The Human Touch Behind Smarter AI

Trust and Safety in Transition: Trends, Challenges, and Future Innovations

A Small Island, Big Opportunities: Presenting Taiwan in the BPO World

Customer Experience in 2025: Key Trends and Insights

Greece in the Spotlight: Uncover a Fresh Frontier for Global Outsourcing

BPO Collaboration Working Models: Insights and Review