Rules Management

Overview

Rules Management allows you to define and manage moderation rules that automatically detect and handle inappropriate content. Configure rules once in the Dashboard, and they’re automatically applied to all messages sent through CometChat.

Using UI Kit or SDK? Once you configure rules in the Dashboard, they are automatically applied to all messages. No additional code is required - the UI Kit and SDK handle moderation seamlessly.

Quick Start

Enable moderation in under 2 minutes:

Open Rules Settings

Enable Default Rules

Toggle on the rules you need (e.g., Profanity Filter, AI Image Moderation)

Test It

Send a test message that violates the rule - it should be blocked automatically

Available Rules Summary

Rule	Description	Use Case
Profanity Filter	Blocks offensive language using keyword matching	General chat apps
Contact Details Filter	Blocks phone numbers	Marketplaces, dating apps
Email Filter	Blocks email addresses	Prevent off-platform contact
AI Toxicity Detection	AI-powered detection of toxic language	Community platforms
AI Spam Detection	Detects spam messages	High-traffic apps
AI Scam Detection	Identifies fraudulent messages	Marketplaces, finance apps
AI Platform Circumvention	Detects attempts to bypass rules	All apps

Rule	Description	Use Case
AI Image Moderation	Blocks explicit/unsafe images	All apps with image sharing
OpenAI Explicit Content	Detects nudity and sexual content	Social apps, dating
OpenAI Violence/Gore	Blocks graphic violence	Family-friendly apps
OpenAI Hate Symbols	Detects hate imagery	Community platforms
OpenAI Privacy Data	Blocks images with personal data	Finance, healthcare
Malware & Virus Scanner	Scans image files for malware and viruses	Enterprise, healthcare, compliance apps

Rule	Description	Use Case
AI Video Moderation	Blocks explicit/unsafe video content	Apps with video sharing
Malware & Virus Scanner	Scans video files for malware and viruses	Enterprise, healthcare, compliance apps

Rule	Description	Use Case
Malware & Virus Scanner	Scans audio files for malware and viruses	Enterprise, healthcare, compliance apps

Rule	Description	Use Case
Malware & Virus Scanner	Scans all file attachments (documents, archives, etc.) for malware and viruses	Enterprise, healthcare, compliance apps

Rule	Type	Description
Hate & Harassment	Text/Image	Detects hateful or harassing content
Explicit Content	Text/Image	Blocks sexual or graphic content
Privacy & Sensitive Info	Text/Image	Protects personal data
Spam & Scam	Text	Identifies fraudulent schemes
Violence & Terrorism	Text/Image	Blocks violent extremism
Self-Harm Content	Text/Image	Detects self-harm references
Impersonation & Fraud	Text	Prevents identity fraud
Minor Safety	Image	Protects minors from exploitation

Recommended Rules by Use Case

Marketplace/Classifieds

Essential:

Contact Details Filter
Email Filter
AI Scam Detection
AI Platform Circumvention

Recommended:

Profanity Filter
AI Image Moderation

Dating App

Essential:

Contact Details Filter
Email Filter
AI Image Moderation
OpenAI Explicit Content

Recommended:

AI Scam Detection
OpenAI Privacy Data (Image)

Gaming/Entertainment

Essential:

Profanity Filter
AI Toxicity Detection
AI Spam Detection

Recommended:

OpenAI Hate & Harassment
AI Image Moderation

Healthcare/Finance

Essential:

OpenAI Privacy & Sensitive Info
OpenAI Privacy Data (Image)
AI Scam Detection
Malware & Virus Scanner

Recommended:

Profanity Filter
Contact Details Filter

Enterprise/Compliance

Essential:

Malware & Virus Scanner
OpenAI Privacy & Sensitive Info
AI Scam Detection

Recommended:

Profanity Filter
AI Image Moderation

Best Practices

Practice	Description
Start with Defaults	Enable default rules first, then customize based on your needs. Don’t try to configure everything at once.
Test Before Launch	Test rules in a staging environment. Send test messages to verify rules work as expected.
Monitor Regularly	Check the Blocked Messages dashboard regularly to catch false positives.
Iterate & Refine	Review Rule Revisions to track changes and refine rules over time.

Tips for Effective Moderation

Layer your rules - Use multiple rules together (e.g., Profanity Filter + AI Toxicity) for better coverage
Adjust confidence levels - Lower confidence = more aggressive blocking, higher = fewer false positives
Use Flag action for borderline content - Instead of blocking, flag messages for manual review
Create custom keyword lists - Add industry-specific terms to the Lists Management

Default Rules

Default rules are pre-configured and ready to use. Simply toggle them on in the Dashboard.

Text Moderation
Image Moderation
Video Moderation
Audio Moderation
File Moderation

Profanity Filter

Automatically detects and blocks messages containing offensive language, profanity, or derogatory remarks using a predefined list of offensive keywords.Example: Before enabling, profane messages are delivered (double ticks). After enabling, they’re blocked (single tick).

Contact Details Filter

Detects and blocks messages containing phone numbers to prevent sharing of private contact information.

Email Filter

Detects and blocks messages containing email addresses to prevent off-platform communication.

AI Message Toxicity

AI-powered detection of toxic, harmful, or inappropriate language including threats, harassment, and hate speech. Supports multiple languages.

AI Platform Circumvention

Detects attempts by users to bypass platform rules using sentence similarity analysis.

AI Scam Detection

Identifies scam-related messages including phishing attempts, fake offers, and fraudulent schemes.

AI Spam Detection

Detects and filters spam messages by analyzing content patterns and identifying unwanted communications.

OpenAI Text Rules

OpenAI: Hate and Harassment (All Languages)

Detects hateful or harassing language toward individuals or groups using OpenAI moderation.

OpenAI: Privacy and Sensitive Info (All Languages)

Detects messages sharing personal or sensitive information without consent.

OpenAI: Explicit or Inappropriate Content (All Languages)

Identifies explicit sexual descriptions, graphic violence, or unsuitable text.

OpenAI: Spam and Scam (All Languages)

Detects spam messages, phishing attempts, and fraudulent schemes.

OpenAI: Violent or Terroristic Threats (All Languages)

Identifies content promoting violence or extremism.

OpenAI: Non-Consensual Sexual Content (All Languages)

Detects sexual exploitation, grooming, or non-consensual content.

OpenAI: Impersonation or Fraud (All Languages)

Identifies deceptive attempts to impersonate individuals or organizations.

OpenAI: Self-Harm or Suicidal Content (All Languages)

Detects messages suggesting self-harm or suicidal thoughts.

Rule Configuration

Filters

Filters narrow down which messages are checked by a rule based on sender or receiver attributes:

Filter Type	Properties
Sender	UID, Role, Name, Tags, Created date
Receiver	Name, GUID, Tags, Group type, Created date, Type (user/group)

Conditions

Conditions define what content triggers the rule:

Content Type	Available Conditions
Text/Custom	Keyword list, Word patterns, Toxicity, Sentiment, Sentence similarity
Image/Video	Violence, Gambling, Alcohol, Drugs, Nudity, Hate symbols, Unsafe content

You can set a confidence percentage for AI-based conditions to control sensitivity.

Actions

Actions determine what happens when content matches the conditions:

Action	Description
Block	Message is not delivered (default)
Flag	Message is delivered but flagged for review
Ban User	User is banned from the group
Kick User	User is removed from the group
Block User	User is blocked platform-wide

Managing Rules

Create Rule

Click Add in the Rules tab
Configure:
- Name: Descriptive name for the rule
- Rule ID: Unique identifier
- Description: Purpose of the rule
- Filter: Who the rule applies to
- Condition: What triggers the rule
- Action: What happens when triggered
Click Save
Enable the rule to start moderating

Regex Pattern Restrictions: When using Word patterns (regex) in conditions, avoid greedy patterns like .*, .+, or unbounded quantifiers (\d+, \w+). Use bounded patterns instead (e.g., \d{3,10} instead of \d+) to prevent performance issues. See Lists Management for more details.

List Rules

All configured rules are displayed in the Rules tab with their name, status, and actions.

Update Rule

Click Edit in the action menu
Modify the rule settings
Click Save

Delete Rule

Click Delete in the action menu and confirm.

Rule Revisions

Track the history of changes made to a rule:

Click View in the action menu
Navigate to Rule History

Lists Management

Create custom keyword lists for your rules

Blocked Messages

View and manage blocked content

Flagged Messages

Review flagged content

SDK Integration

Handle moderation in custom UI

Setup

Message Management

Integrations

Reference

Overview

Quick Start

Available Rules Summary

Recommended Rules by Use Case