I’ve created a word-blacklist system that works decently, I remove all spaces from the message content, then use regex to match for blacklisted words, and if there’s a match, delete the message.
const msgContent = message.content.replace(/s/g, '');
let foundBlacklist = false;
for (const word of data.Words) {
const regex = new RegExp(`${word}`, 'gi');
if (regex.test(msgContent)) {
foundBlacklist = true;
}
}
if(foundBlacklist) message.delete().catch(err => console.log(`There was an error trying to delete that message: ${err}`))
The problem is that it’s very easy to use markdown to bypass this, if bad
is a blacklisted word, simply doing b*a*d
to to make it italic, would make the message content b*a*d
, and the regex won’t match. This of course applies to underline, strikethrough … etc, where b__ad__
or **b**ad
etc won’t match either.
How could this be prevented?