bad-words-next

JavaScript/TypeScript filter and checker for bad words aka profanity.

API documentation in GitHub Wiki.

NOTE: we don't supply with raw json data files in version >= 3.0.0. See the example below to update paths in your import or require statements.

Install

yarn add bad-words-next

or

npm install bad-words-next

Basic usage

const BadWordsNext = require('bad-words-next')
const en = require('bad-words-next/lib/en')

const badwords = new BadWordsNext({ data: en })

// Returns true when passed string contains a bad word
console.log(badwords.check('S0me sh!++ is here'))
// will print `true`

// Returns filtered string with masked bad words
console.log(badwords.filter('S0me sh!++ is here'))
// will print `S0me *** is here`

// Returns filtered string and catches bad words
badwords.filter('S0me sh!++ is here', badword => {
  console.log(badword)
})
// will print `sh!++`

// Use exclusions
const badwords = new BadWordsNext({ data: en, exclusions: ['sh+it+' /*works with lookalikes or the actual words*/] })
console.log(badwords.filter('S0me sh!++ is here'))
// will keep the `sh!++` word

Add more dictionaries

const BadWordsNext = require('bad-words-next')

const en = require('bad-words-next/lib/en')
const es = require('bad-words-next/lib/es')
const fr = require('bad-words-next/lib/fr')
const de = require('bad-words-next/lib/de')
const ru = require('bad-words-next/lib/ru')
const rl = require('bad-words-next/lib/ru_lat')
const ua = require('bad-words-next/lib/ua')
const pl = require('bad-words-next/lib/pl')
const ch = require('bad-words-next/lib/ch')

const badwords = new BadWordsNext()
badwords.add(en)
badwords.add(es)
badwords.add(fr)
badwords.add(de)
badwords.add(ru)
badwords.add(rl)
badwords.add(ua)
badwords.add(pl)
badwords.add(ch)

Dictionary data format

interface Data {
  id: string  // Unique dictionary ID
  words: string[] // Words list
  lookalike: Lookalike // Lookalike homoglyphs map
}

type Lookalike = Record<string | number, string> // Simple key-value object

You can use the following pattern characters in a word string:

* indicates any characters, use it only on start and/or end of a word
+ indicates one or more repeating characters
_ indicates special characters

Here is an example of a typical data object:

{
  "id": "en",
  "words": [
    "any",      // just a word
    "ba+d*",    // word `bad` with repeating `a` and anything after `d`
    "*words*",  // word `words` with anything at start and end of it
    "are_here"  // word `are_here` with pseudo space chars between `r` and `h`
  ],
  "lookalike": {
    "@": "a",
    "1": "i"
  }
}

Options

interface Options {
  data?: Data // Dictionary data
  placeholder?: string // Filter placeholder - default '***'
  placeholderMode?: 'repeat' | 'replace' // Placeholder mode to either replace with or repeat the placeholder - default 'replace'
  specialChars?: RegExp // Special chars to allow on start and/or end of a word - default /\d|[!@#$%^&*()[\];:'",.?\-_=+~`|]|a|(?:the)|(?:el)|(?:la)/
  spaceChars?: string[] // Pseudo space chars, a list of values for `_` symbol in a dictionary word string - default ['', '.', '-', ';', '|']
  confusables?: string[] // List of ids to apply transformations from `confusables` npm package - default ['en', 'es', 'de', 'ru_lat']
  maxCacheSize?: number // Max items to store in cache - default 100
  exclusions?: string[] // The list of exclusions
}

See Options API for more details.

Notes

Dictionary words with spaces won't work b/c they do not represent a single word
Dictionaries have to be improved over time

Name		Name	Last commit message	Last commit date
Latest commit History 207 Commits
.github		.github
.husky		.husky
benchmark		benchmark
src		src
test		test
.commitlintrc		.commitlintrc
.eslintrc		.eslintrc
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
generate.sh		generate.sh
package.json		package.json
rollup.config.js		rollup.config.js
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json
typedoc.json		typedoc.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bad-words-next

Install

Basic usage

Add more dictionaries

Dictionary data format

Options

Notes

About

Releases 17

Packages

Contributors 5

Languages

License

alexzel/bad-words-next

Folders and files

Latest commit

History

Repository files navigation

bad-words-next

Install

Basic usage

Add more dictionaries

Dictionary data format

Options

Notes

About

Resources

License

Stars

Watchers

Forks

Releases 17

Packages 0

Contributors 5

Languages

Packages