marhta

Fast string similarity algorithms

This library is still in development and may not be stable.

Installation

pip install marhta

Usage

from marhta import levenshtein_similarity, jaro_winkler_similarity

# Calculate string similarities
print(levenshtein_similarity("hello", "helo"))  # 0.8
print(jaro_winkler_similarity("martha", "marhta"))  # 0.961

# Find best matches
from marhta import levenshtein_match
strings = ["apple", "banana", "orange", "pear"]
matches = levenshtein_match("aple", strings)
print(matches)  # [("apple", 0.8), ("pear", 0.5)]

Features

Levenshtein distance and similarity measures
Jaro-Winkler distance and similarity measures
Fuzzy string matching with customizable thresholds

Performance

Written in Rust for improved performance, while maintaining a Pythonic API.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
src		src
tests		tests
.bumpversion.cfg		.bumpversion.cfg
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
marhta.pyi		marhta.pyi
py.typed		py.typed
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

marhta

Installation

Usage

Features

Performance

About

Releases

Packages

Languages

License

pjwerneck/marhta

Folders and files

Latest commit

History

Repository files navigation

marhta

Installation

Usage

Features

Performance

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages