-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathDESCRIPTION
34 lines (34 loc) · 1.14 KB
/
DESCRIPTION
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
Package: piecemaker
Title: Tools for Preparing Text for Tokenizers
Version: 1.0.2.9000
Authors@R: c(
person("Jon", "Harmon", , "[email protected]", role = c("aut", "cre"),
comment = c(ORCID = "0000-0003-4781-4346")),
person("Jonathan", "Bratt", , "[email protected]", role = "aut",
comment = c(ORCID = "0000-0003-2859-0076")),
person("Bedford Freeman & Worth Pub Grp LLC DBA Macmillan Learning", role = "cph")
)
Description: Tokenizers break text into pieces that are more usable by
machine learning models. Many tokenizers share some preparation steps.
This package provides those shared steps, along with a simple
tokenizer.
License: Apache License (>= 2)
URL: https://github.com/macmillancontentscience/piecemaker,
https://macmillancontentscience.github.io/piecemaker/
BugReports: https://github.com/macmillancontentscience/piecemaker/issues
Depends:
R (>= 2.10)
Imports:
cli,
glue,
rlang (>= 0.4.2),
stringi,
stringr
Suggests:
covr,
testthat (>= 3.0.0)
Config/testthat/edition: 3
Encoding: UTF-8
LazyData: true
Roxygen: list(markdown = TRUE)
RoxygenNote: 7.2.3