Releases · ropensci/robotstxt

31 Aug 14:27

jrdnbradford

v0.7.15

fa7d50e

CRAN v0.7.15 Latest

Latest

What's Changed

Guess domain name with hyphen(s) correctly by @gittaca in #56
Repair parsing error due to commented-out field token by @gittaca in #59
Pr59 by @petermeissner in #61
add new maintainer by @pedrobtz in #63
there can be only one maintainer :-) by @maelle in #64
add R CMD CHECK action by @pedrobtz in #67
Remove old CI configs and fix badges by @jrdnbradford in #72
remove dev scripts by @pedrobtz in #73
fix vignette by @pedrobtz in #74
Fix/vignette by @pedrobtz in #75

New Contributors

@gittaca made their first contribution in #56
@pedrobtz made their first contribution in #63
@jrdnbradford made their first contribution in #72

Full Changelog: v0.7.7...v0.7.15

Contributors

petermeissner, pedrobtz, and 3 other contributors

Assets 2

22 Jul 08:23

petermeissner

v0.7.7

d762fd5

v0.7.7 - CRAN

v0.7.7 CRAN release

Assets 2

07 Jun 21:18

petermeissner

v0.7.4

a57b5c9

CRAN v0.7.4

v0.7.4 CRAN release

Assets 2

19 Jul 11:20

petermeissner

v0.6.2

edab064

CRAN v0.6.2

0.6.2 | 2018-07-18

minor : changed from future::future_lapply() to future.apply::future_lapply() to make package compatible with versions of future after 1.8.1

0.6.1 | 2018-05-30

minor : package was moved to other repo location and project status badge was added

0.6.0 | 2018-02-10

change/fix check function paths_allowed() would not return correct result in some edge cases, indicating that spiderbar/rep-cpp check method is more reliable and shall be the default and only method: see 1, see 2, see 3

Assets 2

21 Nov 05:22

petermeissner

v0.5.2

e398840

CRAN v0.5.2

0.5.2 | 2017-11-12

fix : rt_get_rtxt() would break on Windows due trying to readLines() from folder

0.5.1 | 2017-11-11

change : spiderbar is now non-default second (experimental) check method
fix : there were warnings in case of multiple domain guessing

0.5.0 | 2017-10-07

feature : spiderbar's can_fetch() was added, now one can choose which check method to use for checking access rights
feature : use futures (from package future) to speed up retrieval and parsing
feature : now there is a get_robotstxts() function wich is a 'vectorized' version of get_robotstxt()
feature : paths_allowed() now allows checking via either robotstxt parsed robots.txt files or via functionality provided by the spiderbar package (the latter should be faster by approximatly factor 10)
feature : various functions now have a ssl_verifypeer option (analog to CURL option https://curl.haxx.se/libcurl/c/CURLOPT_SSL_VERIFYPEER.html) which might help with robots.txt file retrieval in some cases
change : user_agent for robots.txt file retrieval will now default to: sessionInfo()$R.version$version.string
change : robotstxt now assumes it knows how to parse --> if it cannot parse it assumes that it got no valid robots.txt file meaning that there are no restrictions
fix : valid_robotstxt would not accept some actual valid robotstxt files

Assets 2

11 Sep 19:23

petermeissner

v0.4.1

afbf71e

CRAN v0.4.1

0.4.1 | 2017-08-20

restructure : put each function in separate file
fix : parsing would go bonkers for robots.txt of cdc.gov (e.g. combining all robots with all permissions) due to errornous handling of carriage return character (reported by @hrbrmstr - thanks)

0.4.0 | 2017-07-14

user_agent parameter added to tobotstxt() and paths_allowed to allow for user defined HTTP user-agent send when retrieving robots.txt file from domain

0.3.4 | 2017-07-08

fix : non robots.txt files (e.g. html files returned by server instead of the requested robots.txt / facebook.com) would be handled as if it were non existent / empty files (reported by @simonmunzert - thanks)
fix : UTF-8 encoded robots.txt with BOM (byte order mark) would break parsing although files were otherwise valid robots.txt files

Assets 2

28 Apr 14:41

petermeissner

v0.3.2

f5cd54b

CRAN v0.3.2 (and from now on part of ROpenSci)

This is version 0.3.2 of the robotstxt package after ahving been gone through code review on ROpenSci and after having been puplished again on CRAN.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

New Contributors

Contributors

0.6.2 | 2018-07-18

0.6.1 | 2018-05-30

0.6.0 | 2018-02-10

0.5.2 | 2017-11-12

0.5.1 | 2017-11-11

0.5.0 | 2017-10-07

0.4.1 | 2017-08-20

0.4.0 | 2017-07-14

0.3.4 | 2017-07-08

Releases: ropensci/robotstxt

CRAN v0.7.15

What's Changed

New Contributors

Contributors

v0.7.7 - CRAN

CRAN v0.7.4

CRAN v0.6.2

0.6.2 | 2018-07-18

0.6.1 | 2018-05-30

0.6.0 | 2018-02-10

CRAN v0.5.2

0.5.2 | 2017-11-12

0.5.1 | 2017-11-11

0.5.0 | 2017-10-07

CRAN v0.4.1

0.4.1 | 2017-08-20

0.4.0 | 2017-07-14

0.3.4 | 2017-07-08

CRAN v0.3.2 (and from now on part of ROpenSci)