Skip to content

Releases: webrecorder/browsertrix-crawler

Browsertrix Crawler 1.0.3

26 Mar 21:11
Compare
Choose a tag to compare

What's Changed

  • fixes redirected seed (from #475) being counted againt page limit: by @ikreymer in #509
  • sitemap improvements: gz support + application/xml + extraHops fix by @ikreymer in #511

Full Changelog: v1.0.2...v1.0.3

Browsertrix Crawler 1.1.0 Beta 2 (QA Crawl Support Beta)

23 Mar 05:11
Compare
Choose a tag to compare

What's Changed

  • Docs: Minor fixes to edit link & clarifications by @Shrinks99 in #501
  • Improved support for running as non-root by @ikreymer in #503
  • improvements to 'non-graceful' interrupt to ensure WARCs are still closed gracefully by @ikreymer in #504
  • service worker capture fix: disable by default for now by @ikreymer in #506
  • QA Crawl Support (Beta) by @ikreymer in #469

New Contributors

Full Changelog: v1.1.0-beta.1...v1.1.0-beta.2

Browsertrix Crawler 1.0.2

22 Mar 20:38
22a7351
Compare
Choose a tag to compare

What's Changed

  • service worker capture fix: disable service workers by default for now, add cli option by @ikreymer in #506

Full Changelog: v1.0.1...v1.0.2

Browsertrix Crawler 1.0.1

21 Mar 20:58
93c3894
Compare
Choose a tag to compare

What's Changed

  • Docs: Minor fixes to edit link & clarifications by @Shrinks99 in #501
  • Improved support for running as non-root by @ikreymer in #503
  • improvements to 'non-graceful' interrupt to ensure WARCs are still closed gracefully by @ikreymer in #504

New Contributors

Full Changelog: v1.0.0...v1.0.1

Browsertrix Crawl 1.1.0 Beta 1 (QA Support)

20 Mar 05:33
Compare
Choose a tag to compare

What's Changed

  • Merge Browsertrix Crawler 1.0.0 release!

Full Changelog: v1.1.0-beta.0...v1.1.0-beta.1

Browsertrix Crawler 1.0.0

19 Mar 17:58
Compare
Choose a tag to compare

Browsertrix Crawler 1.0.0 Release

  • New capture mechanism via Chrome Debug Protocol, instead of pywb
  • Updated mkdocs (hosted at: https://crawler.docs.browsertrix.com/)
  • Customizable WARC filenames
  • Improved log filtering
  • Conversion to TypeScript
  • Support for pageinfo: records per page.
  • Optimized Sitemap parsing

What's Changed

Full Changelog: v0.12.4...v1.0.0

Browsertrix Crawler 1.0.0 Beta 8

16 Mar 22:32
Compare
Choose a tag to compare
Pre-release

What's Changed

Full Changelog: v1.0.0-beta.7...v1.0.0-beta.8

Browsertrix Crawl 1.1.0 Beta 0 (QA Support)

13 Mar 04:45
Compare
Choose a tag to compare

Initial experimental build of QA / replay crawling support

What's Changed

Full Changelog: v1.0.0-beta.7...v1.1.0-beta.0

Browsertrix Crawler 1.0.0 Beta 7

08 Mar 07:38
9f18a49
Compare
Choose a tag to compare
Pre-release

What's Changed

  • Better tracking of failed requests + logging context exclude by @ikreymer in #485

Full Changelog: v1.0.0-beta.6...v1.0.0-beta.7

Browsertrix Crawler 1.0.0 Beta 6

05 Mar 08:01
65133c9
Compare
Choose a tag to compare
Pre-release

What's Changed

Full Changelog: v1.0.0-beta.5...v1.0.0-beta.6