Skip to content

Commit

Permalink
update CHANGES.md for 0.4.0
Browse files Browse the repository at this point in the history
bump version to 0.4.0
remove extraneous logging
  • Loading branch information
ikreymer committed Jul 21, 2021
1 parent d40cf6c commit 6a65ea7
Show file tree
Hide file tree
Showing 4 changed files with 21 additions and 6 deletions.
13 changes: 13 additions & 0 deletions CHANGES.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,18 @@
## CHANGES

v0.4.0
- YAML based config, specifyable via --config property or via stdin (with '--config stdin')
- Support for different scope types ('page', 'prefix', 'host', 'any', 'none') + crawl depth at crawl level
- Per-Seed scoping, including different scope types, or depth and include/exclude rules configurable per seed in 'seeds' list via YAML config
- Support for 'blockRules' for blocking certain URLs from being stored in WARCs, conditional blocking for iframe based on contents, and iframe URLs (see README for more details)
- Interactive profile creation: creating profiles by interacting with embedded browser loaded in the browser (see README for more details).
- Screencasting: streaming the output of each window via websocket-based streaming, configurable with --screencastPort option
- New 'window' based parallelization: Open each worker in new window in same session
- Simplified custom driver config, default calls 'loadPage'
- Refactor arg parsing, other auxiliary functions into separate utils files
- Image customization: support for customizing browser image, eg. building with Chromium instead of Chrome, support for ARM architecture builds (see README for more details).
- Update to latest pywb (2.5.0b4), browsertrix-behaviors (0.2.3), py-wacz (0.3.1)

v0.3.2
- Added a `--urlFile` option: Allows users to specify a .txt file list of exact URLs to crawl (one URL per line).

Expand Down
2 changes: 1 addition & 1 deletion docker-compose.yml
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@ version: '3.5'

services:
crawler:
image: webrecorder/browsertrix-crawler:0.4.0-beta.3
image: webrecorder/browsertrix-crawler:0.4.0
build:
context: ./

Expand Down
2 changes: 1 addition & 1 deletion package.json
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
{
"name": "browsertrix-crawler",
"version": "0.4.0-beta.3",
"version": "0.4.0",
"main": "browsertrix-crawler",
"repository": "https://github.com/webrecorder/browsertrix-crawler",
"author": "Ilya Kreymer <[email protected]>, Webrecorder Software",
Expand Down
10 changes: 6 additions & 4 deletions util/blockrules.js
Original file line number Diff line number Diff line change
Expand Up @@ -47,9 +47,11 @@ class BlockRules
this.rules.push(new BlockRule(ruleData));
}

console.log("URL Block Rules:\n");
for (const rule of this.rules) {
console.log(rule.toString());
if (this.rules.length) {
console.log("URL Block Rules:\n");
for (const rule of this.rules) {
console.log(rule.toString());
}
}
}

Expand Down Expand Up @@ -150,7 +152,7 @@ class BlockRules
const body = this.blockErrMsg;
const putUrl = new URL(this.blockPutUrl);
putUrl.searchParams.set("url", url);
console.log("put url", putUrl.href);
//console.log("put url", putUrl.href);
await fetch(putUrl.href, {method: "PUT", headers: {"Content-Type": "text/html"}, body});
}
}
Expand Down

0 comments on commit 6a65ea7

Please sign in to comment.