galaxiat.serve.seo Node.js package allows you to easily cron crawl path that you want to have an html version (for dynamic rendering like React) without have to make SSR when request is passed.
We use it at Galaxiat to do our https://galaxiatapp.com/pub/hash/dev rendering.
galaxiat.serve.seo Support both static and dynamic route.
Dynamic route can be delivered by a remote json endpoint and static route can be delivered by the .galaxiat.json
file.
Node.js 16.9.0 or newer is required.
npm install galaxiat.serve.seo
.galaxiat.json
OR .galaxiat.{env}.json
To set env use the GALAXIAT_SERVE_ENV
var
type
: remote
| local
- Remote will use the url provided on the
remote
key to connect a remote chrome instance.- NB : include the
/playwright
at the end of the url - For remote usage we recommend the use of Token see the docs of browserless.io for more infos
- NB : include the
- Local will spawn a chrome headless browser with args on the
args
key.
{
"hostname" : "galaxiatapp.com",
"port" : 3000,
"type" : "remote",
"args" : ["--no-sandbox",
"--disable-setuid-sandbox"],
"remote" : "wss://chrome.shared.svc.galaxiat.fr/playwright?token=MWkH6L4K3knkG3hvsaHrnzA5g6dtfucYk5nD9YVBRRh9ZtdPyDaE",
"target" : "http://localhost:3000",
"public" : "./public",
"crawl" : [
{
"type" : "config",
"url" : "/path",
"file" : "/cache/path.html",
"cron" : "0 * * * * *"
},
{
"type" : "remote",
"json_url" : "https://api.galaxiatapp.com/seo/galaxiat.json",
"cron" : "0 */15 * * * *"
}
],
"crawl_cron" : "* * * * * *",
"crawl_max_num" : 3,
"crawl_queue_num" : 10,
"errors" : {
"https" : false
}
}
https://galaxiatapp.com/seo/galaxiat.json
[
{
"url": "https://galaxiatapp.com/pub/hash/dev",
"file": "/pub/hash/dev.html"
},{
"url": "https://galaxiatapp.com/pub/hash/something",
"file": "/pub/hash/something.html"
}
]
V1.X.X
- Single workload implementation- Per node deployment -> not so good for performance
- Crawling is done on the local node
V2.X.X
- Multiple workload implementation- Multi-node deployment -> better performance
- Crawling is done on remote node
V3.X.X
- Advanced Multiple workload implementation- Multi-node deployment + Cluster cache -> better performance
- Cache is cluster wide instead of a local cache per node
see browserless.io for more infos
Local mode is not recommended for production
Before creating an issue, please ensure that it hasn't already been reported/suggested.
Software is under MIT license