Simple queued & clustered PhantomJS processing. https://www.npmjs.com/package/ghost-town
Now with 20% more creepiness! Check out Ghost Town 2's breaking changes in CHANGELOG.md.
Need highly scalable PhantomJS processing? Ghost Town makes it frighteningly easy! For example, on-demand page rendering, dispatched through Thrift:
var town = require("ghost-town")();
if (town.isMaster) {
thrift.createServer(Renderer, {
render: function (html, width, height, next) {
town.queue({
html: html,
width: width,
height: height
}, function (err, data) {
next(err, !err && new Buffer(data, "base64"));
});
}
}).listen(1337);
} else {
town.on("queue", function (page, data, next) {
// sequential page setup
// page.set("viewportSize", ...)
// page.set("customHeaders", ...)
// page.set("onLoadFinished", ...)
// page.set("content", ...)
page.renderBase64("jpeg", function (data) {
next(null, data);
});
});
}
Ghost Town uses Node's Cluster API, so the master and worker share their code. On the master side, queue items and handle their results. On the worker side, process items and return their results.
town(options)
phantomBinary
: String path to the PhantomJS executable. Default: Automatic via$PATH
.phantomFlags
: Object of strings to use for the PhantomJS options. Default:{}
.phantomPort
: Number to use for the PhantomJS port range. Default:12300
(plus 200).workerCount
: Number of workers to maintain. One or two per CPU is recommended. Default:4
.workerDeath
: Number of items to process before restarting a worker. Default:25
.workerShift
: Number of milliseconds to wait before restarting a worker. Default:-1
(forever).pageCount
: Number of pages to process at a time. If your processing is mostly asynchronous (vs. e.g. render blocked), increasing this is recommended. Default:1
.pageDeath
: Number of milliseconds to wait before before requeuing an item. If your processing is time-sensitive, decreasing this is recommended. Default:30000
.pageTries
: Number of times to retry items that have timed out. If your processing could fail forever, configuring this is recommended. Default:-1
(unlimited).
Starts Ghost Town and returns a Master
or a Worker
instance exposing the following.
Master#isRunning
is set byMaster#start()
andMaster#stop()
.Master#isMaster
andWorker#isMaster
can be used to separate master- and worker-specific code.Worker#phantom
is the PhantomJS wrapper object provided by phantom. Only necessary for advanced uses such as listening to the.stdout
and.stderr
streams.
Master#start()
and Master#stop()
Starts or stops processing. These spawn or kill workers and PhantomJS processes, so they're useful for managing resource usage or gracefully shutting down Node.
Master#queue(data, [asap], next)
Queue an item for processing by a worker. data
is passed to Worker!queue()
, and next(err, data)
is called when complete. Optionally pass true
to asap
to prepend to the queue.
Worker!queue(page, data, next)
Fired when a worker receives an item to process. page
is the PhantomJS page, data
is what was passed to Master#queue()
, and next(err, data)
passes it back.
© 2015 Buzzvil, shared under the MIT license.