Skip to content

Viewers for statistics and dashboarding of Domain Search Engine data

License

Notifications You must be signed in to change notification settings

pking74/memex-explorer

 
 

Repository files navigation

Build Status Coverage Status

memex-explorer

Memex Explorer is a web application that provides easy-to-use interfaces for gathering, analyzing, and graphing web crawl data.

Local Development

To setup your machine, you will need Anaconda or Miniconda installed. Miniconda is a minimal Anaconda installation that bootstraps conda and Python on any operating system. Install Anaconda from http://continuum.io/downloads or Miniconda from http://conda.pydata.org/miniconda.html

Clone the repository, then:

cd memex-explorer/source

Run the following commands:

$ ./app_setup.sh
$ source activate memex
$ supervisord

This script will set up a conda environment named memex, prepare the application by creating an empty database, then launch all of the necessary services for the application. If there are any problems with any of these commands, please report them as a GitHub issue.

If you have already run the install script, simply run supervisord from the memex-explorer/source directory to restart all of the services.

The supervisord will start supervisord in the foreground, which will in turn ensure that all services associated with the core Memex Explorer environment are running. To stop supervisord and the associated services, send an interrupt to the process with Ctrl-c.

Memex Explorer will now be running locally at http://localhost:8000

Testing

To run memex-explorer tests, use the following command from within an active environment:

$ py.test

Building the Documentation

The project documentation is written in reStructuredText and can be built using Sphinx.

$ cd docs
$ make html

The documentation is then available within build/html/index.html

Administration

To access the administration panel, navigate to http://localhost:8000/admin (or the equivalent deployed URL) after starting Memex Explorer. Here you will be able to view and make manual changes to the database.

Deploying

The current method for deploying to the web is to deploy to ec2 by running a fabric script with a few environment variables set.

$ git clone https://github.com/memex-explorer/memex-explorer
$ cd memex-explorer/deploy
$ conda env create --file deploy_environment.yml
$ source activate memex_deploy
$ cp deploy_ec2.sh nocommit.sh

Now edit the file nocommit.sh. It will contain three environment variables which you must set and which you must not commit to the public repository.

AWS_KEY_ID: The key id for your aws account

AWS SECRET: The key secret for your aws account

HTPASSWD_PATH: The HTTP login password path. This file should have been
given to you.  Place it at a location not tracked by git and enter the absolute
path to this location in the value of this variable.

Additionally, you can choose to deploy a different git branch than the production branch.

Once you have set these variables, you can start a new instance with source nocommit.sh, which will create an ec2 instance, place a login key for it in memex-explorer/deploy/keys and run the deploy script on the new instance.

The login key for the new instance will be given three names:

One based on the IP address of the new server.

One based on the creation time of the new server.

latest.pem, a convenience to logging in to the most-recently-created server.

To connect to a instance given an IP address of 54.167.11.71, log in with the command

ssh -i keys/ec2-54.167.11.71.pem [email protected]

After the setup script is done running, you will be able to access the application by entering the IP address into your browser.

About

Viewers for statistics and dashboarding of Domain Search Engine data

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 44.4%
  • HTML 20.3%
  • JavaScript 15.6%
  • CSS 14.1%
  • SaltStack 2.6%
  • Shell 2.1%
  • Other 0.9%