diff --git a/.gitignore b/.gitignore index 90bde74..a0e6b3e 100644 --- a/.gitignore +++ b/.gitignore @@ -1,3 +1,4 @@ /venv /spiders/pages/** /spiders/output/** +/spiders/config.yaml diff --git a/README.md b/README.md index 5fb8c1c..cc0d696 100644 --- a/README.md +++ b/README.md @@ -16,5 +16,12 @@ Filter and serialize content to json. Filter either by xpath syntax. Or Filter with the help of Artificial Neural Networks (work in progress). +To run this, create a python3 virtualenv, pip install -r requirements, +and +``` +apt install xvfb +apt install chromium +apt install chromium-webdriver +```