15 Commits (fbee5d6229c64f6b6ffefaa4705e78b1c5fc679a)

Author SHA1 Message Date
  alpcentaur 953f85ee5b added new lines to chromedriver, to make it work on other systems 9 months ago
  alpcentaur d2324d265a added pdf child text downloading and parse to json exceptions/cases for javascript entry data and normal data 9 months ago
  alpcentaur ec180bed0a added flow for selenium grabbing popup instead of links for entries 9 months ago
  alpcentaur b4fd385c5d did some changes to main.py for using sys.argv 9 months ago
  alpcentaur 89dcca2031 added further handling for javascript links not being urls, made config for giz work 9 months ago
  alpcentaur a0075e429d added further database in config.yaml, added new exception for downloading js generated html pages 9 months ago
  alpcentaur b2cf4b67ce added first config parameters for search on not uniform entries 10 months ago
  alpcentaur ff23c22e3c added working bund.de-bekanntmachungen config with new example of xpath contains 10 months ago
  alpcentaur 06fa81e549 added function find config parameter and changed core spider 10 months ago
  alpcentaur a846ce04cc specifying the links, new exception clause if soupparser does not work 10 months ago
  alpcentaur c078ee4b1b first function works, actuall xml parser has still problems with certain xml types 10 months ago
  alpcentaur 8b20bc178f added multi pages configuration and code 10 months ago
  alpcentaur 7aa903883b update to config.yaml 10 months ago
  alpcentaur 5ac07d151a added first config.yaml template and started creating folder structure 10 months ago