- 26 Mar, 2020 1 commit
-
-
Jorge Ignacio Vicuña Jiménez authored
Dev See merge request !3
-
- 25 Mar, 2020 3 commits
-
-
Jorge Vicuña authored
The new offer_percent field should help with identifying the best offers available.
-
Jorge Vicuña authored
In order to maximize Jumbo's scraped items without ending on a loop, a page limit has been placed by category. Also, the api key and the page number are passed via headers intead of URL parameters.
-
Jorge Vicuña authored
-
- 22 Mar, 2020 1 commit
-
-
Jorge Vicuña authored
This change should facilitate the linking between products and categories tables
-
- 03 Mar, 2020 2 commits
-
-
Jorge Vicuña authored
Now using string-based date stamp. Also fixed Tottus' string categories.
-
Jorge Vicuña authored
-
- 18 Feb, 2020 4 commits
-
-
Jorge Ignacio Vicuña Jiménez authored
Scrape date and simultaneous spider scraping See merge request !2
-
Jorge Vicuña authored
-
Jorge Vicuña authored
With the run.py script, one can execute all crawlers in a single process
-
Jorge Vicuña authored
Now that the items are dated, it's posible to build a 'price history' to check changes in prices over time.
-
- 08 Feb, 2020 6 commits
-
-
Jorge Ignacio Vicuña Jiménez authored
Reached stable and operating state. See merge request !1
-
Jorge Vicuña authored
Now, to get a spider running, one must use scrapy crawl <Spider Name>
-
Jorge Vicuña authored
-
Jorge Vicuña authored
The logger replaces the 'debug' print statements lying around. The 'store' item parameter is for telling from what store the item comes from.
-
Jorge Vicuña authored
with this, one can check from which store the scraped item comes from
-
Jorge Vicuña authored
Now the scraper can throw the crawled items to a MongoDB database, with the connection credentials stored in scraper_tiendas/settings.py. The Database must have the scraper's user created beforehand.
-
- 16 Jan, 2020 7 commits
-
-
Jorge Vicuña authored
-
Jorge Vicuña authored
-
Jorge Vicuña authored
To check product availability an API call is needed. Because scrapy doesn't allow to make sub-requests in the parse stage I was forced to use requests. Also the next page section is now fixed. It'll no longer go to the marketplace main page after reaching the last page.
-
Jorge Vicuña authored
-
Jorge Vicuña authored
-
Jorge Vicuña authored
Merge branch 'dev' of gitlab.labcomp.cl:speedwagon-iwg101-2019-2/gran-super/scraper-tiendas into dev Recovering changes made with GitLab's Web IDE
-
Jorge Vicuña authored
Spider básica para Lider, completando las 3 tiendas principales. Cambiado parser html.parser por lxml
-
- 15 Jan, 2020 2 commits
-
-
Jorge Ignacio Vicuña Jiménez authored
-
Jorge Vicuña authored
Añadidas las spiders de las tiendas Jumbo y Tottus (WIP), además de un placeholder para la spider de la tienda Lider
-
- 11 Jan, 2020 12 commits
-
-
Jorge Ignacio Vicuña Jiménez authored
-
Jorge Ignacio Vicuña Jiménez authored
-
Jorge Ignacio Vicuña Jiménez authored
-
Jorge Ignacio Vicuña Jiménez authored
-
Jorge Ignacio Vicuña Jiménez authored
-
Jorge Ignacio Vicuña Jiménez authored
-
Jorge Ignacio Vicuña Jiménez authored
-
Jorge Ignacio Vicuña Jiménez authored
-
Jorge Ignacio Vicuña Jiménez authored
-
Jorge Ignacio Vicuña Jiménez authored
-
Jorge Ignacio Vicuña Jiménez authored
-
Jorge Ignacio Vicuña Jiménez authored
-
- 19 Dec, 2019 2 commits
-
-
Jorge Ignacio Vicuña Jiménez authored
-
Jorge Ignacio Vicuña Jiménez authored
-