Spiders can become broken due to changes on the target site, which lead to different page layouts (therefore, broken XPath and CSS extractors). Often however, the information content of a page remains roughly similar, just in a different form or layout. This tool that can, in some fortunate cases, automatically infer extraction rules to keep a spider up-to-date with site changes.
Project description
The author of this package has not provided a project description
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Close
Hashes for scrapy_spider_auto_repair-0.1.4.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8c61db1ad9c631dd16b1e9d7f25af076b7fa34760caf6f2d4a97c498eba58ff0 |
|
MD5 | 23afe8dad9ac093eeed5c3e6898baaeb |
|
BLAKE2b-256 | 0af4155227ed07262aa84f1fb0128a9017b44371227f91a0a25a18bc7aa8aaee |
Close
Hashes for scrapy_spider_auto_repair-0.1.4-py3.6.egg
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1270ba5bc823b52f082adaa9f3ee0f071f4c5b9961ba4909b90152db4e34a811 |
|
MD5 | 8c932ec918ab4cb72c5e098b8fddfbb0 |
|
BLAKE2b-256 | 56097cccd74c5714534a705e677f7738aec8f133f7048deb3f404acf9a2693c1 |
Close
Hashes for scrapy_spider_auto_repair-0.1.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f669d1cd11fd863ee17c1b53a21d421885eb1808093cab9dd09f5397f693b176 |
|
MD5 | 3106148faa8089799791b49428e2fc8b |
|
BLAKE2b-256 | 0cdc0cc385d04c48d4906c348953ac63a843c32d1daa5502c6a4b1367f58a929 |