Skip to main content

A tipical crawler, returns a list of urls

Project description

# Copyright (C) 2011 Diego Pardilla Mata
#
# This file is part of SpiderBOY.
#
# SpiderBOY is free software: you can redistribute it and/or modify
# it under the terms of the GNU General Public License as published by
# the Free Software Foundation, either version 3 of the License, or
# (at your option) any later version.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
# GNU General Public License for more details.
#
# You should have received a copy of the GNU General Public License
# along with this program. If not, see <http://www.gnu.org/licenses/>.

This is the SpiderBoy crawler.

It has (among others) the following major features

Search URL's from another URL and with a depth indicated

Installation:

See INSTALL file for information on how to install SpiderBOY

Starting SpiderBOY:

run from the command line: `spiderboy.py [-h] [-n NUMBER_OF_LEVELS] [url]'
run `spiderboy.py -h' for more startup options

More information:

http://code.sidelab.es/projects/diegopardilla/wiki

Written by:

see AUTHORS for more information

Reporting Bugs:

Please report bugs to our mailing list or our bugtracker:
munikes@members.fsf.org
pardilla@members.fsf.org
http://code.sidelab.es/projects/diegopardilla/issues/new

Contacting us:

http://packages.python.org/SpiderBOY (homepage)
munikes@members.fsf.org (mUniKeS)
pardilla@members.fsf.org (Diego Pardilla)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

SpiderBOY-1.0.0.tar.gz (4.6 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

SpiderBOY-1.0.0.linux-x86_64.exe (70.3 kB view details)

Uploaded Source

SpiderBOY-1.0.0-py2.7.egg (8.5 kB view details)

Uploaded Egg

File details

Details for the file SpiderBOY-1.0.0.tar.gz.

File metadata

  • Download URL: SpiderBOY-1.0.0.tar.gz
  • Upload date:
  • Size: 4.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for SpiderBOY-1.0.0.tar.gz
Algorithm Hash digest
SHA256 54100fc4705c9884e2d5d689a0490f8fc2bba01a2ee1bcb6a570f446685be5f6
MD5 da15a4ce9b2d9ce622fc6bfda37fbba7
BLAKE2b-256 b924aaea12d5d766ecd663941b23ee3dc467236a186cbe63971b3ebed5e64186

See more details on using hashes here.

File details

Details for the file SpiderBOY-1.0.0.linux-x86_64.exe.

File metadata

File hashes

Hashes for SpiderBOY-1.0.0.linux-x86_64.exe
Algorithm Hash digest
SHA256 ff68834eb4c9cf7a077e2104ccb2c2a9cb6bdc750c073e951f6b32513b6ba66b
MD5 dc0d353160f569530b4af76d58e0903e
BLAKE2b-256 7a279f398ec5f9309033bd6431a4d4c4fc0b68a284faac461946269ba644dab0

See more details on using hashes here.

File details

Details for the file SpiderBOY-1.0.0-py2.7.egg.

File metadata

  • Download URL: SpiderBOY-1.0.0-py2.7.egg
  • Upload date:
  • Size: 8.5 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for SpiderBOY-1.0.0-py2.7.egg
Algorithm Hash digest
SHA256 2c5e19803f52b18dbab6cc0e7a1c1836944f8c003dc8df78aa746988431f2087
MD5 682dd9279f9fe5ccb6f8eb3344557967
BLAKE2b-256 0c7c2f926d3c22827dc23c3f0a76a29d7fa62329bf13abb730cfae6c3fb92c89

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page