Python parser for Apache/nginx-style HTML directory listing.
Project description
Python parser for Apache/nginx-style HTML directory listing
import htmllistparse
cwd, listing = htmllistparse.fetch_listing(some_url)
# or you can get the url and make a BeautifulSoup yourself, then use
# cwd, listing = htmllistparse.parse(soup)
where cwd is the current directory, listing is a list of FileEntry named tuples:
name: File name, str. Have a trailing / if it’s a directory.
modified: Last modification time, time.struct_time or None. Timezone is not known.
size: File size, int or None. May be estimated from the prefiex, such as “K”, “M”.
description: File description, file type, or any other things found. str as HTML, or None.
Supports:
Vanilla Apache/nginx/lighttpd/darkhttpd autoindex
Most <pre>-style index
Many other <table>-style index
<ul>-style
ReHTTPFS
Reinvented HTTP Filesystem.
Mounts most HTTP file listings with FUSE.
Gets directory tree and file stats with less overHEAD.
Supports Range requests.
Supports Keep-Alive.
usage: rehttpfs.py [-h] [-o OPTIONS] [-t TIMEOUT] [-u USER_AGENT] [-v] [-d] url mountpoint Mount HTML directory listings. positional arguments: url URL to mount mountpoint filesystem mount point optional arguments: -h, --help show this help message and exit -o OPTIONS comma seperated FUSE options -t TIMEOUT, --timeout TIMEOUT HTTP request timeout -u USER_AGENT, --user-agent USER_AGENT HTTP User-Agent -v, --verbose enable debug logging -d, --daemon run in background
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.