Skip to main content

A Faster Collectstatic

Project description

Build Status Windows Build Status Coverage Status Join the chat at https://gitter.im/antonagestam/collectfast

The fast collectstatic for Django projects with S3 as storage backend.

Features

  • Comparing and caching of md5 checksums before uploading

  • Parallel file uploads using Python’s multiprocessing module

Running Django’s collectstatic command can become really slow as more and more files are added to a project, especially if heavy libraries such as jQuery UI are included in the source. This is a custom management command that compares the md5 sum of files with S3 and completely ignores modified_time. The results of the hash lookups are cached locally using your default Django cache. This can make deploying much faster!

Installation

Install the app using pip:

$ pip install Collectfast

Make sure you have this in your settings file and add 'collectfast' to your INSTALLED_APPS:

STATICFILES_STORAGE = "storages.backends.s3boto.S3BotoStorage"
AWS_PRELOAD_METADATA = True
INSTALLED_APPS = (
    # …
    'collectfast',
)

'collectfast' should come before 'django.contrib.staticfiles'. Please note, that failure to do so will cause Django to use django.contrib.staticfiles’s collectstatic.

Note: preload_metadata of the storage class will be overwritten even if AWS_PRELOAD_METADATA is not set to True see #30

Usage

Collectfast overrides Django’s builtin collectstatic command so just run python manage.py collectstatic as normal. You can disable Collectfast by using the --disable-collectfast option.

You can also disable collectfast by setting COLLECTFAST_ENABLED = False in your settings file. This is useful when using a local file storage backend for development.

Setup Dedicated Cache Backend

It’s recommended to setup a dedicated cache backend for Collectfast. Every time Collectfast does not find a lookup for a file in the cache it will trigger a lookup to the storage backend, so it’s recommended to have a fairly high TIMEOUT setting.

Set up your dedicated cache in settings.py with the COLLECTFAST_CACHE setting:

CACHES = {
    'default': {
        # Your default cache
    },
    'collectfast': {
        # Your dedicated Collectfast cache
    }
}

COLLECTFAST_CACHE = 'collectfast'

By default Collectfast will use the default cache.

Note: Collectfast will never clean the cache of obsolete files. To clean out the entire cache, use cache.clear(). Read more about Django’s cache framework.

Note: We recommend you to set the MAX_ENTRIES setting if you have more than 300 static files, see #47

Enable Parallelization

The parallelization feature enables parallel file uploads using Python’s multiprocessing module. Enable it by setting the COLLECTFAST_THREADS setting.

To enable parallelization of file copying, a dedicated cache backend must be setup and it must use a backend that is threadsafe, i.e. something other than Django’s default LocMemCache.

COLLECTFAST_THREADS = 20

Debug

By default, Collectfast will suppress any exceptions that happens when copying and let Django’s collectstatic handle it. To debug those suppressed errors you can set COLLECTFAST_DEBUG = True in your Django settings file.

Contribution

Please feel free to contribute by using issues and pull requests. Discussion is open and welcome.

Testing

To run integration tests you need to setup an S3 bucket with the name collectfast and set your AWS credentials as environment variables. You can do this by adding them to a file aws-credentials like this:

export AWS_ACCESS_KEY_ID="XXXX"
export AWS_SECRET_ACCESS_KEY="XXXX"

And then running the tests with . aws-credentials && python runtests.py.

If you don’t feel like setting up an S3 bucket, just skip setting the environment variables. The integration tests will still run but fail.

To run tests with tox, setup a virtualenv and install tox with pip install tox then run tox in the project directory. To only run tests for a certain environment run e.g. tox -e py35-django110.

License

Collectfast is licensed under the MIT License, see LICENSE file for more information. Previous versions of Collectfast was licensed under Creative Commons Attribution-ShareAlike 3.0 Unported License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Collectfast-0.5.1.tar.gz (8.0 kB view details)

Uploaded Source

Built Distribution

Collectfast-0.5.1-py3-none-any.whl (12.9 kB view details)

Uploaded Python 3

File details

Details for the file Collectfast-0.5.1.tar.gz.

File metadata

  • Download URL: Collectfast-0.5.1.tar.gz
  • Upload date:
  • Size: 8.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for Collectfast-0.5.1.tar.gz
Algorithm Hash digest
SHA256 65ecb72d1ed207f03c98d0565e75ad190f7bf1504a2b9d40aebdf6de2f7e545c
MD5 aef89c6c950f2bcf8d050b02de895b0d
BLAKE2b-256 03188071d3df438688fd90d64eaf2fcce293027e45cc79c93439b9f253cce2f7

See more details on using hashes here.

File details

Details for the file Collectfast-0.5.1-py3-none-any.whl.

File metadata

File hashes

Hashes for Collectfast-0.5.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c8f9fac84236bd71a7591cea0028a91a5d04a54fe2404a21624098cd710c213c
MD5 c397d679753d2447c857a706b04f92a4
BLAKE2b-256 32b700da120a609fcaafe275c37d9d31051ed40de644041bc0b73344c82d8eba

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page