Skip to main content

A Faster Collectstatic

Project description

Build Status Windows Build Status Coverage Status Join the chat at https://gitter.im/antonagestam/collectfast

The fast collectstatic for Django projects with S3 as storage backend.

Features

  • Comparing and caching of md5 checksums before uploading

  • Parallel file uploads using Python’s multiprocessing module

Running Django’s collectstatic command can become really slow as more and more files are added to a project, especially if heavy libraries such as jQuery UI are included in the source. This is a custom management command that compares the md5 sum of files with S3 and completely ignores modified_time. The results of the hash lookups are cached locally using your default Django cache. This can make deploying much faster!

Installation

Install the app using pip:

$ pip install Collectfast

Make sure you have this in your settings file and add 'collectfast' to your INSTALLED_APPS:

STATICFILES_STORAGE = "storages.backends.s3boto.S3BotoStorage"
AWS_PRELOAD_METADATA = True
INSTALLED_APPS = (
    # …
    'collectfast',
)

'collectfast' should come before 'django.contrib.staticfiles'. Please note, that failure to do so will cause Django to use django.contrib.staticfiles’s collectstatic.

Note: preload_metadata of the storage class will be overwritten even if AWS_PRELOAD_METADATA is not set to True see #30

Usage

Collectfast overrides Django’s builtin collectstatic command so just run python manage.py collectstatic as normal. You can disable Collectfast by using the --disable-collectfast option.

You can also disable collectfast by setting COLLECTFAST_ENABLED = False in your settings file. This is useful when using a local file storage backend for development.

Setup Dedicated Cache Backend

It’s recommended to setup a dedicated cache backend for Collectfast. Every time Collectfast does not find a lookup for a file in the cache it will trigger a lookup to the storage backend, so it’s recommended to have a fairly high TIMEOUT setting.

Set up your dedicated cache in settings.py with the COLLECTFAST_CACHE setting:

CACHES = {
    'default': {
        # Your default cache
    },
    'collectfast': {
        # Your dedicated Collectfast cache
    }
}

COLLECTFAST_CACHE = 'collectfast'

By default Collectfast will use the default cache.

Note: Collectfast will never clean the cache of obsolete files. To clean out the entire cache, use cache.clear(). Read more about Django’s cache framework.

Note: We recommend you to set the MAX_ENTRIES setting if you have more than 300 static files, see #47

Enable Parallelization

The parallelization feature enables parallel file uploads using Python’s multiprocessing module. Enable it by setting the COLLECTFAST_THREADS setting.

To enable parallelization of file copying, a dedicated cache backend must be setup and it must use a backend that is threadsafe, i.e. something other than Django’s default LocMemCache.

COLLECTFAST_THREADS = 20

Debug

By default, Collectfast will suppress any exceptions that happens when copying and let Django’s collectstatic handle it. To debug those suppressed errors you can set COLLECTFAST_DEBUG = True in your Django settings file.

Contribution

Please feel free to contribute by using issues and pull requests. Discussion is open and welcome.

Testing

To run integration tests you need to setup an S3 bucket with the name collectfast and set your AWS credentials as environment variables. You can do this by adding them to a file aws-credentials like this:

export AWS_ACCESS_KEY_ID="XXXX"
export AWS_SECRET_ACCESS_KEY="XXXX"

And then running the tests with . aws-credentials && python runtests.py.

If you don’t feel like setting up an S3 bucket, just skip setting the environment variables. The integration tests will still run but fail.

To run tests with tox, setup a virtualenv and install tox with pip install tox then run tox in the project directory. To only run tests for a certain environment run e.g. tox -e py35-django110.

License

Collectfast is licensed under the MIT License, see LICENSE file for more information. Previous versions of Collectfast was licensed under Creative Commons Attribution-ShareAlike 3.0 Unported License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Collectfast-0.5.2.tar.gz (8.1 kB view details)

Uploaded Source

Built Distribution

Collectfast-0.5.2-py3-none-any.whl (13.2 kB view details)

Uploaded Python 3

File details

Details for the file Collectfast-0.5.2.tar.gz.

File metadata

File hashes

Hashes for Collectfast-0.5.2.tar.gz
Algorithm Hash digest
SHA256 b50f958d8816d4ef272e893de18e7706c707bb746e4f745b4116617243f34849
MD5 c06626483d374b36fffb6885821bbc23
BLAKE2b-256 84704d83bb4c044527a268094533c39eb3a335dbc1c533a04c60dac6ecf18ba7

See more details on using hashes here.

File details

Details for the file Collectfast-0.5.2-py3-none-any.whl.

File metadata

File hashes

Hashes for Collectfast-0.5.2-py3-none-any.whl
Algorithm Hash digest
SHA256 b044ce9621760c2b6e335e60c02b1a62da9a26ab028bf196b43a88a89a817ee4
MD5 c74ed3530db5377d66d90f0a8d56fb16
BLAKE2b-256 bb456b4bf4948608fcecb63e3f697903a7c1d9d74537d6b610c7ea2028cb1cfc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page