Skip to main content

A Faster Collectstatic

Project description

Build Status Coverage Status

The fast collectstatic for Django projects with S3 as storage backend.

Features

  • Comparing and caching of md5 checksums before uploading

  • Parallel file uploads using Python’s multiprocessing module

Running Django’s collectstatic command can become really slow as more and more files are added to a project, especially if heavy libraries such as jQuery UI are included in the source. This is a custom management command that compares the md5 sum of files with S3 and completely ignores modified_time. The results of the hash lookups are cached locally using your default Django cache. This can make deploying much faster!

Installation

Install the app using pip:

$ pip install Collectfast

Make sure you have this in your settings file and add 'collectfast' to your INSTALLED_APPS:

STATICFILES_STORAGE = "storages.backends.s3boto.S3BotoStorage"
INSTALLED_APPS = (
    # …
    'collectfast',
)

'collectfast' should come before 'django.contrib.staticfiles'. Please note, that failure to do so will cause Django to use django.contrib.staticfiles’s collectstatic.

Note: preload_metadata of the storage class will be overwritten as True, see #30

Usage

Collectfast overrides Django’s builtin collectstatic command so just run python manage.py collectstatic as normal. You can disable Collectfast by using the --disable-collectfast option.

You can also disable collectfast by setting COLLECTFAST_ENABLED = False in your settings file. This is useful when using a local file storage backend for development.

Setup Dedicated Cache Backend

It’s recommended to setup a dedicated cache backend for Collectfast. Every time Collectfast does not find a lookup for a file in the cache it will trigger a lookup to the storage backend, so it’s recommended to have a fairly high TIMEOUT setting.

Set up your dedicated cache in settings.py with the COLLECTFAST_CACHE setting:

CACHES = {
    'default': {
        # Your default cache
    },
    'collectfast': {
        # Your dedicated Collectfast cache
    }
}

COLLECTFAST_CACHE = 'collectfast'

By default Collectfast will use the default cache.

Note: Collectfast will never clean the cache of obsolete files. To clean out the entire cache, use cache.clear(). Read more about Django’s cache framework.

Note: We recommend you to set the MAX_ENTRIES setting if you have more than 300 static files, see #47

Enable Parallelization

The parallelization feature enables parallel file uploads using Python’s multiprocessing module. Enable it by setting the COLLECTFAST_THREADS setting.

To enable parallelization of file copying, a dedicated cache backend must be setup and it must use a backend that is threadsafe, i.e. something other than Django’s default LocMemCache.

COLLECTFAST_THREADS = 20

Debug

By default, Collectfast will suppress any exceptions that happens when copying and let Django’s collectstatic handle it. To debug those suppressed errors you can set COLLECTFAST_DEBUG = True in your Django settings file.

Contribution

Please feel free to contribute by using issues and pull requests. Discussion is open and welcome.

Testing

The test suite is built to run against an S3 bucket. To be able to test locally you need to provide AWS credentials for a bucket to test against. Add the credentials to a file named aws-credentials in the root of the project directory:

export AWS_ACCESS_KEY_ID=''
export AWS_SECRET_ACCESS_KEY=''

Install test dependencies and target Django version:

pip install -r test-requirements.txt
pip install django==2.2

Run linter and test suite:

flake8
black --check .
make test

License

Collectfast is licensed under the MIT License, see LICENSE file for more information. Previous versions of Collectfast was licensed under Creative Commons Attribution-ShareAlike 3.0 Unported License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Collectfast-1.0.0.tar.gz (9.6 kB view details)

Uploaded Source

Built Distribution

Collectfast-1.0.0-py3-none-any.whl (11.8 kB view details)

Uploaded Python 3

File details

Details for the file Collectfast-1.0.0.tar.gz.

File metadata

  • Download URL: Collectfast-1.0.0.tar.gz
  • Upload date:
  • Size: 9.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.33.0 CPython/3.7.3

File hashes

Hashes for Collectfast-1.0.0.tar.gz
Algorithm Hash digest
SHA256 f194cc3a7f7d502a487d80e473210173d6832fd4d363a83e6fe567344e077ecd
MD5 c9392874b6f1d1a732fbe5c93e1da39a
BLAKE2b-256 7070c7292a5b86b36336fc70b5bf77ad80d09157f57460f6a342685e1b2e2bdc

See more details on using hashes here.

File details

Details for the file Collectfast-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: Collectfast-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 11.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.33.0 CPython/3.7.3

File hashes

Hashes for Collectfast-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a0f7a714f58ccd7abd4f2a04279cb9c69c8af023c38f5b2628f7e2bc1ad93a68
MD5 4346ddcb86294bff2f1bcbc5841f90c0
BLAKE2b-256 fac391f3be11978e972eb8afde7edc2d48717bfaea79efb71f5c6fa55c7aefd0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page