Skip to main content

Generated from aind-library-template

Project description

ZOMBIE Squirrel

License Code Style semantic-release: angular Interrogate Coverage Python

Logo (image from ChatGPT)

zombie-squirrel is a set of one-line functions that handle the entire process of caching and retrieving data (and metadata) from AIND data assets.

In the background, the ZOMBIE squirrel repackages data/metadata into dataframes and stores them on S3 in a flat bucket, or in memory for testing.

Installation

pip install zombie-squirrel

Usage

Set backend

export FOREST_TYPE='S3'

Options are 'S3', 'MEMORY'.

Scurry (fetch) data

from zombie_squirrel import unique_project_names

project_names = unique_project_names()

Acorns

Acorn Description Location Type Partitioned Columns
unique_project_names Unique project names across all assets s3://allen-data-views/data-asset-cache/zs_unique_project_names.pqt metadata False project_name
unique_subject_ids Unique subject_ids across all assets s3://allen-data-views/data-asset-cache/zs_unique_subject_ids.pqt metadata False subject_id
asset_basics Commonly used asset metadata, one row per data asset s3://allen-data-views/data-asset-cache/zs_asset_basics.pqt metadata False _id, _last_modified, modalities, project_name, data_level, subject_id, acquisition_start_time, acquisition_end_time, code_ocean, process_date, genotype, location, name
source_data Mapping from derived asset names to their source raw asset names s3://allen-data-views/data-asset-cache/zs_source_data.pqt metadata False name, source_data, pipeline_name, processing_time
raw_to_derived Mapping from raw asset names to their derived asset names s3://allen-data-views/data-asset-cache/zs_raw_to_derived.pqt metadata False
quality_control Quality control table with one row per QC metric s3://allen-data-views/data-asset-cache/zs_qc/ asset True (by subject_id) name, stage, modality, value, status, asset_name
assets_smartspim SmartSPIM assets with processing status and neuroglancer links s3://allen-data-views/data-asset-cache/zs_assets_smartspim.pqt metadata False subject_id, genotype, institution, acquisition_start_time, processing_end_time, stitched_link, processed, name, channel_1, segmentation_link_1, quantification_link_1, channel_2, segmentation_link_2, quantification_link_2, channel_3, segmentation_link_3, quantification_link_3

Custom acorn

The custom function allows you to store and retrieve your own user-defined DataFrames in the cache by name. This requires write authentication to the active backend.

from zombie_squirrel import custom
import pandas as pd

df = pd.DataFrame({"col": [1, 2, 3]})
custom("my_data", df)

retrieved_df = custom("my_data")

Hide all the acorns

We run a nightly capsule on Code Ocean with this code to hide all acorns (not the custom ones).

from zombie_squirrel.sync import hide_acorns
hide_acorns()

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zombie_squirrel-0.15.3.tar.gz (23.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

zombie_squirrel-0.15.3-py3-none-any.whl (22.7 kB view details)

Uploaded Python 3

File details

Details for the file zombie_squirrel-0.15.3.tar.gz.

File metadata

  • Download URL: zombie_squirrel-0.15.3.tar.gz
  • Upload date:
  • Size: 23.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zombie_squirrel-0.15.3.tar.gz
Algorithm Hash digest
SHA256 07156c379b7b7f1a4670da44fddf1226fa60fb8da69a98900fcfe33caa3b43aa
MD5 e1c52d0eeda5d4619d8690d6d8e97198
BLAKE2b-256 e4324d7e8fd1a9024793d913a6115fba3c03bb87ad0c90e3ac704aa26330b0d8

See more details on using hashes here.

File details

Details for the file zombie_squirrel-0.15.3-py3-none-any.whl.

File metadata

File hashes

Hashes for zombie_squirrel-0.15.3-py3-none-any.whl
Algorithm Hash digest
SHA256 d59bda3f5e54688da2cb9c81a8f87d556ff0c678412b527d2347133b57a54456
MD5 dbf9eacb8abbafcb5031854c8454ede2
BLAKE2b-256 19db47893f3ca87009b9c2bec9fffbd0ab3db665ff8c21d86cc4476f291c526a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page