Creates light curves embeddings using ASTROMER

These details have not been verified by PyPI

Project links

Homepage

Project description

ASTROMER Python library 🔭

ASTROMER is a transformer based model pretrained on millions of light curves. ASTROMER can be finetuned on specific datasets to create useful representations that can improve the performance of novel deep learning models.

❗ This version of ASTROMER can only works on single band light curves.

🔥 See the official repo here

Install

pip install ASTROMER

How to use it

Currently, there are 2 pre-trained models: macho and atlas. To load weights use:

from ASTROMER.models import SingleBandEncoder

model = SingleBandEncoder()
model = model.from_pretraining('macho')

It will automatically download the weights from this public github repository and load them into the SingleBandEncoder instance.

Assuming you have a list of vary-lenght (numpy) light curves.

import numpy as np

samples_collection = [ np.array([[5200, 0.3, 0.2],
                                 [5300, 0.5, 0.1],
                                 [5400, 0.2, 0.3]]),

                       np.array([[4200, 0.3, 0.1],
                                 [4300, 0.6, 0.3]]) ]

Light curves are Lx3 matrices with time, magnitude, and magnitude std. To encode samples use:

attention_vectors = model.encode(samples_collection,
                                 oids_list=['1', '2'],
                                 batch_size=1,
                                 concatenate=True)

where

samples_collection is a list of numpy array light curves
oids_list is a list with the light curves ids (needed to concatenate 200-len windows)
batch_size specify the number of samples per forward pass
when concatenate=True ASTROMER concatenates every 200-lenght windows belonging the same object id. The output when concatenate=True is a list of vary-length attention vectors.

Finetuning or training from scratch

ASTROMER can be easly trained by using the fit. It include

from ASTROMER import SingleBandEncoder

model = SingleBandEncoder(num_layers= 2,
                          d_model   = 256,
                          num_heads = 4,
                          dff       = 128,
                          base      = 1000,
                          dropout   = 0.1,
                          maxlen    = 200)
model.from_pretrained('macho')

where,

num_layers: Number of self-attention blocks
d_model: Self-attention block dimension (must be divisible by num_heads)
num_heads: Number of heads within the self-attention block
dff: Number of neurons for the fully-connected layer applied after the attention blocks
base: Positional encoder base (see formula)
dropout: Dropout applied to output of the fully-connected layer
maxlen: Maximum length to process in the encoder Notice you can ignore model.from_pretrained('macho') for clean training.

mode.fit(train_data,
         validation_data,
         epochs=2,
         patience=20,
         lr=1e-3,
         project_path='./my_folder',
         verbose=0)

where,

train_data: Training data already formatted as tf.data
validation_data: Validation data already formatted as tf.data
epochs: Number of epochs for training
patience: Early stopping patience
lr: Learning rate
project_path: Path for saving weights and training logs
verbose: (0) Display information during training (1) don't

train_data and validation_data should be loaded using load_numpy or pretraining_records functions. Both functions are in the ASTROMER.preprocessing module.

For large datasets is recommended to use Tensorflow Records (see this tutorial to execute our data pipeline)

Resources

ASTROMER Tutorials

Contributing to ASTROMER 🤝

If you train your model from scratch, you can share your pre-trained weights by submitting a Pull Request on the weights repository

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.1.7

Aug 10, 2023

This version

0.1.6

Mar 2, 2023

0.0.6

Jan 24, 2023

0.0.5

Jan 24, 2023

0.0.4

Jan 16, 2023

0.0.3

Dec 1, 2022

0.0.2

Nov 28, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

astromer-0.1.6.tar.gz (3.0 MB view hashes)

Uploaded Mar 2, 2023 Source

Built Distribution

astromer-0.1.6-py3-none-any.whl (28.0 kB view hashes)

Uploaded Mar 2, 2023 Python 3

Hashes for astromer-0.1.6.tar.gz

Hashes for astromer-0.1.6.tar.gz
Algorithm	Hash digest
SHA256	`ad9e3839aa67f42c9a7971ab90d12d41ac446122ff7c75e36fcb542607727abd`
MD5	`3103172452d23ba27df80c86dfbd8261`
BLAKE2b-256	`ea6ae25d078a28908a14b06b5b05345a416a71aab1c09d71421890d6c5e228c9`

Hashes for astromer-0.1.6-py3-none-any.whl

Hashes for astromer-0.1.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e4aa01d2261df279524e2afe1e082373b4e1299bbb165418a318c7c8b793ed55`
MD5	`82c5619813d5260bdf1172cde2d4fd11`
BLAKE2b-256	`e5aa217300b26f0cc1473ed20631c987f176d03e27466cd3329ac4d82627353e`