Skip to main content

Easy access to the DES DB. Enhanced command line SQL interpreter client for DES

Project description

easyaccess

easyaccess: an enhanced command line SQL interpreter client for astronomical surveys.

Jump to:

help_screen

Description

easyaccess is an enhanced command line interpreter and Python package created to facilitate access to astronomical catalogs stored in SQL Databases. It provides a custom interface with custom commands and was specifically designed to access data from the Dark Energy Survey Oracle database, including autocompletion of tables, columns, users and commands, simple ways to upload and download tables using csv, fits and HDF5 formats, iterators, search and description of tables among others. It can easily be extended to another surveys or SQL databases. The package was completely written in Python and support customized addition of commands and functionalities.

For a short tutorial check here.

DES DR1/DR2 access quickstart

To access the DES public data releases, you first need an account, which you can register yourself here. Once you have login credentials for the public DES data server, you can start easyaccess with:

easyaccess -s desdr

Features

  • Nice output format (using pandas)
  • Very flexible configuration
  • Smart tab autocompletion for commands, table names, column names, and file paths
  • Write output results to CSV, TAB, FITS, or HDF5 files
  • Load tables from CSV, FITS or HDF5 files directly into DB (memory friendly by using number of rows or memory limit)
  • Intrinsic DB commands to describe tables, schema, quota, and more
  • easyaccess can be imported as module from Python with a complete Python API
  • Run commands directly from command line
  • Load SQL queries from a file and/or from the editor
  • Show the execution plan of a query if needed
  • Python functions can be run in a inline query

FAQ

We have a running list of FAQ which we will constantly update, please check here.

Contributing

Please take a look at our Code of Conduct and our contribution guide.

Citation

If you use easyaccess in your research, we encourage you to use this reference https://arxiv.org/abs/1810.02721 or copy/paste this BibTeX:

@ARTICLE{2018arXiv181002721C,
       author = {{Carrasco Kind}, M. and {Drlica-Wagner}, A. and {Koziol}, A.~M.~G. and
        {Petravick}, D.},
        title = "{easyaccess: Enhanced SQL command line interpreter for astronomical surveys}",
      journal = {arXiv e-prints},
     keywords = {Astrophysics - Instrumentation and Methods for Astrophysics},
         year = 2018,
        month = Oct,
          eid = {arXiv:1810.02721},
        pages = {arXiv:1810.02721},
archivePrefix = {arXiv},
       eprint = {1810.02721},
 primaryClass = {astro-ph.IM},
       adsurl = {https://ui.adsabs.harvard.edu/\#abs/2018arXiv181002721C},
      adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}

See also: Carrasco Kind et al., (2019). easyaccess: Enhanced SQL command line interpreter for astronomical surveys. Journal of Open Source Software, 4(33), 1022, https://doi.org/10.21105/joss.01022

Installation

Installing easyaccess can be a little bit tricky given the external libraries required, in particular the Oracle libraries which are free to use. If you are primarily interested in using the easyaccess client, we recommend running the Docker image as described below.

Docker

Running easyaccess in Docker is easy. Execute the command below to download and run our published image.

$ docker run -it --rm \
    registry.gitlab.com/des-labs/kubernetes/easyaccess:latest \
    easyaccess -s desdr

Enter username : 
Enter password : 
Connecting to DB ** desdr ** ...
Loading metadata into cache...
     _______      
     \      \      
  // / .    .\    
 // /   .    _\   
// /  .     / // 
\\ \     . / //  
 \\ \_____/ //   
  \\_______//    DARK ENERGY SURVEY
   `-------`     DATA MANAGEMENT

easyaccess 1.4.8-dev. The DESDM Database shell.
_________
DESDR ~> SELECT RA, DEC, MAG_AUTO_G, TILENAME FROM DR2_MAIN sample(0.001) FETCH FIRST 5 ROWS ONLY ;

         RA        DEC  MAG_AUTO_G      TILENAME
1  8.236249 -24.021460   24.450422  DES0032-2415
2  8.084798 -25.715401   26.279263  DES0033-2541
3  8.142266 -35.854926   26.509785  DES0032-3540
4  8.197418 -48.274010   25.243387  DES0030-4831
5  8.107404 -26.313876   24.758778  DES0032-2623

Alternatively, you may build and run the image yourself using the included Dockerfile with the commands:

docker build -t des-easyaccess .
docker run -it --rm des-easyaccess easyaccess -s desdr

Source Installation

easyaccess is based heavily on the Oracle python client cx_Oracle, you can follow the installation instructions from here. For cx_Oracle to work, you will need the Oracle Instant Client packages which can be obtained from here.

Make sure you have these libraries installed before proceeding to the installation of easyaccess, you can try by opening a Python interpreter and type:

import cx_Oracle

If you have issues, please check the Troubleshooting page or our FAQ page.

You can clone this repository and install easyaccess with:

python setup.py install

Requirements

  • Oracle Client > 11g.2 (External library, no python) Check here for instructions on how to install these libraries
  • cx_Oracle
    • Note that cx_Oracle needs libaio on some Linux systems
    • Note that cx_Oracle needs libbz2 on some Linux systems
  • See the setup.py file for additional requirements.

Usage

For a short tutorial and documentation see here. Note that not all the features are available for public database use.

Interactive interpreter

Assuming that easyaccess is in your path, you can enter the interactive interpreter by calling easyaccess without any command line arguments:

easyaccess

Command line usage

Much of the functionality provided through the interpreter is also available directly from the command line. To see a list of command-line options, use the --help option

easyaccess --help

Running SQL commands

Once inside the interpreter run SQL queries by adding a ";" at the end::

DESDB ~> select ... from ... where ... ;

To save the results into a table add ">" after the end of the query (after ";") and namefile at the end of line

DESDB ~> select ... from ... where ... ; > test.fits

The file types supported so far are: .csv, .tab, .fits, and .h5. Any other extension is ignored.

Load tables

To load a table it needs to be in a csv format with columns names in the first row the name of the table is taken from filename or with optional argument --tablename

DESDB ~> load_table <filename> --tablename <mytable> --chunksize <number of rows to read/upload> --memsize <memory in MB to read at a time>

The --chunsize and --memsize are optional arguments to facilitate uploading big files.

Load SQL queries

To load SQL queries just run:

DESDB ~> loadsql <filename.sql>

or

DESDB ~> @filename.sql

The query format is the same as the interpreter, SQL statement must end with ; and to write output files the query must be followed by a standard output redirect command > outfile

Configuration

The configuration file is located at $HOME/.easyaccess/config.ini but everything can be configured from inside easyaccess type:

DESDB ~> help config

to see the meanings of all the options, and:

DESDB ~> config all show

to see the current values, to modify one value, e.g., the prefetch value

DESDB ~> config prefetch set 50000

and to see any particular option (e.g., timeout):

DESDB ~> config timeout show

Architecture

We have included a simplified UML diagram describing the architecture and dependencies of easyaccess which shows only the different methods for a given class and the name of the file hosting a given class. The main class, easy_or(), inherits all methods from all different subclasses, making this model flexible and extendable to other surveys or databases. These methods are then converted to command line commands and functions that can be called inside easyaccess. Given that there are some DES specific functions, we have moved DES methods into a separate class DesActions().

easyaccess architecture diagram

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

easyaccess-1.4.9.tar.gz (1.4 MB view details)

Uploaded Source

Built Distribution

easyaccess-1.4.9-py2.py3-none-any.whl (57.8 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file easyaccess-1.4.9.tar.gz.

File metadata

  • Download URL: easyaccess-1.4.9.tar.gz
  • Upload date:
  • Size: 1.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.8.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.6

File hashes

Hashes for easyaccess-1.4.9.tar.gz
Algorithm Hash digest
SHA256 9ccebb93afa805db107d771442b8762095b818938af5b5e2928f69a2f259e3a5
MD5 e8caff4d989729b0c865919e5fbaa5d4
BLAKE2b-256 257b5d5fe9b0d8455c7d7639ddd1cdbbe245c219c7bd25e71baad0728d826c03

See more details on using hashes here.

File details

Details for the file easyaccess-1.4.9-py2.py3-none-any.whl.

File metadata

  • Download URL: easyaccess-1.4.9-py2.py3-none-any.whl
  • Upload date:
  • Size: 57.8 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.8.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.6

File hashes

Hashes for easyaccess-1.4.9-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 61627f78ed8408d7326048e42c1b010ad14744f971fc559721af4734a741f9c7
MD5 04850754899f401812510ec27d87c7d3
BLAKE2b-256 e52786d9990e0140d995b2df52c7865aab2189d497aaab78146ab96b02012c24

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page