A package to access sciencedata.dk

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

sddk

This is a simple Python package to upload data to- and dowload data from sciencedata.dk. It is especially designed for working with group folders. It relies mainly on Python requests library.

sciencedata.dk is a project managed by DEiC (Danish e-infrastrcture cooperation) aimed to offer a robust data storage, data management and data publication solution for researchers in Denmark and abroad (see docs and dev for more info). The storage is accessible either through (1) the web interface, (2) WebDAV clients or (3) an API relaying on HTTP Protocol (see docs and dev for more info). One of the strength of sciencedata.dk is that it currently supports institutional login from 2976 research and educational institutions around the global (using WAYF). That makes it a perfect tool for international research collaboration.

The main functionality of the package is in uploading any Python object (dict, list, dataframe) as a text or json file to a preselected shared folder and getting it back into a Python environemnt as the original Python object. It uses sciencedata.dk API in combination with Python requests library.

Install and import

So far the package is in a testing PyPi repository here.

To install and import the package within your Python environment (i.e. jupyter notebook) run:

!pip install sddk
import sddk

Configure session and access endpoint for a shared folder

To run the main configuration function below, you have to know the following:

your sciencedata.dk username (e.g. "123456@au.dk" or "kase@zcu.cz"),
your sciencedata.dk password (has to be previously configured in the sciencedata.dk web interface),

In the case you want to access a shared folder, you further need:

name of the shared folder you want to access (e.g. "our_shared_folder"),
username of the owner of the folder (if it is not yours)

(Do not worry, you will be asked to input these values interactively while running the function)

To configure a personal session, run:

s, sddk_url = sddk.configure_session_and_url()

To configure a session pointing to a shared folder, run:

s, sddk_url = sddk.configure_session_and_url("our_shared_folder", "owner_username@au.dk")

Running this function, you configurate two key variables:

s: a request session authorized by your username and password
sddk_url: default url address (endpoint) for your request Below you can inspect how these two are used in typical request commands

Usage

String to TXT

Upload (export) simple text file:

s.put(sddk_url + "testfile.txt", data="textfile content")

Get it back (import) to Python:

string_testfile = ast.literal_eval(s.get(sddk_url + "testfile.txt").text)
print(string_testfile)

Pandas DataFrame to JSON

Upload a dataframe as a json file:

import pandas as pd
df = pd.DataFrame([("a1", "b1", "c1"), ("a2", "b2", "c2")], columns=["a", "b", "c"]) 
s.put(sddk_url + "df.json", data=df.to_json())

Get it back:

df = pd.DataFrame(s.get(sddk_url + "df.json").json())

Pandas DataFrame to CSV

import pandas as pd
df = pd.DataFrame([("a1", "b1", "c1"), ("a2", "b2", "c2")], columns=["a", "b", "c"]) 
df.to_csv("df.csv") ### temporal file
s.put(sddk_url + "df.csv", data = open("df.csv", 'rb'))

Dictionary to JSON

To sciencedata.dk:

dict_object = {"a" : 1, "b" : 2, "c":3 }
s.put(sddk_url + "dict_file.json", data=json.dumps(dict_object))

From sciencedata.dk:

dict_object = json.loads(s.get(sddk_url + "dirgot_data/dict_file.json").content)

Matplotlib figure to PNG

import matplotlib.pyplot as plt
fig = plt.figure()
plt.plot(range(10))
fig.savefig('temp.png', dpi=fig.dpi) ### works even in Google colab
s.put(sddk_url + "temp.png", data = open("temp.png", 'rb'))

Next steps

to develop our own functions for uploading files and getting them back (asking in case of already existing files, etc.:

def file_from_object(file_name_and_loc, python_object):
  if s.get(sciencedata_groupurl + file_name_and_loc).ok: ### if there already is a file with the same name
    new_name = input("file with name \"" + file_name_and_loc.rpartition("/")[2] + "\" already exists in given location. Press Enter to overwrite it or enter a different name (without path)")
    if len(new_name) == 0:
      s.put(sciencedata_groupurl + file_name_and_loc, data=json.dumps(python_object))
    else:
      if "/" in new_name: ### if it is a path
        s.put(sciencedata_groupurl + new_name, data=json.dumps(python_object))
      else: 
        s.put(sciencedata_groupurl + file_name_and_loc.rpartition("/")[0] + new_name, data=json.dumps(python_object))
  else:
    s.put(sciencedata_groupurl + file_name_and_loc, data=json.dumps(python_object))

def object_from_file(file_name_and_loc):
  if s.get(sciencedata_groupurl + file_name_and_loc).ok:
    print("file exists")
    try: 
      return json.loads(s.get(sciencedata_groupurl + file_name_and_loc).content) ### if there already is a file with the same name
    except:
      print("file import failed")
  else:
    print("file does not found; check file name and path.")

The package is built following this tutorial.

Versions history

0.0.6 - first functional configuration
0.0.7 - configuration of individual session by default
0.0.8 - shared folders reading&writing for ordinary users finally functional
0.1.1 - added shared folder owner argument to the main configuration function; migration from test.pypi to real pypi

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

3.9

Dec 19, 2023

3.8

Oct 31, 2023

3.7

Nov 11, 2022

3.6

Nov 11, 2022

3.5

Nov 11, 2022

3.4

Nov 11, 2022

3.3

Jul 13, 2022

3.2

Apr 27, 2022

3.1

Apr 27, 2022

3.0

May 26, 2021

2.10.1

May 11, 2021

2.10.0

May 11, 2021

2.9.4

Mar 31, 2021

2.9.3

Mar 31, 2021

2.9.2

Mar 31, 2021

2.9.1

Mar 31, 2021

2.9

Mar 31, 2021

2.8.2

Dec 8, 2020

2.8.1

Dec 8, 2020

2.8

Dec 8, 2020

2.7

Sep 18, 2020

2.6

May 21, 2020

2.5

May 21, 2020

2.4

May 20, 2020

2.3

May 20, 2020

2.2

May 12, 2020

2.1

May 12, 2020

2.0

May 12, 2020

1.8

May 6, 2020

1.7

Apr 1, 2020

1.6.1

Feb 27, 2020

1.6

Feb 27, 2020

1.5

Feb 25, 2020

1.4

Feb 4, 2020

1.3.1

Feb 3, 2020

1.3

Feb 3, 2020

1.1

Feb 3, 2020

1.0

Feb 2, 2020

0.1.2

Jan 22, 2020

This version

0.1.1

Jan 18, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sddk-0.1.1.tar.gz (4.4 kB view details)

Uploaded Jan 18, 2020 Source

Built Distribution

sddk-0.1.1-py3-none-any.whl (5.8 kB view details)

Uploaded Jan 18, 2020 Python 3

File details

Details for the file sddk-0.1.1.tar.gz.

File metadata

Download URL: sddk-0.1.1.tar.gz
Upload date: Jan 18, 2020
Size: 4.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.6

File hashes

Hashes for sddk-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`f3f614d626f9ea13114a2e190fd8c0cef3b02c566bce4dcb533b0f1a55b6e6a4`
MD5	`8b40c2a5d3834c0391ff7fd67e592409`
BLAKE2b-256	`c8f22a42f28fa98a17b437b9ea577d5b2a25d3abc2b1d347c2ecd30b21d9fa1e`

See more details on using hashes here.

File details

Details for the file sddk-0.1.1-py3-none-any.whl.

File metadata

Download URL: sddk-0.1.1-py3-none-any.whl
Upload date: Jan 18, 2020
Size: 5.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.6

File hashes

Hashes for sddk-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e4dfe56edba5e36d632a73f990b4fffe89f951d9976a33396c66285f48ffe31b`
MD5	`342a68daa96aabbab3ff0543f64ea1bf`
BLAKE2b-256	`ad1448f4bd3d9d8a6ccd06fc6d325c57ac7f0a88ebc738e2fe8b3fc01d3f184e`

See more details on using hashes here.

sddk 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

sddk

Install and import

Configure session and access endpoint for a shared folder

Usage

String to TXT

Pandas DataFrame to JSON

Pandas DataFrame to CSV

Dictionary to JSON

Matplotlib figure to PNG

Next steps

Versions history

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes