Download files from the Cencora secure file transfer site
Project description
Purpose
This Python package can be used to download files from the Cencora (formerly Amerisource) secure file transfer site for ingest into clinical data systems.
Downloads are performed from the web-based secure site located at https://secure.amerisourcebergen.com/. FTP is not supported. (There are many easier ways to automate FTP-based downloads.)
Requirements
- Python 3.10 or newer
Installation
Use pip to install the medberg package.
pip install medberg
Usage
Establishing a connection
Import the SecureSite class from the medberg module.
from medberg import SecureSite
Initialize a connection to the secure site by providing a username and password.
con = SecureSite(username='yourname', password='yourpassword')
Reviewing files
A list of files is automatically downloaded at connection time and stored in the
files variable. Files are represented by objects comprising a name, filesize,
and upload date.
print(con.files)
# [File(name=340B037AM1234567890330.TXT, filesize=self.filesize='1.3MB', date='03/30/2025'), ...]
print(con.files[0].name)
# 340B037AM1234567890330.TXT
print(con.files[0].filesize)
# 1.3MB
print(con.files[0].date)
# datetime.datetime(2025, 3, 30, 8, 13, 58)
The library will attempt to automatically extract additional metadata from the filename describing account type (e.g., 340B, GPO, WAC), file specification (e.g., 037, 039), and account number.
print(con.files[0].account_type)
# 340B
print(con.files[0].specification)
# 037AM
print(con.files[0].account_number)
# 123456789
If the metadata is not present in the filename, the corresponding property will simply evaluate to None.
Downloading files
Any individual file can be downloaded using the get method of the File class.
Optional parameters can be specified for the save directory (save_dir) and
local filename (save_name). If these are omitted, the file will be saved in
the current working directory using the original filename by default.
con.files[0].get(save_dir='C:\\Users\\yourname\\Downloads\\',
save_name='new_filename.txt')
Files can also be downloaded using the get_file method of the SecureSite
class. In this case, the file to download must be specified in the first
parameter as either an instance of the File class or a string containing the
filename as it appears on the remote site. The optional save_dir and
save_name parameters are again available as with the File.get method.
# Using a File object
file_to_get = con.files[0]
con.get_file(file_to_get)
# Using a string filename
con.get_file('039A_012345678_0101.TXT')
When a file is downloaded using either of the methods above, the return value will be a pathlib Path object pointing to the local file.
Filtering files
The list of files obtained from the server can be filtered using the
match_files() method, which can take any number of arguments in the format
file_property=filter_value. For example, to retrieve all files with account
number 123456789, you can call match_files(account_number="123456789"). The
result will be a list of File objects matching the specified arguments.
con.match_files(account_number="123456789")
# [File(name=340B037AM1234567890330.TXT, filesize=self.filesize='1.3MB', date='03/30/2025'), ...]
Files can be matched on any attribute. In cases where the file property type differs from the filter value type, the filter value will be converted to the correct type automatically. For example, the account number above was filtered using a string (as account_number is stored in the file class), but it can just as well be filtered using an integer:
con.match_files(account_number=123456789)
# [File(name=340B037AM1234567890330.TXT, filesize=self.filesize='1.3MB', date='03/30/2025'), ...]
String filter values can contain a wildcard (*) at the beginning or end of
the filter. For example, match_files(file_specification="039*") will match
"039", "039A", "039AM", etc.
List and tuple filters will cause a match if any one of the inner values matches. Effectively, this acts as a nested OR filter.
Callables can also be passed to allow for more complex filtering. For example, we can get all files from the current month as follows:
from datetime import datetime
current_month = datetime.now().month
current_year = datetime.now().year
con.match_files(date=lambda x: x >= datetime(current_year, current_month, 1))
Multiple filter arguments can be passed together to create a more specific filter.
To get a single file with the most recent upload time that matches a filter or
series of filters, use match_latest_file(). This method takes the same
arguments as the match_files() method.
Contributing
Pull requests are welcome. Please ensure all code submitted is formatted with Black and tested with pytest. For major changes, please open an issue first to discuss what you would like to change.
When editing the codebase locally, you may install medberg in development mode to use it in REPLs:
pip install -e '.[dev]'
License
This software is licensed under the MIT License.
Disclaimer
This package and its authors are not afiliated, associated, authorized, or endorsed by Cencora, Inc. All names and brands are properties of their respective owners.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file medberg-1.1.0.tar.gz.
File metadata
- Download URL: medberg-1.1.0.tar.gz
- Upload date:
- Size: 11.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0f20406dfc0d31337db2bf6264f5eb64bdc975879d5ae04242a51c2734cac8d3
|
|
| MD5 |
b02162b6878be45951b09272c3d5953b
|
|
| BLAKE2b-256 |
252beac21caae18e5cee7b08b66585ada364e183351a0de8bfffc0f31198e6c6
|
Provenance
The following attestation bundles were made for medberg-1.1.0.tar.gz:
Publisher:
python-publish.yml on eddie-cosma/medberg
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
medberg-1.1.0.tar.gz -
Subject digest:
0f20406dfc0d31337db2bf6264f5eb64bdc975879d5ae04242a51c2734cac8d3 - Sigstore transparency entry: 193583854
- Sigstore integration time:
-
Permalink:
eddie-cosma/medberg@4812153ca315ecd4fe535462cff2bb5dec5cf001 -
Branch / Tag:
refs/tags/v1.1.0 - Owner: https://github.com/eddie-cosma
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@4812153ca315ecd4fe535462cff2bb5dec5cf001 -
Trigger Event:
release
-
Statement type:
File details
Details for the file medberg-1.1.0-py3-none-any.whl.
File metadata
- Download URL: medberg-1.1.0-py3-none-any.whl
- Upload date:
- Size: 9.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
683da359784913f7a9d7867ff9e87d174e6357cce61bf1823244be70efbeb70a
|
|
| MD5 |
fafe7fb71216960c8c4ea34f4dbbf34d
|
|
| BLAKE2b-256 |
7029b3ef9389913122e4dbbbc858217a878cb3ed38cd92154e29d9ef66171663
|
Provenance
The following attestation bundles were made for medberg-1.1.0-py3-none-any.whl:
Publisher:
python-publish.yml on eddie-cosma/medberg
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
medberg-1.1.0-py3-none-any.whl -
Subject digest:
683da359784913f7a9d7867ff9e87d174e6357cce61bf1823244be70efbeb70a - Sigstore transparency entry: 193583859
- Sigstore integration time:
-
Permalink:
eddie-cosma/medberg@4812153ca315ecd4fe535462cff2bb5dec5cf001 -
Branch / Tag:
refs/tags/v1.1.0 - Owner: https://github.com/eddie-cosma
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@4812153ca315ecd4fe535462cff2bb5dec5cf001 -
Trigger Event:
release
-
Statement type: