No project description provided
Project description
Giza Datasets
Welcome to the Giza Datasets repository. Here you can find a collection of datasets ready to be used for blockchain ML use cases. Familiarize yourself with the ease of using dataframes through our DatasetsLoader
class.
Before discovering how our library works, if you want to find detailed information about each dataset provided by Giza, access our documentation! You will find usage examples for each dataset, the schema of each one with descriptions of every field, the relationship between the datasets, potential use cases for them, and much more!
Enhanced Features
Explore the robust capabilities of the Giza Datasets repository:
- Streamlined Dataset Access: Instantly connect to a curated collection of blockchain datasets, ready for machine learning applications, with no configuration needed.
- Effortless Data Loading: Utilize the
DatasetsLoader
class to easily load Parquet files, streamlining your data workflow. - Optimized Data Handling: Leverage the integration with the polars library, designed for efficient manipulation of large datasets. For detailed guidance on using polars for dataset operations, refer to the polars documentation.
Quick Start
To get started with Giza Datasets, follow the steps below:
-
Install the
giza-datasets
package if you haven't already:pip install giza-datasets
-
Import the
DatasetsLoader
class and initialize it:from giza.datasets import DatasetsLoader loader = DatasetsLoader()
-
Optional: Depending on your device's configuration, it may be necessary to provide SSL certificates to verify the authenticity of HTTPS connections. You can ensure that all these certifications are correct by executing the following line of code:
import certifi import os os.environ['SSL_CERT_FILE'] = certifi.where()
-
Load a dataset using the
load
method. For example, to loadtvl-fee-per-protocol
:df = loader.load('tvl-fee-per-protocol')
-
To view the loaded dataset, simply print the dataframe:
print(df)
Start exploring the datasets and building your machine learning models with ease!
Datasets Hub
The DatasetsHub
class provides methods to manage and access datasets. Here are some of its methods:
show()
: Prints a table of all datasets in the hub.list()
: Returns a list of all datasets in the hub.get(dataset_name)
: Returns a Dataset object with the given name.describe(dataset_name)
: Prints a table of details for the given dataset.
To get started with the DatasetsHub
class, follow the steps below:
- Import the
DatasetsHub
class and initialize it:from giza.datasets import DatasetsHub hub = DatasetsHub()
- Use the
show
method to print a table of all datasets in the hub:hub.show()
- Use the
list
method to get a list of all datasets in the hub:datasets = hub.list() print(datasets)
- Use the
get
method to get a Dataset object with a given name:dataset = hub.get('tvl-fee-per-protocol') print(dataset)
- Use the
describe
method to print a table of details for a given dataset:hub.describe('tvl-fee-per-protocol')
- Use the
list_tags
method to print a list of all tags in the hub.hub.list_tags()
- Use the
get_by_tag
method to a list of Dataset objects with the given tag.hub.get_by_tag('Liquidity')
Contributing
We welcome contributions to the Giza Datasets repository. If you have suggestions for improvements or new features, feel free to open an issue or submit a pull request.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file giza_datasets-0.3.1.tar.gz
.
File metadata
- Download URL: giza_datasets-0.3.1.tar.gz
- Upload date:
- Size: 18.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 22f25570c93d9dd474e2bace446a3ae5fbf540f55de8ee3faa075637a5c393bf |
|
MD5 | 82719469cedc414bd887c9cca4fc1941 |
|
BLAKE2b-256 | 569c1b7f30289ee578b899fae478ff03bae10b532330c89fb3c2be4e9370c6d3 |
File details
Details for the file giza_datasets-0.3.1-py3-none-any.whl
.
File metadata
- Download URL: giza_datasets-0.3.1-py3-none-any.whl
- Upload date:
- Size: 18.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5f8ade407e63d9e16f8f41606bd0b6720b787fd2c0ed85d205b58d8f59afec4f |
|
MD5 | 4aa45ec56938ee718d89bdc259d8fb2f |
|
BLAKE2b-256 | 3939f53250450729aa0313862aa9024eec71bc3325a77bdae8cf3ad7f4f0c6e9 |