faker_pyspark is a PySpark provider for the Faker python package
Project description
PySpark provider for Faker
faker_pyspark
is a provider for the Faker
Python package.
Description
faker_pyspark
provides PySpark based fake data for testing purposes. The definition of "fake" in this context really means "random," as the data may look real. However, I make no claims about accuracy, so do not use this as real data!
Installation
Install with pip:
pip install faker_pyspark
Add as a provider to your Faker instance:
from faker import Faker
from faker_pyspark import PySparkProvider
fake.add_provider(PySparkProvider)
If you already use faker, you probably know the conventional use is:
from faker import Faker
fake = Faker()
PySpark DataFrame and Schema (StructType)
>>> df = fake.pyspark_dataframe()
>>> schema = fake.pyspark_schema()
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
faker_pyspark-0.1.0.tar.gz
(3.6 kB
view hashes)
Built Distribution
Close
Hashes for faker_pyspark-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4bb79a10918383c60aac42886be3eb9db54a95be1f47372eefba4c35478fc5a2 |
|
MD5 | 9f5d660ea27ceff83f9b524ef2645ad8 |
|
BLAKE2b-256 | 40071bc202b4f9531e8ec80ecb238cf892b8e786a70530ae4c96c38b0374d50e |