Skip to main content

Community Version of the B2B Antigravity PySpark Framework. Essential utilities for AWS FinOps and Cloud cost optimization.

Project description

🚀 Antigravity Lite (FinOps & AWS Glue Tools)

AWS Financial Auditor and Smart S3 Manager for PySpark Ecosystems


🛑 The Silent AWS Glue Killer: Spark's Catalyst Optimizer

Have you ever wondered why your massive PySpark cluster just hangs for hours, consuming 100% CPU without writing a single byte of data when processing a Wide Dataframe?

Many Data Engineers blame data skew or bad partitioning, panicking and upscaling AWS Glue Worker instances to expensive G.4X or G.8X tiers. But throwing money at RAM is not the solution. The architectural solution is not buying more RAM; it's isolating the math.

📦 Installation

pip install antigravity-lite

🛠 Included Open-Source Tools

1. Smart S3 Renamer (S3Finalizer - Universal API)

Tired of PySpark polluting your Datalake with part-00000... strings and empty _SUCCESS files? S3Finalizer is a native Boto3 utility that scans raw outputs and renames them sequentially and cleanly without breaking cluster concurrency. It works with Apache Spark, AWS Glue DynamicFrames, and standard S3 files seamlessly.

from antigravity_lite.io.s3_finalizer import S3Finalizer

finalizer = S3Finalizer(bucket_name="my-corporate-datalake")

# Automatically re-sequence and format any outputs natively
finalizer.sequence_files(
    s3_prefix="raw_zone/sales/",
    pattern="ENTERPRISE_REPORT_{seq:04d}.parquet",
    starts_with="",      # Optional: Target specific outputs (e.g. "0000_part")
    ends_with=".parquet",# Optional: Ignore non-parquet files
    contains="part"      # Optional: Filter
)
# Magic Output: ENTERPRISE_REPORT_0001.parquet

2. AWS Glue FinOps Auditor (AgAuditor)

Inject this standalone tool to scan your AWS CloudWatch telemetry and compute exactly how many thousands of dollars you are wasting each month on inflated AWS Worker instances just to keep Spark's Catalyst Optimizer from crashing.

from antigravity_lite.auditor.finops import AgAuditor

# Scan the cluster using CloudWatch and AWS APIs
AgAuditor.run_aws_audit(region="us-east-1", dias_analisis=7)

Console Output: It will accurately map your allocated Workers (G.1X, G.4X) against real compressed JVM Heap usage to reveal your exact financial capital leak.

3. S3 Directory Explorer (AgS3DirectoryLister)

¿Cansado de descubrir que AWS S3 es un espacio plano y no posee "carpetas" reales? Listar jerarquías en Boto3 usando la propiedad CommonPrefixes resulta frustrante. AgS3DirectoryLister abstrae todo el dolor de la paginación nativa y te entrega un árbol de "carpetas" lógicas de forma limpia.

from antigravity_lite.io import AgS3DirectoryLister

explorer = AgS3DirectoryLister()
carpetas_hijas = explorer.list_folders("s3://tu-bucket/datalake/bronze/")

# Imprime un cómodo árbol en tu terminal emulando un `ls`
explorer.print_tree("s3://tu-bucket/datalake/bronze/")

4. Multithreaded S3 Smart Copier (AgS3SmartCopier)

Clonar o agrupar enormes Datalakes en S3 usando scripts tradicionales iterativos asfixia tu red y te toma toda la tarde. Además, prender un clúster de Spark solo para "mover datos" es un desperdicio grosero de facturación de AWS. AgS3SmartCopier levanta un enjambre asíncrono en Python puro ThreadPoolExecutor para transferir miles de archivos aplicando filtros matemáticos, a una fracción del tiempo de un script convencional de Boto3.

from antigravity_lite.io import AgS3SmartCopier

copier = AgS3SmartCopier()

# Copia ultra-rápida y masiva sin arrancar Spark
copier.copy_path(
    origin_path="s3://lago-datos/crudo/",
    dest_path="s3://lago-datos/historico/",
    starts_with="SALES_2026",
    ends_with=".parquet",    # Filtros para ignorar basura oculta
    max_workers=10           # Hilos de la CPU disparados en simultáneo
)

💎 Commercial Licensing (Antigravity PRO)

The Lite version can tell you you're burning thousands of dollars... Purchasing the Antigravity PRO Enterprise License actually fixes it.

If your AgAuditor report flags an "⚠️ AST/OOM RISK" or your Heap spikes past 85%, you need the DataFrameChunker mathematical engine (Exclusive to the Pro B2B Edition). The enterprise version intercepts Spark's low-level planner and vertically slices the execution plan using Logarithmic Binary Trees (Tree Reduce) to forcibly truncate the AST Lineage. This drops your memory footprint so drastically that you can process half-a-billion operations on tiny G.1X clusters at zero OutOfMemory risk.

💻 Request a Proof-of-Concept or Live Architecture Demo for B2B deployment by connecting via LinkedIn.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

antigravity_lite-0.1.8.tar.gz (15.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

antigravity_lite-0.1.8-py3-none-any.whl (12.9 kB view details)

Uploaded Python 3

File details

Details for the file antigravity_lite-0.1.8.tar.gz.

File metadata

  • Download URL: antigravity_lite-0.1.8.tar.gz
  • Upload date:
  • Size: 15.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for antigravity_lite-0.1.8.tar.gz
Algorithm Hash digest
SHA256 efaea2b3d60812eabc2c497aae04c494e78b6b1871585df1e4d87b9b24de84c4
MD5 dde7cb00cba03744e8ce8f1b263de6aa
BLAKE2b-256 5d34353d79809fdfb21f9cb782dd87257b2f963fe8616ca5afd1d649f4a7cddc

See more details on using hashes here.

Provenance

The following attestation bundles were made for antigravity_lite-0.1.8.tar.gz:

Publisher: publish-lite.yml on andresvega925/AntigravityFW

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file antigravity_lite-0.1.8-py3-none-any.whl.

File metadata

File hashes

Hashes for antigravity_lite-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 80f0132fb0715193ddbd83d3541e67649c06039b539bbfa1b843343c7a904f08
MD5 f57946df998880de71e01a0cf5e3a828
BLAKE2b-256 375ebd91935c76d54c9191d84cb726e964c58e1785c4ef42404bd2a14cbc8ab5

See more details on using hashes here.

Provenance

The following attestation bundles were made for antigravity_lite-0.1.8-py3-none-any.whl:

Publisher: publish-lite.yml on andresvega925/AntigravityFW

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page