Skip to main content

Transform Unstructured Data into Usable Datasets

Project description

alt text

Why

World has plenty of data, but most of it is trapped in formats that are difficult to utilize. We’re talking about messy relational databases, unstructured text, audio, video, even the latent space of LLMs. It's not a goldmine; it's a landfill. And we're spending millions trying to clean it up.

Introducting Cyyrus

DataOps today looks a mix of ClickOps, CryOps and PrayOps. What if it didn't had to? Cyyrus aims to do for datasets, what Terraform did for Infrastructure.

Cyyrus does't make assumptions about your data because we don't have to. It handles it all, in all its messy, unstructured glory.

Sure but doesn't X do this already? The market is saturated with products but these tools don't solve data silos; they create new ones. Cyyrus doesn't bundle an analytics product, charging for transformations and checkpoints, not data storage so it benefits with data movement.

Components

Current tooling around running evaluation, performing finetuning are broken. They are built by optimists, dreamers, and in many cases, brilliant engineers. But they're building tools for a world that doesn't exist - a world where data comes pre-cleaned, perfectly labeled, and ready for AI consumption.

Data is unstructured and messy. These $10/month tools? They're useless for 80% of your data. Sure, the tool costs $10/month. But what about the army of data scientist you need to make your data "tool-ready”.

Cyyrus plans to introduce components which makes existing tools "data-ready". Think react.email for last mile data transformation.

Feedback

We're here to give developers what they really need, not what looks good in a TechCrunch headline. We've been there. We've felt the pain, and yes, we've even built some of those well-intentioned but ultimately inadequate tools ourselves. Now, we're channeling that into building Cyyrus.

Footnote

The current Cyyrus package is experimental and built over the weekend to understand if terraforming data ops is viable. Does our approach resonate with you? ? Or do you think we're completely off base?

Don't hold back - we love to talk, and more importantly, we love to listen. Lessgo.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cyyrus-0.11.0.tar.gz (176.2 kB view details)

Uploaded Source

Built Distribution

cyyrus-0.11.0-py3-none-any.whl (49.9 kB view details)

Uploaded Python 3

File details

Details for the file cyyrus-0.11.0.tar.gz.

File metadata

  • Download URL: cyyrus-0.11.0.tar.gz
  • Upload date:
  • Size: 176.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.5

File hashes

Hashes for cyyrus-0.11.0.tar.gz
Algorithm Hash digest
SHA256 610525cc3a9d485b869cdaf758323b2d9ae7e523bcdf23a1d014a31c42db6d20
MD5 5408401e44ecb765632b916f48ea3676
BLAKE2b-256 778b495a2cc9eb9d6d6f6c7757736f67a4f6630b11c964efdf0ca930c9a3cdd1

See more details on using hashes here.

File details

Details for the file cyyrus-0.11.0-py3-none-any.whl.

File metadata

  • Download URL: cyyrus-0.11.0-py3-none-any.whl
  • Upload date:
  • Size: 49.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.5

File hashes

Hashes for cyyrus-0.11.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7a406c1f6b0d00b55788301abfb9762669d98c4298d887f42c1eb129d11894fd
MD5 796971c24b565d208aec1d870354cec2
BLAKE2b-256 6d31efa4ebfd1ee5dee10abecefdf115bbed8995416fd3f293eee8f18d35896e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page