Skip to main content

Transform Unstructured Data into Usable Datasets

Project description

alt text

Why

World has plenty of data, but most of it is trapped in formats that are difficult to utilize. We’re talking about messy relational databases, unstructured text, audio, video, even the latent space of LLMs. It's not a goldmine; it's a landfill. And we're spending millions trying to clean it up.

Introducting Cyyrus

DataOps today looks a mix of ClickOps, CryOps and PrayOps. What if it didn't had to? Cyyrus aims to do for datasets, what Terraform did for Infrastructure.

Cyyrus does't make assumptions about your data because we don't have to. It handles it all, in all its messy, unstructured glory.

Sure but doesn't X do this already? The market is saturated with products but these tools don't solve data silos; they create new ones. Cyyrus doesn't bundle an analytics product, charging for transformations and checkpoints, not data storage so it benefits with data movement.

Components

Current tooling around running evaluation, performing finetuning are broken. They are built by optimists, dreamers, and in many cases, brilliant engineers. But they're building tools for a world that doesn't exist - a world where data comes pre-cleaned, perfectly labeled, and ready for AI consumption.

Data is unstructured and messy. These $10/month tools? They're useless for 80% of your data. Sure, the tool costs $10/month. But what about the army of data scientist you need to make your data "tool-ready”.

Cyyrus plans to introduce components which makes existing tools "data-ready". Think react.email for last mile data transformation.

Feedback

We're here to give developers what they really need, not what looks good in a TechCrunch headline. We've been there. We've felt the pain, and yes, we've even built some of those well-intentioned but ultimately inadequate tools ourselves. Now, we're channeling that into building Cyyrus.

Footnote

The current Cyyrus package is experimental and built over the weekend to understand if terraforming data ops is viable. Does our approach resonate with you? ? Or do you think we're completely off base?

Don't hold back - we love to talk, and more importantly, we love to listen. Lessgo.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cyyrus-0.10.0.tar.gz (176.2 kB view details)

Uploaded Source

Built Distribution

cyyrus-0.10.0-py3-none-any.whl (49.9 kB view details)

Uploaded Python 3

File details

Details for the file cyyrus-0.10.0.tar.gz.

File metadata

  • Download URL: cyyrus-0.10.0.tar.gz
  • Upload date:
  • Size: 176.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.5

File hashes

Hashes for cyyrus-0.10.0.tar.gz
Algorithm Hash digest
SHA256 16e329a4209f3543b0a9168a7d51f60181d542f4b0ab796832e11d5d7e9cda84
MD5 a5bcf4316f99ad9eddef934168c87a82
BLAKE2b-256 c27a8124e64e4cf13116ccc30b0c743e1b0127ce12193ce069b43790a5905a17

See more details on using hashes here.

File details

Details for the file cyyrus-0.10.0-py3-none-any.whl.

File metadata

  • Download URL: cyyrus-0.10.0-py3-none-any.whl
  • Upload date:
  • Size: 49.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.5

File hashes

Hashes for cyyrus-0.10.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b0efaa3587e9fd22f39db4e732edc7f2c74d2bdcc810e32a1ae75ef92b4371c4
MD5 51837805678302a487129b6de6b5a1ef
BLAKE2b-256 535cc6a78253d39739109abec012dcbc30c1da9d581ac83b7106f57f87057b37

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page