Transform Unstructured Data into Usable Datasets
Project description
Why
World has plenty of data, but most of it is trapped in formats that are difficult to utilize. We’re talking about messy relational databases, unstructured text, audio, video, even the latent space of LLMs. It's not a goldmine; it's a landfill. And we're spending millions trying to clean it up.
Introducting Cyyrus
DataOps today looks a mix of ClickOps, CryOps and PrayOps. What if it didn't had to? Cyyrus aims to do for datasets, what Terraform did for Infrastructure.
Cyyrus does't make assumptions about your data because we don't have to. It handles it all, in all its messy, unstructured glory.
Sure but doesn't X do this already? The market is saturated with products but these tools don't solve data silos; they create new ones. Cyyrus doesn't bundle an analytics product, charging for transformations and checkpoints, not data storage so it benefits with data movement.
Components
Current tooling around running evaluation, performing finetuning are broken. They are built by optimists, dreamers, and in many cases, brilliant engineers. But they're building tools for a world that doesn't exist - a world where data comes pre-cleaned, perfectly labeled, and ready for AI consumption.
Data is unstructured and messy. These $10/month tools? They're useless for 80% of your data. Sure, the tool costs $10/month. But what about the army of data scientist you need to make your data "tool-ready”.
Cyyrus plans to introduce components which makes existing tools "data-ready". Think react.email
for last mile data transformation.
Feedback
We're here to give developers what they really need, not what looks good in a TechCrunch headline. We've been there. We've felt the pain, and yes, we've even built some of those well-intentioned but ultimately inadequate tools ourselves. Now, we're channeling that into building Cyyrus.
Footnote
The current Cyyrus package is experimental and built over the weekend to understand if terraforming data ops is viable. Does our approach resonate with you? ? Or do you think we're completely off base?
Don't hold back - we love to talk, and more importantly, we love to listen. Lessgo.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file cyyrus-0.10.0.tar.gz
.
File metadata
- Download URL: cyyrus-0.10.0.tar.gz
- Upload date:
- Size: 176.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 16e329a4209f3543b0a9168a7d51f60181d542f4b0ab796832e11d5d7e9cda84 |
|
MD5 | a5bcf4316f99ad9eddef934168c87a82 |
|
BLAKE2b-256 | c27a8124e64e4cf13116ccc30b0c743e1b0127ce12193ce069b43790a5905a17 |
File details
Details for the file cyyrus-0.10.0-py3-none-any.whl
.
File metadata
- Download URL: cyyrus-0.10.0-py3-none-any.whl
- Upload date:
- Size: 49.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b0efaa3587e9fd22f39db4e732edc7f2c74d2bdcc810e32a1ae75ef92b4371c4 |
|
MD5 | 51837805678302a487129b6de6b5a1ef |
|
BLAKE2b-256 | 535cc6a78253d39739109abec012dcbc30c1da9d581ac83b7106f57f87057b37 |