Skip to main content

Arkadia AI Data Format (AID) - A versatile data serialization format optimized for AI applications.

This project has been archived.

The maintainers of this project have marked this project as archived. No new releases are expected.

Project description

ARKADIA AI.DATA-FORMAT

                                   ; i  :J                                      
                               U, .j..fraaM.  nl                                
                            b h.obWMkkWWMMWMCdkvz,k                             
                         ! .mQWM:o hiMoMW v.uaXMdohbi                           
                        hI,MMmaIao.Wo .IMkoh FCMwqoXa                           
                      ,.c.aWdM. d,aToW  .    Mb!. MopfQ.L                       
                       jhj.xoM :k    aCu F: w MpmqMvMMI,I                       
                      bzMhz:W    .Mw . o lYh ai M iMa pM.j                      
                     hzqWWM;    M;o.WMWWMkMX f.a aa bModpo.                     
                     ;tMbbv   xp oJMMWWWWMMMM iv  dLMXakM:T                     
                       mdh        MMWWWWWWWbQLCzurjktvMor                       
                      ,QFw ;M,b .MWWWWWWWMWMWd  xz   M,kd X                     
                      qjMIo IMTW.WWWWWMWWWM.o.I   rpULaMdi.                     
                       .mMM  uoWWWMWWWWWWp qM,,M l M;mMbrI                      
                        f nm  MMW MWWjMuMj  I  o   LbMac                        
                              WWdMWWWW Mv a.b..aauMhMwQf                        
                              MoWWW,WWtjonJMWtoMdoaoMI                          
                              MMMM Mi    xd:Mm tMwo Cr,                         
                             xMMc .otqokWMMMao:oio.                             
                             MW    .   C..MkTIo                                 
                            WW                                                  
                           QWM                                                  
                           WW                                                   
                          uMW                                                   
                          WW                                                    
                          MW

The High-Density, Token-Efficient Data Protocol for Large Language Models. Stop wasting context window on JSON braces. AI.DATA is a unified, schema-first data format designed specifically for AI understanding. It offers up to 25% token savings, faster parsing, and human-readable structure that LLMs love.


๐Ÿ“ฆ Installation

Get started immediately with pip:

pip install arkadia-ai-data-format

๐Ÿš€ Fast Example

Encoding to AI.DATA:

echo '{ "data": 2}' | aid enc - -c
# Output: <data:number>(2)

Decoding back to JSON:

echo '<data:number>(2)' | aid dec - -f json
# Output: { "data": 2 }

โšก Performance & Token Savings

Why switch? Because every token counts. AICD (Arkadia Compressed Data) consistently outperforms standard formats in both token efficiency.

BENCHMARK SUMMARY:


   JSON  โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘     6921 tok   โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘     0.15 ms
   AICD  โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘     5416 tok   โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ     4.40 ms
   AID   โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘     6488 tok   โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘     4.29 ms
   TOON  โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ     8198 tok   โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘     2.36 ms


   FORMAT     TOKENS       TIME (Total)    AVG TIME/FILE   VS JSON
   ----------------------------------------------------------------------
   AICD       5416             4.40 ms        0.37 ms    -21.7%
   AID        6488             4.29 ms        0.36 ms    -6.3%
   JSON       6921             0.15 ms        0.01 ms    +0.0%
   TOON       8198             2.36 ms        0.20 ms    +18.5%


CONCLUSION: Switching to AICD saves 1505 tokens (21.7%) compared to JSON.

๐Ÿ›  CLI Usage

The package comes with a powerful CLI tool aid for encoding, decoding, and benchmarking.

   Arkadia AI DATA TOOL
   --------------------------------------------------
   Unified interface for AI Data Format operations.

USAGE:
   aid <command> [flags]

COMMANDS:
   enc             [ENCODE] Convert JSON/YAML/TOON to AI.Data format
   dec             [DECODE] Parse AI.Data format back to JSON
   benchmark       [BENCHMARK] Run performance and token usage tests
   ai-benchmark    [AI] Run AI understanding tests (not implemented yet)

GLOBAL OPTIONS:
   -h, --help       Show this help message
   -v, --version    Show version info

๐Ÿ“– Syntax Specification (Current Version)

This section describes the actual, currently implemented syntax of AI.DATA-FORMAT.

1. Type Definition

A type defines a name and an ordered list of fields. Comments are allowed within the definition to assist the LLM.

User</comment/ ={(23,"A",3) #tag1 #tag2} %[{ id: 4, b: "a", c: 43}]: id:number,
b: string , c:number, >
@Users
<
 @list 
 a: number,
 b: string
>
[
  @size=5
  /example list of values/

  (1,`text`,5)
  (2,`Text can be

multiline
`,5)
  {
    id:3,
    b: "text"
  }
]

Key Rules:

  • The type name (@Name) is optional but recommended.
  • The header <...> defines field names and their order.
  • Comments (/ ... /) are allowed in the header.

2. Data Structures

The format supports compact positional records and explicit named records.

Structure Syntax Description
Positional Record (a,b,c) Must follow the exact order of fields in the type header.
Named Record {key:value} Keys must match field names. No spaces allowed in keys/values.
List [ ... ] Contains positional or named records.
Multiline Text text Ends with a line containing only a backtick.

3. Comments

/ this is a comment /
  • Allowed only inside type definitions.
  • Forbidden in raw data blocks to save space.

4. General Rules

  1. Data must contain NO spaces. (Compactness is priority).
  2. Schema/Type definitions may contain spaces and comments.
  3. Named fields always use key:value without spaces.
  4. Positional order must exactly match the declared order.

5. Inline Type Usage

You can declare a type and immediately use it:

@User<id:number name:string desc:string>

value:@User(2,"Alice","Hello")
value2:@User(3,"Bob","World")

6. Nested Types

Currently, nested types are allowed as structural definitions:

@User<
  id:string
  name:string
  profile: < level:number, score:number >
>
[
  ("u1","Aga",{level:5,score:82})
  ("u2","Marek",{level:7,score:91})
]

๐Ÿ”ฎ Futures / Roadmap

The following features are planned for future releases and are not yet implemented.

  • Modifiers:

  • !required - field must be included.

  • ?empty - field must not be empty.

  • =value - default value.

  • N..M - numeric range validation.

  • Binary Data Types:

  • Hex: ~[hex]1A0F4F~

  • Base64: ~[b64]ADFKDXKZK...~

  • Pointers/References:

  • Reference existing objects by ID: (1, "Alex", *User[2])

๐Ÿ“„ License

This project is licensed under the [MIT License].


Built by Arkadia AI. Engineering the kernel of distributed intelligence.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arkadia_ai_data_format-0.1.6.tar.gz (111.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

arkadia_ai_data_format-0.1.6-py3-none-any.whl (50.2 kB view details)

Uploaded Python 3

File details

Details for the file arkadia_ai_data_format-0.1.6.tar.gz.

File metadata

  • Download URL: arkadia_ai_data_format-0.1.6.tar.gz
  • Upload date:
  • Size: 111.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.0

File hashes

Hashes for arkadia_ai_data_format-0.1.6.tar.gz
Algorithm Hash digest
SHA256 b8432bbb27aef558d087a206f0e6fd15fcdbffc3f5db89af934f38780d49a9e6
MD5 02399b949298c934c0c1aace45b47b4c
BLAKE2b-256 2ece3241226bb3142db634ee857e65c0ff878ce4fda42333418775690b879ef2

See more details on using hashes here.

File details

Details for the file arkadia_ai_data_format-0.1.6-py3-none-any.whl.

File metadata

File hashes

Hashes for arkadia_ai_data_format-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 e0434d4e70f3afb1057f3fa93faa803a49ab852cf15440bb827db12d117520e8
MD5 981484fedbf42303ca895b3bd215787f
BLAKE2b-256 5577dca85391a5dd4082787304762fcc6e6b0a147f6de8b3ade5832678d58841

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page