Skip to main content

learning common data structure

Project description

DataPro algorithm

The task is learning the common structure of data. The DataPro algorithm is the algorithm that finds statistically significant patterns in a set of token sequences.

This repository is Implemented from the paper "Learning the Common Structure of Data" by Kristina Lerman and Steven Minton.

installation


pip install datapro-learning

Get started

How to use model with this lib:

from datapro import DataPro

import pandas as pd



# Read data file

df = pd.read_excel("street_road.xlsx")

df = df.dropna()



# Choose a column.

data_sample = df["หน่วยรับผิดชอบ"]



# Create datapro object.

s = DataPro(alpha=0.05, k_percentage=10)



# Train with data

s.fit(data_sample)



# show result

print(s.evaluate_score())

Reference

https://www.aaai.org/Papers/AAAI/2000/AAAI00-093.pdf

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datapro-learning-1.0.tar.gz (6.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datapro_learning-1.0-py3-none-any.whl (7.9 kB view details)

Uploaded Python 3

File details

Details for the file datapro-learning-1.0.tar.gz.

File metadata

  • Download URL: datapro-learning-1.0.tar.gz
  • Upload date:
  • Size: 6.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.8.6

File hashes

Hashes for datapro-learning-1.0.tar.gz
Algorithm Hash digest
SHA256 ed5cca9f1e0dcc756e6d1975b4ce4c5d0e500f55f7ee63135fc29fb96ec139f5
MD5 0c436162dd027b020f475f142e8a5aca
BLAKE2b-256 793bc2349f1f0429340e769d56fbd4284b592e92ef620699d365ef2745258bbb

See more details on using hashes here.

File details

Details for the file datapro_learning-1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for datapro_learning-1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d04effd3cd62c2c1513ff6eff4ac09aeb8f13293c6db42544b2ac12965286ef3
MD5 5e7d744a7eb55e08a7f5df65d2634c41
BLAKE2b-256 985c77cd15bc11e1c46b6fa44d8f0f2b914a0b1f9d0e8762e7feed31b6399b24

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page