An intuitive, Pythonic way to work with tabular data
Project description
Python DataMatrix
An intuitive, Pythonic way to work with tabular data.
Sebastiaan Mathôt
Copyright 2015-2022
https://pydatamatrix.eu/
About
DataMatrix
is an intuitive Python library for working with column-based and continuous data. It's a light-weight and easy-to-use alternative to pandas
.
datamatrix
is also one of the core libraries of OpenSesame, a graphical experiment builder for the social sciences, and Rapunzel, a modern code editor for numerical computing with Python and R.
Ultra-short cheat sheet
from datamatrix import DataMatrix, io # Read a DataMatrix from file dm = io.readtxt('data.csv') # Create a new DataMatrix dm = DataMatrix(length=5) # The first two rows print(dm[:2]) # Create a new column and initialize it with the Fibonacci series dm.fibonacci = 0, 1, 1, 2, 3 # Remove 0 and 3 with a simple selection dm = (dm.fibonacci > 0) & (dm.fibonacci < 3) # Get a list of indices that match certain criteria print(dm[(dm.fibonacci > 0) & (dm.fibonacci < 3)]) # Select 1, 1, and 2 by matching any of the values in a set dm = dm.fibonacci == {1, 2} # Select all odd numbers with a lambda expression dm = dm.fibonacci == (lambda x: x % 2) # Change all 1s to -1 dm.fibonacci[dm.fibonacci == 1] = -1 # The first two cells from the fibonacci column print(dm.fibonacci[:2]) # Column mean print('Mean: %s' % dm.fibonacci.mean) # Multiply all fibonacci cells by 2 dm.fibonacci_times_two = dm.fibonacci * 2 # Loop through all rows for row in dm: print(row.fibonacci) # get the fibonacci cell from the row # Loop through all columns for colname, col in dm.columns: for cell in col: # Loop through all cells in the column print(cell) # do something with the cell # Or just see which columns exist print(dm.column_names)
Dependencies
- Python >= 3.7
Optional:
numpy
andscipy
for using theFloatColumn
,IntColumn
, andSeriesColumn
objectsprettytable
for creating a text representation of a DataMatrix (e.g. to print it out)openpyxl
for reading and writing.xlsx
filesfastnumbers
for improved performance
Installation
PyPi
pip install python-datamatrix
Anaconda
conda install datamatrix -c conda-forge
Ubuntu
sudo add-apt-repository ppa:smathot/cogscinl
sudo apt-get update
sudo apt install python3-datamatrix
Documentation
License
python-datamatrix
is licensed under the GNU General Public License
v3.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
python-datamatrix-0.14.3.tar.gz
(52.7 kB
view hashes)
Built Distribution
Close
Hashes for python_datamatrix-0.14.3-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1a0944f52661a141669d392b8cea27f482cf6429f121de8536b7a3b5a61b1b48 |
|
MD5 | 94c71d0d4ae8f2d75dee508b7ad73a33 |
|
BLAKE2-256 | 075d06e68ef1311e83b79a2eb96cc355f6e43c14917cda6d177a1a1203b56681 |