A packages that helps you clean and recover columns of Excel data converted from PDF files.
Project description
excelcleaner
What is it?
A package that helps you clean and recover columns of Excel data converted from PDF files.
Where to get it
pip install excelcleaner
Dependencies
- openpyxl for opening/saving Excel file in Python.
- tksheet for displaying the content of the Excel file.
Changes
- Version 0.0.18
- Added function-key features
- F1: delete rows or columns
- F2: delete rows having a particular cell
- F3: combine rows or columns
- F4: combine selected rows
- F5: insert a cell to row and repeat the process to other rows in the same pattern
- F7: delete a cell to row and repeat the process to other rows in the same pattern
- F10: split a column to two columns with a particular string
- Added an option to save processes to a file
- The processes saved in the file can be executed to speed the process until wanted cleaning is completed.
- Clicking a row provides the rows having the same pattern of the cells in row
- Restoring just the data before the last action is extended to data of all actions since the first action.
- Changed function keys for actions.
- Screen displays the selected row where the next action is needed.
- A tiny bug of renaming a cleaned xlsx file was fixed.
- Added some help:
- When a column is all None, it will be deleted without any notice.
- Rows in a pattern that takes place at the least number of rows will be highlighted.
- The first row of the rows will be selected and is ready for next action. This selection is only a hint and should not be followed unless otherwise.
- Disabled some bindings that belong to tksheet package.
import excelcleaner as xl
xlpath='(excel file name with extension)'
xl.sheet(xlpath)
- A tksheet window will be opened with the contents of excel file.
- Here are recommending orders of actions to try:
- Remove unwanted rows (F1) first
- Remove unwanted columns (F1) or combine columns (F3)
- Click a cell to insert (F5) or delete the cell (F7)
- If the same pattern or kind of selected rows/columns is found at other locations, the same action is repeated for the found rows/columns.
- Thus, it would better try one action at the top
- To remove rows, click and drag the row names.
- To remove or combine columns, click and drag the column names.
- When rows or columns are to be combined, the cells of the same rows or columns are combined.
- After every action, the contents will be saved to a file that is the original name with a string "_cleaned".
- When a cell is inserted, all values to the right are moved to the right, and an empty cell is added at the end of all other unchanged rows.
- When a cell is deleted, all the values on the right are moved to the left and the last column is empty.
- If one action is done by mistake, press comination key (control-z) to restore the the data before he last action
- After each action is completed,
- the number of columns and rows and the number of patterns of rows will be displayed
- either a few columns or muliple rows are highlighted, which are indicative of ok to be deleted.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
excelcleaner-0.0.18.tar.gz
(28.0 kB
view details)
Built Distribution
File details
Details for the file excelcleaner-0.0.18.tar.gz
.
File metadata
- Download URL: excelcleaner-0.0.18.tar.gz
- Upload date:
- Size: 28.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ea81b3aeb3203530798163e7e190d2cf626406f4c52e6e0c2a92e602299fb5d7 |
|
MD5 | 0179ae1119297f049153c8f076202f5f |
|
BLAKE2b-256 | dabbc1d1f8fd6774143cc9b532cf23955677e0b8c76ce6951a30d285754d78ee |
File details
Details for the file excelcleaner-0.0.18-py3-none-any.whl
.
File metadata
- Download URL: excelcleaner-0.0.18-py3-none-any.whl
- Upload date:
- Size: 9.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 44c62f90a5c43d760cf7972f203025b9f3eaf0bab5cbfb1b9158417363e6044c |
|
MD5 | 44927ff42500619be863a264055e5769 |
|
BLAKE2b-256 | 65f5ff2722927d7502aba725fd7b4daf36590c1b12132741cdb5f6021df55e7d |