Skip to main content

A packages that helps you clean and recover columns of Excel data converted from PDF files.

Project description

excelcleaner

What is it?

A package that helps you clean and recover columns of Excel data converted from PDF files.

Where to get it

pip install excelcleaner

Dependencies

  • openpyxl for opening/saving Excel file in Python.
  • tksheet for displaying the content of the Excel file.

Changes

  • Version 0.0.18
    • Added function-key features
      1. F1: delete rows or columns
      2. F2: delete rows having a particular cell
      3. F3: combine rows or columns
      4. F4: combine selected rows
      5. F5: insert a cell to row and repeat the process to other rows in the same pattern
      6. F7: delete a cell to row and repeat the process to other rows in the same pattern
      7. F10: split a column to two columns with a particular string
    • Added an option to save processes to a file
      • The processes saved in the file can be executed to speed the process until wanted cleaning is completed.
    • Clicking a row provides the rows having the same pattern of the cells in row

  • Version 0.0.10
    • Restoring just the data before the last action is extended to data of all actions since the first action.
    • Changed function keys for actions.
    • Screen displays the selected row where the next action is needed.

  • Version 0.0.9
    • A tiny bug of renaming a cleaned xlsx file was fixed.

  • Version 0.0.7
    • Added some help:
      1. When a column is all None, it will be deleted without any notice.
      2. Rows in a pattern that takes place at the least number of rows will be highlighted.
      3. The first row of the rows will be selected and is ready for next action. This selection is only a hint and should not be followed unless otherwise.

  • Version 0.0.5
    • Disabled some bindings that belong to tksheet package.

  • Version 0.0.4
  • import excelcleaner as xl
    
    xlpath='(excel file name with extension)'
    xl.sheet(xlpath)
    
    • A tksheet window will be opened with the contents of excel file.
    • Here are recommending orders of actions to try:
      1. Remove unwanted rows (F1) first
      2. Remove unwanted columns (F1) or combine columns (F3)
      3. Click a cell to insert (F5) or delete the cell (F7)

    • If the same pattern or kind of selected rows/columns is found at other locations, the same action is repeated for the found rows/columns.
      • Thus, it would better try one action at the top
      • To remove rows, click and drag the row names.
      • To remove or combine columns, click and drag the column names.

    • When rows or columns are to be combined, the cells of the same rows or columns are combined.

    • After every action, the contents will be saved to a file that is the original name with a string "_cleaned".
    • When a cell is inserted, all values to the right are moved to the right, and an empty cell is added at the end of all other unchanged rows.
    • When a cell is deleted, all the values ​​on the right are moved to the left and the last column is empty.

    • If one action is done by mistake, press comination key (control-z) to restore the the data before he last action

    • After each action is completed,
      • the number of columns and rows and the number of patterns of rows will be displayed
      • either a few columns or muliple rows are highlighted, which are indicative of ok to be deleted.

    Project details


    Download files

    Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

    Source Distribution

    excelcleaner-0.0.18.tar.gz (28.0 kB view details)

    Uploaded Source

    Built Distribution

    excelcleaner-0.0.18-py3-none-any.whl (9.9 kB view details)

    Uploaded Python 3

    File details

    Details for the file excelcleaner-0.0.18.tar.gz.

    File metadata

    • Download URL: excelcleaner-0.0.18.tar.gz
    • Upload date:
    • Size: 28.0 kB
    • Tags: Source
    • Uploaded using Trusted Publishing? No
    • Uploaded via: twine/4.0.2 CPython/3.11.0

    File hashes

    Hashes for excelcleaner-0.0.18.tar.gz
    Algorithm Hash digest
    SHA256 ea81b3aeb3203530798163e7e190d2cf626406f4c52e6e0c2a92e602299fb5d7
    MD5 0179ae1119297f049153c8f076202f5f
    BLAKE2b-256 dabbc1d1f8fd6774143cc9b532cf23955677e0b8c76ce6951a30d285754d78ee

    See more details on using hashes here.

    File details

    Details for the file excelcleaner-0.0.18-py3-none-any.whl.

    File metadata

    File hashes

    Hashes for excelcleaner-0.0.18-py3-none-any.whl
    Algorithm Hash digest
    SHA256 44c62f90a5c43d760cf7972f203025b9f3eaf0bab5cbfb1b9158417363e6044c
    MD5 44927ff42500619be863a264055e5769
    BLAKE2b-256 65f5ff2722927d7502aba725fd7b4daf36590c1b12132741cdb5f6021df55e7d

    See more details on using hashes here.

    Supported by

    AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page