Skip to main content

A packages that helps you clean and recover columns of Excel data converted from PDF files.

Project description

excelcleaner

What is it?

A package that helps you clean and recover columns of Excel data converted from PDF files.

Where to get it

pip install excelcleaner

Dependencies

  • openpyxl for opening/saving Excel file in Python.
  • tksheet for displaying the content of the Excel file.

Changes

  • Version 0.0.18
    • Added function-key features
      1. F1: delete rows or columns
      2. F2: delete rows having a particular cell
      3. F3: combine rows or columns
      4. F4: combine selected rows
      5. F5: insert a cell to row and repeat the process to other rows in the same pattern
      6. F7: delete a cell to row and repeat the process to other rows in the same pattern
      7. F10: split a column to two columns with a particular string
    • Added an option to save processes to a file
      • The processes saved in the file can be executed to speed the process until wanted cleaning is completed.
    • Clicking a row provides the rows having the same pattern of the cells in row

  • Version 0.0.10
    • Restoring just the data before the last action is extended to data of all actions since the first action.
    • Changed function keys for actions.
    • Screen displays the selected row where the next action is needed.

  • Version 0.0.9
    • A tiny bug of renaming a cleaned xlsx file was fixed.

  • Version 0.0.7
    • Added some help:
      1. When a column is all None, it will be deleted without any notice.
      2. Rows in a pattern that takes place at the least number of rows will be highlighted.
      3. The first row of the rows will be selected and is ready for next action. This selection is only a hint and should not be followed unless otherwise.

  • Version 0.0.5
    • Disabled some bindings that belong to tksheet package.

  • Version 0.0.4
  • import excelcleaner as xl
    
    xlpath='(excel file name with extension)'
    xl.sheet(xlpath)
    
    • A tksheet window will be opened with the contents of excel file.
    • Here are recommending orders of actions to try:
      1. Remove unwanted rows (F1) first
      2. Remove unwanted columns (F1) or combine columns (F3)
      3. Click a cell to insert (F5) or delete the cell (F7)

    • If the same pattern or kind of selected rows/columns is found at other locations, the same action is repeated for the found rows/columns.
      • Thus, it would better try one action at the top
      • To remove rows, click and drag the row names.
      • To remove or combine columns, click and drag the column names.

    • When rows or columns are to be combined, the cells of the same rows or columns are combined.

    • After every action, the contents will be saved to a file that is the original name with a string "_cleaned".
    • When a cell is inserted, all values to the right are moved to the right, and an empty cell is added at the end of all other unchanged rows.
    • When a cell is deleted, all the values ​​on the right are moved to the left and the last column is empty.

    • If one action is done by mistake, press comination key (control-z) to restore the the data before he last action

    • After each action is completed,
      • the number of columns and rows and the number of patterns of rows will be displayed
      • either a few columns or muliple rows are highlighted, which are indicative of ok to be deleted.

    Project details


    Download files

    Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

    Source Distribution

    excelcleaner-0.0.18.tar.gz (28.0 kB view hashes)

    Uploaded Source

    Built Distribution

    excelcleaner-0.0.18-py3-none-any.whl (9.9 kB view hashes)

    Uploaded Python 3

    Supported by

    AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page