Skip to main content

Library for obtaining and organizing text data from your Project Folders.

Project description

📝 TextTableScoop

Welcome to TextTableScoop 🌟, a versatile tool designed for extracting text from files and CSV tables, particularly focusing on Office files like Excel, PowerPoint, etc. This project is part of a 'ProjectText' suite that includes ProjectTextAgent and ProjectDataBaseQnA.

🚀 Features

  • Specializes in extracting text from various file formats, including Office files.
  • Designed to work in both Windows with COM and Linux with LibreOffice + PyUNO.
  • Current implementation supports Linux + LibreOffice + PyUNO.
  • Windows support with COM environment is planned for robust file handling.

📥 Installation

To install TextTableScoop, use the following pip command:

pip3 install git+https://github.com/Flagro/TextTableScoop.git

🛠️ Usage

Run texttablescoop from the bin folder with these arguments:

  1. path: Path to the file or directory to process.
  2. -t or --temp: (Optional) Path to a custom temporary folder.
  3. -p or --project: (Optional) Path to the project folder the file belongs to.
  4. --ignore: (Optional) Comma-separated list of patterns to ignore.

🖥️ Example Command

texttablescoop 'path/to/file' --temp 'path/to/temp' --project 'path/to/project' --ignore 'pattern1,pattern2'

🤝 Collaboration & Issues

Open for collaboration; check the issues page for discussions.

Here's how you can contribute:

  1. Fork the Project.
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature).
  3. Commit your Changes (git commit -m 'Add some AmazingFeature').
  4. Push to the Branch (git push origin feature/AmazingFeature).
  5. Open a Pull Request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

texttablescoop-0.0.1.tar.gz (7.3 kB view hashes)

Uploaded Source

Built Distribution

texttablescoop-0.0.1-py3-none-any.whl (9.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page