Skip to main content

Library for obtaining and organizing text data from your Project Folders.

Project description

📝 TextTableScoop

Welcome to TextTableScoop 🌟, a versatile tool designed for extracting text from files and CSV tables, particularly focusing on Office files like Excel, PowerPoint, etc. This project is part of a 'ProjectText' suite that includes ProjectTextAgent and ProjectDataBaseQnA.

🚀 Features

  • Specializes in extracting text from various file formats, including Office files.
  • Designed to work in both Windows with COM and Linux with LibreOffice + PyUNO.
  • Current implementation supports Linux + LibreOffice + PyUNO.
  • Windows support with COM environment is planned for robust file handling.

📥 Installation

To install TextTableScoop, use the following pip command:

pip3 install git+https://github.com/Flagro/TextTableScoop.git

🛠️ Usage

Run texttablescoop from the bin folder with these arguments:

  1. path: Path to the file or directory to process.
  2. -t or --temp: (Optional) Path to a custom temporary folder.
  3. -p or --project: (Optional) Path to the project folder the file belongs to.
  4. --ignore: (Optional) Comma-separated list of patterns to ignore.

🖥️ Example Command

texttablescoop 'path/to/file' --temp 'path/to/temp' --project 'path/to/project' --ignore 'pattern1,pattern2'

🤝 Collaboration & Issues

Open for collaboration; check the issues page for discussions.

Here's how you can contribute:

  1. Fork the Project.
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature).
  3. Commit your Changes (git commit -m 'Add some AmazingFeature').
  4. Push to the Branch (git push origin feature/AmazingFeature).
  5. Open a Pull Request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

texttablescoop-0.0.1.tar.gz (7.3 kB view details)

Uploaded Source

Built Distribution

texttablescoop-0.0.1-py3-none-any.whl (9.4 kB view details)

Uploaded Python 3

File details

Details for the file texttablescoop-0.0.1.tar.gz.

File metadata

  • Download URL: texttablescoop-0.0.1.tar.gz
  • Upload date:
  • Size: 7.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.9.18

File hashes

Hashes for texttablescoop-0.0.1.tar.gz
Algorithm Hash digest
SHA256 7b6b1d11a71f3c27357e5d74c397a6145ec9e7b4b378d7eb1a5d32b57aaa243f
MD5 b6400c628a2ac0d38fb1a0db647744db
BLAKE2b-256 6f1dca30f5bcbdce715d813985a09212c7a077f27fbe52542c06334d44fc1a9a

See more details on using hashes here.

File details

Details for the file texttablescoop-0.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for texttablescoop-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e4b9911fd813f1d95205f7b22b056443f902b08795af10e51a9326135bc232a8
MD5 1c6388078fccbc192161efe4b08d8581
BLAKE2b-256 4d24c258cd692a4403d5abf1aba0864817a08e82c07432aaf4961da0a86665d7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page