Library for obtaining and organizing text data from your Project Folders.
Project description
📝 TextTableScoop
Welcome to TextTableScoop
🌟, a versatile tool designed for extracting text from files and CSV tables, particularly focusing on Office files like Excel, PowerPoint, etc. This project is part of a 'ProjectText' suite that includes ProjectTextAgent
and ProjectDataBaseQnA
.
🚀 Features
- Specializes in extracting text from various file formats, including Office files.
- Designed to work in both Windows with COM and Linux with LibreOffice + PyUNO.
- Current implementation supports Linux + LibreOffice + PyUNO.
- Windows support with COM environment is planned for robust file handling.
📥 Installation
To install TextTableScoop
, use the following pip command:
pip3 install git+https://github.com/Flagro/TextTableScoop.git
🛠️ Usage
Run texttablescoop
from the bin folder with these arguments:
path
: Path to the file or directory to process.-t
or--temp
: (Optional) Path to a custom temporary folder.-p
or--project
: (Optional) Path to the project folder the file belongs to.--ignore
: (Optional) Comma-separated list of patterns to ignore.
🖥️ Example Command
texttablescoop 'path/to/file' --temp 'path/to/temp' --project 'path/to/project' --ignore 'pattern1,pattern2'
🤝 Collaboration & Issues
Open for collaboration; check the issues page for discussions.
Here's how you can contribute:
- Fork the Project.
- Create your Feature Branch (
git checkout -b feature/AmazingFeature
). - Commit your Changes (
git commit -m 'Add some AmazingFeature'
). - Push to the Branch (
git push origin feature/AmazingFeature
). - Open a Pull Request.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
texttablescoop-0.0.1.tar.gz
(7.3 kB
view hashes)
Built Distribution
Close
Hashes for texttablescoop-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e4b9911fd813f1d95205f7b22b056443f902b08795af10e51a9326135bc232a8 |
|
MD5 | 1c6388078fccbc192161efe4b08d8581 |
|
BLAKE2b-256 | 4d24c258cd692a4403d5abf1aba0864817a08e82c07432aaf4961da0a86665d7 |