Sherlock-lit helps you get everything an Abstract hides with a fast technical description card (Research questions, contribution, possible future works) of an NLP paper before reading it.
Project description
Sherlock-lit
Sherlock-lit is a Python package (part of the tools used in haut platform) designed to process NLP research papers (Later, on more general papers) and generate precise descriptions of the papers in markdown (MD) card format. It streamlines understanding and documentation of research papers, making it an invaluable tool for researchers, students, and developers in NLP field to discover easily the contributions and research questions researchers tackle without being lost in papers or being doubtful after reading the abstract.
Features
- Markdown Card Generation: Automatically generates concise and precise descriptions of NLP papers in markdown format.
- Efficient Processing: Really fast on CPUs and also can be further improved with GPU.
- Small Size: less than 70 MB.
- First-Time Setup: The first run may be slower due to setup, but subsequent runs are significantly faster.
- Simple Command-Line Interface: Process papers by simply providing the path to a PDF file or a folder containing PDFs files.
Installation
Sherlock is currently available for installation directly from GitHub. To install, run:
pip install sherlock-lit
Usage
Once installed, you can process a research paper by running:
sherlock_lit file_path.pdf
Replace file_path.pdf with the path to the research paper you want to process. The markdown card will be generated and saved in the output directory.
Example Workflow
-
Install sherlock-lit from PypI.
-
Run
sherlock_liton a PDF file:sherlock_lit example_paper.pdf -
Retrieve the generated markdown card from the output directory (card-papers folder located in the current directory).
Contributions
Contributions are welcome! I will be writing the contribution guidelines very soon.
Future Plans
- Optimize processing speed further.
- Improve parsing of papers.
- Add support for more advanced GPU and multi-threaded processing.
- Expand compatibility for additional research paper formats.
License
Sherlock-lit is licensed under the MIT License.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sherlock_lit-0.1.4.tar.gz.
File metadata
- Download URL: sherlock_lit-0.1.4.tar.gz
- Upload date:
- Size: 13.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.11.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
abad6c225c74737fffefe0ef07c8437752e3d780ec940f5667e85bb909132fb1
|
|
| MD5 |
10d8e8a5409f51e94c1d939358fa829c
|
|
| BLAKE2b-256 |
fd220a8e72bd23194b1f1c0395528b53d5cf091533e6204dd6e0fda08204da7a
|
File details
Details for the file sherlock_lit-0.1.4-py3-none-any.whl.
File metadata
- Download URL: sherlock_lit-0.1.4-py3-none-any.whl
- Upload date:
- Size: 13.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.11.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c6d836fab772936348671cd8344d9a7946453439c9206f5289266e984fd08e3a
|
|
| MD5 |
a654189621d7a1852b71d1af58bc0c42
|
|
| BLAKE2b-256 |
28d99beba939e672be45aa558bc9c40502074fd41327b0b4bc3d89acd886fede
|