Skip to main content

Sherlock-lit helps you get everything an Abstract hides with a fast technical description card (Research questions, contribution, possible future works) of an NLP paper before reading it.

Project description

Sherlock-lit

Sherlock-lit is a Python package (part of the tools used in haut platform) designed to process NLP research papers (Later, on more general papers) and generate precise descriptions of the papers in markdown (MD) card format. It streamlines understanding and documentation of research papers, making it an invaluable tool for researchers, students, and developers in NLP field to discover easily the contributions and research questions researchers tackle without being lost in papers or being doubtful after reading the abstract.


Features

  • Markdown Card Generation: Automatically generates concise and precise descriptions of NLP papers in markdown format.
  • Efficient Processing: Really fast on CPUs and also can be further improved with GPU.
  • Small Size: less than 70 MB.
  • First-Time Setup: The first run may be slower due to setup, but subsequent runs are significantly faster.
  • Simple Command-Line Interface: Process papers by simply providing the path to a PDF file or a folder containing PDFs files.

Installation

Sherlock is currently available for installation directly from GitHub. To install, run:

pip install sherlock-lit 

Usage

Once installed, you can process a research paper by running:

sherlock_lit file_path.pdf

Replace file_path.pdf with the path to the research paper you want to process. The markdown card will be generated and saved in the output directory.


Example Workflow

  1. Install sherlock-lit from PypI.

  2. Run sherlock_lit on a PDF file:

    sherlock_lit example_paper.pdf
    
  3. Retrieve the generated markdown card from the output directory (card-papers folder located in the current directory).


Contributions

Contributions are welcome! I will be writing the contribution guidelines very soon.


Future Plans

  • Optimize processing speed further.
  • Improve parsing of papers.
  • Add support for more advanced GPU and multi-threaded processing.
  • Expand compatibility for additional research paper formats.

License

Sherlock-lit is licensed under the MIT License.


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sherlock_lit-0.1.4.tar.gz (13.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sherlock_lit-0.1.4-py3-none-any.whl (13.0 kB view details)

Uploaded Python 3

File details

Details for the file sherlock_lit-0.1.4.tar.gz.

File metadata

  • Download URL: sherlock_lit-0.1.4.tar.gz
  • Upload date:
  • Size: 13.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.7

File hashes

Hashes for sherlock_lit-0.1.4.tar.gz
Algorithm Hash digest
SHA256 abad6c225c74737fffefe0ef07c8437752e3d780ec940f5667e85bb909132fb1
MD5 10d8e8a5409f51e94c1d939358fa829c
BLAKE2b-256 fd220a8e72bd23194b1f1c0395528b53d5cf091533e6204dd6e0fda08204da7a

See more details on using hashes here.

File details

Details for the file sherlock_lit-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: sherlock_lit-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 13.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.7

File hashes

Hashes for sherlock_lit-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 c6d836fab772936348671cd8344d9a7946453439c9206f5289266e984fd08e3a
MD5 a654189621d7a1852b71d1af58bc0c42
BLAKE2b-256 28d99beba939e672be45aa558bc9c40502074fd41327b0b4bc3d89acd886fede

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page