Information parsing assistant
Project description
Installation
Install ffmpeg
(brew install ffmpeg
on macOS)
Install packages from requirements.txt
(pip install -r requirements.txt
)
Configuration
Set parameters as appropriate in config.json
Usage
See demo.ipynb
for an example.
- download_and_parse_pdf(pdf_url) -> pdf_text
- get_arxiv_paper(arxiv_url) -> {"title": title, "url": arxiv_url, "text": pdf_text}
- get_youtube_transcript(youtube_url) -> {"title": title, "date": date, "yt_url": youtube_url, "uploader": uploader, "channel": channel, "transcripts": transcripts}
What happened to the old version?
Sorry! I'm working on making this more streamlined and better. For the old version, please see the distyll_old
branch.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
distyll_info-0.1.0.tar.gz
(7.0 kB
view details)
Built Distribution
File details
Details for the file distyll_info-0.1.0.tar.gz
.
File metadata
- Download URL: distyll_info-0.1.0.tar.gz
- Upload date:
- Size: 7.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.12.2 Darwin/23.3.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 079a786bd572708e1e3bde61a9b1fed7e8582fa75ed6b2e779075fb804ba505f |
|
MD5 | f2fe68a13a6d1a0d43f3810d5d9d431b |
|
BLAKE2b-256 | e8b253b832311f222aac4b0f056784c49e0b7eddc91698a8babbf04f7c7074d4 |
File details
Details for the file distyll_info-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: distyll_info-0.1.0-py3-none-any.whl
- Upload date:
- Size: 8.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.12.2 Darwin/23.3.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9d72f9453fff6d6b07a19b419053fc0d4696f5388f2034fdb0129c83a9f6b408 |
|
MD5 | 5d2f2cda6dd92d602fef7e3ab02de748 |
|
BLAKE2b-256 | bf8f67d0227425a21505627d2395ea75098cc267bd5788618666c524058daaa8 |