llama-index readers wordpress integration
Project description
Wordpress Loader
pip install llama-index-readers-wordpress
This loader fetches the text from Wordpress blog posts using the Wordpress API. It also uses the BeautifulSoup library to parse the HTML and extract the text from the articles.
Usage
To use this loader, you need to pass base url of the Wordpress installation
(e.g. https://www.mysite.com
) and optionally a username, and an application
password for the user (more about application passwords
here)
from llama_index.readers.wordpress import WordpressReader
loader = WordpressReader(
url="https://www.mysite.com",
username="my_username",
password="my_password",
)
documents = loader.load_data()
This loader is designed to be used as a way to load data into LlamaIndex.
Pages and Posts
Be default, the loader retrieves both Wordpress pages (static content) and
posts (blog entries) from the target site. This behavior can be configured
by setting get_pages=False
or get_posts=False
when initializing the
WordpressReader
object.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for llama_index_readers_wordpress-0.2.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 18753154994f317557e154c50be58863fbf4b7c86f548a9afb401374dd0262e0 |
|
MD5 | 2099ef5ca2e5552176ecfc516b7e45ba |
|
BLAKE2b-256 | 57ad593eb7705e5d6bbcdf4a811972bc6c726f0f3fec77759188b85881fbb3c3 |
Hashes for llama_index_readers_wordpress-0.2.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 38b91aceacf526eb37a0b0d99008fd159e65c0d122f167a1fe458e2ca98b79d2 |
|
MD5 | bdaa047a24c6cab8df67bbd0ae6401ba |
|
BLAKE2b-256 | 97fadaf662882e18f61ce07c60627ad05112dcf673b8c4f8a9a60a484d81436a |