vitae-parser extracts and structures biographical details from unstructured text using pattern matching.
Project description
Vitae-Parse
A Python package for extracting and structuring biographical information from unstructured text about notable individuals.
Overview
This package uses a combination of natural language processing (NLP) and machine learning to extract key details such as name, profession, notable facts, and dates from text sources. It's designed for researchers, biographers, and historians who need to quickly parse and organize information from text sources.
Installation
You can install Vitae-Parse using pip:
pip install vitae_parser
Usage
You can use the vitae_parser function to extract information from a text input. Here's an example:
from vitae_parser import vitae_parser
user_input = """
John Doe is a renowned scientist who has published numerous papers on AI. He received his PhD in Computer Science from Stanford University in 2010.
"""
response = vitae_parser(user_input)
print(response)
The function takes in three parameters:
user_input: The text input to process, which should describe a person's life, achievements, or significant events.api_key: The API key for LLM7, which is used by default if not provided. You can get a free API key by registering at https://token.llm7.io/llm: The LangChain LL&M instance to use. If not provided, the defaultChatLLM7will be used. You can safely pass your own LLM instance if you want to use another LLM. For example, to use the OpenAI LLM:
from langchain_openai import ChatOpenAI
from vitae_parser import vitae_parser
llm = ChatOpenAI()
response = vitae_parser(user_input, llm=llm)
Development
This package uses the LangChain library to integrate with LLM7. You can find the documentation for LangChain at https://docs.langchain.com/. If you want to use a different LLM, you can pass your own instance of the BaseChatModel class.
The default rate limits for LLM7 free tier are sufficient for most use cases of this package. If you want higher rate limits for LLM7, you can pass your own API key via environment variable LLM7_API_KEY or via passing it directly.
Contributing
Contributions are welcome! Please submit issues or pull requests through the GitHub repository: https://github.com/chigwell/vitae-parser.
Author
Eugene Evstafev (eugene.evstafev@eugene.plus)
License
This package is released under the MIT license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vitae_parser-2025.12.22074807.tar.gz.
File metadata
- Download URL: vitae_parser-2025.12.22074807.tar.gz
- Upload date:
- Size: 6.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e3225c0edc6eaf5675d4137d8204ac7bd27d74a635a5d5d7a87489acd791262e
|
|
| MD5 |
7ebb4d8c20c63302de5f80f32cb1775f
|
|
| BLAKE2b-256 |
44077ee26f00e111848fa4448b14c698dc4bae6bbb8cd7434cb11a57c068cd66
|
File details
Details for the file vitae_parser-2025.12.22074807-py3-none-any.whl.
File metadata
- Download URL: vitae_parser-2025.12.22074807-py3-none-any.whl
- Upload date:
- Size: 7.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
97ba7c49e50770f57eb97ceaaa114e89d364f2dfb24530847a031e020133733c
|
|
| MD5 |
71b0fafec9aa6010dfbc8897717bba33
|
|
| BLAKE2b-256 |
ba109325f2995890f299fb3880d231d3639bb5c3ac841e02b3b61e547f07c1cf
|