Skip to main content

vitae-parser extracts and structures biographical details from unstructured text using pattern matching.

Project description

Vitae-Parse

PyPI version License: MIT Downloads LinkedIn

A Python package for extracting and structuring biographical information from unstructured text about notable individuals.

Overview

This package uses a combination of natural language processing (NLP) and machine learning to extract key details such as name, profession, notable facts, and dates from text sources. It's designed for researchers, biographers, and historians who need to quickly parse and organize information from text sources.

Installation

You can install Vitae-Parse using pip:

pip install vitae_parser

Usage

You can use the vitae_parser function to extract information from a text input. Here's an example:

from vitae_parser import vitae_parser

user_input = """
John Doe is a renowned scientist who has published numerous papers on AI. He received his PhD in Computer Science from Stanford University in 2010.
"""

response = vitae_parser(user_input)
print(response)

The function takes in three parameters:

  • user_input: The text input to process, which should describe a person's life, achievements, or significant events.
  • api_key: The API key for LLM7, which is used by default if not provided. You can get a free API key by registering at https://token.llm7.io/
  • llm: The LangChain LL&M instance to use. If not provided, the default ChatLLM7 will be used. You can safely pass your own LLM instance if you want to use another LLM. For example, to use the OpenAI LLM:
from langchain_openai import ChatOpenAI
from vitae_parser import vitae_parser

llm = ChatOpenAI()
response = vitae_parser(user_input, llm=llm)

Development

This package uses the LangChain library to integrate with LLM7. You can find the documentation for LangChain at https://docs.langchain.com/. If you want to use a different LLM, you can pass your own instance of the BaseChatModel class.

The default rate limits for LLM7 free tier are sufficient for most use cases of this package. If you want higher rate limits for LLM7, you can pass your own API key via environment variable LLM7_API_KEY or via passing it directly.

Contributing

Contributions are welcome! Please submit issues or pull requests through the GitHub repository: https://github.com/chigwell/vitae-parser.

Author

Eugene Evstafev (eugene.evstafev@eugene.plus)

License

This package is released under the MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vitae_parser-2025.12.22074807.tar.gz (6.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vitae_parser-2025.12.22074807-py3-none-any.whl (7.1 kB view details)

Uploaded Python 3

File details

Details for the file vitae_parser-2025.12.22074807.tar.gz.

File metadata

  • Download URL: vitae_parser-2025.12.22074807.tar.gz
  • Upload date:
  • Size: 6.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.1

File hashes

Hashes for vitae_parser-2025.12.22074807.tar.gz
Algorithm Hash digest
SHA256 e3225c0edc6eaf5675d4137d8204ac7bd27d74a635a5d5d7a87489acd791262e
MD5 7ebb4d8c20c63302de5f80f32cb1775f
BLAKE2b-256 44077ee26f00e111848fa4448b14c698dc4bae6bbb8cd7434cb11a57c068cd66

See more details on using hashes here.

File details

Details for the file vitae_parser-2025.12.22074807-py3-none-any.whl.

File metadata

File hashes

Hashes for vitae_parser-2025.12.22074807-py3-none-any.whl
Algorithm Hash digest
SHA256 97ba7c49e50770f57eb97ceaaa114e89d364f2dfb24530847a031e020133733c
MD5 71b0fafec9aa6010dfbc8897717bba33
BLAKE2b-256 ba109325f2995890f299fb3880d231d3639bb5c3ac841e02b3b61e547f07c1cf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page