Skip to main content

A new package is designed to analyze user-submitted text about Hacker News discussions to identify and extract a structured list of books mentioned in 2025. It leverages language models to recognize b

Project description

HN Book Extractor

PyPI version License: MIT Downloads LinkedIn

HN Book Extractor is a Python package designed to analyze user-submitted text from Hacker News discussions and extract a structured list of books mentioned in the year 2025. It utilizes language models to intelligently recognize book titles, authors, and publication years within unstructured comments or posts, providing a clear, formatted summary of all referenced books.

This tool is ideal for content creators, researchers, and enthusiasts who need to quickly compile relevant book lists from large discussion datasets without the need for manual effort.

Features

  • Extracts book titles, authors, and publication years from unstructured text.
  • Specifically targets books mentioned in the year 2025.
  • Leverages language models for accurate recognition of book information.
  • Outputs a structured list of extracted books.

Installation

pip install hnbookextractor

Usage

The hnbookextractor function takes user input text and returns a list of extracted book information.

from hnbookextractor import hnbookextractor

user_text = """
I recently read 'Project Hail Mary' by Andy Weir, published in 2021, and it was fantastic.
Someone in another thread mentioned 'The Three-Body Problem' by Cixin Liu, which I plan to read next.
Looking forward to 'Dune' by Frank Herbert, though that's an older classic.
Has anyone read any new sci-fi books released in 2025? I heard 'Hyperion' by Dan Simmons is getting a sequel.
"""

books = hnbookextractor(user_input=user_text)
print(books)

Parameters

The hnbookextractor function accepts the following parameters:

  • user_input (str): The raw text input from Hacker News discussions to analyze.
  • api_key (Optional[str]): Your LLM7 API key. If not provided, the package will attempt to use the LLM7_API_KEY environment variable.
  • llm (Optional[BaseChatModel]): An optional Langchain-compatible chat model instance. If not provided, a default ChatLLM7 instance will be used.

LLM Configuration

By default, the package uses ChatLLM7 from langchain_llm7.

Using a Different LLM

You can provide your own Langchain LLM instance to the hnbookextractor function if you prefer to use other models.

Example with OpenAI:

from langchain_openai import ChatOpenAI
from hnbookextractor import hnbookextractor

llm = ChatOpenAI(api_key="YOUR_OPENAI_API_KEY")
response = hnbookextractor(user_input="...", llm=llm)

Example with Anthropic:

from langchain_anthropic import ChatAnthropic
from hnbookextractor import hnbookextractor

llm = ChatAnthropic(api_key="YOUR_ANTHROPIC_API_KEY")
response = hnbookextractor(user_input="...", llm=llm)

Example with Google Generative AI:

from langchain_google_genai import ChatGoogleGenerativeAI
from hnbookextractor import hnbookextractor

llm = ChatGoogleGenerativeAI(api_key="YOUR_GOOGLE_API_KEY")
response = hnbookextractor(user_input="...", llm=llm)

LLM7 API Key

The default rate limits for the LLM7 free tier are generally sufficient for most users. If you require higher rate limits for LLM7, you can provide your own API key either by setting the LLM7_API_KEY environment variable or by passing it directly to the hnbookextractor function.

You can obtain a free API key by registering at https://token.llm7.io/.

Contributing

Please report any issues or suggest improvements on the GitHub repository.

License

[MIT License] (Specify your license here if different)

Author

GitHub Repository

https://github.com/chigwell/hnbookextractor

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hnbookextractor-2025.12.21171700.tar.gz (4.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hnbookextractor-2025.12.21171700-py3-none-any.whl (5.2 kB view details)

Uploaded Python 3

File details

Details for the file hnbookextractor-2025.12.21171700.tar.gz.

File metadata

File hashes

Hashes for hnbookextractor-2025.12.21171700.tar.gz
Algorithm Hash digest
SHA256 13b9d7de2e0fc7570527d9cad18e2c7c959997d2eee04bda6e4acf801e200fd5
MD5 03947f15e063a6578a5cac67bc47e425
BLAKE2b-256 6502229fb1d842b84d5093fcee8d72da68936594dd786ecf36abeaea7608a231

See more details on using hashes here.

File details

Details for the file hnbookextractor-2025.12.21171700-py3-none-any.whl.

File metadata

File hashes

Hashes for hnbookextractor-2025.12.21171700-py3-none-any.whl
Algorithm Hash digest
SHA256 b25439257d361cdadcb43af5e59bdc95df1f0be73bea40a2974b893b644cbfbc
MD5 069142c71c09bf40a02f108f55836727
BLAKE2b-256 d7d1e441844ef0aef63dba18ef1f33dfdda2a3a9acccf96d5fea3c40584e908c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page