A package to parse raw HTML and return structured information.
Project description
html2info
html2info
is a Python package that allows you to parse LinkedIn profiles from raw HTML and return structured information in JSON format.
Features
- Extracts profile information such as name, title, location, profile photo, about, experience, and education.
- Returns a JSON object containing the parsed data.
Installation
Install html2info
using pip:
pip install html2info
Usage
Here's an example of how to use html2info:
from html2info.linkedin import Person
url = "https://www.linkedin.com/in/iglovikov/" raw_data = "..." # Raw HTML content of the LinkedIn page
person = Person(url, raw_data) person.parse() print(person.to_dict())
{
"linkedin_url": "https://www.linkedin.com/in/iglovikov/",
...
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
html2info-0.1.0.tar.gz
(4.1 kB
view hashes)
Built Distribution
Close
Hashes for html2info-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 10222d4877034a0f5b3883ce976ecff63525581fe1c6c1292818d783c776a99d |
|
MD5 | 7af8ea2daa3be21c2fe8af3106947dc4 |
|
BLAKE2b-256 | b941b123f8a027a1ef12f44cc9db94956d0c94761722f5a4fb7879fedb23d348 |