Skip to main content

AI-powered resume parser with parallel processing for multiple file formats (PDF, DOCX, images, etc.)

Project description

ResumeParser Pro 🚀

PyPI version Python Support License: MIT

Production-ready AI-powered resume parser with parallel processing capabilities. Extract structured data from resumes in PDF, DOCX, TXT, images, and more, using state-of-the-art language models.

🌟 Features

  • 🤖 AI-Powered: Uses advanced language models (GPT, Gemini, Claude, etc.).
  • ⚡ Parallel Processing: Process multiple resumes simultaneously.
  • 📊 Structured Output: Returns clean, validated JSON data.
  • 🎯 High Accuracy: Extracts 20+ fields with intelligent categorization.
  • 📁 Multi-Format Support: Parses PDFs, DOCX, TXT, images (PNG, JPG), HTML, and ODT files.
  • 🔌 Easy Integration: Simple API with just a few lines of code.

🚀 Quick Start

Installation

Core installation (for PDF, DOCX, TXT) pip install ai-resume-parser

To include support for all file types pip install ai-resume-parser[full]

See the "Supported File Formats" section for installing specific file handlers.

Basic Usage

from resumeparser_pro import ResumeParserPro

Initialize parser parser = ResumeParserPro( provider="google_genai", model_name="gemini-2.0-flash", api_key="your-api-key" )

Parse single resume (supports .pdf, .docx, .png, etc.) result = parser.parse_resume("path/to/your/resume.pdf")

if result.success: print(f"Name: {result.resume_data.contact_info.full_name}") print(f"Experience: {result.resume_data.total_experience_months} months")

📁 Supported File Formats

ResumeParser Pro supports a wide range of file formats. Core dependencies handle PDF, DOCX, and TXT. For other formats, install the optional extras.

Format Extensions Required Installation Command
Core Formats .pdf, .docx, .txt pip install ai-resume-parser
Images (OCR) .png, .jpg, .jpeg pip install ai-resume-parser[ocr]
HTML .html, .htm pip install ai-resume-parser[html]
OpenDocument .odt pip install ai-resume-parser[odt]

❗️ Important Note for Image Parsing: To parse images, you must have the Google Tesseract OCR engine installed on your system. This is a separate step from the pip installation.

🎯 Supported AI Providers

Since ai-resume-parser uses LangChain's init_chat_model, it supports all LangChain-compatible providers:

Provider Example Models Setup
Google Gemini 2.0 Flash, Gemini Pro, Gemini 1.5 provider="google_genai"
OpenAI GPT-4o, GPT-4o-mini, GPT-4 Turbo provider="openai"
Anthropic Claude 3.5 Sonnet, Claude 3 Opus provider="anthropic"

... (and others like Azure, Bedrock, Ollama, etc.)

Full list: See LangChain Model Providers for complete provider support.

📈 Performance

  • Speed: ~3-5 seconds per resume (depending on the LLM).
  • Parallel Processing: 5-10x faster for batch operations.
  • Accuracy: 95%+ field extraction accuracy.

🛠️ Advanced Features

Custom Configuration

parser = ResumeParserPro( provider="openai", model_name="gpt-4o-mini", api_key="your-api-key", max_workers=10, # Parallel processing workers temperature=0.1 # Model consistency )

🤝 Contributing

Contributions are welcome! Please read our contributing guidelines.

📄 License

MIT License - see LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai_resume_parser-1.0.4.tar.gz (12.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ai_resume_parser-1.0.4-py3-none-any.whl (14.3 kB view details)

Uploaded Python 3

File details

Details for the file ai_resume_parser-1.0.4.tar.gz.

File metadata

  • Download URL: ai_resume_parser-1.0.4.tar.gz
  • Upload date:
  • Size: 12.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for ai_resume_parser-1.0.4.tar.gz
Algorithm Hash digest
SHA256 f6ef849ca22ae507f04a1c41a297b3e4eb4567880ad4187f1136a0eb7fd1a94c
MD5 4657542ea430ba7f56ddba1c3161bb1a
BLAKE2b-256 7f34ffd7b1b592ccb512feb3de1098d8b153d3e2dd35a5d15bef9cb05185fa50

See more details on using hashes here.

File details

Details for the file ai_resume_parser-1.0.4-py3-none-any.whl.

File metadata

File hashes

Hashes for ai_resume_parser-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 346ee1eecd2a4c5c5586d7cd96ef7ae8689abab61466f5be30c85e4aa84b1d7f
MD5 67578b3291141f78407884c3c0cd394f
BLAKE2b-256 2a44ce23aa35d907e1fa4e657aa746e5fde7e3335ffe2432b598f62c4ae71f3b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page