A library for extracting information contained in resumes

These details have not been verified by PyPI

Project links

Project description

Resume Extract

이력서 URL을 입력하면 정보를 추출하는 파이썬 라이브러리입니다. Google의 LangExtract를 활용하여 구조화된 정보 추출을 제공합니다.

요구사항: Python 3.10+

설치

uv 사용 (권장)

uv add resume_extract

pip 사용

pip install resume_extract

3. 환경 설정

LangExtract API 키가 필요합니다:

# 환경변수 설정
export LANGEXTRACT_API_KEY="your-api-key-here"

# 또는 .env 파일에 추가
echo "LANGEXTRACT_API_KEY=your-api-key-here" > .env

사용법

기본 사용법

from resume_extract import ResumeExtractor

# 추출기 초기화
extractor = ResumeExtractor()

# URL에서 이력서 정보 추출
result = extractor.extract_from_url("https://example.com/resume.pdf")

print(result.name)
print(result.contact.email)
print(result.skills)
print(result.experience)

편의 함수 사용

from resume_extract import extract_from_url, extract_from_file

# 간단한 한 줄 사용
result = extract_from_url("https://example.com/resume.pdf")
result = extract_from_file("./resume.pdf")

컨텍스트 매니저 사용

with ResumeExtractor() as extractor:
    result = extractor.extract_from_url("https://example.com/resume.pdf")
    # 자동으로 리소스 정리됨

지원 형식

PDF 파일
Word 문서 (.docx, .doc)
HTML 웹페이지
일반 텍스트 파일

추출되는 정보

이름
연락처 (이메일, 전화번호, 주소, LinkedIn, GitHub)
기술/스킬
경력 사항
학력
프로젝트 경험
자격증

개발 명령어

의존성 관리

# 의존성 설치
make install

# 개발 의존성 포함 설치
make install-dev

# 의존성 추가
make add PACKAGE=package-name

# 개발 의존성 추가
make add-dev PACKAGE=package-name

# 락파일 업데이트
make lock

테스트

# 기본 테스트
make test

# 상세 테스트
make test-verbose

# 커버리지 포함 테스트
make test-cov

코드 품질

# 코드 포맷팅
make format

# 포맷 확인
make format-check

# 린팅
make lint

빌드 및 배포

# 빌드
make build

# 배포
make publish

예제 실행

# 기본 예제 실행
make example

# 또는 직접 실행
uv run python examples/basic_usage.py

설정 옵션

extractor = ResumeExtractor(
    langextract_api_key="your-key",        # API 키
    model_id="gemini-2.0-flash",           # 모델 선택
    max_file_size_mb=10,                   # 최대 파일 크기
    timeout=30,                            # 네트워크 타임아웃
    max_retries=3                          # 재시도 횟수
)

데이터 모델

추출된 정보는 구조화된 Pydantic 모델로 반환됩니다:

ResumeInfo: 전체 이력서 정보
ContactInfo: 연락처 정보
ExperienceInfo: 경력 정보
EducationInfo: 학력 정보
ProjectInfo: 프로젝트 정보
CertificationInfo: 자격증 정보

JSON 출력 예제

result = extractor.extract_from_url("https://example.com/resume.pdf")

# 딕셔너리로 변환
data = result.to_dict()

# JSON 문자열로 변환
json_str = result.to_json()
print(json_str)

오류 처리

from resume_extract import (
    ResumeExtractor,
    InvalidURLError,
    UnsupportedFileTypeError,
    DownloadError
)

try:
    result = extractor.extract_from_url("https://example.com/resume.pdf")
except InvalidURLError as e:
    print(f"잘못된 URL: {e}")
except UnsupportedFileTypeError as e:
    print(f"지원하지 않는 파일 형식: {e}")
except DownloadError as e:
    print(f"다운로드 실패: {e}")

성능 최적화

배치 처리: 여러 이력서를 처리할 때는 하나의 ResumeExtractor 인스턴스를 재사용
캐싱: 동일한 URL의 경우 결과를 캐싱하여 재사용
병렬 처리: 독립적인 이력서들은 병렬로 처리 가능

# 배치 처리 예제
urls = ["url1.pdf", "url2.pdf", "url3.pdf"]

with ResumeExtractor() as extractor:
    results = []
    for url in urls:
        try:
            result = extractor.extract_from_url(url)
            results.append(result)
        except Exception as e:
            print(f"처리 실패 {url}: {e}")

라이선스

MIT License

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.0.1

Oct 3, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

resume_extract-0.0.1.tar.gz (158.6 kB view details)

Uploaded Oct 3, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

resume_extract-0.0.1-py3-none-any.whl (15.8 kB view details)

Uploaded Oct 3, 2025 Python 3

File details

Details for the file resume_extract-0.0.1.tar.gz.

File metadata

Download URL: resume_extract-0.0.1.tar.gz
Upload date: Oct 3, 2025
Size: 158.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for resume_extract-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`83db13678cebcb22e84725ec8582651c43fd039ef193226e14e76d2e651083f6`
MD5	`8a95f475483ecddd55afd60020b24d96`
BLAKE2b-256	`50d35c925a82915b80cc5696596def229d0dc3ae0c1d07fa62157a2184895966`

See more details on using hashes here.

File details

Details for the file resume_extract-0.0.1-py3-none-any.whl.

File metadata

Download URL: resume_extract-0.0.1-py3-none-any.whl
Upload date: Oct 3, 2025
Size: 15.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for resume_extract-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c2aa6b265e7f57da4ccb3781f585c4a2aa492dedba2066e6eec5ef8161b5ceb9`
MD5	`ba52fc4e3936d4468843a9569d18ea33`
BLAKE2b-256	`33ecc450fcff164215da286ab3870da9c237e213f41a9ac3a50163270ef27f36`

See more details on using hashes here.

resume-extract 0.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Resume Extract

설치

uv 사용 (권장)

pip 사용

3. 환경 설정

사용법

기본 사용법

편의 함수 사용

컨텍스트 매니저 사용

지원 형식

추출되는 정보

개발 명령어

의존성 관리

테스트

코드 품질

빌드 및 배포

예제 실행

설정 옵션

데이터 모델

JSON 출력 예제

오류 처리

성능 최적화

라이선스

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes