Skip to main content

Date parsing library designed to parse dates from HTML pages

Project description

travis build status pypi downloads per day pypi version

dateparser provides modules to easily parse localized dates in almost any string formats commonly found on web pages.

Documentation

Documentation can be found here.

Features

  • Generic parsing of dates in English, Spanish, Dutch, Russian and several other languages and formats.

  • Generic parsing of relative dates like: '1 min ago', '2 weeks ago', '3 months, 1 week and 1 day ago'.

  • Generic parsing of dates with time zones abbreviations or UTC offsets like: 'August 14, 2015 EST', 'July 4, 2013 PST', '21 July 2013 10:15 pm +0500'.

  • Extensive test coverage.

Usage

The most straightforward way is to use the dateparser.parse function, that wraps around most of the functionality in the module.

Relative Dates

>>> parse('1 hour ago')
datetime.datetime(2015, 5, 31, 23, 0)
>>> parse(u'Il ya 2 heures')  # French (2 hours ago)
datetime.datetime(2015, 5, 31, 22, 0)
>>> parse(u'1 anno 2 mesi')  # Italian (1 year 2 months)
datetime.datetime(2014, 4, 1, 0, 0)
>>> parse(u'yaklaşık 23 saat önce')  # Turkish (23 hours ago)
datetime.datetime(2015, 5, 31, 1, 0)
>>> parse(u'Hace una semana')  # Spanish (a week ago)
datetime.datetime(2015, 5, 25, 0, 0)
>>> parse(u'2小时前')  # Chinese (2 hours ago)
datetime.datetime(2015, 5, 31, 22, 0)

Dependencies

dateparser translates non-English dates to English and uses dateutil module parser to parse the translated date.

Also, it requires PyYAML for its language detection module to work.

Limitations

  • Limited language support.

History

0.3.0 (2015-07-29)

New features:

  • Compatibility with Python 3 and PyPy.

Improvements:

  • languages.yaml data cleaned up to make it human-readable.

  • Improved Spanish date parsing.

0.2.1 (2015-07-13)

  • Support for generic parsing of dates with UTC offset.

  • Support for Filipino dates.

  • Improved support for French and Spanish dates.

0.2.0 (2015-06-17)

  • Easy to use parse function

  • Languages definitions using YAML.

  • Using translation based approach for parsing non-english languages. Previously, dateutil.parserinfo was used for language definitions.

  • Better period extraction.

  • Improved tests.

  • Added a number of new simplifications for more comprehensive generic parsing.

  • Improved validation for dates.

  • Support for Polish, Thai and Arabic dates.

  • Support for pytz timezones.

  • Fixed building and packaging issues.

0.1.0 (2014-11-24)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dateparser-0.3.0.tar.gz (54.9 kB view details)

Uploaded Source

Built Distribution

dateparser-0.3.0-py2.py3-none-any.whl (32.9 kB view details)

Uploaded Python 2Python 3

File details

Details for the file dateparser-0.3.0.tar.gz.

File metadata

  • Download URL: dateparser-0.3.0.tar.gz
  • Upload date:
  • Size: 54.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for dateparser-0.3.0.tar.gz
Algorithm Hash digest
SHA256 b02f8c5f735a86eb1e7044c47f038c1e7af708bc834860abcc8197b568650d88
MD5 a8fe2538dbf038245e48fb4928eff070
BLAKE2b-256 725c390638480211055d734cd525fe237e8b5b6c1303b23e9da4719c930871f6

See more details on using hashes here.

File details

Details for the file dateparser-0.3.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for dateparser-0.3.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 c8a59db3ebce4a755fb7dec8c1c3161463639dbbc7c8f7f9e2a3ca2ca4d0e657
MD5 0d4ef437322a5e12d2c517fefcad7621
BLAKE2b-256 7174d201a4a42f51ab7cb9988377fa345d0bd042efa2785208036914e90d7aa0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page