Skip to main content

Extract date and location from a list of strings

Project description

Date Location Extractor

date_location_extractor is a date_location extractor that retrieves dates and locations found in a list of strings. The input can either be a json file with a list or you can use a list directly

Install & Setup

Grab the package using pip (this will take a few minutes)

pip install date-location-extractor

Dater Location Extractor uses the following dependencies:

  • datetime
  • dateutil
  • geotext
  • datefinder
  • ast
  • os

Basic Usage

Import the module, give some text or a URL, and presto.

from date_location_extractor import DateLocationExtractor

date_location_extractor = DateLocationExtractor()
print(date_location_extractor.get_date_location_from_json_file("list_to_parse.json", use_simple_parser=True))
  • use_simple_parser does not use datefinder and uses the simple dateutil parser

The result is a list of dictionaries, e.g:

[{"address": "San Juan Costa Rica", "date_iso": "2009-11-27", "ranking": 1.0, "normalized_address": {"City": "San Juan", "Country": "CR"}}]

Without loading a file:

from date_location_extractor import DateLocationExtractor

date_location_extractor = DateLocationExtractor()
print(date_location_extractor.get_date_location_from_list(["13 May 2009", "12/15/2010"]))
print(date_location_extractor.get_date_location_from_list_with_parser(["13 May 2009", "12/15/2010"]))

The ranking algorithm has the following weights set:

  • RANKING_WEIGHT_HAS_DATE = 0.3
  • RANKING_WEIGHT_HAS_DAY = 0.2
  • RANKING_WEIGHT_HAS_COUNTRY = 0.3
  • RANKING_WEIGHT_HAS_CITY = 0.2

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

date_location_extractor-0.1.1.tar.gz (4.1 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page