A high-performance map matching tool for GMNS networks
Project description
MapMatcher4GMNS
A high-performance map matching tool for GPS trajectories on GMNS (General Modeling Network Specification) networks.
Features
- High-Performance Map Matching: Efficient Hidden Markov Model (HMM) based map matching algorithm
- GMNS Network Support: Native support for GMNS network format (node.csv, link.csv)
- Multi-Core Processing: Built-in parallel processing support for large-scale GPS data
- Flexible Configuration: Comprehensive parameters for fine-tuning matching quality
- Route Generation: Automatic generation of complete routes between matched points
Installation
From PyPI
pip install mapmatcher4gmns
Quick Start
import mapmatcher4gmns as m4g
def main():
# Load network from GMNS format
net = m4g.LoadNetFromCSV(
folder='path/to/network',
node_file='node.csv',
link_file='link.csv'
)
# Create matcher
matcher = m4g.MapMatcher(
network=net,
time_field='timestamp',
time_format='%Y-%m-%dT%H:%M:%S.%fZ',
out_dir='output',
result_file='matched_result.csv',
route_file='matched_route.csv',
)
# Perform map matching (pass CSV path)
matcher.match('gps_data.csv')
# Note: match(...) accepts CSV path string input.
if __name__ == '__main__':
main()
Input Data Requirements
Network Files (GMNS Format)
node.csv (required fields):
node_id: Unique node identifierx_coord: Longitude (if coordinate_type='lonlat') or X coordinatey_coord: Latitude (if coordinate_type='lonlat') or Y coordinate
link.csv (required fields):
link_id: Unique link identifierfrom_node_id: Starting node IDto_node_id: Ending node IDlanes: Number of lanesgeometry: LineString geometry in WKT format
GPS Data
Required fields:
journey_id(or custom agent_field): Unique identifier for each GPS trajectorylongitude: GPS longitudelatitude: GPS latitude
Optional but recommended:
time(or custom time_field): Timestamp for temporal orderingspeed: Speedheading: Heading direction in degrees
Configuration Parameters
Core Matching Parameters
search_radius(default: 15.0): Search radius in meters for candidate linksnoise_sigma(default: 8.0): GPS noise standard deviation in meterstrans_weight(default: 12.0): Weight for transition probabilitymax_candidates(default: 10): Maximum number of candidate links per GPS point
Movement Consistency
turn_sigma(default: 45.0): Turn angle standard deviation in degreesheading_sigma(default: 30.0): Heading difference standard deviationuse_heading(default: True): Whether to use heading information
Filtering
filter_dwell(default: True): Filter out stationary pointsdwell_dist(default: 5.0): Distance threshold for dwell detection in metersdwell_count(default: 2): Minimum consecutive points to be considered dwellingmax_gap_seconds(default: 45.0): Maximum time gap allowed between consecutive points
Performance
core_num: Number of CPU cores to use (default: 1). If set aboveos.cpu_count(), it is capped to available CPU cores.max_agents(default: None): Maximum number of trajectories to process (useful for testing/debugging). If set, only the first N trajectories will be matched
Output
The tool generates two main output files:
1. Matched Results (matched_result.csv)
Contains the matched GPS points with:
journey_id: Trajectory identifierseq: Sequence numbertime: Timestamplink_id: Matched link IDfrom_node_id,to_node_id: Link endpointslongitude,latitude: Original GPS coordinatesspeed_mph: Speed (if provided)match_heading: Heading of matched linkroute_dis: Culmulative route distance
2. Route File (matched_route.csv)
Contains the complete route for each journey:
journey_id: Trajectory identifierlink_ids: Comma-separated list of link IDs forming the complete route
3. Run Summary (summary.txt)
Contains run-level summary statistics:
- Input/kept/dropped/matched journeys
- Input/matched data points and match rate
- Total elapsed time
Advanced Usage
Note: When using multiprocessing features, wrap your code in if __name__ == '__main__': to avoid issues, especially on Windows.
Multi-Core Processing
matcher = m4g.MapMatcher(
network=net,
core_num=2, # Use 2 CPU cores
# ... other parameters
)
Large CSV (Memory-Safe Streaming)
For very large GPS files (for example, 100M+ rows), avoid loading all rows
into memory at once. Pass CSV path directly to match(...).
In streaming mode, the matcher internally hashes journey_id into temporary
partitions first, then processes partition files. This keeps the same
journey_id in one partition without requiring a global CSV sort.
matcher = m4g.MapMatcher(
network=net,
time_field='local_time',
time_format='%Y-%m-%dT%H:%M:%S%z',
out_dir='output',
result_file='matched_result.csv',
route_file='matched_route.csv',
core_num=1,
)
matcher.match('data.csv')
Custom Field Names
matcher = m4g.MapMatcher(
network=net,
agent_field='vehicle_id', # Custom trajectory ID field
lng_field='lon', # Custom longitude field
lat_field='lat', # Custom latitude field
time_field='timestamp', # Custom time field
# ... other parameters
)
Extra Fields
Keep additional fields from input GPS data in the output:
matcher = m4g.MapMatcher(
network=net,
extra_fields=['vehicle_type', 'driver_id', 'trip_purpose'],
# ... other parameters
)
Requirements
- Python >= 3.8
- numpy >= 1.20.0
- pandas >= 1.3.0
- shapely >= 2.0.0
- geopandas >= 0.10.0
- networkx >= 2.6.0
- tqdm >= 4.60.0
Citation
If you use this tool in your research, please cite this tool.
[Add your citation here]
License
This project is licensed under the MIT License - see the LICENSE file for details.
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
Acknowledgments
This package was inspired by and references the excellent work of the TrackIt (GoTrackIt) project. We are grateful for their contributions to the open-source map matching community and their innovative approach to HMM-based map matching algorithms.
References
- TrackIt/GoTrackIt: A comprehensive map matching Python package based on Hidden Markov Model (HMM)
- GitHub: https://github.com/zdsjjtTLG/TrackIt
- Documentation: https://gotrackit.readthedocs.io/
- Developed by: TangKai and contributors at Hangzhou Zecheng Data Technology Co., Ltd.
This tool is designed to work with the General Modeling Network Specification (GMNS) format, supporting transportation network analysis and GPS trajectory processing.
Support
For questions, issues, or feature requests, please contact us.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mapmatcher4gmns-0.1.8.tar.gz.
File metadata
- Download URL: mapmatcher4gmns-0.1.8.tar.gz
- Upload date:
- Size: 46.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
92655cf2d26ccd3af5076f4f2d35d1fc4a148429b10854b95363f41cb9dd159c
|
|
| MD5 |
ace8fd8d78fc39588cefca5b1b6a5788
|
|
| BLAKE2b-256 |
a07ff7a07b741645face3d723845beca84720f5ea571c1af4dd705428a980fb3
|
File details
Details for the file mapmatcher4gmns-0.1.8-py3-none-any.whl.
File metadata
- Download URL: mapmatcher4gmns-0.1.8-py3-none-any.whl
- Upload date:
- Size: 44.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4ccf94484a7e375168dbb6029304a44a8b7fe225a720b095dde19ce0f8c302fc
|
|
| MD5 |
62c46395e9d0d171cfa9a09b9435aa93
|
|
| BLAKE2b-256 |
8d4a8d33f6266cf9a524a0b2f9f65d70c1c1cad895063eba58322870df74f540
|