MCP server for the Workplace Gender Equality Agency (WGEA) public data file. Plain-English access to per-employer workforce composition, gender-equality policy answers, parental leave, flexible work, and harm-prevention data — every WGEA-reporting employer in Australia, every year, with fuzzy employer-name search and a CC-BY 3.0 AU attribution contract.
Project description
wgea-mcp
MCP server for the Workplace Gender Equality Agency (WGEA) public data file. Plain-English access to per-employer workforce composition, gender-equality policy answers, parental leave, flexible work, and harm-prevention data — every WGEA-reporting employer in Australia (~9,600 employers), every year, from a single uvx command.
"What's the gender breakdown at Commonwealth Bank?"
"Which mining companies set gender targets in 2024-25?"
"Workforce composition by occupation at Qantas"
"Sexual harassment policy responses across financial services"
"Promotions to manager by gender at Atlassian"
Sister to abs-mcp, rba-mcp, ato-mcp, apra-mcp, aihw-mcp, asic-mcp, and au-weather-mcp.
Install
uvx --upgrade wgea-mcp
Claude Desktop
{
"mcpServers": {
"wgea": { "command": "uvx", "args": ["--upgrade", "wgea-mcp"] }
}
}
Claude Code
claude mcp add wgea --command uvx --args -- --upgrade wgea-mcp
What it exposes
Five tools, all plain-English in, structured out:
| Tool | Purpose |
|---|---|
search_datasets |
Fuzzy-search the curated catalog by keyword |
describe_dataset |
List a dataset's filterable dimensions and returnable measures |
get_data |
Query with filters, period range, output format |
latest |
Restrict to the latest reporting year |
list_curated |
Enumerate the curated dataset IDs |
Every response is the same shape — dataset_id, dataset_name, query, reporting_year, unit, row_count, records, source_url, download_url, did_you_mean, attribution, stale flag, server_version.
Curated datasets (7 in v0.1)
| ID | What it is | Source CSV |
|---|---|---|
WORKFORCE_COMPOSITION |
Per-employer headcount by occupation × manager category × gender | wgea_workforce_composition_<year>.csv |
WORKFORCE_MANAGEMENT |
Manager movements (promotions, hires, resignations) by gender | wgea_workforce_management_statistics_<year>.csv |
GENDER_EQUALITY_ACTIONS |
Pay-gap analyses, gender targets, governance — Q&A responses | wgea_questionnaire_action_on_gender_equality_<year>.csv |
PARENTAL_LEAVE_FLEX |
Parental leave + flexible-work policy responses | wgea_questionnaire_flexible_work_<year>.csv |
HARM_PREVENTION |
Sexual harassment + domestic-violence policy responses | wgea_questionnaire_harm_prevention_<year>.csv |
EMPLOYEE_SUPPORT |
Carer leave, EAP, mental-health programs | wgea_questionnaire_employee_support_<year>.csv |
WORKPLACE_OVERVIEW |
Board composition, governing-body diversity, CEO + KMP demographics | wgea_questionnaire_workplace_overview_<year>.csv |
Note on the headline gender-pay-gap %. WGEA's Data Explorer publishes a headline per-employer gender pay gap percentage. That specific aggregate is NOT included in the public CSV release — WGEA pre-aggregates remuneration data before public publication. Use this MCP for the underlying workforce composition + policy detail; use WGEA's Data Explorer for the headline pay-gap percentage.
Reliability — 2-tier URL resolution
WGEA publishes the public data file annually under a single CKAN package on data.gov.au. Each annual release gets a fresh resource UUID:
- Live CKAN —
package_show?id=wgea-datasetreturns every resource; the newest "WGEA Data — Public Data File" wins. Cached 6h. - Bundled seed manifest — when CKAN is unreachable, fall back to
data/seed_urls.jsonshipped in the wheel. The response is flaggedstale: truewith an honest reason.
Net effect: a fresh uvx wgea-mcp always gets the current reporting year; a 12-month-old install still works because the seed manifest is refreshed and --upgrade pulls a new wheel.
Fuzzy employer-name search
Pass any abbreviation, alias, or substring and rapidfuzz resolves it:
| You type | Resolved to |
|---|---|
"CBA" |
Commonwealth Bank of Australia |
"Commonwealth Bank" |
Commonwealth Bank of Australia |
"NAB" |
National Australia Bank Limited |
"Westpac" |
Westpac Banking Corporation |
"Woolies" / "woolworths" |
Woolworths Group Limited |
"Atlassian" |
Atlassian Pty Ltd |
"qantas" |
Qantas Airways Limited |
When nothing exact matches, did_you_mean carries the top-5 closest legal names so the agent can ask the user to pick.
Attribution
Data sourced from the Workplace Gender Equality Agency. Licensed under Creative Commons Attribution 3.0 Australia (CC BY 3.0 AU). wgea-mcp is MIT-licensed; WGEA's data carries the upstream CC-BY 3.0 AU licence, echoed in every response's attribution field.
Per-employer reporting is a deliberate disclosure under the Workplace Gender Equality Act 2012 — redistribution is explicitly intended.
Sister MCPs (Australian Public Data portfolio)
- abs-mcp — Australian Bureau of Statistics (CPI, unemployment, ERP, building approvals)
- rba-mcp — Reserve Bank of Australia (cash rate, lending stats, exchange rates)
- ato-mcp — Australian Taxation Office (tax stats, ACNC charities)
- apra-mcp — Australian Prudential Regulation Authority (banking, insurance, super)
- aihw-mcp — Australian Institute of Health and Welfare
- asic-mcp — Australian Securities and Investments Commission (company registers)
- aemo-mcp — Australian Energy Market Operator (NEM dispatch, spot prices, generation)
- au-weather-mcp — Open-Meteo (Bureau of Meteorology aggregator)
- wgea-mcp — this one. Workplace gender equality.
- aus-identity — Postcode / state / ABN normalisation helper used by all sisters
Development
git clone https://github.com/Bigred97/wgea-mcp.git
cd wgea-mcp
uv venv
uv pip install -e ".[dev]"
pytest # unit tests
pytest -m live # integration tests against data.gov.au (downloads the ~71 MB ZIP)
Issues and contributions welcome: github.com/Bigred97/wgea-mcp/issues.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file wgea_mcp-0.1.5.tar.gz.
File metadata
- Download URL: wgea_mcp-0.1.5.tar.gz
- Upload date:
- Size: 385.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bba44238f487f9059360f3e1bf43240274b3564bc6c91b0212026ba75e6e91ca
|
|
| MD5 |
8badb7d440c6bfe4e27ccc3d74990e99
|
|
| BLAKE2b-256 |
0402c8ec952352f416d2cef2a97b17ed6d545d1ffd3232e3517782aab71b4165
|
Provenance
The following attestation bundles were made for wgea_mcp-0.1.5.tar.gz:
Publisher:
publish.yml on Bigred97/wgea-mcp
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
wgea_mcp-0.1.5.tar.gz -
Subject digest:
bba44238f487f9059360f3e1bf43240274b3564bc6c91b0212026ba75e6e91ca - Sigstore transparency entry: 1542161236
- Sigstore integration time:
-
Permalink:
Bigred97/wgea-mcp@158b883611f32857114a1e4c883060a1c497a31a -
Branch / Tag:
refs/tags/v0.1.5 - Owner: https://github.com/Bigred97
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@158b883611f32857114a1e4c883060a1c497a31a -
Trigger Event:
release
-
Statement type:
File details
Details for the file wgea_mcp-0.1.5-py3-none-any.whl.
File metadata
- Download URL: wgea_mcp-0.1.5-py3-none-any.whl
- Upload date:
- Size: 50.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
53392e01350952e528a077aede5f4f71ad04db93311b055f47f521c1516aea60
|
|
| MD5 |
bed3a07769c8a47ed6c619ceddf819a1
|
|
| BLAKE2b-256 |
7eafe4c89b0611de0b7dd7f49cee2c244b4a6808cd2acac04027feffec701242
|
Provenance
The following attestation bundles were made for wgea_mcp-0.1.5-py3-none-any.whl:
Publisher:
publish.yml on Bigred97/wgea-mcp
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
wgea_mcp-0.1.5-py3-none-any.whl -
Subject digest:
53392e01350952e528a077aede5f4f71ad04db93311b055f47f521c1516aea60 - Sigstore transparency entry: 1542161341
- Sigstore integration time:
-
Permalink:
Bigred97/wgea-mcp@158b883611f32857114a1e4c883060a1c497a31a -
Branch / Tag:
refs/tags/v0.1.5 - Owner: https://github.com/Bigred97
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@158b883611f32857114a1e4c883060a1c497a31a -
Trigger Event:
release
-
Statement type: