Skip to main content

Adversarial estimator for structural models on graphs

Project description

Adversarial estimation on graphs

Adversarial estimator for graph structural models extends the theoretical framework proposed by Kaji et al. (2023) to structural models defined on graphs (strategic communication games, peer-effect models, etc.). With graph data, the entire network is typically a single realization; to create variability for the discriminator we sample subgraphs from both the ground truth and the synthetic graph and label them by origin. Intuitively, local neighborhoods contain enough information about global structure for discrimination; our current implementation uses $k$-hop ego sampling. Additional challenges include multiple equilibria in dynamic network models, lack of closed-form asymptotics, and the need to tailor the discriminator architecture to the structural class.

Formal setup

Observed data. Let $G=(X,Y,N,A)$ where:

  • $X\in\mathbb{R}^{n\times k}$ are exogenous node covariates (row $i$ is $x_i^\top$).
  • $Y\in\mathbb{R}^{n\times \ell}$ are endogenous node outcomes (row $i$ is $y_i^\top$).
  • $N={1,\dots,n}$ is the node index set.
  • $A\in{0,1}^{n\times n}$ is a symmetric adjacency matrix.

We work with a peer operator $P(A)$ that excludes self-links:

P(A) = A - \mathrm{diag}(A)

Optionally, $P(A)$ may be row/degree-normalized; we only require $\mathrm{diag}!\big(P(A)\big)=0$.

Structural model (generator). A structural mapping

m_\theta:\ \big(X,\ P(A),\ Y^{(0)},\ \xi\big)\ \longmapsto\ Y' \in \mathbb{R}^{n\times \ell}

takes covariates, the peer operator, and an initial outcome state $Y^{(0)}$ (e.g., pre-interaction signals, baselines, or zeros) and returns simulated outcomes $Y'$. The innovation $\xi$ captures simulation randomness if present.

  • Single-step (peer-to-peer, no self-loop):
Y' = m_\theta\!\Big(X,\ P(A)\,Y^{(0)},\ \xi\Big),

i.e., each $y_i'$ depends on $x_i$ and peers’ initial outcomes ${y_j^{(0)}: j\in\mathcal{N}(i)}$, but not on $y_i^{(0)}$ directly.

  • Multi-step propagation (optional): for $t=0,\dots,T-1$,
Y^{(t+1)} = m_\theta\!\Big(X,\ P(A)\,Y^{(t)},\ \xi^{(t)}\Big),
\qquad Y' \equiv Y^{(T)}.

Synthetic data. Define $G'(\theta)=(X,\ Y',\ N,\ A)$ with $Y'=m_\theta(X,P(A),Y^{(0)},\xi)$. Exogenous features and topology are held fixed so identification comes from matching the distribution of outcomes over sampled subgraphs.

Subgraph sampling. Let $\mathsf{S}$ denote a randomized sampler (e.g., $k$-hop ego nets or rooted random-walk subgraphs). Sampling from $G$ induces $p_{\mathrm{data}}^{\mathsf{S}}$ over subgraphs $g$; sampling from $G'(\theta)$ induces $p_{\theta}^{\mathsf{S}}$.

Discriminator. A GNN discriminator $D_\phi: g\mapsto[0,1]$ outputs the probability that $g$ came from $p_{\mathrm{data}}^{\mathsf{S}}$.

Adversarial objective (Goodfellow-style). Estimate $\theta$ by

\min_{\theta}\ \max_{\phi}\ 
\mathbb{E}_{g\sim p_{\mathrm{data}}^{\mathsf{S}}}\!\big[\log D_\phi(g)\big]
\;+\;
\mathbb{E}_{g\sim p_{\theta}^{\mathsf{S}}}\!\big[\log\big(1-D_\phi(g)\big)\big].

At the optimal discriminator this minimizes the Jensen–Shannon divergence between $p_{\mathrm{data}}^{\mathsf{S}}$ and $p_{\theta}^{\mathsf{S}}$.

Practical implementation

  • Discriminator. Implement $D_\phi$ with PyTorch Geometric. Use the same sampler $\mathsf{S}$ (e.g., $k$-hop ego nets or rooted random-walk subgraphs) for real and synthetic graphs to keep the target distribution fixed.

  • Generators.

    • Ground-truth generator: sampling manager over $G$ to produce $g\sim p_{\mathrm{data}}^{\mathsf{S}}$.
    • Synthetic generator: wraps $m_\theta$, reuses $X$, $A$ and the chosen initial state $Y^{(0)}$ (passed through the estimator), constructs $P(A)$ with zero diagonal, and exposes generate_outcomes(θ) for counterfactual simulation.
  • Optimization. Treat the outer problem as black-box in $\theta$. Bayesian optimization is a reasonable default; use binary cross-entropy from the objective above (not accuracy) as the scalar loss. The estimator exposes an outer_optimizer switch: keep "gp" (default, via skopt.gp_minimize) or set "nelder-mead" to activate SciPy's derivative-free simplex routine. Additional optimizer arguments can be supplied through outer_optimizer_params (legacy code using gp_params continues to work when the GP optimizer is selected). SciPy is declared as a project dependency.

linear_in_means_model.ipynb

A two-parameter testbed illustrating training curves and objective values. For linear-in-means,

Y = (I-\rho P)^{-1}(X\beta+\varepsilon),

the no-self-loop restriction holds via $P(A)$ and invertibility requires

|\rho| < 1/\lambda_{\max}(P).

Notes

  • Utilities currently target the linear-in-means demo; they should generalize by swapping $m_\theta$ and the sampler $\mathsf{S}$.
  • The demo’s GNN for $D_\phi$ is ad hoc; strong identification often tolerates simple discriminators.
  • Prefer cross-entropy over accuracy for the outer objective.

Reference

Kaji, T., Manresa, E., & Pouliot, G. (2023). An adversarial approach to structural estimation. Econometrica, 91(6), 2041–2063.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

adversarial_nets_lib_econ-0.1.3.tar.gz (23.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

adversarial_nets_lib_econ-0.1.3-py3-none-any.whl (16.7 kB view details)

Uploaded Python 3

File details

Details for the file adversarial_nets_lib_econ-0.1.3.tar.gz.

File metadata

File hashes

Hashes for adversarial_nets_lib_econ-0.1.3.tar.gz
Algorithm Hash digest
SHA256 988fbe88e74fd4ae4ac031437123399d5ec09ecff9eea12a766f0d3431978376
MD5 1ca4e97aae69062fd46433fc77322a70
BLAKE2b-256 6a54345c42ee24094b1fbc013a2f85b94ab5e64cac25f125b6fad4037d661dc1

See more details on using hashes here.

File details

Details for the file adversarial_nets_lib_econ-0.1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for adversarial_nets_lib_econ-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 736abc95ef115ad7ccebf97b7219fd4b8d8f8c5e5c151a9c9598ce87fa13a1a4
MD5 5598536f43fb0d47f747a9e319383662
BLAKE2b-256 149f22d61acb10e110114a19d584bc9ced0d0a78a1a9013e41ce22c204468d70

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page