Skip to main content

Highly Scalable Distributed Node2Vec Algorithm Library

Project description

Node2Vec

PyPI pyversions PyPI license PyPI version Coverage Status

A highly scalable distributed node2vec algorithm

Installation

pip install node2vec-fugue

Release History

0.3.4

  • support checkpointing using fugue for deep traversal
  • add a node2vec implementation in native spark
  • add two working examples in fugue spark and native spark

0.3.1

  • 1st open-source version
  • highly scalable to graph with hundreds of millions of vertices and billions of edges
  • Can handle highly sparse graphs and skewed graphs

0.2.13

  • Refactor and add native PySpark node2vec

0.2.9

  • alternative persist in bfs
  • improve alias calculation

0.2.8

  • Significant improvement on handling hotspot vertices
  • Fix misuse of Fugue compute()

0.2.5

  • Add indexer of graph vertices
  • Allow trimming hotspot vertices

0.2.4

  • Use Apache-2.0 license

0.2.3

  • Add graph indexer for arbitrary vertex names
  • Refactor layout

0.2.2

  • support word2vec on either gensim or spark.ml.feature backend
  • fully tested

0.2.1

  • change the interface to support backend compute engine
  • use iterable to replace most pandas usage

0.2.0

  • support fugue based node2vec
  • not yet support input format validation and vertex indexing

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

node2vec-fugue-0.3.4.tar.gz (21.8 kB view hashes)

Uploaded Source

Built Distribution

node2vec_fugue-0.3.4-py3-none-any.whl (32.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page