gensim

Python Framework for Topic Modeling

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Gensim is a Python framework for unsupervised learning from raw, unstructured digital texts.
It provides a framework for learning hidden (*latent*) corpus structure.
Once found, documents can be succinctly expressed in terms of this structure, queried
for topical similarity and so on.

Gensim includes the following features:
* Memory independence -- there is no need for the whole text corpus (or any
intermediate term-document matrices) to reside fully in RAM at any one time.
* Provides implementations for several popular topic inference algorithms,
including Latent Semantic Analysis (LSA, LSI) and Latent Dirichlet Allocation (LDA),
and makes adding new ones simple.
* Contains I/O wrappers and converters around several popular data formats.
* Allows similarity queries across documents in their latent, topical representation.

The principal design objectives behind gensim are:
1. Straightforward interfaces and low API learning curve for developers,
facilitating modifications and rapid prototyping.
2. Memory independence with respect to the size of the input corpus; all intermediate
steps and algorithms operate in a streaming fashion, processing one document
at a time.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

4.3.2

Aug 24, 2023

4.3.1

Mar 10, 2023

4.3.0

Dec 21, 2022

4.2.0

May 1, 2022

4.1.2

Sep 17, 2021

4.1.1

Sep 14, 2021

4.1.0

Aug 29, 2021

4.0.1

Apr 1, 2021

4.0.0

Mar 25, 2021

3.8.3

May 4, 2020

3.8.2

Apr 12, 2020

3.8.1

Sep 26, 2019

3.8.0

Jul 9, 2019

3.7.3

May 8, 2019

3.7.2

Apr 10, 2019

3.7.1

Jan 31, 2019

3.7.0

Jan 18, 2019

3.6.0

Sep 20, 2018

3.5.0

Jul 6, 2018

3.4.0

Mar 1, 2018

3.3.0

Feb 2, 2018

3.2.0

Dec 9, 2017

3.1.0

Nov 6, 2017

3.0.0

Sep 27, 2017

2.3.0

Jul 25, 2017

2.2.0

Jun 21, 2017

2.1.0

May 12, 2017

2.0.0

Apr 10, 2017

1.0.1

Mar 3, 2017

1.0.0

Feb 25, 2017

0.13.4

Dec 25, 2016

0.13.3

Oct 21, 2016

0.13.2

Aug 26, 2016

0.13.1

Jun 24, 2016

0.13.0

Jun 22, 2016

0.12.4

Jan 31, 2016

0.12.3

Nov 6, 2015

0.12.2

Sep 19, 2015

0.12.1

Jul 20, 2015

0.12.0

Jul 6, 2015

0.11.1

Apr 11, 2015

0.10.3

Nov 19, 2014

0.10.2

Sep 18, 2014

0.10.1

Jul 22, 2014

0.10.0

Jun 4, 2014

0.9.1

Apr 12, 2014

0.9.0

Mar 15, 2014

0.8.9

Dec 26, 2013

0.8.8

Nov 3, 2013

0.8.7

Sep 18, 2013

0.8.6

Sep 15, 2012

0.8.5

Jul 22, 2012

0.8.4

Mar 9, 2012

0.8.3

Dec 2, 2011

0.8.2

Oct 30, 2011

0.8.1

Oct 10, 2011

0.8.0

Jun 28, 2011

0.7.8

Mar 26, 2011

0.7.7

Feb 13, 2011

0.7.6

Jan 10, 2011

0.7.5

Nov 3, 2010

0.7.4

Sep 13, 2010

0.7.3

Sep 7, 2010

0.7.2

Sep 1, 2010

0.7.1

Aug 28, 2010

0.7.0

Aug 28, 2010

0.6.0

Jun 19, 2010

0.5.0

Apr 28, 2010

0.4.7

Apr 27, 2010

0.4.6

Apr 17, 2010

0.4.5

Apr 5, 2010

0.4.4

Mar 30, 2010

0.4.3

Mar 29, 2010

0.4.2

Mar 28, 2010

0.4.1

Mar 19, 2010

0.4

Mar 19, 2010

This version

0.3.0

Mar 18, 2010

0.2

Mar 18, 2010

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gensim-0.3.0.tar.gz (124.4 kB view hashes)

Uploaded Mar 18, 2010 Source

Built Distribution

gensim-0.3.0-py2.5.egg (130.8 kB view hashes)

Uploaded Mar 18, 2010 Source

Hashes for gensim-0.3.0.tar.gz

Hashes for gensim-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`9a43e91473b3b7b8471e1ac5d3a2df1615f6d14a343de4bf088504787c24a3b5`
MD5	`1009141ab11f4b6520a83b4300505cd4`
BLAKE2b-256	`0eb23c062f4009408209bb899dceb9093d814c76d5fec141136ae5f9075e9e81`

Hashes for gensim-0.3.0-py2.5.egg

Hashes for gensim-0.3.0-py2.5.egg
Algorithm	Hash digest
SHA256	`6baeb63d82020307ef43a777f91730230d1f9304a0dc6b0433771f4026be9ff6`
MD5	`a2d0ef0fb9b4a6d7224ec102ddfb6670`
BLAKE2b-256	`526688ec78bc4ea8aa2318ccf44f9fd2349ebfef75e6ab37fdea8b7974b7696e`