Building an index of GPT summaries.

Project description

🗂️ ️GPT Index

GPT Index is a project consisting of a set of data structures that are created using LLMs and can be traversed using LLMs in order to answer queries.

PyPi: https://pypi.org/project/gpt-index/.

Documentation: https://gpt-index.readthedocs.io/en/latest/.

🚀 Overview

NOTE: This README is not updated as frequently as the documentation. Please check out the documentation above for the latest updates!

Context

LLMs are a phenomenal piece of technology for knowledge generation and reasoning.
A big limitation of LLMs is context size (e.g. OpenAI's davinci model for GPT-3 has a limit of 4096 tokens. Large, but not infinite).
The ability to feed "knowledge" to LLMs is restricted to this limited prompt size and model weights.
Thought: what if LLMs could have access to potentially a much larger database of knowledge without retraining/finetuning?

Proposed Solution

That's where GPT Index comes in. GPT Index is a simple, flexible interface between your external data and LLMs. It resolves the following pain points:

Provides simple data structures to resolve prompt size limitations.
Offers data connectors to your external data sources.
Offers you a comprehensive toolset trading off cost and performance.

At the core of GPT Index is a data structure. Instead of relying on world knowledge encoded in model weights, a GPT Index data structure does the following:

Uses a pre-trained LLM primarily for reasoning/summarization instead of prior knowledge.
Takes a large corpus of text data as input and builds a structured index over it (using an LLM or heuristics).
Allow users to query the index in order to synthesize an answer to the question - this requires both traversal of the index as well as a synthesis of the answer.

💡 Contributing

Interesting in contributing? See our Contribution Guide for more details.

📄 Documentation

Full documentation can be found here: https://gpt-index.readthedocs.io/en/latest/.

Please check it out for the most up-to-date tutorials, how-to guides, references, and other resources!

💻 Example Usage

pip install gpt-index

Examples are in the examples folder. Indices are in the indices folder (see list of indices below).

To build a tree index:

from gpt_index import GPTTreeIndex, SimpleDirectoryReader
documents = SimpleDirectoryReader('data').load_data()
index = GPTTreeIndex(documents)

To save to and load from disk:

# save to disk
index.save_to_disk('index.json')
# load from disk
index = GPTTreeIndex.load_from_disk('index.json')

To query:

index.query("<question_text>?", child_branch_factor=1)

🔧 Dependencies

The main third-party package requirements are tiktoken, openai, and langchain.

All requirements should be contained within the setup.py file. To run the package locally without building the wheel, simply run pip install -r requirements.txt.

Project details

Release history Release notifications | RSS feed

0.8.42

Oct 10, 2023

0.8.41

Oct 7, 2023

0.8.40

Oct 5, 2023

0.8.39.post2

Oct 4, 2023

0.8.39

Oct 4, 2023

0.8.38

Oct 2, 2023

0.8.37

Sep 30, 2023

0.8.36

Sep 27, 2023

0.8.35

Sep 27, 2023

0.8.34

Sep 26, 2023

0.8.33

Sep 25, 2023

0.8.32

Sep 24, 2023

0.8.31

Sep 22, 2023

0.8.30

Sep 21, 2023

0.8.29.post1

Sep 18, 2023

0.8.29

Sep 18, 2023

0.8.28

Sep 16, 2023

0.8.28a1 pre-release

Sep 15, 2023

0.8.27

Sep 14, 2023

0.8.26.post1

Sep 13, 2023

0.8.26

Sep 13, 2023

0.8.25

Sep 12, 2023

0.8.24.post1

Sep 11, 2023

0.8.24

Sep 11, 2023

0.8.23.post1

Sep 9, 2023

0.8.23

Sep 9, 2023

0.8.22

Sep 8, 2023

0.8.21

Sep 6, 2023

0.8.20

Sep 4, 2023

0.8.19

Sep 3, 2023

0.8.18

Sep 3, 2023

0.8.17

Sep 2, 2023

0.8.16

Sep 1, 2023

0.8.15

Aug 31, 2023

0.8.14

Aug 30, 2023

0.8.13

Aug 29, 2023

0.8.12

Aug 28, 2023

0.8.11.post3

Aug 27, 2023

0.8.11.post2

Aug 27, 2023

0.8.11.post1

Aug 27, 2023

0.8.11

Aug 27, 2023

0.8.10.post1

Aug 26, 2023

0.8.10

Aug 26, 2023

0.8.9

Aug 24, 2023

0.8.8

Aug 23, 2023

0.8.7

Aug 22, 2023

0.8.6

Aug 22, 2023

0.8.5.post2

Aug 21, 2023

0.8.5.post1

Aug 19, 2023

0.8.5

Aug 18, 2023

0.8.4

Aug 17, 2023

0.8.3

Aug 16, 2023

0.8.2.post1

Aug 15, 2023

0.8.2

Aug 14, 2023

0.8.1.post1

Aug 13, 2023

0.8.1

Aug 13, 2023

0.8.0

Aug 11, 2023

0.7.24.post1

Aug 11, 2023

0.7.23

Aug 10, 2023

0.7.22

Aug 8, 2023

0.7.21

Aug 7, 2023

0.7.20

Aug 6, 2023

0.7.19

Aug 4, 2023

0.7.18

Aug 3, 2023

0.7.17

Aug 2, 2023

0.7.16

Jul 30, 2023

0.7.15

Jul 29, 2023

0.7.14

Jul 28, 2023

0.7.13

Jul 26, 2023

0.7.12

Jul 25, 2023

0.7.11.post1

Jul 20, 2023

0.7.11

Jul 20, 2023

0.7.10.post1

Jul 18, 2023

0.7.10

Jul 18, 2023

0.7.9

Jul 16, 2023

0.7.8

Jul 14, 2023

0.7.7

Jul 13, 2023

0.7.6

Jul 12, 2023

0.7.5

Jul 12, 2023

0.7.4

Jul 8, 2023

0.7.3

Jul 8, 2023

0.7.2

Jul 6, 2023

0.7.1

Jul 5, 2023

0.7.0

Jul 4, 2023

0.6.38.post1

Jul 2, 2023

0.6.38

Jul 2, 2023

0.6.37

Jun 30, 2023

0.6.36

Jun 30, 2023

0.6.35

Jun 28, 2023

0.6.34.post1

Jun 26, 2023

0.6.34

Jun 26, 2023

0.6.33

Jun 25, 2023

0.6.32

Jun 23, 2023

0.6.31

Jun 22, 2023

0.6.30

Jun 21, 2023

0.6.29

Jun 20, 2023

0.6.28

Jun 19, 2023

0.6.27

Jun 17, 2023

0.6.26

Jun 14, 2023

0.6.25.post1

Jun 13, 2023

0.6.25

Jun 13, 2023

0.6.24

Jun 12, 2023

0.6.23

Jun 11, 2023

0.6.22

Jun 10, 2023

0.6.21.post1

Jun 7, 2023

0.6.20

Jun 5, 2023

0.6.19

Jun 4, 2023

0.6.18

Jun 3, 2023

0.6.17

Jun 2, 2023

0.6.16.post1

Jun 1, 2023

0.6.16

Jun 1, 2023

0.6.15

May 31, 2023

0.6.14

May 30, 2023

0.6.13

May 28, 2023

0.6.12

May 27, 2023

0.6.11

May 25, 2023

0.6.10.post1

May 24, 2023

0.6.10

May 24, 2023

0.6.9

May 19, 2023

0.6.8

May 16, 2023

0.6.7

May 14, 2023

0.6.6

May 13, 2023

0.6.5

May 11, 2023

0.6.4

May 10, 2023

0.6.3

May 10, 2023

0.6.2

May 8, 2023

0.6.1

May 5, 2023

0.6.0

May 2, 2023

0.6.0a7 pre-release

May 2, 2023

0.6.0a6 pre-release

May 2, 2023

0.6.0a5 pre-release

May 1, 2023

0.6.0a4 pre-release

May 1, 2023

0.6.0a3 pre-release

Apr 30, 2023

0.6.0a2 pre-release

Apr 29, 2023

0.6.0a1 pre-release

Apr 28, 2023

0.5.27

Apr 28, 2023

0.5.26

Apr 28, 2023

0.5.25

Apr 26, 2023

0.5.23.post1

Apr 24, 2023

0.5.23

Apr 23, 2023

0.5.22

Apr 23, 2023

0.5.20

Apr 20, 2023

0.5.19

Apr 20, 2023

0.5.18

Apr 19, 2023

0.5.17.post1

Apr 18, 2023

0.5.17

Apr 18, 2023

0.5.16

Apr 17, 2023

0.5.15

Apr 13, 2023

0.5.13.post1

Apr 12, 2023

0.5.13

Apr 12, 2023

0.5.12

Apr 10, 2023

0.5.11

Apr 9, 2023

0.5.10

Apr 7, 2023

0.5.9

Apr 6, 2023

0.5.8

Apr 5, 2023

0.5.7

Apr 4, 2023

0.5.6

Apr 3, 2023

0.5.5

Apr 2, 2023

0.5.4

Mar 31, 2023

0.5.3

Mar 31, 2023

0.5.2

Mar 30, 2023

0.5.1

Mar 29, 2023

0.5.0

Mar 28, 2023

0.4.40

Mar 27, 2023

0.4.39

Mar 26, 2023

0.4.38

Mar 25, 2023

0.4.37

Mar 24, 2023

0.4.36

Mar 23, 2023

0.4.35.post1

Mar 22, 2023

0.4.35

Mar 22, 2023

0.4.34

Mar 21, 2023

0.4.33

Mar 20, 2023

0.4.32

Mar 19, 2023

0.4.31

Mar 19, 2023

0.4.30

Mar 18, 2023

0.4.29

Mar 16, 2023

0.4.28

Mar 14, 2023

0.4.27

Mar 13, 2023

0.4.26

Mar 11, 2023

0.4.25

Mar 10, 2023

0.4.24

Mar 9, 2023

0.4.23

Mar 8, 2023

0.4.22.post1

Mar 7, 2023

0.4.22

Mar 7, 2023

0.4.21

Mar 6, 2023

0.4.20

Mar 4, 2023

0.4.19

Mar 3, 2023

0.4.18

Mar 2, 2023

0.4.17

Mar 1, 2023

0.4.16

Mar 1, 2023

0.4.15

Feb 28, 2023

0.4.14

Feb 26, 2023

0.4.13

Feb 25, 2023

0.4.12

Feb 24, 2023

0.4.11

Feb 24, 2023

0.4.10

Feb 23, 2023

0.4.9

Feb 23, 2023

0.4.8

Feb 21, 2023

0.4.7

Feb 20, 2023

0.4.6

Feb 19, 2023

0.4.5

Feb 18, 2023

0.4.4

Feb 16, 2023

0.4.1

Feb 10, 2023

0.4.0

Feb 8, 2023

0.3.6

Feb 7, 2023

0.3.5

Feb 5, 2023

0.3.4

Feb 3, 2023

0.3.3

Feb 2, 2023

0.3.2

Feb 1, 2023

0.3.1

Jan 31, 2023

0.3.0

Jan 30, 2023

0.2.17

Jan 29, 2023

0.2.16

Jan 27, 2023

0.2.15

Jan 27, 2023

0.2.14

Jan 26, 2023

0.2.13

Jan 25, 2023

0.2.12

Jan 23, 2023

0.2.11

Jan 22, 2023

0.2.10

Jan 21, 2023

0.2.9

Jan 20, 2023

0.2.8

Jan 19, 2023

0.2.7

Jan 18, 2023

0.2.6

Jan 17, 2023

0.2.5

Jan 16, 2023

0.2.4

Jan 15, 2023

0.2.3

Jan 13, 2023

This version

0.2.2

Jan 12, 2023

0.2.1

Jan 10, 2023

0.2.0

Jan 8, 2023

0.1.18

Jan 7, 2023

0.1.17

Jan 5, 2023

0.1.16

Jan 4, 2023

0.1.15

Jan 4, 2023

0.1.14

Jan 3, 2023

0.1.13

Jan 2, 2023

0.1.12

Dec 31, 2022

0.1.11

Dec 30, 2022

0.1.10

Dec 30, 2022

0.1.9

Dec 27, 2022

0.1.8

Dec 26, 2022

0.1.7

Dec 25, 2022

0.1.6

Dec 24, 2022

0.1.5

Dec 23, 2022

0.1.4

Dec 22, 2022

0.1.3

Dec 21, 2022

0.1.2

Dec 19, 2022

0.1.1

Dec 18, 2022

0.1.0

Dec 9, 2022

0.0.10

Dec 5, 2022

0.0.9

Dec 4, 2022

0.0.8

Dec 4, 2022

0.0.7

Nov 29, 2022

0.0.6

Nov 28, 2022

0.0.5

Nov 23, 2022

0.0.4

Nov 22, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpt_index-0.2.2.tar.gz (83.7 kB view details)

Uploaded Jan 12, 2023 Source

File details

Details for the file gpt_index-0.2.2.tar.gz.

File metadata

Download URL: gpt_index-0.2.2.tar.gz
Upload date: Jan 12, 2023
Size: 83.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for gpt_index-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`ea832d6a5f6b245bedcdcdfcadc360157c8f20214c557eb081799c9944989c79`
MD5	`adaa72901c09c7ff9a72253ee79eae23`
BLAKE2b-256	`3b0ba483b12432e7f77d98d01dfcebeafb0cbc58af1a0228fd27fa4b805b98f8`