Python Sdk for Milvus

These details have not been verified by PyPI

Project links

Homepage

Project description

Milvus Python SDK

Python SDK for Milvus. To contribute code to this project, please read our contribution guidelines first.

For detailed SDK documentation, refer to API Documentation.

New features
Get started
Basic operations
Connect to the Milvus server
Create/Drop collections
- Create a collection
- Drop a collection
Create/Drop partitions in a collection
- Create a partition
- Drop a partition
Create/Drop indexes in a collection
- Create an index
- Drop an index
Insert/Delete vectors in collections/partitions
Flush data in one or multiple collections to disk
Compact all segments in a collection
Search vectors in collections/partitions
- Search vectors in a collection
- Search vectors in a partition
Disconnect from the Milvus server
FAQ

Get started

Prerequisites

pymilvus only supports Python 3.6 or higher.

Install pymilvus

You can install pymilvus via pip or pip3 for Python3:

$ pip3 install pymilvus

The following collection shows Milvus versions and recommended pymilvus versions:

Milvus version	Recommended pymilvus version
0.10.6	0.4.0
0.10.5	0.2.15, 0.4.0
0.10.1 - 0.10.4	0.2.14
0.10.0	0.2.13
0.9.1	0.2.12
0.9.0	0.2.11
0.8.0	0.2.10
0.7.1	0.2.9
0.7.0	0.2.8
0.6.0	0.2.6, 0.2.7
0.5.3	0.2.5
0.5.2	0.2.3
0.5.1	0.2.3
0.5.0	0.2.3
0.4.0	0.2.2
0.3.1	0.1.25
0.3.0	0.1.13

You can install a specific version of pymilvus by:

$ pip install pymilvus==0.4.0

You can upgrade pymilvus to the latest version by:

$ pip install --upgrade pymilvus

Examples

Refer to examples for more example programs.

Basic operations

Connect to the Milvus server

Import pymilvus.

# Import pymilvus
>>> from milvus import Milvus, IndexType, MetricType, Status

Create a client to Milvus server by using one of the following methods:
```
# Connect to Milvus server
>>> client = Milvus(host='localhost', port='19530')
```
Note: In the above code, default values are used for host and port parameters. Feel free to change them to the IP address and port you set for Milvus server.
```
>>> client = Milvus(uri='tcp://localhost:19530')
```

Create/Drop collections

Create a collection

Prepare collection parameters.

# Prepare collection parameters
>>> param = {'collection_name':'test01', 'dimension':128, 'index_file_size':1024, 'metric_type':MetricType.L2}

Create collection test01 with dimension size as 128, size of the data file for Milvus to automatically create indexes as 1024, and metric type as Euclidean distance (L2).
```
# Create a collection
>>> status = client.create_collection(param)
>>> status
Status(code=0, message='Create collection successfully!')
```

Drop a collection

# Drop collection
>>> status = client.drop_collection(collection_name='test01')
>>> status
Status(code=0, message='Delete collection successfully!')

Create/Drop partitions in a collection

Create a partition

You can split collections into partitions by partition tags for improved search performance. Each partition is also a collection.

# Create partition
>>> status = client.create_partition(collection_name='test01', partition_tag='tag01')
>>> status
Status(code=0, message='OK')

Use list_partitions() to verify whether the partition is created.

# Show partitions
>>> status, partitions = client.list_partitions(collection_name='test01')
>>> partitions
[(collection_name='test01', tag='_default'), (collection_name='test01', tag='tag01')]

Drop a partition

>>> status = client.drop_partition(collection_name='test01', partition_tag='tag01')
Status(code=0, message='OK')

Create/Drop indexes in a collection

Create an index

Note: In production, it is recommended to create indexes before inserting vectors into the collection. Index is automatically built when vectors are being imported. However, you need to create the same index again after the vector insertion process is completed because some data files may not meet the index_file_size and index will not be automatically built for these data files.

Prepare index parameters. The following command uses IVF_FLAT index type as an example.
```
# Prepare index param
>>> ivf_param = {'nlist': 4096}
```

Create an index for the collection.

# Create index
>>> status = client.create_index('test01', IndexType.IVF_FLAT, ivf_param)
Status(code=0, message='Build index successfully!')

Drop an index

>>> status = client.drop_index('test01')
Status(code=0, message='OK')

Insert/Delete vectors in collections/partitions

Insert vectors in a collection

Generate 20 vectors of 128 dimension.

>>> import random
>>> dim = 128
# Generate 20 vectors of 128 dimension
>>> vectors = [[random.random() for _ in range(dim)] for _ in range(20)]

Insert the list of vectors. If you do not specify vector ids, Milvus automatically generates IDs for the vectors.

# Insert vectors
>>> status, inserted_vector_ids = client.insert(collection_name='test01', records=vectors)
>>> inserted_vector_ids 
[1592028661511657000, 1592028661511657001, 1592028661511657002, 1592028661511657003, 1592028661511657004, 1592028661511657005, 1592028661511657006, 1592028661511657007, 1592028661511657008, 1592028661511657009, 1592028661511657010, 1592028661511657011, 1592028661511657012, 1592028661511657013, 1592028661511657014, 1592028661511657015, 1592028661511657016, 1592028661511657017, 1592028661511657018, 1592028661511657019]

Alternatively, you can also provide user-defined vector ids:

>>> vector_ids = [id for id in range(20)]
>>> status, inserted_vector_ids = client.insert(collection_name='test01', records=vectors, ids=vector_ids)

Insert vectors in a partition

>>> status, inserted_vector_ids = client.insert('test01', vectors, partition_tag="tag01")

To verify the vectors you have inserted, use get_vector_by_id(). Assume you have vector with the following ID.

>>> status, vector = client.get_entity_by_id(collection_name='test01', ids=inserted_vector_ids[:10])

Delete vectors by ID

You can delete these vectors by:

>>> status = client.delete_entity_by_id('test01', inserted_vector_ids[:10])
>>> status
Status(code=0, message='OK')

Flush data in one or multiple collections to disk

When performing operations related to data changes, you can flush the data from memory to disk to avoid possible data loss. Milvus also supports automatic flushing, which runs at a fixed interval to flush the data in all collections to disk. You can use the Milvus server configuration file to set the interval.

>>> status = client.flush(['test01'])
>>> status
Status(code=0, message='OK')

Compact all segments in a collection

A segment is a data file that Milvus automatically creates by merging inserted vector data. A collection can contain multiple segments. If some vectors are deleted from a segment, the space taken by the deleted vectors cannot be released automatically. You can compact segments in a collection to release space.

>>> status = client.compact(collection_name='test01')
>>> status
Status(code=0, message='OK')

Search vectors in collections/partitions

Search vectors in a collection

Prepare search parameters.

>>> search_param = {'nprobe': 16}

Search vectors.

# create 5 vectors of 32-dimension
>>> q_records = [[random.random() for _ in range(dim)] for _ in range(5)]
# search vectors
>>> status, results = client.search(collection_name='test01', query_records=q_records, top_k=2, params=search_param)
>>> results
[
[(id:1592028661511657012, distance:19.450458526611328), (id:1592028661511657017, distance:20.13418197631836)],
[(id:1592028661511657012, distance:19.12230682373047), (id:1592028661511657018, distance:20.221458435058594)],
[(id:1592028661511657014, distance:20.423980712890625), (id:1592028661511657016, distance:20.984281539916992)],
[(id:1592028661511657018, distance:18.37057876586914), (id:1592028661511657019, distance:19.366962432861328)],
[(id:1592028661511657013, distance:19.522361755371094), (id:1592028661511657010, distance:20.304216384887695)]
]

Search vectors in a partition

# create 5 vectors of 32-dimension
>>> q_records = [[random.random() for _ in range(dim)] for _ in range(5)]
>>> client.search(collection_name='test01', query_records=q_records, top_k=1, partition_tags=['tag01'], params=search_param)

Note: If you do not specify partition_tags, Milvus searches the whole collection.

close client

>>> client.close()

FAQ

I'm getting random "socket operation on non-socket" errors from gRPC when connecting to Milvus from an application served on Gunicorn

Make sure to set the environment variable GRPC_ENABLE_FORK_SUPPORT=1. For reference, see https://zhuanlan.zhihu.com/p/136619485

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

1.0.0

Feb 24, 2021

0.6.1

Dec 21, 2020

0.6.0

Dec 9, 2020

0.5.4

Dec 1, 2020

0.5.3

Nov 5, 2020

0.5.2

Nov 4, 2020

0.5.1

Nov 3, 2020

0.5.0

Nov 3, 2020

0.4.16

Oct 16, 2020

0.4.15

Oct 15, 2020

0.4.14

Oct 12, 2020

0.4.13

Aug 25, 2020

0.4.12

Aug 25, 2020

0.4.11

Aug 25, 2020

0.4.10

Aug 22, 2020

0.4.9

Aug 13, 2020

0.4.8

Aug 10, 2020

0.4.7

Aug 5, 2020

0.4.6

Aug 5, 2020

0.4.5

Aug 4, 2020

0.4.4

Jul 31, 2020

0.4.3

Jul 28, 2020

0.4.2

Jul 27, 2020

0.4.1

Jul 27, 2020

0.4.0

Jul 27, 2020

0.3.39

Feb 24, 2021

0.3.38

Feb 24, 2021

0.3.37

Dec 25, 2020

0.3.36

Dec 24, 2020

0.3.35

Dec 21, 2020

0.3.34

Jul 9, 2020

0.3.33

Jul 6, 2020

0.3.32

Jun 15, 2020

0.3.31

Jun 13, 2020

0.3.30

Jun 12, 2020

0.3.29

Jun 11, 2020

0.3.28

Jun 10, 2020

0.3.27

Jun 9, 2020

0.3.26

Jun 8, 2020

0.3.25

Jun 2, 2020

0.3.24

Jun 1, 2020

0.3.23

May 29, 2020

0.3.22

May 29, 2020

0.3.21

May 19, 2020

0.3.20

May 19, 2020

0.3.19

May 19, 2020

0.3.18

May 16, 2020

0.3.17

May 15, 2020

0.3.16

May 14, 2020

0.3.15

May 12, 2020

0.3.14

May 11, 2020

0.3.13

May 9, 2020

0.3.12

May 9, 2020

0.3.11

May 8, 2020

0.3.10

Apr 30, 2020

0.3.9

Apr 25, 2020

0.3.8

Apr 13, 2020

0.3.7

Apr 10, 2020

0.3.6

Apr 7, 2020

0.3.5

Apr 3, 2020

0.3.4

Mar 31, 2020

0.3.3

Mar 19, 2020

0.3.2

Mar 18, 2020

0.3.1

Mar 15, 2020

0.2.62

Jun 19, 2020

0.2.61.1

Jun 19, 2020

0.2.61.post1

Jun 19, 2020

0.2.61

Mar 11, 2020

0.2.61rc0 pre-release

Jun 19, 2020

0.2.61a2 pre-release

Jun 19, 2020

0.2.61a1 pre-release

Jun 19, 2020

0.2.60

Mar 7, 2020

0.2.59

Feb 25, 2020

0.2.58

Feb 25, 2020

0.2.57

Feb 24, 2020

0.2.56

Feb 20, 2020

0.2.55

Feb 19, 2020

0.2.54

Feb 16, 2020

0.2.53

Feb 13, 2020

0.2.52

Feb 10, 2020

0.2.51

Feb 9, 2020

0.2.50

Jan 21, 2020

0.2.49

Jan 14, 2020

0.2.48

Jan 9, 2020

0.2.47

Jan 9, 2020

0.2.46

Jan 9, 2020

0.2.45

Jan 9, 2020

0.2.44

Dec 30, 2019

0.2.43

Dec 30, 2019

0.2.42

Nov 29, 2019

0.2.41

Nov 23, 2019

0.2.40

Nov 22, 2019

0.2.39

Nov 19, 2019

0.2.38

Nov 18, 2019

0.2.37

Nov 16, 2019

0.2.36

Nov 16, 2019

0.2.35

Nov 12, 2019

0.2.34

Nov 8, 2019

0.2.33

Nov 8, 2019

0.2.32

Nov 8, 2019

0.2.31

Nov 7, 2019

0.2.30

Nov 7, 2019

0.2.29

Oct 25, 2019

0.2.28

Oct 21, 2019

0.2.27

Oct 19, 2019

0.2.25

Oct 19, 2019

0.2.23

Oct 10, 2019

0.2.22

Sep 28, 2019

0.2.21

Sep 26, 2019

0.2.20

Sep 21, 2019

0.2.19

Sep 20, 2019

0.2.18

Sep 12, 2019

0.2.17

Sep 12, 2019

0.2.16

Sep 12, 2019

0.2.15

Sep 10, 2019

0.2.14

Sep 9, 2019

0.2.13

Sep 7, 2019

0.2.12

Sep 7, 2019

0.2.11

Sep 7, 2019

0.2.10

Sep 6, 2019

0.2.9

Sep 5, 2019

0.2.8

Sep 5, 2019

0.2.7

Sep 5, 2019

0.2.6

Sep 5, 2019

0.2.5

Sep 3, 2019

0.2.4

Sep 2, 2019

0.2.3

Aug 29, 2019

0.2.2

Aug 28, 2019

0.2.1

Aug 26, 2019

0.2.0

Aug 23, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pymilvus-test-1.0.0.tar.gz (50.0 kB view details)

Uploaded Feb 24, 2021 Source

Built Distribution

pymilvus_test-1.0.0-py3-none-any.whl (60.3 kB view details)

Uploaded Feb 24, 2021 Python 3

File details

Details for the file pymilvus-test-1.0.0.tar.gz.

File metadata

Download URL: pymilvus-test-1.0.0.tar.gz
Upload date: Feb 24, 2021
Size: 50.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.6.9

File hashes

Hashes for pymilvus-test-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`c250eadf88e59fa7d4827808f04258751f449c1804653db3acf30100d6b45e1d`
MD5	`dba15df36d81f85cfe73ee0e3ddf1117`
BLAKE2b-256	`6b4fffb59bb6311c885a118be434ebd0f24e3c14d779845180399a747e1e3327`

See more details on using hashes here.

File details

Details for the file pymilvus_test-1.0.0-py3-none-any.whl.

File metadata

Download URL: pymilvus_test-1.0.0-py3-none-any.whl
Upload date: Feb 24, 2021
Size: 60.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.6.9

File hashes

Hashes for pymilvus_test-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e295dfcc6f26bc819c1ee3f8616a93c368ef8dc34064b428fe88e8817d0f2819`
MD5	`4434db1ad10f73f8180850f5696ec02c`
BLAKE2b-256	`ab9a84fe129e30d3734828e69733fbfc31c5553a7b836fdc3a4ab9f2e636268c`

See more details on using hashes here.

pymilvus-test 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Milvus Python SDK

Get started

Prerequisites

Install pymilvus

Examples

Basic operations

Connect to the Milvus server

Create/Drop collections

Create a collection

Drop a collection

Create/Drop partitions in a collection

Create a partition

Drop a partition

Create/Drop indexes in a collection

Create an index

Drop an index

Insert/Delete vectors in collections/partitions

Insert vectors in a collection

Insert vectors in a partition

Delete vectors by ID

Flush data in one or multiple collections to disk

Compact all segments in a collection

Search vectors in collections/partitions

Search vectors in a collection

Search vectors in a partition

close client

FAQ

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes