Downloads pretrained Microsoft Vision models
Project description
Microsoft Vision
Installation
pip install microsoftvision
Usage
Input images should be in BGR format of shape (3 x H x W), where H and W are expected to be at least 224. The images have to be loaded in to a range of [0, 1] and then normalized using mean = [0.485, 0.456, 0.406] and std = [0.229, 0.224, 0.225].
Example script:
import microsoftvision
import torch
# This will load pretrained model
model = microsoftvision.models.resnext101_32x8d(pretrained=True)
# Load model to CPU memory, interface is the same as torchvision
model = microsoftvision.resnet50(map_location=torch.device('cpu'))
Example of creating image embeddings:
import microsoftvision
from torchvision import transforms
import torch
from PIL import Image
def get_image():
img = cv2.imread('example.jpg', cv2.IMREAD_COLOR)
img = cv2.resize(img, (256, 256))
img = img[16:256-16, 16:256-16]
preprocess = transforms.Compose([
transforms.ToTensor(),
transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]),
])
return preprocess(image).unsqueeze(0) # Unsqueeze only required when there's 1 image in images batch
model = microsoftvision.models.resnet50(pretrained=True)
features = model(get_image())
print(features.shape)
Should output
...
torch.Size([1, 2048])
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
microsoftvision-1.0.4.tar.gz
(5.1 kB
view hashes)
Built Distribution
Close
Hashes for microsoftvision-1.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 109ff1c7ee3bb50aee20cc82862d37390a9045891eeb63c6e0dfd6805d334f50 |
|
MD5 | 52f762c74b80a76e56c698463388202a |
|
BLAKE2b-256 | 06d628ee51218c42acf10901a38054d737d68b9093ae8101acac4a545e6a1703 |