A framework for distributed text analysis using gRPC and microservices-based architecture.
Microservice Text Analysis Platform
MTAP is a framework for distributed text analysis using gRPC and microservices-based architecture. MTAP is used to create independent, scalable, interoperable text analysis pipeline components in either Python or Java.
- Operating System: We test on Ubuntu 22.04 and MacOS Big Sur, but other UNIX-like distributions should work.
- Python 3.7+ We test on Python 3.7 and the latest stable version of Python.
- Optional: Java 8+ (If you want to create Java Processors) We test on Java 8 and the latest stable version of Java.
- Optional: Go 13+ if you want to run the RESTful API Gateway.
Ease of Use
MTAP takes care of all the communication between different components, and provides a distributed object model for text analysis artifacts. All you have to worry about is writing the text analysis code.
By using the microservice pattern, text analysis components can be deployed once and then mixed and matched in different pipelines. Components written in different languages can interoperate without hassle. We also provide a RESTful API gateway that lets you call components using HTTP.
MTAP is designed to bridge the gap between prototyping new ideas and deploying them into a production environment. It supports calling components locally without using any network infastructure all the way up to deploying services and using service discovery via Consul to build pipelines.
We make getting started tutorials available on our project website for both Python and Java.
MTAP is developed at the University of Minnesota by the NLP/IE Group in the Institute for Health Informatics.
Funding for this work was provided by:
- 1 R01 LM011364-01 NIH-NLM
- 1 R01 GM102282-01A1 NIH-NIGMS
- U54 RR026066-01A2 NIH-NCRR
Release history Release notifications | RSS feed
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.