No project description provided
Project description
Marian Client
A client for interacting with Marian-NMT websocket server (see MarianNMT).
Marian is a state of the art Neural Machine Translation framework written in C++. Eventually, someone should write Python bindings (not it!). Until then, the recommended way of communicating with Marian is via WebSockets. The authors of Marian provide an example of doing this with this script. If you need Python2 support, you should look at that script.
What this project contributes beyond the above script:
- Persistent connection - This keeps a connection open between Marian Server and Python. This saves a few hundred ms per call, which is significant
- Encapsulation - just import a class, instantiate, and call. Don't think about websockets at all
- Timeout, retries, and error handling - websockets are not the most reliable. Connections fail, timeouts happen. Just pass a value for
timeout
andretries
when you instantiateMarianClient
and this will just be handled for you.
Installation
pip install marian-client
Usage
from marian_client import MarianClient
# These are the default values:
host = "localhost"
port = "8080"
# or give the fully qualified URL
url = "ws://my.marian.server.ip/translate"
# Default values
timeout = 30 # measured in seconds - you may want to make this much lower
retries = 3 # amount of times to retry on error. backs off exponentially.
debug = False # set to True for a little more info on errors
mc = MarianClient(PORT=port, HOST=host, timeout=timeout, retries=retries, debug=debug)
# or if you want to specify url
# mc = MarianClient(url, timeout=timeout, retries=retries, debug=debug)
# if you just want all the default values, and marian-server is running locally:
# mc = MarianClient()
tokenized_sentence = "Alice like cats ."
success, corrected_sentence, error_info = mc(tokenized_sentence)
if success:
print(corrected_sentence)
else:
print(f"Call to MarianClient failed with error code {error_info[0]} and message {error_info[1]}")
# If marian-server is sert up and working, this prints
# >>> "Alice likes cats ."
Notes
- When instantiating a
MarianClient
instance, if we receive aConnectionRefusedError
, we attempt to reconnectconnection_retries
times, with exponential backoff, maxing out atmax_wait_time_between_connection_attempts
. - This means in the default case, if Marian Server is unavailable, we will try to connect, wait 1 second, try to connect again, wait 2 seconds, try to connect again, wait 4 seconds, ... then 8, 16, 32, 64, 128, 256, 300, then actually fail, for a total wait time of 811 seconds.
License
Like Marian, this package is released under the MIT license.
Credits
This package was made by the NLP team at Qordoba. If you are using it, and interested in working on NLP, maybe reach out to Sam?
Thanks to Marcin Junczys-Dowmunt and the rest of the awesome authors of Marian!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file marian-client-0.16.0.tar.gz
.
File metadata
- Download URL: marian-client-0.16.0.tar.gz
- Upload date:
- Size: 7.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/45.2.0 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d13a626922acadc5d7b7d27563ce7e7f0e66e1a89eb2f13e8780c2be7cb8bce3 |
|
MD5 | 0d07e31bb8fa5fbf81473210dac541f3 |
|
BLAKE2b-256 | 3ba99c84ea9ab60acee6c070040475b1850780ef96da73e928aae0d20c59a889 |