Skip to main content

Facebook AI Research Sequence-to-Sequence Toolkit

Project description



Support Ukraine MIT License Latest Release Build Status Documentation Status CicleCI Status


Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks.

We provide reference implementations of various sequence modeling papers:

List of implemented papers

What's New:

Previous updates

Features:

We also provide pre-trained models for translation and language modeling with a convenient torch.hub interface:

en2de = torch.hub.load('pytorch/fairseq', 'transformer.wmt19.en-de.single_model')
en2de.translate('Hello world', beam=5)
# 'Hallo Welt'

See the PyTorch Hub tutorials for translation and RoBERTa for more examples.

Requirements and Installation

  • PyTorch version >= 1.10.0
  • Python version >= 3.8
  • For training new models, you'll also need an NVIDIA GPU and NCCL
  • To install fairseq and develop locally:
git clone https://github.com/pytorch/fairseq
cd fairseq
pip install --editable ./

# on MacOS:
# CFLAGS="-stdlib=libc++" pip install --editable ./

# to install the latest stable release (0.10.x)
# pip install fairseq
  • For faster training install NVIDIA's apex library:
git clone https://github.com/NVIDIA/apex
cd apex
pip install -v --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" \
  --global-option="--deprecated_fused_adam" --global-option="--xentropy" \
  --global-option="--fast_multihead_attn" ./
  • For large datasets install PyArrow: pip install pyarrow
  • If you use Docker make sure to increase the shared memory size either with --ipc=host or --shm-size as command line options to nvidia-docker run .

Getting Started

The full documentation contains instructions for getting started, training new models and extending fairseq with new model types and tasks.

Pre-trained models and examples

We provide pre-trained models and pre-processed, binarized test sets for several tasks listed below, as well as example training and evaluation commands.

We also have more detailed READMEs to reproduce results from specific papers:

Join the fairseq community

License

fairseq(-py) is MIT-licensed. The license applies to the pre-trained models as well.

Citation

Please cite as:

@inproceedings{ott2019fairseq,
  title = {fairseq: A Fast, Extensible Toolkit for Sequence Modeling},
  author = {Myle Ott and Sergey Edunov and Alexei Baevski and Angela Fan and Sam Gross and Nathan Ng and David Grangier and Michael Auli},
  booktitle = {Proceedings of NAACL-HLT 2019: Demonstrations},
  year = {2019},
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

fairseq_built-0.12.4-cp312-cp312-win_amd64.whl (11.7 MB view details)

Uploaded CPython 3.12Windows x86-64

fairseq_built-0.12.4-cp311-cp311-win_amd64.whl (11.7 MB view details)

Uploaded CPython 3.11Windows x86-64

fairseq_built-0.12.4-cp310-cp310-win_amd64.whl (11.7 MB view details)

Uploaded CPython 3.10Windows x86-64

File details

Details for the file fairseq_built-0.12.4-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for fairseq_built-0.12.4-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 4c07266e10432e8a9b62bf00cf569fa358841ecd14b0fa3247d13a2e371e43a6
MD5 6a0632ae127639b6d237e219083949c8
BLAKE2b-256 329d89043c25eeee7dde16ef188d64b684faf1a55a37e7f5913faea15714d285

See more details on using hashes here.

File details

Details for the file fairseq_built-0.12.4-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for fairseq_built-0.12.4-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 32722a8fe12924edfa9a7a7e81677efe82f03e06ac3e8f6eba4d03639db2f57b
MD5 0fbd0df725d10d45e3ac78849d7c6d55
BLAKE2b-256 6988b6ad3ab484b37256ff0ce1b5aaefc3c6a13658168b092ffb1b3deaab36a1

See more details on using hashes here.

File details

Details for the file fairseq_built-0.12.4-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for fairseq_built-0.12.4-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 73c1dedb0533b8219732f34eae9f6f1b6caee709891b9d627e1b4d8e3aff8c46
MD5 8d8a9744d0cff1c74a444b30638bdba6
BLAKE2b-256 b78b6d952b5dbae19127658463d67e4f9745410bb62780be69718cdd69f98aef

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page