Skip to main content

Simple parser for ITCH messages

Project description

Nasdaq TotalView-ITCH 5.0 Parser

License: MIT

A Python library for parsing binary data conforming to the Nasdaq TotalView-ITCH 5.0 protocol specification. This parser converts the raw byte stream into structured Python objects, making it easier to work with Nasdaq market data.

Overview

The Nasdaq TotalView-ITCH 5.0 protocol is a binary protocol used by Nasdaq to disseminate full order book depth, trade information, and system events for equities traded on its execution system. This parser handles the low-level details of reading the binary format, unpacking fields according to the specification, and presenting the data as intuitive Python objects.

Features

  • Parses ITCH 5.0 Binary Data: Accurately interprets the binary message structures defined in the official specification.
  • Supports All Standard Message Types: Implements classes for all messages defined in the ITCH 5.0 specification (System Event, Stock Directory, Add Order, Trade, etc.).
  • Object-Oriented Representation: Each ITCH message type is represented by a dedicated Python class (SystemEventMessage, AddOrderMessage, etc.), inheriting from a common MarketMessage base class.
  • Flexible Input: Reads and parses messages from:
    • Binary files (.gz or similar).
    • Raw byte streams (e.g., from network sockets).
  • Data Decoding: Provides a .decode() method on each message object to convert it into a human-readable dataclass representation, handling:
    • Byte-to-string conversion (ASCII).
    • Stripping padding spaces.
    • Price decoding based on defined precision.
  • Timestamp Handling: Correctly reconstructs the 6-byte (48-bit) nanosecond timestamps.
  • Price Handling: Decodes fixed-point price fields into floating-point numbers based on the standard 4 or 8 decimal place precision.
  • Pure Python: Relies only on the Python standard library . No external dependencies required.

Installation

You can install this project using pip

  1. Clone the repository (or download the source code):
    pip install itch
    
  2. Import the necessary modules directly into your Python project:
    from itch.parser import MessageParser
    from itch.messages import ModifyOrderMessage
    

Usage

Parsing from a Binary File

This is useful for processing historical ITCH data stored in files. The MessageParser handles buffering efficiently.

from itch import MessageParser
from itch.messages import AddOrderMessage, TradeMessage

# Initialize the parser. Optionally filter messages by type.
# parser = MessageParser(message_type=b"AP") # Only parse AddOrder and NonCrossTrade messages
parser = MessageParser()

# Path to your ITCH 5.0 data file
itch_file_path = 'path/to/your/data'
# you can find sample data [here](https://emi.nasdaq.com/ITCH/Nasdaq%20ITCH/)

try:
    with open(itch_file_path, 'rb') as itch_file:
        # read_message_from_file returns a list of parsed message objects
        parsed_messages = parser.read_message_from_file(itch_file)

        print(f"Parsed {len(parsed_messages)} messages.")

        # Process the messages
        for message in parsed_messages:
            # Access attributes directly
            print(f"Type: {message.message_type.decode()}, Timestamp: {message.timestamp}")

            if isinstance(message, AddOrderMessage):
                print(f"  Add Order: Ref={message.order_reference_number}, "
                      f"Side={message.buy_sell_indicator.decode()}, "
                      f"Shares={message.shares}, Stock={message.stock.decode().strip()}, "
                      f"Price={message.decode_price('price')}") 

            elif isinstance(message, TradeMessage): 
                 print(f"  Trade: Match={message.match_number}")
                 # Access specific trade type attributes...

            # Get a human-readable dataclass representation
            decoded_msg = message.decode()
            print(f"  Decoded: {decoded_msg}")

except FileNotFoundError:
    print(f"Error: File not found at {itch_file_path}")
except Exception as e:
    print(f"An error occurred: {e}")

Parsing from Raw Bytes

This is suitable for real-time processing, such as reading from a network stream.

from itch import MessageParser
from itch.messages import AddOrderMessage
from queue import Queue

# Initialize the parser
parser = MessageParser()

# Simulate receiving a chunk of binary data (e.g., from a network socket)
# This chunk contains multiple ITCH messages, each prefixed with 0x00 and length byte
# Example: \x00\x0bS...\x00\x25R...\x00\x27F...
raw_binary_data: bytes = b"..." # Your raw ITCH 5.0 data chunk

# read_message_from_bytes returns a queue of parsed message objects
message_queue: Queue = parser.read_message_from_bytes(raw_binary_data)

print(f"Parsed {message_queue.qsize()} messages from the byte chunk.")

# Process messages from the queue
while not message_queue.empty():
    message = message_queue.get()

    print(f"Type: {message.message_type.decode()}, Timestamp: {message.timestamp}")

    if isinstance(message, AddOrderMessage):
         print(f"  Add Order: Ref={message.order_reference_number}, "
               f"Stock={message.stock.decode().strip()}, Price={message.decode_price('price')}")

    # Use the decoded representation
    decoded_msg = message.decode(prefix="Decoded")
    print(f"  Decoded: {decoded_msg}")

Supported Message Types

The parser supports the following ITCH 5.0 message types. Each message object has attributes corresponding to the fields defined in the specification. Refer to the class docstrings in itch.messages for detailed attribute descriptions.

Type (Byte) Class Name Description
S SystemEventMessage System Event Message
R StockDirectoryMessage Stock Directory Message
H StockTradingActionMessage Stock Trading Action Message
Y RegSHOMessage Reg SHO Short Sale Price Test Restricted Indicator
L MarketParticipantPositionMessage Market Participant Position message
V MWCBDeclineLeveMessage Market-Wide Circuit Breaker (MWCB) Decline Level
W MWCBStatusMessage Market-Wide Circuit Breaker (MWCB) Status
K IPOQuotingPeriodUpdateMessage IPO Quoting Period Update Message
J LULDAuctionCollarMessage LULD Auction Collar Message
h OperationalHaltMessage Operational Halt Message
A AddOrderNoMPIAttributionMessage Add Order (No MPID Attribution)
F AddOrderMPIDAttribution Add Order (MPID Attribution)
E OrderExecutedMessage Order Executed Message
C OrderExecutedWithPriceMessage Order Executed With Price Message
X OrderCancelMessage Order Cancel Message
D OrderDeleteMessage Order Delete Message
U OrderReplaceMessage Order Replace Message
P NonCrossTradeMessage Trade Message (Non-Cross)
Q CrossTradeMessage Cross Trade Message
B BrokenTradeMessage Broken Trade / Order Execution Message
I NOIIMessage Net Order Imbalance Indicator (NOII) Message
N RetailPriceImprovementIndicator Retail Price Improvement Indicator (RPII)
O DLCRMessage Direct Listing with Capital Raise Message

Data Representation

  • Base Class: All message classes inherit from itch.messages.MarketMessage. This base class provides common attributes like message_type, description, stock_locate, tracking_number, and timestamp.
  • Timestamp: Timestamps are stored as 64-bit integers representing nanoseconds since midnight. The set_timestamp and split_timestamp methods handle the conversion from/to the 6-byte representation used in the raw messages.
  • Prices: Price fields (e.g., price, execution_price, level1_price) are stored as integers in the raw message objects. Use the message.decode_price('attribute_name') method to get the correctly scaled floating-point value (usually 4 or 8 decimal places, defined by message.price_precision).
  • Strings: Alpha fields are stored as bytes. The .decode() method converts these to ASCII strings and removes right-padding spaces.
  • Decoded Objects: The message.decode() method returns a standard Python dataclass instance. This provides a clean, immutable, and easily inspectable representation of the message content with correct data types (float for prices, string for text).

Contributing

Contributions are welcome! If you find a bug, have a suggestion, or want to add a feature:

  1. Check Issues: See if an issue for your topic already exists.
  2. Open an Issue: If not, open a new issue describing the bug or feature request.
  3. Fork and Branch: Fork the repository and create a new branch for your changes.
  4. Implement Changes: Make your code changes, ensuring adherence to the ITCH 5.0 specification. Add tests if applicable.
  5. Submit Pull Request: Open a pull request from your branch to the main repository, referencing the relevant issue.

License

This project is licensed under the MIT License - see the LICENSE file for details.

References

  • Nasdaq TotalView-ITCH 5.0 Specification: The official documentation is the definitive source for protocol details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

itchfeed-1.0.0.tar.gz (29.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

itchfeed-1.0.0-py3-none-any.whl (22.1 kB view details)

Uploaded Python 3

File details

Details for the file itchfeed-1.0.0.tar.gz.

File metadata

  • Download URL: itchfeed-1.0.0.tar.gz
  • Upload date:
  • Size: 29.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for itchfeed-1.0.0.tar.gz
Algorithm Hash digest
SHA256 1542babc2a5df660de9853b4ef58f512b9c33357d7f23a9b6fa904f408de4e47
MD5 43556d539726568b54c5ce735db93551
BLAKE2b-256 81b93e479e149c368783f39503f6e9f1e406ba7616c89ca4a0df20d4412ffb63

See more details on using hashes here.

File details

Details for the file itchfeed-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: itchfeed-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 22.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for itchfeed-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9ae8becd740fcd090f06ed83533395297106b7910547cb7018adf6c80addec23
MD5 d9aa7c86bee373817339d9e56f5ee424
BLAKE2b-256 163deaf398f7ee109ace6ce017ee68e2f817be5c5d1553d60533da037c101924

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page