Skip to main content

Data Structures for Python

Project description

datastructpy

Documentation Status Python 3.11 codecov

datastructpy is a Python package designed to provide customizable and practical implementations of essential data structures, such as Binary Search Trees (BST). It is tailored to help users prepare for technical interviews, coding challenges, and educational projects by offering intuitive and efficient solutions. The package stands out by emphasizing simplicity and clarity while maintaining detailed documentation and modular implementations suited for both learning and practical use. Unlike visualization-heavy libraries, the package focuses on providing a hands-on, customizable experience for working with binary trees and other data structures. If you require a structured, minimalistic approach without additional dependencies or overhead, datastructpy serves as a valuable alternative!

Data Structures Included:

  • Binary Search Tree

    A Binary Search Tree (BST) is a data structure that organizes data hierarchically, allowing for efficient insertion, deletion, and lookup operations. Each node in the tree contains a key, and the tree is structured such that:

    • Keys in the left subtree of a node are smaller than the node’s key.
    • Keys in the right subtree of a node are larger than the node’s key.
    • Duplicate keys are not allowed.

    Time Complexity: Search, Insert, Delete:

    • Average Case: O(log n) => This occurs when the BST is balanced.
    • Worst Case: O(n) => This occurs when the BST is skewed.

    Space Complexity:

    • Space for the BST: O(n).
    • Recursive Operations: O(h), where h is the height of the tree.

    Methods

    • insert(key):

      • Inserts a specified key into the Binary Search Tree (BST) while maintaining BST properties.
      • If the tree is empty, it creates a new root node.
      • If the key is smaller than the current node’s key, it is inserted into the left subtree; if larger, into the right subtree.
      • Duplicate keys are ignored.
    • search(key):

      • Searches for a specified key in the BST.
      • The search follows the Binary Search Tree (BST) property:
      • If the key is smaller than the current node, search the left subtree.
      • If the key is larger, search the right subtree.
      • If the key matches, return the node.
      • Returns the node containing the key if found, otherwise None.
    • delete(key):

      • Deletes a specified key from the BST while maintaining BST properties.
      • If the node has:
        • No children → It is removed.
        • One child → The child replaces the deleted node.
        • Two children → The node is replaced by its in-order successor (the smallest value in the right subtree), and the successor is then deleted.
    • list_to_tree(elements):

      • Constructs a Binary Search Tree (BST) from a given list of elements.
      • Sequentially inserts each element into the BST, ensuring the tree maintains BST properties.
      • Duplicate values are ignored to preserve the BST structure.
      • Returns a BinarySearchTree object with the provided elements organized as a valid BST.

datastructpy in Python Ecosystem

datastructpy complements Python’s standard library by providing customizable implementations of essential data structures for learning and interview preparation. While modules like collections (e.g., deque) and heapq focus on optimized, ready-to-use structures, datastructpy emphasizes clarity and adaptability, making it ideal for understanding core concepts. Unlike specialized libraries like pyrsistent or sortedcontainers, datastructpy bridges the gap between practical functionality and educational needs, carving out a unique space in the Python ecosystem.

🚀 Installation

You can install datastructpy using pip.

Install via pip:

$ pip install datastructpy

Usage

Example usage:

from datastructpy.non_linear.trees.binary_search_tree import BinarySearchTree

# Create a Binary Search Tree from a list of elements
elements = [10, 5, 15, 8]
bst = BinarySearchTree.list_to_tree(elements)

# Check the structure of the tree
print("Tree Structure After Creation:")
print(bst.root.key)             # Output: 10
print(bst.root.left.key)        # Output: 5
print(bst.root.right.key)       # Output: 15
print(bst.root.left.right.key)  # Output: 8

# Insert new nodes into the BST
print("Inserting New Elements:")
bst.insert(12)  # Insert into right subtree of 10
bst.insert(2)   # Insert into left subtree of 5
print(bst.root.right.left.key)  # Output: 12 (left child of 15)
print(bst.root.left.left.key)   # Output: 2 (left child of 5)

# Search for values in the BST
print("Searching for Keys:")
print(bst.search(8) is not None)   # Output: True (8 exists in the tree)
print(bst.search(20) is None)      # Output: True (20 does not exist in the tree)

# Delete a node
print("Deleting Nodes:")
bst.delete(5)  # Delete the left child of the root
if bst.root.left:
    print(bst.root.left.key)    # Output: 8 (5 replaced by its in-order successor)
else:
    print(bst.root.left)        # Output: None (if no successor is present)

bst.delete(10)  # Delete the root
print(bst.root.key)             # Output: 15 (new root after deletion)

# Final structure of the tree
print("Final Tree Structure:")
print(bst.root.key)             # Output: 15
print(bst.root.left.key)        # Output: 8
print(bst.root.right.left.key)  # Output: 12

Running Tests

To run tests for the datastructpy package, follow these steps:

  1. Ensure dependencies are installed: If you haven't set up the environment yet, install dependencies using Poetry:

    $ poetry install
    
  2. Run tests using pytest: Execute the following command from the root of the project directory:

    $ poetry run pytest
    

    This will automatically discover and execute all test files in the tests/ directory.

  3. Run a specific test file: To test a specific module, specify the file path:

    $ poetry run pytest tests/non-linear/trees/binary_search_tree/test_list_to_tree.py
    
  4. Run tests with coverage (optional): To measure test coverage, use:

    $ poetry run pytest --cov=src/datastructpy
    

For more advanced testing options and configurations, refer to the pytest documentation.

Contributors

Albert Halim @Albert0011, Azin Piran @AzinPiran, Javier Martinez @javiermtzo99, Jessica Kuo @kuo4230

Contributing

Interested in contributing? Check out the contributing guidelines. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.

License

datastructpy was created by Albert Halim, Azin Piran, Javier Martinez, Jessica Kuo. It is licensed under the terms of the MIT license.

Credits

datastructpy was created with cookiecutter and the py-pkgs-cookiecutter template.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datastructpy-0.0.0.tar.gz (6.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datastructpy-0.0.0-py3-none-any.whl (7.1 kB view details)

Uploaded Python 3

File details

Details for the file datastructpy-0.0.0.tar.gz.

File metadata

  • Download URL: datastructpy-0.0.0.tar.gz
  • Upload date:
  • Size: 6.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for datastructpy-0.0.0.tar.gz
Algorithm Hash digest
SHA256 8cdb8ebad78c799e2038231366e1bf55f77b6499798760629080f42acd7e48a2
MD5 db88899535668c06101912b03e99d5dc
BLAKE2b-256 2a7ab28c3387cab0c8287c7b2be7343879c4238b37b1e6a191de2ae6a74def1e

See more details on using hashes here.

File details

Details for the file datastructpy-0.0.0-py3-none-any.whl.

File metadata

  • Download URL: datastructpy-0.0.0-py3-none-any.whl
  • Upload date:
  • Size: 7.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for datastructpy-0.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ef760b430c77ab1cb47a72c9f2ac66593442c37239bf1f001c4febc6a6ab90e5
MD5 042c73a71e42863efeb687e92416592b
BLAKE2b-256 cf361292eeeadb7fdff745e53435f9aa32cfedbb2d4995370829216f855f4787

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page