No project description provided
Project description
PyF Collection
A Python library that brings functional programming collection operations to Python, inspired by monadic operations from languages like Scala and Haskell.
Installation
pip install pyf-collection
Overview
PyFCollection is a generic collection wrapper that provides a fluent API for functional programming operations on iterables. It allows you to chain operations like map, filter, fold, and more in a clean, readable way.
Quick Start
from pyf_collection import PyFCollection
# Create a collection
numbers = PyFCollection([1, 2, 3, 4, 5])
# Chain operations
result = (numbers
.map(lambda x: x * 2)
.filter(lambda x: x > 4)
.to_list())
print(list(result)) # [6, 8, 10]
API Reference
Constructor
PyFCollection(content: Optional[collections.Iterable[T]])
Creates a new PyFCollection instance.
# From a list
collection = PyFCollection([1, 2, 3])
# From any iterable
collection = PyFCollection(range(5))
# Empty collection
collection = PyFCollection(None)
Transformation Operations
map(func: Callable[[T], U]) -> PyFCollection[U]
Transforms each element in the collection using the provided function.
numbers = PyFCollection([1, 2, 3])
doubled = numbers.map(lambda x: x * 2)
print(list(doubled.to_list())) # [2, 4, 6]
# Transform to different type
words = PyFCollection(["hello", "world"])
lengths = words.map(len)
print(list(lengths.to_list())) # [5, 5]
flat_map(func: Callable[[T], PyFCollection[U]]) -> PyFCollection[U]
Maps each element to a PyFCollection and flattens the results.
words = PyFCollection(["hello", "world"])
chars = words.flat_map(lambda word: PyFCollection(list(word)))
print(list(chars.to_list())) # ['h', 'e', 'l', 'l', 'o', 'w', 'o', 'r', 'l', 'd']
# Expand numbers
numbers = PyFCollection([1, 2, 3])
expanded = numbers.flat_map(lambda x: PyFCollection([x, x * 10]))
print(list(expanded.to_list())) # [1, 10, 2, 20, 3, 30]
Filtering Operations
filter(func: Callable[[T], bool]) -> PyFCollection[T]
Keeps only elements that satisfy the predicate function.
numbers = PyFCollection([1, 2, 3, 4, 5, 6])
evens = numbers.filter(lambda x: x % 2 == 0)
print(list(evens.to_list())) # [2, 4, 6]
distinct(dis: T) -> PyFCollection[T]
Removes all occurrences of the specified element.
numbers = PyFCollection([1, 2, 2, 3, 2, 4])
without_twos = numbers.distinct(2)
print(list(without_twos.to_list())) # [1, 3, 4]
Search Operations
find(func: Callable[[T], bool]) -> Optional[T]
Returns the first element that satisfies the predicate, or None if not found.
numbers = PyFCollection([1, 2, 3, 4, 5])
first_even = numbers.find(lambda x: x % 2 == 0)
print(first_even) # 2
not_found = numbers.find(lambda x: x > 10)
print(not_found) # None
exist(func: Callable[[T], bool]) -> bool
Returns True if any element satisfies the predicate.
numbers = PyFCollection([1, 2, 3, 4, 5])
has_even = numbers.exist(lambda x: x % 2 == 0)
print(has_even) # True
has_large = numbers.exist(lambda x: x > 10)
print(has_large) # False
Aggregation Operations
fold(acc: U, func: Callable[[U, T], U]) -> PyFCollection[U]
Reduces the collection to a single value using an accumulator function.
numbers = PyFCollection([1, 2, 3, 4])
sum_result = numbers.fold(0, lambda acc, x: acc + x)
print(list(sum_result.to_list())) # [10]
# String concatenation
words = PyFCollection(["Hello", " ", "World"])
sentence = words.fold("", lambda acc, x: acc + x)
print(list(sentence.to_list())) # ["Hello World"]
Slicing Operations
take(n: int) -> PyFCollection[T]
Returns a new collection with the first n elements.
numbers = PyFCollection([1, 2, 3, 4, 5])
first_three = numbers.take(3)
print(list(first_three.to_list())) # [1, 2, 3]
drop(n: int) -> PyFCollection[T]
Returns a new collection without the first n elements.
numbers = PyFCollection([1, 2, 3, 4, 5])
without_first_two = numbers.drop(2)
print(list(without_first_two.to_list())) # [3, 4, 5]
slice(n: int, m: int) -> PyFCollection[T]
Returns elements from index n to m (exclusive).
numbers = PyFCollection([0, 1, 2, 3, 4, 5])
middle = numbers.slice(2, 4)
print(list(middle.to_list())) # [2, 3]
Output Operations
to_list() -> collections.Iterable[T]
Converts the collection back to its underlying iterable.
collection = PyFCollection([1, 2, 3])
result = collection.to_list()
print(list(result)) # [1, 2, 3]
Chaining Operations
All operations return a new PyFCollection, allowing for fluent method chaining:
result = (PyFCollection([1, 2, 3, 4, 5, 6, 7, 8, 9, 10])
.filter(lambda x: x % 2 == 0) # [2, 4, 6, 8, 10]
.map(lambda x: x * x) # [4, 16, 36, 64, 100]
.take(3) # [4, 16, 36]
.map(lambda x: f"Value: {x}") # ["Value: 4", "Value: 16", "Value: 36"]
.to_list())
print(list(result)) # ["Value: 4", "Value: 16", "Value: 36"]
Type Safety
PyFCollection is fully typed using Python's type hints and generics, providing excellent IDE support and type checking:
from typing import List
# Type inference works correctly
numbers: PyFCollection[int] = PyFCollection([1, 2, 3])
strings: PyFCollection[str] = numbers.map(str) # PyFCollection[str]
Performance
PyFCollection is designed to be performant while maintaining readability. Benchmarks show it can even outperform equivalent vanilla Python implementations in complex pipelines.
Benchmark Results
A comprehensive benchmark comparing PyFCollection against vanilla Python comprehensions shows impressive performance:
Test Setup:
- Dataset: 1,000,000 integers
- Pipeline operations:
map→ double each valuefilter→ keep multiples of 3flat_map→ produce (n, -n) for each elementdistinct→ drop a single value (-6)take→ keep first 100 itemsto_list→ materialize the final result
Results:
PyFCollection → 0.187s (5 runs)
Vanilla Python → 0.235s (5 runs)
Sample output : [6, 12, -12, 18, -18, 24, -24, 30, -30, 36]
PyFCollection demonstrates ~20% better performance than equivalent vanilla Python code, while providing significantly more readable and maintainable code.
Benchmark Code
from timeit import timeit
from typing import List
from pyf_collection import PyFCollection
def pipeline_pyf() -> List[int]:
data = list(range(1, 1_000_001))
result = (
PyFCollection(data)
.map(lambda x: x * 2)
.filter(lambda x: x % 3 == 0)
.flat_map(lambda x: PyFCollection([x, -x]))
.distinct(-6)
.take(100)
.to_list()
)
return result
def pipeline_vanilla() -> List[int]:
data = list(range(1, 1_000_001))
doubled = (x * 2 for x in data) # map
multiples = (x for x in doubled if x % 3 == 0) # filter
flatmapped = (y for x in multiples for y in (x, -x)) # flat-map
distincted = (x for x in flatmapped if x != -6) # distinct
first_100 = [x for *, x in zip(range(100), distincted)] # take
return first_100
# Run benchmark
vanilla_time = timeit("pipeline_vanilla()", globals=globals(), number=5)
pyf_time = timeit("pipeline_pyf()", globals=globals(), number=5)
print(f"PyFCollection → {pyf_time:.3f}s (5 runs)")
print(f"Vanilla Python → {vanilla_time:.3f}s (5 runs)")
Requirements
- Python >= 3.9
- typing support for generics
License
MIT License
Author
Pablo Picouto Garcia
Contributing
Contributions are welcome! Please visit the GitHub repository for more information.
Issues
Report issues at: https://github.com/politrons/Dive-into-Python/issues
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pyf_collection-0.1.4.tar.gz.
File metadata
- Download URL: pyf_collection-0.1.4.tar.gz
- Upload date:
- Size: 6.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0bdc144c2fa7c17ec73303b23bd4d43ad39e0408fede210ce14790db7a21633d
|
|
| MD5 |
4bb1edd71661d601ec9da0e0b33a856a
|
|
| BLAKE2b-256 |
8292f1d1842a360f9a5af26ebb16599de6ddb6bd1736d2bde0056305453064b8
|
File details
Details for the file pyf_collection-0.1.4-py3-none-any.whl.
File metadata
- Download URL: pyf_collection-0.1.4-py3-none-any.whl
- Upload date:
- Size: 6.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d42dbc496e5419182b1876132e85ab96cb90b450b7dc42b40a4b2fce859da1ca
|
|
| MD5 |
3dfb4b1aa27644cb90466984428f37f1
|
|
| BLAKE2b-256 |
eed5afe1af5c1259c0ddbd145fe83f514ed84cd0d7002014473f53b13683af35
|