Advanced Recommendation Systems Library with State-of-the-Art Algorithms
Project description
CoreRec & VishGraphs Manual
Discover the power of graph analysis and recommendation with CoreRec & VishGraphs. Dive into our comprehensive manual and explore the endless possibilities.
Introduction
CoreRec is a modern recommendation engine built for the deep learning era. It implements industry-standard architectures (Two-Tower, Transformers, GNNs) with a multi-stage pipeline approach used by Netflix, YouTube, and other major platforms. CoreRec seamlessly integrates collaborative filtering, content-based methods, and sequential models into a unified framework.
VishGraphs is your ultimate Python library for graph visualization and analysis. Whether you're a data scientist, researcher, or hobbyist, VishGraphs offers intuitive tools to generate, visualize, and analyze graphs effortlessly.
NEW: CoreRec now supports modern embedding-based architectures! See MODERN_RECSYS_GUIDE.md for the complete guide on Two-Tower models, vector databases, and multi-modal fusion.
Our Downloads per month
Note: last updated: 2024-11-20
Features
core_rec.py
-
GraphTransformer(num_layers, d_model, num_heads, d_feedforward, input_dim)- A Transformer model for graph data with customizable parameters.
-
GraphDataset(adj_matrix)- A PyTorch dataset for graph data, streamlining model training.
-
train_model(model, data_loader, criterion=False, optimizer=False, num_epochs=False)- Train your model with ease using our flexible training function.
-
predict(model, graph, node_index, top_k=5)- Predict similar nodes with precision using trained models.
-
aaj_accuracy(graph, node_index, recommended_indices)- Measure the accuracy of your recommendations with our robust metrics.
vish_graphs.py
-
generate_large_random_graph(num_people, file_path="large_random_graph.csv", seed=None)- Generate and save large random graphs effortlessly.
-
draw_graph(adj_matrix, top_nodes=None, recommended_nodes=None, node_labels=None, transparent_labeled=True, edge_weights=None)- Create stunning 2D visualizations of your graphs.
-
draw_graph_3d(adj_matrix, top_nodes=None, recommended_nodes=None, node_labels=None, transparent_labeled=True, edge_weights=None)- Experience your graphs in 3D with customizable features.
-
show_bipartite_relationship(adj_matrix)- Visualize bipartite relationships with clarity.
Installation
Install CoreRec using pip:
pip install --upgrade corerec
Quick Start Examples
Modern Deep Learning (Industry Standard)
from corerec.engines import TwoTower
from corerec.retrieval.vector_store import create_index
# Two-Tower for fast retrieval (like YouTube, Netflix)
model = TwoTower(
user_input_dim=64,
item_input_dim=128,
embedding_dim=256
)
model.fit(user_ids, item_ids, interactions)
# Build vector index for fast search
item_embs = model.get_item_embeddings()
index = create_index("faiss", dim=256)
index.add(item_embs, item_ids)
# Retrieve top candidates
candidates = model.recommend(user_id, top_k=10)
Sequential Models (Time-Aware)
from corerec.engines import BERT4Rec
# Bidirectional transformer for sequences
model = BERT4Rec(hidden_dim=256, num_layers=4)
model.fit(user_ids, item_ids, interactions)
next_items = model.recommend(user_id, top_k=10)
Multi-Modal Fusion
from corerec.multimodal.fusion_strategies import MultiModalFusion
# Combine text, images, and metadata
fusion = MultiModalFusion(
modality_dims={'text': 768, 'image': 2048, 'meta': 32},
output_dim=256,
strategy='attention'
)
item_embedding = fusion({'text': text_emb, 'image': img_emb, 'meta': meta})
Traditional Methods (Still Supported)
# Collaborative Filtering
from corerec.engines import unionized
collab_model = unionized.FastRecommender()
collab_model.fit(user_ids, item_ids, ratings)
# Content-Based Filtering
from corerec.engines import content
content_model = content.TFIDFRecommender()
content_model.fit(item_features)
# Graph-Based Recommendations
from corerec.engines import GNNRec
gnn_model = GNNRec(num_layers=3, embedding_dim=128)
gnn_model.fit(user_ids, item_ids, graph_structure)
Using Optimizers / Boosters
CoreRec provides various optimizers for training models:
from corerec.cr_boosters.adam import Adam
from corerec.cr_boosters.nadam import NAdam
# Use with your model
optimizer = Adam(model.parameters(), lr=0.001)
CoreRec has various in-built optimizers for training models.
Available Optimizers
- Adam
- Nadam
- Adamax
- Adadelta
- Adagrad
- ASGD
- LBFGS
- RMSprop
- SGD
- SparseAdam
CoreRec Manual
Table of Contents
Introduction
- CoreRec: CoreRec is a cutting-edge recommendation engine for graph data analysis and visualization. It excels in recommending similar nodes, training machine learning models, and visualizing complex network structures.
- VishGraphs: VishGraphs is a Python library designed for graph visualization and analysis. It offers tools for generating random graphs, drawing graphs in 2D and 3D, and analyzing graph properties.
Directory Structure
| Description | Quick Access |
|---|---|
CoreRecCore recommendation engine components and utilities |
|
EnginesEngines Parts and Algorithms it Supports | engines/
├── __init__.py
├── content_based.py
├── hybrid.py
│
├── content_based/
│ ├── __init__.py
│ ├── base_recommender.py
│ ├── cr_contentFilterFactory.py
│ ├── tfidf_recommender.py
│ │
│ ├── traditional_ml_algorithms/
│ │ ├── LR.py
│ │ ├── decision_tree.py
│ │ ├── lightgbm.py
│ │ ├── svm.py
│ │ ├── tfidf.py
│ │ └── vw.py
│ │
│ ├── nn_based_algorithms/
│ │ ├── AITM.py
│ │ ├── DSSM.py
│ │ ├── MIND.py
│ │ ├── TDM.py
│ │ ├── WidenDeep.py
│ │ ├── Word2Vec.py
│ │ ├── Youtube_dnn.py
│ │ ├── autoencoder.py
│ │ ├── cnn.py
│ │ ├── dkn.py
│ │ ├── lstur.py
│ │ ├── naml.py
│ │ ├── npa.py
│ │ ├── nrms.py
│ │ ├── rnn.py
│ │ ├── transformer.py
│ │ └── vae.py
│ │
│ ├── context_personalization/
│ │ ├── context_aware.py
│ │ ├── item_profiling.py
│ │ └── user_profiling.py
│ │
│ ├── fairness_explainability/
│ │ ├── explainable.py
│ │ ├── fairness_aware.py
│ │ └── privacy_preserving.py
│ │
│ └── [Other specialized modules...]
│
├── collaborative/
│ ├── __init__.py
│ ├── als_recommender.py
│ ├── base_recommender.py
│ ├── cr_unionizedFactory.py
│ ├── initializer.py
│ ├── matrix_factorization.py
│ │
│ ├── nn_base/
│ │ ├── AFM_base.py
│ │ ├── AutoFI_base.py
│ │ ├── DCN_base.py
│ │ ├── DIEN_base.py
│ │ ├── DIN_base.py
│ │ ├── DeepFM_base.py
│ │ ├── NFM_base.py
│ │ └── [Other neural network models...]
│ │
│ ├── graph_based_base/
│ │ ├── DeepWalk_base.py
│ │ ├── GNN_base.py
│ │ ├── lightgcn_base.py
│ │ └── [Other graph-based models...]
│ │
│ ├── mf_base/
│ │ ├── ALS_base.py
│ │ ├── SVD_base.py
│ │ ├── nmf_base.py
│ │ └── [Other matrix factorization models...]
│ │
│ └── attention_mechanism_base/
│ ├── Attention_based_uf_base.py
│ ├── SASRec_base.py
│ └── Transformer_based_uf_base.py
│
└── test_struct_UF/
├── factory/
├── matrix_factorization_algorithms/
├── neural_network_based_algorithms/
├── graph_based_algorithms/
└── attention_mechanism_based_algorithms/
|
Usage
Generating Random Graphs
Generate random graphs effortlessly with the generate_random_graph function:
import vish_graphs as vg
graph_file = vg.generate_random_graph(10, "random_graph.csv")
The use cases are:-
🔍 Delve into Advanced Graph Analysis and Recommendation with VishGraphs and CoreRec! 🚀
Welcome to a world of cutting-edge graph analysis and recommendation tools brought to you by VishGraphs and CoreRec. Uncover the potential of data visualization and machine learning in a sophisticated manner.
🔗 Explore Detailed UseCases Here 🔗
CoreRec
import core_rec as cs
1. GraphTransformer(num_layers, d_model, num_heads, d_feedforward, input_dim)
Main Algorithm CoreRec Provides Based on Transformer Architecture works fine with PyTorch, CoreRec etc. In Simple terms it uses DNG Score to rank prediction of surrondings of Target node Providing a Enhanced way to compute Attention.
Use case: Providing recommendations for nodes based on their similarity within a graph.
2. GraphTransformerV2(num_layers, d_model, num_heads, d_feedforward, input_dim)
GraphTransformerV2 adds dropout and layer normalization, enhancing robustness compared to GraphTransformer's simpler architecture.
Use case: More Evolved Training machine learning models for graph-related tasks, such as node classification or link prediction.
3. GraphDataset(adj_matrix, weight_matrix)
Defines a PyTorch dataset for graph data, allowing easy integration with DataLoader for model training.
Use case: Preparing graph data for training machine learning models.
4. train_model(model, data_loader, criterion, optimizer, num_epochs)
Trains a given model using the provided data loader, loss function, optimizer, and number of epochs.
Use case: Training machine learning models for graph-related tasks using graph data.
In the test.py file, various functionalities from vish_graphs.py and core_rec.py are utilized and demonstrated:
- Random graph generation (
generate_random_graph). - Identification of top nodes in a graph (
find_top_nodes). - Training a Transformer model for graph data (
GraphTransformerV2,GraphDataset,train_model). - Recommending similar nodes using a trained model (
recommend_similar_nodes). - Visualization of a graph in 3D (
draw_graph_3d).
vishgraphs
import vishgraphs as vg
1. generate_random_graph(num_people, file_path="graph_dataset.csv", seed=None)
Generate a random graph with a specified number of people and save the adjacency matrix to a CSV file.
Use case: Generating synthetic graph data for testing algorithms or simulations.
2. draw_graph(adj_matrix, nodes, top_nodes)
Draw a 2D visualization of a graph based on its adjacency matrix, highlighting top nodes if specified.
Use case: Visualizing relationships within a graph dataset.
3. find_top_nodes(matrix, num_nodes=10)
Identify the top nodes with the greatest number of strong correlations in a graph.
Use case: Identifying influential or highly connected nodes in a network.
4. draw_graph_3d(adj_matrix, nodes, top_nodes)
Create a 3D visualization of a graph based on its adjacency matrix, with optional highlighting of top nodes.
Use case: Visualizing complex network structures in a three-dimensional space.
5. show_bipartite_relationship_with_cosine(adj_matrix)
Visualize bipartite relationships in a graph using cosine similarity and community detection algorithms.
Use case: Analyzing relationships between different sets of nodes in a bipartite graph.
6. bipartite_matrix_maker(csv_path)
Read a CSV file containing a bipartite adjacency matrix and return it as a list.
Use case: Preparing data for analyzing bipartite networks.
Explore the codebase and utilize these functionalities for your graph analysis and recommendation tasks! If you have any questions or need further assistance, don't hesitate to reach out. Happy graph analyzing! 📊🔍
Drawing Graphs
VishGraphs supports drawing graphs in both 2D and 3D:
adj_matrix = vishgraphs.bipartite_matrix_maker(graph_file)
nodes = list(range(len(adj_matrix)))
top_nodes = [0, 1, 2] # Example top nodes
vishgraphs.draw_graph(adj_matrix, nodes, top_nodes)
Core Team 👨💻
These are the humans that form the CoreRec's core team, which runs the project. (Team is not yet decided)
| @vishesh9131 |
|---|
| Founder / Creator |
Troubleshooting
Troubleshooting Guide
For issues with CoreRec and VishGraphs:
- Check Documentation: Ensure you're following the library's guidelines and examples correctly.
- GitHub Issues: Report bugs or seek help by creating an issue on the GitHub repository.
- Verify Data: Confirm that your input data is correctly formatted and compatible.
- Model Parameters: Double-check model configurations and training parameters.
- Visualization Inputs: Ensure correct parameters for graph visualization functions.
- Community Help: Utilize community forums for additional support.
This streamlined approach should help resolve common issues efficiently.
Contributing
We welcome contributions to enhance the functionalities of our graph analysis and recommendation tools. If you're interested in contributing, here are a few ways you can help:
- Bug Fixes: Identify and fix bugs in the existing code.
- Feature Enhancements: Suggest and implement improvements to current features.
- New Features: Propose and develop new features that could benefit users of the libraries.
- Documentation: Help improve the documentation to make the libraries more user-friendly.
To contribute, please follow these steps:
- Fork the repository.
- Create a new branch for your feature or fix.
- Develop your changes while adhering to the coding standards and guidelines.
- Submit a pull request with a clear description of the changes and any relevant issue numbers.
Your contributions are greatly appreciated and will help make these tools more effective and accessible to everyone!
License
VishGraphs is distributed under the following terms:
The library and utilities are only for research purposes. Please do not use it commercially without the author's (@Vishesh9131) consent.
Examples and Demos
-
Dataset note (crlearn): Examples will try to use the IJCAI dataset via
cr_learn.ijcai.load()when available. If not available, they fall back to CSVs insample_data/. You don't need to change the scripts; they auto-detect. -
Deep engines (tiny, runnable demos)
# Deep & Cross Network
python examples/engines_dcn_example.py
# DeepFM
python examples/engines_deepfm_example.py
# GNN-based recommender
python examples/engines_gnnrec_example.py
# MIND (multi-interest) sequential recommender
python examples/engines_mind_example.py
# NASRec
python examples/engines_nasrec_example.py
# SASRec (self-attentive sequential)
python examples/engines_sasrec_example.py
- UnionizedFilterEngine (collaborative/hybrid)
# FastAI-style embedding dot-bias
python examples/unionized_fast_example.py
# FastAI-style recommender variant
python examples/unionized_fast_recommender_example.py
# SAR (item-to-item similarity)
python examples/unionized_sar_example.py
# RLRMC (Riemannian low-rank)
python examples/unionized_rlrmc_example.py
# RBM (Restricted Boltzmann Machine)
python examples/unionized_rbm_example.py
# GeoMLC (geometric matrix completion)
python examples/unionized_geomlc_example.py
- Content Filter
# TF-IDF based content filter
python examples/content_filter_tfidf_example.py
- Frontends (imshow plug-and-play)
# Minimal plug-and-play frontend server
python examples/imshow_connector_example.py
# Then open http://127.0.0.1:8000 in your browser
- Unified Test Runner
# Discover and run the test suite with a compact report
python examples/run_all_algo_tests_example.py
Tips
- All example scripts add the project root to
sys.pathautomatically so you can run them directly. - If
cr_learnis installed, examples will prefer it; otherwise they usesample_data/CSVs bundled in this repo.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file corerec-0.5.2.tar.gz.
File metadata
- Download URL: corerec-0.5.2.tar.gz
- Upload date:
- Size: 1.5 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4bd806b57746d41d8a818b1ffc5d0cccee39305075d939e864d8a3c4584f50da
|
|
| MD5 |
76a7c4da521d204ab7498c47979e5156
|
|
| BLAKE2b-256 |
d2b358ba4a5ccaa3de3a2414b758f5379f9d4ed534deda0b878392e062e0bb96
|
File details
Details for the file corerec-0.5.2-py3-none-any.whl.
File metadata
- Download URL: corerec-0.5.2-py3-none-any.whl
- Upload date:
- Size: 1.9 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c572da0f9e6c20a8450278b7e99fef8e420510e46e9ccb1daa84ec02bb4ee189
|
|
| MD5 |
88f273c3d30ecaaf45dc0999b8c13a06
|
|
| BLAKE2b-256 |
81b34b17859bb4ae083bc6b5bc275da019f5b3a9fa2dd855b27604799c820e58
|