LocalLab: A lightweight AI inference server for running LLMs locally or in Google Colab with a friendly API.

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Operating System
Programming Language

Project description

🚀 LocalLab: Your Personal AI Lab

Run ChatGPT-like AI on your own computer! LocalLab is a complete AI platform that runs models locally with a powerful chat interface and Python client.

✨ What Makes LocalLab Special?

LocalLab gives you your own personal ChatGPT that runs entirely on your computer:

🎯 Terminal Chat Interface - ChatGPT-like experience in your terminal
🔒 Complete Privacy - Your data never leaves your computer
💰 Zero Cost - No monthly fees or API charges
🌐 Access Anywhere - Use from any device with ngrok tunneling
⚡ Multiple Models - Support for various open-source AI models
🤖 Model Management - Download, organize, and manage AI models locally
🎮 Free GPU - Run on Google Colab for free GPU acceleration

Perfect for developers, students, researchers, or anyone who wants to experiment with AI without privacy concerns or ongoing costs.

🚀 Quick Start (3 Steps)

# 1. Install LocalLab
pip install locallab locallab-client

# 2. Start your AI server
locallab start

# 3. Chat with your AI
locallab chat

That's it! You now have your own ChatGPT running locally.

🤖 Model Management (Optional)

Want to download models ahead of time or manage your local AI models? LocalLab includes powerful model management:

# Discover available models
locallab models discover

# Download a model locally (faster startup)
locallab models download microsoft/phi-2

# List your cached models
locallab models list

# Get detailed model information
locallab models info microsoft/phi-2

📖 Learn More: See the Model Management Guide for complete documentation.

🧠 How LocalLab Works

LocalLab has three main components:

1. 🖥️ LocalLab Server (`pip install locallab`)

Runs AI models on your computer
Provides a web API for interactions
Handles model loading and optimization
Start with: locallab start

2. 💬 Chat Interface (Built-in)

Terminal-based ChatGPT-like experience
Real-time streaming responses
Multiple generation modes
Access with: locallab chat

3. 🤖 Model Management (Built-in)

Download and organize AI models locally
Discover available models from HuggingFace Hub
Manage disk space and cache cleanup
Use with: locallab models

4. 🐍 Python Client (`pip install locallab-client`)

Programmatic access for your code
Both sync and async support
Use with: client = SyncLocalLabClient("http://localhost:8000")

graph TD
    A[Terminal Chat] -->|Uses| C[LocalLab Server]
    B[Python Code] -->|Uses| C
    C -->|Runs| D[AI Models]
    C -->|Optional| E[Ngrok Tunnel]
    E -->|Access from| F[Any Device]
    style C fill:#2563eb,stroke:#1e40af,stroke-width:2px,color:#ffffff
    style D fill:#059669,stroke:#047857,stroke-width:2px,color:#ffffff
    style A fill:#7c3aed,stroke:#6d28d9,stroke-width:2px,color:#ffffff
    style B fill:#dc2626,stroke:#b91c1c,stroke-width:2px,color:#ffffff
    style E fill:#ea580c,stroke:#c2410c,stroke-width:2px,color:#ffffff
    style F fill:#0891b2,stroke:#0e7490,stroke-width:2px,color:#ffffff

🌟 The Magic: Use --use-ngrok to access your AI from anywhere - your phone, another computer, or share with friends!

🎯 Key Features

📦 Easy Setup         🔒 Privacy First       🎮 Free GPU Access
🤖 Multiple Models    💾 Memory Efficient    🔄 Auto-Optimization
🗂️ Model Management   ⚡ Fast Response       🔧 Simple Server
🌐 Local or Colab    🔌 Client Package      🛡️ Secure Tunneling
🌍 Access Anywhere   📥 Offline Models      🧹 Cache Cleanup

Two-Part System:

LocalLab Server: Runs the AI models and exposes API endpoints
LocalLab Client: A separate Python package (pip install locallab-client) that connects to the server

Access From Anywhere: With built-in ngrok integration, you can securely access your LocalLab server from any device, anywhere in the world - perfect for teams, remote work, or accessing your models on the go.

🌟 Two Ways to Run

On Your Computer (Local Mode)

💻 Your Computer
└── 🚀 LocalLab Server
    └── 🤖 AI Model
        └── 🔧 Auto-optimization

On Google Colab (Free GPU Mode)

☁️ Google Colab
└── 🎮 Free GPU
    └── 🚀 LocalLab Server
        └── 🤖 AI Model
            └── ⚡ GPU Acceleration

📦 Installation & Setup

Latest Package Versions:

LocalLab Server:

LocalLab Client:

Windows Setup

Install Required Build Tools
- Install Microsoft C++ Build Tools
  - Select "Desktop development with C++"
- Install CMake
  - Add to PATH during installation
Install Packages
```
pip install locallab locallab-client
```
Verify PATH
- If locallab command isn't found, add Python Scripts to PATH:
```
# Find Python location
where python
# This will show something like: C:\Users\YourName\AppData\Local\Programs\Python\Python311\python.exe
```
Adding to PATH in Windows:
1. Press Win + X and select "System"
2. Click "Advanced system settings" on the right
3. Click "Environment Variables" button
4. Under "System variables", find and select "Path", then click "Edit"
5. Click "New" and add your Python Scripts path (e.g., C:\Users\YourName\AppData\Local\Programs\Python\Python311\Scripts\)
6. Click "OK" on all dialogs
7. Restart your command prompt
- Alternatively, use: python -m locallab start

🔍 Having issues? See our Windows Troubleshooting Guide

Linux/Mac Setup

# Install both server and client packages
pip install locallab locallab-client

2. Configure the Server (Recommended)

# Run interactive configuration
locallab config

# This will help you set up:
# - Model selection
# - Memory optimizations
# - GPU settings
# - System resources

3. Start the Server

# Start with saved configuration
locallab start

# Or start with specific options
locallab start --model microsoft/phi-2 --quantize --quantize-type int8

💬 Terminal Chat Interface - Your Personal ChatGPT

The LocalLab Chat Interface is a powerful terminal-based tool that gives you a ChatGPT-like experience right in your command line. It's the easiest way to interact with your AI models.

🎯 Why Use the Chat Interface?

Instant AI Access - No coding required, just type and chat
Real-time Responses - See AI responses as they're generated
Rich Formatting - Markdown rendering with syntax highlighting
Smart Features - History, saving, batch processing, and more
Works Everywhere - Local, remote, or Google Colab

🚀 Getting Started

# Start your server
locallab start

# Open chat interface
locallab chat

✨ Key Features

Feature	Description	Example
Dynamic Mode Switching	Change generation mode per message	`Explain AI --stream`
Real-time Streaming	See responses as they're typed	Live text generation
Conversation History	Track and save your chats	`/history`, `/save`
Batch Processing	Process multiple prompts	`/batch` command
Remote Access	Connect to any LocalLab server	`--url https://your-server.com`
Error Recovery	Auto-reconnection and graceful handling	Seamless experience

🎮 Interactive Commands

/help      # Show all available commands
/history   # View conversation history
/save      # Save current conversation
/batch     # Enter batch processing mode
/reset     # Clear conversation history
/exit      # Exit gracefully

🔄 Dynamic Mode Switching (New!)

Override the default generation mode for any message:

You: Write a story --stream          # Use streaming mode
🔄 Using stream mode for this message

You: Remember my name is Alice --chat # Use chat mode with context
🔄 Using chat mode for this message

You: What's 2+2? --simple            # Use simple mode
🔄 Using simple mode for this message

📱 Example Chat Session

$ locallab chat
🚀 LocalLab Chat Interface
✅ Connected to: http://localhost:8000
📊 Server: LocalLab v0.9.0 | Model: qwen-0.5b

You: Hello! Can you help me with Python?

AI: Hello! I'd be happy to help you with Python programming.
What specific topic would you like to explore?

You: Show me how to create a class --stream

AI: Here's how to create a simple class in Python:

```python
class Person:
    def __init__(self, name, age):
        self.name = name
        self.age = age

    def introduce(self):
        return f"Hi, I'm {self.name} and I'm {self.age} years old."

# Usage
person = Person("Alice", 25)
print(person.introduce())

You: /save
💾 Conversation saved to: chat_2024-07-06_14-30-15.json

You: /exit
👋 Goodbye!

🌐 Remote Access

Connect to any LocalLab server from anywhere:

# Connect to remote server
locallab chat --url https://abc123.ngrok.app

# Use with Google Colab
locallab chat --url https://your-colab-ngrok-url.app

📖 Complete Guide: See the Chat Interface Documentation for advanced features, examples, and troubleshooting.

🐍 Python Client - Programmatic Access

For developers who want to integrate AI into their applications, LocalLab provides a powerful Python client package.

🎯 Two Ways to Use LocalLab

Method	Best For	Getting Started
Chat Interface	Interactive use, testing, quick questions	`locallab chat`
Python Client	Applications, scripts, automation	`from locallab_client import SyncLocalLabClient`

📦 Synchronous Client (Recommended for Beginners)

from locallab_client import SyncLocalLabClient

# Connect to server - choose ONE of these options:
# 1. For local server (default)
client = SyncLocalLabClient("http://localhost:8000")

# 2. For remote server via ngrok (when using Google Colab or --use-ngrok)
# client = SyncLocalLabClient("https://abc123.ngrok.app")  # Replace with your ngrok URL

try:
    print("Generating text...")
    # Generate text
    response = client.generate("Write a story")
    print(response)

    print("Streaming responses...")
    # Stream responses
    for token in client.stream_generate("Tell me a story"):
       print(token, end="", flush=True)

    print("Chat responses...")
    # Chat with AI
    response = client.chat([
        {"role": "system", "content": "You are helpful."},
        {"role": "user", "content": "Hello!"}
    ])
    print(response.choices[0]["message"]["content"])

finally:
    # Always close the client
    client.close()

💡 Important: When connecting to a server running on Google Colab or with ngrok enabled, always use the ngrok URL (https://abc123.ngrok.app) that was displayed when you started the server.

Asynchronous Client Usage (For Advanced Users)

import asyncio
from locallab_client import LocalLabClient

async def main():
    # Connect to server - choose ONE of these options:
    # 1. For local server (default)
    client = LocalLabClient("http://localhost:8000")

    # 2. For remote server via ngrok (when using Google Colab or --use-ngrok)
    # client = LocalLabClient("https://abc123.ngrok.app")  # Replace with your ngrok URL

    try:
        print("Generating text...")
        # Generate text
        response = await client.generate("Write a story")
        print(response)

        print("Streaming responses...")
        # Stream responses
        async for token in client.stream_generate("Tell me a story"):
            print(token, end="", flush=True)

        print("\nChatting with AI...")
        # Chat with AI
        response = await client.chat([
            {"role": "system", "content": "You are helpful."},
            {"role": "user", "content": "Hello!"}
        ])
        # Extracting Content
        content = response['choices'][0]['message']['content']
        print(content)
    finally:
        # Always close the client
        await client.close()

# Run the async function
asyncio.run(main())

🌐 Google Colab Usage with Remote Access

Step 1: Set Up the Server on Google Colab

First, you'll set up the LocalLab server on Google Colab to use their free GPU:

# In your Colab notebook:

# 1. Install the server package
!pip install locallab

# 2. Configure with CLI (notice the ! prefix)
!locallab config

# 3. Start server with ngrok for remote access
!locallab start --use-ngrok

# The server will display a public URL like:
# 🚀 Ngrok Public URL: https://abc123.ngrok.app
# COPY THIS URL - you'll need it to connect!

Step 2: Connect to Your Server

After setting up your server on Google Colab, you'll need to connect to it using the LocalLab client package. The server will display a ngrok URL that you'll use for the connection.

Using the Client Connection Examples

You can now use the client connection examples from the Client Connection & Usage section above.

Just make sure to:

Use your ngrok URL instead of localhost
Install the client package if needed

For example:

# In another cell in the same Colab notebook:

# 1. Install the client package
!pip install locallab-client

# 2. Import the client
from locallab_client import SyncLocalLabClient

# 3. Connect to your ngrok URL (replace with your actual URL from Step 1)
client = SyncLocalLabClient("https://abc123.ngrok.app")  # ← REPLACE THIS with your URL!

# 4. Now you can use any of the client methods
response = client.generate("Write a poem about AI")
print(response)

# 5. Always close when done
client.close()

Access From Any Device

The power of using ngrok is that you can connect to your Colab server from anywhere:

# On your local computer, phone, or any device with Python:
pip install locallab-client

from locallab_client import SyncLocalLabClient
client = SyncLocalLabClient("https://abc123.ngrok.app")  # ← REPLACE THIS with your URL!
response = client.generate("Hello from my device!")
print(response)
client.close()

💡 Remote Access Tip: The ngrok URL lets you access your LocalLab server from any device - your phone, tablet, another computer, or share with teammates. See the Client Connection & Usage section above for more examples of what you can do with the client.

💻 Requirements

Local Computer

Python 3.8+
4GB RAM minimum (8GB+ recommended)
GPU optional but recommended
Internet connection for downloading models

Google Colab

Just a Google account!
Free tier works fine

🌟 Features

Easy Setup: Just pip install and run
Multiple Models: Use any Hugging Face model
Resource Efficient: Automatic optimization
Privacy First: All local, no data sent to cloud
Free GPU: Google Colab integration
Flexible Client API: Both async and sync clients available
Automatic Resource Management: Sessions close automatically
Remote Access: Access your models from anywhere with ngrok integration
Secure Tunneling: Share your models securely with teammates or access from mobile devices
Client Libraries: Python libraries for both synchronous and asynchronous usage

🌍 Client-Server Architecture

graph LR
    A[Your Application] -->|Uses| B[LocalLab Client]
    B -->|API Requests| C[LocalLab Server]
    C -->|Runs| D[AI Models]
    C -->|Optional| E[Ngrok Tunnel]
    E -->|Remote Access| F[Any Device, Anywhere]
    style A fill:#7c3aed,stroke:#6d28d9,stroke-width:2px,color:#ffffff
    style B fill:#dc2626,stroke:#b91c1c,stroke-width:2px,color:#ffffff
    style C fill:#2563eb,stroke:#1e40af,stroke-width:2px,color:#ffffff
    style D fill:#059669,stroke:#047857,stroke-width:2px,color:#ffffff
    style E fill:#ea580c,stroke:#c2410c,stroke-width:2px,color:#ffffff
    style F fill:#0891b2,stroke:#0e7490,stroke-width:2px,color:#ffffff

➡️ See All Features

📚 Documentation

🚀 Getting Started

Guide	Description
Installation & Setup	Complete installation guide for all platforms
CLI Overview	Command-line interface documentation
Chat Interface	Terminal chat features and examples

💻 Using LocalLab

Guide	Description
CLI Reference	Complete command documentation
Model Management	Download and organize AI models
Python Client	Programmatic access guide
API Reference	HTTP API documentation

🌐 Deployment & Advanced

Guide	Description
Google Colab Setup	Free GPU deployment guide
Troubleshooting	Common issues and solutions
Advanced Features	Power user features

🔍 Need Help?

Check FAQ
Visit Troubleshooting
Ask in Discussions

📖 Additional Resources

🌟 Star Us!

If you find LocalLab helpful, please star our repository! It helps others discover the project.

Made with ❤️ by Utkarsh Tiwari GitHub • Twitter • LinkedIn

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Operating System
Programming Language

Release history Release notifications | RSS feed

0.11.3

Jul 23, 2025

0.11.2

Jul 22, 2025

0.11.1

Jul 8, 2025

This version

0.11.0

Jul 8, 2025

0.10.0

Jul 6, 2025

0.9.0

Jul 6, 2025

0.8.0

Jul 4, 2025

0.7.2

Jun 21, 2025

0.7.1

May 20, 2025

0.7.0

May 16, 2025

0.6.6

May 16, 2025

0.6.5

May 16, 2025

0.6.3

May 8, 2025

0.6.1

May 4, 2025

0.5.9

May 1, 2025

0.5.8

May 1, 2025

0.5.5

Apr 30, 2025

0.5.4

Apr 30, 2025

0.5.3

Apr 30, 2025

0.5.2

Apr 30, 2025

0.5.1

Apr 22, 2025

0.5.0

Apr 22, 2025

0.4.49

Apr 21, 2025

0.4.48

Apr 17, 2025

0.4.47

Apr 16, 2025

0.4.46

Apr 9, 2025

0.4.45

Apr 8, 2025

0.4.44

Mar 19, 2025

0.4.43

Mar 19, 2025

0.4.42

Mar 19, 2025

0.4.41

Mar 19, 2025

0.4.40

Mar 18, 2025

0.4.39

Mar 17, 2025

0.4.38

Mar 17, 2025

0.4.37

Mar 16, 2025

0.4.36

Mar 16, 2025

0.4.35

Mar 15, 2025

0.4.34

Mar 15, 2025

0.4.33

Mar 15, 2025

0.4.32

Mar 14, 2025

0.4.31

Mar 14, 2025

0.4.30

Mar 14, 2025

0.4.29

Mar 14, 2025

0.4.28

Mar 14, 2025

0.4.27

Mar 13, 2025

0.4.26

Mar 13, 2025

0.4.25

Mar 13, 2025

0.4.24

Mar 13, 2025

0.4.23

Mar 13, 2025

0.4.22

Mar 11, 2025

0.4.21

Mar 11, 2025

0.4.20

Mar 11, 2025

0.4.18

Mar 11, 2025

0.4.17

Mar 11, 2025

0.4.16

Mar 10, 2025

0.4.15

Mar 10, 2025

0.4.14

Mar 10, 2025

0.4.13

Mar 10, 2025

0.4.12

Mar 10, 2025

0.4.11

Mar 10, 2025

0.4.9

Mar 9, 2025

0.4.7

Mar 9, 2025

0.4.6

Mar 9, 2025

0.4.5

Mar 8, 2025

0.4.4

Mar 8, 2025

0.4.3

Mar 8, 2025

0.4.2

Mar 8, 2025

0.4.1

Mar 7, 2025

0.4.0

Mar 6, 2025

0.3.9

Mar 6, 2025

0.3.8

Mar 5, 2025

0.3.7

Mar 5, 2025

0.3.6

Mar 5, 2025

0.3.5

Mar 5, 2025

0.3.4

Mar 3, 2025

0.3.3

Mar 3, 2025

0.3.2

Mar 3, 2025

0.3.1

Mar 3, 2025

0.3.0

Mar 3, 2025

0.2.9

Mar 2, 2025

0.2.8

Mar 2, 2025

0.2.7

Mar 2, 2025

0.2.6

Mar 2, 2025

0.2.5

Mar 2, 2025

0.2.4

Mar 2, 2025

0.2.3

Mar 2, 2025

0.2.2

Mar 1, 2025

0.2.1

Mar 1, 2025

0.2.0

Mar 1, 2025

0.1.9

Mar 1, 2025

0.1.8

Mar 1, 2025

0.1.7

Mar 1, 2025

0.1.6

Feb 25, 2025

0.1.5

Feb 25, 2025

0.1.4

Feb 25, 2025

0.1.3

Feb 25, 2025

0.1.1

Feb 25, 2025

0.1.0

Feb 25, 2025

0.0

Feb 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

locallab-0.11.0.tar.gz (118.1 kB view details)

Uploaded Jul 8, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

locallab-0.11.0-py3-none-any.whl (114.1 kB view details)

Uploaded Jul 8, 2025 Python 3

File details

Details for the file locallab-0.11.0.tar.gz.

File metadata

Download URL: locallab-0.11.0.tar.gz
Upload date: Jul 8, 2025
Size: 118.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.23

File hashes

Hashes for locallab-0.11.0.tar.gz
Algorithm	Hash digest
SHA256	`f4d9530233c91d57dbbf289fcb21266c9c916b5b42d2754cea17e4f3f2ad1aae`
MD5	`40cf1647eb6e0acc041d381d171833dc`
BLAKE2b-256	`8432effa2a5f17316949212f6be08bbf5306f7d872c72759e53417f65f1c54aa`

See more details on using hashes here.

File details

Details for the file locallab-0.11.0-py3-none-any.whl.

File metadata

Download URL: locallab-0.11.0-py3-none-any.whl
Upload date: Jul 8, 2025
Size: 114.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.23

File hashes

Hashes for locallab-0.11.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e1308a673571ec7cde7b4a7fcf40d6374d67f92f73dd55a658833a09c613747a`
MD5	`795df28dae0deb20c44a5eae2e2faf85`
BLAKE2b-256	`e345063a5eda1c3810bee928f15445dfb81a8db8f8a09d6df3f0b530ac38e92b`

See more details on using hashes here.

locallab 0.11.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🚀 LocalLab: Your Personal AI Lab

✨ What Makes LocalLab Special?

🚀 Quick Start (3 Steps)

🤖 Model Management (Optional)

🧠 How LocalLab Works

1. 🖥️ LocalLab Server (pip install locallab)

2. 💬 Chat Interface (Built-in)

3. 🤖 Model Management (Built-in)

4. 🐍 Python Client (pip install locallab-client)

🎯 Key Features

🌟 Two Ways to Run

📦 Installation & Setup

Windows Setup

Linux/Mac Setup

2. Configure the Server (Recommended)

3. Start the Server

💬 Terminal Chat Interface - Your Personal ChatGPT

🎯 Why Use the Chat Interface?

🚀 Getting Started

✨ Key Features

🎮 Interactive Commands

🔄 Dynamic Mode Switching (New!)

📱 Example Chat Session

🌐 Remote Access

🐍 Python Client - Programmatic Access

🎯 Two Ways to Use LocalLab

📦 Synchronous Client (Recommended for Beginners)

Asynchronous Client Usage (For Advanced Users)

🌐 Google Colab Usage with Remote Access

Step 1: Set Up the Server on Google Colab

Step 2: Connect to Your Server

Using the Client Connection Examples

Access From Any Device

💻 Requirements

Local Computer

Google Colab

🌟 Features

🌍 Client-Server Architecture

📚 Documentation

🚀 Getting Started

💻 Using LocalLab

🌐 Deployment & Advanced

🔍 Need Help?

📖 Additional Resources

🌟 Star Us!

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

1. 🖥️ LocalLab Server (`pip install locallab`)

4. 🐍 Python Client (`pip install locallab-client`)