Skip to main content

Nano Llama

Project description

nanollama

A compact and efficient implementation of the Llama 3.2 in a single file, featuring minimal dependencies—no transformers library required, even for tokenization.

Overview

nanollama provides a lightweight and straightforward implementation of the Llama model. It features:

  • Minimal dependencies
  • Easy-to-use interface
  • Efficient performance suitable for various applications

Quick Start

To get started, clone this repository and install the necessary packages.

pip install nanollama

Here’s a quick example of how to use nanollama32:

>>> from nanollama import Chat

# Initialize the chat instance
>>> chat = Chat()

# Start a conversation
>>> chat("What's the weather like in Busan?")
# Llama responds with information about the weather

# Follow-up question that builds on the previous context
>>> chat("And how about the temperature?")
# Llama responds with the temperature, remembering the previous context

# Another follow-up, further utilizing context
>>> chat("What should I wear?")
# Llama suggests clothing based on the previous responses

Command-Line Interface

You can also run nanollama from the command line:

nlm how to create a new conda env
# Llama responds with ways to create a new conda environment and prompts the user for further follow-up questions

Managing Chat History

  • --history: Specify the path to the JSON file where chat history will be saved and/or loaded from. If the file does not exist, a new one will be created.
  • --resume: Use this option to resume the conversation from a specific point in the chat history.

For example, you can specify 0 to resume from the most recent entry:

nlm "and to list envs?" --resume 0

Or, you can resume from a specific entry in history:

nlm "and to delete env?" --resume 20241026053144

Adding Text from Files

You can include text from any number of external files by using the {...} syntax in your input. For example, if you have a text file named langref.rst, you can include its content in your input like this:

nlm to create reddit bots {langref.rst}

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Acknowledgements

This project builds upon the MLX implementation and Karpathy's LLM.c implementation of the Llama model. Special thanks to the contributors of both projects for their outstanding work and inspiration.

Contributing

Contributions are welcome! Feel free to submit issues or pull requests.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nanollama-0.0.6.tar.gz (11.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nanollama-0.0.6-py3-none-any.whl (12.5 kB view details)

Uploaded Python 3

File details

Details for the file nanollama-0.0.6.tar.gz.

File metadata

  • Download URL: nanollama-0.0.6.tar.gz
  • Upload date:
  • Size: 11.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for nanollama-0.0.6.tar.gz
Algorithm Hash digest
SHA256 4ffd4cbc018cee1300b5f1945c61e1a83d52acfd8cd323382be5a3e6a724cffa
MD5 fc0883f652f4e1b772f31cb048b7dc29
BLAKE2b-256 08d4d6d9819edb30549aaf217d7c86d3c3ceba4e7629f074959fad59f685ca5a

See more details on using hashes here.

File details

Details for the file nanollama-0.0.6-py3-none-any.whl.

File metadata

  • Download URL: nanollama-0.0.6-py3-none-any.whl
  • Upload date:
  • Size: 12.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for nanollama-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 ec78db2e3543edd9776394927e333b17edd3e92678aa18327d6f45fbc24e24bd
MD5 a66c753800fab95480b36116541b3dfa
BLAKE2b-256 39f39eb10b07b64d9ceaf271072d329f8e23d2cb5886fe120c2ae74fa54e1ded

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page