AgentOS - TGSC
Project description
AgentOS
A minimal, production-ready implementation of Andrej Karpathy's Agent Operating System architecture, developed by Swarms.ai and partners.
Overview
AgentOS is a lightweight, single-file implementation that provides a robust foundation for building autonomous AI agents. It implements the core concepts outlined in Karpathy's Agent OS architecture while maintaining simplicity and extensibility. Developed by Swarms.ai and its partners, AgentOS is a production-ready implementation of autonomous AI agents that follows the architectural principles outlined by Andrej Karpathy.
Features
- Unified Model Interface: Seamless integration with multiple LLM providers through LiteLLM
- Support for Anthropic Claude models (Opus, Sonnet, Haiku)
- Integration with OpenAI GPT models
- Access to optimized variants (GPT-4o, GPT-4o-mini)
- Browser Automation: Built-in browser agent capabilities for web interaction using browser-use
- Multi-Modal Support:
- Text processing and generation
- Video analysis through Google's Gemini models
- Audio processing and speech synthesis
- Image handling capabilities
- Resource Management:
- Efficient handling of computational resources
- Dynamic model selection based on task requirements
- Automatic GPU/CPU optimization
- HuggingFace Integration:
- Direct access to open-source models
- Support for text generation and multiple NLP tasks
- Automatic model quantization and optimization
- Extensible Architecture: Easy to add new capabilities and tools
Core Components
- Model Management: Dynamic selection and utilization of language models
- Browser Automation: Autonomous web-based task execution
- Resource Orchestration: Efficient management of computational resources
- Context Management: Maintains system state and task dependencies
Installation
pip3 install -U agentos-sdk
Usage
from agentos import AgentOS
# Initialize AgentOS
agent_os = AgentOS()
# Run a task
result = agent_os.run(
task="Your task description",
img="optional_image.jpg",
video="optional_video.mp4",
audio="optional_audio.mp3"
)
Available Tools
AgentOS comes with a powerful set of built-in tools that enable various capabilities. Here's a comprehensive list of all available tools:
| Tool Name | Description | Use Case Examples |
|---|---|---|
| Browser Agent | Autonomous web browser automation tool that can navigate websites, extract information, and perform web-based tasks | - Web scraping - Form filling - Data extraction - Website testing |
| Hugging Face Model | Interface for using various Hugging Face models for text generation and other NLP tasks | - Text generation - Language translation - Text classification - Custom model inference |
| LiteLLM Model | Unified interface for multiple LLM providers including OpenAI, Anthropic, and others | - Text generation - Chat completion - Content creation - Advanced reasoning |
| Safe Calculator | Secure mathematical expression evaluator with built-in safety checks | - Mathematical calculations - Formula evaluation - Secure computation - Numeric processing |
| Terminal Developer Agent | Advanced agent for performing terminal operations and development tasks | - File operations - Code execution - System commands - Development tasks |
| Generate Speech | Text-to-speech conversion tool supporting multiple voices and models | - Audio content creation - Voice synthesis - Accessibility features - Audio narration |
| Generate Video | AI-powered video generation tool using Google's Veo 3.0 model | - Video content creation - Visual storytelling - Animation generation - Creative content |
Community
Join our community of agent engineers and researchers for technical support, cutting-edge updates, and exclusive access to world-class agent engineering insights!
| Platform | Description | Link |
|---|---|---|
| 📚 Documentation | Official documentation and guides | docs.swarms.world |
| 📝 Blog | Latest updates and technical articles | Medium |
| 💬 Discord | Live chat and community support | Join Discord |
| Latest news and announcements | @kyegomez | |
| Professional network and updates | The Swarm Corporation | |
| 📺 YouTube | Tutorials and demos | Swarms Channel |
| 🎫 Events | Join our community events | Sign up here |
| 🚀 Onboarding Session | Get onboarded with Kye Gomez, creator and lead maintainer of Swarms | Book Session |
Contributing
We welcome contributions from the community. Please see our contributing guidelines for more information.
License
This project is under the MIT License.
Todo
- Add deep research agent or sub agent
- Implement video and audio processing
- Create better system prompt and add multiple shot examples on when to use certain tools and etc
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agentos_sdk-0.0.3.tar.gz.
File metadata
- Download URL: agentos_sdk-0.0.3.tar.gz
- Upload date:
- Size: 25.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.12.3 Darwin/24.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c2cb61780057cb329067dac4c000ba47bd49cf88d67e14e92e1003be9541bb0a
|
|
| MD5 |
1fd0c01733e30ffaf5bb502157e90be5
|
|
| BLAKE2b-256 |
c93b65f0e669b981d6530f1dbb147befec61323d6024cb7b051904f00bc69a02
|
File details
Details for the file agentos_sdk-0.0.3-py3-none-any.whl.
File metadata
- Download URL: agentos_sdk-0.0.3-py3-none-any.whl
- Upload date:
- Size: 26.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.12.3 Darwin/24.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
10269cbfd51c20c401c8829d06a80bc2b2d19e8d200ec235e3de36d530ae50cd
|
|
| MD5 |
3a843d5d36ed9b72242d0246f686e92c
|
|
| BLAKE2b-256 |
54741b1d80d83cca653fddb8aaa2dc334594ac37ab17378f136d9e67b860c2c6
|