A Python package for the Gemini AI generative model
Project description
gemini_ai
gemini_ai
is a Python package that provides an interface to interact with Google’s Gemini AI model. This package enables advanced configuration of the generative model, file handling, media uploading, and response streaming, primarily optimized for Google Colab and Jupyter Notebook environments.
Features
- Flexible Model Configuration: Customize model settings like temperature, top-p, and top-k sampling for generation.
- File Upload and Media Support: Supports uploading various file types (image, audio, text) to Google Colab and Gemini API.
- Chat and Response Management: Easily manage chat sessions with token counting, response streaming, and history display.
- Environment-Specific Optimizations: Automatic detection of Google Colab and Jupyter Notebook environments for optimized performance.
Installation
To install gemini_ai
and its dependencies, use:
pip install gemini_ai
This will install the required packages, including google-generativeai
, pillow
, ipywidgets
, and ipython
.
Usage
1. Initializing GeminiAI
First, you need a Google Gemini API key. Set up your account and get an API key from Google Cloud.
from gemini.gemini import GeminiAI
# Initialize GeminiAI with your API key
gemini = GeminiAI(api_key="YOUR_API_KEY")
2. Configuring the Model
You can configure the model with parameters such as temperature
, top_p
, top_k
, and max_output_tokens
for tailored response generation.
gemini.config(temp=0.7, top_p=0.9, top_k=50, max_output_tokens=1024)
3. Starting a Chat Session
To start a chat session with the AI model, provide an initial instruction. If you’re working in Google Colab, you can also upload files as part of the chat context.
gemini.start_chat(instruction="Tell me about the latest in AI technology.")
4. Sending Messages and Generating Content
Once a session is started, you can send prompts to the AI model and retrieve responses. The send_message
function is useful for quick interactions, while generate
can be used for more complex responses with optional streaming.
# Send a simple message
gemini.send_message("What are the recent advancements in AI?")
# Generate a more elaborate response with optional streaming
gemini.generate(prompt="Can you write a story about space exploration?", stream=True)
5. Handling File Uploads (Google Colab Only)
In Google Colab, you can upload files directly to the Colab environment or to the Gemini API.
Uploading to Colab
file_path = gemini.upload() # Uploads a file in Colab and returns the file path
Uploading to Gemini API
file_uri = gemini.upload_to_gemini(path=file_path, mime_type="text/plain")
print(f"File URI: {file_uri}")
6. Managing Chat History and Token Counts
You can display the chat history or count tokens in the chat session to manage usage effectively.
# Display chat history
gemini.history()
# Count tokens in the chat history
gemini._token_counts()
Environment-Specific Features
GeminiAI
optimizes certain features based on the runtime environment. Here are some environment-specific details:
- Google Colab: Supports file uploads directly to Colab and uses
google.colab
utilities. - Jupyter Notebook: Limits file upload functionality, skipping Colab-specific features gracefully.
Class and Method Overview
Class: GeminiAI
__init__(api_key: str, gemini_model: str = 'gemini-1.5-flash-latest')
Initializes the GeminiAI
object with an API key and model name.
- api_key (str): Your API key for Gemini AI.
- gemini_model (str): Specifies the model version. Default is
'gemini-1.5-flash-latest'
.
config(temp: Optional[int] = 1, top_p: Optional[float] = 0.95, top_k: Optional[int] = 64, max_output_tokens: Optional[int] = 8192, response_mime_type: str = "text/plain", stream: bool = True, silent: bool = True)
Configures the model settings with adjustable parameters.
start_chat(instruction: [str], file_path: Optional[str] = None, meme_type: Optional[str]="text/plain")
Starts a new chat session with the AI, with optional file input.
send_message(prompt: str, stream: bool = False)
Sends a text prompt to the AI and retrieves a response, with optional streaming.
generate(prompt: str, stream: bool = True, chunk_size: int = 80)
Generates content from a prompt, with support for chunked streaming.
upload() -> str
Uploads a file in Google Colab and returns the file path. Raises an error if not in Colab.
upload_to_gemini(path, mime_type=None)
Uploads the specified file directly to the Gemini API.
history()
Displays the chat session history.
_token_counts()
Counts tokens in the entire chat session history for API usage management.
MIME Types Supported
This package supports various MIME types for file uploads:
- Image:
image/jpeg
,image/png
,image/gif
,image/webp
,image/heic
,image/heif
- Audio:
audio/wav
,audio/mp3
,audio/aiff
,audio/aac
,audio/ogg
,audio/flac
- Text:
text/plain
,text/html
,text/css
,text/javascript
,application/json
,text/markdown
Running Tests
To test the gemini_ai
package, use pytest
with coverage:
python3.10 -m pytest --cov=gemini --cov-report=term-missing
Example Code
Here’s a complete example demonstrating the initialization, configuration, chat session setup, and file upload:
from gemini.gemini import GeminiAI
# Initialize the AI with your API key
gemini = GeminiAI(api_key="YOUR_API_KEY")
# Configure model settings
gemini.config(temp=0.7, top_p=0.9)
# Start a chat session
gemini.start_chat(instruction="Tell me about recent advancements in AI")
# Send a prompt and generate a response
gemini.send_message("What's the future of AI?")
gemini.generate("Can you explain the role of AI in healthcare?")
# Display chat history
gemini.history()
# Upload a file to Gemini (Colab only)
file_uri = gemini.upload_to_gemini("/path/to/your/file.txt")
print(f"File uploaded to Gemini with URI: {file_uri}")
Contribution
Contributions to gemini_ai
are welcome! Please feel free to open issues or submit pull requests on the GitHub repository.
License
This project is licensed under the MIT License. See the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file gemini_ai-0.0.5.tar.gz
.
File metadata
- Download URL: gemini_ai-0.0.5.tar.gz
- Upload date:
- Size: 9.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 351c1ba23c8d129370d149fe824e56c7a3e51c0724285fd7babbbac265a1d17b |
|
MD5 | a9cc0e59b687490a689b19ac30b0ee09 |
|
BLAKE2b-256 | b1381b6d2a854504e225eabfe711f00c55e27011e8750d5eb8a0991c69ac3890 |
File details
Details for the file gemini_ai-0.0.5-py3-none-any.whl
.
File metadata
- Download URL: gemini_ai-0.0.5-py3-none-any.whl
- Upload date:
- Size: 8.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cb7198cc2d30ed23a5c524a65cd8cfbf37492207025627eaf1d6ef58e222974b |
|
MD5 | 5a41b10b41d7c517b1a43ed1b79f23ff |
|
BLAKE2b-256 | 67c007e14a0d6b07facbb3722c66d80ca2ad5743fce31618685f5545d72edc28 |