Magma API Service

April 29, 2025 · View on GitHub

This directory contains a FastAPI server for the Microsoft Magma multimodal model. The API service provides endpoints for inference using the Magma model, including processing images and generating text and action responses.

Quick Start

The fastest way to run the server is:

./magma-server.sh run

This command will:

Create or activate a conda environment named "magma"
Install PyTorch first
Install the Magma package with all dependencies (including server requirements)
Start the server directly

Note: You must have conda (Miniconda or Anaconda) installed on your system. The script will automatically detect and use it.

All dependency versions are specified in pyproject.toml, making it easy to update them in a single place.

Directory Structure

The server is organized as follows:

main.py - Main FastAPI application for the Magma model
magma-server.sh - Unified script for managing all deployment methods
test_api.py - Script for testing the API
/docker/ - Files for Docker-based deployment
- Dockerfile - Container definition
- docker-compose.yml - Docker Compose configuration
/native/ - Files for native system service deployment
- run_magma_api.sh - Script to run the API directly
- manage_magma_service.sh - Script to manage the service
- magma-api.service - Systemd service definition

API Functionality

This API provides:

Vision and language processing via a REST API
Action prediction for robotics applications
Health check endpoint
Support for both base64-encoded images and file uploads

Installation & Usage

This server leverages the main Magma package and installs server-specific dependencies through the optional [server] dependencies in pyproject.toml. All server-specific dependencies are defined in one place, avoiding duplicate requirements files.

Note on Dependency Installation: Some dependencies like flash-attn require PyTorch to be installed first. Our installation scripts follow a simple two-step process: first install PyTorch, then install the rest of the dependencies. This simple approach ensures proper build order for packages that need PyTorch to be present during installation.

Prerequisites

NVIDIA GPU with CUDA support
Python 3.10+
Conda (Miniconda or Anaconda) for environment management

Using the Unified Management Script

We provide a unified script to manage all deployment methods:

# Quick start - simplest option
./magma-server.sh run            # Install dependencies and run directly

# For Docker deployment
./magma-server.sh docker up      # Start Docker container
./magma-server.sh docker down    # Stop Docker container
./magma-server.sh docker logs    # View Docker logs
./magma-server.sh docker build   # Build Docker image

# For native deployment
./magma-server.sh native setup    # Set up conda environment
./magma-server.sh native install  # Install as systemd service
./magma-server.sh native start    # Start the service
./magma-server.sh native stop     # Stop the service
./magma-server.sh native run      # Run directly without service

Manual Usage

Option 1: Using Docker

Navigate to the docker directory:
```
cd docker
```
Build and start the container:
```
docker compose up -d
```
Check the logs:
```
docker compose logs -f
```

Option 2: Running Directly

Install the package with server dependencies:

# From the repository root
pip install torch torchvision  # Install PyTorch first
pip install -e ".[server]"     # Then install Magma with server dependencies

Run the server directly:
```
cd server
python main.py
```

Test the API (in a separate terminal):

cd server
# Basic test - just check if the server is running
./test_api.py --url http://localhost:8080

# Full test with an image
./test_api.py --url http://localhost:8080 --image /path/to/image.jpg

Option 3: Installing as a Service

Navigate to the native directory:
```
cd native
```
Setup the conda environment:
```
./manage_magma_service.sh setup-conda
```

Edit the service file to replace the placeholder with your username:

# Replace USER with your username in magma-api.service
sed -i 's/User=USER/User=your_username/' magma-api.service
sed -i 's/Group=USER/Group=your_username/' magma-api.service

Install and start the service:

sudo ./manage_magma_service.sh install
sudo ./manage_magma_service.sh start

API Endpoints

The API will be available on port 8080 by default.

Health Check

GET /health

Predict from Base64 Image

POST /predict

Request body:

{
  "image": "base64_encoded_image_data",
  "prompt": "What can you see in this image and what action should I take?"
}

Predict from File Upload

POST /predict_from_file

Use multipart/form-data with:

file: Image file
prompt: Text prompt

Response Format

{
  "text_response": "Text description from the model",
  "normalized_actions": [-0.25, 0.42, 0.13, 0.0, 0.0, 0.0, 1.0],
  "delta_values": [-0.025, 0.042, 0.013, 0.0, 0.0, 0.0, 1.0]
}