AudioMuse-AI - Where Music Takes Shape

July 25, 2026 · View on GitHub

GitHub license Latest Tag

⭐ Leave a star on this project: One shines alone; together, they make it visible and keep it alive.

💛 Donate to shape AudioMuse-AI future by supporting AI licenses, homelab infrastructure, and continuous development.

AudioMuse-AI - Where Music Takes Shape

AudioMuse-AI Logo

AudioMuse-AI is an opensource and self-hosted tool that uses sonic analysis to rediscover forgotten songs in your music library and generate groove-aware playlists that also capture the meaning behind each track, without relying on metadata or external APIs.

You can run it locally with Docker Compose or Podman, deploy it at scale in a Kubernetes cluster (AMD64 and ARM64 supported), or use native applications available for macOS, Windows, and Linux. It integrates with major self-hosted music servers including Navidrome, Jellyfin, LMS, Lyrion, Emby, and Plex, with more integrations planned.

Prefer not to self-host? Elestio offers AudioMuse-AI as a managed cloud service, and their YouTube video is a good introduction to the project and its features.

AudioMuse-AI lets you explore your music library in innovative ways, just start with an initial analysis, and you’ll unlock features like:

Multiple Music Servers (from v3.0.0): connect several media servers - any mix of Navidrome, Jellyfin, LMS, Lyrion, Emby and Plex - to a single AudioMuse-AI deployment. Built-in duplicate detection recognizes the same song across servers, so each track is analyzed only once and every server shares the result.
Clustering: Automatically groups sonically similar songs, creating genre-defying playlists based on the music's actual sound.
Instant Playlists: Simply tell the AI what you want to hear-like "high-tempo, low-energy music" and it will instantly generate a playlist for you.
Music Map: Discover your music collection visually with a vibrant, genre-based 2D map.
Playlist from Similar Songs: Pick a track you love, and AudioMuse-AI will find all the songs in your library that share its sonic signature, creating a new discovery playlist.
Song Paths: Create a seamless listening journey between two songs. AudioMuse-AI finds the perfect tracks to bridge the sonic gap.
Sonic Fingerprint: Generates playlists based on your listening habits, finding tracks similar to what you've been playing most often.
Song Alchemy: Mix your ideal vibe, mark tracks as "ADD" or "SUBTRACT" to get a curated playlist and a 2D preview. Export the final selection directly to your media server.
Text Search: search your song with simple text that can contains mood, instruments and genre like calm piano songs.
Lyrics Search: search your library by theme, story or meaning, like love songs, not just the sound.

Lyrics language support: the Lyrics Search feature works only with the 72 languages listed below.

Show the 72 supported languages

Afrikaans, Albanian, Arabic, Armenian, Azerbaijani, Basque, Belarusian, Bengali, Bulgarian, Burmese, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, Georgian, German, Greek, Gujarati, Haitian Creole, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Korean, Lao, Latvian, Lithuanian, Macedonian, Malay, Malayalam, Marathi, Mongolian, Nepali, Norwegian, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Serbian, Sinhala, Slovak, Slovenian, Somali, Spanish, Swahili, Swedish, Tagalog, Tamil, Telugu, Thai, Turkish, Ukrainian, Urdu, Vietnamese, Welsh, Yoruba.

More information can be found in the docs folder: ARCHITECTURE, ALGORITHM DESCRIPTION, MULTIPLE MUSIC SERVERS, DEPLOYMENT STRATEGY, NAVIDROME SETUP, GPU DEPLOYMENT, CONFIGURATION PARAMETERS, AUTHENTICATION, PLUGINS, ERROR CODES and FAQ.

The full list of AudioMuse-AI related repository are:

AudioMuse-AI: the core application, it run Flask and Worker containers to actually run all the feature;

AudioMuse-AI Helm Chart: helm chart for easy installation on Kubernetes;

AudioMuse-AI Plugin for Navidrome: Navidrome Plugin;

AudioMuse-AI Plugin for Jellyfin: Jellyfin Plugin;

lyrion-audiomuseai-plugin: Unofficial Lyrion Plugin by JameZUK;

AudioMuse-AI MusicServer: Open Subosnic like Music Sever with integrated sonic functionality.

And now just some NEWS:

Version 3.0.0 added multiple music server support on a single deployment, with duplicate detection so a song shared by more servers is analyzed only once.

Version 2.6.0 added support for third party plugin. Give a look to the plugin documentation to know how to develop one and to the official 3rd party catalog. The plugin system requires a persistent volume mounted on both the Flask and worker containers, otherwise installed plugins are lost whenever the containers restart; the deployment example has been updated accordingly.

Version 2.5.0 added Plex Music Server support.

Disclaimer

Important

Despite the similar name, this project (AudioMuse-AI) is an independent, community-driven effort. It has no official connection to the website audiomuse.ai.

We are not affiliated with, endorsed by, or sponsored by the owners of audiomuse.ai.

Quick Start Deployment (Containerized)
Native Deployment
Hardware Requirements
Docker Image Tagging Strategy
How To Contribute
Code Mirror
Star History

Quick Start Deployment (Containerized)

Get AudioMuse-AI running in minutes with Docker Compose. For more deployment examples see the DEPLOYMENT page.

From v1.0.0, only PostgreSQL, Redis and TZ are configured via environment variables. Everything else is managed through the browser Setup Wizard and persisted in the database (legacy environment variables are imported automatically on first startup). The Setup Wizard is the landing page of a clean installation and stays available under Administration > Setup Wizard.

Prerequisites:

Docker and Docker Compose installed
A running media server (Navidrome, Jellyfin, Lyrion, Emby, or Plex)
See Hardware Requirements

Steps:

Create your environment file:
```
cp deployment/.env.example deployment/.env
```
You can customize the setup by editing deployment/.env before startup. As a minimum, it is suggested to change the default database user and password, but you can also override other PostgreSQL and Redis connection parameters if needed:
```
POSTGRES_PASSWORD=your-secure-password
```

Start the services:

docker compose -f deployment/docker-compose.yaml up -d

Access the application:
- Web UI: http://localhost:8000
- Interactive API documentation (Swagger UI): http://localhost:8000/apidocs/ (when authentication is enabled, log in via the Web UI first - /apidocs/ is gated by the same JWT cookie as the rest of the app.)
Run your first analysis:
- Navigate to "Analysis and Clustering" page
- Click "Start Analysis" to scan your library
- Wait for completion, then explore features like clustering and music map
Stopping the services:

docker compose -f deployment/docker-compose.yaml down

Important

AudioMuse-AI is designed to work with PostgreSQL v15 as in the deployment example. Different versions could cause errors.

Native Deployment

Prefer not to use Docker? We ship native packages for macOS, Linux and Windows, attached to each release. Each bundles the whole stack (embedded PostgreSQL, Redis, web UI and workers), so you don't need Docker or an external database. Once started, open http://127.0.0.1:8000.

The apps are not signed, so your OS may warn you on first launch, see the per-platform notes below for how to allow them.

macOS - Apple Silicon, AudioMuse-AI-arm64.zip (from v2.1.2)

Unzip and move AudioMuse-AI.app to /Applications.
Remove the quarantine flag (the app is unsigned), either way:
- Terminal: xattr -dr com.apple.quarantine /Applications/AudioMuse-AI.app, then double-click - the icon appears in your menu bar.
- No Terminal: double-click and dismiss the warning, then System Settings → Privacy & Security → "Open Anyway", authenticate, and launch again.
Runs only on Apple Silicon (ARM) on recent macOS (tested on macOS 15.3.1, Mac Mini M4 / 16 GB).

Files: data (database, Redis, temp audio) in ~/Library/AudioMuse-AI, log at ~/Library/Logs/AudioMuse-AI/audiomuse.log

Linux - x86_64 / arm64, .deb or .rpm (from v2.1.3)

Install as root (writes to /opt and the system app/service dirs):
- Debian/Ubuntu: sudo dpkg -i AudioMuse-AI-<arch>-linux.deb (where <arch> is x86_64 or aarch64)
- Fedora/RHEL: sudo rpm -i AudioMuse-AI-<arch>-linux.rpm (where <arch> is x86_64 or aarch64)
Run as your normal user (never with sudo/root - it stores data in your home and won't start as root):
- audiomuse-ai start (stop with audiomuse-ai stop), or auto-start on login with systemctl --user enable --now audiomuse-ai.
Verified on Debian 12 (bookworm) (glibc 2.36). The .rpm is the same payload, expected to work on recent Fedora / RHEL 9, but too old for RHEL/Rocky/Alma 8 (glibc 2.28). Feedback on RPM-based distros is welcome.

Files (under the launching user's home): data (database, Redis, temp audio) in ~/.local/share/AudioMuse-AI, log at ~/.local/state/AudioMuse-AI/logs/audiomuse.log (newest entries first)

Windows - x86_64, AudioMuse-AI-amd64-windows.zip (from v2.1.4)

Unzip the portable archive anywhere.
From a terminal you can start with AudioMuse-AI.exe start and stop with AudioMuse-AI.exe stop.
Runs only on x86_64 (Intel/AMD) on Windows 10/11.

Files: data (database, Redis, temp audio) in %LOCALAPPDATA%\AudioMuse-AI, log at %LOCALAPPDATA%\AudioMuse-AI\logs\audiomuse.log (newest entries first)

Important

Before updating a native version, first stop any running instance.

Hardware Requirements

AudioMuse-AI has been tested on:

Intel: HP Mini PC with Intel i5-6500, 16 GB RAM and NVMe SSD
ARM: Raspberry Pi 5, 8 GB RAM and NVMe SSD / Mac Mini M4 16GB / Amphere based VM with 4core 8GB ram

Minimum requirements:

CPU: 4-core Intel with AVX2 support (usually produced in 2015 or later) or ARM
RAM: 8 GB RAM
DISK: NVME SSD storage

For more information about the GPU deployment requirements have a look to the GPU page.

Important

If you use virtualization (e.g. Proxmox), make sure to pass through the host CPU. QEMU's virtual CPU lacks AVX2 support, which will prevent AudioMuse-AI from starting.

Docker Image Tagging Strategy

Our GitHub Actions workflow automatically builds and publishes Docker images with the following tags:

:latest Last released image. Use it for automatic update.
:X.Y.Z (e.g. :1.0.0, :0.1.4-alpha) Immutable images built from Git release tags. Recommended for most users. Pinned deployments: you decide when to update by changing the version manually.
:devel Build from main on each commit/pr merged. It's a less stable build. Recommended only for testing and early adopters.
:pr-<NUMBER> (e.g. :pr-661) Build generated for a specific open pull request (non-draft), to preview its changes before they are merged. For reviewing and testing that single PR
-noavx2 variants Experimental images for CPUs without AVX2 support, using legacy dependencies. Not recommended unless required for compatibility.
-nvidia variants Images that support the use of GPU for both Analysis and Clustering. Not recommended for old GPU.

Versioning is Major.Minor.Patch release. Eventually (rare) model change that could require a new analysis could happen in Major and Minor release. Read the release note before any update especially for Major and Minor release.

How To Contribute

Contributions, issues, and feature requests are welcome!

For more details on how to contribute please follow the Contributing Guidelines

Code Mirror

AudioMuse-AI repository code is mirrored here:

https://codeberg.org/NeptuneHub/AudioMuse-AI

DO NOT USE MIRROR TO RAISE ISSUE, PR OTHER ACTION DIFFERENT FROM GET THE CODE