Audio to JSON for AI

Drop a song and get structured data — BPM, key, loudness, duration, format — to paste into ChatGPT, Claude, Gemini, Grok, Perplexity, Suno or Udio so the AI actually understands the track. Everything runs in your browser; your file is never uploaded.

🎧

Drop an audio file or click to choose

It's analyzed on your device — nothing leaves your browser.

MP3 · WAV · OGG · FLAC · AAC · M4A

Analyzing…

audio

Include energy-over-time

Segments 16

Coarse loudness timeline that helps AI read the arrangement (intro / build / drop).

Tempo and key are estimated by in-browser DSP — accurate for most music with a clear beat and tonal center, less so for ambient, spoken word, or tempo-changing tracks. Loudness uses the ITU-R BS.1770-4 (LUFS) standard.

What's this for?

AI music tools like Suno and Udio — and chat models like ChatGPT, Claude, Gemini and Grok — can't "listen" to an audio file. If you want to recreate the feel of a reference track, or ask an AI to reason about it, you have to describe it. This tool does that for you: it measures the track's tempo, key, loudness and format locally in your browser and hands you clean JSON (or Markdown) you can paste straight into a chat. No account, files never leave your device. It works with any AI chat or music generator — ChatGPT, Claude, Gemini, Grok, Perplexity, Copilot, DeepSeek, Suno, Udio and more.

FAQ

Is my audio uploaded anywhere?▼

No. Decoding and analysis happen entirely in your browser with the Web Audio API. Your file never leaves your device.

How accurate are the BPM and key?▼

They're estimates from signal analysis — autocorrelation for tempo and a Krumhansl-Schmuckler chromagram for key (the same engines as our BPM Finder and Key Finder). For most music with a clear beat and tonal center they're reliable to within a BPM or two and the right key or its relative; ambient, spoken, or tempo-shifting material is harder. Loudness (LUFS) is a precise ITU-R BS.1770-4 measurement.

How do I use it with Suno or ChatGPT?▼

Pick JSON or Markdown (some chats read Markdown more cleanly), click "Copy for AI chats", paste it into the chat, and ask something like "describe the vibe of a track with these specs" or "write a Suno prompt for something similar." The structured fields give the model concrete, accurate detail to work from instead of guessing.

What fields are included?▼

Duration, sample rate, channels, file format and approximate bitrate; tempo (BPM); musical key, scale, Camelot code and a confidence score; and integrated LUFS loudness plus peak dBFS.