MusicData Lab

Your Music Data, Finally Unified

Aggregating millions of streaming records from Spotify, Apple Music, YouTube and more into a unified royalty data integration system.

Your Music Data, Finally Unified
Use Cases

Prototypes for Every Corner of the Music Industry

Solutions for Labels
Stop drowning in distributor spreadsheets. We build custom prototypes that ingest messy data from ADA, Orchard, or Merlin and turn it into real-time profitability dashboards. We can even integrate AI models to predict catalog performance or automate artist statement generation.
Solutions for Publishers
Bridge the gap between ISRC and ISWC automatically. We build POCs for automated CWR/DDEX parsing and AI-driven auditing tools to find unclaimed royalties. Turn weeks of manual data verification into minutes of automated precision.
Solutions for Distributors
Scale your delivery pipeline with custom ingestion engines. We prototype automated metadata validation tools and AI-powered sonic analysis for auto-tagging genres, moods, and BPMs, ensuring your deliveries to DSPs are flawless and enriched.
Solutions for Rights Owners & Managers
Whether managing a legacy estate or a boutique sync library, we build the tech to track your assets. From AI-based 'sounds-like' search engines for sync, to custom audit tools that cross-reference performance logs, we build the bespoke tools that off-the-shelf software misses.
By the Numbers

Built for Scale

€5,490

Flat price (no surprises)

3

Music & Tech Experts Involved

14

Days to Deliver Project

8

Working days total

Approaches

How Does It Compare?

There are many ways to handle royalty data. Only one scales reliably.
Approach Setup Maintenance Format Changes Scales
Manual spreadsheetsNoneHours/monthBreaks silentlyMore hours per source
Generic ETLMediumLowLimitedOnly if connector exists
Custom Python scriptsHighHigh/fragileDepends on devNew script per source
Adapter-based pipelineHigh upfrontLowAdapter update onlyAdd adapter, done
Supported Platforms

17+ Distributors and Growing

Each distributor gets a dedicated adapter that handles its quirks. Adding a new one takes 2-3 hours.
Major Distributors
ADA (Warner), The Orchard (Sony), Merlin, FUGA (Downtown), Believe, TuneCore, DistroKid, CD Baby
Platforms
Bandcamp, Qello, SoundCloud, YouTube Music, TikTok, Meta
Regional
Phonofile (Nordic), Zebralution (Germany), IDOL (France), Altafonte (Spain/LatAm)
Architecture

Universal Base + Specialised Adapters

Every distributor report maps to 20 standardised fields. The adapter handles the quirks; downstream systems get clean, uniform data.
Identification
Artist, Track, Product, Label, UPC, EAN, ISRC
Location
Country code (ISO), Territory
Time
Year, Month, Period begin/end
Financial
Currency, Units, Unit price, Income, Income type, Retailer, Service type
Built-in Tools

More Than a Data Pipeline

MusicData Lab comes with specialized tools that solve real problems music companies face every day.
  • Scout: Batch ISRC Enrichment
    Upload 40,000 tracks, get back clean ISRCs, artist data, and confidence scores. Matches against Spotify and MusicBrainz APIs. Days of manual work in minutes.
  • Read more
  • AI Audio Similarity Search
    Find sounds by how they sound, not how they're tagged. CLAP embeddings and vector search discover SFX by acoustic similarity in seconds.
  • Read more
  • AI Analytics Dashboard
    Ask plain English questions like 'top 5 artists by income' and get instant charts. Powered by ClickHouse and local LLM. No data leaves your servers.
  • Read more
MusicData Lab built-in tools
Experimental

AI-Native Tools for Music Data

We're exploring a new way to interact with music data — open-source MCP servers that let AI assistants query, analyse, and manage your catalogue directly. No dashboards, no clicks — just ask.
MCP Bandcamp
Query Bandcamp sales reports, revenue by artist, top items, and fee breakdowns — all from your AI assistant.
Open Source View on GitHub
MCP Metadata
Read and write audio metadata tags (title, artist, album, ISRC, genre) in MP3, FLAC, and OGG files programmatically.
Open Source View on GitHub
MCP FUGA
Query FUGA (Downtown) royalty reports, catalogue data, and distribution analytics through your AI assistant.
Coming Soon
Team

Two Experts. One Killer Combo.

A rare blend of deep technical skill and 30+ years of music industry knowledge. Together, they cover every angle of your data challenge.
Mariusz Smenżyk

Mariusz Smenżyk

Lead AI Developer & System Architect

PYTHONDATA PIPELINESAI/MLELASTICSEARCHSYSTEM ARCHITECTURE

Experience

Full-stack developer and AI engineer with 15+ years of building production systems. Created the entire MusicData Lab pipeline from scratch, including 17+ distributor adapters, the Elasticsearch engine, and the automated ingestion workflow. Runs MusicTech Lab, a music and sport tech studio.

Role in Project

Designs and builds the complete data pipeline. From raw distributor files to clean, queryable data. Handles architecture decisions, adapter development, AI-powered data matching, and infrastructure on Digital Ocean.

Fatiha Ben Brahim

Fatiha Ben Brahim

Music Rights & Metadata Strategist

DIGITAL RIGHTS MANAGEMENTROYALTY ADMINISTRATIONMETADATA STRATEGYRIGHTS-TECH

Experience

Over 30 years in music rights, royalty administration, and catalogue intelligence. Led global rights operations across major publishers and rights-tech organizations. Creator of MetadataIQ, a service focused on rights-data audits and valuation insights for rights-holders, investors, and musictech companies.

Role in Project

Ensures the data model captures what matters for royalty calculations and rights management. Validates field mappings, defines business rules for edge cases, and provides domain expertise that keeps the system aligned with how the music industry actually works.

Clients

Trusted by

Universal Music Data Parser for 20+ PlatformsCase Study

Universal Music Data Parser for 20+ Platforms

200M+ streaming records from Spotify, Apple Music, YouTube and 20+ distributors unified into a single queryable system. 25+ adapters handle format chaos (CSV, XLS, XLSX, XLSB), inconsistent naming, date quirks, and mixed currencies. Reports generated in under 15 seconds. Built for Cherry Red Records (UK).

Jack Clough

Jack Clough

Digital Manager @ Cherry Red Records

Working with MusicTech Lab has been a pleasure. The team provided us with helpful regular updates across the development process, and went above and beyond in delivering additional features and ideas. Their ability to problem-solve and resolve any issues swiftly helped deliver an excellent final product.

Matt Bristow

Matt Bristow

Director @ Cherry Red Records

MusicTech Lab built us an incredibly useful piece of software and delivered on budget — doing a thoroughly professional and diligent job throughout the whole process.

Get Started

Struggling with Messy Royalty Data?

We built this system and we can tailor it for your label. Tell us about your data challenges.
Maciej Dulski

Maciej Dulski

Business Partner

+48 661 713 000

Book a call
Questions

Got Questions? We've Got Answers