SIGNAL.DAT // Specialized Musicology Datasets for Audio AI

specialized training data

High-Fidelity Musicology Datasets for Generative Audio AI

Training competitive music generators and recommendation engines requires structured, precise audio annotations made by real musicians. We supply human-annotated, multi-dimensional musicology datasets — every record reviewed and QA-certified before export.

Explore Dataset Preview How We Annotate

HUMAN ANNOTATED MUSICOLOGIST VERIFIED 4-PLANE NARRATIVE MELODIC CONTOUR LEXICON QA CERTIFIED ESSENTIA KEY+BPM SECTION-LEVEL DETAIL STRUCTURED JSON EXPORT HUMAN ANNOTATED MUSICOLOGIST VERIFIED 4-PLANE NARRATIVE MELODIC CONTOUR LEXICON QA CERTIFIED ESSENTIA KEY+BPM SECTION-LEVEL DETAIL STRUCTURED JSON EXPORT

Dataset Distribution & Metrics // Live Analytics

ANALYTICS_DASHBOARD.LOG

TOTAL TRACKS 0

TOTAL ANNOTATED TIME 0h 0m

AVERAGE BPM 0

UNIQUE INSTRUMENTS 0

Top Instruments Distribution

Top Vibe Tags

Key Frequencies

The Annotation Pipeline // Verified Local & Semantic Metadata

SIGNAL_PROCESS.LOG

Acoustic Verification

Every track is analyzed locally using Essentia.js WASM extractors. Key and BPM are extracted directly from the PCM audio buffer and verified against the annotation before the record is accepted.

Human Expert Annotation

A trained musicologist listens to each track and manually annotates every section: instrumentation, melodic contour, vibe, narrative description, and production characteristics. No AI generates the annotations — every judgment is human.

QA Review & Export

Every annotation is reviewed against strict quality constraints before approval: no cross-section references, complete instrument lists, validated vibe tags, and mandatory production assessment. Approved records are exported as structured JSON.

Dataset Anatomy // Rich Structured Annotations

METRIC_SCHEMAS.JSON

4-Plane Narrative Timeline

Every timeline block contains a strict 4-sentence structure describing: 1) Foreground melody/vocals, 2) Middle-ground keys/rhythm, 3) Background bass/drums, and 4) Dynamic energy shifts.

Mix & Production Parameters

Detailed semantic tracking of panning distributions, sidechaining, effects (reverb/delay wetness), harmonic saturation, EQ filtering, and overall fidelity standards (studio vs. Lo-Fi).

Melodic Contour Lexicon

Vocals and instruments are mapped using formal contours (undulating, conjunct, disjunct, static, ascending/descending arches) allowing generative models to train on target melodic curves.

Rigorous QA Constraints

Strict QA checks guarantee 0:00 alignment, prevent cross-reference shortcuts ("same as", "repeats"), mandate specific outro closures, and verify instrument presence per-block.

Interactive Database Explorer // Explore Metadatos

EXPLORER_WIDGET.EXE

0 matching tracks found

Bohemian Rhapsody

Queen

SIG-000000 Ab Minor 120 BPM 0:00

Global Characteristics

Genre & Era: ...

Groove & Rhythm: ...

Harmony: ...

Instrumentation: ...

Wow Factor: ...

Narrative Timeline Segmentations

Acoustic & Production Specs

Mix & Production: ...

Sonic Fidelity: ...

Vocal Expression: ...

Lyrical Evocation: ...

dataset_record_preview.json

Loading database records...

Licensing & Datasets // Pricing Models

PRICING_TIERS.DAT

SAMPLE.DAT

$0 / free

50 full track JSON entries
Human-annotated musicology
Mix & timeline segmentations
Evaluation license

STARTER.DAT

$599 / one-time

200 track dataset
Complete musicology annotations
Melodic contour & vocal style logs
Commercial training license

★ Recommended FULL.DAT

$1,499 / one-time

Full 710-track catalog
Verified clean provenance
Embeddings + stems metadata
Commercial training license
Regular catalog updates

ENTERPRISE.DAT

$2k+ / custom

Custom catalog & volume
Real-time annotation API access
Custom musicological constraints
Exclusive licensing options
Dedicated curation team support