Weekly Reading Group

Music. Co-Creative. AI

We read ML papers about music, try to actually understand them, and document what we learn. Interactive demos, code, and a lot of questions along the way.

Join the Group View Full Schedule GitHub

Meets weekly · California 7:00 AM PST ·Chicago 9:00 AM CDT ·New York 10:00 AM EDT ·Europe 4:00 PM CEST

Reading Schedule

What we're currently reading

Week 1 2026-03-19 Signal Processing & ML

Read session notes

DDSP: Differentiable Digital Signal Processing

Engel, Hantrakul, Gu, Roberts 2020 Slides

A Review of Differentiable Digital Signal Processing for Music & Speech Synthesis

Hayes, Shier, Saitis, Fazekas 2024

Week 2 2026-03-26 Music Transcription

Read session notes

A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation

Bittner, Bosch, Rubinstein, Meseguer-Brocal, Ewert 2022

MT3: Multi-Task Multitrack Music Transcription

Gardner, Simon, Manilow, Hawthorne, Engel 2021

Onsets and Frames: Dual-Objective Piano Transcription

Hawthorne, Elsen, Song, Roberts, Simon, Raffel, Engel, Oore, Eck 2017

Week 3 2026-04-02 Music Generation

Recording pw: Di!3?KhN

Read session notes

VampNet: Music Generation via Masked Acoustic Token Modeling

Garcia, Tsuda, Shier, Bryan, Fierro, Agostinelli 2023 Slides

Anticipatory Music Transformer

Thickstun, Hall, Donahue, Liang 2023 Slides

Interactive Notes

Notes from past sessions

generationtransformersvampnet

Music Generation

From masked acoustic tokens to anticipatory symbolic infilling

An overview of two approaches to music generation: VampNet's masked acoustic token modeling for audio-domain generation, and the Anticipatory Music Transformer's interleaved infilling for symbolic MIDI.

transcriptionpiano-rolltransformers

Music Transcription

Dual-objective CNNs to transformers to 16K-parameter models

An overview of three papers that shaped the trajectory of automatic music transcription, covering Onsets and Frames, MT3, and the lightweight NMP (Basic Pitch) model, with interactive visualizations of the main ideas.

ddspsignal-processingpytorch

DDSP From Scratch

A minimal differentiable synthesizer in PyTorch (trying to understand the paper)

Building a differentiable digital signal processing synthesizer from scratch, with interactive visualizations and audio experiments from our reading group session on Engel et al. (2020).

What we read

ML papers about music: synthesis, transcription, generative models, anything with a loss function and an audio output. We work through one or two a week.

How we share

Each session gets interactive notes with visualizations and audio demos (and usually some PyTorch). The goal is real intuition, not just a summary of the abstract.

Who it's for

Researchers, engineers, musicians, students. Anyone who wants to actually dig into the papers rather than just hear about them. Show up, ask questions, bring coffee.

Want in, have a paper to suggest, or want to present one?

Join the Google Group