TutorialMIDIAndrew Carlins7 min read

How to Clean Up MIDI After Converting Audio

Converting audio to MIDI gives you a first draft of the notes, not a finished part. Here is how to clean up MIDI after conversion: quantizing loose timing, fixing octave and stray-note errors, splitting parts onto their own tracks, and getting it ready for your DAW or notation program.

How to clean up MIDI after converting audio, quantizing loose timing, fixing octave and stray-note errors, and splitting parts onto separate tracks

Part of our guide to converting audio to MIDI.

When you convert a recording to MIDI, what comes back is a first draft of the notes, not a finished part. The notes are detected from sound, and that detection is imperfect, so the raw result usually has a few timing wobbles, a stray note or two, and the odd pitch caught an octave off. None of that means the conversion failed. It means there is a short cleanup pass between the conversion and a part you would actually play from or hand to another musician. Here is how to clean up MIDI after converting audio, step by step.

Why the first pass needs cleanup

Audio-to-MIDI conversion works by listening to a recording and estimating which notes are sounding, when they start, and how long they last. Because it is reading pitch and timing out of a messy real-world signal (reverb, overtones, overlapping instruments), the estimate is close but not exact. That is normal for the whole category, not a quirk of any one tool. The practical upshot is that you treat the converted file as a starting point: most of the notes are right, and your job is to find the handful that are not and tidy the timing. If you are new to the format itself, what is MIDI explains why MIDI is editable note data rather than a fixed audio recording, which is exactly what makes this cleanup possible. The same flow applies whether you started from a full mix or a single stem, and the broader path is covered in converting MP3 to MIDI.

Quantize the timing

The first cleanup move is usually quantizing. Quantizing snaps note starts to a rhythmic grid, such as sixteenth notes, so loose or slightly early and late notes line up cleanly. The catch is to quantize with care. Over-quantizing a human performance flattens its feel, turning a part that breathed into one that sounds mechanical, so match the grid to the music and leave expressive playing some room. If a part actually moves in sixteenths, a sixteenth-note grid is right; if it is mostly quarter notes, a coarse grid avoids forcing detail that is not there. A good habit is to quantize the rhythmically strict sections (a steady comping pattern, a programmed-feeling bass line) more firmly, and go lighter on the parts where the timing is part of the performance. The detection of how busy and polyphonic a part is feeds into this, which is why monophonic vs polyphonic transcription is worth understanding before you start nudging notes around.

Fix octaves, stray notes, and durations

With the timing tidied, scan for the three errors that show up most. First, octave errors: notes detected an octave too high or low. These are frequent on bass notes and come from overtones, where a strong harmonic gets picked up instead of the fundamental, so check the low end first and move any misplaced note up or down one octave. Second, short spurious or ghost notes: brief blips that are artifacts of the detection rather than real notes. Delete them so they do not clutter the part or trigger an unwanted sound. Third, note durations that overlap or run too long, where one note bleeds into the next. Trim the lengths so each note releases where it should and the part stops sounding muddy. Working through these in the piano-roll editor is fast once you know what to look for, and it is the same skill covered in fixing AI transcription errors.

Split parts, set tempo, and export

For a multi-instrument capture, split the parts onto separate tracks so each instrument is editable on its own, rather than leaving everything stacked on one track where edits collide. Then set the correct tempo and time signature so the bars line up with the music. Get these right and the grid you quantized to actually matches the song; get them wrong and every later edit fights the bar lines. Do the work in a DAW's piano-roll editor if the goal is production, or in a notation program if the goal is sheet music. Because Songscription returns an editable result, some of this cleanup is already handled before you export, so you are refining a part rather than rebuilding one from scratch. When you are ready to move the cleaned notation into a score, converting MIDI to sheet music picks up from here.

Start with a cleaner first pass

Upload a recording and get editable notes back, so the timing and parts are already organized before you start tidying. The free tier is enough to convert your first song.

Frequently Asked Questions

Why does converted MIDI need cleaning up?

Converting audio to MIDI produces a first draft of the notes, not a finished part, because the notes are detected from sound and the detection is imperfect. Expect a few notes caught an octave off, short ghost notes that are artifacts rather than real notes, and durations that overlap or run too long. Cleanup turns that first pass into a part you can play from or hand to a player. Because Songscription returns an editable result, some of this work is already handled before you export.

How do I quantize MIDI without losing the feel?

Quantizing snaps note starts to a rhythmic grid (such as sixteenth notes) to tidy loose timing, but over-quantizing a human performance flattens its feel. Match the grid to the music: use a sixteenth-note grid only if the part actually moves in sixteenths, and leave expressive playing some room rather than locking every note hard to the line. Quantize the rhythmically strict parts and go lighter on the parts that breathe.

How do I fix notes that are an octave off?

Octave errors are common on bass notes and come from overtones in the recording, where a high harmonic gets detected instead of the fundamental. Select the wrong note in the piano-roll editor and move it down (or up) one octave so it sits where it belongs. Scan the bass line first, since that is where octave jumps hide most often, then check any spot where the melody suddenly leaps a full octave for no musical reason.

Should I clean up MIDI in a DAW or in notation software?

It depends on the goal. If you want the MIDI for production (a software instrument, a remix, a backing track), do the work in a DAW's piano-roll editor. If you want readable sheet music, do it in a notation program so the spelling, beaming, and layout come out right. Either way the cleanup tasks are the same: quantize the timing, fix octaves and stray notes, set the tempo and time signature, and split the parts so each is editable on its own.

The fastest way to start is on a song you already have a recording of. Upload it with Songscription and get editable notes you can clean up.

About the author

Andrew Carlins

Written by

Andrew Carlins

Co-Founder & CEO, Songscription

Andrew co-founded Songscription at Stanford with a few fellow musicians who were tired of not finding the notes to the songs they wanted to play. He grew up playing piano and baritone saxophone and performing in musical theater, and though he hasn't performed in years, he likes to think he's still pretty sharp. He writes about getting a song off the recording and onto the page.

More about the team

Keep exploring more posts on the same topics.