Two Ways to Isolate Vocals with AI

Last updated on September 12, 2024 by , Posted to edit audio

isolate vocal from the song

Any musical composition is made up of many components. The vocals are mixed with the instrumental elements of the song such as drums, guitars and synthesizers. But sometimes the opposite operation is required when you want to isolate the vocals from the instrumental component.This could be because you want to create a mashup or karaoke version of a song. With the advent of neural networks, various implementations of isolating vocals from the accompanying music in a song with a fairly high quality have appeared.

Isolating vocals can be considered the most difficult task when separating musical sources, as there is a lot of frequency overlap in a song, and the song may contain many other instruments that may not make the vocals sound clear enough. And let's not forget harmonics, the individual timbral features of the voice, and additional effects like reverb and echo.

In this article, we'll discuss why it's important to emphasize vocals, how to do it, and some techniques for doing so.

1 What is vocal isolation needed for?

There are many applications for vocal isolation tools in many industries, not necessarily related to music.

Musicians most often need a separate vocal track to create remixes. And while famous DJs can get the necessary recording from the artist directly, as part of a joint creative project, their less fortunate colleagues extract the vocal part themselves.

Vocal lessons also require separating the voice from the accompaniment, which distorts the perception of sound. For example, on some Guns'n'roses compositions, Axl's vocals merge with the guitar part and it is difficult to find the boundary between them. This gives the student the impression that the vocal range is much wider than it actually is.

A rather unexpected application of vocal extraction has found the field of karaoke. Everyone is used to the fact that for karaoke, vocals need to be removed, creating a backing track. However, after extracting the vocals, you can run it through a speech recognition system and get ready-made lyrics synchronized with the music.

As I mentioned at the beginning, vocal extraction is used not only in music. Removing all extraneous sounds can be considered as a system of noise reduction and speech enhancement. Such algorithms are used in audio forensics to improve speech intelligibility, speaker identification and dialogue decoding.

2 Online Vocal Isolator

LALAL.AI is perhaps one of the most popular and highly rated vocal splitters on the internet. Apart from vocals, this tool can also separate different elements of a song in just a few simple clicks. This includes drums, bass, voice and noise, acoustic guitar, electric guitar, piano, strings and more. Not to mention, this app has a mobile app that you can easily download from your Android and iOS mobile phones. If you are wondering how to split vocals and instrumentals online, follow these steps.

online vocal isolator

Step 1: go to the official website of the tool using any web browser on your computer.

Step 2: click on the “Stems” drop-down menu to select the element you want to extract from the file you want to upload.

Step 3: click on the “Select Files” button to upload an audio file from your computer’s local drive.

Step 4: click on the “Process the Entire File” button to render and save the output on your computer.

Thanks to artificial intelligence and machine learning technologies, web splitters have caught up with and some have even surpassed professional software. But what about the price? The minimum 90-minute package is priced at $18 and does not include a fast processing queue. To understand whether it is expensive or not, let's look at an alternative solution.

3 Desktop Vocal Isolation tool

VocalRip AI is a cutting-edge software specifically designed to isolate vocals from songs. Thanks to advances in machine learning technology, the music source separation model has been condensed into a very compact size. Why waste time getting 5 stems when in most cases you only need two? The result is a fast and effective music and vocal rebalancing tool that works even on a low-end computer without sacrificing quality.

vocal isolation AI tool

Step 1: download and install the application. For this, 50 MB of free disk space is enough.

Step 2: load the song, check that it plays and click the "Split" button. Wait for the process to complete.

Step 3: now you can use the Vocal and Music controls to balance the sound. In our case, vocal isolation is required, so we set it to maximum and music to zero.

Step 4: click the "Save" button, select a file name and compression settings. Wait for the processing to complete.

To my amateur ear, the difference in the sound of the vocals after splitting is imperceptible. I would even say that VocalRip leaves fewer artifacts. In terms of processing time, there is also approximate parity. The online service splits faster, but requires a lot of time to transfer data and wait in line. In both cases, it took a couple of minutes to split the song. The determining factor is the price, and here VocalRip is a confident leader, providing a lifetime license for $15.

4 Final words

In this article, we looked at two approaches suitable for both the average user and advanced techies. The rapid development of neural networks gives us a window of opportunity that was not available five years ago. There are a huge number of AI models that divide songs into stems, but they all handle vocals differently. Whether you are a beginner musician, a DJ creating remixes, or just a music lover who wants to focus on the instrumental elements of a track, these two options will not be superfluous in your arsenal.


Related Articles