Audio data as video data representation

@Tajnost, are you still interested in this topic ?

I found an interesting video where a group of researchers made a neural network system, using visuals of audio spectrums, and trained it to predict the settings on the u-he Diva soft synth, to create an equivalent sound. This video goes through the details of this system, it’s a little technical, but mostly understandable, if you have some background with neural network systems.

The video runs nearly 20 minutes.

This would be a neat product, for instance if you could grab a sample of a sound you’d like to make, and this system would generate a patch for your synth. It would be great if it was capable of creating patches for a group of your favorite synths, and all be that easy to do. It also seems to me something really useful for manufacturers to generate sound libraries.

ADDED : Seems to me this sort of thing would potentially be a way around some sorts of copyright restrictions.

It’s a different way to think about a “sampler”.

4 Likes