The idea is not that simple as an audio visualization. Actually it doesn’t even presuppose visualization at all : )
So what he does with visuals is: he analyze pictures with neural network algorithms, network finds some patterns and learn “characteristics” of pictures, and after it’s possible to morph between different pictures of adjust their characteristics.
For example he analyze woman faces of different ages and man faces with beards and after he can generate a new woman face, tweak her age as needed and bring a “man beard” elements to it or it morph into man, adjust “age”, “race” or any other characteristics, and it looks really natural, hi end and scaring.
Or he analyzes Van Gogh paintings and Leonardo paintings, neural network finds patterns and after can generate random art which is 40% Leonardo and 60% Van Gogh : )
It’s all super slow and rendering takes dayz : )
So the idea is to use his algorithms and apply them to audio. For example, to make a model of human voice and morph it into elephant talk, or analyze Autechre and Amanda Lear music and generate completely new music which will be 30% Lear 70% Autechre. And it’s not basic mixing, it’s generating new stuff based on neural network analyze.
It’s a bit more complicated, but I’ve tried to explain it as easy as possible.
So, the idea is to interpret audio as visual information. It’s possible to do so with some spectral analyze. I’m looking for some ways and most of them are super scientific and there’s no ready tool for that…