I don’t really see how FM can be “an alternative to field recording”. What we refer to when talking about “FM synthesis” is actually a combination of phase modulation and additive synthesis. The “operators” were traditionally a pitched sine with an envelope, these days this concept is a little more flexible (any wavetable with any arbitrarily complex envelope). And the “algorithms” that combine them usually end up carrying the sound of one of the operators or an additively summed output of a few.
Theoretically you can map any sound to a finite combination of sines. In practice this isn’t feasible to achieve because it requires non-linear envelope modulation and in-flight modulation of FM algorithms. That’s not only prohibitive computationally but also super unnatural in terms of human manipulation of such sounds. Plus, as you observed yourself, such sounds would not track the keyboard in expected ways. Instead, traditional FM synthesis opts for tracking pitches of all operators in a relative way, allowing you to specify ratios between them.
Maybe a different synthesis approach would work better?
The reverse FFT approach that @chalkwalk is describing above is one interesting approach here that can be used for a subset of sounds that have linear harmonic responses. For other purposes, you have other synthesis methods like physical string modelling (used very successfully for example by Pianoteq to synthesize acoustic piano sounds; or by Sculpture in Logic Pro X), formant synthesis (used for speech synthesis), or hidden Markov chains synthesis (also speech, and granular).
In fact, there’s too many modern techniques to enumerate in a single comment on the Internet. It seems to me that the most interesting synthesis method for what could plausibly be treated as “an alternative to field recording” would be digital waveguide synthesis. That was commercially used in many general-purpose synthesizers and PC sound cards in the 1990s and early 2000s. AFAICT it’s the most versatile synthesis method.
Coming back to Yamaha-style “FM synthesis”, it’s become its own thing where the goal isn’t to fool the listener that they’re listening to an acoustic instrument.
It’s a certain flavor, a certain color in a musician’s palette. For this purpose, even 2 operators already give interesting results. Minilogue XD’s multi-engine (also available on the NTS-1) supports this architecture for example. 4 operators are very popular, as used by Yamaha Reface DX, Elektron Digitone, and a large number of FM synthesizers from the 1980s and 1990s.
Beyond 4 operators the number of possible connections between them is raising exponentially, especially if you allow for arbitrary feedback loops. But unless you also enable very complex non-linear envelope modulation and variable frequency ratios, increasing the number of operators is sort of a diminishing return, considering the computational power needed.
This is why commercial FM synthesizers pretty much all stop at 6 operators, with just a few exceptions going over that. Now, assuming six operators, the theoretical number of single-connection combinations is 64. But allowing multiplexing and arbitrary feedback it quickly gets unwieldy (1,530,843). Only a subset of those is musically interesting so the DX7 shipped with 32, and the KORG opsix defaults to 40. Even the 8-operator Yamaha FVX-1 only gave you 63.
This means the number of possible timbres you can achieve is limited. To be fair, that limit is just the “color” or “flavor” I talked about above. You can still design plenty of unique sounds with that. And with a touch of chorus, flanging, phasing, delays, or reverbs (especially convolution reverbs) you can actually get enough variety to spend years exploring. It’s what Brian Eno did with his DX7 back in the day. Just keep in mind it’s not a silver bullet that can shape any sound.
My recommendations
A few recommended implementations that are interesting for sound design:
-
Native Instruments FM8 - not only is it compatible with classic DX7, it’s also very usable in terms of UI and goes way beyond the “glass e-piano” brand of FM synthesis;
-
KORG opsix - I haven’t used one myself but looking at the existing review videos and the manual this looks like a sound designer’s dream;
-
Waves Flow Motion - simpler than the first two (“only” 4 oscillators) but unique because the envelopes are decoupled from the oscillators, and modulation is brought to the front and center of the UI. Along with its p-lock style sequencing, you can achieve very interesting sound designs with it.
Alternatively, if you’re looking for more flexibility (in turn losing a lot of immediacy), you can model FM synthesis in Reaktor, with a good introductory example shown in this Benn Jordan video. Finally, you could further complicate your life with Max/MSP, SuperCollider, or straight up C++ (or Python as I did
).