Ah, time-stretching is FFT based in most cases, and in that sense oversampling can lead to more accurate FFT windows and thus a cleaner reconstruction in the inverse transform. However, mathematically speaking it doesn’t matter if you oversample the source or just upsample from the existing data - if the time stretching algorithm is implemented correctly it should be able to get an arbitrarily high-resolution stretch from a 48kHz source as well as from a 96kHz source - again, the mathematics of perfect reconstruction play out here too.
However, some stretching algorithms don’t oversample, and thus you’re effectively doing this work for them by operating at 96kHz. This is not a benefit of 96kHz, though, it’s just a deficiency in the implementation of the algorithm.