have you considered using a uC with two USB ports, instead of gluing together two uCs? (though i certainly appreciate the expediency of the latter)
there are several that have multiple OTG-comptaible ports, so can act like 2 devices, 2 hosts, or any combination. e.g. stm32f4x7. (in that case only one port can be high-speed, which is typical, but full-speed is 12 Mbps which is still far more than MIDI needs.)
it also seems a pity to use so much horsepower as a simple pipe. if you are interested, i’ve done some firmware prototyping on a device which can be reconfigured at runtime for <dev/host>/<dev/host> and for protocol transform selection (e.g. grid -> MIDI,) transform logic also being scriptable. but theres no way i would get this finished and out the door as hardware, without collaboration.