I found another solution as well and it seems to work but is less complicated:
- To add multiple text-to-speech in one slide, insert a temporary blank slide. Use that slide to add text-to-speech. Convert it to audio and assign it from the Library to a non-displayable text caption on the desired slide. Repeat as desired using the same blank slide with the next new piece of audio.