theoretically, using the Session Initiation Protocol (SIP), it is possible to set up a multimedia connection in an IP network to transmit text files as well as audio and video using various digital modulation techniques. Of great importance here is the description of sessions with SDP (Session Description Protocol) for grouping of media streams.
My chapter "SIP Guide" (in German) may help you - see:
In principle you can use three different channels with different carrier frequencies to transmit the three streams the video stream, the audio stream and the data stream. You can divide the overall bandwidth into three subbands, one fro each signal. The first step is to determine the bit rate demand for each signal. If you define the modulation technique say 4 QAM, then you can calculate the bandwidth for each signal and consequently you can choose the center frequency as carrier frequency . Then you need to have three modulators and then you combine the output of the three modulators. In the demodulator you perform the inverse process.
There are many other solutions using different modulation techniques like the OFDM. In this case you can time domain multiplex the three signals and send them as one stream.