Abstract: The objective of this work is to extract the target speaker’s voice from a mixture of voices using visual cues. Existing works on audio-visual speech separation have demonstrated their ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results