If headphones are connected to the device, the video return is unmuted and can be muted/unmuted from the video return thumbnail.
If no headphones are connected, the video return is muted and cannot be unmuted. This is to avoid creating an audio feedback loop where the audio from the video return is captured by the main microphone.