Video calls

The Voximplant platform enables high-quality video calls with various features.
This guide explains how it works and the differences between server video calls and other types.

Server video calls

Server video calls are common one-to-one video calls between Voximplant users or SIP addresses. They offer a wide range of server-based features, including call recording, speech processing, and integration of AI-powered robot Avatar. Billing is based on the destination and call duration.

Here are the key characteristics of server video calls:

Video calls between two participants.
Client-based features available: screen sharing, muting microphone/speakers, holding a call, transferring a call, sending tone signals.
Server-based features available: call recording (video from one user, audio from both users), streaming (audio and video from one user), speech synthesis and recognition, IVR, avatar.
Client-server encryption. Data is decrypted and processed at the server side before being encrypted and sent to the client.
You can convert a server video call into a conference. You cannot convert the call into a peer-to-peer call.
Billing according to the price list per minutes and traffic transferred.

You can read how to implement server video calls here.

Peer-to-peer video calls

Peer-to-peer video calls are available for calling Voximplant users with SDK clients. These calls offer enhanced security since the data is end-to-end encrypted, and the traffic is transmitted directly between the clients. If the clients are located on the same local network, the connection bypasses the internet and operates entirely within the local network. Notably, P2P calls are completely free when using a TURN server.

However, since the traffic is direct, server-based features such as recording, streaming, speech processing, and so on are not available.

Here are the key characteristics of peer-to-peer video calls:

Video calls between two participants.
The traffic goes directly between the clients. If the clients are within the same local network, the connection goes throughout the local network without the internet.
Client-based features available: screen sharing, muting microphone/speakers, holding a call, transferring a call.
No server-based features available, such as call recording, speech processing, etc.
End-to-end encryption.
You cannot convert a peer-to-peer video call into a server video call or a conference.
Billing applies only if using a TURN server, peer-to-peer video calls are free.

A TURN server is a media relay/proxy that enables peer-to-peer media traffic exchange, especially when one or both parties are behind Network Address Translation (NAT). When using a TURN server, peer-to-peer calls incur billing based on the price list.

For more information on implementing peer-to-peer video calls, refer to the guide.

Video conferences

Video conferences are multi-participant video calls that allow simultaneous viewing and audio exchange. They offer a wide range of server-based features, including simulcast, conference recording, where all participants’ audios and videos are saved in a single file, speech processing, joining as screen sharing only users or as viewers only, and the ability to modify separate participants’ settings, such as visibility and resolution. Video conferences are billed based on the number of participants and the duration of the conference.

Here are the key characteristics of video conferences:

Video calls between up to 50 participants.
Client-based features available: screen sharing, screen sharing as a separate call, joining as viewer only, muting microphone/speakers, sending tone signals.
Server-based features available: conference recording (video and sound from all the participants in the single file), speech synthesis and recognition, IVR, avatar, simulcast.
Client-server encryption. The data is decrypted and processed at the server side, then encrypted back and sent to the client.
You can manage the participants' visibility and resolution.
You can convert a conference into a server video call.
Billing according to the price list per minutes and the number of participants.

Simulcast is the process of broadcasting video in multiple resolutions and/or bitrates. It aims to enhance video performance for users with poor connection or hardware, while also improving video quality for users with good connection and hardware. For more information, refer to this article.

You can find detailed instructions on how to implement video conferences here.