litecord/docs/lvsp.md

# Litecord Voice Server Protocol (LVSP)

LVSP is a protocol for Litecord to communicate with an external component
dedicated for voice data. The voice server is responsible for the
Voice Websocket Discord and Voice UDP connections.

LVSP runs over a *long-lived* websocket with TLS. The encoding is JSON.

## OP code table

"client" is litecord. "server" is the voice server.

| opcode | name | sent by |
| --: | :-- | :-- |
| 0 | HELLO | server |
| 1 | IDENTIFY | client |
| 2 | RESUME | client |
| 3 | READY | server |
| 4 | HEARTBEAT | client |
| 5 | HEARTBEAT\_ACK | server |
| 6 | INFO | client / server |

## Message structure

Message data is defined by each opcode.

**Note:** the `snowflake` type follows the same rules as the Discord Gateway's
snowflake type: A string encoding a Discord Snowflake.

| field | type | description |
| --: | :-- | :-- |
| op | integer, opcode | operator code |
| d | map[string, any] | message data |
| s | Optional[int] | sequence number |

 - The `s` field is explained in the `RESUME` message.

## High level overview

 - connect, receive HELLO
 - send IDENTIFY or RESUME
 - if RESUME, process incoming messages as they were post-ready
 - receive READY
 - start HEARTBEAT'ing
 - send INFO / VSU\_REQUEST messages as needed

## Error codes

| code | meaning |
| --: | :-- |
| 4000 | general error. Reconnect |
| 4001 | authentication failure |
| 4002 | decode error, given message failed to decode as json |

## HELLO message

Sent by the server when a connection is established.

| field | type | description |
| --: | :-- | :-- |
| heartbeat\_interval | integer | amount of milliseconds to heartbeat with |
| nonce | string | random 10-character string used in authentication |

## IDENTIFY message

Sent by the client to identify itself.

| field | type | description |
| --: | :-- | :-- |
| token | string | `HMAC(SHA256, key=[secret shared between server and client]), message=[nonce from HELLO]` |

## RESUME message

Sent by the client to resume itself from a failed websocket connection.

The server will resend its data, then send a READY message.

| field | type | description |
| --: | :-- | :-- |
| token | string | same value from IDENTIFY.token |
| seq | integer | last sequence number to resume from |

### Sequence numbers

Sequence numbers are used to resume a failed connection back and make the
voice server replay its missing events to the client.

They are **positive** integers, **starting from 0.** There is no default
upper limit. A "long int" type in languages will probably be enough for most
use cases.

Replayable messages MUST have sequence numbers embedded into the message
itself with a `s` field. The field lives at the root of the message, alongside
`op` and `d`.

## READY message

 - The `health` field is described with more detail in the `HEARTBEAT_ACK`
    message.

| field | type | description |
| --: | :-- | :-- |
| `health` | Health | server health |

## HEARTBEAT message

Sent by the client as a keepalive / health monitoring method.

The server MUST reply with a HEARTBEAT\_ACK message back in a reasonable
time period.

| field | type | description |
| --: | :-- | :-- |
| s | integer | sequence number |

## HEARTBEAT\_ACK message

Sent by the server in reply to a HEARTBEAT message coming from the client.

The `health` field is a measure of the servers's overall health. It is a
float going from 0 to 1, where 0 is the worst health possible, and 1 is the
best health possible.

Servers SHOULD use the same algorithm to determine health, it CAN be based off:
 - Machine resource usage (RAM, CPU, etc), however they're too general and can
    be unreliable.
 - Total users connected.
 - Total bandwidth used in some X amount of time.

Among others.

| field | type | description |
| --: | :-- | :-- |
| s | integer | sequence number |
| health | float | server health |

## INFO message

Sent by either client or a server to send information between eachother.
The INFO message is extensible in which many request / response scenarios
are laid on.

| field | type | description |
| --: | :-- | :-- |
| type | InfoType | info type |
| data | Any | info data, varies depending on InfoType |

### InfoType Enum

| value | name | description |
| --: | :-- | :-- |
| 0 | CHANNEL\_REQ | channel assignment request |
| 1 | CHANNEL\_ASSIGN | channel assignment reply |
| 2 | CHANNEL\_UPDATE | channel update |
| 3 | CHANNEL\_DESTROY | channel destroy |
| 4 | VST\_CREATE | voice state create request |
| 5 | VST\_UPDATE | voice state update |
| 6 | VST\_LEAVE | voice state leave |

**TODO:** finish all infos

### CHANNEL\_REQ

Request a channel to be created inside the voice server.

The Server MUST reply back with a CHANNEL\_ASSIGN when resources are
allocated for the channel.

| field | type | description |
| --: | :-- | :-- |
| channel\_id | snowflake | channel id |
| guild\_id | Optional[snowflake] | guild id, not provided if dm / group dm |
| channel\_properties | ChannelProperties | channel properties |

#### ChannelProperties

| field | type | description |
| --: | :-- | :-- |
| bitrate | integer | channel bitrate |

### CHANNEL\_ASSIGN

Sent by the Server to signal the successful creation of a voice channel.

| field | type | description |
| --: | :-- | :-- |
| channel\_id | snowflake | channel id |
| guild\_id | Optional[snowflake] | guild id, not provided if dm / group dm |

### CHANNEL\_UPDATE

Sent by the client to signal an update to the properties of a channel,
such as its bitrate.

Same data as CHANNEL\_REQ.

### CHANNEL\_DESTROY

Sent by the client to signal the destruction of a voice channel. Be it
a channel being deleted, or all members in it leaving.

Same data as CHANNEL\_ASSIGN.

## Common logic scenarios

### User joins an unitialized voice channel

Since the channel is unitialized, both logic on initialization AND
user join is here.

 - Client will send a CHANNEL\_REQ.
 - Client MAY send a VST\_CREATE right after as well.
 - The Server MUST process CHANNEL\_REQ first, so the Server can keep
    a lock on channel operations while it is initialized.
 - Reply with CHANNEL\_ASSIGN once initialization is done.
 - Process VST\_CREATE **TODO**

### Updating a voice channel

 - Client sends CHANNEL\_UPDATE.
 - Server DOES NOT reply.

### Destroying a voice channel

 - Client sends CHANNEL\_DESTROY.
 - Server MUST disconnect any users currently connected with its
    voice websocket.

### User joining an (initialized) voice channel

 - Client sends VST\_CREATE
 - **TODO**

### User moves a channel

**TODO**

### User leaves a channel

**TODO**