Local Network Traversal - Multicast Discovery

_Created by @CMCDragonkai_ 
### Specification
![Untitled-2023-06-09-1740](https://github.com/MatrixAI/js-mdns/assets/50583248/5450cc32-4572-4a4d-bfed-51c6e8cd04e2)
There are two types of Data Flow in the MDNS System, Polling (Pull), and Announcements/Responses (Push). When a Node joins the MDNS group, the records are pushed to all other nodes. However, for the joined node to discover other nodes, it needs to conduct polling queries that other nodes respond to.

#### Sending Queries
![image](https://github.com/MatrixAI/js-mdns/assets/50583248/21305a4c-db91-4036-9ac9-0943c84f9c7b)
The MDNS spec states that query records can have additional records, but we won't care to do this as it isn't necessary.
Queries won't have any other records in the query record, much like a standard DNS packet (albeit an mdns query packet can contain multiple questions).

In the case that a responder is binded to 2 interfaces that are connected to the same network (such as a laptop with WiFi + ethernet connected), the queries asking for the ip for a hostname of the responder will receive multiple responses with different ip addresses.
![Untitled-2023-06-09-1740 excalidraw](https://github.com/MatrixAI/js-mdns/assets/50583248/52e79a92-e646-4862-acc5-c14830ab59c5)

This behavior is documented in: [RFC 6762 14.](https://datatracker.ietf.org/doc/html/rfc6762#section-14)


##### Control Flow
~~Unlike other mDNS libraries, we're going to use an AsyncIterator in order to have the consumer to have more control over the querying. An example of this would be:~~

```typescript
async function* query({...}: Service, minimumDelay: number = 1, maximumDelay: number = 3600) {
   let delay = minimumDelay;
   while (true) {
    await this.sendPacket(...);
    delay *= 2;
    yield delay;
  }
}
```
The query system has been decided to have it's runtime contained within `MDNS` rather than being consumer-driven. This means that scheduled background queries will have to be managed by a TaskManager (similar to polykey)

##### Data Flow
![Untitled-2023-06-09-1740(1)](https://github.com/MatrixAI/js-mdns/assets/50583248/aad26c1e-4980-4dab-88e1-ea748e4f8b1f)

#### Receiving Announcements/Responses (Pull)

##### Data Flow
Because queries are basically fire and forget, the main part comes in the form of receiving query responses from the multicast group. Hence, our querier needs to be able to collect records with a fan-in approach using a muxer that is reactive:

![Untitled-2023-06-09-1740(3)](https://github.com/MatrixAI/js-mdns/assets/50583248/c36f1378-54f1-4442-bf55-d52f9b64f043)

This can also be interpreted as a series of state transitions to completely build a service.
![Untitled-2023-06-09-1740(3)](https://github.com/MatrixAI/js-mdns/assets/50583248/054de63d-d92c-439d-949c-7e1871e66f5d)

There also needs to be consideration that if the threshold for a muxer to complete is not reached, that additional queries are sent off in order to reach the finished state.
![Untitled-2023-06-09-1740(2)](https://github.com/MatrixAI/js-mdns/assets/50583248/5cbb95f9-1447-4b86-a3ed-493c59823822)

The decision tree for such would be as follows:
![Untitled-2023-06-09-1740(4)](https://github.com/MatrixAI/js-mdns/assets/50583248/ff51fa3e-2a1a-48c4-add0-aab0a52c932d)

##### Control Flow
Instances of MDNS will extend EventTarget in order to emit events for service discovery/removal/etc.
```typescript
class MDNS extends EventTarget {
}
```

The cache will be managed using a timer that is set to the soonest record TTL, rather than a timer for each record. The cache will also need to be an LRU in order to make sure that malicious responders cannot overwhelm it.

#### Sending Announcements
##### Control Flow
This will need to be experimented with a little. Currently the decisions are:
- registerService cannot be called before start is called.
- create should take in services in place of this.
- stop should deregister all services
- destroy should remove everything from the instance
```typescript
class MDNS extends EventTarget {
  create()
  start()
  stop()
  register()
  deregister()
}
```

#### Types
Messages can be Queries or Announcements or Responses.
This can be expressed as:
```typescript
type MessageType = "query" | "announcement" | "response";
type Message = [MessageType, ResourceRecord] & ["query", QuestionRecord];
const message = ["query", {...}];
```

#### Parser / Generator
The Parsing and Generation together are not isomorphic, as different parsed UInt8array packets can result in the same packet structure.

Every worker parser function will return the value wrapped in an object of this type:
```typescript
type Parsed<T> = {
  data: T;
  remainder: UInt8Array;
}
```

The point of this is so that whatever hasn't been parsed get returned in `.remainder` so we don't keep track of the offset manually. This means that each worker function also needs to take in a second uint8array representing the original data structure.
1. DNS Packet Parser Generator Utilities
  - Parser - `parsePacket(Uint8array): Packet`
    - Headers - `parseHeader(Uint8array): {id: ..., flags: PacketFlags, counts: {...}}`
    - Id - `parseId(Uint8array): number`
    - Flags - `parseFlags(Uint8Array): PacketFlags`
    - Counts - `parseCount(Uint8Array): number`
    - Question Records - `parseQuestionRecords(Uint8Array): {...}`
      - `parseQuestionRecord(Uint8Array): {...}`
    - Resource Records - `parseResourceRecords(Uint8Array): {...}`
      - `parseResourceRecord(Uint8Array): {...}`
      - `parseResourceRecordName(Uint8Array): string`
      - `parseResourceRecordType(Uint8Array): A/CNAME`
      - `parseResourceRecordClass(Uint8Array): IN`
      - `parseResourceRecordLength(Uint8array): number`
      - `parseResourceRecordData(Uint8array): {...}`
        - `parseARecordData(Uint8array): {...}`
        - `parseAAAARecordData(Uint8array): {...}`
        - `parseCNAMERecordData(Uint8array): {...}`
        - `parseSRVRecordData(Uint8array): {...}`
        - `parseTXTRecordData(Uint8array): Map<string, string>`
        - `parseOPTRecordData(Uint8array): {...}`
        - `parseNSECRecordData(Uint8array): {...}`
    - String Pointer Cycle Detection 
      - Everytime a string is parsed, we take reference of the beginning and end of the string so that pointers cannot point to a start of a string that would infinite loop. A separate index table for the path of the dereferences to make sure deadlock doesn't happen.
    - Errors at each parsing function instead of letting the data view failing
      - `ErrorDNSParse` - Generic error with message that contains information for different exceptions. Ie. `id parse failed at ...`
    - Record Keys - `parseResourceRecordKey` and `parseQuestionRecordKey` and `parseRecordKey` - `parseLabels`.
  - Generator - `generatePacket(Packet): UInt8Array`
    - Header `generateHeader(id, flags, counts...)`
      - Id
      - Flags - `generateFlags({ ... }): Uint8Array`
      - Counts - `generateCount(number): Uint8Array`
    - Question Records - `generateQuestionRecords(): Uint8Array` - `flatMap(generateQuestion)`
      - `generateQuestionRecord(): Uint8Array`
    - Resource Records (KV) - `generateResourceRecords()`
      - `generateRecord(): Uint8array` - 
      - `generateRecordName` - "abc.com" - ...RecordKey
      - `generateRecordType` - A/CNAME
      - `generateRecordClass` - IN
      - `generateRecordLength`
      - `generateRecordData`
        - `generateARecordData(string): Uint8array`
        - `generateAAAARecordData(string): Uint8array`
        - `generateCNAMERecordData(string): Uint8array`
        - `generateSRVRecordData(SRVRecordValue): Uint8array`
        - `generateTXTRecordData(Map<string, string>): Uint8array`
        - `generateOPTRecordData(Uint8array): Uint8array`
        - `generateNSECRecordData(): Uint8array`
  - Integrated into `MDNS`
2. `MDNS`
  - Querying
    - `MDNS.query()`
     - query services of a type
    - `MDNS.registerService()`
    - `MDNS.unregisterService()`
  - Responding
    - Listening to queries
    - Responding to all queries with all records
    - Respond to unicast
    - Truncated bit
 
#### Testing
We can use two MDNS instances to interact with each other to test both query and respond on separate ports.

### Additional Context

The following discussion from 'Refactoring Network Module' MR should be addressed:
* @CMCDragonkai:  (+3 comments)

    > Next things:
    > 
    > 1. Multicast discovery (bring back what we had in the old code)
    > 2. Update our peer table
    > 3. Implement kademlia on peer table
    > 4. Refactor GRPC commands for this
    > 5. Produce test harness for multiple nodes on a local network
    > 
    > ## Multicast Discovery
    > 
    > DNS-SD and mDNS are related protocols for discovery on a local network. This local network is designated usually by a "home router" or a switch. But basically some arbitrary subnet with gateways that are part of a "single network".
    > 
    > The way this works is that there's a special address (for IPv4 and IPv6) that a mDNS software both listens and sends UDP packets on. This is known as a "multicast" address.
    > 
    > The UDP packet contains information structured as a "DNS message", basically the same kind of message you get when you actually query a DNS server to resolve a hostname.
    > 
    > What we have to do is register a hostname for our application. Let's say "polykey", this registration is done on IANA (just like how we registered the OID for our company). This list is here: https://www.iana.org/assignments/service-names-port-numbers/service-names-port-numbers.xhtml and our application for an assignment is here: https://www.iana.org/form/ports-services
    > 
    > Now once we build a multicast mDNS client/server, we can then advertise on the multicast address by sending a DNS UDP packet that claims "polykey.local" is registered to IP at PORT. Simultaneously we can listen on the multicast address for DNS packets, and filter for all claims to "polykey.local".
    > 
    > Since multiple PK agents can occur on the same subnet, this means we may either have to differentiate on a subdomain of our hostname, or append a renew record to the hostname. It is possible for a given hostname to contain multiple DNS records (and these may indicate IP and PORT assignments). I'm not sure if this is the right way to do this. Need to do a bit of research.
    > 
    > At the end of the day, PK node table gets populated with this initial multicast discovery. We have to have a mDNS client (to send multicast) and mDNS server (to listen for multicast). The mDNS server would be listening for these packets.
    > 
    > Note that there is software like avahi, that acts like an mDNS server/client, this is used interactively by other software that don't have native mDNS capabilities. This is not important to us... we will be advertising directly.
    > 
    > According to this: https://askubuntu.com/a/1105895 it may be possible that the "service type" is different from the hostname. Therefore it's possible to randomly allocate a hostname, and only listen for a given "service type". In which case `_polykey_._tcp` would be the service type, and you could get hostnames like `polykey-PUBLICKEYFINGERPRINT.local`, and thus that should be considered unique. Therefore from a service discovery point of view, the only thing static is the IP used for multicast, and the "service type" which is globally setup at IANA. The hostnames, IPs and even ports are all dynamically setup.
    > 
    > The node library https://github.com/agnat/node_mdns binds into system-level services and provides the ability to listen and query mdns packets: https://github.com/agnat/node_mdns
    > 
    > A lower level library in pure-js is here: https://github.com/mafintosh/multicast-dns that is used with https://github.com/mafintosh/multicast-dns-service-types according to this issue: https://github.com/mafintosh/multicast-dns/issues/6#issuecomment-91034709

* https://news.ycombinator.com/item?id=8229792
* https://blog.apnic.net/2022/05/03/how-nat-traversal-works-concerning-cgnats/
* https://github.com/MatrixAI/Polykey/issues/487#issuecomment-1294558470
* https://github.com/MatrixAI/Polykey/issues/487#issuecomment-1294742114
* https://support.citrix.com/article/CTX205483/how-to-accommodate-hairpinning-behaviour-in-netscaler

* mDNS RFC https://datatracker.ietf.org/doc/html/rfc6762
* DNS-SD RFC https://datatracker.ietf.org/doc/html/rfc6763
* Domain Names RFC https://datatracker.ietf.org/doc/html/rfc1035
* Extension Mechanisms for DNS RFC https://datatracker.ietf.org/doc/html/rfc6891
* NSEC RFC https://datatracker.ietf.org/doc/html/rfc3845

### Tasks
- [x] Parser - 5.5 days
  - [x] Packet Header - 0.5 days
  - [x] Packet Flags - 0.5 days
  - [x] Questions - 0.5 days
  - [x] Resource Records - 4 days
- [x] Generator 5.5 days
  - [x] Packet Header - 0.5 days
  - [x] Packet Flags - 0.5 days
  - [x] Questions - 0.5  days
  - [x] Resource Records - 4 days
- [x] MDNSCache - 2.5 days
  - [x] Multi-Keyed Maps for ResourceRecord Querying - 0.5 days
  - [x] TTL Expiry Record Invalidation - 0.5 days
  - [x] Reverse Host to Record Mapping - 0.5 days
  - [x] LRU to Prevent DoS - 0.5 days
  - [x] Support use as local resource record in-memory database - 0.5 days
- [x] TaskManager - ? days
  - [x] Migrate to in-memory - ? days
- [x] MDNS - 11.5 days
  - [x] UDP Networking Logic - 2 days
    - [x] Socket Binding to Multiple Interfaces - 1 days
    - [x] Error Handling - 1 days
  - [x] Querier - 4 days
    - [x] Service Querying - 2.5 days
      - [x] Record Aggregation for Muxxing Together Services - 0.5 days
      - [x] Querying For a Service's Missing Records - 0.5 days
      - [x] Emitting Expired Services - 0.5 days
    - [x] Unicast 1.5 days
      - [x] Checking for Unicast Availability - 1 days
      - [x] Sending queries with unicast enabled - 0.5 days
  - [x] Responder 5.5 days
    - [x] Service Registration - 0.5 days
    - [x] Filter Messages Received from Multicast Interface Loopback  - 0.5 days
    - [x] Unicast
      - [x] Responding to Unicast Queries - 0.5 days

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Local Network Traversal - Multicast Discovery #1

Specification

Sending Queries

Control Flow

Data Flow

Receiving Announcements/Responses (Pull)

Data Flow

Control Flow

Sending Announcements

Control Flow

Types

Parser / Generator

Testing

Additional Context

Multicast Discovery

Tasks

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Local Network Traversal - Multicast Discovery #1

Description

Specification

Sending Queries

Control Flow

Data Flow

Receiving Announcements/Responses (Pull)

Data Flow

Control Flow

Sending Announcements

Control Flow

Types

Parser / Generator

Testing

Additional Context

Multicast Discovery

Tasks

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions