§ Decentralized Web Node Companion Guide

========

Status: Draft

Latest Draft: identity.foundation/decentralized-web-node/guide

Chairs

Editors:

Contributors:

Participate:


Note: This document is a WORKING DOCUMENT and IN PROGRESS.

Table of Contents

§ Overview

The Decentralized Web Node (DWN) companion guide is a non-normative guide that provides an overview of the functional requirements and design processes for implementing the DWN specification developed by the Decentralized Identity Foundation (DIF). This guide is intended to be used by developers, architects, and solution providers who are interested in building decentralized web applications and services that conform to the DWN specification.

This companion guide is not a formal specification, but rather a practical resource that provides guidance on implementing the DWN specification in a way that promotes best practices and ensures interoperability with other decentralized web nodes. The guide covers a range of topics, including functional requirements, design considerations, and best practices for building and deploying decentralized web nodes.

The contents of this companion guide include:

This companion guide is intended to supplement the formal DWN specification developed by the DIF. By providing practical guidance on implementing the specification, this guide can help developers, architects, and solution providers to build decentralized web applications and services that promote greater privacy, security, and user control over their data.

Overall, the Decentralized Web Node companion guide is a valuable resource for anyone who is interested in building decentralized web nodes that conform to the DWN specification.

STATUS: PRE-DRAFT / IN PROGRESS

§ What Are Decentralized Web Nodes?

The DWN specification is a set of standards for building and deploying decentralized web nodes, which are the building blocks of a decentralized web infrastructure.

The DWN specification defines a set of protocols and APIs that enable decentralized web nodes to communicate and work together in a secure and interoperable way. This includes standards for data sharing, node discovery, and peer-to-peer networking.

The DWN specification is designed to enable developers to build decentralized web applications and services that can operate independently of centralized infrastructure. This can help to improve the privacy, security, and resilience of the web, while also promoting greater user control over their data.

The functional advantages of DWN’s are that they are very good at scaling decentralized web apps. They enable multi-party data transactions with minimal overhead.

Overall, the DWN specification is an important part of the DIF’s work to promote the development of decentralized web technologies and standards. By providing a clear set of guidelines and best practices for building and deploying decentralized web nodes, the DWN specification can help to accelerate the adoption of a more decentralized and open web.

§ Target Audience

This target audience for this document are those that have a strong technical background and experience in building web applications, as well as a good understanding of decentralized systems and protocols. They may also have experience with blockchain technologies, distributed computing, and peer-to-peer networking.

Developers who intend to implement the DWN specification will need to have a good understanding of the protocols and APIs defined in the specification, as well as the underlying technologies that support it. This may include familiarity with decentralized storage systems like IPFS, as well as cryptographic protocols for secure data sharing and verification. This guide is intended to provide descriptive and functional color around some of the more formal specifications provided by the core specs.

Architects and solution providers will also need to have a good understanding of the broader decentralized web ecosystem, including emerging standards and best practices. This can help to inform the design of decentralized web applications and services that are secure, scalable, and interoperable.

Overall, the target audience for the DWN companion guide is a technical community that is committed to building a more decentralized and open web. By leveraging the DWN specification, developers, architects, and solution providers can help to accelerate the adoption of decentralized web technologies, and promote greater privacy, security, and user control over their data.

§ Scope

This non-normative guide is intended to provide an overview of the functional requirements and design processes for implementing the Decentralized Web Node (DWN) specification developed by the Decentralized Identity Foundation (DIF). This guide is intended to be used by developers, architects, and solution providers who are interested in building decentralized web applications and services that conform to the DWN specification.

The guide covers the following topics:

This guide is intended to be a non-normative companion to the formal DWN specification developed by the DIF. While it is not a formal specification, this guide is intended to provide practical guidance for implementing the DWN specification in a way that promotes best practices and ensures interoperability with other decentralized web nodes.

Overall, the scope of this non-normative guide is to provide developers, architects, and solution providers with a clear and practical overview of the functional requirements and design processes for implementing the DWN specification developed by the DIF.

§ Disclaimer

This Decentralized Web Node (DWN) companion guide is a non-normative resource that is intended to provide practical guidance on implementing the DWN specification developed by the Decentralized Identity Foundation (DIF). This guide is not a formal specification, and as such, it is not intended to replace or supersede the DWN specification.

The contents of this guide are based on the opinions and experiences of the authors, and are not necessarily endorsed by the DIF or any other organization. The guide is intended to be opinionated in the sense that it represents a particular perspective on how best to implement the DWN specification, based on the authors’ experiences and insights.

Readers are encouraged to use their own judgment and discretion when implementing the DWN specification, and to consider a range of approaches and best practices. This companion guide is not intended to be prescriptive or comprehensive, and readers are encouraged to consult other resources and experts in the field to inform their decisions.

Overall, this companion guide is intended to provide a helpful resource for those interested in implementing the DWN specification, but it should be understood that the opinions and recommendations expressed in this guide are not the only or definitive way to approach decentralized web node design and implementation

§ Terminology

The Terminology section of the Decentralized Web Node (DWN) companion guide is intended to provide a comprehensive and accessible reference for the key terms and concepts related to the DWN specification. This section aims to define important technical terms and concepts in a clear and concise manner, and to provide examples and illustrations where appropriate. The Terminology section is designed to be a useful resource for developers, architects, and solution providers who are new to the world of decentralized web technologies, as well as for those who are more experienced and looking for a refresher or clarification on certain terms and concepts.

§ Technology Comparision

There has been so much rapid development of Decentralized Storage technologies that it’s important to highlight the common aspects, and the differences with the goal of matching their unique features with the Use Case at hand.

We will use the term “Personal and Application Data Storage” to denote the compared technologies whether they are a stack, libraries, protocols, or frameworks.

This is by no means a comprehensive comparison, and we did not test these technologies at scale.

§ Technologies that are not Personal Data Stores

§ DIDCommhttps://didcomm.org/

A DID-based, secured, transport-agnostic, peer-to-peer communications protocol. It lays the foundation to build domain/vertical/application specific protocols.

§ KERIhttps://keri.one/

Enables the portability of Self-Sovereign Identities by eliminating the need for a ledger to establish a root of trust.

§ Nostrhttps://nostr.com/

Nostr has gained some popularity as an open protocol that offers a censorship-resistant alternative to Twitter. It relies on relay servers that accept and store posts. A client or Dapp signs messages with the user’s private key and posts messages to as many relay servers as possible in order to keep the user’s content from being banned. Relay servers do not communicate with each other; thus the responsibility of replication is delegated to the Client application. Users are identified by their public key. That is, every post that is signed can be cryptographically verified.

§ Decentralized Storages that are not intrinsically Personal Data Stores

§ ChainSafe Storagehttps://storage.chainsafe.io/

ChainSafe is an end-to-end, file-encrypting storage application. It persists symmetric-encrypted information on the IPFS/FileCoin network. It is meant to transition traditional Web 2.0 integrations with AWS S3 buckets to Web 3.0.

§ Fleekhttps://docs.fleek.co/

Fleek is a multi-purpose set of technologies that allow Dapp Developers to host web applications on IPFS/FileCoin. It also provides general IPFS/FileCoin storage management. It is geared toward builders rather than individuals. Fleek offers Space and Space Daemon which are intended for building Privacy preserving Dapps. It is currently in Alpha.

§ Protocol Labs IPFS, FileCoin, FVMhttps://fvm.filecoin.io/

IPFS is without a doubt the most successful storage protocol that decouples data from well-known servers, cloud storage, or any type of centralized storage. This is accomplished using Content Addressing (CID) and the segmenting of data in Direct Acyclic Graphs. In IPFS, the location of the data is its CID. FileCoin runs on top of IPFS and offers an incentive-based model for cold storage so that any entity that wants to profit from offering hardware resources may easily do so.

The biggest drawback with IPFS/FileCoin is that once a rogue party has a hold of CIDs, the corresponding data is fully accessible. This paradigm forces client processes to encrypt data prior to storing it. Until now…

Protocol Labs has now released the FileCoin Virtual Machine (FVM) network, an Ethereum-compatible VM. This means that Solidity developers can also develop in the new FVM.

This technology offers the basic L1 plumbing that unleashes the potential for a new open data economy. In essence, this works as a decentralized operating system that orchestrates how data is persisted, retrieved, and governed. One of the basic features is the ability to bring computation to decentralized data. This means that L2 Compute Networks can encrypt and decrypt sensitive information, act as a gatekeeper, and offer the same features as the various Personal Data Stores discussed herein.

It is worth mentioning that FVM uses WebAssembly as the bytecode for Smart Contracts. This means that any program that can be compiled into WebAssembly can be used for on-chain development.

One of the most powerful features of these FVM smart contracts is the ability to define rules for data to obey, most importantly region and location for the storage of that data. This is important in order to remain compliant with regulations such as GDPR; e.g., data about EU citizens must remain within the borders of the European Community.

FVM Consensus is achieved using their Interplanetary Consensus, and it is estimated that FVM will be able to handle transactions in the realm of one billion transactions per second (tps).

§ Personal Data Stores

Solid Pods https://solidproject.org/
Description Decentralized Data Stores that rely on Linked Data to express identity and semantic data. The Pod is owned by an agent (Person, Organization, Group, Device, etc.) that is globally identified by a WebID.
Specification Open Specification incubated by Inrupt and now the W3C.
Deployment A Pod can be hosted by a Solid Pod Provider, or it can be user-deployed using any of its implementations in Node.js, PHP, or as a plugin for Nextcloud.
Identity WebID and its corresponding WebID Profile document. The WebID comes in the form of an HTTP URI, and it allows the linking of many agents in a web of trust using vocabularies such as Friend of a Friend (FOAF) semantics.
Authentication An agent uses its WebID to authenticate using the SOLID-OIDC specification. A Solid Pod server becomes an OpenID provider.
Authorization Access to resources is managed by the Web Access Control system and its underlying Access Control List model. Authorizations are described using the ACL Ontology which is granular at the graph subject level.
Transport HTTP/1.1 through GET, PUT, POST, PATCH, and DELETE HTTP Methods.
Schema / Data Representation Data is encoded in graphs using N3 notation (a superset of RDF triples). Schemas are innate in RDF semantic ontologies. Any graphed relationship denotes a schema in its definition.
Query Capabilities The HTTP GET method allows for N3 Path Syntax. It is very similar to SPARQL 1.1 Property Path Syntax.
License MIT license and the Creative Commons Attribution 4.0
Ceramic and ComposeDB https://ceramic.network/
Description Ceramic is a decentralized data network. Its foundations are laid on top of the Ceramic Event Driven Protocol. The infrastructure to build Personal Data Stores is offered by the Ceramic ComposeDB. ComposeDB replaces IDX and DID Data Store.
Specification Open Specification curated by Ceramic.Network
Deployment A ComposeDB instance is installed as part of Ceramic Node deployment. It can only be hosted in a Cloud environment.
Identity Decentralized Identifiers (DIDs)
Authentication Web3 Wallets and DID.
Authorization Object Capabilities
Transport GraphQL API over HTTP/1.1
Schema / Data Representation API models are defined as GraphQL Schemas. The underlying data store uses graph nodes: Accounts and Documents. Relations are expressed as Edges.
Query Capabilities Partial GraphQL Queries. As of this writing, a query cannot be made against any data attributes.
License
MIT
and Apache
Atomic Data and Atomic Server https://docs.atomicdata.dev/
Description Atomic offers a specification and a server to build JSON-LD for building privacy preserving applications.
Specification Open-Source Specification. The Atomic Server implementation in Rust is also open sourced.
Deployment It can be deployed in a Cloud environment or User-Hosted
Identity PKI
Authentication Json-AD Authentication Resource
Authorization Atomic Hierarchy Model
Transport WebSockets, HTTP 1/1
Schema / Data Representation JSON-AD (JSON-Atomic Data). A variation of JSON-LD which supports the definition of schemas to provide type-safety.
Query Capabilities Atomic Paths, SPARQL
License MIT
Encrypted Data Vaults https://identity.foundation/edv-spec/
Description A specification with the goal of ensuring the privacy of an entity’s data by encrypting the data at rest
Specification Open-Source Specification incubated by DIF
Deployment [Pending]
Identity Support for various Identity models, DIDs being one such.
Authentication [Pending]
Authorization Authorization Capabilities
Transport HTTP 1/1, gRPC, Bluetooth
Schema / Data Representation [Pending]
Query Capabilities The goal is to provide Indexing and Querying capabilities. The working group is in the process of how deciding how this will be done.
License Apache 2.0
MyDex Personal Data Store https://dev.mydex.org/connection-api/personal-data-store.html
Description The MyDex Personal Data Store is a secure data vault residing in the cloud and hosted by MyDex Community Interest Company. An individual’s data is encrypted at rest using the individual’s key. MyDex does not have access to any key for decryption.
Specification Proprietary Specification
Deployment Offered as a SaaS solution
Identity MyDexID derived from PKI
Authentication SAML and OIDC
Authorization Proprietary Data Sharing Agreement
Transport REST over HTTP/1.1
Schema / Data Representation JSON Formatted
Query Capabilities [Not found in documentation]
The Hub of All Things https://www.hubofallthings.com/
Description The Hub of All Things is a service provided by DataSwift who developed the HAT Microserver, a personal web server and its accompanying PostgresQL database. A Hat Microserver segments data in namespaces, such that data from various verticals/domains/apps can live under the same instance.
Specification Proprietary Specification. HAT Microserver implementation in Scala is open sourced.
Deployment Offered as a SaaS solution
Identity HAT Universal ID
Authentication DataSwift One SSO
Authorization HAT Microserver Instructions Contract (HMIC)
Transport REST over HTTP 1.1
Schema / Data Representation JSON Formatted
Query Capabilities [Not found in documentation]
Peergos https://https://peergos.org/
Description Peergos is a decentralised protocol and open-source platform for storage, social media and applications
Specification Open source specification and implementations
Deployment Self Hosted or as a SaaS Multi-Tenant Service
Identity PKI + random keypairs
Authentication Self-authenticated (signed and content addressed) & S3 V4 Signatures for block level access control
Authorization Cryptree based encryption and Block access controls
Transport Transport agnostic. Apps have a local HTTP RESTful API served from a ServiceWorker
Schema / Data Representation DAG CBOR Encoded IPLD Objects and Raw Objects. JSON Schema for app configuration.
Query Capabilities Peergos offers a RESTFul API with various capabilities described here. A few endpoints are directly specified.
License GNU Affero General Public License v3.0
Decentralized Web Nodes https://identity.foundation/decentralized-web-node/spec/
Description Decentralized Web Nodes are a mesh-like datastore construction that enable an entity to operate multiple nodes that sync to the same state across one another, enabling the owning entity to secure, manage, and transact their data with others without reliance on location or provider-specific infrastructure, interfaces, or routing mechanisms.
Specification Open-Source Specification incubated by DIF
Deployment Self Hosted or as a SaaS Multi-Tenant Service
Identity Decentralized Identifiers
Authentication DWN Aware Wallets / DID based
Authorization Permissions employ a capabilities-based architecture that allows for DID-based authorization and delegation of authorized capabilities to others. Derived key encryption with cryptree like encryption scheme.
Transport Transport Agnostic. Currently mostly implemented with HTTP.
Schema / Data Representation Messages committed as IPLD DAG CBOR Encoded Object with attached JSON Schema
Query Capabilities Protocols, Hooks, Records, Permissions
License
WebNative Filesystem https://github.com/wnfs-wg/spec
Description The Web Native File System (WNFS) is a distributed file system. It is versioned, logged, programmable, has strong-yet-flexible security, and is fully controlled by the end user. Service providers can validate writes without reading the contents of the file system, and minimal metadata is leaked.
Specification Open-Source Specification
Deployment Self Hosted or as a SaaS Multi-Tenant Service
Identity Decentralized Identifiers
Authentication Web3 Wallets and DIDs
Authorization Object Capabilities via UCANs
Transport IPFS
Schema / Data Representation Files
Query Capabilities Filesystem Like
License Apache 2.0 and MIT

§ Architecture and Components

This section provides an overview of the high-level architecture of a DWN, including the different components that make up a typical DWN, such as the network layer, data storage layer, identity and access control layer, and the application layer. The section could also provide guidance on how to design and implement each of these components to conform to the DWN specification.

§ Node Discovery and Peer-to-Peer Networking

This section provides detailed guidance on how to implement the node discovery and peer-to-peer networking protocols that are required for a DWN to function properly. This section could cover topics such as how to bootstrap a new node onto the network, how to maintain a list of known nodes, how to discover and connect to new peers, and how to propagate data across the network.

§ Data Sharing and Interoperability

This section provides guidance on how to design and implement data sharing protocols that conform to the DWN specification, including the use of decentralized storage systems like IPFS and the InterPlanetary Linked Data (IPLD) format. This section could also cover strategies for promoting interoperability between different decentralized web nodes and data sharing protocols, such as the use of standardized data formats and metadata.

§ Security and Privacy

This section provides guidance on how to design and implement security and privacy features that conform to the DWN specification, including the use of cryptographic protocols like Public Key Infrastructure (PKI) and Self-Sovereign Identity (SSI) for secure data sharing and verification. This section could also cover best practices for securing DWN infrastructure and protecting user data against common attacks and threats.

§ Testing and Debugging

This section provides guidance on how to test and debug a DWN implementation, including strategies for testing individual components and the network as a whole, as well as tools and techniques for troubleshooting issues that may arise during development or deployment.

§ Deployment and Operations

This section provides guidance on how to deploy and operate a DWN implementation in a production environment, including best practices for scaling and managing a distributed network, as well as tools and techniques for monitoring and managing network performance and reliability. This section could also cover strategies for maintaining backward compatibility and promoting interoperability with other decentralized web nodes and protocols.

§ Local Nodes, Remote Nodes, and Relays

This section clarifies the role of a remote node, a local node, and a relay, with respect to a deployment. It is important to note that they are actually all the same thing, in that each is actually a DWN with no feature differences across these deployment types, but in practice a local node may be used slightly differently than a remote node.

This section clarifies the difference in use between local and remote nodes, and what it means for a DWN to be a “relay”.

§ Example Deployment (Simple)

In this simple example, each actor has a remote (i.e a server) and local node (i.e a phone). As an example, you have a chat app with a remote and local node. Alice wants to send a message to Bob in this case, and Bob will reply with a message back.

DWN Simple
Connection

Steps

  1. Bob shares DID to Alice (via a QR code or some other transport)
  2. Alice Resolve’s Bob’s DID
  3. Alice sends a message to Bob’s node discovered via a Service Endpoint in the DID Document
  4. Bob’s Node relays the Alice’s message from the remote note to the local node.
  5. Bob resolves Alice’s DID and finds the service endpoints
  6. Bob’s local node ACTs on the message, sending a message back to Alice’s Node
  7. Alice’s remote node receives the message and relays it locally.

§ Example Deployment (Complex)

§ Miscellaneous

§ Example Use Cases

§ Real World Applications

§ DWN Adoption

§ Ecosystem interplay

§ Limitations and Other Considerations

§ Q&A

§ General Questions

§ Security Questions

§ Specification Questions

§ Technical Questions

§ Reference Implementations

Table of Contents