From c73909df0f23fa62f90ff7e893fe05236cb4ab31 Mon Sep 17 00:00:00 2001 From: Danny Salman Date: Sun, 15 Jan 2023 12:00:18 -0500 Subject: [PATCH 1/7] populate content routing overview --- content/concepts/content-routing/overview.md | 54 +++++++++++++++++++- 1 file changed, 53 insertions(+), 1 deletion(-) diff --git a/content/concepts/content-routing/overview.md b/content/concepts/content-routing/overview.md index 149fea02..610008bf 100644 --- a/content/concepts/content-routing/overview.md +++ b/content/concepts/content-routing/overview.md @@ -4,4 +4,56 @@ description: "Learn about Noise in libp2p." weight: 218 --- -Coming soon! +## Overview + +Content routing refers to the process of directing data to its intended recipient within a +P2P network. In a traditional client-server network, routing is a straightforward process +because a central authority manages the flow of data. However, in a P2P network, there is +no central authority, so routing becomes more complex. + +One of the main challenges of content routing in P2P networks is scalability. As the number +of peers in a network increases, the number of possible routes for data to travel also increases. +This can lead to a situation where the routing table becomes excessively large and difficult to +manage. + +Another challenge is fault tolerance. In a centralized network, if the central server goes down, +the entire network goes down. In a P2P network, if a peer goes down, the network can still +function, but losing that peer can create a bottleneck and make routing more difficult. + +In general, specific characteristics of P2P networks complicate this process, including: + +- The lack of universal orchestration that a central server can provide when + querying and retrieving content. +- Not having a central directory that contains information about reaching every peer + in the network. +- The presence of high node churn. +- Creating a resilient, scalable, and optimal routing protocol that is resistant to +- Resistance against Sybil attacks. +- Forward compatibility. + +## Content Routing in libp2p + +libp2p provides a set of modules for different network-level functionality, including +a content routing interface. + +```shell +interface ContentRouting { + Provide(CID, bool) error + FindProviders(CID) [Multiaddr] +} +``` + +In libp2p, content routing is based on a +[Distributed Hash Table (DHT) called Kademlia](../introduction/protocols/kaddht.md). Kademlia assigns +each piece of content a unique identifier and stores the content on the peer whose identifier is +closest to the content's identifier. This allows for efficient routing, reducing the number of possible +routes. The content router is simply an index of the peer serving the content of interest, +and a DHT is used to maintain a P2P index. More information can be found in the +[Kad-DHT content routing document](kaddht.md). + +{{< alert icon="" context="note">}} +While there are different design approaches for a content routing protocol, such as +Kademlia DHT, DNS, and BitTorrent trackers, the libp2p +documentation will focus on a DHT-based approach that implements the content routing +interface: Kad-DHT-libp2p. +{{< /alert >}} From d0b175c6e1f8167491066afb2498cfe8fadcab77 Mon Sep 17 00:00:00 2001 From: Danny Salman Date: Sat, 28 Jan 2023 10:52:15 -0500 Subject: [PATCH 2/7] incorporate PR feedback & general improvements --- content/concepts/content-routing/overview.md | 37 +++++++++----------- 1 file changed, 17 insertions(+), 20 deletions(-) diff --git a/content/concepts/content-routing/overview.md b/content/concepts/content-routing/overview.md index 610008bf..5b801750 100644 --- a/content/concepts/content-routing/overview.md +++ b/content/concepts/content-routing/overview.md @@ -9,16 +9,12 @@ weight: 218 Content routing refers to the process of directing data to its intended recipient within a P2P network. In a traditional client-server network, routing is a straightforward process because a central authority manages the flow of data. However, in a P2P network, there is -no central authority, so routing becomes more complex. +no central authority, so routing becomes more complex as data only travels point-to-point +(except when relayed). One of the main challenges of content routing in P2P networks is scalability. As the number -of peers in a network increases, the number of possible routes for data to travel also increases. -This can lead to a situation where the routing table becomes excessively large and difficult to -manage. - -Another challenge is fault tolerance. In a centralized network, if the central server goes down, -the entire network goes down. In a P2P network, if a peer goes down, the network can still -function, but losing that peer can create a bottleneck and make routing more difficult. +of peers increase, it becomes more difficult to efficiently find and retrieve the desired +content due to the larger number of potential sources. In general, specific characteristics of P2P networks complicate this process, including: @@ -27,16 +23,18 @@ In general, specific characteristics of P2P networks complicate this process, in - Not having a central directory that contains information about reaching every peer in the network. - The presence of high node churn. -- Creating a resilient, scalable, and optimal routing protocol that is resistant to -- Resistance against Sybil attacks. +- Creating a resilient, scalable, and optimal routing protocol resistant to Sybil attacks. - Forward compatibility. +Content routing protocols attempt to addresses these challenges by providing an effective way +to identify a specific peer that holds the requested data. + ## Content Routing in libp2p libp2p provides a set of modules for different network-level functionality, including a content routing interface. -```shell +```go interface ContentRouting { Provide(CID, bool) error FindProviders(CID) [Multiaddr] @@ -44,16 +42,15 @@ interface ContentRouting { ``` In libp2p, content routing is based on a -[Distributed Hash Table (DHT) called Kademlia](../introduction/protocols/kaddht.md). Kademlia assigns -each piece of content a unique identifier and stores the content on the peer whose identifier is -closest to the content's identifier. This allows for efficient routing, reducing the number of possible -routes. The content router is simply an index of the peer serving the content of interest, -and a DHT is used to maintain a P2P index. More information can be found in the -[Kad-DHT content routing document](kaddht.md). +[Distributed Hash Table (DHT) called Kademlia](../introduction/protocols/kaddht.md). +Kademlia assigns each piece of content a unique identifier and stores the content on the +peer whose identifier is closest to the content's identifier. This allows for efficient +routing, reducing the number of possible routes. A content router is used to find peers +that have requested content and tells the network that a peer can provide certain content. +More information can be found in the [Kad-DHT content routing document](kaddht.md). {{< alert icon="" context="note">}} While there are different design approaches for a content routing protocol, such as -Kademlia DHT, DNS, and BitTorrent trackers, the libp2p -documentation will focus on a DHT-based approach that implements the content routing -interface: Kad-DHT-libp2p. +Kademlia DHT, DNS, and BitTorrent trackers, the +documentation will focus on the DHT-based approach that libp2p takes. {{< /alert >}} From 11c799cac1a4e82d41e12fd38449765413c93a4b Mon Sep 17 00:00:00 2001 From: Danny Salman Date: Sat, 28 Jan 2023 10:59:06 -0500 Subject: [PATCH 3/7] incorporate PR feedback & general improvements --- content/concepts/content-routing/overview.md | 15 +++++++++------ 1 file changed, 9 insertions(+), 6 deletions(-) diff --git a/content/concepts/content-routing/overview.md b/content/concepts/content-routing/overview.md index 5b801750..038f22eb 100644 --- a/content/concepts/content-routing/overview.md +++ b/content/concepts/content-routing/overview.md @@ -6,8 +6,9 @@ weight: 218 ## Overview -Content routing refers to the process of directing data to its intended recipient within a -P2P network. In a traditional client-server network, routing is a straightforward process +Content routing refers to the process of identifying a specific peer that holds +certain data and the means to connect to them within a P2P network. +In a traditional client-server network, routing is a straightforward process because a central authority manages the flow of data. However, in a P2P network, there is no central authority, so routing becomes more complex as data only travels point-to-point (except when relayed). @@ -43,10 +44,12 @@ interface ContentRouting { In libp2p, content routing is based on a [Distributed Hash Table (DHT) called Kademlia](../introduction/protocols/kaddht.md). -Kademlia assigns each piece of content a unique identifier and stores the content on the -peer whose identifier is closest to the content's identifier. This allows for efficient -routing, reducing the number of possible routes. A content router is used to find peers -that have requested content and tells the network that a peer can provide certain content. +Kademlia assigns each piece of content a unique identifier and stores the information +of the peer holding the content whose identifier is closest to the content's +identifier. This allows for efficient routing, reducing the number of possible routes. +A content router is used to find peers that have requested content and tells the network +that a peer can provide certain content. + More information can be found in the [Kad-DHT content routing document](kaddht.md). {{< alert icon="" context="note">}} From c75563098e53d06105aa54ad1c872eb3ae2c4d9c Mon Sep 17 00:00:00 2001 From: Danny Salman Date: Wed, 1 Feb 2023 08:05:00 -0500 Subject: [PATCH 4/7] incorporate PR feedback and enhance --- content/concepts/content-routing/overview.md | 34 ++++++++++++-------- 1 file changed, 21 insertions(+), 13 deletions(-) diff --git a/content/concepts/content-routing/overview.md b/content/concepts/content-routing/overview.md index 038f22eb..7af7671e 100644 --- a/content/concepts/content-routing/overview.md +++ b/content/concepts/content-routing/overview.md @@ -6,16 +6,24 @@ weight: 218 ## Overview -Content routing refers to the process of identifying a specific peer that holds -certain data and the means to connect to them within a P2P network. -In a traditional client-server network, routing is a straightforward process -because a central authority manages the flow of data. However, in a P2P network, there is -no central authority, so routing becomes more complex as data only travels point-to-point -(except when relayed). +The most widely used method for accessing data today is via addressing, +where the desired content can be located by its name on a specific server. +This makes routing simple as the server information is readily available. +For example, the URL "example.com/cat.jpg" informs us that the server with +the domain name "example.com" has a file named "cat.jpg". -One of the main challenges of content routing in P2P networks is scalability. As the number -of peers increase, it becomes more difficult to efficiently find and retrieve the desired -content due to the larger number of potential sources. +However, when accessing content-addressed data, where the content itself +serves as the addressing mechanism, routing becomes a challenge when only +the hash value of the content is known. This is where content routing comes +into play, providing a solution to the problem of determining who has the +content given only its hash value. + +Content routing refers to the process of identifying a specific peer that +holds certain data and the means to connect to them within a P2P network. + +One of the main challenges of content routing in P2P networks is scalability. +As the number of peers increase, it becomes more difficult to efficiently find +and retrieve the desired content due to the larger number of potential sources. In general, specific characteristics of P2P networks complicate this process, including: @@ -27,8 +35,8 @@ In general, specific characteristics of P2P networks complicate this process, in - Creating a resilient, scalable, and optimal routing protocol resistant to Sybil attacks. - Forward compatibility. -Content routing protocols attempt to addresses these challenges by providing an effective way -to identify a specific peer that holds the requested data. +Content routing protocols have to deal with these problems while still providing a way to +find out who has the requested data. ## Content Routing in libp2p @@ -54,6 +62,6 @@ More information can be found in the [Kad-DHT content routing document](kaddht.m {{< alert icon="" context="note">}} While there are different design approaches for a content routing protocol, such as -Kademlia DHT, DNS, and BitTorrent trackers, the -documentation will focus on the DHT-based approach that libp2p takes. +Kademlia DHT, DNS, and BitTorrent trackers, libp2p provides a Kademlia +implementation that applications can use to find and provide content. {{< /alert >}} From 4be9e5651655228d9e85c4f8c448cfe76c1d95e7 Mon Sep 17 00:00:00 2001 From: Danny Salman Date: Wed, 1 Feb 2023 08:06:59 -0500 Subject: [PATCH 5/7] edits --- content/concepts/content-routing/overview.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/content/concepts/content-routing/overview.md b/content/concepts/content-routing/overview.md index 7af7671e..1a9a91e1 100644 --- a/content/concepts/content-routing/overview.md +++ b/content/concepts/content-routing/overview.md @@ -8,7 +8,7 @@ weight: 218 The most widely used method for accessing data today is via addressing, where the desired content can be located by its name on a specific server. -This makes routing simple as the server information is readily available. +This makes routing simple, as the server information is readily available. For example, the URL "example.com/cat.jpg" informs us that the server with the domain name "example.com" has a file named "cat.jpg". @@ -22,7 +22,7 @@ Content routing refers to the process of identifying a specific peer that holds certain data and the means to connect to them within a P2P network. One of the main challenges of content routing in P2P networks is scalability. -As the number of peers increase, it becomes more difficult to efficiently find +As the number of peers increases, it becomes more difficult to efficiently find and retrieve the desired content due to the larger number of potential sources. In general, specific characteristics of P2P networks complicate this process, including: From 30467606496d320915aca380bbf5b4b7f75f65b1 Mon Sep 17 00:00:00 2001 From: Marco Munizaga Date: Sat, 25 Feb 2023 00:23:03 +0000 Subject: [PATCH 6/7] Edits --- content/concepts/content-routing/overview.md | 40 +++++++++----------- 1 file changed, 17 insertions(+), 23 deletions(-) diff --git a/content/concepts/content-routing/overview.md b/content/concepts/content-routing/overview.md index 1a9a91e1..0393f6d6 100644 --- a/content/concepts/content-routing/overview.md +++ b/content/concepts/content-routing/overview.md @@ -6,19 +6,18 @@ weight: 218 ## Overview -The most widely used method for accessing data today is via addressing, +The most widely used method for accessing data today is via location addressing, where the desired content can be located by its name on a specific server. This makes routing simple, as the server information is readily available. For example, the URL "example.com/cat.jpg" informs us that the server with the domain name "example.com" has a file named "cat.jpg". -However, when accessing content-addressed data, where the content itself -serves as the addressing mechanism, routing becomes a challenge when only -the hash value of the content is known. This is where content routing comes -into play, providing a solution to the problem of determining who has the -content given only its hash value. +However, with content-addressed data the content itself serves as the address. +Routing becomes a challenge since we only know what the data looks like, not who +has it. This is where content routing comes into play, giving us a way to +figure out who has the content and how we can communicate with them. -Content routing refers to the process of identifying a specific peer that +Content routing is the abstract process of identifying a specific peer that holds certain data and the means to connect to them within a P2P network. One of the main challenges of content routing in P2P networks is scalability. @@ -29,20 +28,18 @@ In general, specific characteristics of P2P networks complicate this process, in - The lack of universal orchestration that a central server can provide when querying and retrieving content. -- Not having a central directory that contains information about reaching every peer - in the network. +- Not having a central directory that contains information about how to reach + every peer in the network. - The presence of high node churn. -- Creating a resilient, scalable, and optimal routing protocol resistant to Sybil attacks. +- Creating a resilient, scalable, and optimal routing protocol resistant to + Sybil attacks. - Forward compatibility. -Content routing protocols have to deal with these problems while still providing a way to -find out who has the requested data. +Content routing protocols have to deal with these problems while still providing +a way to find out who has the requested data. ## Content Routing in libp2p -libp2p provides a set of modules for different network-level functionality, including -a content routing interface. - ```go interface ContentRouting { Provide(CID, bool) error @@ -50,15 +47,12 @@ interface ContentRouting { } ``` -In libp2p, content routing is based on a -[Distributed Hash Table (DHT) called Kademlia](../introduction/protocols/kaddht.md). -Kademlia assigns each piece of content a unique identifier and stores the information -of the peer holding the content whose identifier is closest to the content's -identifier. This allows for efficient routing, reducing the number of possible routes. -A content router is used to find peers that have requested content and tells the network -that a peer can provide certain content. +In libp2p, content routing is based on a [Distributed Hash Table (DHT) called +Kademlia](../introduction/protocols/kaddht.md). At a high level, the DHT stores +the content ID (CID) as a key and the value is the list of peers who can provide +it. The DHT handle spreading the load across the network and building in resiliency. -More information can be found in the [Kad-DHT content routing document](kaddht.md). +More information on Kademlia can be found in the [Kad-DHT content routing document](kaddht.md). {{< alert icon="" context="note">}} While there are different design approaches for a content routing protocol, such as From 29ddad17539afbf484c86668365544637892e0f2 Mon Sep 17 00:00:00 2001 From: Prithvi Shahi <50885601+p-shahi@users.noreply.github.com> Date: Fri, 24 Feb 2023 17:38:29 -0800 Subject: [PATCH 7/7] Update overview.md --- content/concepts/content-routing/overview.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/content/concepts/content-routing/overview.md b/content/concepts/content-routing/overview.md index 0393f6d6..c3393e48 100644 --- a/content/concepts/content-routing/overview.md +++ b/content/concepts/content-routing/overview.md @@ -9,8 +9,8 @@ weight: 218 The most widely used method for accessing data today is via location addressing, where the desired content can be located by its name on a specific server. This makes routing simple, as the server information is readily available. -For example, the URL "example.com/cat.jpg" informs us that the server with -the domain name "example.com" has a file named "cat.jpg". +For example, the URL `example.com/cat.jpg` informs us that the server with +the domain name `example.com` has a file named `cat.jpg`. However, with content-addressed data the content itself serves as the address. Routing becomes a challenge since we only know what the data looks like, not who