Batch call for retrieving blobs from RPC #8952

StefanBratanov · 2024-12-21T09:56:26Z

PR Description

When we need to retrieve blobs from RPC when following the head, we do one request per a sidecar. This PR changes this functionality to do one batch call. It will save bandwith (when used) and could help with rate limiting. Ideally, this should be merged after #8927

Fixed Issue(s)

fixes #8928

Documentation

I thought about documentation and added the doc-change-required label to this PR if updates are required.

Changelog

I thought about adding a changelog entry, and added one if I deemed necessary.

tbenr · 2025-01-07T14:37:01Z

beacon/sync/src/main/java/tech/pegasys/teku/beacon/sync/fetch/FetchBlobSidecarsTask.java

 import tech.pegasys.teku.spec.datastructures.blobs.versions.deneb.BlobSidecar;
 import tech.pegasys.teku.spec.datastructures.networking.libp2p.rpc.BlobIdentifier;

-public class FetchBlobSidecarTask extends AbstractFetchTask<BlobIdentifier, BlobSidecar> {
+public class FetchBlobSidecarsTask extends AbstractFetchTask<Bytes32, List<BlobSidecar>> {


I have doubts here.
In principle K type should uniquely identify a task, so I think we should have a record(blockroot, List<BlobIdentifier>).

tbenr · 2025-01-07T14:48:16Z

...src/main/java/tech/pegasys/teku/beacon/sync/gossip/blobs/RecentBlobSidecarsFetchService.java

-    if (allTasks.putIfAbsent(blobIdentifier, task) != null) {
+    final FetchBlobSidecarsTask task =
+        fetchTaskFactory.createFetchBlobSidecarsTask(blockRoot, requiredBlobIdentifiers);
+    if (allTasks.putIfAbsent(blockRoot, task) != null) {


so here we can have problems if we change our mind and we initiate a request with a different List<blobIdentifier> for the same block root.

we could argue that the number of blobs identifier can only decrease with time, so we don't care if we take the initial request and cancel the new one, but:

introduces an assumption that needs to be handled here, and it is risky (if we don't check we may cancel the task containing what we really need)

anyway this approach breaks the design of the abstract class IMO

so if we move to the theoretical correct key we could and up requesting the same blob(s) in two different tasks, but at least we know that we will have them all executed despite the order of arrival

so at the end:

option 1:

change the key and allow multiple request for same blockRoot (potential duplication of blobs download)

cancel method still based on blockRoot (looping on allTasks)

option 2:

leave the key as blockRoot, adding some checks when adding a task for a blockRoot already present (logging an error if this is not supposed to happen, or some logic that compares the identifier list, cancel one and leave the other)

cancel method remains as is

StefanBratanov · 2025-01-08T10:26:33Z

Had an internal discussion and decided to potentially revisit this idea in the future

StefanBratanov force-pushed the batch_call_blobs_rpc branch 2 times, most recently from 6d1e538 to 68fa5c2 Compare December 30, 2024 11:01

StefanBratanov force-pushed the batch_call_blobs_rpc branch from 6006719 to 96bf28a Compare January 7, 2025 08:36

tbenr reviewed Jan 7, 2025

View reviewed changes

StefanBratanov added 12 commits January 8, 2025 10:52

Batch call for retrieving blobs from RPC

6ab31e9

fix assemble

2d1a0a5

fix assemble

c074702

fix test

cf31e8a

Remove unused method

cc9ac9f

fix assemble

c05f1d5

add a fix

fbd4fb0

small optimization

8d6fe11

fix logging

9eca8e1

simplify

9cb900b

change error to debug

1e83524

change key for the task

cc702fb

StefanBratanov force-pushed the batch_call_blobs_rpc branch from 542d0a9 to cc702fb Compare January 8, 2025 08:52

StefanBratanov closed this Jan 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch call for retrieving blobs from RPC #8952

Batch call for retrieving blobs from RPC #8952

StefanBratanov commented Dec 21, 2024 •

edited

Loading

tbenr Jan 7, 2025

tbenr Jan 7, 2025

tbenr Jan 7, 2025 •

edited

Loading

tbenr Jan 7, 2025

StefanBratanov commented Jan 8, 2025

Batch call for retrieving blobs from RPC #8952

Batch call for retrieving blobs from RPC #8952

Conversation

StefanBratanov commented Dec 21, 2024 • edited Loading

PR Description

Fixed Issue(s)

Documentation

Changelog

tbenr Jan 7, 2025

Choose a reason for hiding this comment

tbenr Jan 7, 2025

Choose a reason for hiding this comment

tbenr Jan 7, 2025 • edited Loading

Choose a reason for hiding this comment

tbenr Jan 7, 2025

Choose a reason for hiding this comment

StefanBratanov commented Jan 8, 2025

StefanBratanov commented Dec 21, 2024 •

edited

Loading

tbenr Jan 7, 2025 •

edited

Loading