Add option to get execution data from AN/Observer by koko1123 · Pull Request #89 · onflow/flow-archive

koko1123 · 2023-04-05T02:05:06Z

Goal of this PR

introduces a second way to get exec data via the exec data API from a trusted node

Fixes #32

peterargue · 2023-04-05T22:45:59Z

 	pflag.StringVar(&flagSeedAddress, "seed-address", "", "host address of seed node to follow consensus")
 	pflag.StringVar(&flagSeedKey, "seed-key", "", "hex-encoded public network key of seed node to follow consensus")
 	pflag.BoolVarP(&flagTracing, "tracing", "t", false, "enable tracing for this instance")
+	pflag.BoolVarP(&flagDisableGCP, "disable-cloud-streaming", "g", false, "disable streaming exec data from GCP and use the Access Node instead")


rather having a separate option to disable GCP, could we just enable it only if bucket is set?

peterargue · 2023-04-05T22:48:26Z

+	ctx := context.Background()
+
+	// initialize clients
+	opts := []grpc.DialOption{grpc.WithDefaultCallOptions(grpc.MaxCallRecvMsgSize(grpcutils.DefaultMaxMsgSize)),


This should be configurable. grpcutils.DefaultMaxMsgSize most likely won't be enough for large blocks.

peterargue · 2023-04-05T23:02:14Z

+// data is available at the moment.
+func (e *ExecDataStreamer) Next() (*uploader.BlockData, error) {
+	// same implementation as GCPStreamer
+	go e.poll()


You could simplify this logic a bit by running a fixed size worker pool that continuously reads from e.queue fills e.buffer up to the configured size. Then there is no need for explicit polling and you can tune the number of worker needed to keep up with indexing

peterargue · 2023-04-05T23:04:46Z

+	b, err := e.accessApi.GetBlockByID(e.ctx, br)
+	if err != nil {
+		errs = multierror.Append(errs, fmt.Errorf("failed to get block data for blockID (%s): %w", blockID, err))
+	}


could we just get this from the local storage?

we'd only have the header in local

why only the header? the archive node is running a consensus follower which should be indexing full blocks

peterargue · 2023-04-05T23:12:36Z

+	tx, err := e.accessApi.GetTransactionResultsByBlockID(e.ctx, txr)
+	if err != nil {
+		errs = multierror.Append(errs, fmt.Errorf("failed to get transaction results for blockID (%s): %w", blockID, err))
+	}


Are we planning to serve the transaction result data? if not, maybe we can just stop indexing it?

If we want to keep it, this approach is fine for now, but we'll have to handle the case where the response size is too large. In that case, we need to get the total tx count (from the list of collections in the exec data), and call GetTransactionResultByIndex for each index to fetch the results individually.

we now serve transactions from the access API implemented in Archive, but yes, I want to remove this call entirely, left as a todo.

zhangchiqing · 2023-04-05T23:53:15Z

+		streamer = cloud.NewGCPStreamer(log, bucket,
+			cloud.WithCatchupBlocks(blockIDs),
+		)
+	}


we need to print log about which streamer end up being used

zhangchiqing · 2023-04-05T23:54:54Z

+		}()
+		bucket := client.Bucket(flagBucket)
+		streamer = cloud.NewGCPStreamer(log, bucket,
+			cloud.WithCatchupBlocks(blockIDs),


I'm afraid the number of blockIDs might be too big especially the indexing speed is far behind.

Could we at least log the total number of blockIDs here?

zhangchiqing · 2023-04-05T23:57:03Z

+			return fmt.Errorf("could not pull execution record (name: %s): %w", blockID, err)
+		}
+
+		e.log.Debug().


This log is worth to be INFO level

Suggested change

e.log.Debug().

e.log.Info().

…ndeep/add-exec-api-option

koko1123 added 3 commits April 4, 2023 20:07

Create streamer to poll from API

ed42c36

leave todos

38d4c1f

add multierror

542174d

koko1123 changed the title ~~Amlandeep/add exec api option~~ Add option to get execution data from AN/Observer Apr 5, 2023

koko1123 marked this pull request as ready for review April 5, 2023 19:59

j1010001 mentioned this pull request Apr 5, 2023

Create new Polling function to poll AN GetExecutionDataByBlockID API instead of gcp streamer #32

Open

peterargue reviewed Apr 5, 2023

View reviewed changes

zhangchiqing approved these changes Apr 5, 2023

View reviewed changes

koko1123 added 5 commits May 14, 2023 12:41

merge conflicts

733562f

progress

36d088c

address comments

2441976

Merge branch 'master' of https://github.com/onflow/flow-dps into amla…

be2ca10

…ndeep/add-exec-api-option

update streamer library

1e87bf3

koko1123 mentioned this pull request Jul 13, 2023

Execution node uploader only uploads last event instead of all events. onflow/flow-go#4558

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option to get execution data from AN/Observer#89

Add option to get execution data from AN/Observer#89
koko1123 wants to merge 8 commits into
masterfrom
amlandeep/add-exec-api-option

koko1123 commented Apr 5, 2023 •

edited

Loading

Uh oh!

peterargue Apr 5, 2023

Uh oh!

peterargue Apr 5, 2023

Uh oh!

peterargue Apr 5, 2023 •

edited

Loading

Uh oh!

peterargue Apr 5, 2023

Uh oh!

koko1123 May 17, 2023

Uh oh!

peterargue May 18, 2023

Uh oh!

peterargue Apr 5, 2023

Uh oh!

koko1123 May 17, 2023

Uh oh!

zhangchiqing Apr 5, 2023

Uh oh!

zhangchiqing Apr 5, 2023

Uh oh!

zhangchiqing Apr 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

koko1123 commented Apr 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Goal of this PR

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

peterargue Apr 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

koko1123 commented Apr 5, 2023 •

edited

Loading

peterargue Apr 5, 2023 •

edited

Loading