Add metrics to monitor the query costs #51

guerremdq · 2023-10-30T14:52:08Z

Expose metrics for each query to monitor the query cost

This change is

- Add keepAlive flag to not kill the app when the query fail

stephen-soltesz · 2023-11-01T19:17:08Z

@guerremdq -- adding these metrics would be welcomed. Thank you! Please be mindful of unit tests, code coverage, and the build status.

guerremdq · 2023-11-03T15:56:32Z

Hey @stephen-soltesz , Need to implement a few more methods in the cloudtest library to be able to test this
m-lab/go#174

stephen-soltesz · 2023-11-08T00:43:50Z

Changes from m-lab/go#174 are available in https://github.com/m-lab/go/releases/tag/v0.1.67

* - Add metrics for sucessful and failed queries - Add keepAlive flag to not kill the app when the query fail * - Add metric names prefix - Remove duplicated matric calls - Add extra label to update duration metic

guerremdq · 2023-11-08T09:54:45Z

Thanks @stephen-soltesz , I let me know what you think about this PR

stephen-soltesz

Thank you @guerremdq -- I've added a few suggestions that I think will make the change a little simpler.

Reviewed 1 of 7 files at r1, 1 of 7 files at r3, 1 of 1 files at r5, 2 of 2 files at r7.
Reviewable status: 0 of 1 LGTMs obtained

query/bigquery_runner.go line 22 at r7 (raw file):

}

func (b *bigQueryImpl) Query(query string, visit func(row map[string]bigquery.Value) error) (*bigquery.QueryStatistics, error) {

Rather than returning bigquery.QueryStatistics here (and for all existing cases calling Query), consider extending the BQRunner interface instead.

Then the bigQueryImpl struct could include a bigquery.QueryStatistics field that's set after a successful query and could be read by callers that need it from a new method, say, LastStats() or similar.

Then the *Collector.Update() loop would call Query() followed by LastStats() to update the metrics -- and all the return type changes throughout this PR are unnecessary.

query/bigquery_runner.go line 31 at r7 (raw file):

	}

	if job == nil {

It should not be possible for Query.Run to return a nil err and a nil job. Please remove this check.

query/bigquery_runner.go line 38 at r7 (raw file):

	status, err := job.Wait(context.Background())
	if err != nil {
		return nil, status.Err()

status is not guaranteed to be non-nil when err != nil. Prefer simply returning err here.

query/bigquery_runner.go line 44 at r7 (raw file):

	it, err := job.Read(context.Background())

nit: please remove extra new line between job.Read and the err check.

sql/collector.go line 155 at r7 (raw file):

func setSlotMillis(ch chan<- prometheus.Metric, slotMillis int64, metricName string) {
	desc := prometheus.NewDesc("bqx_slot_seconds_utilized", "slot milliseconds utilized", []string{"filename"}, nil)

If these metrics were part of the Collector they would need to be registered in Describe, iirc.

Each set of Collector metrics may be distinct from one another. These metrics are constant for the package. So, I don't think these should be part of the Collector. These are package level metrics collected about the different queries run.

Please redefine these at the package level as standard Gauges.

.travis.yml line 14 at r6 (raw file):

- make
- go test -short -v ./... -cover=1 -coverprofile=_c.cov
- $GOPATH/bin/goveralls -service=travis-ci -coverprofile=_c.cov

If you'd be willing to change "travis-ci" to "travis-pro" we may be able to get coverage stats for this PR. (But it might not work until a new PR is created)

guerremdq added 6 commits September 22, 2023 10:38

- Add metrics for sucessful and failed queries

421fa95

- Add keepAlive flag to not kill the app when the query fail

add GCP Billed Bytes

952effa

Add billed bytes by BigQuery

f0491b4

fix tests

ed67ccc

Add Metrics for slot utilization and total bytes:

8d86c9d

Remove travis CI

cd06fd8

stephen-soltesz self-requested a review November 8, 2023 00:44

guerremdq and others added 3 commits November 8, 2023 10:48

fix test, change metric name and unit to pass the current test

9da0787

Add metrics and keep app alive (m-lab#50)

6f68c89

* - Add metrics for sucessful and failed queries - Add keepAlive flag to not kill the app when the query fail * - Add metric names prefix - Remove duplicated matric calls - Add extra label to update duration metic

rebase main

b6547f0

guerremdq and others added 3 commits November 8, 2023 11:11

Set same labels name for metrics

69fadbd

restore deleted file

33dab1e

Merge branch 'main' into billed-bytes

e0e62fb

stephen-soltesz requested changes Nov 13, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add metrics to monitor the query costs #51

Add metrics to monitor the query costs #51

guerremdq commented Oct 30, 2023 •

edited by stephen-soltesz

Loading

stephen-soltesz commented Nov 1, 2023 •

edited

Loading

guerremdq commented Nov 3, 2023

stephen-soltesz commented Nov 8, 2023

guerremdq commented Nov 8, 2023

stephen-soltesz left a comment

Add metrics to monitor the query costs #51

Are you sure you want to change the base?

Add metrics to monitor the query costs #51

Conversation

guerremdq commented Oct 30, 2023 • edited by stephen-soltesz Loading

stephen-soltesz commented Nov 1, 2023 • edited Loading

guerremdq commented Nov 3, 2023

stephen-soltesz commented Nov 8, 2023

guerremdq commented Nov 8, 2023

stephen-soltesz left a comment

Choose a reason for hiding this comment

guerremdq commented Oct 30, 2023 •

edited by stephen-soltesz

Loading

stephen-soltesz commented Nov 1, 2023 •

edited

Loading