pmdastatsd new QA and small agent changes #836

Erbenos · 2020-02-06T22:35:19Z

New QA has better coverage than the old one. Includes a Valgrind test.

pmdastatsd received small changes to double metric values handling and improvements to clean up procedures.

…settings configuration via ini files

…ault values of hardcoded metrics, updates of non-config hardcoded metrics, counter metric type, gauge metric type

…evel 2 artifact

…ple stress test case, added instance processing abstractions to qa utils

… test for port to listen on setting

…ut_filename option

…oth inf/-inf and loss of precision

…after the pmlog is initialized

lzap · 2020-02-11T09:59:41Z

qa/statsd/src/cases/04.py

+# - statsd.pmda.dropped (before: 0, after: 40)
+# - statsd.pmda.time_spent_parsing (before: 0, after: non-zero)
+# - statsd.pmda.time_spent_aggregating (before: 0, after: non-zero)
+# - statsd.pmda.metrics_tracked, with its counter, gauge, duration and total instances (before: 0, 0, 0, 0; after: 4, 4, 2, 10)


I don't know how these tests work, but is this supposed to check also the before/after state?

lzap · 2020-02-11T10:10:51Z

qa/statsd/src/cases/09.py

+expected_percentile90 = 18000000
+expected_percentile95 = 19000000
+expected_percentile99 = 19800000
+expected_stddev = 5773500.278068


Ah disregard, I see how this is done here.

lzap · 2020-02-11T10:12:02Z

qa/statsd/src/cases/12.py

+    "cache_cleared:1|msa",
+    "session_started:|ms",
+    ":20|ms"
+]


Nitpick: If these "bad" metric names are not needed for these test, I'd probably drop these as these were already tested above.

12.py checks if the specificity of various verbose levels is ordered correctly. Payloads as above would generate additional err logs.

lzap · 2020-02-11T10:13:46Z

qa/statsd/src/cases/13.py

+
+run_test()
+
+


This test does only check of there are any errors I assume? Would it be feasible also to check those metrics created?

08.py checks if labels are OK, the payload is nearly identical as in case of 13.py

lzap · 2020-02-11T10:14:38Z

Tests look good.

natoscott

Looks good - small comments follow in-line.

natoscott · 2020-02-12T03:59:57Z

qa/1599

-which ruby >/dev/null 2>&1 || _notrun "ruby not installed"
+which python >/dev/null 2>&1 || _notrun "python not installed"
+
+which valgrind >/dev/null 2>&1 || _notrun "valgrind not installed"


Switch over to '_check_valgrind' here (from qa/common.check)

Addressed by 78b11bf

natoscott · 2020-02-12T04:07:51Z

qa/statsd/src/cases/09.py

+            if utils.check_is_in_bounds(expected_average, number_value):
+                status = True
+        elif k == "/count":
+            # TODO: Ask Nathan, if this is OK


Which part? The 0.5?

Yes, that one, basically I am not sure, how to accurately test for datagrams lost/processed ratio. So I just check if the value is within a somewhat expected margin.

Yes, that's fine - it's a common pattern - when writing shell tests we even have a shared _within_tolerance() function available for this purpose.

Alright, I will leave it as is and remove the TODO comment, thanks.

I had to set up a new VM to work on this on which I encounter following phenomena: even 0.5 toleration margin is not enough, particularly in relation to count of received UDP payloads, even after setting sysctl net.core.rmem_max and sysctl net.core.rmem_default to 26214400, which makes me worry about reliability of such a test - even though its UDP after all, what would your approach to this be?

Set the acceptable boundaries sufficiently wide that any reasonable scenario would be catered for. Add comments to the test discussing possible causes for values outside these boundaries, in case people have to come along later and widen the boundaries further. Even with wide boundaries, its often still likely to be a worthwhile test.

Will do, thanks.

natoscott · 2020-02-12T04:14:15Z

qa/statsd/src/cases/06.py

+        sock.sendto("test_gauge2:+{}|g".format(overflow_payload).encode("utf-8"), (ip, port))
+        utils.print_metric("statsd.pmda.dropped")
+        utils.print_metric("statsd.test_gauge2")
+        # TODO: check if this is truly the desired behavior


Now would be a good time to check, and figure out an answer (before this is merged) - otherwise, we're all likely to forget over time.

I actually already did the other day (we had small chat in Slack about it), emailed it trough with Lukas if it is OK and we resolved it. I will remove the comment in code.

natoscott · 2020-02-12T04:16:23Z

qa/statsd/GNUmakefile

 	$(INSTALL) -m 755 -d $(TESTDIR)
-	$(INSTALL) -m 644 $(RBFILES) $(TESTDIR)
+	# I wish I knew how to extract such listing from $(TESTFILES), not sure why this is needed yet
+	$(INSTALL) -m 644 src/cases/01.py $(TESTDIR)/src/cases/01.py


In the same way TESTDIR is installed above, each new directory used also needs to be $(INSTALL)'d ... otherwise, the packaging ends up slightly incorrect (nothing 'owns' the directory is its not explicitly installed).

I added GNUmakefile's to each qa/statsd dir in 78b11bf (as inspired by qa/slurm). I guess GNUmakefile.install are for post install steps once all test files are $(INSTALL)-ed? Will leave only default setup install clean check in them since python is interpreted.

I added GNUmakefile's to each qa/statsd dir in 78b11bf (as inspired by qa/slurm).

Perfect.

I guess GNUmakefile.install are for post install steps once all test files are $(INSTALL)-ed?

This is the makefile that will be in the pcp-testsuite (rpm/deb) package, used when tests are run from /var/lib/pcp/testsuite

natoscott · 2020-02-12T04:17:37Z

qa/statsd/GNUmakefile

+	$(INSTALL) -m 644 src/cases/13.py $(TESTDIR)/src/cases/13.py
+	$(INSTALL) -m 644 src/cases/14.py $(TESTDIR)/src/cases/14.py
+	$(INSTALL) -m 644 src/cases/15.py $(TESTDIR)/src/cases/15.py
+	$(INSTALL) -m 644 src/configs/complex/0/pmdastatsd.ini $(TESTDIR)/src/configs/complex/0/pmdastatsd.ini


Likewise for each of these new ini file dirs :|

Since these configs are so small, I'd consider generating them at runtime instead of committing them as individual files in the repo.

natoscott · 2020-02-12T04:20:27Z

qa/1599

@@ -15,8 +15,9 @@ echo "QA output created by $seq"

 test -e $PCP_PMDAS_DIR/statsd/pmdastatsd || _notrun "statsd PMDA not installed"



The comment at the start of this file refering to 'ruby' needs updating.

natoscott · 2020-02-12T04:21:57Z

qa/1599

@@ -15,8 +15,9 @@ echo "QA output created by $seq"



Consider replacing the above three lines with common.python now.

natoscott · 2020-02-12T04:22:22Z

qa/1599

@@ -15,8 +15,9 @@ echo "QA output created by $seq"

 test -e $PCP_PMDAS_DIR/statsd/pmdastatsd || _notrun "statsd PMDA not installed"

-# NOTE: Miroslav is planning to re-work this in python/shell
-which ruby >/dev/null 2>&1 || _notrun "ruby not installed"
+which python >/dev/null 2>&1 || _notrun "python not installed"


common.python will have checked this (python not installed) for you already.

natoscott · 2020-02-12T04:23:45Z

qa/1599

 for script in $scripts
 do
-    ruby $script
+    python $script $here/statsd/output


Using common.python, this should then be $python rather than just python.

This lets us test both python2 and python3 more easily (we use python3 more aggressively - whenever available - over python2).

Erbenos · 2020-02-12T21:40:51Z

Thank you for your review so far, I will address the issues you brought up.

…s, hardcoded various test configs to replace static files, added _check_valgrind check, removed some TODO's, updated comment at the start of 1599, removed python check as common.python checks it, cleaned up debug_output_filename output file output's to be more syntatically consistent, disabled 13.py test case

Erbenos · 2020-02-17T23:50:36Z

The commit above addressed issues mentioned but also disabled 13.py as I noticed the test case is not deterministic and will need to be filtered/processed in python further.

Erbenos added 13 commits January 18, 2020 21:37

(re-)started working on python tests for pmdastatsd, case exercising …

5bddf33

…settings configuration via ini files

pmdastatsd qa: added cases for installation and removal of agent, def…

d15d813

…ault values of hardcoded metrics, updates of non-config hardcoded metrics, counter metric type, gauge metric type

fixed python .pyc artifact inclusion in git, fixed 1599.out verbose l…

b3364e2

…evel 2 artifact

Added labels test, duration metric type basic aggregation test

4f071e3

removed debug artifact from 1599's test case and test output

0eef0d2

pmdastatsd 1599, added hdr histogram aggregation test case, added sim…

5e27021

…ple stress test case, added instance processing abstractions to qa utils

pmdastatsd qa 1599, added test for verbose logging specificity, added…

58f4076

… test for port to listen on setting

pmdastatsd qa, added tests for max_udp_packet_size option, debug_outp…

2a2b9e4

…ut_filename option

make metric value double type handling more consistent, aka allowed b…

e814e6a

…oth inf/-inf and loss of precision

pmdastatsd qa, added WIP valgrind-check case

eccae28

pmdastatsd qa, more work on valgrind test, moved 'config loaded' log …

b14aab7

…after the pmlog is initialized

pmdastatsd qa, finished valgrind test

a524693

pmdastatsd improved clean up procedures

712815a

Erbenos requested a review from natoscott February 6, 2020 22:38

lzap reviewed Feb 11, 2020

View reviewed changes

natoscott reviewed Feb 12, 2020

View reviewed changes

natoscott merged commit 78b11bf into performancecopilot:master Feb 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

pmdastatsd new QA and small agent changes #836

pmdastatsd new QA and small agent changes #836

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		@@ -15,8 +15,9 @@ echo "QA output created by $seq"

		test -e $PCP_PMDAS_DIR/statsd/pmdastatsd \|\| _notrun "statsd PMDA not installed"


		run_test()

Uh oh!

pmdastatsd new QA and small agent changes #836

pmdastatsd new QA and small agent changes #836

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!