2362 bb get agent logs #2384

mssalvatore · 2022-10-02T19:31:32Z

What does this PR do?

Download agent logs from new endpoint
Fixes #2362

Builds https://jenkins.guardi/view/Monkey/job/run_appimage_ete/730/ and https://jenkins.guardi/view/Monkey/job/run_msi_ete/603/ contain logs.

PR Checklist

Have you added an explanation of what your changes do and why you'd like to include them?
Is the TravisCI build passing?
~~Was the CHANGELOG.md updated to reflect the changes?~~
~~Was the documentation framework updated to reflect the changes?~~
Have you checked that you haven't introduced any duplicate code?

Testing Checklist

~~Added relevant unit tests?~~
Have you successfully tested your changes locally? Elaborate:

Tested by running ETE tests from Jenkins and examining the log archive.
~~If applicable, add screenshots or log transcripts of the feature working~~

Reduces time to download logs by approx. 40%, but may be unnecessary after resolving #2383

envs/monkey_zoo/blackbox/log_handlers/monkey_logs_downloader.py

VakarisZ · 2022-10-03T08:10:29Z

envs/monkey_zoo/blackbox/log_handlers/monkey_logs_downloader.py

+
+    def _get_log_file_path(self, agent: Agent, machines: Mapping[MachineID, Machine]) -> Path:
+        try:
+            machine_ip = machines[agent.machine_id].network_interfaces[0].ip


It would be best if we could somehow distinguish between the IP that got scanned VS internal IP's. The map, logs, etc. should use the IP that agent found in the network.
Imagine you are running a fully-remote company. All developers use remote desktop to access their dev. env. All of these servers are created from the same base image, which contains virtual-box. Now you decide to scan this network and all machines are vulnerable to the same ssh credentials. But when it comes to the map, all machines are displayed with the same IP address, because virtual-box has an adapter and all machines were spawned with the same adapter IP (because they all come from the same base image). I wonder what's the best way to differentiate the IP that got scanned from other, internal IP addresses that are less relevant to the display. Putting it at the start of the list is not explicit.

Agreed. This is a hard problem and I'm not sure how to solve it yet. To the best of my knowledge, v1.13.0 and prior all have this issue, so doing this doesn't add a new bug/deficiency.

The way to do it with our current API is to get exploitation/scan events that target this machine and get the IP from target. There are 2 problems: 1. that machines might have the same IP's that agent scans. 2. That we use irrelevant IP's gathered from machines.
Machine should be identified by the IP that got scanned, not by some arbitrary IP that it has.

But since this is for zoo, where machines are clean, I don't think it matters yet

VakarisZ · 2022-10-03T08:14:24Z

envs/monkey_zoo/blackbox/log_handlers/monkey_logs_downloader.py

+        try:
+            machine_ip = machines[agent.machine_id].network_interfaces[0].ip
+        except IndexError:
+            machine_ip = "UNKNOWN"


How is this possible? How did the agent communicate to the island if it doesn't have an IP address? If it was launched on the island, then how did the downloader access it? I think if this happens, then it's a bug we want to know about instantly, not when we check the logs to see a bunch of ...UNKNOWN... files

Other than logging an error (which I can do), I'm not sure what this component can do about that.

If the machine has no IP, then the test will not be able to determine that it communicated back, so the test will fail.

This component probably shouldn't extract the IP in the first place. But the more important thing is to allow this component crash and burn rather than silently putting UNKNOWN files, when they are probably a result of some error

It's not silent. It logs an error and the test will fail, but it's not this components responsibility to pass or fail the test.

envs/monkey_zoo/blackbox/log_handlers/monkey_logs_downloader.py

monkey/monkey_island/cc/resources/agent_logs.py

VakarisZ

Minor fixes required

ilija-lazoroski · 2022-10-03T08:24:54Z

envs/monkey_zoo/blackbox/log_handlers/monkey_logs_downloader.py

+
+        start_time = agent.start_time.strftime("%Y-%m-%d-%H-%M-%S")
+
+        return self.log_dir_path / f"agent-{start_time}-{machine_ip}.log"


Maybe we can move the agent log name to a const?

It's not a constant, it's an fstring. It'll change for each agent for each run.

I think @ilija-lazoroski meant to move AGENT_PREFIX = "agent", and then f"{AGENT_PREFIX}-{start_time}-{machine_ip}.log". Not sure it's that necessary though

ilija-lazoroski · 2022-10-03T08:40:45Z

envs/monkey_zoo/blackbox/log_handlers/monkey_logs_downloader.py

+        log_file_path = self._get_log_file_path(agent, machines)
+        log_contents = self.island_client.get_agent_log(agent.id)
+
+        MonkeyLogsDownloader._write_log_to_file(log_file_path, log_contents)
+
+        self.monkey_log_paths.append(log_file_path)


Zerologon agent logs are empty (and some other exploiters as well.). We should not be outputing empty log files. Probably a check if we even got any log.

Sounds like a bug, but a different bug. Create an issue.

Use underscores to improve readability

codecov · 2022-10-03T11:57:56Z

Codecov Report

Base: 61.15% // Head: 61.20% // Increases project coverage by +0.04% 🎉

Coverage data is based on head (cfd49db) compared to base (de435e2).
Patch coverage: 100.00% of modified lines in pull request are covered.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #2384      +/-   ##
===========================================
+ Coverage    61.15%   61.20%   +0.04%     
===========================================
  Files          550      550              
  Lines        14399    14410      +11     
===========================================
+ Hits          8806     8819      +13     
+ Misses        5593     5591       -2

Impacted Files	Coverage Δ
monkey/monkey_island/cc/resources/agent_logs.py	`100.00% <100.00%> (ø)`
.../cc/agent_event_handlers/handle_ping_scan_event.py
...land/cc/agent_event_handlers/scan_event_handler.py	`100.00% <0.00%> (ø)`
...key/monkey_island/cc/setup/agent_event_handlers.py	`42.10% <0.00%> (+4.01%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

mssalvatore added 6 commits 8000 October 1, 2022 19:21

BB: Add a TODO about parse_log()

3db3df8

BB: Add MonkeyIslandClient.get_agents()

b335601

BB: Add MonkeyIslandClient.get_machines()

99c2c5c

BB: Add MonkeyIslandClient.get_agent_log()

c706466

Island: Return empty str, not dict on 404 in AgentLogs.get()

07a6f49

BB: Download agent logs from new endpoints

e415564

mssalvatore requested review from cakekoa, ilija-lazoroski, VakarisZ and shreyamalviya October 2, 2022 19:31

mssalvatore added 2 commits October 2, 2022 16:57

BB: Use threading to download logs

6a783d9

Reduces time to download logs by approx. 40%, but may be unnecessary after resolving #2383

BB: Remove disused MonkeyLog

e369ef2

mssalvatore force-pushed the 2362-bb-get-agent-logs branch from 99d9052 to e369ef2 Compare October 2, 2022 20:58

VakarisZ reviewed Oct 3, 2022

View reviewed changes

envs/monkey_zoo/blackbox/log_handlers/monkey_logs_downloader.py Show resolved Hide resolved

VakarisZ reviewed Oct 3, 2022

View reviewed changes

envs/monkey_zoo/blackbox/log_handlers/monkey_logs_downloader.py Show resolved Hide resolved

VakarisZ reviewed Oct 3, 2022

View reviewed changes

envs/monkey_zoo/blackbox/log_handlers/monkey_logs_downloader.py Outdated Show resolved Hide resolved

VakarisZ reviewed Oct 3, 2022

View reviewed changes

monkey/monkey_island/cc/resources/agent_logs.py Outdated Show resolved Hide resolved

VakarisZ requested changes Oct 3, 2022

View reviewed changes

ilija-lazoroski reviewed Oct 3, 2022

View reviewed changes

mssalvatore added 3 commits October 3, 2022 07:51

BB: Change agent log file name

fc24d80

Use underscores to improve readability

BB: Remove disused MonkeyIslandClient.find_log_in_db()

477e80b

BB: Add type hints to MonkeyLogsDownloader.__init__()

378e8d5

mssalvatore added 2 commits October 3, 2022 10:14

BB: Add error message when machine is missing interfaces

d922d71

Island: Use logger.exception()

cfd49db

VakarisZ approved these changes Oct 3, 2022

View reviewed changes

mssalvatore merged commit eb16969 into develop Oct 3, 2022

mssalvatore deleted the 2362-bb-get-agent-logs branch October 3, 2022 14:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

2362 bb get agent logs #2384

2362 bb get agent logs #2384

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!


		start_time = agent.start_time.strftime("%Y-%m-%d-%H-%M-%S")

		return self.log_dir_path / f"agent-{start_time}-{machine_ip}.log"

2362 bb get agent logs #2384

2362 bb get agent logs #2384

Uh oh!

Conversation

What does this PR do?

PR Checklist

Testing Checklist

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this co 8000 mment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Codecov Report

Uh oh!

Uh oh!