Hatchet multi-process problem when handling large graphs

@slabasan

Problem

The following code will be hanging there forever with the default number of procs for g_range.filter. However, if num_procs=1, g_range.filter could return reasonable results.

Code

import hatchet, json
g_range = hatchet.GraphFrame.from_literal(json.load(open('db-range.json', 'r')))
g_range.show_metric_columns()
5E31 g_range_filtered = g_range.filter(lambda x: x['GINS (I)'] > 0.0)
print(g_range_filtered.tree('GINS (E)'))

Tested under hatchet 0283a68, and Python 3.8.5.

Please rename db-range.txt to db-range.json

db-range.txt

@slabasan Could you take a look? It's a database generated by our new hpcprof implementation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Problem

Code

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Description

Problem

Code

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions