8000 [data/proprocessors] Support flattening vector features in concatenator · Issue #51757 · ray-project/ray · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
[data/proprocessors] Support flattening vector features in concatenator #51757
Closed
@rclough

Description

@rclough

Description

When you use Concatenate in combination with preprocessors that create vector feature columns(such as OneHotEncoder or MultiHotEncoder), the output of Concatenator is not flattened (this is arguably correct behavior, it's not really documented). However, the goal of Concatenator is typically to provide tensor inputs to models, which in many cases is expected to be flat tensors of floats.

Based on offline discussions in the Ray slack, I'd like to propose supporting a flatten flag for the Concatenator that optionally will flatten any vector columns in-place within the output vector. I will follow up soon with an implementation and tests in a PR.

Use case

When using encoder preprocessors that output a vector column, we want to flatten the columns in the final concatenate step for input to the model.

Metadata

Metadata

Assignees

No one assigned

    Labels

    P1Issue that should be fixed within a few weeksbugSomething that is supposed to be working; but isn'tcommunity-backlogdataRay Data-related issues

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0