Open
Description
Description
Right now, with "strict mode" enabled by default, users need to care about using the right schema when they passing in functions into map
or map_batches
Ideally, we should infer the schema (I guess we already do that in ray.data.read_xxx
calls). Another reference is Dask DataFrame: https://docs.dask.org/en/stable/generated/dask.dataframe.DataFrame.apply.html
Use case
This will simplify users' mental model in doing batch processing with Ray Data