-
Notifications
You must be signed in to change notification settings - Fork 6.5k
Insights: ray-project/ray
Overview
Could not load contribution data
Please try again later
90 Pull requests merged by 43 people
-
[ci] add cibase tags for ci base envs
#53755 merged
Jun 27, 2025 -
Remove
botocore
dependency in Ray Serve LLM#54156 merged
Jun 27, 2025 -
(serve.llm) Remove test leakage from placement bundle logic
#53723 merged
Jun 27, 2025 -
[data] split dask and modin tests
#54122 merged
Jun 26, 2025 -
[Data] Fixing PyArrow overflow handling
#53971 merged
Jun 26, 2025 -
[serve] split call_user_method
#54104 merged
Jun 26, 2025 -
[Data] Handle Huggingface Integration CI test failures
#54128 merged
Jun 26, 2025 -
[Data] Fix ActorPool autoscaler to properly scale up
#53983 merged
Jun 26, 2025 -
use gtm datalayer directly, fix format
#54144 merged
Jun 26, 2025 -
[package] remove
__api__
insetup.py
#54143 merged
Jun 26, 2025 -
[Minor][Fix][Core/Test] Fix test_actor_restart_on_node_failure wrong test logic without waiting
#54088 merged
Jun 26, 2025 -
[data] fix repartitioning empty datasets
#54107 merged
Jun 26, 2025 -
[Doc][KubeRay] revert kuberay-gcs-ft.ipynb to markdown
#54084 merged
Jun 26, 2025 -
Fix sort_benchmark release test arg
#54145 merged
Jun 26, 2025 -
[Doc][KubeRay] Convert rayjob-quick-start.ipynb back to markdown docs
#54093 merged
Jun 26, 2025 -
[core] split dask and modin tests
#54121 merged
Jun 26, 2025 -
[Core] Remove Unnecessary Checks in GRPC Server Shutdown Process
#53910 merged
Jun 26, 2025 -
[core] Delete unused env vars
#54095 merged
Jun 26, 2025 -
[Doc][KubeRay] Remove
rayserve-dev-doc.md
#54057 merged
Jun 26, 2025 -
[core] Bump timeout in
test_ray_init
#54136 merged
Jun 26, 2025 -
[core] Clean up unused FFs
#54139 merged
Jun 26, 2025 -
[core] Fix GCS crash on duplicate MarkJobFinished RPCs due to network failures
#53951 merged
Jun 26, 2025 -
[train] Remove usage of
ray._private.state
#54142 merged
Jun 26, 2025 -
[core] Deflake
test_scheduling.py
in client mode#54137 merged
Jun 26, 2025 -
[core] Fix
test_basic_3.py
in client mode#54135 merged
Jun 26, 2025 -
[serve] refactor _run_user_code
#54103 merged
Jun 26, 2025 -
[Doc] vale ignores anchors of headers
#53580 merged
Jun 26, 2025 -
set config for ua tag
#54112 merged
Jun 26, 2025 -
[Serve.llm] Add a doc snippet to inform users about existing diffs between vllm serve and ray serve llm.
#54042 merged
Jun 26, 2025 -
[ci][docs] Add test tag rule for Vale files
#54118 merged
Jun 26, 2025 -
[train] update beginner pytorch example
#54124 merged
Jun 26, 2025 -
[Data] Bumped latest PA version to 20.0
#54123 merged
Jun 26, 2025 -
[ci] fix missing
dask
tag in all tags list#54113 merged
Jun 26, 2025 -
[core][test] fix data races in NodeManagerTest
#54097 merged
Jun 25, 2025 -
[core] Remove experimental "array" library
#54105 merged
Jun 25, 2025 -
[core] Clean up
test_locality_aware_leasing_borrowed_objects
#54086 merged
Jun 25, 2025 -
[core][refactor] replace unnecessary shared_ptrs with unique_ptrs and references in raylet
#54062 merged
Jun 25, 2025 -
[ci] fix mac ci by pinning cython version
#54061 merged
Jun 25, 2025 -
[core] Deflake
test_basic_3.py
#54083 merged
Jun 25, 2025 -
remove final references to plasma_event_handler
#54085 merged
Jun 25, 2025 -
[core] Deflake
test_ray_init
#54094 merged
Jun 25, 2025 -
[core] Deflake
test_actor_restart
#54087 merged
Jun 25, 2025 -
Updated stalebot to run every 12 hours.
#54041 merged
Jun 25, 2025 -
[serve] Prefer localhost instead of host ip for microbenchmarks
#54092 merged
Jun 25, 2025 -
[train] Driver SIGINT calls controller abort
#53600 merged
Jun 25, 2025 -
[data] Split out long running scaling test
#54045 merged
Jun 25, 2025 -
[core] Deflake
test_actor_unavailable_conn_broken
#54090 merged
Jun 25, 2025 -
[V2][Autoscaler] Fix
numOfHosts
> 1 slice termination logic#54063 merged
Jun 25, 2025 -
[V2][Autoscaler] Add
cloud_instance_id
to all V2 Austoscaler termination requests#53938 merged
Jun 25, 2025 -
Fix autoscaler recovery docker config to use node-specific settings
#53992 merged
Jun 25, 2025 -
[data/preprocessors] Improve execution perf for One Hot encoding
#54022 merged
Jun 25, 2025 -
[Docs][KubeRay] Update changes from KubeRay 1.3.2 to 1.4.0
#53886 merged
Jun 25, 2025 -
[core] Fix comment
#53853 merged
Jun 25, 2025 -
[ci] add
-sSL
for curl on node install#54060 merged
Jun 25, 2025 -
updating compile comment
#54058 merged
Jun 25, 2025 -
Revert "remove extraneous index.rst file for e2e examples (part 2)"
#54051 merged
Jun 25, 2025 -
[data] fix lint error in conftest.py
#54053 merged
Jun 25, 2025 -
[serve] Use
get_application_url
in test_metrics#54050 merged
Jun 24, 2025 -
[ci] update anyscale layer
#54043 merged
Jun 24, 2025 -
[serve.llm] Prefix aware router eviction thread improvements
#53957 merged
Jun 24, 2025 -
[serve] Remove hardcoded urls from serve microbenchmarks
#54026 merged
Jun 24, 2025 -
[core] fix detached actor being unexpectedly killed
#53562 merged
Jun 24, 2025 -
[POC] fix test_metrics
#54037 merged
Jun 24, 2025 -
[serve] Handle request with Semaphore
#54019 merged
Jun 24, 2025 -
remove extraneous index.rst file for e2e examples (part 2)
#54023 merged
Jun 24, 2025 -
[☀️] Fix repr for ray.ObjectRef, ray.ObjectRefGenerator types
#54011 merged
Jun 24, 2025 -
[core][ci] Disable test db for container tests
#54031 merged
Jun 24, 2025 -
[docker] Update latest Docker dependencies for 2.47.1 release
#54016 merged
Jun 23, 2025 -
[core] improve assertion check in test_task_metrics
#53958 merged
Jun 23, 2025 -
remove extraneous index.rst file for e2e-multimodal-ai-workloads
#54017 merged
Jun 23, 2025 -
[Serve.llm] Remove ImageRetriever class and related tests from the LLM deployment module.
#53980 merged
Jun 23, 2025 -
fix test_request_timeout timeout mismatch issue
#54010 merged
Jun 23, 2025 -
fix gsat global
#54012 merged
Jun 23, 2025 -
[train] Fix release test missing data key
#53963 merged
Jun 23, 2025 -
[data] remove schema from release tests
#53956 merged
Jun 23, 2025 -
[kuberay] log actionable err msg when required TPU node selectors missing
#53914 merged
Jun 23, 2025 -
[core] Fix flaky
test_state_api
#53975 merged
Jun 23, 2025 -
[data] remove operator_fusion_benchmark
#53962 merged
Jun 23, 2025 -
[Data] Add reading from Delta Lake tables and from Unity Catalog
#53701 merged
Jun 23, 2025 -
test: refactor
test_observability_helpers
#53875 merged
Jun 23, 2025 -
[core] Remove actor task path in normal task submitter
#53996 merged
Jun 23, 2025 -
[core] Rename
GcsFunctionManager
and use fake in test#53973 merged
Jun 23, 2025 -
[Serve.llm][P/D] Fix health check in prefill disagg
#53937 merged
Jun 22, 2025 -
[Test][KubeRay] Update KubeRay version to v1.4.0 for autoscaler tests
#53974 merged
Jun 22, 2025 -
[core] Fix ActorClass.remote return typing and expose Actor class methods to static analysis
#53986 merged
Jun 21, 2025 -
[core] Use core worker client pool in GCS
#53654 merged
Jun 21, 2025 -
[core] Revert container tests to medium size instance
#53966 merged
Jun 21, 2025 -
Fix ray import error when both ROCR_VISIBLE_DEVICES and HIP_VISIBLE_DEVICES are set
#53757 merged
Jun 20, 2025 -
[core] Making NodeManager use ILocalTaskManager instead of TaskManager.
#53961 merged
Jun 20, 2025
58 Pull requests opened by 43 people
-
docs(data): fix broken Parameters table
#53972 opened
Jun 20, 2025 -
Feature/sac discrete
#53982 opened
Jun 20, 2025 -
[CI][KubeRay] Update KubeRay CI Tests branch for KubeRay v1.4.0 release
#53984 opened
Jun 21, 2025 -
[Core] Add AcceleratorManager implementation for Rebellions NPU
#53985 opened
Jun 21, 2025 -
[Doc] Update Istio service mesh graph
#53988 opened
Jun 21, 2025 -
[Serve] Make replica scheduler backoff configurable #52871
#53991 opened
Jun 21, 2025 -
[core] Recover intermediate objects if needed while generator running
#53999 opened
Jun 22, 2025 -
[Data] Add TooManyRequests catch to BQ writer
#54000 opened
Jun 23, 2025 -
[ci][core] Fix timeouts in `test_scheduling` when run in debug mode
#54003 opened
Jun 23, 2025 -
Fixes default_dqn_torch_rl_module assuming the device is 'cpu'
#54004 opened
Jun 23, 2025 -
[RLlib] Fix shapes in `explained_variance` for recurrent policies.
#54005 opened
Jun 23, 2025 -
Added openssl support for PPC64LE.
#54006 opened
Jun 23, 2025 -
[dashboard] Clean up naming for GPU profiling module
#54009 opened
Jun 23, 2025 -
[docker] Update latest Docker dependencies for 2.47.1 release
#54015 opened
Jun 23, 2025 -
[core] test out wait_for_condition exceptions
#54018 opened
Jun 23, 2025 -
[DONOTMERGE] Proof-of-concept for GPU objects + NIXL
#54024 opened
Jun 24, 2025 -
Bump mlflow from 2.19.0 to 3.1.0 in /doc/source/ray-overview/examples/e2e-xgboost
#54027 opened
Jun 24, 2025 -
[Data] Fix `test_binary` setup fixture that doesn't close file handles
#54028 opened
Jun 24, 2025 -
Multimodal ai
#54029 opened
Jun 24, 2025 -
[core][autoscaler][v1] add heartbeat timeout logic to determine node activity status
#54030 opened
Jun 24, 2025 -
Bump mlflow from 2.22.0 to 3.1.0 in /python
#54032 opened
Jun 24, 2025 -
[core] Delete asyncio actor logic in in-order scheduling code
#54033 opened
Jun 24, 2025 -
[core] Don't order retries at all for in-order actors
#54034 opened
Jun 24, 2025 -
gen test
#54046 opened
Jun 24, 2025 -
[DNR]
#54048 opened
Jun 24, 2025 -
update all 'Run on Anyscale' buttons to redirect to respective template preview pages
#54049 opened
Jun 24, 2025 -
[data] Use `write_dataset` for partitioning & writing to file instead of custom implementation
#54052 opened
Jun 24, 2025 -
Add Azure Files support to persistent storage documentation
#54055 opened
Jun 24, 2025 -
[train] Add broadcast_from_rank_zero and barrier collectives
#54066 opened
Jun 25, 2025 -
[core][refactor] move NodeManager::KillWorker to WorkerInterface::Kill for better testability
#54068 opened
Jun 25, 2025 -
[RLlib] Fix env runners not being marked healthy if there is no local env runner
#54071 opened
Jun 25, 2025 -
[ci] remove `ci/keep_alive`
#54079 opened
Jun 25, 2025 -
[Docs][KubeRay] Delete KubeRay doctests
#54080 opened
Jun 25, 2025 -
[RLlib] Bug fix: Failed EnvRunners are not restored if there is no local EnvRunner.
#54091 opened
Jun 25, 2025 -
Adapt to vLLM reducing exports from the top level
#54099 opened
Jun 25, 2025 -
Feat/middleware callback support
#54106 opened
Jun 25, 2025 -
Adapt Dask on Ray to the new Dask Task class
#54108 opened
Jun 25, 2025 -
[data] Remove asserts that test internal `ds._block_num_rows()`
#54109 opened
Jun 25, 2025 -
[Doc] Convert configuring-autoscaling.ipynb back to markdown docs
#54111 opened
Jun 25, 2025 -
[Doc][KubeRay] verl example
#54114 opened
Jun 25, 2025 -
vLLM ZMQ KVEvent Router
#54115 opened
Jun 25, 2025 -
[core] Fix "Check failed: it->second.num_retries_left == -1"
#54116 opened
Jun 25, 2025 -
[core][cgraph] Export classes related to NCCL communicator
#54117 opened
Jun 26, 2025 -
[Doc][KubeRay] Convert raycluster-quick-start.ipynb back to markdown docs
#54125 opened
Jun 26, 2025 -
[serve] Increase default uvicorn keep alive timeout
#54127 opened
Jun 26, 2025 -
[core][test] fix flaky data races in NodeManagerTest
#54129 opened
Jun 26, 2025 -
[Docs][KubeRay] Convert rayservice-quick-start.ipynb back to markdown docs
#54138 opened
Jun 26, 2025 -
[serve] Remove usage of `ray._private.state`
#54140 opened
Jun 26, 2025 -
[core] fix checking for uv existence during ray_runtime setup
#54141 opened
Jun 26, 2025 -
[data] Handle HuggingFace parquet dataset resolve URLs
#54146 opened
Jun 26, 2025 -
[DO NOT MERGE] [RLlib] Fix checkpoints not having correct num_env_steps_sampled_lifetime
#54148 opened
Jun 26, 2025 -
[core] Deflake `test_spread_scheduling_overrides_locality_aware_scheduling`
#54154 opened
Jun 26, 2025 -
[data] Add timeout for `test_arrow_block_scaling.py`
#54155 opened
Jun 26, 2025 -
Correct asyncio ref documentation for Python 3.11+
#54157 opened
Jun 26, 2025 -
[Data] Fix examples in some Data user guides
#54158 opened
Jun 27, 2025 -
Revert "[core][refactor] replace unnecessary shared_ptrs with unique_ptrs and references in raylet (#54062)"
#54159 opened
Jun 27, 2025 -
[doc] fix broken links in the vllm guide
#54161 opened
Jun 27, 2025 -
[data] gather dask tests into single test files
#54163 opened
Jun 27, 2025
58 Issues closed by 25 people
-
CI test linux://python/ray/data:test_block_sizing is consistently_failing
#54164 closed
Jun 27, 2025 -
[Core] Exiting because this node manager has mistakenly been marked as dead by the GCS
#54035 closed
Jun 27, 2025 -
CI test windows://python/ray/serve/tests:test_logging is consistently_failing
#46043 closed
Jun 27, 2025 -
[bug][serve.llm] AssertionError: failed to get the hash of the compiled graph (VLM, batch, TP=2)
#53824 closed
Jun 27, 2025 -
[Serve, LLM] missing botocore dependency!
#53052 closed
Jun 27, 2025 -
Error Handling Large Pyarrow Chunk
#53536 closed
Jun 26, 2025
10000
-
CI test linux://python/ray/train/v2:test_controller is consistently_failing
#54147 closed
Jun 26, 2025 -
[Serve][LLM] Qwen3 models “enable_thinking: False” still returns thinking process
#52979 closed
Jun 26, 2025 -
[Core] Ray fails to fulfill request due to node being annotated by IP address
#54150 closed
Jun 26, 2025 -
[Docs][KubeRay] Convert kuberay-gcs-ft.ipynb back to markdown docs
#54078 closed
Jun 26, 2025 -
[Docs][KubeRay] Convert rayjob-quick-start.ipynb back to markdown docs
#54075 closed
Jun 26, 2025 -
CI test darwin://python/ray/tests:test_basic_3_client_mode is consistently_failing
#54126 closed
Jun 26, 2025 -
CI test windows://python/ray/tests:test_basic_3_client_mode is consistently_failing
#54132 closed
Jun 26, 2025 -
[Core] Transient network failure on RPC `MarkJobFinished` causes node crash
#53645 closed
Jun 26, 2025 -
CI test linux://python/ray/tests:test_basic_3_client_mode is consistently_failing
#54119 closed
Jun 26, 2025 -
[Doc] The anchors of headers doesn't follow Vale rules.
#53516 closed
Jun 26, 2025 -
CI test linux://:local_object_manager_test is flaky
#54130 closed
Jun 26, 2025 -
[Core] Could not connect to socket
#54067 closed
Jun 26, 2025 -
[core] TSAN failing on `node_manager_test`
#54096 closed
Jun 25, 2025 -
CI test windows://python/ray/tests:test_object_store_metrics is consistently_failing
#49514 closed
Jun 25, 2025 -
CI test linux://python/ray/data:test_arrow_block is flaky
#48859 closed
Jun 25, 2025 -
CI test linux://python/ray/data:test_huggingface is consistently_failing
#44516 closed
Jun 25, 2025 -
CI test linux://python/ray/train:accelerate_torch_trainer_no_raydata is consistently_failing
#48939 closed
Jun 25, 2025 -
CI test linux://python/ray/train:deepspeed_torch_trainer is consistently_failing
#44517 closed
Jun 25, 2025 -
Release test training_ingest_benchmark-task=image_classification.full_training.jpeg failed
#53953 closed
Jun 25, 2025 -
[Core] Autoscaler Node Recovery Ignores Node-Specific Docker Config
#53987 closed
Jun 25, 2025 -
[Doc][KubeRay] Run doctest `user-guides/configuring-autoscaling.ipynb` in CI
#53989 closed
Jun 25, 2025 -
CI test windows://python/ray/tests:test_basic is consistently_failing
#51497 closed
Jun 25, 2025 -
[CI] Migrate from flake8 to ruff
#34889 closed
Jun 25, 2025 -
[Docker] Upgrade the base image from ubuntu:focal to ubuntu:22.04LTS
#35514 closed
Jun 25, 2025 -
CI test linux://python/ray/data:test_backpressure_e2e is flaky
#49963 closed
Jun 25, 2025 -
CI test linux://python/ray/tests:test_runtime_env_complicated is consistently_failing
#49674 closed
Jun 25, 2025 -
CI test linux://python/ray/data:test_execution_optimizer is consistently_failing
#44410 closed
Jun 25, 2025 -
[Dashboard] Decorator that exposes attribute to dashboard for display in grid
#33188 closed
Jun 24, 2025 -
[serve] AttributeError when attempting to use serve with cluster and FastAPI
#54008 closed
Jun 24, 2025 -
[gcp] Node mistakenly marked dead: increase heartbeat timeout?
#16945 closed
Jun 24, 2025 -
Docs on Cython extensions and install requirements
#7094 closed
Jun 24, 2025 -
[core] Detached actor being killed when its parent actor crashes
#40864 closed
Jun 24, 2025 -
CI test linux://doc:doctest[data] is consistently_failing
#54036 closed
Jun 24, 2025 -
CI test linux://python/ray/data:doctest is consistently_failing
#44570 closed
Jun 24, 2025 -
[data/proprocessors] Support flattening vector features in concatenator
#51757 closed
Jun 24, 2025 -
[Docs][KubeRay] Don't sleep for a long time in `kuberay-gcs-ft.ipynb`
#54040 closed
Jun 24, 2025 -
Release test many_nodes_actor_test_on_v2.aws failed
#53990 closed
Jun 24, 2025 -
CI test linux://doc/source/train/examples/lightning:lightning_cola_advanced is consistently_failing
#44545 closed
Jun 24, 2025 -
CI test linux://python/ray/train:accelerate_torch_trainer is consistently_failing
#44513 closed
Jun 24, 2025 -
CI test linux://python/ray/train:deepspeed_torch_trainer_no_raydata is consistently_failing
#44932 closed
Jun 24, 2025 -
CI test windows://python/ray/serve/tests:test_request_timeout is flaky
#48417 closed
Jun 24, 2025 -
[old]
#54020 closed
Jun 23, 2025 -
CI test linux://rllib:learning_tests_multi_agent_stateless_cartpole_ppo_multi_cpu is consistently_failing
#47313 closed
Jun 23, 2025 -
CI test windows://python/ray/serve/tests:test_batching is consistently_failing
#46016 closed
Jun 23, 2025 -
CI test linux://python/ray/tests:test_runtime_env_container is consistently_failing
#45223 closed
Jun 23, 2025 -
[CI] `linux://python/ray/tests:test_state_api` is failing/flaky on master.
#54001 closed
Jun 23, 2025 -
Ability to select a disk for ray workers
#8607 closed
Jun 23, 2025 -
Conflict between ROCR_VISIBLE_DEVICES and HIP_VISIBLE_DEVICES environment variables causes Ray import error
#53737 closed
Jun 21, 2025 -
CI test linux://python/ray/serve/tests:test_multiplex is flaky
#48378 closed
Jun 21, 2025 -
[RLlib] MAML does not work with TF2 in Ray 2.3.1
#34620 closed
Jun 20, 2025
45 Issues opened by 33 people
-
[Serve] UnboundLocalError: local variable 'stopped' in deployment state
#54169 opened
Jun 27, 2025 -
[core][gpu-objects] Hide the details of constructing process groups
#54168 opened
Jun 27, 2025 -
[core][gpu-objects] Support streaming generator
#54167 opened
Jun 27, 2025 -
[core][gpu-objects] Support DTensor
#54166 opened
Jun 27, 2025 -
ERROR services.py:1355 -- Failed to start the dashboard , return code 3221226505
#54165 opened
Jun 27, 2025 -
Assessment of the difficulty in porting CPU architecture for Ray
#54162 opened
Jun 27, 2025 -
CI test linux://python/ray/tests:test_scheduling_client_mode is flaky
#54160 opened
Jun 27, 2025 -
[core] ray.util.state.api.get_actor with timeout = 1s does not work
#54153 opened
Jun 26, 2025 -
[Core] Ray fails to fulfill request due to node being annotated by IP address
#54152 opened
Jun 26, 2025 -
Ray component: Core ray.init() fails on windows since #51731
#54151 opened
Jun 26, 2025 -
[core] Improving Ray Typing annotation
#54149 opened
Jun 26, 2025 -
[Core] bug in _check_uv_existence() method of uv runtime backend breaks installing packages in ray runtimes
#54134 opened
Jun 26, 2025 -
Release test air_example_gptj_deepspeed_fine_tuning failed
#54133 opened
Jun 26, 2025 -
CI test linux://:local_object_manager_test is flaky
#54131 opened
Jun 26, 2025 -
[Core] ray job submit may hang in some scenarios
#54120 opened
Jun 26, 2025 -
CI test linux://python/ray/data:test_arrow_block_scaling is consistently_failing
#54110 opened
Jun 25, 2025 -
[Docker] [CI] Bump the GPU base image to a newer version
#54102 opened
Jun 25, 2025 -
[Data] `ArrowInvalid` during `ray.data.from_huggingface`: Parquet magic bytes not found in footer
#54101 opened
Jun 25, 2025 -
[Core] ray._raylet.CoreWorker.put_file_like_object, parameter owner_address unused
#54100 opened
Jun 25, 2025 -
[Data] Allow parameterized queries in `read_sql`
#54098 opened
Jun 25, 2025 -
[RLlib] num_env_steps_sampled_lifetime is wrong after checkpoint loaded - bug changed in 2.47
#54089 opened
Jun 25, 2025 -
[Core] When pinning object, transient error on RPC `PubsubLongPolling` causes job stuck
#54081 opened
Jun 25, 2025 -
[Docs][KubeRay] Convert configuring-autoscaling.ipynb back to markdown docs
#54077 opened
Jun 25, 2025 -
[Docs][KubeRay] Convert rayservice-quick-start.ipynb back to markdown docs
#54076 opened
Jun 25, 2025 -
[Docs][KubeRay] Convert raycluster-quick-start.ipynb back to markdown docs
#54074 opened
Jun 25, 2025 -
[Docs][KubeRay] Delete KubeRay doctests
#54073 opened
Jun 25, 2025 -
[Epic][Docs/KubeRay] Convert doctests back to normal markdown docs
#54072 opened
Jun 25, 2025 -
[serve.llm] vLLM engine became unhealthy under high incoming traffic
#54070 opened
Jun 25, 2025 -
[data] support streaming writes for `write_lance`
#54069 opened
Jun 25, 2025 -
[train] Can not start training on more than one node
#54065 opened
Jun 25, 2025 -
CI test linux://:node_manager_test is flaky
#54059 opened
Jun 25, 2025 -
[train] Add Azure Files support to persistent storage documentation
#54054 opened
Jun 24, 2025 -
[Core] ray cannot start under macos + anaconda + python 3.13 + bash
#54047 opened
Jun 24, 2025 -
[Core] Ray postmortem debugging does not work with python 3.12
#54044 opened
Jun 24, 2025 -
[RFC] Improving Ray for Post-Training / RL for LLM Projects
#54021 opened
Jun 23, 2025 -
[Core] Multi-threaded ray.get can hang in certain situations.
#54007 opened
Jun 23, 2025 -
[CI] `linux://python/ray/tests:test_scheduling_debug_mode` is failing/flaky on master.
#54002 opened
Jun 23, 2025 -
Ray worker resolves module to __init__.py instead of actual file for nested package class
#53998 opened
Jun 22, 2025 -
[Data] When writing on BigQuery, Google's "TooManyRequests" exceptions is not retried
#53997 opened
Jun 22, 2025 -
[data] Slow fetching of metadata for large number of parquet files
#53995 opened
Jun 22, 2025 -
[Rllib] Bug in TorchMultiDistribution logp prevents policy mapping from being used
#53994 opened
Jun 22, 2025 -
[core][gpu-objects] Allow sending ObjectRefs to other processes
#53978 opened
Jun 20, 2025 -
[core][gpu-objects] Support ray.put
#53977 opened
Jun 20, 2025 -
[core][gpu-objects] RDMA support for data transfer
#53976 opened
Jun 20, 2025
224 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
[train] TrainStateActor periodically checks controller status and sets aborted
#53818 commented on
Jun 26, 2025 • 36 new comments -
[Core] Add Logic to Emit Task Events to Event Aggregator
#53402 commented on
Jun 26, 2025 • 29 new comments -
[core] Fix race condition b/w object eviction & repinning for recovery.
#53934 commented on
Jun 27, 2025 • 20 new comments -
[Docs][KubeRay] Update all KubeRay version references for KubeRay 1.4.0 release
#53884 commented on
Jun 25, 2025 • 17 new comments -
python depsets tool
#53904 commented on
Jun 27, 2025 • 14 new comments -
[Core] Add default Ray Node labels at Node init
#53360 commented on
Jun 25, 2025 • 10 new comments -
Add progress bars to hash operators
#53175 commented on
Jun 26, 2025 • 10 new comments -
[doc][kuberay] state `rayStartParams` is optional starting with KubeRay 1.4.0
#53943 commented on
Jun 25, 2025 • 9 new comments -
[Data] - write_parquet enable both partition by & min_rows_per_file, max_rows_per_file
#53930 commented on
Jun 24, 2025 • 9 new comments -
BLD: Automatically patch ``.bazelrc`` file for Windows 11 build
#53586 commented on
Jun 26, 2025 • 8 new comments -
[Refactor]Rename NCCL-related items to comm_backend
#51061 commented on
Jun 24, 2025 • 8 new comments -
[core] Add as_completed and map_unordered APIs
#53461 commented on
Jun 27, 2025 • 7 new comments -
[Feat][Core] Implement Event Aggregator Agent
#53182 commented on
Jun 26, 2025 • 6 new comments -
[core] Fix gcs register actor callback check
#53634 commented on
Jun 25, 2025 • 5 new comments -
Add `pin_memory` to `iter_torch_batches`
#53792 commented on
Jun 23, 2025 • 4 new comments -
feat(runtime_env): add Azure Blob Storage support
#53135 commented on
Jun 26, 2025 • 4 new comments -
[core][telemetry/11] support histogram metric on worker side
#53740 commented on
Jun 24, 2025 • 4 new comments -
Pass parameters to custom routers through LLMConfig
#53870 commented on
Jun 27, 2025 • 4 new comments -
[core][compiled graphs] Supporting allreduce on list of input nodes
#51047 commented on
Jun 25, 2025 • 3 new comments -
[WIP][core][gpu-objects] GC
#53911 commented on
Jun 24, 2025 • 3 new comments -
[core][telemetry/10] support custom gauge+counter+sum metrics
#53734 commented on
Jun 24, 2025 • 3 new comments -
Update V2 Autoscaler to support scheduling using Node labels and LabelSelector API
#53578 commented on
Jun 25, 2025 • 2 new comments -
[core][telemetry/09] record sum metric e2e
#53512 commented on
Jun 24, 2025 • 2 new comments -
[doc][kuberay] add version skew warning for plugin and RayCluster
#53950 commented on
Jun 26, 2025 • 2 new comments -
[Doc][KubeRay] Add doc for running KubeRay dashboard
#53830 commented on
Jun 24, 2025 • 1 new comment -
[core] Ungracefully exit if the agent dies unexpectedly
#53847 commented on
Jun 23, 2025 • 1 new comment -
[core] adding additional stats to the dump object store usage api.
#53856 commented on
Jun 26, 2025 • 1 new comment -
Bump flask-cors from 4.0.0 to 6.0.0 in /python
#53116 commented on
Jun 24, 2025 • 0 new comments -
[Data] Fixing null-safety when converting to `TensorArray`
#52977 commented on
Jun 26, 2025 • 0 new comments -
[core] Use GetResourceLoadRequest as a substitute liveness check
#52971 commented on
Jun 25, 2025 • 0 new comments -
[RLlib; Offline RL] - Use `iter_torch_batches` in learner
#52968 commented on
Jun 20, 2025 • 0 new comments -
[deps] upgrade pandas to always use 2+
#52961 commented on
Jun 26, 2025 • 0 new comments -
[Serve] Prioritize stopping most recently scaled-up replicas during downscaling
#52929 commented on
Jun 26, 2025 • 0 new comments -
[Chore][Dashboard] Move `TrainHead` to `python/ray/train` folder
#52014 commented on
Jun 25, 2025 • 0 new comments -
macos wheel build debug
#53119 commented on
Jun 25, 2025 • 0 new comments -
[data] fix lance dataset schema
#53134 commented on
Jun 24, 2025 • 0 new comments -
[docs] updating broken links on rllib torch doc
#53161 commented on
Jun 26, 2025 • 0 new comments -
Bump tornado from 6.1 to 6.5.1 in /python
#53274 commented on
Jun 24, 2025 • 0 new comments -
[data] Add GroupedData.random_sample() for group-wise sampling
#53313 commented on
Jun 25, 2025 • 0 new comments -
[WIP] Fix daft test
#53338 commented on
Jun 24, 2025 • 0 new comments -
fix: Type of AlgorithmConfig.training(learner_connector
#53369 commented on
Jun 25, 2025 • 0 new comments -
[serve.llm] DO NOT REVIEW, IN DRAFT
#53391 commented on
Jun 24, 2025 • 0 new comments -
try running things with protobuf 4
#53442 commented on
Jun 24, 2025 • 0 new comments -
[Data] Make `from_items` lineage serializable
#52026 commented on
Jun 24, 2025 • 0 new comments -
[WIP] Ray Data doc updates
#52062 commented on
Jun 25, 2025 • 0 new comments -
[Data,Train] Add helpful errors when running forbidden methods on sharded datasets
#52079 commented on
Jun 27, 2025 • 0 new comments -
[Dashboard] Add GPU component usage
#52102 commented on
Jun 24, 2025 • 0 new comments -
upgrade path to python protobuf 4
#52194 commented on
Jun 24, 2025 • 0 new comments -
[train] upgrade tensorflow-datasets
#52195 commented on
Jun 24, 2025 • 0 new comments -
[build] warning when username or homedir include @ character
#52274 commented on
Jun 25, 2025 • 0 new comments -
[core] Static Priority Scheduling (1/N)
#52439 commented on
Jun 23, 2025 • 0 new comments -
[core] Static Priority scheduling (2/N)
#52465 commented on
Jun 24, 2025 • 0 new comments -
[core] Static Priority scheduling (4/N)
#52489 commented on
Jun 24, 2025 • 0 new comments -
[core] Static Priority Scheduling (3/N)
#52506 commented on
Jun 24, 2025 • 0 new comments -
Adapt Dask on Ray to the new Dask Task class
#52589 commented on
Jun 27, 2025 • 0 new comments -
[core] [easy] readability improvements for IO Workers
#52590 commented on
Jun 26, 2025 • 0 new comments -
[Dashboard] Allow getting dashboard URL via RuntimeContext
#52676 commented on
Jun 25, 2025 • 0 new comments -
check if ray is installed when using conda env
#52677 commented on
Jun 25, 2025 • 0 new comments -
[core] Minor pull manager cleanup
#52724 commented on
Jun 24, 2025 • 0 new comments -
[Core][Refactor] Create separate RPCs for cancelling prepared PG bundle and removing PG
#52751 commented on
Jun 24, 2025 • 0 new comments -
[core] Remove copy when receiving small object returns
#52777 commented on
Jun 24, 2025 • 0 new comments -
[core] Remove small task output copy on task execution path
#52778 commented on
Jun 24, 2025 • 0 new comments -
[core][refactor] Move `to_resubmit_` from CoreWorker to TaskManager to avoid an abstraction leak
#52779 commented on
Jun 25, 2025 • 0 new comments -
[ci] try running cicd unit tests in forge env
#52792 commented on
Jun 27, 2025 • 0 new comments -
Train Tests: Use map_batches for image_classification
#52837 commented on
Jun 24, 2025 • 0 new comments -
[core] Synchronize locations with pinned_at_raylet_id
#52920 commented on
Jun 25, 2025 • 0 new comments -
[core] Add sync get node info to NodeInfoAccessor
#52928 commented on
Jun 26, 2025 • 0 new comments -
[core] Adding a nightly benchmark for continuous, bidirectional object transfer on two nodes.
#53657 commented on
Jun 24, 2025 • 0 new comments -
[refactor] Install uv from test-requirements.txt
#53685 commented on
Jun 26, 2025 • 0 new comments -
[WIP] Remove old uv runtime env plugin
#53690 commented on
Jun 25, 2025 • 0 new comments -
Bump requests from 2.32.3 to 2.32.4 in /python
#53691 commented on
Jun 23, 2025 • 0 new comments -
[RLlib] Examples folder do-over (vol 53): Learning 2-agent cartpole with global observation, 1 policy outputting all agents' actions, and individual rewards.
#53697 commented on
Jun 26, 2025 • 0 new comments -
[RLlib; Offline RL] Implement Offline Policy Evaluation (OPE) via Importance Sampling.
#53702 commented on
Jun 23, 2025 • 0 new comments -
(serve.llm): Refactor/Consolidate LoRA downloading
#53714 commented on
Jun 26, 2025 • 0 new comments -
Bump scikit-learn from 1.3.2 to 1.5.1 in /doc/source/ray-overview/examples/e2e-timeseries
#53721 commented on
Jun 25, 2025 • 0 new comments -
[RLlib; docs] Docs do-over (new API stack): `ConnectorV2` documentation.
#53732 commented on
Jun 26, 2025 • 0 new comments -
[WIP] Remove test cases for `gcs_actor_based_scheduling`
#53733 commented on
Jun 26, 2025 • 0 new comments -
[core] upgrade opentelemetry-sdk
#53745 commented on
Jun 26, 2025 • 0 new comments -
Test
#53746 commented on
Jun 26, 2025 • 0 new comments -
Add example gpt2 tuning script
#53750 commented on
Jun 27, 2025 • 0 new comments -
[core] Add switch for the cache of runtime env
#53775 commented on
Jun 25, 2025 • 0 new comments -
[serve] Add telemetry for users with Pydantic version < 2
#53779 commented on
Jun 27, 2025 • 0 new comments -
Bump tqdm from 4.64.1 to 4.66.3 in /python
#53820 commented on
Jun 23, 2025 • 0 new comments -
[RLlib] Mixin Layer Design Sketch Up
#53850 commented on
Jun 26, 2025 • 0 new comments -
[core] Don't queue in flight submissions by attempt number
#53866 commented on
Jun 22, 2025 • 0 new comments -
[ci] add python 3.13 ray docker image build
#53894 commented on
Jun 23, 2025 • 0 new comments -
Update deletion policy for rayjob quick start
#53929 commented on
Jun 20, 2025 • 0 new comments -
[core][GPU objects] Attach tensor transport to task args protobuf
#53935 commented on
Jun 27, 2025 • 0 new comments -
[Data] Replaced `get_object_locations` with `get_local_object_locations`
#53942 commented on
Jun 26, 2025 • 0 new comments -
finishing commit for issue #52113
#53964 commented on
Jun 20, 2025 • 0 new comments -
feat: Add QPS-based autoscaling policy for Ray Serve
#53445 commented on
Jun 24, 2025 • 0 new comments -
Bump torch from 2.0.1 to 2.7.0 in /doc/source/templates/testing/docker/03_serving_stable_diffusion
#53447 commented on
Jun 24, 2025 • 0 new comments -
[Data] Add fillna function
#53459 commented on
Jun 24, 2025 • 0 new comments -
[Data] Added distinct function
#53460 commented on
Jun 26, 2025 • 0 new comments -
[Serve] Set the docs path after app is initialized on the replica
#53463 commented on
Jun 22, 2025 • 0 new comments -
[core][compiled graphs] Unify and simplify NCCL operation nodes
#53470 commented on
Jun 21, 2025 • 0 new comments -
[RLlib] Wrapper which allows EnvRunners to operate on environments with Repeated observation spaces
#53519 commented on
Jun 24, 2025 • 0 new comments -
[core] Turn executed task inserted into a RAY_CHECK
#53522 commented on
Jun 23, 2025 • 0 new comments -
ray: fix handling large chunks
#53535 commented on
Jun 26, 2025 • 0 new comments -
[RLlib] Upgrade RLlink protocol for external env/simulator training.
#53550 commented on
Jun 25, 2025 • 0 new comments -
Bump torch from 2.3.0 to 2.7.1 in /python
#53558 commented on
Jun 26, 2025 • 0 new comments -
[Data] [Draft] user guide for aggregations
#53568 commented on
Jun 21, 2025 • 0 new comments -
[Not for Merge] Event Aggregator Perf
#53576 commented on
Jun 27, 2025 • 0 new comments -
[CI] Re-enable isort for all remaining files
#53583 commented on
Jun 22, 2025 • 0 new comments -
[Do not merge] Run ray data release tests with export API
#53594 commented on
Jun 21, 2025 • 0 new comments -
[core] Cleanup retryable grpc client
#53599 commented on
Jun 21, 2025 • 0 new comments -
Fix 53605
#53607 commented on
Jun 24, 2025 • 0 new comments -
[core] Remove experimental `max_cpu_frac_per_node`
#53610 commented on
Jun 25, 2025 • 0 new comments -
[rllib] IMPALA fix no attribute '_minibatch_size'
#53620 commented on
Jun 21, 2025 • 0 new comments -
[core] Support broadcast and reduce collective for compiled graphs
#53625 commented on
Jun 24, 2025 • 0 new comments -
[core] Gcs actor manager cleanup
#53633 commented on
Jun 22, 2025 • 0 new comments -
[Air] Add Video FPS Support for `WandbLoggerCallback`
#53638 commented on
Jun 27, 2025 • 0 new comments -
[Serve] Check multiple FastAPI ingress deployments in a single application
#53647 commented on
Jun 27, 2025 • 0 new comments -
[Core] Starting multiple local instances on one node may result in errors due to randomly selecting t E5BA he same port.
#53906 commented on
Jun 24, 2025 • 0 new comments -
[Azure] Ray up for Azure fails
#48976 commented on
Jun 24, 2025 • 0 new comments -
[Dashboard] Support for List Tasks Filter Pushdown
#53970 commented on
Jun 24, 2025 • 0 new comments -
CI test linux://rllib:learning_tests_cartpole_dqn_multi_cpu is flaky
#47214 commented on
Jun 24, 2025 • 0 new comments -
[Serve] Proxy issues: Request cancellation, intermittent 503 backpressure, and max_queued_requests configuration not applied
#53794 commented on
Jun 24, 2025 • 0 new comments -
[Docker][CI] Add Python 3.13 Ray Image to CI
#53923 commented on
Jun 25, 2025 • 0 new comments -
[Core] [Observability] Add PID to structured logs
#52840 commented on
Jun 25, 2025 • 0 new comments -
[core] Ray fails to reuse GPU to create new actor when CUDA_VISIBLE_DEVICES is set
#44821 commented on
Jun 25, 2025 • 0 new comments -
[Core] Ray 2.47 regression: All tasks hang when using `uv`
#53848 commented on
Jun 25, 2025 • 0 new comments -
[Autoscaler, data] Ray starts `AutoscalingRequester` even when using `enableInTreeAutoscaling`
#51559 commented on
Jun 25, 2025 • 0 new comments -
TypeError: Descriptors cannot not be created directly.
#36417 commented on
Jun 25, 2025 • 0 new comments -
[Ray serve] StopAsyncIteration error thrown by ray when the client cancels the request
#51598 commented on
Jun 25, 2025 • 0 new comments -
[Ray Data]Pylint detection found some Python code defects in ray data
#53881 commented on
Jun 25, 2025 • 0 new comments -
[Core] Ray Label Selector API Implementation Tracker
#51564 commented on
Jun 26, 2025 • 0 new comments -
CI test linux://rllib:learning_tests_multi_agent_cartpole_ppo_multi_gpu is flaky
#46226 commented on
Jun 26, 2025 • 0 new comments -
[Core] Transient network failure on RPC `WaitForActorRefDeleted` causes actor registration fail
#53797 commented on
Jun 26, 2025 • 0 new comments -
[Core] `InternalKVPut` retries incorrectly when encountering transient error
#53946 commented on
Jun 26, 2025 • 0 new comments -
Release test random_shuffle_fixed_size failed
#53806 commented on
Jun 26, 2025 • 0 new comments -
[<Ray component: Core|RLlib|etc...>] Ray Timeout Error running VLLM Multi-Node(tp_size=2) Online Server with Acl_Graph when handling curl request
#53845 commented on
Jun 26, 2025 • 0 new comments -
[RLlib] Checkpoint metrics loading with Tune is broken in 2.47.0
#53877 commented on
Jun 26, 2025 • 0 new comments -
Windows VS WSL2
#53924 commented on
Jun 26, 2025 • 0 new comments -
Check failed: WarmupStore() when starting process
#53094 commented on
Jun 26, 2025 • 0 new comments -
[Serve][llm] Make Serve LLM endpoint 100% compatible with the engine's native server.
#53533 commented on
Jun 26, 2025 • 0 new comments -
[core][ray client] fetch_local flag to ray.wait is not respected for ray client
#52401 commented on
Jun 26, 2025 • 0 new comments -
[Core] Ray causes a 25% slower GPU performance compared with manually written Multi-processing program on 8 Hopper GPUs
#53799 commented on
Jun 26, 2025 • 0 new comments -
[core][compiled graphs] Slow NCCL init on H200 server
#53619 commented on
Jun 26, 2025 • 0 new comments -
[Serve] Unable to load meta-llama/Llama-3.3-70B-Instruct
#53571 commented on
Jun 27, 2025 • 0 new comments -
Update multi-agent-envs.rst
#50075 commented on
Jun 22, 2025 • 0 new comments -
[Data/Preprocessors]: Preprocessors do not work with nested records
#53920 commented on
Jun 20, 2025 • 0 new comments -
[Core] BUG: Cluster crashes when using temp_dir "could not connect to socket" raylet.x [since 2.7+]
#44431 commented on
Jun 20, 2025 • 0 new comments -
CI test linux://rllib:examples/metrics/custom_metrics_in_algorithm_training_step is flaky
#51870 commented on
Jun 21, 2025 • 0 new comments -
[core|serve] Migrate shared utilities from `ray._private` to `ray._common`
#53478 commented on
Jun 21, 2025 • 0 new comments -
[Serve] Make replica scheduler backoff configurable
#52871 commented on
Jun 21, 2025 • 0 new comments -
[Core] Ray Does Not Detect GPU
#53919 commented on
Jun 21, 2025 • 0 new comments -
[core][gpu-objects] Support streaming to overlap computation / communication
#51643 commented on
Jun 23, 2025 • 0 new comments -
[Core] Ray hangs with vllm0.8.5 v1 api for tp8+pp4
#53758 commented on
Jun 23, 2025 • 0 new comments -
[serve][dashboard] Show last line instead of first line in Serve app status message
#35600 commented on
Jun 23, 2025 • 0 new comments -
[VM launcher] Document how to set up the cluster when there is UFW firewall
#35254 commented on
Jun 23, 2025 • 0 new comments -
[Core] ux issues of ray state cli for tasks
#30805 commented on
Jun 23, 2025 • 0 new comments -
Ray kill actor API is a GET request
#18411 commented on
Jun 23, 2025 • 0 new comments -
[Core] Support setting options to the pip install command
#52679 commented on
Jun 23, 2025 • 0 new comments -
[Dashboard] A button to shut down the ray cluster from the dashboard UI
#29208 commented on
Jun 23, 2025 • 0 new comments -
[core] Get IP Address of Actor
#7431 commented on
Jun 23, 2025 • 0 new comments -
[Data] `dataset.write_iceberg` error
#52967 commented on
Jun 23, 2025 • 0 new comments -
[Ray Core/Dashboard] - Installing Ray via UV breaks dashboard.
#53608 commented on
Jun 23, 2025 • 0 new comments -
[RFC] GPU object store support in Ray Core
#51173 commented on
Jun 23, 2025 • 0 new comments -
[Autoscaler][v1] Autoscaler launches extra nodes despite fulfilled resource demand
#52864 commented on
Jun 24, 2025 • 0 new comments -
[Serve] DeploymentResponse._to_object_ref() blocks untill final results from actor
#46893 commented on
Jun 24, 2025 • 0 new comments -
[Core] Submitted containerized job is stuck in pending mode
#37293 commented on
Jun 24, 2025 • 0 new comments -
Clusters (AWS) - SSH Access to head node via AWS Session Manager
#38885 commented on
Jun 24, 2025 • 0 new comments -
StreamSplitDataIterator(epoch=-1, split=0) blocked waiting on other clients for more than 30s.
#42008 commented on
Jun 24, 2025 • 0 new comments -
[Serve.llm] Clean up output logs and give option to opt out of different verbosity levels
#53492 commented on
Jun 24, 2025 • 0 new comments -
[Core] ray._raylet.ObjectRef and ray.types.ObjectRef type compabtibility
#53591 commented on
Jun 24, 2025 • 0 new comments -
Ray Serve Replica Initialization Timeout: STDOUT "Failed to load", RequestCancelledError, Likely Due to Slow/Crashing RLModule.from_checkpoint()
#53079 commented on
Jun 24, 2025 • 0 new comments -
[Core] ASSERTION FAILED: queue.num_items() == 0
#53510 commented on
Jun 24, 2025 • 0 new comments -
[Data] Aggregation is doing internal conversions that breaks on list-like AggType
#52257 commented on
Jun 24, 2025 • 0 new comments -
[core][collective] Avoid creation of `gloo_queue` in race condition
#50132 commented on
Jun 22, 2025 • 0 new comments -
[Autoscaler][V2] Use running node instances to rate-limit upscaling
#50414 commented on
Jun 22, 2025 • 0 new comments -
[RLlib] Enable spliting and zero padding of Dict observation
#50589 commented on
Jun 22, 2025 • 0 new comments -
[Core] Split stats_metric into smaller targets to improve build performance
#50595 commented on
Jun 23, 2025 • 0 new comments -
[core] Cover cpplint for ray/src/ray/stats
#50678 commented on
Jun 23, 2025 • 0 new comments -
[CI] Enable pretty-format-java pre-commit hook
#50957 commented on
Jun 25, 2025 • 0 new comments -
fix restore BUG "RuntimeError: Expected scalars to be on CPU, got cud…
#50983 commented on
Jun 22, 2025 • 0 new comments -
Suppress type error
#50994 commented on
Jun 23, 2025 • 0 new comments -
[doc] add jax example
#51040 commented on
Jun 22, 2025 • 0 new comments -
[core] Always create a default executor
#51058 commented on
Jun 23, 2025 • 0 new comments -
[CI] Replace `black` with `ruff format`
#51332 commented on
Jun 25, 2025 • 0 new comments -
[Dashboard] Support reporting AMD GPU usage
#51345 commented on
Jun 27, 2025 • 0 new comments -
[Core] Cover cpplint for ray/src/ray/common
#51551 commented on
Jun 24, 2025 • 0 new comments -
[Docs][wip] Feature: adopt llms.txt convention
#51605 commented on
Jun 26, 2025 • 0 new comments -
update to protbuf-28.2, absl-20240722, grpc-1.67 and patch for windows
#51673 commented on
Jun 23, 2025 • 0 new comments -
windows dev setup
#51678 commented on
Jun 24, 2025 • 0 new comments -
[Docs][KubeRay] Add guide for writing KubeRay doctests
#51708 commented on
Jun 25, 2025 • 0 new comments -
[core] Lazily subscribe to node changes from workers
#51718 commented on
Jun 23, 2025 • 0 new comments -
[Core] Native CPU affinity support for accelerators
#51719 commented on
Jun 27, 2025 • 0 new comments -
[core] Remove object store runner
#51766 commented on
Jun 23, 2025 • 0 new comments -
[core] Get cloud provider with ray on kubernetes
#51793 commented on
Jun 23, 2025 • 0 new comments -
[core][wip] Trying bzlmod
#51834 commented on
Jun 23, 2025 • 0 new comments -
[core] Support `.options` chaining in `actor.options`
#51836 commented on
Jun 25, 2025 • 0 new comments -
[Data] Fix bug where pandas blocks don't use tensor extension
#51868 commented on
Jun 24, 2025 • 0 new comments -
[Fix][Core] Fail fast if the dashboard agent fails to launch the HTTP server
#51960 commented on
Jun 25, 2025 • 0 new comments -
test for raycirun
#52012 commented on
Jun 24, 2025 • 0 new comments -
[Chore][Dashboard] Move DataHead to python/ray/data/ folder
#52013 commented on
Jun 25, 2025 • 0 new comments -
[RLlib] `TorchMultiCategorical.to_deterministic()` cannot handle Multi-agent + LSTM case
#52177 commented on
Jun 27, 2025 • 0 new comments -
[serve.llm] Ray LLM serving not respecting max_completion_tokens parameter
#53922 commented on
Jun 27, 2025 • 0 new comments -
[serve.llm] LLM serving seems not working with mistral tokenizer.
#53873 commented on
Jun 27, 2025 • 0 new comments -
CI test linux://rllib:examples/evaluation/evaluation_parallel_to_training_multi_agent_duration_auto is flaky
#53255 commented on
Jun 27, 2025 • 0 new comments -
[Dashboard] Decoupling dashboard and dashboard lifetime from Ray Cluster
#46444 commented on
Jun 27, 2025 • 0 new comments -
[core] Implement runtime plugins for additional package managers (mamba, micromamba, pixi, etc.)
#45572 commented on
Jun 27, 2025 • 0 new comments -
CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_cpu is flaky
#47264 commented on
Jun 27, 2025 • 0 new comments -
CI test linux://python/ray/data:test_json is flaky
#48150 commented on
Jun 27, 2025 • 0 new comments -
Add Apple silicon GPU(mps) support to ray
#38464 commented on
Jun 26, 2025 • 0 new comments -
verify windows wheels.
#43442 commented on
Jun 24, 2025 • 0 new comments -
remove flaky marker from test
#44033 commented on
Jun 26, 2025 • 0 new comments -
[data] add better support for list-typed fields when using `write_bigquery`
#44564 commented on
Jun 25, 2025 • 0 new comments -
Enable setting OS disk size in Azure
#45867 commented on
Jun 25, 2025 • 0 new comments -
Fix malformed `temp_dir` path when connecting Windows workers to cluster with Linux head
#45930 commented on
Jun 25, 2025 • 0 new comments -
[URL] Change the absolute path to a relative path to solve the ingres…
#45933 commented on
Jun 24, 2025 • 0 new comments -
Fix mlflow artifact logging
#46570 commented on
Jun 25, 2025 • 0 new comments -
[bazel] move python rules up
#47260 commented on
Jun 27, 2025 • 0 new comments -
:bug: do not modify user-provided runtime_env
#48021 commented on
Jun 22, 2025 • 0 new comments -
[Core]: Fix ConnectionError on Autoscaler CR lookups in K8s clusters …
#48675 commented on
Jun 22, 2025 • 0 new comments -
[Fix][GCS] Implement reconnection for RedisContext
#48781 commented on
Jun 25, 2025 • 0 new comments -
[Build][Deps] Add new `ray[azure]` extra package
#48847 commented on
Jun 25, 2025 • 0 new comments -
Update azure.md - Missing azure dependency
#49104 commented on
Jun 26, 2025 • 0 new comments -
[Fix][Core] Periodically check log message queue cleared before shutdown
#49337 commented on
Jun 25, 2025 • 0 new comments -
[core][cgraph] Use threadpool and one io_context for mutable object provider
#49500 commented on
Jun 22, 2025 • 0 new comments -
[core][cgraph] Use cv instead of busy wait for next version
#49542 commented on
Jun 23, 2025 • 0 new comments -
[core] Don't get dashboard address after each dashboard connection failure
#49584 commented on
Jun 22, 2025 • 0 new comments -
[DATA]Add custom resources in data autoscaling
#49756 commented on
Jun 24, 2025 • 0 new comments -
[core] Thread-safe gcs node manager
#50024 commented on
Jun 22, 2025 • 0 new comments