test: speeding up tests by using dummy small models #233

terrykong · 2025-04-19T07:51:43Z

Issue

Our unit tests were acting more like functional tests and ballooned our unit test time pretty high. We need to slowly shift to mocking and test smaller units and defer these longer more involved tests to functional tests.

For now, this is a stopgap since the CI time was untenable.

Proposal

Switch all tests to using small dummy (deterministic) models.

tldr

Before

=========== 184 passed, 3 skipped, 9 warnings in 1763.80s (0:29:23) ============

After

============ 182 passed, 5 skipped, 6 warnings in 764.30s (0:12:44) ============

Signed-off-by: Terry Kong <terryk@nvidia.com> make native checkpoints faster Signed-off-by: Terry Kong <terryk@nvidia.com> fix tokenizer size Signed-off-by: Terry Kong <terryk@nvidia.com> add pytest line logger for debugging Signed-off-by: Terry Kong <terryk@nvidia.com> vllm wip Signed-off-by: Terry Kong <terryk@nvidia.com> wip Signed-off-by: Terry Kong <terryk@nvidia.com> fix Signed-off-by: Terry Kong <terryk@nvidia.com> wip Signed-off-by: Terry Kong <terryk@nvidia.com> durations=30 Signed-off-by: Terry Kong <terryk@nvidia.com> breakpoint fix missing test_data Signed-off-by: Terry Kong <terryk@nvidia.com> fix Signed-off-by: Terry Kong <terryk@nvidia.com> fix vocab sizes Signed-off-by: Terry Kong <terryk@nvidia.com> fix api issues and change the dtensor check to log prob since logit checking is too strict for this oddly initialized play model Signed-off-by: Terry Kong <terryk@nvidia.com> simplify tests fsdp1 too Signed-off-by: Terry Kong <terryk@nvidia.com> cleanup Signed-off-by: Terry Kong <terryk@nvidia.com>

Signed-off-by: Terry Kong <terryk@nvidia.com>

parthchadha · 2025-04-20T14:47:57Z

tests/unit/models/generation/test_vllm_generation.py

-            "Who is the president of the USA? Who is the president of the USA? Who is",
-            "What is the capital of the moon? A. Houston, Texas B. New York City",
-            "Where is the sun? Where is the moon? Where is the earth?",
+            "a b B B B B B B B B B B",


Will we really catch bugs with this mock test? Should we rather switch to 126m param real model?

@parthchadha that's true. the focus of my PR was just to drive down the unit test time so admittedly the fidelity of the test signal is worsened by me doing so.

i kind of still think 126m is too high especially if we want to add more tests. one thing is I could SFT the toy model for sequences like:

a b c d e .... 0 1 2 3 4.... 2 4 6 8 .... n v i d i a .....

and maybe push to personal hub for now since we don't want to commit it binary. wdyt?

yuki-666 · 2025-04-25T06:30:37Z

After merge latest main branch, the threshold and refit_buffer_size_gb in test_vllm_weight_update_memory in tests/unit/models/generation/test_vllm_generation.py should also be updated, otherwise the test is meaningless.

Do you need my help to update it after you resolve the conflicts with main?

terrykong · 2025-05-05T16:22:07Z

closing in favor of #315

terrykong added the CI:L0 Run doctests and unit tests label Apr 19, 2025

terrykong changed the title ~~Unit mock ray~~ tests: speeding up tests by using dummy small models Apr 19, 2025

terrykong changed the title ~~tests: speeding up tests by using dummy small models~~ test: speeding up tests by using dummy small models Apr 19, 2025

terrykong added CI:L0 Run doctests and unit tests Run CICD Set to run CI (unset + set to rerun) and removed CI:L0 Run doctests and unit tests Run CICD Set to run CI (unset + set to rerun) labels Apr 19, 2025

terrykong requested review from parthchadha and chtruong814 April 20, 2025 01:02

terrykong force-pushed the unit-mock-ray branch from fbc1980 to 7347aee Compare April 20, 2025 01:03

revert and move stuff around for quality of life

2d787ef

Signed-off-by: Terry Kong <terryk@nvidia.com>

github-actions bot added the CI Relating to CI label Apr 20, 2025

terrykong marked this pull request as ready for review April 20, 2025 01:06

revert test time back to 30min since we're at 15 min

2fa863b

Signed-off-by: Terry Kong <terryk@nvidia.com>

terrykong added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L0 Run doctests and unit tests labels Apr 20, 2025

parthchadha reviewed Apr 20, 2025

View reviewed changes

chtruong814 approved these changes Apr 21, 2025

View reviewed changes

terrykong closed this May 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test: speeding up tests by using dummy small models #233

test: speeding up tests by using dummy small models #233

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

test: speeding up tests by using dummy small models #233

test: speeding up tests by using dummy small models #233

Uh oh!

Conversation

Uh oh!

Issue

Proposal

tldr

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!