10000 Workflow runs · Qihoo360/360-LLaMA-Factory · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Actions: Qihoo360/360-LLaMA-Factory

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
11 workflow runs
11 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

对于sp, 当用ulysses替换ring_flash_attn时,为什么loss很大
label_issue #11: Issue #55 opened by cangshuli
May 13, 2025 07:45 7s
May 13, 2025 07:45 7s
Please update the QR code for wechat
label_issue #10: Issue #54 opened by BeverYang
May 12, 2025 17:04 8s
May 12, 2025 17:04 8s
[Help] can support Qwen3
label_issue #8: Issue #51 opened by dubeno
April 30, 2025 03:27 9s
April 30, 2025 03:27 9s
Sequence_parrallel 导致nccl通信timeout
label_issue #7: Issue #49 opened by wongzhenhao
April 23, 2025 09:03 8s
April 23, 2025 09:03 8s
关于华为NPU的支持
label_issue #5: Issue #47 opened by githubwqj
April 17, 2025 03:21 7s
April 17, 2025 03:21 7s
Non-shuffled order sample per-epoch
label_issue #4: Issue #46 opened by xiye17
April 14, 2025 19:32 10s
April 14, 2025 19:32 10s
序列并行为什么不支持attenstion dropout?
label_issue #3: Issue #43 opened by duomicoding
April 8, 2025 03:34 13s
April 8, 2025 03:34 13s
SP does NOT work with liger kernel
label_issue #2: Issue #36 opened by XD-BDIV-NLP
March 22, 2025 16:55 9s
March 22, 2025 16:55 9s
后面可以支持deepseek3的template吗
label_issue #1: Issue #32 opened by KevinFan0
March 21, 2025 07:28 8s
March 21, 2025 07:28 8s
0