8000 prefill only models 路线图 · Issue #1 · noooop/wde · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
prefill only models 路线图 #1
Open
@noooop

Description

@noooop

推理性能优化

  1. torch compile
  2. cuda graph
  3. SageAttention
  4. fp8

服务性能优化

使用优化

  1. 评估
    • MTEB & C-MTEB

新模型

文档优化

日志和监控

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0