-
Baidu
Lists (1)
Sort Name ascending (A-Z)
Stars
Arena-Hard-Auto: An automatic LLM benchmark.
[ACL 2025] Official resources of "FinanceReasoning: Benchmarking Financial Numerical Reasoning More Credible, Comprehensive and Challenging".
High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle
The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
国家中小学智慧教育平台 电子课本下载工具,帮助您从智慧教育平台中获取电子课本的 PDF 文件网址并进行下载,让您更方便地获取课本内容。
Apache Atlas - Open Metadata Management and Governance capabilities across the Hadoop platform and beyond
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
The Metadata Platform for your Data and AI Stack
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
Interactively Chat with arXiv AI Papers
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…
Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".
A visual playground for agentic workflows: Iterate over your agents 10x faster
A fluent, scalable, and easy-to-use LLM data processing framework.
ByteCheckpoint: An Unified Checkpointing Library for LFMs
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
OpenRefine is a free, open source power tool for working with messy data and improving it
A comprehe 53BD nsive benchmark for data cleaning methods and their impact of ML models
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
DeepSeek-VL: Towards Real-World Vision-Language Understanding